git-server-git.apps.pok.os.sepia.ceph.com Git - ceph.git/log

]> git-server-git.apps.pok.os.sepia.ceph.com Git - ceph.git/log

projects / ceph.git / log

summary | shortlog | log | commit | commitdiff | tree
first ⋅ prev ⋅ next

commit | commitdiff | tree

Ronen Friedman [Sat, 17 Aug 2024 16:08:19 +0000 (11:08 -0500)]

osd/scrub: delay both targets on some failures

If the failure of a scrub-job is due to a condition that affects
both targets, both should be delayed. Otherwise, we may end up
with the following bogus scenario:

A high priority deep target is scheduled, but scrub session initiation
fails due to, for example, a concurrent snap trim. The deep target
will be delayed. A second initiation attempt may happen after the
snap trimming is done, but before the updated deep target not-before.
As a result - the lower priority target will be scheduled before the
higher priority one - which is a bug.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Thu, 15 Aug 2024 13:17:48 +0000 (08:17 -0500)]

osd/scrub: reverse OSDRestrictions flags polarity

As most of the flags in OSDRestrictions are of 'true is bad' polarity,
reverse the two non-conforming flags - cpu load and time-of-day
restrictions - to match.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Thu, 15 Aug 2024 12:51:15 +0000 (07:51 -0500)]

osd/scrub: fix the conditions for auto-repair scrubs

The conditions for auto-repair scrubs should have been changed
when need_auto lost some of its setters.

Also fix the rescheduling of repair scrubs
when the last scrub ended with errors.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Thu, 8 Aug 2024 13:49:57 +0000 (08:49 -0500)]

osd/scrub: remove requested_scrub_t::deep_scrub_on_error

This flag was used to indicate that a deep scrub should
be performed if a shallow scrub finds an error. It was
always set true for shallow, regular, scrubs - if
can_autorepair flag was set. Thus, the ephemeral flag in
the requested_scrub_t object is not really needed.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Tue, 6 Aug 2024 13:07:17 +0000 (08:07 -0500)]

qa/standalone/scrub: disable scrub_extended_sleep test

Disabling osd-scrub-test.sh::TEST_scrub_extended_sleep,
as the test is no longer valid (updated code no longer
produces the same logs or the same behavior).

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Tue, 30 Jul 2024 12:12:54 +0000 (07:12 -0500)]

osd/scrub: remove non-display usage of target's is_high_priority()

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Mon, 29 Jul 2024 04:34:32 +0000 (23:34 -0500)]

osd/scrub: remove 'calculated_to_deep' flag

as once a sched-target was selected, we know the level of the scrub.
Also removed: the ephemeral 'time_for_deep' flag.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Sun, 28 Jul 2024 12:37:07 +0000 (07:37 -0500)]

osd/scrub: modify after-repair-scrub triggering

... to manipulate the relevant scrub target directly, instead
of using the 'planned scrub' flags.

The relevant condition flag was moved from the PG and into the scrubber.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Sun, 28 Jul 2024 10:52:38 +0000 (05:52 -0500)]

osd/scrub: fix ReplicaReservations ctor to use correct query

when determining whether replica reservations are required.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Sun, 28 Jul 2024 06:09:25 +0000 (01:09 -0500)]

osd/scrub: fix parameters validation on scrub start

... as the selected target already determines the
scrub level & type.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Sun, 28 Jul 2024 10:20:38 +0000 (05:20 -0500)]

osd/scrub: fix reserve_local()

to use the correct method when determining whether we should
perform the reservation.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Sat, 27 Jul 2024 17:59:46 +0000 (12:59 -0500)]

osd/scrub: fix initiation path of operator-commanded scrubs

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Tue, 30 Jul 2024 10:59:00 +0000 (05:59 -0500)]

common/not_before_queue: extending the container's API

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Wed, 24 Jul 2024 07:02:46 +0000 (02:02 -0500)]

osd/scrub: OSD's scrub queue now holds SchedEntry-s

The OSD's scrub queue now holds SchedEntry-s, instead of ScrubJob-s.
The queue itself is implemented using the 'not_before_queue_t' class.

Note: this is not a stable state of the scrubber code. In the next
commits:
- modifying the way sched targets are modified and updated, to match the
  new queue implementation.
- removing the 'planned scrub' flags.

Important note: the interaction of initiate_scrub() and pop_ready_pg()
is not changed by this commit. Namely:

Currently - pop..() loops over all eligible jobs, until it finds one
that matches the environment restrictions (which most of the time, as the
concurrency limit is usually reached, would be 'high-priority-only').

The other option is to maintain Sam's 'not_before_q' clean interface: we
always pop the top, and if that top fails the preconds tests - we delay and
re-push. This has the following troubling implications:

- it would take a long time to find a viable scrub job, if the problem
  is related to, for example, 'no scrub'.
- local resources failure (inc_scrubs() failure) must be handles
  separately, as we do not want to reshuffle the queue for this
  very very common case.
- but the real problem: unneeded shuffling of the queue, even as the
  problem is not with the scrub job itself, but with the environment
  (esp. no-scrub etc.).
  This is a common case, and it would be wrong to reshuffle the queue
  for that.
- and - remember that any change to a sched-entry must be done under PG
  lock.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Tue, 30 Jul 2024 10:54:59 +0000 (05:54 -0500)]

common/not_before_queue: move status_t out of container_t

for readability

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Mon, 29 Jul 2024 03:58:22 +0000 (22:58 -0500)]

common/not_before_queue: some spelling fixes

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Samuel Just [Fri, 16 Dec 2022 18:30:18 +0000 (18:30 +0000)]

common: add not_before_queue_t

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Fri, 12 Jul 2024 13:18:30 +0000 (08:18 -0500)]

osd/scrub: modify ScrubJob to hold two SchedTarget-s

ScrubJob will now hold two SchedTarget-s - two sets of scheduling
information (times, levels, etc.) for the next shallow and deep scrubs.

This is in preparation for the upcoming changes to the scheduling queue.
The change cannot stand on its own, as the partial implementation
creates some inconsistencies in the scheduling logic.

Specifically, here is what changes here, and how it differs from the
desired implementation:
- The OSD still maintains a queue of scrub jobs - one object only per
  PG.
  But now - each queue element holds two SchedTarget-s.
- When a scrub is initiated, the Scrubber is handed a ScrubJob object.
  Only in the next commit will it also receive the ID of the selected
  level. That causes some issues when re-determining the level of the
  initiated scrub. A failure to match the queue "intent" results in
  failures.
- the 'planned scrub' flags are still here, instead of directly
  encoding the characteristics of the next scrub in the relevant
  sched-entry.
- the 'urgency' levels do not cover the full required range of
  behaviors and priorities.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Sun, 7 Jul 2024 17:46:25 +0000 (12:46 -0500)]

osd/scrub: introducing the concept of a SchedEntry

SchedEntry holds the scheduling details for scrubbing a specific PG at
a specific scrub level. Namely - it identifies the [pg,level]
combination, the 'urgency' attribute of the scheduled scrub
(which determines most of its behavior and scheduling decisions)
and the actual time attributes for scheduling (target,
deadline, not_before).

Added a table detailing, for each type of scrub, what limitations apply
to it, and what restrictions are waived.

The following commits will reshape the ScrubJob objects to hold
two instances of SchedTarget-s - two wrappers around SchedEntry-s,
one for the next shallow scrub and one for the next deep scrub.

Sched-entries (wrapped in sched-targets) have a defined order:

For ready-to-scrub entries (those that have an n.b. in the past),
the order is first by urgency, then by target time (and then by
level - deep before shallow - and then by the n.b. itself).

'Future' entries are ordered by n.b., then urgency,
target time, and level.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Zac Dover [Wed, 21 Aug 2024 11:26:54 +0000 (21:26 +1000)]

Merge pull request #59348 from zdover23/wip-doc-2024-08-20-rados-ops-cache-tiering

doc/rados: document unfound object cache-tiering scenario

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Gil Bregman [Wed, 21 Aug 2024 05:46:29 +0000 (08:46 +0300)]

Merge pull request #59362 from gbregman/main

mgr/cephadm: change SPDK RPC fields in nvmeof configuration

commit | commitdiff | tree

Yuval Lifshitz [Wed, 21 Aug 2024 05:09:34 +0000 (08:09 +0300)]

Merge pull request #59323 from yuvalif/wip-yuval-67514

test/rgw/notifications: don't check for full queue if topics expired

Reviewed-By: Casey Bodley <cbodley@ibm.com>

commit | commitdiff | tree

NitzanMordhai [Tue, 20 Aug 2024 16:16:25 +0000 (19:16 +0300)]

Merge pull request #54984 from NitzanMordhai/wip-nitzan-restful-un-boundary-keep-requests

mgr/rest: Trim requests array and limit size

commit | commitdiff | tree

Gil Bregman [Tue, 20 Aug 2024 13:29:57 +0000 (16:29 +0300)]

mgr/cephadm: change SPDK RPC fields in nvmeof configuration
Fixes https://tracker.ceph.com/issues/67629

Signed-off-by: Gil Bregman <gbregman@il.ibm.com>

commit | commitdiff | tree

Gil Bregman [Tue, 20 Aug 2024 13:28:12 +0000 (16:28 +0300)]

python-common/ceph/deployment: change SPDK RPC fields in nvmeof configuration
Fixes https://tracker.ceph.com/issues/67629

Signed-off-by: Gil Bregman <gbregman@il.ibm.com>

commit | commitdiff | tree

Zac Dover [Tue, 20 Aug 2024 12:45:29 +0000 (22:45 +1000)]

doc/rados: document unfound object cache-tiering scenario

Explain how to deal with "unfound objects" when restarting OSDs in a
cache-tiered environment.

Fixes: https://tracker.ceph.com/issues/44286
Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Adam King [Tue, 20 Aug 2024 12:35:44 +0000 (08:35 -0400)]

Merge pull request #58460 from rkachach/fix_issue_oauth2_support

adding support for SSO based on auth2-proxy

Reviewed-by: Adam King <adking@redhat.com>

commit | commitdiff | tree

Adam King [Tue, 20 Aug 2024 12:20:02 +0000 (08:20 -0400)]

Merge pull request #58860 from adk3798/cephadm-nvmeof-require-group

mgr/cephadm: require "group" parameter in nvmeof specs

Reviewed-by: Redouane Kachach <rkachach@ibm.com>

commit | commitdiff | tree

NitzanMordhai [Tue, 20 Aug 2024 12:07:21 +0000 (15:07 +0300)]

Merge pull request #59165 from NitzanMordhai/wip-nitzan-test-rados-tools-newline-trim

test: test_rados_tools compare output without trimming newline

commit | commitdiff | tree

nmordech@redhat.com [Wed, 21 Feb 2024 10:01:25 +0000 (10:01 +0000)]

doc/mgr/restful: update max_request config

Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>

commit | commitdiff | tree

nmordech@redhat.com [Wed, 21 Feb 2024 09:21:25 +0000 (09:21 +0000)]

PendingReleaseNotes: Adding note about rest module change and adding max_request option

Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>

commit | commitdiff | tree

NitzanMordhai [Tue, 28 Nov 2023 09:52:05 +0000 (09:52 +0000)]

mgr/rest: Trim request array and limit size

Presently, the requests array in the REST module has the potential to grow
indefinitely, leading to excessive memory consumption, particularly when
dealing with lengthy and intricate request results.

To address this issue, a limit will be imposed on the requests array within
the REST module.
This limitation will be governed by the `mgr/restful/x/max_requests` configuration
parameter specific to the REST module.
when submit_request called we will check request array if exceed max_request option
if it does we will check if the future trimmed request finished and log error
message in case we are trimming un-finished requests.

Fixes: https://tracker.ceph.com/issues/59580
Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Tue, 20 Aug 2024 10:19:23 +0000 (12:19 +0200)]

Merge pull request #59153 from ajarr/wip-67436

rbd: fix CLI output of `rbd group snap info` command when a group snapshot with no member images

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
Reviewed-by: Sunil Angadi <Sunil.Angadi@ibm.com>

commit | commitdiff | tree

Yingxin [Tue, 20 Aug 2024 08:30:57 +0000 (16:30 +0800)]

Merge pull request #59292 from cyx1231st/wip-seastore-revert-decouple-ool-writes

Revert "crimson/os/seastore: wait ool writes in DeviceSubmission phase"

Reviewed-by: Xuehan Xu <xuxuehan@qianxin.com>
Reviewed-by: Myoungwon Oh <myoungwon.oh@samsung.com>

commit | commitdiff | tree

Casey Bodley [Mon, 19 Aug 2024 17:10:57 +0000 (13:10 -0400)]

Merge pull request #59241 from tobias-urdin/openstack-upperconstraints

qa: barbican: restrict python packages with upper-constraints

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Yuval Lifshitz [Mon, 19 Aug 2024 16:48:29 +0000 (16:48 +0000)]

test/rgw/notifications: don't check for full queue if topics expired

there are other tests for queue length, so we can skip this check
if test takes too long.
also remove unnecessary delays from the test.

Fixes: https://tracker.ceph.com/issues/67514?tab=history
Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>

commit | commitdiff | tree

Yuri Weinstein [Mon, 19 Aug 2024 14:25:47 +0000 (07:25 -0700)]

Merge pull request #58961 from NitzanMordhai/wip-nitzan-dencoder-test-forward-incompat-fix

workunit/dencoder: dencoder test forward incompat fix

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Mon, 19 Aug 2024 14:24:56 +0000 (07:24 -0700)]

Merge pull request #58594 from jamiepryde/isa-xor-raid

erasure-code/isa: Use isa/raid's xor_gen() instead of the region_xor(…

Reviewed-by: Mark Nelson <mnelson@redhat.com>

commit | commitdiff | tree

Tobias Urdin [Thu, 15 Aug 2024 15:17:14 +0000 (17:17 +0200)]

qa: barbican: restrict python packages with upper-constraints

We install barbican by doing a pip install directly on the
cloned git repository but we don't honor the upper-constraints
from the OpenStack Requirements project that handles what
versions is supported.

This changes the pip install command that we issue when
installing barbican to honor the requirements for the
version (derived from the branch) that we use, in
this case it's the 2023.1 release upper-constraints [1].

This prevents us from pulling in untested Python packages.

This only updates Barbican because for the Keystone job
we dont directly issue pip but install using tox using the
`venv` environment which already by default sets the
constraints as you can see in [2].

[1] https://releases.openstack.org/constraints/upper/2023.1
[2] https://github.com/openstack/keystone/blob/stable/2023.1/tox.ini#L12

Fixes: https://tracker.ceph.com/issues/67444
Signed-off-by: Tobias Urdin <tobias.urdin@binero.com>

commit | commitdiff | tree

Yuval Lifshitz [Mon, 19 Aug 2024 10:37:07 +0000 (13:37 +0300)]

Merge pull request #59239 from yuvalif/wip-yuval-67513

Reviewed-By: Casey Bodley <cbodley@ibm.com>
test/rgw/notification: use real ip address instead of localhost

based on that comment:
https://tracker.ceph.com/issues/67206#note-6
the address used by the endpoint is taken as the real IP address of the
host where the test script is running and not localhost.

we also changed the rabbitmq-server conf to allow "guest"
user to connect over non localhost address

Fixes: https://tracker.ceph.com/issues/67206
Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>

commit | commitdiff | tree

Igor Fedotov [Mon, 19 Aug 2024 09:47:40 +0000 (12:47 +0300)]

Merge pull request #59200 from ifed01/wip-ifed-fix-store-test-col-ref

test/store_test: fix assertions due to unclosed collection refs.

Reviewd-by: Pere Diaz Bou <pere-altea@hotmail.com>

commit | commitdiff | tree

Zac Dover [Mon, 19 Aug 2024 07:21:51 +0000 (17:21 +1000)]

Merge pull request #59256 from zdover23/wip-doc-2024-08-17-cephfs-ceph-dokan-mount-point

doc/cephfs: s/mountpoint/mount point/

Reviewed-by: Jos Collin <jcollin@redhat.com>
Reviewed-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Mon, 19 Aug 2024 05:49:52 +0000 (11:19 +0530)]

Merge pull request #58995 from rhcs-dashboard/fix-66844-main

qa/mgr/dashboard: fix test race condition

Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Yingxin [Mon, 19 Aug 2024 02:18:32 +0000 (10:18 +0800)]

Merge pull request #59212 from cyx1231st/wip-seastore-more-reports

crimson/os/seastore/cache: report lru usage/in/out with trans and extent type

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Yingxin Cheng [Mon, 19 Aug 2024 01:48:28 +0000 (09:48 +0800)]

Revert "crimson/os/seastore: wait ool writes in DeviceSubmission phase"

This reverts commit c9e423facea79d42f0496264f267adee5d911b87.

The commit starts to submit OOL writes before submitting the journal
write, true, but it cannot guarantee that OOL writes finish before the
journal write.

Thus it is possible that during SeaStore restart, a journal record
appears valid but its dependent OOL records are partial written, which
leads to corruption.

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Anthony D'Atri [Sun, 18 Aug 2024 15:43:00 +0000 (08:43 -0700)]

Merge pull request #59290 from anthonyeleven/mountpoint

doc: Harmonize 'mountpoint'

commit | commitdiff | tree

Anthony D'Atri [Sun, 18 Aug 2024 15:23:39 +0000 (11:23 -0400)]

doc: Harmonize 'mountpoint'

Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>

commit | commitdiff | tree

Zac Dover [Sat, 17 Aug 2024 20:00:23 +0000 (06:00 +1000)]

Merge pull request #59257 from zdover23/wip-doc-2024-08-17-cephfs-mount-point

doc/cephfs: s/mountpoint/mount point/

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Zac Dover [Sat, 17 Aug 2024 03:44:30 +0000 (13:44 +1000)]

doc/cephfs: s/mountpoint/mount point/

Change the string "mountpoint" to "mount point" in English-language
strings (as opposed to in commands, where the string "mountpoint"
sometimes appears and is correct).

cf. https://github.com/ceph/ceph/pull/58908#discussion_r1697715486 in
which page 345 of The IBM Style Guide is referenced to back up this
change.

This commit alters only English-language text and example commands in
which the string "{mount point}" is meant to be replaced. No commands
meant for cutting-and-pasting have been altered in this commit.

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Zac Dover [Sat, 17 Aug 2024 03:37:58 +0000 (13:37 +1000)]

doc/cephfs: s/mountpoint/mount point/

Change the string "mountpoint" to "mount point" in English-language
strings (as opposed to in commands, where the string "mountpoint"
sometimes appears and is correct).

cf. https://github.com/ceph/ceph/pull/58908#discussion_r1697715486
in which page 345 of The IBM Style Guide is referenced to back up this
change.

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Venky Shankar [Fri, 16 Aug 2024 16:14:21 +0000 (21:44 +0530)]

Merge pull request #58355 from batrick/ceph-backport-fetchhead

script/ceph-backport: robustness adjustments for local git repo quirks

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Yuval Lifshitz [Thu, 15 Aug 2024 14:34:57 +0000 (14:34 +0000)]

test/rgw/notification: use real ip address instead of localhost

based on that comment:
https://tracker.ceph.com/issues/67206#note-6
the address used by the endpoint is taken as the real IP address of the
host where the test script is running and not localhost.

we also changed the rabbitmq-server conf to allow "guest"
user to connect over non localhost address

Fixes: https://tracker.ceph.com/issues/67206
Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>

commit | commitdiff | tree

Zac Dover [Fri, 16 Aug 2024 09:20:01 +0000 (19:20 +1000)]

Merge pull request #59167 from zdover23/wip-doc-2024-08-12-cephfs-file-layouts

doc/cephfs: improve "layout fields" text

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Venky Shankar [Fri, 16 Aug 2024 06:03:26 +0000 (11:33 +0530)]

Merge PR #58896 into main

* refs/pull/58896/head:
client: flush the caps release in filesystem sync

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Dhairya Parmar <dparmar@redhat.com>

commit | commitdiff | tree

Yingxin [Fri, 16 Aug 2024 05:48:27 +0000 (13:48 +0800)]

Merge pull request #59205 from xxhdx1985126/wip-seastore-find-pending-version

crimson/os/seastore/btree: fix minor corner case issue

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Xiubo Li [Mon, 29 Jul 2024 06:20:41 +0000 (14:20 +0800)]

client: flush the caps release in filesystem sync

We have hit a race between cap releases and cap revoke request
that will cause the check_caps() to miss sending a cap revoke ack
to MDS. And the client will depend on the cap release to release
that revoking caps, which could be delayed for some unknown reasons.

In Kclient we have figured out the RCA about race and we need
a way to explictly trigger this manually could help to get rid
of the caps revoke stuck issue.

Fixes: https://tracker.ceph.com/issues/67221
Signed-off-by: Xiubo Li <xiubli@redhat.com>

commit | commitdiff | tree

Laura Flores [Thu, 15 Aug 2024 19:02:36 +0000 (14:02 -0500)]

Merge pull request #58415 from ljflores/wip-tracker-66809

qa/suites/upgrade: ignore PG_AVAILABILITY and MON_DOWN for quincy-x and reef-x upgrade suites

commit | commitdiff | tree

Ivo Almeida [Thu, 15 Aug 2024 17:07:47 +0000 (18:07 +0100)]

Merge pull request #59220 from ivoalmeida/carbon-datatable-cleanups

mgr/dashboard: carbon datatables impr and cleanups

Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

J. Eric Ivancich [Thu, 15 Aug 2024 14:22:09 +0000 (10:22 -0400)]

Merge pull request #59218 from yuvalif/wip-yuval-67525

rgw/notifications: fixing radosgw-admin notification json

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Matan Breizman [Thu, 15 Aug 2024 11:02:38 +0000 (14:02 +0300)]

Merge pull request #59118 from xxhdx1985126/wip-crimson-backfill-cancellation

crimson/osd/backfill_state: support backfill cancellation

Reviewed-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Xuehan Xu [Sat, 10 Aug 2024 06:22:09 +0000 (14:22 +0800)]

crimson/osd/backfill_state: support backfilling cancellation

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Xuehan Xu [Sat, 10 Aug 2024 06:22:52 +0000 (14:22 +0800)]

crimson/osd/pg_recovery: reset backfill_state when backfill finished

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Matan Breizman [Thu, 15 Aug 2024 08:09:23 +0000 (11:09 +0300)]

Merge pull request #57966 from xxhdx1985126/wip-crimson-concurrent-recover-missing

crimson/osd/osd_operations: make the "recover_missing" phase concurrent

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Matan Breizman [Thu, 15 Aug 2024 08:08:25 +0000 (11:08 +0300)]

Merge pull request #53151 from xxhdx1985126/wip-crimson-backfill-fixes

crimson/osd/backfill_state: fixes two corner cases in backfilling

Reviewed-by: Radosław Zarzyński <rzarzyns@redhat.com>

commit | commitdiff | tree

Yuval Lifshitz [Thu, 15 Aug 2024 08:00:20 +0000 (11:00 +0300)]

Merge pull request #58911 from yuvalif/wip-yuval-67229

test/cls_2pc_queue: prevent list+remove race between consumers

Reviewed-By: Casey Bodley <cbodley@ibm.com>

commit | commitdiff | tree

Yuval Lifshitz [Thu, 15 Aug 2024 07:58:50 +0000 (10:58 +0300)]

Merge pull request #59219 from yuvalif/wip-yuval-50610

doc/rgw/notification: persistent notification queue full behavior

Reviewed-By: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Yingxin Cheng [Wed, 14 Aug 2024 05:22:10 +0000 (13:22 +0800)]

crimson/os/seastore/cache: report lru usage/in/out with trans and extent type

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Wed, 14 Aug 2024 05:20:30 +0000 (13:20 +0800)]

crimson/os/seastore: cleanup periodical reporting

Consolidate time into a single place per SeaStore::Shard.

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Fri, 9 Aug 2024 08:55:41 +0000 (16:55 +0800)]

crimson/os/seastore/cache/lru: renames

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Fri, 9 Aug 2024 08:13:48 +0000 (16:13 +0800)]

crimson/os/seastore/cache: refine lru logics

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Fri, 9 Aug 2024 08:01:39 +0000 (16:01 +0800)]

crimson/os/seastore: move counter_by_extent_t definition

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Fri, 9 Aug 2024 06:08:38 +0000 (14:08 +0800)]

crimson/os/seastore/seastore_types: unify checks to the extent types

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Anthony D'Atri [Thu, 15 Aug 2024 02:38:51 +0000 (19:38 -0700)]

Merge pull request #59225 from zdover23/wip-doc-2024-08-15-glossary-flapping-osd

doc/glossary: add "flapping OSD"

commit | commitdiff | tree

Casey Bodley [Wed, 14 Aug 2024 18:16:05 +0000 (14:16 -0400)]

Merge pull request #59028 from cbodley/wip-67326

rgw/notify: visit() returns copy of owner string

Reviewed-by: Adam Emerson <aemerson@redhat.com>

commit | commitdiff | tree

Zac Dover [Wed, 14 Aug 2024 18:08:14 +0000 (04:08 +1000)]

doc/glossary: add "flapping OSD"

Add an entry for "Flapping OSD" to the glossary.

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Casey Bodley [Wed, 14 Aug 2024 17:47:54 +0000 (13:47 -0400)]

Merge pull request #58448 from cbodley/wip-rgw-lc-async

cls/rgw: define lc ops in terms of ObjectOperation instead of IoCtx

Reviewed-by: Matt Benjamin <mbenjamin@redhat.com>

commit | commitdiff | tree

Ivo Almeida [Wed, 14 Aug 2024 11:15:36 +0000 (12:15 +0100)]

mgr/dashboard: carbon datatables impr and cleanups

Fixes: https://tracker.ceph.com/issues/67544,
https://tracker.ceph.com/issues/67538,
https://tracker.ceph.com/issues/67542, https://tracker.ceph.com/issues/67545, https://tracker.ceph.com/issues/67546

Signed-off-by: Ivo Almeida <ialmeida@redhat.com>

commit | commitdiff | tree

Casey Bodley [Wed, 14 Aug 2024 14:57:44 +0000 (10:57 -0400)]

Merge pull request #58965 from linuxbox2/wip-lcgt-typo

rgwlc: fix typo in getlc (ObjectSizeGreaterThan)

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Yuval Lifshitz [Wed, 14 Aug 2024 11:02:09 +0000 (11:02 +0000)]

doc/rgw/notification: persistent notification queue full behavior

Fixes: https://tracker.ceph.com/issues/50610
Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>

commit | commitdiff | tree

Casey Bodley [Wed, 14 Aug 2024 13:14:13 +0000 (09:14 -0400)]

Merge pull request #59169 from cbodley/wip-67464

rgw: revert account-related changes to get_iam_policy_from_attr()

Reviewed-by: Pritha Srivastava <prsrivas@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Wed, 14 Aug 2024 13:12:28 +0000 (16:12 +0300)]

Merge pull request #57888 from liangmingyuanneo/wip-standalone-test-pg-repair

qa/standalone: bugfix for latecy repair after scrub

Reviewed-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

naman munet [Wed, 14 Aug 2024 12:42:39 +0000 (18:12 +0530)]

Merge pull request #59210 from rhcs-dashboard/multi-cluster-overview-usage-template-fix

mgr/dashboard: fix multi-cluster usage bar error after carbon changes

commit | commitdiff | tree

Redouane Kachach [Tue, 2 Jul 2024 15:28:40 +0000 (17:28 +0200)]

mgr/cephadm: adding oauth2-proxy cephadm service

adding new oauth2-proxy service. The enable_auth flag enables SSO
authentication via the oauth2-proxy service. The user must ensure the
oauth2-proxy service is deployed before enabling this flag in the
mgmt-gateway service.

FQDN related changes: previously, we were obtaining the FQDN using a
call to the Python socket library run inside the container. While this
generally works, the FQDN returned inside a container can sometimes
differ from the one obtained outside the container. This discrepancy
could cause some issues. To ensure consistency, we now use the FQDN
from the inventory, which provides the correct value as recognized on the host.

Signed-off-by: Redouane Kachach <rkachach@ibm.com>

commit | commitdiff | tree

Yuval Lifshitz [Wed, 14 Aug 2024 10:41:18 +0000 (10:41 +0000)]

rgw/notifications: fixing radosgw-admin notification json

Fixes: https://tracker.ceph.com/issues/67525
Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>

commit | commitdiff | tree

Zac Dover [Wed, 14 Aug 2024 10:09:38 +0000 (20:09 +1000)]

Merge pull request #59168 from zdover23/wip-doc-2024-08-12-cephfs-cache-configuration

doc/cephfs: improve cache-configuration.rst

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Yingxin Cheng [Mon, 5 Aug 2024 03:09:14 +0000 (11:09 +0800)]

crimson/os/seastore/cache: pass missing src to touch_extent()

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Fri, 2 Aug 2024 07:06:38 +0000 (15:06 +0800)]

crimson/os/seastore/cache: cleanup add_extent()

Move add_to_dirty() and touch_extent() out of add_extent(), this removes
duplicated calls to touch_extent() from the on_cache callback.

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Thu, 1 Aug 2024 08:46:39 +0000 (16:46 +0800)]

crimson/os/seastore/cache: cleanup remove_from_dirty()

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Thu, 1 Aug 2024 08:44:48 +0000 (16:44 +0800)]

crimson/os/seastore: drop duplicated calls to touch_extent()

The extent is already PRESENT, which means it was already touched in
this transaction.

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Thu, 1 Aug 2024 08:43:41 +0000 (16:43 +0800)]

crimson/os/seastore/cached_extent: rename primary_ref_list

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Naman Munet [Wed, 14 Aug 2024 07:13:22 +0000 (12:43 +0530)]

mgr/dashboard: fix multi-cluster usage bar error after carbon changes

fixes: https://tracker.ceph.com/issues/67536

Signed-off-by: Naman Munet <nmunet@redhat.com>

commit | commitdiff | tree

naman munet [Wed, 14 Aug 2024 05:57:02 +0000 (11:27 +0530)]

Merge pull request #59186 from rhcs-dashboard/replace-cluster-capacity-with-usage-bar

mgr/dashboard: replace individual cluster's capacity info with Usage bar in Multi-Cluster

commit | commitdiff | tree

Venky Shankar [Wed, 14 Aug 2024 04:58:09 +0000 (10:28 +0530)]

Merge PR #59025 into main

* refs/pull/59025/head:
tools/rados: Fix extra NL in getxattr

Reviewed-by: Rishabh Dave <ridave@redhat.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Ronen Friedman <rfriedma@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Gabriel Benhanokh <gbenhano@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Wed, 14 Aug 2024 04:18:04 +0000 (09:48 +0530)]

Merge pull request #58485 from ivoalmeida/carbon-datatable

mgr/dashboard: replace ngx-datatable by carbon datatable

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Xuehan Xu [Wed, 14 Aug 2024 03:00:00 +0000 (11:00 +0800)]

crimson/os/seastore/btree: fix minor corner case issue

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Laura Flores [Tue, 13 Aug 2024 22:58:17 +0000 (17:58 -0500)]

qa/suites/upgrade: ignore MON_DOWN and PG_AVAILABILITY warnings in upgrade tests

Signed-off-by: Laura Flores <lflores@ibm.com>

commit | commitdiff | tree

Ramana Raja [Sun, 11 Aug 2024 02:18:07 +0000 (22:18 -0400)]

rbd: fix CLI output of `rbd group snap info` command

... when a group snapshot has no member images.

A group snapshot can be created with no member images. For such a group
snapshot, omit the 'image snap' and 'images' fields from the
unformatted CLI output of `rbd group snap info` command so as to not
confuse the user. In the librbd C/C++ data structures representing a
group snapshot with no member images, set the 'image_snap_name' data
member to an empty string.

Fixes: https://tracker.ceph.com/issues/67436
Signed-off-by: Ramana Raja <rraja@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Tue, 13 Aug 2024 19:27:42 +0000 (21:27 +0200)]

Merge pull request #58140 from guits/cv-tpm2-support

ceph-volume: add TPM2 token enrollment support for encrypted OSDs

commit | commitdiff | tree

Guillaume Abrioux [Tue, 13 Aug 2024 19:18:33 +0000 (21:18 +0200)]

Merge pull request #58956 from ThomasLamprecht/ceph-volume-debian-dependency

debian pkg: record python3-packaging dependency for ceph-volume

commit | commitdiff | tree

Casey Bodley [Tue, 13 Aug 2024 17:06:32 +0000 (13:06 -0400)]

qa/s3tests: configure tenant name for 's3 tenant' section

Signed-off-by: Casey Bodley <cbodley@redhat.com>

Unnamed repository; edit this file 'description' to name the repository.