git.apps.os.sepia.ceph.com Git

]> git.apps.os.sepia.ceph.com Git - ceph.git/log

Kamoltat [Wed, 31 Jan 2024 15:25:10 +0000 (15:25 +0000)]

src/osd/OSDMap.cc: Fix encoder to produce same bytestream

Fixes: https://tracker.ceph.com/issues/63389
Signed-off-by: Kamoltat <ksirivad@redhat.com>
(cherry picked from commit a3bdffb4e0366af8288258539f17bafeae17e3c4)

commit | commitdiff | tree

Zac Dover [Wed, 27 Mar 2024 13:33:03 +0000 (23:33 +1000)]

Merge pull request #56494 from zdover23/wip-doc-2024-03-27-backport-56450-to-squid

squid: docs/rados: remove incorrect ceph command

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Nizamudeen A [Wed, 27 Mar 2024 09:08:24 +0000 (14:38 +0530)]

Merge pull request #56442 from rhcs-dashboard/wip-65102-squid

squid: mgr/dashboard: rm warning/error threshold for cpu usage

Reviewed-by: afreen23 <NOT@FOUND>

commit | commitdiff | tree

Nizamudeen A [Wed, 27 Mar 2024 04:37:32 +0000 (10:07 +0530)]

Merge pull request #56464 from aaSharma14/wip-64608-squid

squid: mgr/dashboard: replace piechart plugin charts with native pie chart panel

Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Taha Jahangir [Mon, 25 Mar 2024 10:02:12 +0000 (13:32 +0330)]

docs/rados: remove incorrect ceph command

The removed line was (incorrectly changed) output of the previous command.

Signed-off-by: Taha Jahangir <mtjahangir@gmail.com>
(cherry picked from commit 3cd39e3582ed830f86c55d2203ece72fa0f9426b)

commit | commitdiff | tree

Yuri Weinstein [Tue, 26 Mar 2024 18:08:28 +0000 (11:08 -0700)]

Merge pull request #56226 from cbodley/wip-64949-squid

squid: rgw/multisite: add x-rgw-replicated-at header to replicated objects

Reviewed-by: Shilpa Jagannath <smanjara@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Tue, 26 Mar 2024 18:07:52 +0000 (11:07 -0700)]

Merge pull request #55625 from rzarzynski/wip-end2end-tracing-squid

squid: tracer/osd/librados/build/rgw: rgw and osd end2end tracing using open…

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Tue, 26 Mar 2024 14:46:28 +0000 (20:16 +0530)]

Merge pull request #56358 from Svelar/wip-65027-squid

squid: mgr/dashboard/frontend:Ceph dashboard supports multiple languages

Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Tue, 26 Mar 2024 14:34:03 +0000 (20:04 +0530)]

Merge pull request #56386 from rhcs-dashboard/wip-65069-squid

squid: mgr/dashboard: fix subvolume group edit size

Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Tue, 26 Mar 2024 14:20:19 +0000 (19:50 +0530)]

Merge pull request #56473 from rhcs-dashboard/wip-65138-squid

squid: mgr/dashboard: add system users to rgw user form

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
Reviewed-by: afreen23 <NOT@FOUND>

commit | commitdiff | tree

Anthony D'Atri [Tue, 26 Mar 2024 14:06:15 +0000 (10:06 -0400)]

Merge pull request #56465 from zdover23/wip-doc-2024-03-26-backport-56436-to-squid

squid: doc/start: improve MDS explanation

commit | commitdiff | tree

Guillaume Abrioux [Tue, 26 Mar 2024 12:42:45 +0000 (13:42 +0100)]

Merge pull request #56258 from guits/wip-64945-squid

squid: ceph-volume: add missing import

commit | commitdiff | tree

Venky Shankar [Tue, 26 Mar 2024 10:50:08 +0000 (16:20 +0530)]

Merge pull request #56035 from vshankar/wip-64779-squid

squid: mds,cephfs_mirror: add labelled per-client and replication metrics

Reviewed-by: Yuri Weinstein <yuriw@redhat.com>

commit | commitdiff | tree

Pedro Gonzalez Gomez [Fri, 22 Mar 2024 14:20:48 +0000 (15:20 +0100)]

mgr/dashboard: add system users to rgw user form

Fixes: https://tracker.ceph.com/issues/65074
Signed-off-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
(cherry picked from commit b658bb9c32ad31608921e77603003c004ab0f15b)

commit | commitdiff | tree

Zac Dover [Sun, 24 Mar 2024 21:19:24 +0000 (07:19 +1000)]

doc/start: improve MDS explanation

Improve the explanation of MDS as requested by Anthony D'Atri here:

https://github.com/ceph/ceph/pull/56367#discussion_r1536667249

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit e7eff3cf8738ddd378c2fd70075ac61dbac3cf55)

commit | commitdiff | tree

Zac Dover [Tue, 26 Mar 2024 08:28:31 +0000 (18:28 +1000)]

Merge pull request #56434 from zdover23/wip-doc-2024-03-25-backport-56287-to-squid

squid: mon, doc: overriding ec profile requires --yes-i-really-mean-it

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Aashish Sharma [Mon, 26 Feb 2024 09:32:19 +0000 (15:02 +0530)]

mgr/dashboard: replace piechart plugin charts with native pie chart
panel

Fixes: https://tracker.ceph.com/issues/64579
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
(cherry picked from commit 6e5efb626f4b3503a12371a8f609fdb5175a3ebe)

commit | commitdiff | tree

Nizamudeen A [Tue, 26 Mar 2024 07:02:01 +0000 (12:32 +0530)]

Merge pull request #56451 from rhcs-dashboard/wip-65112-squid

squid: mgr/dashboard: sanitize dashboard user creation

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
Reviewed-by: afreen23 <NOT@FOUND>

commit | commitdiff | tree

Venky Shankar [Tue, 26 Mar 2024 06:06:45 +0000 (11:36 +0530)]

Merge pull request #56123 from xhernandez/wip-64738-squid

squid: client: fix leak of file handles

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Mon, 25 Mar 2024 18:00:17 +0000 (14:00 -0400)]

Merge PR #56391 into squid

* refs/pull/56391/head:
qa/suites/fs/nfs: use standard health ignorelist

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>

commit | commitdiff | tree

Patrick Donnelly [Mon, 25 Mar 2024 17:59:55 +0000 (13:59 -0400)]

Merge PR #56395 into squid

* refs/pull/56395/head:
qa: change log-whitelist to log-ignorelist

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>

commit | commitdiff | tree

Patrick Donnelly [Mon, 25 Mar 2024 17:59:23 +0000 (13:59 -0400)]

Merge PR #56397 into squid

* refs/pull/56397/head:
mds: ensure snapclient is synced before corruption check

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>

commit | commitdiff | tree

Patrick Donnelly [Mon, 25 Mar 2024 17:59:04 +0000 (13:59 -0400)]

Merge PR #56405 into squid

* refs/pull/56405/head:
qa: test fuse/kclient for mds upgrade seq
qa: ignore OSD_DOWN during cephadm upgrades
qa: ignore warning "Replacing daemon"
qa: ignore MDS_INSUFFICIENT_STANDBY
qa: remove ignorelist error parenthesis
mds: update comment on kclient decoding of MDSMap
qa: do upgrades from quincy and older reef minor releases
mds: reverse MDSMap encoding of max_xattr_size/bal_rank_mask

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>

commit | commitdiff | tree

Patrick Donnelly [Mon, 25 Mar 2024 17:58:13 +0000 (13:58 -0400)]

Merge PR #56406 into squid

* refs/pull/56406/head:
doc/dev: update quiesce developer document
qa: wrap quiesce verification to dump debugging on error
qa: update quiesce tests for control via locallock
qa: set archive path in vstart_runner
qa: refactor CephFSMount.kill_background to optionally kill all background jobs
qa: use kwarg for rank parameter
qa: simplify calls to (rank|mds)_(tell|asok)
Revert "pybind/mgr/volumes: block quiesce for critical .meta file"
mds: remove is_root indication on quiesce_inode op
mds: prevent new lock cache cons when invalidating an existing one
mds: use XLOCK_WAIT For local lock xlockers
mds: prevent new wrlocks on LocalLock if there exists any xlock waiter
mds: block import discover when parent directory inode is quiesced
mds: avoid issuing exclusive caps to clients lacking w caps
mds: print lock cache during invalidation
mds: use inodeno_t to track quiesce requests
mds: dispatch quiesce_inode ops after dir traversal
mds: remove quiescelock handling for SimpleLock type
mds: quiescelock as local lock + cap masking
qa: run quiesce unit tests in fs:functional
qa: add quiesce protocol unit tests
qa: detect partial migrations during large config of dist epin
qa: use stdin-killer to timeout run_shell_payload
qa: simplify run_shell argument processing
doc: add dev docs for quiesce protocol
pybind/mgr/volumes: block quiesce for critical .meta file
mds: add vxattr to block quiesce on an inode
mds: convert encoded ephemeral dist pin to flags
mds: add counter to throttle quiesce
mds: add quiesce set feature flag
mds: skip non-head inodes for quiesce
mds: add quiesce op
mds: print all SimpleLock flags in debug output
mds: pretty print mutation when dumping lock
mds: add new inode quiescelock
mds: use 128 bits for waiters on MDSCacheObject
mds: provide mechanism to authpin while freezing
mds: add command to get specific op
mds: finish request before completing internal req
mds: complete internal op if killed
mds: avoid killing dead requests
mds: add command to kill request
mds: add path argument to `ops` and `dump tree` to stream result to local file
mds: print internal_request filepaths if present
mds: add more information to debug message
mds: remove redundant parenthesis
mds: implement Mutation::dump method
mds: make LockType fields const
mds: annotate mdr with try_rdlock_snap_layout failure
mds: refactor if into switch
mds: call Locker method using this
mds: simplify assert
mds: dump locks passed to Locker::acquire_locks
mds: add LockOp::print method for debugging
mds: use new insert template via print
mds: add request result to mutation for analysis by tests
mds: add comment on locking order rules
mds: allow specifying rdlock position
mds: remove dead method
common: provide a template for object dumps
common: support long running ops without slow warnings
common: simplify loop
common: add JSONFormatterFile class
common: use more efficient vector for stack
include: use larger int for large gathers

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>

commit | commitdiff | tree

Patrick Donnelly [Mon, 25 Mar 2024 17:57:14 +0000 (13:57 -0400)]

Merge PR #56407 into squid

* refs/pull/56407/head:
qa/cephfs: stop ignoring MON_DOWN globally
qa: extend mon timeout coming up after mondb creation
qa: update dashboard schema for mon_status
mon: do not log MON_DOWN if monitor uptime is less than threshold

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>

commit | commitdiff | tree

Patrick Donnelly [Mon, 25 Mar 2024 17:56:52 +0000 (13:56 -0400)]

Merge PR #56416 into squid

* refs/pull/56416/head:
qa/cephfs: add mgr debugging

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>

commit | commitdiff | tree

Pedro Gonzalez Gomez [Thu, 21 Mar 2024 07:56:58 +0000 (08:56 +0100)]

mgr/dashboard: sanitize dashboard user creation

Fixes: https://tracker.ceph.com/issues/65030
Signed-off-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
(cherry picked from commit 9951a3a7f314c7cd1eb147fb1b398ac6c5212806)

commit | commitdiff | tree

Nizamudeen A [Mon, 25 Mar 2024 09:51:45 +0000 (15:21 +0530)]

Merge pull request #56440 from rhcs-dashboard/wip-65109-squid

squid: mgr/dashboard: fix empty tags

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
Reviewed-by: afreen23 <NOT@FOUND>

commit | commitdiff | tree

Nizamudeen A [Tue, 19 Mar 2024 14:57:13 +0000 (20:27 +0530)]

mgr/dashboard: rm warning/error threshold for cpu usage

for multi-core cpu's the value can be more than 100% so it doesn't make
sense to show warning/error when the usage is at or more than 100%.
hence removing it

Fixes: https://tracker.ceph.com/issues/58838
Signed-off-by: Nizamudeen A <nia@redhat.com>
(cherry picked from commit 131a70754a6bfbbb2d1481ba74436953c6ecc226)

commit | commitdiff | tree

Pedro Gonzalez Gomez [Thu, 14 Mar 2024 06:58:36 +0000 (07:58 +0100)]

mgr/dashboard: fix empty tags

Fixes: https://tracker.ceph.com/issues/64911
Signed-off-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
(cherry picked from commit a60f82566551e731987b4a02d1bb2bf4cfb9a9ce)

commit | commitdiff | tree

Venky Shankar [Mon, 25 Mar 2024 04:40:51 +0000 (10:10 +0530)]

Merge pull request #56018 from vshankar/wip-64755-squid

squid: qa: ignore container checkpoint/restore related selinux denials for centos9

Reviewed-by: Yuri Weinstein <yuriw@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Mon, 25 Mar 2024 04:39:18 +0000 (10:09 +0530)]

Merge pull request #56418 from rhcs-dashboard/wip-65077-squid

squid: Snapshot schedule show subvolume path

Reviewed-by: Ankush Behl <cloudbehl@gmail.com>

commit | commitdiff | tree

Radoslaw Zarzynski [Tue, 19 Mar 2024 13:43:10 +0000 (13:43 +0000)]

mon, doc: overriding ec profile requires --yes-i-really-mean-it

This is per https://tracker.ceph.com/issues/64333#note-17 describing
driving factors of a catastrophic cluster failure.

Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
(cherry picked from commit 629ba7bd349d48cdaa6d094751e7cfce651ba2bc)

commit | commitdiff | tree

Anthony D'Atri [Sun, 24 Mar 2024 13:27:16 +0000 (09:27 -0400)]

Merge pull request #56425 from zdover23/wip-doc-2024-08-24-backport-56367-to-squid

squid: doc/start: improve MDS explanation

commit | commitdiff | tree

Zac Dover [Sun, 24 Mar 2024 07:32:06 +0000 (17:32 +1000)]

Merge pull request #56409 from zdover23/wip-doc-2024-03-23-backport-56377-to-squid

squid: doc/start: link to mon map command

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Zac Dover [Thu, 21 Mar 2024 12:48:10 +0000 (22:48 +1000)]

doc/start: improve MDS explanation

Remove a prolix "on behalf of" and streamline some sentences explaining
the benefits of MDS.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 2da1812a1de7e32395cb95c2e0ebeea689c1b810)

commit | commitdiff | tree

Ivo Almeida [Wed, 20 Mar 2024 10:35:52 +0000 (10:35 +0000)]

mgr/dashboard: show full subvol path snap schedule

Show full subvolume path when adding snap schedule

Fixes: https://tracker.ceph.com/issues/65000
Signed-off-by: Ivo Almeida <ialmeida@redhat.com>
(cherry picked from commit b6dbf0f24d189d5b75148f07f1dc7f332d23b64a)

commit | commitdiff | tree

Ivo Almeida [Fri, 22 Mar 2024 19:30:24 +0000 (19:30 +0000)]

Merge pull request #56388 from rhcs-dashboard/wip-64960-squid

squid: mgr/dashboard: fixed nfs attach command

Reviewed-by: Ankush Behl <cloudbehl@gmail.com>

commit | commitdiff | tree

Patrick Donnelly [Tue, 19 Mar 2024 15:01:48 +0000 (11:01 -0400)]

qa/cephfs: add mgr debugging

Fixes: https://tracker.ceph.com/issues/64985
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit d2d844b60be5b0881844cecf7b0a54124061152b)

commit | commitdiff | tree

Zac Dover [Thu, 21 Mar 2024 22:12:16 +0000 (08:12 +1000)]

doc/start: link to mon map command

Link to the "ceph mon stat" command when "Intro to Ceph" document first
mentions Monitor Maps.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 5d300db825316db25706111bbe577d41d41ab4b3)

commit | commitdiff | tree

Nizamudeen A [Fri, 22 Mar 2024 16:11:22 +0000 (21:41 +0530)]

Merge pull request #56214 from rhcs-dashboard/applitools-disable-squid

squid: mgr/dashboard: disable applitools e2e

Reviewed-by: ivoalmeida <NOT@FOUND>

commit | commitdiff | tree

Patrick Donnelly [Mon, 18 Mar 2024 18:32:01 +0000 (14:32 -0400)]

qa/cephfs: stop ignoring MON_DOWN globally

This should hopefully no longer be necessary with the addition of
mon_down_uptime_grace.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit ae96ed4afb288fbc5a7ed818ba556c21418c631c)

Conflicts:
qa/cephfs/overrides/ignorelist_wrongly_marked_down.yaml: already ignored

commit | commitdiff | tree

Patrick Donnelly [Tue, 19 Mar 2024 00:38:08 +0000 (20:38 -0400)]

qa: extend mon timeout coming up after mondb creation

Fixes: https://tracker.ceph.com/issues/64968
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 57b9e161f44d2c685b4a0ab68aff1b43bfd63e66)

commit | commitdiff | tree

Patrick Donnelly [Mon, 18 Mar 2024 19:55:04 +0000 (15:55 -0400)]

qa: update dashboard schema for mon_status

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 23de8e318f56b3052873fdd3744d313423e871bf)

commit | commitdiff | tree

Patrick Donnelly [Mon, 18 Mar 2024 15:03:23 +0000 (11:03 -0400)]

mon: do not log MON_DOWN if monitor uptime is less than threshold

Fixes: https://tracker.ceph.com/issues/64968
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 4182362c1195f3f322ae50dc0054518e62fea73a)

commit | commitdiff | tree

Patrick Donnelly [Fri, 16 Feb 2024 18:25:52 +0000 (13:25 -0500)]

qa: test fuse/kclient for mds upgrade seq

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 182f4c0f54385ab15ba13e1f06c1146ba16fd64b)

commit | commitdiff | tree

Patrick Donnelly [Fri, 16 Feb 2024 15:09:53 +0000 (10:09 -0500)]

qa: ignore OSD_DOWN during cephadm upgrades

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 4bcaaa45eb45f12df2d1835831c01fe0eb6ca595)

commit | commitdiff | tree

Patrick Donnelly [Fri, 16 Feb 2024 14:46:18 +0000 (09:46 -0500)]

qa: ignore warning "Replacing daemon"

This is expected for cephadm deployments where join_fs is configured, causing
affinity replacements.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 75d76f97b0d7c842bd52cb609b00ffe3508c01b5)

commit | commitdiff | tree

Patrick Donnelly [Fri, 16 Feb 2024 01:23:32 +0000 (20:23 -0500)]

qa: ignore MDS_INSUFFICIENT_STANDBY

This is expected when bringing a volume and its mds up initially.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 560300f1c5ea44a3c81d87f3bf9d10727622901c)

commit | commitdiff | tree

Patrick Donnelly [Fri, 16 Feb 2024 01:16:20 +0000 (20:16 -0500)]

qa: remove ignorelist error parenthesis

Some messages are duplicated to the cluster log lookign like:

2024-02-15T22:54:31.244 INFO:teuthology.orchestra.run.smithi033.stdout:2024-02-15T22:50:00.000263+0000 mon.smithi033 (mon.0) 558 : cluster 4 [ERR] MDS_ALL_DOWN: 1 filesystem is offline

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 0e5e847f087cde893cc49ed3434c78006055d0b5)

commit | commitdiff | tree

Patrick Donnelly [Thu, 15 Feb 2024 19:31:47 +0000 (14:31 -0500)]

mds: update comment on kclient decoding of MDSMap

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 427ad7c0f93d374db45b86073f2c7787d2e8cb9a)

commit | commitdiff | tree

Patrick Donnelly [Thu, 15 Feb 2024 02:49:17 +0000 (21:49 -0500)]

qa: do upgrades from quincy and older reef minor releases

Fixes: https://tracker.ceph.com/issues/64441
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 52c09aa1e50e3d9f36909354c43da89dec8f9bb5)

commit | commitdiff | tree

Patrick Donnelly [Thu, 15 Feb 2024 15:28:32 +0000 (10:28 -0500)]

mds: reverse MDSMap encoding of max_xattr_size/bal_rank_mask

Commit e134c890 adds the bal_rank_mask with encoded (ev) version 17.  This was
merged into main Oct 2022 and made it into the reef release normally.

Commit 7b8def5c adds the max_xattr_size also with encoded (ev) version 17 but
places it before bal_rank_mask. This is problematic as there were no plans to
backport e134c890 to quincy or pacific so piggybacking on the ev 17 bump would
not work and otherwise would require the backports to be done as a set to
ensure consistency (including with the kernel client).

However, the real issue is that 7b8def5c was not merged until after reef was
already cut. This required 7b8def5c to be backported separately in [1] which
was not merged until after v18.2.1 (current reef HEAD as of this commit).
Ultimately, this means that there are reef versions (v18.2.[01]) in the wild
which expect bal_rank_mask to be encoded at ev17 and not (max_xattr_size,
bal_rank_mask). Adding to the complications, the kernel client has already
merged code [2] expecting max_xattr_size for ev17.

It was decided in a github discussion [3] to move bal_rank_mask to ev18 to
avoid updating the kernel client which was done in the main branch via 36ee8e7e
and update the reef max_xattr_size backport with the same change (d8cebd67).

Unfortunately, this breaks upgrades v18.2.[01] to newer reef versions or to
main.  The reason is that monitors will encode v17 with bal_rank_mask
(max_xattr_size is not merged yet) and send that to upgraded mgrs (which are
upgraded first). The mgr will attempt to decode bal_rank_mask as a uint64_t
(max_xattr_size) but fail because an empty (by default) bal_rank_mask is simply
encoded as a signed 32-bit integer. Consequently, the mgr will fail decoding
with:

    failed to decode message of type 45 v1: End of buffer [buffer:2]

Of course the problem does not stop there, even if the mgr were able to handle
this, the monitors/mds/clients would fail in similar fashion.

So the only choice left is to fix max_xattr_size to be encoded at ev18.
Fortunately, v18.2.2 has not been released nor has any max_xattr_size backport
to quincy/pacific been merged. The main downside will be that kernels will
wrongly decode ev17 (which is already true for ceph clusters running
v18.2.[01]). A follow-up kernel fix will be required.

[1] https://tracker.ceph.com/issues/59405
[2] linux.git d93231a6bc8a452323d5fef16cca7107ce483a27
[3] https://github.com/ceph/ceph/pull/53340#discussion_r1399255031

Fixes: https://tracker.ceph.com/issues/64440
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 78abfeaff27fee343fb664db633de5b221699a73)

commit | commitdiff | tree

Patrick Donnelly [Thu, 14 Mar 2024 18:29:58 +0000 (14:29 -0400)]

doc/dev: update quiesce developer document

To include changes relating to it now being a local lock that prevents mutable
caps.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 719d30d2774ab05bd9f92b7902487aec859c5d99)

commit | commitdiff | tree

Patrick Donnelly [Fri, 8 Mar 2024 03:50:54 +0000 (22:50 -0500)]

qa: wrap quiesce verification to dump debugging on error

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 48766b336d6d21051221f3cd33a5dd9b97df7740)

commit | commitdiff | tree

Patrick Donnelly [Tue, 27 Feb 2024 20:27:02 +0000 (15:27 -0500)]

qa: update quiesce tests for control via locallock

- Verify multirank quiesce incl. caps issued.
- Unset splitauth experimental

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit c0f0997e17e5639499bde09a82e8fbeef95ce006)

commit | commitdiff | tree

Patrick Donnelly [Sun, 3 Mar 2024 02:19:58 +0000 (21:19 -0500)]

qa: set archive path in vstart_runner

For storing misc. test artifacts.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 561965131eefb092163fa000fda7da9afcdcfcd4)

commit | commitdiff | tree

Patrick Donnelly [Sat, 2 Mar 2024 02:51:36 +0000 (21:51 -0500)]

qa: refactor CephFSMount.kill_background to optionally kill all background jobs

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit bb71023ced2d97a2590b98c4b820e7af6f1801b3)

commit | commitdiff | tree

Patrick Donnelly [Thu, 14 Mar 2024 14:37:12 +0000 (10:37 -0400)]

qa: use kwarg for rank parameter

Otherwise it gets included in the *args list. This is necessary after commit
`qa: simplify calls to (rank|mds)_(tell|asok)`.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 71fd9a34a911745957b8dbc4a904ab37616678b8)

commit | commitdiff | tree

Patrick Donnelly [Sat, 2 Mar 2024 02:49:53 +0000 (21:49 -0500)]

qa: simplify calls to (rank|mds)_(tell|asok)

Instead of requiring the caller to put the arguments in a list, allow passing
as regular arguments.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 464ef51525d673d38187bb9a841c4802c0681e0f)

commit | commitdiff | tree

Patrick Donnelly [Tue, 5 Mar 2024 19:22:53 +0000 (14:22 -0500)]

Revert "pybind/mgr/volumes: block quiesce for critical .meta file"

This reverts commit 16e50abbd42a4b4f740816ba5f60bc259a6a96bf.

This flag is no longer necessary as the volumes plugin issues quiesce calls
against the data (i.e. root) directory of the subvolume rather than the
subvolume directory (with its associated .meta file).

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 92f1ed3ed1fdba8a292dd4f625acd2b23bd6fdec)

commit | commitdiff | tree

Patrick Donnelly [Tue, 19 Mar 2024 21:08:54 +0000 (17:08 -0400)]

mds: remove is_root indication on quiesce_inode op

This is no longer necessary with the change to a LocalLock quiescelock.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 667c9a9c5af8d342ce6da5e942efb219b67d980b)

commit | commitdiff | tree

Patrick Donnelly [Tue, 5 Mar 2024 17:27:00 +0000 (12:27 -0500)]

mds: prevent new lock cache cons when invalidating an existing one

The previous scheme invalidated a lock cache and then immediately removed it
from its Capability list. The lock cache would eventually be deleted but a new
one could be constructed shortly after. The main reason for this is that simply
invalidating the lock cache does not drive a state change in the local locks
preventing new writers. This is mostly important for acquiring the quiescelock.

This commit also corrects a bug where a MDLockCache would be created for a
given opcode type (like create) when the capability does not have the issued
cap (CEPH_CAP_DIR_CREATE). The bug would not cause any negative side-effects
but would hold locks unnecessarily when only MDS ops (and not the client
executing ops asynchronously) are acquiring the locks.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 9b4a93f2b7b902e56a41f7e50fd21b8046134ef6)

commit | commitdiff | tree

Patrick Donnelly [Mon, 4 Mar 2024 20:44:45 +0000 (15:44 -0500)]

mds: use XLOCK_WAIT For local lock xlockers

This avoids waking up all waiters when only WAIT_XLOCK waiters should wake.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit f813b8c511a50a7fab673c2b99587f00fb506323)

commit | commitdiff | tree

Patrick Donnelly [Tue, 5 Mar 2024 17:19:42 +0000 (12:19 -0500)]

mds: prevent new wrlocks on LocalLock if there exists any xlock waiter

Otherwise, an xlock waiter can become starved as a LocalLock supports multiple
writers.

Strictly speaking, a new lock state would be appropriate for this but we cheat
frequently with the LocalLock -- there is only one state. All transition checks
are already manually performed by the Locker.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit bddd3c72014dc7aa4bb4f16e4633111a0f2e0bb0)

commit | commitdiff | tree

Patrick Donnelly [Sat, 2 Mar 2024 03:01:48 +0000 (22:01 -0500)]

mds: block import discover when parent directory inode is quiesced

This is to prevent two racing ranks quiescing some root from exporting a tree
under a completely quiesced directory (inode). The state of that imported tree
may take time to quiesce and cause the root to be QUIESCED before all inodes
under it are actually quiesced.

If a dirfrag to be imported is discovered before the parent is quiesced, then
the quiesce traversal will issue a quiesce_inode op normally for parent which
will attempt to authpin the parent. That will block if the export is still
in-progress (causing quiesce to wait for the export to finish or abort).

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 1ec6817cc082b379fba86bcd0a73ff950d7d1152)

commit | commitdiff | tree

Patrick Donnelly [Tue, 27 Feb 2024 20:24:26 +0000 (15:24 -0500)]

mds: avoid issuing exclusive caps to clients lacking w caps

Signed-off-by: Leonid Usov <leonid.usov@ibm.com>
(cherry picked from commit 772f04afdf87d25de8102b7f6aa7e0a242cd566e)

commit | commitdiff | tree

Patrick Donnelly [Sun, 3 Mar 2024 03:22:19 +0000 (22:22 -0500)]

mds: print lock cache during invalidation

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit d86a5c2ab76b740da235aeb1f5a994a84bb3515f)

commit | commitdiff | tree

Patrick Donnelly [Thu, 7 Mar 2024 19:40:58 +0000 (14:40 -0500)]

mds: use inodeno_t to track quiesce requests

If a CInode is removed from cache before the quiesce_inode request can process
it (and pin it in cache), a new CInode may be created with the same address.
That pointer still exists in MutationImpl::quiesce_ops and would prevent
issuing a quiesce_inode op for the new inode.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 1fa9c25d70c666259c300fc95e54ec70e8fd35a5)

commit | commitdiff | tree

Patrick Donnelly [Thu, 7 Mar 2024 19:30:35 +0000 (14:30 -0500)]

mds: dispatch quiesce_inode ops after dir traversal

Mostly to avoid the possibility of the CDir::items changing during iteration
when calling sub-ops.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit c90cc2d04bb2d6d4e558fe268bcd241ea7723880)

commit | commitdiff | tree

Patrick Donnelly [Tue, 5 Mar 2024 19:13:42 +0000 (14:13 -0500)]

mds: remove quiescelock handling for SimpleLock type

This is no longer necessary with the conversion to LocalLock.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 7e75a9e96343b2acf8a2c77b71563dadbdefd37b)

commit | commitdiff | tree

Patrick Donnelly [Tue, 27 Feb 2024 20:17:28 +0000 (15:17 -0500)]

mds: quiescelock as local lock + cap masking

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Signed-off-by: Leonid Usov <leonid.usov@ibm.com>
(cherry picked from commit 7fa8bc8b29f22f3cbe3a15c34f86003ed7c73088)

commit | commitdiff | tree

Patrick Donnelly [Tue, 13 Feb 2024 17:37:07 +0000 (12:37 -0500)]

qa: run quiesce unit tests in fs:functional

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Fixes: https://tracker.ceph.com/issues/63664
(cherry picked from commit 51a04479726776c2852159fb9cfe28e60fe74d54)

commit | commitdiff | tree

Patrick Donnelly [Wed, 24 Jan 2024 02:26:46 +0000 (21:26 -0500)]

qa: add quiesce protocol unit tests

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Fixes: https://tracker.ceph.com/issues/63664
(cherry picked from commit 7cf14fb10a528761dd3139d813834207c8233145)

commit | commitdiff | tree

Patrick Donnelly [Sat, 17 Feb 2024 15:26:14 +0000 (10:26 -0500)]

qa: detect partial migrations during large config of dist epin

This method would wrongly "succeed" when looking for setup of distributed
ephemerally pinned directory fragments. If the migrator splits a subtree during
the course of migration (to reduce the migration size) then the operation may
not actually be complete.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit a1ac3e03a0227fc386f4341fc55b9a2c2a58d0bc)

commit | commitdiff | tree

Patrick Donnelly [Wed, 24 Jan 2024 02:25:35 +0000 (21:25 -0500)]

qa: use stdin-killer to timeout run_shell_payload

- simplify argument processing / forwarding
- use stdin-killer to kill all sub-processes of the shell
- do not needlessly use run_shell to execute the command as it adds a timeout
to the stdout/stderr processing
- provide a stdin (PIPE) by default otherwise teuthology's code closes stdin
and triggers stdin-killer to timeout the shell.
- use a 15 minute timeout by default

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 4cfc5b802f4dcc4dec563b6dbc662feefb8ae951)

commit | commitdiff | tree

Patrick Donnelly [Thu, 7 Mar 2024 03:21:24 +0000 (22:21 -0500)]

qa: simplify run_shell argument processing

No functional changes.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit c3707b99c38ab153149684488276c76be6186d8f)

commit | commitdiff | tree

Patrick Donnelly [Thu, 11 Jan 2024 03:08:13 +0000 (22:08 -0500)]

doc: add dev docs for quiesce protocol

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 51c9b86bf1f7be41f568ef1594b305133cd8b118)

commit | commitdiff | tree

Patrick Donnelly [Fri, 26 Jan 2024 21:37:15 +0000 (16:37 -0500)]

pybind/mgr/volumes: block quiesce for critical .meta file

This file must remain accessible during quiesce to allow snapshots to be taken
on a subvolume.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 0308f215ae7a6392de4dbd658c6d4334c1c98e9d)

commit | commitdiff | tree

Patrick Donnelly [Fri, 26 Jan 2024 21:36:53 +0000 (16:36 -0500)]

mds: add vxattr to block quiesce on an inode

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit e2529fc74808b102b1ca85a3ee6644160da32e60)

commit | commitdiff | tree

Patrick Donnelly [Fri, 26 Jan 2024 21:34:41 +0000 (16:34 -0500)]

mds: convert encoded ephemeral dist pin to flags

To use space efficiently, convert this field to a 8 bit flags field that is
backward compatible.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit fb05dd1806fc0edf2e4c0908aae114d57099a6d6)

commit | commitdiff | tree

Patrick Donnelly [Wed, 21 Feb 2024 15:03:14 +0000 (10:03 -0500)]

mds: add counter to throttle quiesce

So a storm of quiesce operations do not affect normal MDS operations.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit f4eec857fc8bda4980b9046cde6314fa005122c2)

commit | commitdiff | tree

Patrick Donnelly [Tue, 31 Oct 2023 17:27:16 +0000 (13:27 -0400)]

mds: add quiesce set feature flag

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Fixes: https://tracker.ceph.com/issues/63664
(cherry picked from commit af073d2fba48a80a533e2a1be2f21716d56aa26c)

commit | commitdiff | tree

Patrick Donnelly [Wed, 21 Feb 2024 19:08:13 +0000 (14:08 -0500)]

mds: skip non-head inodes for quiesce

These are accessible only via snapshots.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 92b5bcb44c0eacbc4c449a418350c0ff0506b7b5)

commit | commitdiff | tree

Patrick Donnelly [Tue, 14 Nov 2023 18:17:57 +0000 (13:17 -0500)]

mds: add quiesce op

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Fixes: https://tracker.ceph.com/issues/63664
(cherry picked from commit 06916dd2a7f30874d29b03e0cd998370cf595f58)

commit | commitdiff | tree

Patrick Donnelly [Tue, 27 Feb 2024 20:18:21 +0000 (15:18 -0500)]

mds: print all SimpleLock flags in debug output

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 3980454db49cbd70c42a502b832262552d9c9c74)

commit | commitdiff | tree

Patrick Donnelly [Wed, 10 Jan 2024 19:08:11 +0000 (14:08 -0500)]

mds: pretty print mutation when dumping lock

Now looks like:

... (iquiesce xlock x=1 by request(mds.0:2785 nref=6)) ...

instead of:

... (iquiesce xlock x=1 by 0xdeadbeef)) ...

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 7c59c3749021e9113dfb777c197f70709f416ce6)

commit | commitdiff | tree

Patrick Donnelly [Tue, 9 Jan 2024 19:04:45 +0000 (14:04 -0500)]

mds: add new inode quiescelock

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit a3e541f9992e5e264a8d9ecffd69787d722f88a0)

commit | commitdiff | tree

Patrick Donnelly [Tue, 20 Feb 2024 22:08:32 +0000 (17:08 -0500)]

mds: use 128 bits for waiters on MDSCacheObject

Adding a new inode lock will overflow inode wait bits into the MDSCacheObject
wait bits. Make space for the quiescelock.

This includes a minor refactor to no longer attempt scoping the set of masks we
test in MDSCacheObject::waiting when calling MDSCacheObject::is_waiter_for.
This optimization wasn't worth the overhead and would be awkard to keep as
std::bitset cannot be used as a key for a std::multimap (easily). Instead, we
use the sequence number as a key which helps us to avoid allocating another map
whenever we call MDSCacheObject::take_waiting.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 84b33ea9fe6268adfaca1e17b282e68788e38bf4)

commit | commitdiff | tree

Patrick Donnelly [Tue, 13 Feb 2024 16:07:26 +0000 (11:07 -0500)]

mds: provide mechanism to authpin while freezing

When a subtree is freezing, it's no longer possible to acquire new authpins.
This is a problem when a compound request like quiescing a subtree is trying to
acquire authpins for each sub-op. This creates a situation where some quiesce
sub-ops complete with authpins (thereby preventing the tree from becoming
"frozen") and new sub-ops cannot acquire authpins (because the tree is
"freezing"). To circumvent this, allow some authpin requests to proceed if
FLAG_BYPASSFREEZING is set.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit a9964a7ccc4394f923fb0f1c76eb8fa03fe8733d)

commit | commitdiff | tree

Patrick Donnelly [Tue, 9 Jan 2024 18:55:48 +0000 (13:55 -0500)]

mds: add command to get specific op

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 3197a975a521654500173175c6e856c54df497ef)

commit | commitdiff | tree

Patrick Donnelly [Tue, 13 Feb 2024 21:07:37 +0000 (16:07 -0500)]

mds: finish request before completing internal req

Importantly, this marks the request dead so it cannot be killed.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit c944eadaf6987a812e738459214872af1ff7678a)

commit | commitdiff | tree

Patrick Donnelly [Tue, 14 Nov 2023 18:50:53 +0000 (13:50 -0500)]

mds: complete internal op if killed

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit ba3dac94e6a90ed76970fef89b1a82dcedb4ec2b)

commit | commitdiff | tree

Patrick Donnelly [Tue, 13 Feb 2024 20:58:19 +0000 (15:58 -0500)]

mds: avoid killing dead requests

This is important when you have:

* sub-op A finishes and resumes parent
* parent kills all sub-ops (including sub-op A)

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit b72e153d2b36956e97b8686aeb6aaacffe928750)

commit | commitdiff | tree

Patrick Donnelly [Mon, 13 Nov 2023 19:23:54 +0000 (14:23 -0500)]

mds: add command to kill request

Fixes: https://tracker.ceph.com/issues/63374
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 9956f9ad78a3b6ebe520dcd10e33a59c361c4d91)

commit | commitdiff | tree

Patrick Donnelly [Sat, 17 Feb 2024 15:23:43 +0000 (10:23 -0500)]

mds: add path argument to `ops` and `dump tree` to stream result to local file

This file can be collected during testing to avoid sending a large JSON result
over the `ceph tell`/asok interface.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit a1303a52af7504c3748a75ade9619c7a6c454831)

commit | commitdiff | tree

Patrick Donnelly [Wed, 31 Jan 2024 19:28:14 +0000 (14:28 -0500)]

mds: print internal_request filepaths if present

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit eaea93e05018294e893e0e5241b3293e10971f5e)

commit | commitdiff | tree

Patrick Donnelly [Tue, 14 Nov 2023 18:19:47 +0000 (13:19 -0500)]

mds: add more information to debug message

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit f8a26a61305ba8b9fbb9ce47b5221e3adb754c9b)

commit | commitdiff | tree

Patrick Donnelly [Sat, 14 Oct 2023 00:27:02 +0000 (20:27 -0400)]

mds: remove redundant parenthesis

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 1513327c1a55614ac7e2f1a6da0de06e5fd2214d)

commit | commitdiff | tree

Patrick Donnelly [Tue, 14 Nov 2023 14:38:03 +0000 (09:38 -0500)]

mds: implement Mutation::dump method

So we can dump it from asok commands.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit ba7d1f5f2a2e4355155e00173b0d2ade562a8945)

Unnamed repository; edit this file 'description' to name the repository.

RSS Atom