]> git.apps.os.sepia.ceph.com Git - ceph-ci.git/log
ceph-ci.git
19 months agodebian/cephadm.postinst: stop using adduser --gecos
Kefu Chai [Sat, 3 Feb 2024 05:46:05 +0000 (13:46 +0800)]
debian/cephadm.postinst: stop using adduser --gecos

--gecos option of adduser is deprecated in debian/bookworm, and
will be removed in debian/trixie,
see https://manpages.debian.org/bookworm/adduser/adduser.8.en.html.
so to be future-proof, let's switch to `usermod --comment`. please
note, since we still need to support ubuntu/jammy which is used in
our CI, and `adduser` shipped by ubuntu/jammy does not support
`--comment` yet, so we cannot use this option.

Signed-off-by: Kefu Chai <tchaikov@gmail.com>
(cherry picked from commit e74ec0b27dfb480a2951f85bf00072e95d944772)
(cherry picked from commit 5a236031951a2a6f8b358a25b9238dd1e8281f10)

19 months agodebian/cephadm.postinst: replace tab with spaces
Kefu Chai [Wed, 17 Jan 2024 15:52:48 +0000 (23:52 +0800)]
debian/cephadm.postinst: replace tab with spaces

for better readability, and to be more consistent with the rest
of this file, and other .postinst scripts of this project.

Signed-off-by: Kefu Chai <tchaikov@gmail.com>
(cherry picked from commit 013eebf5357666cf2509a37eecf69ef4ac349321)
(cherry picked from commit 7cff98556ecc407da17d62ca26b6989d1a2837d2)

19 months agodebian/cephadm.postinst: specify --home when adduser
Kefu Chai [Wed, 17 Jan 2024 15:47:39 +0000 (23:47 +0800)]
debian/cephadm.postinst: specify --home when adduser

quote from adduser/NEWS.Debian.gz:

>  System user home defaults to /nonexistent if --home is not specified.
>  Packages that call adduser to create system accounts should explicitly
>  specify a location for /home (see Lintian check
>  maintainer-script-lacks-home-in-adduser).

so let's follow this change in adduser. otherwise "cephadm"
would have a $HOME at `/nonexistent`.

Fixes: https://tracker.ceph.com/issues/64069
Signed-off-by: Kefu Chai <tchaikov@gmail.com>
(cherry picked from commit 90bc0369243077c2aaf67f0de2bab5810b217f4e)
(cherry picked from commit 81c71165defaeedaf127263b7e615938df493b8e)

19 months agodebian/cephadm.postinst: wrap command line lines which is too long
Kefu Chai [Wed, 17 Jan 2024 15:44:23 +0000 (23:44 +0800)]
debian/cephadm.postinst: wrap command line lines which is too long

for better readability.

Signed-off-by: Kefu Chai <tchaikov@gmail.com>
(cherry picked from commit c517d0b0e95b20cd9bd979405d0248b9419e5c85)
(cherry picked from commit 044bb8a92514c66c24393e184db3b70298c62cb3)

19 months agodebian/ceph-common.postinst: set user directory using adduser
Kefu Chai [Wed, 17 Jan 2024 15:36:12 +0000 (23:36 +0800)]
debian/ceph-common.postinst: set user directory using adduser

now that adduser allows us to set its home directory, we can do
this using adduser instead of using usermod. this change also
silences the warning from lintian
"maintainer-script-lacks-home-in-adduser". lintian complains if
`adduser --system` is called without passing `--home` option.

also, take this opportunity to s/-c/--comment/ in the command line
of `usermod`, for better readability.

Fixes: https://tracker.ceph.com/issues/64069
Signed-off-by: Kefu Chai <tchaikov@gmail.com>
(cherry picked from commit a1a857698cad4f2feeea0b56fd781350e44078b7)
(cherry picked from commit 776a089b09c70ee45f2c84eec05279d6826abf15)

19 months agodebian/control: add adduser to Depends of cephadm and ceph-common
Kefu Chai [Wed, 17 Jan 2024 15:09:02 +0000 (23:09 +0800)]
debian/control: add adduser to Depends of cephadm and ceph-common

in `debian/ceph-common.postinst` and `debian/cephadm.postinst`, we
use `adduser --system` to create the system user when configuring
the corresponding package.

before this change, the dependency is not listed in the runtime
`Depends` section of ceph-common and cephadm.

in this change, the dependency is added. this is also suggested
by Securing Debian Manual, see
https://www.debian.org/doc/manuals/securing-debian-manual/bpp-lower-privs.en.html

Signed-off-by: Kefu Chai <tchaikov@gmail.com>
(cherry picked from commit 2a30ddccd04b204821f0496856847e8ffc07835d)
(cherry picked from commit eeaaeeadcdff5fff0732d6f4e89b9d6f7b626c2f)

19 months agobackport mgr/prometheus: fix orch check to prevent Prometheus crash
Redouane Kachach [Thu, 8 Feb 2024 07:25:37 +0000 (08:25 +0100)]
backport mgr/prometheus: fix orch check to prevent Prometheus crash
https://tracker.ceph.com/issues/64339

Signed-off-by: Redouane Kachach <rkachach@redhat.com>
(cherry picked from commit de7da4a77ddbd33bbe050c52c4c5bbbb6ab1919f)

19 months agosrc/osd/OSDMap.cc: Fix encoder to produce same bytestream
Kamoltat [Wed, 31 Jan 2024 15:25:10 +0000 (15:25 +0000)]
src/osd/OSDMap.cc: Fix encoder to produce same bytestream

Fixes: https://tracker.ceph.com/issues/63389
Signed-off-by: Kamoltat <ksirivad@redhat.com>
(cherry picked from commit a3bdffb4e0366af8288258539f17bafeae17e3c4)
(cherry picked from commit e9880fefda543d9d785dba89fe90c5e5074bb62a)

22 months ago18.2.1 v18.2.1
Ceph Release Team [Mon, 11 Dec 2023 21:55:38 +0000 (21:55 +0000)]
18.2.1

Signed-off-by: Ceph Release Team <ceph-maintainers@ceph.io>
22 months agoceph-volume: fix mpath device support
Guillaume Abrioux [Wed, 6 Sep 2023 09:30:41 +0000 (09:30 +0000)]
ceph-volume: fix mpath device support

commit [1] broke mpath devices support in `disk.is_device()`

[1] https://github.com/ceph/ceph/commit/4fc6bc394dffaf3ad375ff29cbb0a3eb9e4dbefc

Fixes: https://tracker.ceph.com/issues/62722
Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
(cherry picked from commit 28a12f8ef703a7349d9f2de40251c22e018fc475)
(cherry picked from commit 1e7223281fa044c9653633e305c0b344e4c9b3a4)

22 months agoos/bluestore: get rid off resulting lba alignment in allocators
Igor Fedotov [Fri, 15 Sep 2023 17:35:13 +0000 (20:35 +0300)]
os/bluestore: get rid off resulting lba alignment in allocators

Fixes: https://tracker.ceph.com/issues/62815
Signed-off-by: Igor Fedotov <igor.fedotov@croit.io>
(cherry picked from commit b0cb41a761dd7c32bde8e29eb12d44519a2988a5)
(cherry picked from commit a585ac27e151e2ffe1e0a347d1a09fb4c752e809)

22 months agoos/bluestore: adjust and validate bluefs_shared_alloc_size
Igor Fedotov [Wed, 13 Sep 2023 13:51:16 +0000 (16:51 +0300)]
os/bluestore: adjust and validate bluefs_shared_alloc_size

Make sure it's in-sync (meaning it's higher or equal and properly aligned)
with bluestore_min_alloc_size into account

Fixes: https://tracker.ceph.com/issues/63618
Signed-off-by: Igor Fedotov <igor.fedotov@croit.io>
(cherry picked from commit 334e7fa0879512790e89d1f4d30cd6aaa81f28fa)
(cherry picked from commit f7b8b4d14b2baba0aef5563f4e0abe9d780d290a)

22 months agoqa/cephfs: try usual umount if client blocklisting cannot be inferred
Venky Shankar [Thu, 30 Nov 2023 05:31:05 +0000 (11:01 +0530)]
qa/cephfs: try usual umount if client blocklisting cannot be inferred

Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit 3d0a927ecf530ba7ef1d2636afc282af7c44435c)

22 months agoqa: don't fail on gather_mount_info() failure
Venky Shankar [Tue, 28 Nov 2023 05:44:19 +0000 (11:14 +0530)]
qa: don't fail on gather_mount_info() failure

kernel 5.4 (Ubuntu 20.04) has the following missing commits:

5a9e2f5d5590 ceph: add ceph.{cluster_fsid/client_id} vxattrs
247b1f19dbeb ceph: add status debugfs file

fs suite relies on these debugfs entries to gather mount information
(client-id, addr/inst) which are required by some tests. In fs suite,
the disto kernel gets overridden by the testing kernel and therefore
even if Ubuntu 20.04 is chosen as the distro, the testing kernel is
installed. However, with smoke suite, the distro kernel is used and
the missing patches causes certain essential information gathering to
fail early on (client-id, etc..) causing the test to not even start
execution. PR #54515 fixes a bug in the client-id fetching path but
isn't complete due to the missing patches - details here:

        https://tracker.ceph.com/issues/63488#note-8

But its essential to have the smoke tests running since those tests
have lately uncovered bugs in the MDS (w/ distro kernels). In order
to benefit from those tests, this change ignores failures when
gathering mount information (which aren't used by the fs relevant
smoke tests). The test (in fs suite) that rely on this piece of
information would fail when run with 20.04 distro kernel (but the
fs suite overrides it with the testing kernel).

Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit dde5bf4d4ce08f9f0d46b3bb709578561f580179)

22 months agoqa: add centos_latest (9.stream) and ubuntu_20.04 yamls to supported-all-distro
Venky Shankar [Mon, 27 Nov 2023 05:12:02 +0000 (10:42 +0530)]
qa: add centos_latest (9.stream) and ubuntu_20.04 yamls to supported-all-distro

A bug in Ceph MDS (MDS crash!) is seen with distos using a not-so-recent kernel
(5.4ish). This crash was first seen in quincy smoke run and the problematic
backport change was reverted. The smoke suite chooses a random distro for each
job, so to hit this bug, the appropriate distro needs to be (randomly) get chosen.

This change point the smoke suite to run against all supported distros.

This effects suites that point to supported-all-distro (powercycle) since it
bloats up the number of jobs. E.g., currently, without --subset, powercycle:osd

          INFO:teuthology.suite.run:0/336 jobs were filtered out.

vs

(with this change)

          Unable to schedule 560 jobs, too many jobs, when maximum 500 jobs allowed.

For smoke suite

          INFO:teuthology.suite.run:Scheduled 24 jobs in total.

vs

(with this change)

           INFO:teuthology.suite.run:Scheduled 120 jobs in total.

Eventually, with PR #46882, then testing kernel will no longer override the
distro kernel in fs suite, so we should get good coverage then.

Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit 800333abd2a2e17ecf22ce61f66530d5317a790b)

22 months agoqa: point ubuntu_latest.yaml to latest (22.04)
Venky Shankar [Mon, 27 Nov 2023 05:10:43 +0000 (10:40 +0530)]
qa: point ubuntu_latest.yaml to latest (22.04)

Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit aa0b81920b89d2e6f61602516712c086ae8275ec)

22 months agoqa/smoke: run smoke suite with all supported distros
Venky Shankar [Mon, 27 Nov 2023 04:58:39 +0000 (10:28 +0530)]
qa/smoke: run smoke suite with all supported distros

Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit 6f8d9435b96fc5346942a7fcab34ecb9e1671f51)

22 months agoMClientRequest: handle owner_uid and owner_gid from ceph_mds_request_head_legacy
Alexander Mikhalitsyn [Mon, 23 Oct 2023 12:37:44 +0000 (14:37 +0200)]
MClientRequest: handle owner_uid and owner_gid from ceph_mds_request_head_legacy

When a client is too old and uses struct ceph_mds_request_head_legacy we must
fill new owner_uid and owner_gid fields from an old client_uid and client_gid.

Fixes: https://github.com/ceph/ceph/pull/52575
Fixes: https://tracker.ceph.com/issues/63288
Fixes: commit 46cb244b9c839 ("ceph_fs.h: add separate owner_{u,g}id fields")
Signed-off-by: Alexander Mikhalitsyn <aleksandr.mikhalitsyn@canonical.com>
(cherry picked from commit a70a70f589214d6e2a5b477a61005b13ba2fec46)
(cherry picked from commit 65257baa62eddac0cc3df9d2ca3a57e7fd2b25e2)

22 months agoMClientRequest: handle ext_num_retry and ext_num_fwd from ceph_mds_request_head_legacy
Alexander Mikhalitsyn [Mon, 23 Oct 2023 12:34:26 +0000 (14:34 +0200)]
MClientRequest: handle ext_num_retry and ext_num_fwd from ceph_mds_request_head_legacy

When a client is too old and uses struct ceph_mds_request_head_legacy we must
fill new ext_num_retry and ext_num_fwd fields from an old num_retry and num_fwd.

Fixes: https://github.com/ceph/ceph/pull/45669
Fixes: https://tracker.ceph.com/issues/63288
Fixes: commit cbd7e3040208 ("ceph_fs.h: add 32 bits extended num_retry and num_fwd support")
Signed-off-by: Alexander Mikhalitsyn <aleksandr.mikhalitsyn@canonical.com>
(cherry picked from commit 43f32a46aa9095b19525357ba7ca215e842b4f77)
(cherry picked from commit 312bb5b9f1ada9646205a78f0a0fcc73d2530d5c)

23 months agoMerge pull request #54521 from guits/wip-63554-reef
Guillaume Abrioux [Thu, 16 Nov 2023 14:33:55 +0000 (15:33 +0100)]
Merge pull request #54521 from guits/wip-63554-reef

reef: ceph-volume: fix a regression in `raw list`

23 months agoceph-volume: fix a regression in `raw list`
Guillaume Abrioux [Wed, 15 Nov 2023 15:03:10 +0000 (16:03 +0100)]
ceph-volume: fix a regression in `raw list`

'ceph-volume raw list' is broken for a specific use case (rook).

rook copies devices from /dev/ to /mnt for specific/internal needs.

when ceph-volume raw list is passed a device from /mnt then
ceph-volume ignores it and return an empty dict.
That prevent rook from creating OSDs properly.

https://tracker.ceph.com/issues/63545

Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
(cherry picked from commit e67dce7120954fff9b7b936348bed0b2ac6c1155)

23 months agoMerge pull request #54509 from zdover23/wip-doc-2023-11-15-backport-54497-to-reef
Anthony D'Atri [Wed, 15 Nov 2023 13:40:10 +0000 (08:40 -0500)]
Merge pull request #54509 from zdover23/wip-doc-2023-11-15-backport-54497-to-reef

reef: doc/glossary: add "Quorum" to glossary

23 months agodoc/glossary: add "Quorum" to glossary
Zac Dover [Tue, 14 Nov 2023 13:40:42 +0000 (23:40 +1000)]
doc/glossary: add "Quorum" to glossary

Add the term "Quorum" to the glossary and link to the part of
architecture.rst concerning Monitors. The sticky header at the top of
the docs.ceph.com website gets in the way of the location linked to in
this commit, but fatigue and disgust prevent me from spending time today
trial-and-erroring my way through the hostile and ill-documented
wilderness of scroll-margin so that the link goes where it should.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit c2f6a770bf0e12296c334d99ac86ff4732ec29b7)

23 months agoMerge pull request #54502 from zdover23/wip-doc-2023-11-15-backport-54496-to-reef
Anthony D'Atri [Tue, 14 Nov 2023 19:07:38 +0000 (14:07 -0500)]
Merge pull request #54502 from zdover23/wip-doc-2023-11-15-backport-54496-to-reef

reef: doc/rados: edit "troubleshooting-mon"

23 months agodoc/rados: edit "troubleshooting-mon"
Zac Dover [Tue, 14 Nov 2023 13:03:28 +0000 (23:03 +1000)]
doc/rados: edit "troubleshooting-mon"

Edit the text in the "Initial Troubleshooting" section of
doc/rados/troubleshooting/troubleshooting-mon.rst.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit fa8129a7466b3b48b56124eaf1d9ca8e8fd51fbc)

23 months agoMerge pull request #54480 from zdover23/wip-doc-2023-11-14-backport-54479-to-reef
Anthony D'Atri [Mon, 13 Nov 2023 16:57:51 +0000 (11:57 -0500)]
Merge pull request #54480 from zdover23/wip-doc-2023-11-14-backport-54479-to-reef

reef: doc/start: fix typo in hardware-recommendations.rst

23 months agodoc/start: fix typo in hardware-recommendations.rst
Anthony D'Atri [Mon, 13 Nov 2023 13:58:44 +0000 (08:58 -0500)]
doc/start: fix typo in hardware-recommendations.rst

Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
(cherry picked from commit 48fd7ff1e13ccfeca04b9283b13641ce59316723)

23 months agoMerge pull request #54477 from zdover23/wip-doc-2023-11-13-backport-54473-to-reef
Anthony D'Atri [Mon, 13 Nov 2023 16:38:44 +0000 (11:38 -0500)]
Merge pull request #54477 from zdover23/wip-doc-2023-11-13-backport-54473-to-reef

reef: doc/rados: format "initial troubleshooting"

23 months agodoc/rados: format "initial troubleshooting"
Zac Dover [Mon, 13 Nov 2023 10:57:07 +0000 (20:57 +1000)]
doc/rados: format "initial troubleshooting"

Format the steps in the "Initial Troubleshooting" section of
doc/rados/troubleshooting/troubleshooting-mon.rst. A near-future PR (not
this one) will add context to this section and explain that the steps
described here are the first steps that you should undertake when you
determine that you have an unresponsive or down Monitor. This PR is
merely for formatting.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit c581018caf626fa0dd50bd244766bfa9755c9a16)

23 months agoMerge pull request #54224 from rkachach/fix_issue_63266
Adam King [Mon, 13 Nov 2023 13:40:32 +0000 (08:40 -0500)]
Merge pull request #54224 from rkachach/fix_issue_63266

reef: backport of rook orchestrator fixes and e2e automated testing

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Juan Miguel Olmo Martínez <jolmomar@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
23 months agoMerge pull request #54463 from zdover23/wip-doc-2023-11-13-backport-54457-to-reef
Anthony D'Atri [Sun, 12 Nov 2023 16:58:24 +0000 (11:58 -0500)]
Merge pull request #54463 from zdover23/wip-doc-2023-11-13-backport-54457-to-reef

reef: doc/config: edit "ceph-conf.rst"

23 months agoMerge pull request #54461 from zdover23/wip-doc-2023-11-12-backport-54458-to-reef
Anthony D'Atri [Sun, 12 Nov 2023 16:56:56 +0000 (11:56 -0500)]
Merge pull request #54461 from zdover23/wip-doc-2023-11-12-backport-54458-to-reef

reef: doc/rados: parallelize t-mon headings

23 months agodoc/config: edit "ceph-conf.rst"
Zac Dover [Sun, 12 Nov 2023 10:21:41 +0000 (20:21 +1000)]
doc/config: edit "ceph-conf.rst"

Edit the first section of doc/rados/configuration/ceph-conf.rst.
Initially I just wanted to change "series" to "set", but once I got my
hands dirty I ended up simplifying some sentences.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit c886e05404ce9c50454d40e692ababdc55f346c6)

23 months agodoc/rados: parallelize t-mon headings
Zac Dover [Sun, 12 Nov 2023 10:52:09 +0000 (20:52 +1000)]
doc/rados: parallelize t-mon headings

Give parallel structure to the questions in the Q&A section of the "The
Cluster Has Quorum But At Least One Monitor Is Down" subsection of the
"Most Common Monitor Issues" section of
doc/rados/troubleshooting/troubleshooting-mon.rst.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 010e5a13b9e4e6a860c676f1b3090a495592d715)

23 months agoMerge pull request #54455 from zdover23/wip-doc-2023-11-11-backport-54452-to-reef
Anthony D'Atri [Sat, 11 Nov 2023 13:35:24 +0000 (08:35 -0500)]
Merge pull request #54455 from zdover23/wip-doc-2023-11-11-backport-54452-to-reef

reef: doc/rados: edit t-mon "common issues" (5 of x)

23 months agodoc/rados: edit t-mon "common issues" (5 of x)
Zac Dover [Sat, 11 Nov 2023 03:45:42 +0000 (13:45 +1000)]
doc/rados: edit t-mon "common issues" (5 of x)

Edit the fifth part of the section "Most Common Monitor Issues" in
doc/rados/troubleshooting/troubleshooting-mon.rst.

Follows https://github.com/ceph/ceph/pull/54440.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit d98222db9b4436841102d23c979f05a9d958bb1f)

23 months agoMerge pull request #54443 from zdover23/wip-doc-2023-11-10-backport-54440-to-reef
Anthony D'Atri [Thu, 9 Nov 2023 23:47:11 +0000 (18:47 -0500)]
Merge pull request #54443 from zdover23/wip-doc-2023-11-10-backport-54440-to-reef

reef: doc/rados: edit t-mon "common issues" (4 of x)

23 months agodoc/rados: edit t-mon "common issues" (4 of x)
Zac Dover [Thu, 9 Nov 2023 19:50:21 +0000 (05:50 +1000)]
doc/rados: edit t-mon "common issues" (4 of x)

Edit the fourth part of the section "Most Common Monitor Issues" in
doc/rados/troubleshooting/troubleshooting-mon.rst.

Follows https://github.com/ceph/ceph/pull/54432.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit f6b686ac3118507baab719f1587c8c360499e929)

23 months agoMerge pull request #54438 from zdover23/wip-doc-2023-11-10-backport-54432-to-reef
Anthony D'Atri [Thu, 9 Nov 2023 21:02:03 +0000 (16:02 -0500)]
Merge pull request #54438 from zdover23/wip-doc-2023-11-10-backport-54432-to-reef

reef: doc/rados: edit t-mon "common issues" (3 of x)

23 months agodoc/rados: edit t-mon "common issues" (3 of x)
Zac Dover [Thu, 9 Nov 2023 10:20:20 +0000 (20:20 +1000)]
doc/rados: edit t-mon "common issues" (3 of x)

Edit the second part of the section "Most Common Monitor Issues" in
doc/rados/troubleshooting/troubleshooting-mon.rst.

Follows https://github.com/ceph/ceph/pull/54417.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 91ed10bc734bd93605b60c87037393d2704a16bc)

23 months agoMerge pull request #54436 from ceph/wip-yuriw-quincy-x-reef
Yuri Weinstein [Thu, 9 Nov 2023 18:46:09 +0000 (10:46 -0800)]
Merge pull request #54436 from ceph/wip-yuriw-quincy-x-reef

qa/tests: added - \(POOL_APP_NOT_ENABLED\) to the ignore list

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Prashant D <pdhange@redhat.com>
23 months agoqa/tests: added - \(POOL_APP_NOT_ENABLED\) to the ignore list
Yuri Weinstein [Thu, 9 Nov 2023 16:05:38 +0000 (08:05 -0800)]
qa/tests: added - \(POOL_APP_NOT_ENABLED\) to the ignore list

Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
23 months agoMerge pull request #54429 from guits/wip-63491-reef
Guillaume Abrioux [Thu, 9 Nov 2023 13:59:47 +0000 (14:59 +0100)]
Merge pull request #54429 from guits/wip-63491-reef

reef: ceph-volume: Revert "ceph-volume: fix raw list for lvm devices"

23 months agoceph-volume: fix raw list for non-existent device
Matthew Booth [Tue, 7 Nov 2023 19:58:16 +0000 (19:58 +0000)]
ceph-volume: fix raw list for non-existent device

ceph-volume should not crash when given a device which doesn't exist.

Fixes: https://tracker.ceph.com/issues/63391
Signed-off-by: Matthew Booth <mbooth@redhat.com>
(cherry picked from commit e009b1a319c834a504947423ae1dd55387809235)
(cherry picked from commit 0e17f8b5aebb51ee69ce0689dabe2a9111055716)

23 months agoceph-volume: Revert "ceph-volume: fix raw list for lvm devices"
Guillaume Abrioux [Tue, 7 Nov 2023 13:39:50 +0000 (14:39 +0100)]
ceph-volume: Revert "ceph-volume: fix raw list for lvm devices"

This reverts commit e5e429617c1c27dcd631171f65d30571e32f7266.
This commit introduced a regression, see linked tracker for details.

Fixes: https://tracker.ceph.com/issues/63391
Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
(cherry picked from commit 916a22ef031953056771eceb1f49cab7eb746978)

23 months agoMerge pull request #54427 from zdover23/wip-doc-2023-11-09-backport-54412-to-reef
Anthony D'Atri [Thu, 9 Nov 2023 04:45:09 +0000 (23:45 -0500)]
Merge pull request #54427 from zdover23/wip-doc-2023-11-09-backport-54412-to-reef

reef: doc/rados: edit t-shooting-mon.rst

23 months agodoc/rados: edit t-shooting-mon.rst
Zac Dover [Wed, 8 Nov 2023 07:56:29 +0000 (17:56 +1000)]
doc/rados: edit t-shooting-mon.rst

Improve the front matter of the section "Troubleshooting Monitors" in
doc/rados/troubleshooting/troubleshooting-mon.rst.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit e6e3fda2efb8f4a4345ba5f939baade844c5d582)

23 months agoMerge pull request #54421 from zdover23/wip-doc-2023-11-09-backport-54417-to-reef
Anthony D'Atri [Wed, 8 Nov 2023 16:15:45 +0000 (11:15 -0500)]
Merge pull request #54421 from zdover23/wip-doc-2023-11-09-backport-54417-to-reef

reef: doc/rados: edit t-mon "common issues" (2 of x)

23 months agodoc/rados: edit t-mon "common issues" (2 of x)
Zac Dover [Wed, 8 Nov 2023 13:24:06 +0000 (23:24 +1000)]
doc/rados: edit t-mon "common issues" (2 of x)

Edit the second part of the section "Most Common Monitor Issues" in
doc/rados/troubleshooting/troubleshooting-mon.rst.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 7dcfa9132ce6ca97d2c414014fc5de0a0c7e106e)

23 months agoMerge pull request #54418 from zdover23/wip-doc-2023-11-08-backport-54413-to-reef
zdover23 [Wed, 8 Nov 2023 15:07:49 +0000 (01:07 +1000)]
Merge pull request #54418 from zdover23/wip-doc-2023-11-08-backport-54413-to-reef

reef: doc/rados: edit t-mon "common issues" (1 of x)

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
23 months agodoc/rados: edit t-mon "common issues" (1 of x)
Zac Dover [Wed, 8 Nov 2023 09:07:40 +0000 (19:07 +1000)]
doc/rados: edit t-mon "common issues" (1 of x)

Edit the first part of the section "Most Common Monitor Issues" in
doc/rados/troubleshooting/troublehsooting-mon.rst.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 434d8967a37d49bdce3effae4e9b4c69a8586a56)

23 months agobackport: https://github.com/ceph/ceph/pull/54364
Redouane Kachach [Wed, 8 Nov 2023 13:02:31 +0000 (14:02 +0100)]
backport: https://github.com/ceph/ceph/pull/54364

Signed-off-by: Redouane Kachach <rkachach@redhat.com>
23 months agoMerge pull request #54395 from aaSharma14/wip-63470-reef
Aashish Sharma [Wed, 8 Nov 2023 04:41:19 +0000 (10:11 +0530)]
Merge pull request #54395 from aaSharma14/wip-63470-reef

reef: mgr/dashboard: fix rgw multi-site import form helper

Reviewed-by: Nizamudeen A <nia@redhat.com>
23 months agoMerge pull request #54380 from vshankar/tr-62539
Yuri Weinstein [Tue, 7 Nov 2023 17:17:47 +0000 (09:17 -0800)]
Merge pull request #54380 from vshankar/tr-62539

reef: qa: add POOL_APP_NOT_ENABLED to ignorelist for cephfs tests

23 months agomgr/dashboard: fix rgw multi-site import form helper
Aashish Sharma [Tue, 7 Nov 2023 13:27:24 +0000 (18:57 +0530)]
mgr/dashboard: fix rgw multi-site import form helper

Before : To obtain the token, generate it from your primary Ceph cluster. This token includes encoded information about the secondary cluster's endpoint, access key, and secret key.
Fix: To obtain the token, generate it from your primary Ceph cluster. This token includes encoded information about the primary cluster's endpoint, access key, and secret key.

Fixes: https://tracker.ceph.com/issues/63469
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
(cherry picked from commit 157f21022e5a065e1457e5ac534d0dd7c66aaab1)

23 months agoMerge pull request #54388 from zdover23/wip-doc-2023-11-07-backport-54384-to-reef
zdover23 [Tue, 7 Nov 2023 16:11:29 +0000 (02:11 +1000)]
Merge pull request #54388 from zdover23/wip-doc-2023-11-07-backport-54384-to-reef

reef: doc/cephadm: fix typo in set ssh key command

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
23 months agoMerge pull request #54376 from ljflores/wip-pool-app-reef
Yuri Weinstein [Tue, 7 Nov 2023 15:34:38 +0000 (07:34 -0800)]
Merge pull request #54376 from ljflores/wip-pool-app-reef

reef: qa/smoke,orch,perf-basic: add POOL_APP_NOT_ENABLED to ignorelist

Reviewed-by: Yuri Weinstein <yweinste@redhat.com>
Reviewed-by: Prashant D <pdhange@redhat.com>
23 months agodoc/cephadm: fix typo in set ssh key command
Piotr Parczewski [Tue, 7 Nov 2023 11:48:01 +0000 (12:48 +0100)]
doc/cephadm: fix typo in set ssh key command

Signed-off-by: Piotr Parczewski <piotr@stackhpc.com>
(cherry picked from commit e9654b77eda12a3862eefb93c65dc5f15715833d)

23 months agoMerge pull request #54371 from cbodley/wip-63452-reef
Casey Bodley [Tue, 7 Nov 2023 12:51:40 +0000 (12:51 +0000)]
Merge pull request #54371 from cbodley/wip-63452-reef

reef: rgw: fetch_remote_obj() uses uncompressed size for encrypted objects

Reviewed-by: Yuri Weinstein <yuriw@redhat.com>
23 months agobackport: https://github.com/ceph/ceph/pull/54056
Redouane Kachach [Tue, 7 Nov 2023 12:26:25 +0000 (13:26 +0100)]
backport: https://github.com/ceph/ceph/pull/54056

Signed-off-by: Redouane Kachach <rkachach@redhat.com>
23 months agoRevert "Revert "Merge PR #53077 into main""
Venky Shankar [Fri, 8 Sep 2023 08:41:13 +0000 (14:11 +0530)]
Revert "Revert "Merge PR #53077 into main""

This reverts commit af4558f0ac3b0dc23a8b3cbba63b9c8acc498ea4.

Fixes: http://tracker.ceph.com/issues/62482
Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit 7d5795dc37fc903121f2caa584bec55773fb61ee)

23 months agoqa: do not create rbd pool for CephFS QA
Patrick Donnelly [Thu, 17 Aug 2023 13:27:45 +0000 (09:27 -0400)]
qa: do not create rbd pool for CephFS QA

Fixes: https://tracker.ceph.com/issues/62482
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 636d2a44c7a935426788ed61e9a77432dc49ca5e)

23 months agoRevert "Merge PR #53077 into main"
Patrick Donnelly [Mon, 28 Aug 2023 13:16:10 +0000 (09:16 -0400)]
Revert "Merge PR #53077 into main"

This reverts commit b8bf0c62b96cb76ae547d5d2f13d2a68417e79e9, reversing
changes made to fe07f64396e504c373bd1b621dda0b7893753fe8.

Silencing this health warning is unnecessary if we stop creating the rbd pool
in CephFS testing.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit af4558f0ac3b0dc23a8b3cbba63b9c8acc498ea4)

23 months agoqa: add `POOL_APP_NOT_ENABLED` to ignorelist for cephfs tests
Venky Shankar [Tue, 22 Aug 2023 06:14:55 +0000 (11:44 +0530)]
qa: add `POOL_APP_NOT_ENABLED` to ignorelist for cephfs tests

Fixes: http://tracker.ceph.com/issues/62508
Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit 97b76a480b9708f8557222de14058f644746a4a6)

23 months agoqa/smoke,orch,perf-basic: add POOL_APP_NOT_ENABLED to ignorelist
Prashant D [Wed, 18 Oct 2023 20:07:47 +0000 (16:07 -0400)]
qa/smoke,orch,perf-basic: add POOL_APP_NOT_ENABLED to ignorelist

Some of the smoke, orch and perf-basic tests are failing due
to POOL_APP_NOT_ENABLED health check failure. Add
POOL_APP_NOT_ENABLED to ignorelist for these tests.

Signed-off-by: Prashant D <pdhange@redhat.com>
(cherry picked from commit 4ecec7a777e90fce31317bc805fc70e1b40cba74)

23 months agoMerge pull request #54368 from zdover23/wip-doc-2023-11-07-backport-54342-to-reef
Anthony D'Atri [Mon, 6 Nov 2023 20:33:09 +0000 (15:33 -0500)]
Merge pull request #54368 from zdover23/wip-doc-2023-11-07-backport-54342-to-reef

reef: doc/radosgw: Improve dynamicresharding.rst

23 months agoMerge pull request #54366 from zdover23/wip-doc-2023-11-07-backport-54352-to-reef
Anthony D'Atri [Mon, 6 Nov 2023 20:03:04 +0000 (15:03 -0500)]
Merge pull request #54366 from zdover23/wip-doc-2023-11-07-backport-54352-to-reef

reef:doc/rados: format Q&A list in tshooting-mon.rst

23 months agorgw: fetch_remote_obj() uses uncompressed size for encrypted objects
Casey Bodley [Tue, 24 Oct 2023 20:48:06 +0000 (16:48 -0400)]
rgw: fetch_remote_obj() uses uncompressed size for encrypted objects

use the original size from RGW_ATTR_COMPRESSION as the accounted size in
the bucket index for objects that were transferred in their
encrypted/compressed form

Fixes: https://tracker.ceph.com/issues/63306
Signed-off-by: Casey Bodley <cbodley@redhat.com>
(cherry picked from commit 526643826246fa8dbe35ea2f65bf9ad0ef9566ff)

Conflicts:
        src/rgw/driver/rados/rgw_rados.cc no req_context

23 months agorgw: rename 'expected' to 'accounted' size in fetch_remote_obj()
Casey Bodley [Tue, 24 Oct 2023 20:45:47 +0000 (16:45 -0400)]
rgw: rename 'expected' to 'accounted' size in fetch_remote_obj()

and use it instead of cb.get_data_len() after testing for equality

Signed-off-by: Casey Bodley <cbodley@redhat.com>
(cherry picked from commit 999d539f76bdb4baa0b8342fff74a9e3fd1cdc12)

Conflicts:
src/rgw/driver/rados/rgw_rados.cc no req_context

23 months agodoc/radosgw: Improve dynamicresharding.rst
Anthony D'Atri [Sat, 4 Nov 2023 15:06:25 +0000 (11:06 -0400)]
doc/radosgw: Improve dynamicresharding.rst

Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
(cherry picked from commit a9bf0c13e3dc866bd1cb724a10e23ae215e15437)

23 months agodoc/rados: format Q&A list in tshooting-mon.rst
Zac Dover [Mon, 6 Nov 2023 04:31:20 +0000 (14:31 +1000)]
doc/rados: format Q&A list in tshooting-mon.rst

Bold the questions in a Q&A list in the "Most Common Monitor Issues"
section of doc/rados/troubleshooting/troubleshooting-mon.rst.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 0d953d27cb6503dfde7f4f7ecb9a386c533c30c9)

23 months agobackport: https://github.com/ceph/ceph/pull/53747
Redouane Kachach [Mon, 6 Nov 2023 15:57:47 +0000 (16:57 +0100)]
backport: https://github.com/ceph/ceph/pull/53747
backport: https://github.com/ceph/ceph/pull/54151

Signed-off-by: Redouane Kachach <rkachach@redhat.com>
23 months agobackport of: https://github.com/ceph/ceph/pull/54307
Redouane Kachach [Mon, 6 Nov 2023 15:56:01 +0000 (16:56 +0100)]
backport of: https://github.com/ceph/ceph/pull/54307

Signed-off-by: Redouane Kachach <rkachach@redhat.com>
23 months agoMerge pull request #54353 from zdover23/wip-doc-2023-11-06-backport-54286-to-reef
Anthony D'Atri [Mon, 6 Nov 2023 14:03:49 +0000 (09:03 -0500)]
Merge pull request #54353 from zdover23/wip-doc-2023-11-06-backport-54286-to-reef

reef: doc/cephfs: edit mount-using-fuse.rst

23 months agodoc/cephfs: edit mount-using-fuse.rst
Jaanus Torp [Wed, 1 Nov 2023 10:23:47 +0000 (12:23 +0200)]
doc/cephfs: edit mount-using-fuse.rst

Fixes missing dash that lead to unnecessary debugging.

Signed-off-by: Jaanus Torp <jaanus@saun.ee>
(cherry picked from commit 9fcb98411bfeb9147f61bef4424a78d98c63cadc)

23 months agoMerge pull request #54349 from zdover23/wip-doc-2023-11-06-backport-54344-to-reef
Anthony D'Atri [Sun, 5 Nov 2023 16:43:37 +0000 (11:43 -0500)]
Merge pull request #54349 from zdover23/wip-doc-2023-11-06-backport-54344-to-reef

reef: doc/rados: edit t-mon.rst text

23 months agodoc/rados: edit t-mon.rst text
Zac Dover [Sun, 5 Nov 2023 12:28:39 +0000 (22:28 +1000)]
doc/rados: edit t-mon.rst text

Clarify the text in the "Clock Skew" section of
doc/rados/troubleshooting/troubleshooting-mon.rst.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 5496bd426f35c0ab91ae7d544ae92ed3b517c7eb)

23 months agoMerge pull request #54345 from zdover23/wip-doc-2023-11-05-backport-54343-to-reef
Anthony D'Atri [Sun, 5 Nov 2023 13:45:40 +0000 (08:45 -0500)]
Merge pull request #54345 from zdover23/wip-doc-2023-11-05-backport-54343-to-reef

reef: doc/rados: format Q&A list in t-mon.rst

23 months agodoc/rados: format Q&A list in t-mon.rst
Zac Dover [Sun, 5 Nov 2023 06:24:16 +0000 (16:24 +1000)]
doc/rados: format Q&A list in t-mon.rst

Format a Q&A list in doc/rados/troubleshooting/troubleshooting.rst, in
the "Clock Skews" section.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit aadb87aeb5cd8b413937ddfe4562411d424c4bb1)

23 months agoMerge pull request #54161 from cbodley/wip-61352-reef
Yuri Weinstein [Sat, 4 Nov 2023 13:12:51 +0000 (06:12 -0700)]
Merge pull request #54161 from cbodley/wip-61352-reef

reef: rgw/s3: ListObjectsV2 returns correct object owners

Reviewed-by: Shilpa Jagannath <smanjara@redhat.com>
23 months agoMerge pull request #54267 from ronen-fr/wip-63371-reef
Yuri Weinstein [Sat, 4 Nov 2023 13:11:49 +0000 (06:11 -0700)]
Merge pull request #54267 from ronen-fr/wip-63371-reef

reef: osd: fix use-after-move in build_incremental_map_msg()

Reviewed-by: Samuel Just <sjust@redhat.com>
23 months agoMerge pull request #54191 from cfsnyder/wip-63252-reef
Yuri Weinstein [Sat, 4 Nov 2023 12:07:39 +0000 (05:07 -0700)]
Merge pull request #54191 from cfsnyder/wip-63252-reef

reef: rgw: add versioning info to radosgw-admin bucket stats output

Reviewed-by: Shilpa Jagannath <smanjara@redhat.com>
23 months agoMerge pull request #54163 from galsalomon66/wip-63297-reef
Yuri Weinstein [Sat, 4 Nov 2023 11:39:07 +0000 (04:39 -0700)]
Merge pull request #54163 from galsalomon66/wip-63297-reef

reef: rgw/s3select: fix per QE defect.

Reviewed-by: Shilpa Jagannath <smanjara@redhat.com>
23 months agoMerge pull request #54124 from cbodley/wip-63279-reef
Yuri Weinstein [Sat, 4 Nov 2023 11:36:24 +0000 (04:36 -0700)]
Merge pull request #54124 from cbodley/wip-63279-reef

reef: rgw: don't dereference nullopt in DeleteMultiObj

Reviewed by: Shilpa Jagannath <smanjara@redhat.com>
Reviewed-by: Cory Snyder <csnyder@iland.com>
23 months agoMerge pull request #53516 from yuvalif/wip-61919-reef
Yuri Weinstein [Sat, 4 Nov 2023 11:34:43 +0000 (04:34 -0700)]
Merge pull request #53516 from yuvalif/wip-61919-reef

reef: rgw/amqp: remove possible race conditions with the amqp connections

Reviewed-by: Shilpa Jagannath <smanjara@redhat.com>
23 months agoMerge pull request #54254 from pkalever/wip-63350-reef
Yuri Weinstein [Fri, 3 Nov 2023 21:34:04 +0000 (14:34 -0700)]
Merge pull request #54254 from pkalever/wip-63350-reef

reef: rbd-nbd: fix stuck with disable request

Reviewed-by: Ilya Dryomov <idryomov@redhat.com>
Reviewed-by: Mykola Golub <mgolub@suse.com>
23 months agoMerge pull request #54276 from ljflores/wip-63375-reef
Yuri Weinstein [Fri, 3 Nov 2023 21:31:06 +0000 (14:31 -0700)]
Merge pull request #54276 from ljflores/wip-63375-reef

reef: osd: fix logic in check_pg_upmaps

Reviewed-by: Josh Salomon <josh.salomon@gmail.com>
23 months agoMerge pull request #54329 from zdover23/wip-doc-2023-11-04-backport-54274-to-reef
Anthony D'Atri [Fri, 3 Nov 2023 16:20:16 +0000 (12:20 -0400)]
Merge pull request #54329 from zdover23/wip-doc-2023-11-04-backport-54274-to-reef

reef: doc: Update dynamicresharding.rst

23 months agoMerge pull request #54320 from zdover23/wip-doc-2023-11-03-backport-54303-to-reef
Anthony D'Atri [Fri, 3 Nov 2023 16:18:29 +0000 (12:18 -0400)]
Merge pull request #54320 from zdover23/wip-doc-2023-11-03-backport-54303-to-reef

reef: doc/cephadm: edit troubleshooting.rst (2 of x)

23 months agoMerge pull request #54323 from zdover23/wip-doc-2023-11-03-backport-54322-to-reef
Anthony D'Atri [Fri, 3 Nov 2023 16:17:34 +0000 (12:17 -0400)]
Merge pull request #54323 from zdover23/wip-doc-2023-11-03-backport-54322-to-reef

reef: doc/cephadm/services: remove excess rendered indentation in osd.rst

23 months agodoc: Update dynamicresharding.rst
Aliaksei Makarau [Tue, 31 Oct 2023 15:20:05 +0000 (16:20 +0100)]
doc: Update dynamicresharding.rst

"3, After resharding completed:" should be
"3. After resharding completed:"

Signed-off-by: Aliaksei Makarau <aliaksei.makarau@ibm.com>
(cherry picked from commit e96cef78bb97491e6488999d6bc4a068cd1c5a8d)

23 months agodoc/cephadm/services: remove excess rendered indentation in osd.rst
Ville Ojamo [Fri, 3 Nov 2023 05:44:00 +0000 (12:44 +0700)]
doc/cephadm/services: remove excess rendered indentation in osd.rst

Start bash command blocks at the left margin, removing
excessive padding/indentation that would render the
block too much towards the right.

At the same time ident the source consistently:
- Two spaces for command blocks and output blocks.
- Four spaces for notes, code blocks.

There seems to be no uniform style for this, sometimes
commands are indented with three spaces but it would
seem two spaces is common. In the end it all renders
the same I guess.

Signed-off-by: Ville Ojamo <14869000+bluikko@users.noreply.github.com>
(cherry picked from commit 329df4959d08e9bc90d6e1d83f99bd344a13dc1e)

23 months agodoc/cephadm: edit troubleshooting.rst (2 of x)
Zac Dover [Thu, 2 Nov 2023 06:07:39 +0000 (16:07 +1000)]
doc/cephadm: edit troubleshooting.rst (2 of x)

Edit doc/cephadm/troubleshooting.rst. This commit and the PR of which it
is a part was raised in response to https://github.com/ceph/ceph/pull/53976.

Co-authored-by: John Mulligan <jmulligan@redhat.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit b096c21c7c4df2bf177f3ba84093cc75ecd71d49)

23 months agoMerge pull request #54291 from ajarr/wip-63387-reef
Yuri Weinstein [Thu, 2 Nov 2023 18:18:56 +0000 (11:18 -0700)]
Merge pull request #54291 from ajarr/wip-63387-reef

reef: qa/suites/rbd: add test to check rbd_support module recovery

Reviewed-by: Ilya Dryomov <idryomov@redhat.com>
Reviewed-by: Mykola Golub <mgolub@suse.com>
23 months agoMerge pull request #54289 from ajarr/wip-63384-reef
Yuri Weinstein [Thu, 2 Nov 2023 18:17:42 +0000 (11:17 -0700)]
Merge pull request #54289 from ajarr/wip-63384-reef

reef: mgr/rbd_support: fix recursive locking on CreateSnapshotRequests lock

Reviewed-by: Mykola Golub <mgolub@suse.com>
Reviewed-by: Ilya Dryomov <idryomov@redhat.com>
23 months agoMerge pull request #54283 from zdover23/wip-doc-2023-11-01-backport-54281-to-reef
Anthony D'Atri [Wed, 1 Nov 2023 13:12:47 +0000 (09:12 -0400)]
Merge pull request #54283 from zdover23/wip-doc-2023-11-01-backport-54281-to-reef

reef: doc/cephadm: edit troubleshooting.rst (1 of x)

23 months agoqa/suites/rbd: add test to check rbd_support module recovery
Ramana Raja [Mon, 18 Sep 2023 02:52:56 +0000 (22:52 -0400)]
qa/suites/rbd: add test to check rbd_support module recovery

... on repeated blocklisting of its client.

There were issues with rbd_support module not being able to recover
from its RADOS client being repeatedly blocklisted. This occured for
example in clusters with OSDs slow to process RBD requests while the
module's mirror_snapshot_scheduler was taking mirror snapshots by
requesting exclusive locks on the RBD images and workloads were running
on the snapshotted images via kernel clients.

Fixes: https://tracker.ceph.com/issues/62891
Signed-off-by: Ramana Raja <rraja@redhat.com>
(cherry picked from commit 2f2cd3bcff82afc3a4d251143eb462e700e7fc60)

23 months agomgr/rbd_support: remove CreateSnapshotRequests __del__()
Ramana Raja [Mon, 30 Oct 2023 15:05:27 +0000 (11:05 -0400)]
mgr/rbd_support: remove CreateSnapshotRequests __del__()

There is no need for CreateSnapshotRequests.__del__() that calls
CreateSnapshotRequests.wait_for_pending().
MirrorSnapshotScheduleHandler.shutdown() already calls
CreateSnapshotRequests.wait_for_pending().

Signed-off-by: Ramana Raja <rraja@redhat.com>
(cherry picked from commit fed1e87685a698876cf167b3681327e5b0066ee6)

23 months agomgr/rbd_support: fix recursive locking on CreateSnapshotRequests lock
Ramana Raja [Thu, 26 Oct 2023 17:18:52 +0000 (13:18 -0400)]
mgr/rbd_support: fix recursive locking on CreateSnapshotRequests lock

The MirrorSnapshotScheduleHandler's run thread issues asynchronous
create snapshot requests using a CreateSnapshotRequests instance. When
the thread invokes a CreateSnapshotRequests instance's get_ioctx(),
the instance's class variable lock is acquired. With the class
variable lock held, the garbage collection of a CreateSnapshotRequests
instance may race in the thread. The thread would then call
CreateSnapshotRequests __del__() that tries to acquire the class
variable lock that the thread already holds. Fix this
recursive deadlock by converting the CreateSnapshotRequests lock from
a class variable to an instance variable. There is no need to share
the lock across CreateSnapshotRequests instances.

Also convert MirrorSnapshotScheduleHandler, PerfHandler and
TrashPurgeScheduleHandler class variables to instance variables
that don't need to be shared across the instances.

Fixes: https://tracker.ceph.com/issues/62994
Signed-off-by: Ramana Raja <rraja@redhat.com>
Co-Authored-By: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 4452bc22d1c6c8499cf55d6e39090adf7ae1dcbf)

23 months agodoc/cephadm: edit troubleshooting.rst (1 of x)
Zac Dover [Wed, 1 Nov 2023 01:53:59 +0000 (11:53 +1000)]
doc/cephadm: edit troubleshooting.rst (1 of x)

Edit doc/cephadm/troubleshooting.rst. This commit and the PR of which it
is a part was raised in response to
https://github.com/ceph/ceph/pull/53976. The limits of reStructuredText
are particularly visible here in every instance of a BASH for-loop and
in every instance of a command stretched over multiple lines.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 69472c26af5419faa9ed93c071ed5933d03fa67f)