]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
11 months agodoc/dev_guide: add needs-upgrade-testing label info 58730/head
Zac Dover [Mon, 22 Jul 2024 15:36:53 +0000 (01:36 +1000)]
doc/dev_guide: add needs-upgrade-testing label info

Add an explanation of the "needs-upgrade-testing" label and explain when
to use it.

Laura is the true author of this text.

Co-authored-by: Laura Flores <lflores@ibm.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 02d5a6a3371c9fa52a32124cf07da16b89a11e27)

11 months agoMerge pull request #58721 from zdover23/wip-doc-2024-07-22-backport-56517-to-reef
Anthony D'Atri [Mon, 22 Jul 2024 14:07:24 +0000 (10:07 -0400)]
Merge pull request #58721 from zdover23/wip-doc-2024-07-22-backport-56517-to-reef

reef: doc/start/os-recommendations: remove 16.2.z support for CentOS 7

11 months agodoc/start/os-recommendations: remove 16.2.z support for CentOS 7 58721/head
gukaifeng [Tue, 2 Apr 2024 07:28:04 +0000 (15:28 +0800)]
doc/start/os-recommendations: remove 16.2.z support for CentOS 7

(cherry picked from commit 92b07ddece5f717454e5aeaa00ec70e5649539c0)

11 months agoMerge pull request #58713 from zdover23/wip-doc-2024-07-22-backport-58639-to-reef
Anthony D'Atri [Mon, 22 Jul 2024 12:45:42 +0000 (08:45 -0400)]
Merge pull request #58713 from zdover23/wip-doc-2024-07-22-backport-58639-to-reef

reef: doc/start: add tested container host oses

11 months agoMerge pull request #57094 from aclamk/wip-aclamk-fix-teuthology-test-bluestore-b
Adam Kupczyk [Mon, 22 Jul 2024 12:17:44 +0000 (14:17 +0200)]
Merge pull request #57094 from aclamk/wip-aclamk-fix-teuthology-test-bluestore-b

teuthology/bluestore: Fix running of compressed tests

11 months agodoc/start: add tested container host oses 58713/head
Zac Dover [Wed, 17 Jul 2024 06:18:58 +0000 (16:18 +1000)]
doc/start: add tested container host oses

Add a list of OSes that have been tested as container hosts. This commit
is added in response to a request made by Josh Durgin on Slack.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 9485f9723fcf58a0efbc08def27dd85ffe645764)

11 months agoMerge pull request #58350 from chrisphoffman/wip-66732-reef
Xiubo Li [Mon, 22 Jul 2024 06:21:05 +0000 (14:21 +0800)]
Merge pull request #58350 from chrisphoffman/wip-66732-reef

reef: mds: Implement remove for ceph vxattrs

11 months agoMerge pull request #58556 from rishabh-d-dave/wip-66927-reef
Xiubo Li [Mon, 22 Jul 2024 06:20:59 +0000 (14:20 +0800)]
Merge pull request #58556 from rishabh-d-dave/wip-66927-reef

reef: mgr/vol: handle case where clone index entry goes missing

11 months agoMerge pull request #58560 from rishabh-d-dave/wip-66931-reef
Xiubo Li [Mon, 22 Jul 2024 06:20:37 +0000 (14:20 +0800)]
Merge pull request #58560 from rishabh-d-dave/wip-66931-reef

reef: qa/cephfs: fix test_single_path_authorize_on_nonalphanumeric_fsname

11 months agoMerge pull request #58563 from rishabh-d-dave/wip-66936-reef
Xiubo Li [Mon, 22 Jul 2024 06:20:11 +0000 (14:20 +0800)]
Merge pull request #58563 from rishabh-d-dave/wip-66936-reef

reef: qa/cephfs: improvements for "mds fail" and "fs fail"

11 months agoMerge pull request #58632 from joscollin/wip-66976-reef
Xiubo Li [Mon, 22 Jul 2024 06:17:47 +0000 (14:17 +0800)]
Merge pull request #58632 from joscollin/wip-66976-reef

reef: cephfs_mirror, qa: fix mirror daemon doesn't restart when blocklisted or failed

12 months agoMerge pull request #58697 from zdover23/wip-doc-2024-07-21-backport-58695-to-reef
Anthony D'Atri [Sat, 20 Jul 2024 19:13:03 +0000 (15:13 -0400)]
Merge pull request #58697 from zdover23/wip-doc-2024-07-21-backport-58695-to-reef

reef: doc: Amend dev mailing list subscribe instructions.

12 months agodoc: Amend dev mailing list subscribe instructions. 58697/head
Paulo E. Castro [Sat, 20 Jul 2024 16:32:17 +0000 (17:32 +0100)]
doc: Amend dev mailing list subscribe instructions.

Signed-off-by: Paulo E. Castro <pecastro@wormholenet.com>
(cherry picked from commit 3b0360bc2ea1e0b5f7a7c6b04ade483f77e4b5c6)

12 months agoMerge pull request #58681 from zdover23/wip-doc-2024-07-20-backport-58678-to-reef
Anthony D'Atri [Fri, 19 Jul 2024 17:50:05 +0000 (13:50 -0400)]
Merge pull request #58681 from zdover23/wip-doc-2024-07-20-backport-58678-to-reef

reef: doc/cephadm/upgrade: ceph-ci containers are hosted by quay.ceph.io

12 months agoMerge pull request #58684 from zdover23/wip-doc-2024-07-20-backport-58671-to-reef
Anthony D'Atri [Fri, 19 Jul 2024 17:44:14 +0000 (13:44 -0400)]
Merge pull request #58684 from zdover23/wip-doc-2024-07-20-backport-58671-to-reef

reef: doc/start: update mailing list links

12 months agodoc/start: update mailing list links 58684/head
Zac Dover [Fri, 19 Jul 2024 06:44:44 +0000 (16:44 +1000)]
doc/start: update mailing list links

Update the User and Dev mailing list subscription links. This commit is
made in response to a request from Dan van der Ster.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit a242fe3d940d5764cd766dba8e42449501986765)

12 months agodoc/cephadm/upgrade: ceph-ci containers are hosted by quay.ceph.io 58681/head
Casey Bodley [Fri, 19 Jul 2024 14:10:38 +0000 (10:10 -0400)]
doc/cephadm/upgrade: ceph-ci containers are hosted by quay.ceph.io

Signed-off-by: Casey Bodley <cbodley@redhat.com>
(cherry picked from commit 9c77963300fee804f249d512c980cdeac06f14f7)

12 months agoMerge pull request #58447 from ceph/wip-yuriw-reef-client-x
Yuri Weinstein [Thu, 18 Jul 2024 14:47:23 +0000 (07:47 -0700)]
Merge pull request #58447 from ceph/wip-yuriw-reef-client-x

qa/tests: added client-upgrade-reef-squid tests

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
12 months agoMerge pull request #58657 from rhcs-dashboard/wip-67003-reef
Avan [Thu, 18 Jul 2024 10:18:54 +0000 (15:48 +0530)]
Merge pull request #58657 from rhcs-dashboard/wip-67003-reef

reef: mgr/dashboard: add NFS export button for subvolume/ grp

Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
12 months agomgr/dashboard: add NFS export route from subvolume/subvolume grp 58657/head
Avan Thakkar [Fri, 5 Jul 2024 09:29:13 +0000 (14:59 +0530)]
mgr/dashboard: add NFS export route from subvolume/subvolume grp

Signed-off-by: Avan Thakkar <athakkar@redhat.com>
(cherry picked from commit c0b2db74b2a99eb0ee4e4a37997d6516d8622855)

12 months agoMerge pull request #58615 from zdover23/wip-doc-2024-07-16-backport-58596-to-reef
Zac Dover [Thu, 18 Jul 2024 05:22:00 +0000 (15:22 +1000)]
Merge pull request #58615 from zdover23/wip-doc-2024-07-16-backport-58596-to-reef

reef: doc: clarify superuser note for ceph-fuse

Reviewed-by: Cole Mitchell <cole.mitchell.ceph@gmail.com>
12 months agoqa/tests: added client-upgrade-reef-squid tests 58447/head
Yuri Weinstein [Fri, 5 Jul 2024 18:01:24 +0000 (11:01 -0700)]
qa/tests: added client-upgrade-reef-squid tests

Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
12 months agoMerge pull request #57565 from dparmar18/wip-66061-reef
Xiubo Li [Wed, 17 Jul 2024 03:19:45 +0000 (11:19 +0800)]
Merge pull request #57565 from dparmar18/wip-66061-reef

reef: qa: add a YAML to ignore MGR_DOWN warning

12 months agoMerge pull request #57611 from vshankar/wip-65987-reef
Xiubo Li [Wed, 17 Jul 2024 03:19:35 +0000 (11:19 +0800)]
Merge pull request #57611 from vshankar/wip-65987-reef

reef: qa: ignore cluster warnings generated from forward-scrub task

12 months agoMerge pull request #58332 from mchangir/wip-66670-reef
Xiubo Li [Wed, 17 Jul 2024 03:19:23 +0000 (11:19 +0800)]
Merge pull request #58332 from mchangir/wip-66670-reef

reef: qa: cleanup snapshots before subvolume delete

12 months agoMerge pull request #58493 from joscollin/wip-66868-reef
Xiubo Li [Wed, 17 Jul 2024 03:19:00 +0000 (11:19 +0800)]
Merge pull request #58493 from joscollin/wip-66868-reef

reef: mds: prevent scrubbing for standby-replay MDS

12 months agoMerge pull request #58497 from joscollin/wip-66667-reef
Xiubo Li [Wed, 17 Jul 2024 03:18:34 +0000 (11:18 +0800)]
Merge pull request #58497 from joscollin/wip-66667-reef

reef: mds: set the correct WRLOCK flag always in wrlock_force()

12 months agoMerge pull request #58502 from joscollin/wip-66275-reef
Xiubo Li [Wed, 17 Jul 2024 03:14:18 +0000 (11:14 +0800)]
Merge pull request #58502 from joscollin/wip-66275-reef

reef: mds: relax divergent backtrace scrub failures for replicated ancestor inodes

12 months agoMerge pull request #58528 from joscollin/wip-66614-reef
Xiubo Li [Wed, 17 Jul 2024 03:14:01 +0000 (11:14 +0800)]
Merge pull request #58528 from joscollin/wip-66614-reef

reef: mds: set the proper extra bl for the create request

12 months agoMerge pull request #58529 from joscollin/wip-66200-reef
Xiubo Li [Wed, 17 Jul 2024 03:13:38 +0000 (11:13 +0800)]
Merge pull request #58529 from joscollin/wip-66200-reef

reef: client: set LIBMOUNT_FORCE_MOUNT2=always

12 months agoqa: Wait for mirror daemon restart before getting new rados_inst 58632/head
Jos Collin [Thu, 14 Mar 2024 14:42:46 +0000 (20:12 +0530)]
qa: Wait for mirror daemon restart before getting new rados_inst

After blocklisted/failed, wait for the mirror daemon restart
which is after 30 seconds timeout and then check for the new rados_inst.

Fixes: https://tracker.ceph.com/issues/64927
Signed-off-by: Jos Collin <jcollin@redhat.com>
(cherry picked from commit a9a56919ff95d2face436da2ca08fd26580fadb9)

12 months agocephfs_mirror: Fixed negative seconds
Jos Collin [Thu, 30 May 2024 10:18:08 +0000 (15:48 +0530)]
cephfs_mirror: Fixed negative seconds

Fixes: https://tracker.ceph.com/issues/64927
Fixes: https://tracker.ceph.com/issues/51964
Fixes: https://tracker.ceph.com/issues/63931
Fixes: https://tracker.ceph.com/issues/63089
Signed-off-by: Jos Collin <jcollin@redhat.com>
(cherry picked from commit 96c3f8f33d789a065c7ed418679afd53faffa3d8)

12 months agocephfs_mirror: Add ErrorListener to maintain blocklisted/failed timestamp in FSMirror
Jos Collin [Tue, 28 May 2024 14:57:55 +0000 (20:27 +0530)]
cephfs_mirror: Add ErrorListener to maintain blocklisted/failed timestamp in FSMirror

Have FSMirror register a listener with InstanceWatcher/MirrorWatcher which would get invoked when the mirror daemon is blocklisted or failed.
Thus FSMirror can maintain the last blocklisted/failed timestamp and use that for restarting the mirror daemon.

Fixes: https://tracker.ceph.com/issues/64927
Fixes: https://tracker.ceph.com/issues/51964
Fixes: https://tracker.ceph.com/issues/63931
Fixes: https://tracker.ceph.com/issues/63089
Signed-off-by: Jos Collin <jcollin@redhat.com>
(cherry picked from commit 77ec7bfde7a349b0e06b34ecdf328996c7642d43)

12 months agoMerge pull request #58232 from batrick/wip-66592-reef
Xiubo Li [Tue, 16 Jul 2024 05:55:38 +0000 (13:55 +0800)]
Merge pull request #58232 from batrick/wip-66592-reef

reef: tools/cephfs: recover alternate_name of dentries from journal

12 months agoMerge pull request #57920 from batrick/wip-66387-reef
Xiubo Li [Tue, 16 Jul 2024 05:54:33 +0000 (13:54 +0800)]
Merge pull request #57920 from batrick/wip-66387-reef

reef: qa: disable mon_warn_on_pool_no_app in fs suite

12 months agoMerge pull request #57761 from joscollin/wip-66278-reef
Xiubo Li [Tue, 16 Jul 2024 05:54:25 +0000 (13:54 +0800)]
Merge pull request #57761 from joscollin/wip-66278-reef

reef: cephfs-journal-tool: Add preventive measures to avoid fs corruption

12 months agoMerge pull request #57683 from batrick/wip-66194-reef
Xiubo Li [Tue, 16 Jul 2024 05:54:17 +0000 (13:54 +0800)]
Merge pull request #57683 from batrick/wip-66194-reef

reef: mds: use regular dispatch for processing beacons

12 months agoMerge pull request #57681 from batrick/wip-66190-reef
Xiubo Li [Tue, 16 Jul 2024 05:54:11 +0000 (13:54 +0800)]
Merge pull request #57681 from batrick/wip-66190-reef

reef: mds: use regular dispatch for processing metrics

12 months agoMerge pull request #57524 from batrick/wip-66043-reef
Xiubo Li [Tue, 16 Jul 2024 05:54:00 +0000 (13:54 +0800)]
Merge pull request #57524 from batrick/wip-66043-reef

reef: qa: unmount clients before damaging the fs

12 months agoMerge pull request #58157 from vshankar/wip-66324-reef
Xiubo Li [Tue, 16 Jul 2024 05:51:08 +0000 (13:51 +0800)]
Merge pull request #58157 from vshankar/wip-66324-reef

reef: qa: enable debug logs for fs:cephadm:multivolume subsuite

12 months agoMerge pull request #58172 from batrick/wip-65272-reef
Xiubo Li [Tue, 16 Jul 2024 05:50:50 +0000 (13:50 +0800)]
Merge pull request #58172 from batrick/wip-65272-reef

reef: qa: ignore PG health warnings in CephFS QA

12 months agoMerge pull request #58231 from batrick/wip-65845-reef
Xiubo Li [Tue, 16 Jul 2024 05:50:24 +0000 (13:50 +0800)]
Merge pull request #58231 from batrick/wip-65845-reef

reef: qa: ignore variation of PG_DEGRADED health warning

12 months agoMerge pull request #58251 from joscollin/wip-66673-reef
Xiubo Li [Tue, 16 Jul 2024 05:49:36 +0000 (13:49 +0800)]
Merge pull request #58251 from joscollin/wip-66673-reef

reef: qa: enhance per-client labelled perf counters test

12 months agoMerge pull request #58295 from batrick/wip-66625-reef
Xiubo Li [Tue, 16 Jul 2024 05:49:10 +0000 (13:49 +0800)]
Merge pull request #58295 from batrick/wip-66625-reef

reef: mds: do remove the cap when seqs equal or larger than last issue

12 months agoMerge pull request #58328 from joscollin/wip-65926-reef
Xiubo Li [Tue, 16 Jul 2024 05:48:43 +0000 (13:48 +0800)]
Merge pull request #58328 from joscollin/wip-65926-reef

reef: mds/purgequeue: add l_pq_executed_ops counter

12 months agodoc: elide superuser discussion from mount prereq 58615/head
Patrick Donnelly [Mon, 15 Jul 2024 14:18:15 +0000 (10:18 -0400)]
doc: elide superuser discussion from mount prereq

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 7290bdce6f4dd93be88be6ad54030fa90a8c1ec0)

12 months agodoc: clarify superuser note for ceph-fuse
Patrick Donnelly [Mon, 15 Jul 2024 14:12:24 +0000 (10:12 -0400)]
doc: clarify superuser note for ceph-fuse

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit d100de9c9ce884beee6e9551b2b221c686b775b5)

12 months agoMerge pull request #58323 from zdover23/wip-doc-2024-06-28-backport-57685-to-reef
Zac Dover [Tue, 16 Jul 2024 03:25:12 +0000 (13:25 +1000)]
Merge pull request #58323 from zdover23/wip-doc-2024-06-28-backport-57685-to-reef

reef: cephadm: use importlib.metadata for querying ceph_iscsi's version

Reviewed-by: Adam King <adking@redhat.com>
12 months agoMerge pull request #58576 from zdover23/wip-doc-2024-07-14-backport-58451-to-reef
Zac Dover [Mon, 15 Jul 2024 13:21:12 +0000 (23:21 +1000)]
Merge pull request #58576 from zdover23/wip-doc-2024-07-14-backport-58451-to-reef

reef: doc/dev: edit "Principles for format change"

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
12 months agoMerge pull request #58475 from rhcs-dashboard/wip-66871-reef
Pedro Gonzalez Gomez [Mon, 15 Jul 2024 09:07:35 +0000 (11:07 +0200)]
Merge pull request #58475 from rhcs-dashboard/wip-66871-reef

reef: mgr/dashboard: nfs export enhancement for CEPHFS

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
12 months agoMerge pull request #58579 from zdover23/wip-doc-2024-07-15-backport-58574-to-reef
Anthony D'Atri [Sun, 14 Jul 2024 17:22:43 +0000 (13:22 -0400)]
Merge pull request #58579 from zdover23/wip-doc-2024-07-15-backport-58574-to-reef

reef: doc/cephfs: edit "is mount helper present"

12 months agodoc/cephfs: edit "is mount helper present" 58579/head
Zac Dover [Sun, 14 Jul 2024 06:40:14 +0000 (16:40 +1000)]
doc/cephfs: edit "is mount helper present"

Edit the section called "Is mount helper present?", the title of which
prior to this commit was "Is mount helper is present?". Other small
disambiguating improvements have been made to the text in the section.
An unselectable prompt has been added before a command.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 0ebfb3eda8e2fc88662530bbddc84f2efcc4e13c)

12 months agodoc/dev: edit "Principles for format change" 58576/head
Zac Dover [Sun, 7 Jul 2024 03:59:15 +0000 (13:59 +1000)]
doc/dev: edit "Principles for format change"

Improve "Principles for format change" in doc/dev/encoding.rst. This
commit started as a response to Anthony D'Atri's suggestion here: https://github.com/ceph/ceph/pull/58299/files#r1656985564

Review of this section suggested to me that certain minor English usage
improvements would be of benefit. The numbered lists in this section
could still be made a bit clearer.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 570797e5588b67b8c72e5297b61f84d9aa48dc45)

12 months agoMerge pull request #58549 from idryomov/wip-test-cookie-reef
Ilya Dryomov [Fri, 12 Jul 2024 16:16:27 +0000 (18:16 +0200)]
Merge pull request #58549 from idryomov/wip-test-cookie-reef

reef: test/librbd: clean up unused TEST_COOKIE variable

Reviewed-by: Ramana Raja <rraja@redhat.com>
12 months agoMerge pull request #58245 from rhcs-dashboard/wip-66424-reef
Pedro Gonzalez Gomez [Fri, 12 Jul 2024 15:23:18 +0000 (17:23 +0200)]
Merge pull request #58245 from rhcs-dashboard/wip-66424-reef

reef: mgr/dashboard: fix edit bucket failing in other selected gateways

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
12 months agoqa/cephfs: rename couple of test methods 58563/head
Rishabh Dave [Wed, 8 May 2024 14:24:28 +0000 (19:54 +0530)]
qa/cephfs: rename couple of test methods

New name will make it easier to find these tests by making it similar to
health warnings (MDS_CACHE_OVERSIZED) they do testing for.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit 0f41207dacbd56360689552097789e06dca55446)

12 months agoqa/cephfs: improve and move _get_unhealthy_mds_name to TestMDSFail
Rishabh Dave [Wed, 8 May 2024 13:09:35 +0000 (18:39 +0530)]
qa/cephfs: improve and move _get_unhealthy_mds_name to TestMDSFail

1. Instead of accepting health report as argument, get one directly.
2. Since it is not being used elsewhere move it to the class where it is
   being used.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit b1918686a383e47d51ae0115c8d79ef250ddd0ec)

12 months agoqa/cephfs: use wait_for_health() instead of the new method
Rishabh Dave [Wed, 8 May 2024 12:29:11 +0000 (17:59 +0530)]
qa/cephfs: use wait_for_health() instead of the new method

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit 5972cafb7a3750cde83f09c9ac592b97c5f9a9b3)

12 months agoqa/cephfs: make code for generating health warnings reusable
Rishabh Dave [Wed, 8 May 2024 12:08:43 +0000 (17:38 +0530)]
qa/cephfs: make code for generating health warnings reusable

Code to generate MDS_TRIM and MDS_CACHE_OVERSIZED health warnings is
repeated in test methods of TestMDSFail and TestFSFail. Move this code
to separate helper methods so that it can be reused instead of
duplicating it. And move these helper methods to TestAdminCommands so
to make them conveniently available for reuse.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit 79b047b22e1031df74584cfa4b77d7b824b0e42e)

12 months agoqa/cephfs: fix test_single_path_authorize_on_nonalphanumeric_fsname 58560/head
Rishabh Dave [Wed, 26 Jun 2024 13:35:35 +0000 (19:05 +0530)]
qa/cephfs: fix test_single_path_authorize_on_nonalphanumeric_fsname

This test deletes the CephFS already present on the cluster at the very
beginning and unmounts the first client beforehand. But it leaves the
second client mounted on this deleted CephFS that doesn't exist for the
rest of the test. And then at the very end of this test it attempts to
remount the second client (during tearDown()) which hangs and causes
test runner to crash.

Unmount the second client beforehand to prevent the bug and delete
mount_b object to avoid confusion for the readers in future about
whether or not 2nd mountpoint exists.

Fixes: https://tracker.ceph.com/issues/66077
Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit 2130ec8ebc377364a11be7448ed2773b46b464c0)

12 months agomgr/vol: handle case where clone index entry goes missing 58556/head
Rishabh Dave [Mon, 17 Jun 2024 19:03:28 +0000 (00:33 +0530)]
mgr/vol: handle case where clone index entry goes missing

In `async_cloner.py`, clone index entry is fetched to get next clone job
that needs to be executed. It might happen that the clone job was
cancelled just when it was going to be picked for execution (IOW, when
it was about to move from pending state to in-progress state).

Currently, MGR hangs in such a case because exception `ObjectNotFound`
from CephFS Python bindings is raised and is left uncaught. To prevent
this issue catch the exception, log it and return None to tell
`get_job()` of `async_job.py` to look for next job in the queue.

Increase the scope of try-except in method `get_oldest_clone_entry()` of
`async_cloner.py` so that when exception `cephfs.Error` or any exception
under it is thrown by `self.fs.lstat()` is not left uncaught.

FS object is also passed to the method `list_one_entry_at_a_time()`, so
increasing scope of try-except is useful as it will not allow exceptions
raised in other calls to CephFS Python binding methods to be left
uncaught.

Fixes: https://tracker.ceph.com/issues/66560
Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit 3cff7251c86a4670768721f924b11b3de33f807b)

12 months agotest/ librbd: clean up unused 'TEST_COOKIE' 58549/head
Rongqi Sun [Tue, 9 Jul 2024 02:55:27 +0000 (02:55 +0000)]
test/ librbd: clean up unused 'TEST_COOKIE'

Clean them up because
```
/home/jenkins-build/build/workspace/ceph-pull-requests-arm64/src/test/librbd/exclusive_lock/test_mock_PreAcquireRequest.cc:47:26: error: unused variable 'TEST_COOKIE' [-Werror,-Wunused-const-variable]
static const std::string TEST_COOKIE("auto 123");
                         ^
1 error generated.
```

Ref: https://jenkins.ceph.com/job/ceph-pull-requests-arm64/59242/consoleFull#-5120798632a811ea2-3e7b-466b-84b4-d13df7e35809

Signed-off-by: Rongqi Sun <sunrongqi@huawei.com>
(cherry picked from commit 49869222c816023f696ff2a42898e2d9f3f54215)

12 months agoMerge pull request #56877 from rhcs-dashboard/wip-65170-reef
Nizamudeen A [Fri, 12 Jul 2024 07:21:44 +0000 (12:51 +0530)]
Merge pull request #56877 from rhcs-dashboard/wip-65170-reef

reef: mgr/dashboard: bump follow-redirects from 1.15.3 to 1.15.6 in /src/pybind/mgr/dashboard/frontend

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
12 months agoMerge pull request #58338 from rhcs-dashboard/wip-66738-reef
Nizamudeen A [Fri, 12 Jul 2024 05:17:13 +0000 (10:47 +0530)]
Merge pull request #58338 from rhcs-dashboard/wip-66738-reef

reef: mgr/dashboard: fix clone async validators with different groups

Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
Reviewed-by: nmunet <NOT@FOUND>
12 months agoMerge pull request #58522 from cbodley/wip-66896-reef
Casey Bodley [Thu, 11 Jul 2024 21:25:01 +0000 (22:25 +0100)]
Merge pull request #58522 from cbodley/wip-66896-reef

reef: install-deps.sh, do_cmake.sh: almalinux is another el flavour

Reviewed-by: Kyr Shatskyy <kyrylo.shatskyy@gmail.com>
Reviewed-by: Dan van der Ster <dan.vanderster@clyso.com>
12 months agoMerge pull request #58247 from rhcs-dashboard/wip-66487-reef
Pedro Gonzalez Gomez [Thu, 11 Jul 2024 10:21:37 +0000 (12:21 +0200)]
Merge pull request #58247 from rhcs-dashboard/wip-66487-reef

reef: mgr/dashboard: add dueTime to rgw bucket validator

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
Reviewed-by: nmunet <NOT@FOUND>
12 months agoMerge pull request #57752 from rhcs-dashboard/wip-66246-reef
Pedro Gonzalez Gomez [Thu, 11 Jul 2024 10:19:31 +0000 (12:19 +0200)]
Merge pull request #57752 from rhcs-dashboard/wip-66246-reef

reef: mgr/dashboard: fix readonly landingpage

Reviewed-by: Nizamudeen A <nia@redhat.com>
12 months agomds: set the proper extra bl for the create request 58528/head
Xiubo Li [Tue, 28 May 2024 04:35:17 +0000 (12:35 +0800)]
mds: set the proper extra bl for the create request

Just in case a create request was totally completed but the client
didn't receive any unsafe and safe responses, and then when it's
retried in the MDS side it will be treated as a open request and
will miss setting the create ino for the reply message.

Finally for client it just sent a create request and then the MDS
just sent back a open reply, which will let the client miss setting
the CREATED flag and then the VFS will fail the create by returnning
a -EEXIST errno.

Fixes: https://tracker.ceph.com/issues/63265
Signed-off-by: Xiubo Li <xiubli@redhat.com>
(cherry picked from commit 3482100e8ce174d2fd3a2e7f7ed0333ae2c0a166)

12 months agoclient: set LIBMOUNT_FORCE_MOUNT2=always 58529/head
Jakob Haufe [Tue, 30 Apr 2024 15:45:24 +0000 (17:45 +0200)]
client: set LIBMOUNT_FORCE_MOUNT2=always

This is a workaround for libmount 2.38 or later on linux kernel 5.1 or
later.

libmount defaults to the new fsconfig interface which does not support
remounting the way the old mount(2) syscall did. Thus, dentry trimming
fails.

Fixes: https://tracker.ceph.com/issues/62664
Signed-off-by: Jakob Haufe <sur5r@sur5r.net>
(cherry picked from commit 9ca1d4be2b587ce1d3e6c10dc2a98195b2ab3ba9)

12 months agomds: encode the correct extra info depending on the feature bits
Xiubo Li [Tue, 28 May 2024 04:23:57 +0000 (12:23 +0800)]
mds: encode the correct extra info depending on the feature bits

In the client side it will decode the extra info depending on the
feature bits, but if we always encode it with the old version then
the client could crash or gets the corrupted data. More detail
please see "parse_reply_info_create()" in kclient code.

Fixes: https://tracker.ceph.com/issues/63265
Signed-off-by: Xiubo Li <xiubli@redhat.com>
(cherry picked from commit 1f1198b81b2442e5d6c0566bdade2f5d7dd23d12)

12 months agomds: add set_reply_extra_bl() helper support
Xiubo Li [Tue, 28 May 2024 04:18:06 +0000 (12:18 +0800)]
mds: add set_reply_extra_bl() helper support

Fixes: https://tracker.ceph.com/issues/63265
Signed-off-by: Xiubo Li <xiubli@redhat.com>
(cherry picked from commit 6615a568cdfbefb421c37ef790acba03ce20f843)

12 months agomds: cleanup the code to make it to be more readable
Xiubo Li [Tue, 28 May 2024 04:20:03 +0000 (12:20 +0800)]
mds: cleanup the code to make it to be more readable

Fixes: https://tracker.ceph.com/issues/63265
Signed-off-by: Xiubo Li <xiubli@redhat.com>
(cherry picked from commit d48b169e40a94b6df578c57bc595187c94ef48f6)

12 months agoinstall-deps.sh, do_cmake.sh: almalinux is another el flavour 58522/head
Dan van der Ster [Thu, 5 Oct 2023 16:50:36 +0000 (09:50 -0700)]
install-deps.sh, do_cmake.sh: almalinux is another el flavour

Signed-off-by: Dan van der Ster <dan.vanderster@clyso.com>
(cherry picked from commit 6e85f76385c3cffd860a8a3054bf14ce85a43b59)

Conflicts:
do_cmake.sh add rocky too

12 months agomds: relax divergent backtrace scrub failures for replicated ancestor inodes 58502/head
Venky Shankar [Fri, 5 Apr 2024 12:01:17 +0000 (08:01 -0400)]
mds: relax divergent backtrace scrub failures for replicated ancestor inodes

scrub could be verifying backtrace for an inode for which some of its
ancestors might be replicas, e.g. (from a custom debug build) some
ancestors of an inode with divergent backtrace were replicas:

```
[inode 0x3000000502f [...122,head] /volumes/qa/sv_0/b98de6ea-ed40-40d0-8e1a-9433a337a387/client.0/tmp/payload.2/multiple_rsync_payload.190107/firmware/ rep@0.1 fragtree_t(*^3) v6663 f(v493 m2024-05-01T06:38:16.403080+0000 388=289+99) n(v139 rc2024-05-01T06:55:35.239345+0000 b467915716 4880=4534+346) old_inodes=24 (inest mix) (ifile mix) | lock=0 importing=0 dirfrag=1 0x55a85d244680]
```

In such cases, the backpointer version (inode_backpointer_t::version) of the
in-memory (cache) inode can fall behind the on-disk version causing scrub to
consider the inode backtrace as divergent (memory version < on-disk version).

Sample:

```
"ondisk_value":"(2)0x30000005bba:

[<0x3000000502f/mwl8k v2126>,
<0x30000005026/firmware v6663>,
<0x30000005025/multiple_rsync_payload.190107 v3041>,
<0x10000005894/payload.2 v4873>,
<0x10000000005/tmp v6193>,<0x10000000003/client.0 v5964>,
<0x10000000002/b98de6ea-ed40-40d0-8e1a-9433a337a387 v5817>
,<0x10000000001/sv_0 v5837>,
<0x10000000000/qa v6241>,
<0x1/volumes v4036>]

"memoryvalue":"(2)0x30000005bba:

[<0x3000000502f/mwl8k v2126>,
<0x30000005026/firmware v6663>,
<0x30000005025/multiple_rsync_payload.190107 v3041>,
<0x10000005894/payload.2 v4873>,
<0x10000000005/tmp v6081>,
<0x10000000003/client.0 v5942>,
<0x10000000002/b98de6ea-ed40-40d0-8e1a-9433a337a387 v5709>,
<0x10000000001/sv_0 v5819>,
<0x10000000000/qa v6121>,
<0x1/volumes v4022>]
```

Fixes: http://tracker.ceph.com/issues/64730
Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit b98bb860a899ba86caf2d0e4d82b5882a13ae840)

12 months agomds: set the correct WRLOCK flag always in wrlock_force() 58497/head
Xiubo Li [Thu, 25 Apr 2024 04:06:25 +0000 (12:06 +0800)]
mds: set the correct WRLOCK flag always in wrlock_force()

The wrlock is not like the xlock, which needs to be acquired in
the CInode's auth always, and it is based on the CDir's auths instead.

When a remote_wrlock is acquired and the local MDS will add a lock
item and marks it as REMOTE_WRLOCK. And later when the local MDS try
to force wrlock in the emplace_lock() will just return the existing
lock item without updating the WRLOCK flag. So when cleaning the
requests later it will just release the remote locks and then removes
lock items directly, which will miss releasing the local wrlock
reference.

Fixes: https://tracker.ceph.com/issues/65630
Signed-off-by: Xiubo Li <xiubli@redhat.com>
(cherry picked from commit 9a6f249b96faed085a7dea251b0fd81a28330025)

12 months agoqa: adding test for preventing scrub when mds is inactive 58493/head
neeraj pratap singh [Tue, 10 Oct 2023 05:55:31 +0000 (11:25 +0530)]
qa: adding test for preventing scrub when mds is inactive

Fixes: https://tracker.ceph.com/issues/62537
Signed-off-by: Neeraj Pratap Singh <neesingh@redhat.com>
(cherry picked from commit b9a2d0571f784b671a64ba4404b12f89a3bf1686)

12 months agomds: prevent scrub start for standby-replay MDS
neeraj pratap singh [Tue, 5 Sep 2023 11:13:39 +0000 (16:43 +0530)]
mds: prevent scrub start for standby-replay MDS

Fixes: https://tracker.ceph.com/issues/62537
Signed-off-by: Neeraj Pratap Singh <neesingh@redhat.com>
(cherry picked from commit 525a3beeb71a62c468ee13a01f4c456e4d471871)

12 months agoMerge pull request #58489 from zdover23/wip-doc-2024-07-10-backport-58484-to-reef
Zac Dover [Wed, 10 Jul 2024 00:27:03 +0000 (10:27 +1000)]
Merge pull request #58489 from zdover23/wip-doc-2024-07-10-backport-58484-to-reef

reef: doc: Correct link to "Device management"

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
12 months agoMerge pull request #58469 from zdover23/wip-doc-2024-07-09-backport-58454-to-reef
Zac Dover [Wed, 10 Jul 2024 00:26:42 +0000 (10:26 +1000)]
Merge pull request #58469 from zdover23/wip-doc-2024-07-09-backport-58454-to-reef

reef: doc/cephfs: fix typo

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
12 months agodoc: Correct link to "Device management" 58489/head
Matthew Vernon [Tue, 9 Jul 2024 16:02:31 +0000 (17:02 +0100)]
doc: Correct link to "Device management"

The current link goes to `cephadm/services/rados/operations/devices` which is a 404, it instead needs to be `rados/operations/devices/`.

Signed-off-by: Matthew Vernon <mvernon@wikimedia.org>
(cherry picked from commit d1a7cc4978adacd4e11df2d3e0886b191b118cc1)

12 months agomgr/dashboard: nfs export enhancement for CEPHFS 58475/head
Avan Thakkar [Thu, 27 Jun 2024 10:06:42 +0000 (15:36 +0530)]
mgr/dashboard: nfs export enhancement for CEPHFS

Fixes: https://tracker.ceph.com/issues/66718
Signed-off-by: Avan Thakkar <athakkar@redhat.com>
(cherry picked from commit 69a45db9c3cc9ed4ac36e7f2d6db78bd7940530b)

12 months agodoc/cephfs: fix typo 58469/head
spdfnet [Mon, 8 Jul 2024 08:03:28 +0000 (09:03 +0100)]
doc/cephfs: fix typo

blanacer -> balancer

Signed-off-by: spdfnet <32593931+spdfnet@users.noreply.github.com>
(cherry picked from commit d94f90b757189e269882523178a401cd4411a946)

12 months agoMerge pull request #58207 from idryomov/wip-66300-reef
Yuri Weinstein [Mon, 8 Jul 2024 19:26:11 +0000 (12:26 -0700)]
Merge pull request #58207 from idryomov/wip-66300-reef

reef: librbd: disallow group snap rollback if memberships don't match

Reviewed-by: Ramana Raja <rraja@redhat.com>
12 months agoMerge pull request #58345 from idryomov/wip-66570-reef
Yuri Weinstein [Mon, 8 Jul 2024 19:25:23 +0000 (12:25 -0700)]
Merge pull request #58345 from idryomov/wip-66570-reef

reef: librbd: make diff-iterate in fast-diff mode aware of encryption

Reviewed-by: Ramana Raja <rraja@redhat.com>
12 months agolibrbd: make diff-iterate in fast-diff mode aware of encryption 58345/head
Ilya Dryomov [Thu, 20 Jun 2024 19:13:56 +0000 (21:13 +0200)]
librbd: make diff-iterate in fast-diff mode aware of encryption

diff-iterate wasn't updated when librbd was being prepared to support
encryption in commit 8d6a47933269 ("librbd: add crypto image dispatch
layer").  This is even noted in [1]:

> The two places I skipped for now are DiffIterate and TrimRequest.

CryptoImageDispatch has since been removed, but diff-iterate in
fast-diff mode is still unaware of encryption and just assumes that all
offsets are raw.  This means that the callback gets invoked with
incorrect image offsets when encryption is loaded.  For example, for
a LUKS1-formatted image with some data at offsets 0 and 20971520,
diff-iterate with encryption loaded reports

  0~4194304
  4194304~4194304
  25165824~4194304

instead of

  0~4194304
  20971520~4194304

as "exists".  For any piece of code that is using diff-iterate to
optimize block-by-block processing (e.g. copy an encrypted source image
to a differently-encrypted destination image), this is fatal: it would
skip processing block 20971520 which has data and instead process block
25165824 which doesn't have any data and was to be skipped, producing
a corrupted destination image.

[1] https://github.com/ceph/ceph/pull/37935#issue-735278403

Fixes: https://tracker.ceph.com/issues/66570
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit cdeb0efce3f9f857ad6d5b7ff3965f3292cb571a)

12 months agotest/librbd: extend DiffIterateTest.DiffIterateDeterministic{,PP}
Ilya Dryomov [Fri, 21 Jun 2024 13:56:27 +0000 (15:56 +0200)]
test/librbd: extend DiffIterateTest.DiffIterateDeterministic{,PP}

Currently we are laying data only at the beginning of an object.
Extend the skeletons to write to three different offsets in the middle
and also at the end of the object.

Separately, make C and C++ API test variants slightly different in
terms of offsets being targeted to not go through exactly the same
scenario twice.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit da69d1fdbf69cf37ba8b75ea9a230634bb25ae76)

12 months agotest/librbd: refactor DiffIterateTest.DiffIterateDeterministic{,PP}
Ilya Dryomov [Fri, 21 Jun 2024 13:53:05 +0000 (15:53 +0200)]
test/librbd: refactor DiffIterateTest.DiffIterateDeterministic{,PP}

In preparation for extending coverage, extract test logic into a couple
of parametrized helpers.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 1df87998dc30be6e3b4c90f5833539c7ccc70911)

12 months agoMerge pull request #58211 from idryomov/wip-66418-reef
Yuri Weinstein [Mon, 8 Jul 2024 14:44:18 +0000 (07:44 -0700)]
Merge pull request #58211 from idryomov/wip-66418-reef

reef: librbd: diff-iterate shouldn't crash on an empty byte range

Reviewed-by: Mykola Golub <mgolub@suse.com>
12 months agoMerge pull request #57765 from idryomov/wip-66232-reef
Yuri Weinstein [Mon, 8 Jul 2024 14:43:25 +0000 (07:43 -0700)]
Merge pull request #57765 from idryomov/wip-66232-reef

reef: qa/suites/rbd: override extra_system_packages directly on install task

Reviewed-by: Ramana Raja <rraja@redhat.com>
12 months agoMerge pull request #57603 from ajarr/wip-66116-reef
Yuri Weinstein [Mon, 8 Jul 2024 14:42:41 +0000 (07:42 -0700)]
Merge pull request #57603 from ajarr/wip-66116-reef

reef: librbd: create rbd_trash object during pool initialization and namespace creation

Reviewed-by: Ilya Dryomov <idryomov@redhat.com>
12 months agoMerge pull request #57570 from idryomov/wip-65813-reef
Yuri Weinstein [Mon, 8 Jul 2024 14:41:20 +0000 (07:41 -0700)]
Merge pull request #57570 from idryomov/wip-65813-reef

reef: librbd: don't crash on a zero-length read if buffer is NULL

Reviewed-by: Ramana Raja <rraja@redhat.com>
12 months agotest/librbd: avoid caching effects TestGroup.add_snapshot{,PP} 58207/head
Ilya Dryomov [Wed, 26 Jun 2024 16:24:30 +0000 (18:24 +0200)]
test/librbd: avoid caching effects TestGroup.add_snapshot{,PP}

After rollback started being tested in commit b3977c53c930
("test/librbd: make rollback in TestGroup.add_snapshot{,PP}
meaningful"), these tests can fail on comparing post-rollback
data to expected data if run with exclusive lock disabled.
This doesn't occur with exclusive lock enabled because the RBD
cache gets invalidated implicitly before releasing the lock.

While at it, pass LIBRADOS_OP_FLAG_FADVISE_FUA to avoid relying
on any cache settings that happen to be in effect.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit a7e314c2ef899406eb3ade6c5f862622ce1e698f)

12 months agotest/librbd: give groups unique names in TestGroup tests
Ilya Dryomov [Wed, 26 Jun 2024 16:15:23 +0000 (18:15 +0200)]
test/librbd: give groups unique names in TestGroup tests

Otherwise, a failing add_snapshot test causes add_snapshotPP test to
fail too, for example.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 4c69370f6b0e4c04e77fde67a9c199af6ea13649)

12 months agotest/librbd: drop unused variable in TestGroup.group_create
Ilya Dryomov [Wed, 26 Jun 2024 16:12:33 +0000 (18:12 +0200)]
test/librbd: drop unused variable in TestGroup.group_create

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit db52fca5b1fc826ce9d7cf63d7fd1aa86addccf0)

12 months agolibrbd: drop unused parameter on group_snap_rollback_by_record()
Ilya Dryomov [Fri, 14 Jun 2024 12:04:53 +0000 (14:04 +0200)]
librbd: drop unused parameter on group_snap_rollback_by_record()

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit afbb744ebe9bddc1d71c84598fee832bf7843c2b)

12 months agolibrbd: disallow group snap rollback if memberships don't match
Ilya Dryomov [Fri, 14 Jun 2024 12:04:39 +0000 (14:04 +0200)]
librbd: disallow group snap rollback if memberships don't match

Before proceeding with group rollback, ensure that the set of images
that took part in the group snapshot matches the set of images that are
currently part of the group.  Otherwise, because we preserve affected
snapshots when an image is removed from the group, data loss can ensue
where an image gets rolled back while part of another group or not part
of any group but long repurposed for something else.

Similarly, ensure that the group snapshot is complete.

Fixes: https://tracker.ceph.com/issues/66300
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit f212a9ca5b9238b80ae5c728a7bf99366ed910a1)

Conflicts:
src/test/pybind/test_rbd.py [ commit d7fd66ec9944 ("librbd: add
  rbd_clone4() API to take parent snapshot by ID") not in reef ]

12 months agoMerge pull request #58421 from guits/wip-66831-reef
Guillaume Abrioux [Fri, 5 Jul 2024 08:10:17 +0000 (10:10 +0200)]
Merge pull request #58421 from guits/wip-66831-reef

reef: qa: fix log errors for cephadm tests

12 months agoMerge pull request #58220 from guits/wip-66024-reef
Guillaume Abrioux [Fri, 5 Jul 2024 08:09:17 +0000 (10:09 +0200)]
Merge pull request #58220 from guits/wip-66024-reef

reef: ceph-volume: create LVs when using partitions

12 months agoMerge pull request #58432 from zdover23/wip-doc-2024-07-05-backport-58368-to-reef
Anthony D'Atri [Thu, 4 Jul 2024 20:31:18 +0000 (16:31 -0400)]
Merge pull request #58432 from zdover23/wip-doc-2024-07-05-backport-58368-to-reef

reef: doc/rados: document manually passing search domain

12 months agodoc/rados: document manually passing search domain 58432/head
Zac Dover [Mon, 1 Jul 2024 06:36:02 +0000 (16:36 +1000)]
doc/rados: document manually passing search domain

Document how to manually pass the search domain to "mon_dns_srv_name" in
doc/rados/configuration/mon-lookup-dns.rst.

This commit is made in response to a request by Lander Duncan that was made on the [ceph-users] mailing list, and can be seen here: https://lists.ceph.io/hyperkitty/list/ceph-users@ceph.io/thread/F7V4CWLIYCAJ4JXI2JLNY6QPCFPR4SLA/

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 98938a0312dd0c8e0b293ed9aa2e0760cc9619fa)