]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
13 months agomgr/dashboard: fix duplicate grafana panels when on mgr failover 56929/head
Avan Thakkar [Tue, 2 Apr 2024 09:45:48 +0000 (15:15 +0530)]
mgr/dashboard: fix duplicate grafana panels when on mgr failover

Fixes: https://tracker.ceph.com/issues/64970
Signed-off-by: Avan Thakkar <athakkar@redhat.com>
(cherry picked from commit e6ff23d1297cbcf5da030178c53e14a6b6ce5f43)

Conflicts:
src/pybind/mgr/cephadm/templates/services/prometheus/prometheus.yml.j2
src/pybind/mgr/cephadm/tests/test_services.py

13 months agoMerge pull request #55857 from lxbsz/wip-64655
Jos Collin [Wed, 19 Jun 2024 04:38:40 +0000 (10:08 +0530)]
Merge pull request #55857 from lxbsz/wip-64655

reef: qa: switch to use the merge fragment for fscrypt

Reviewed-by: Jos Collin <jcollin@redhat.com>
13 months agoqa: switch to use the merge fragment for fscrypt 55857/head
Xiubo Li [Wed, 29 Mar 2023 05:04:47 +0000 (13:04 +0800)]
qa: switch to use the merge fragment for fscrypt

Currently only the upstream kclient supports fscrypt feature.

Fixes: https://tracker.ceph.com/issues/59195
Signed-off-by: Xiubo Li <xiubli@redhat.com>
(cherry picked from commit 04fab9b5f967eab51beef5429e2b6317909bde38)

13 months agoqa: switch to use .qa/cephfs/mount/ instead
Xiubo Li [Wed, 29 Mar 2023 04:59:27 +0000 (12:59 +0800)]
qa: switch to use .qa/cephfs/mount/ instead

Fixes: https://tracker.ceph.com/issues/59195
Signed-off-by: Xiubo Li <xiubli@redhat.com>
(cherry picked from commit e7272e7a7e46909e07d862e3c72375fe25de7bc6)

13 months agoqa: remove the FuseMount/KernelMount check in test_fscrypt.py
Xiubo Li [Wed, 29 Mar 2023 04:51:44 +0000 (12:51 +0800)]
qa: remove the FuseMount/KernelMount check in test_fscrypt.py

Will use the postmerge fragment to check this.

Fixes: https://tracker.ceph.com/issues/59195
Signed-off-by: Xiubo Li <xiubli@redhat.com>
(cherry picked from commit f52f1316b7a254d57b4ebbc844910a1dacb3174f)

13 months agoMerge pull request #53628 from lxbsz/wip-62866
Jos Collin [Tue, 18 Jun 2024 11:07:57 +0000 (16:37 +0530)]
Merge pull request #53628 from lxbsz/wip-62866

reef: qa: increase the http postBuffer size and disable sslVerify

Reviewed-by: Jos Collin <jcollin@redhat.com>
13 months agoMerge pull request #56415 from batrick/wip-65088-reef
Jos Collin [Tue, 18 Jun 2024 08:54:25 +0000 (14:24 +0530)]
Merge pull request #56415 from batrick/wip-65088-reef

reef: qa/cephfs: add mgr debugging

Reviewed-by: Jos Collin <jcollin@redhat.com>
13 months agoMerge pull request #57162 from batrick/i63782-v2
Jos Collin [Tue, 18 Jun 2024 08:52:38 +0000 (14:22 +0530)]
Merge pull request #57162 from batrick/i63782-v2

reef: qa: add support/qa for cephfs-shell on CentOS 9 / RHEL9

Reviewed-by: Jos Collin <jcollin@redhat.com>
13 months agoMerge pull request #56666 from batrick/wip-65105-reef
Jos Collin [Tue, 18 Jun 2024 08:44:03 +0000 (14:14 +0530)]
Merge pull request #56666 from batrick/wip-65105-reef

reef: qa/cephfs: add probabilistic ignorelist for pg_health

Reviewed-by: Jos Collin <jcollin@redhat.com>
13 months agoMerge pull request #56664 from batrick/wip-65274-reef
Jos Collin [Tue, 18 Jun 2024 08:41:24 +0000 (14:11 +0530)]
Merge pull request #56664 from batrick/wip-65274-reef

reef:  mds: some request errors come from errno.h rather than fs_types.h

Reviewed-by: Jos Collin <jcollin@redhat.com>
13 months agoMerge pull request #56886 from dparmar18/wip-65315-reef
Jos Collin [Tue, 18 Jun 2024 08:39:13 +0000 (14:09 +0530)]
Merge pull request #56886 from dparmar18/wip-65315-reef

reef: mds: CInode::item_caps used in two different lists

Reviewed-by: Jos Collin <jcollin@redhat.com>
13 months agoMerge pull request #56955 from batrick/wip-65325-reef
Jos Collin [Tue, 18 Jun 2024 08:36:12 +0000 (14:06 +0530)]
Merge pull request #56955 from batrick/wip-65325-reef

reef: client: log debug message when requesting unmount

Reviewed-by: Jos Collin <jcollin@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
13 months agoMerge pull request #56953 from batrick/wip-65365-reef
Jos Collin [Tue, 18 Jun 2024 08:33:49 +0000 (14:03 +0530)]
Merge pull request #56953 from batrick/wip-65365-reef

reef: qa: test test_kill_mdstable for all mount types

Reviewed-by: Jos Collin <jcollin@redhat.com>
13 months agoMerge pull request #56836 from rishabh-d-dave/wip-65440-reef
Jos Collin [Tue, 18 Jun 2024 08:30:03 +0000 (14:00 +0530)]
Merge pull request #56836 from rishabh-d-dave/wip-65440-reef

reef: qa/cephfs: CephFSTestCase.create_client() must keyring

Reviewed-by: Jos Collin <jcollin@redhat.com>
13 months agoMerge pull request #56895 from chrisphoffman/wip-65488-reef
Jos Collin [Tue, 18 Jun 2024 08:26:51 +0000 (13:56 +0530)]
Merge pull request #56895 from chrisphoffman/wip-65488-reef

reef: mds: Add fragment to scrub

Reviewed-by: Jos Collin <jcollin@redhat.com>
13 months agoMerge pull request #57834 from rishabh-d-dave/wip-65921-reef
Jos Collin [Tue, 18 Jun 2024 08:23:38 +0000 (13:53 +0530)]
Merge pull request #57834 from rishabh-d-dave/wip-65921-reef

reef: mds: don't add counters in warning for standby-replay MDS

Reviewed-by: Jos Collin <jcollin@redhat.com>
13 months agoMerge pull request #56747 from lxbsz/wip-65321
Jos Collin [Tue, 18 Jun 2024 08:16:15 +0000 (13:46 +0530)]
Merge pull request #56747 from lxbsz/wip-65321

 reef: mds: defer trim() until after the last cache_rejoin ack being received

Reviewed-by: Jos Collin <jcollin@redhat.com>
13 months agoMerge pull request #56701 from joscollin/wip-65319-reef
Jos Collin [Tue, 18 Jun 2024 06:29:22 +0000 (11:59 +0530)]
Merge pull request #56701 from joscollin/wip-65319-reef

reef: cephfs-mirror: use monotonic clock

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>
13 months agomds: defer trim() until after the last cache_rejoin ack being received 56747/head
Xiubo Li [Wed, 26 Jul 2023 06:34:01 +0000 (14:34 +0800)]
mds: defer trim() until after the last cache_rejoin ack being received

Just before the last cache_rejoin ack being received the entire
subtree, together with the inode subtree root belongs to, were
trimmed the isolated_inodes list couldn't be correctly erased. We
should defer calling the trim() until the last cache_rejoin ack
being received.

Fixes: https://tracker.ceph.com/issues/62036
Signed-off-by: Xiubo Li <xiubli@redhat.com>
(cherry picked from commit dd783803f44d8cc424fdf33fef0a6a9203be3447)

13 months agoMerge pull request #58106 from zdover23/wip-doc-2024-06-18-backport-58901-to-reef
Anthony D'Atri [Tue, 18 Jun 2024 03:55:16 +0000 (23:55 -0400)]
Merge pull request #58106 from zdover23/wip-doc-2024-06-18-backport-58901-to-reef

reef: doc/dev: link to ceph.io leads list

13 months agoMerge pull request #56019 from vshankar/wip-64756-reef
Jos Collin [Tue, 18 Jun 2024 03:43:18 +0000 (09:13 +0530)]
Merge pull request #56019 from vshankar/wip-64756-reef

reef: qa: ignore container checkpoint/restore related selinux denials for centos9

Reviewed-by: Jos Collin <jcollin@redhat.com>
13 months agodoc/dev: link to ceph.io leads list 58106/head
Zac Dover [Mon, 17 Jun 2024 14:39:10 +0000 (00:39 +1000)]
doc/dev: link to ceph.io leads list

Remove the docs.ceph.com list of team leads, and link to the list of
leads on ceph.io.

Fixes: https://tracker.ceph.com/issues/66392
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 5134aeb78bb1d9d956442333ad372e1f397ff7b3)

13 months agoMerge pull request #57437 from joscollin/wip-65980-reef
Jos Collin [Mon, 17 Jun 2024 15:08:27 +0000 (20:38 +0530)]
Merge pull request #57437 from joscollin/wip-65980-reef

reef: cephfs_mirror: increment sync_failures when sync_perms() and sync_snaps() fails

Reviewed-by: Venky Shankar <vshankar@redhat.com>
13 months agoMerge pull request #57441 from joscollin/wip-65983-reef
Jos Collin [Mon, 17 Jun 2024 15:05:24 +0000 (20:35 +0530)]
Merge pull request #57441 from joscollin/wip-65983-reef

reef: mgr/stats: initialize mx_last_updated in FSPerfStats

Reviewed-by: Venky Shankar <vshankar@redhat.com>
13 months agoMerge pull request #57763 from joscollin/wip-66271-reef
Jos Collin [Mon, 17 Jun 2024 15:03:55 +0000 (20:33 +0530)]
Merge pull request #57763 from joscollin/wip-66271-reef

reef: pybind/mgr/mirroring: Fix KeyError: 'directory_count' in daemon status

Reviewed-by: Venky Shankar <vshankar@redhat.com>
13 months agoMerge pull request #57115 from lxbsz/wip-65676
Jos Collin [Mon, 17 Jun 2024 15:01:14 +0000 (20:31 +0530)]
Merge pull request #57115 from lxbsz/wip-65676

reef: mds: fix the description for inotable testing only options

Reviewed-by: Jos Collin <jcollin@redhat.com>
Reviewed-by: Leonid Usov <leonid.usov@ibm.com>
Reviewed-by: Anthony D Atri <anthony.datri@gmail.com>
13 months agoMerge pull request #57227 from kotreshhr/wip-65405-reef
Jos Collin [Mon, 17 Jun 2024 14:43:19 +0000 (20:13 +0530)]
Merge pull request #57227 from kotreshhr/wip-65405-reef

reef: mds: log at a lower level when stopping

Reviewed-by: Jos Collin <jcollin@redhat.com>
13 months agoMerge pull request #56403 from batrick/wip-65061-reef
Jos Collin [Mon, 17 Jun 2024 14:38:56 +0000 (20:08 +0530)]
Merge pull request #56403 from batrick/wip-65061-reef

reef: qa/cephfs: ignorelist clog of MDS_UP_LESS_THAN_MAX

Reviewed-by: Jos Collin <jcollin@redhat.com>
13 months agoMerge pull request #56022 from vshankar/wip-64763-reef
Jos Collin [Mon, 17 Jun 2024 14:36:42 +0000 (20:06 +0530)]
Merge pull request #56022 from vshankar/wip-64763-reef

reef: qa/cephfs: add more ignorelist entries

Reviewed-by: Jos Collin <jcollin@redhat.com>
13 months agoMerge pull request #55391 from dparmar18/wip-64066-reef
Jos Collin [Mon, 17 Jun 2024 13:22:08 +0000 (18:52 +0530)]
Merge pull request #55391 from dparmar18/wip-64066-reef

reef: qa: fix error reporting string in assert_cluster_log

Reviewed-by: Jos Collin <jcollin@redhat.com>
13 months agoMerge pull request #52985 from leonid-s-usov/backport/strip-nofail/reef
Jos Collin [Mon, 17 Jun 2024 13:11:55 +0000 (18:41 +0530)]
Merge pull request #52985 from leonid-s-usov/backport/strip-nofail/reef

reef: nofail option in fstab not supported

Reviewed-by: Jos Collin <jcollin@redhat.com>
13 months agoMerge pull request #56951 from batrick/wip-65520-reef
Jos Collin [Mon, 17 Jun 2024 11:05:10 +0000 (16:35 +0530)]
Merge pull request #56951 from batrick/wip-65520-reef

reef: qa: ignore human-friendly POOL_APP_NOT_ENABLED in clog

Reviewed-by: Jos Collin <jcollin@redhat.com>
13 months agoMerge pull request #57172 from batrick/wip-65707-reef
Jos Collin [Mon, 17 Jun 2024 10:17:40 +0000 (15:47 +0530)]
Merge pull request #57172 from batrick/wip-65707-reef

reef: qa: increase debugging for snap_schedule

Reviewed-by: Jos Collin <jcollin@redhat.com>
13 months agoMerge pull request #57174 from batrick/wip-65709-reef
Jos Collin [Mon, 17 Jun 2024 10:11:32 +0000 (15:41 +0530)]
Merge pull request #57174 from batrick/wip-65709-reef

reef: client: clear resend_mds only after sending request

Reviewed-by: Jos Collin <jcollin@redhat.com>
13 months agoMerge pull request #57446 from mchangir/wip-65898-reef
Jos Collin [Mon, 17 Jun 2024 10:03:35 +0000 (15:33 +0530)]
Merge pull request #57446 from mchangir/wip-65898-reef

reef: mgr/snap_schedule: restore yearly spec to lowercase y

Reviewed-by: Anthony D Atri <anthony.datri@gmail.com>
Reviewed-by: Jos Collin <jcollin@redhat.com>
13 months agoMerge pull request #57557 from lxbsz/wip-66053
Jos Collin [Mon, 17 Jun 2024 09:50:50 +0000 (15:20 +0530)]
Merge pull request #57557 from lxbsz/wip-66053

reef: qa/fsx: use a specified sha1 to build the xfstest-dev

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>
Reviewed-by: Jos Collin <jcollin@redhat.com>
13 months agoMerge pull request #57837 from rishabh-d-dave/wip-65927-reef
Jos Collin [Mon, 17 Jun 2024 09:46:25 +0000 (15:16 +0530)]
Merge pull request #57837 from rishabh-d-dave/wip-65927-reef

reef: mon,cephfs: require confirmation flag to bring down unhealthy MDS

Reviewed-by: Anthony D Atri <anthony.datri@gmail.com>
Reviewed-by: Jos Collin <jcollin@redhat.com>
13 months agoMerge pull request #58076 from idryomov/wip-cephadm-flake8-fix-reef
Pedro Gonzalez Gomez [Mon, 17 Jun 2024 07:19:31 +0000 (09:19 +0200)]
Merge pull request #58076 from idryomov/wip-cephadm-flake8-fix-reef

reef: mgr/cephadm: fix flake8 test failures

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
13 months agomgr/cephadm: fix flake8 test failures 58076/head
Nizamudeen A [Sun, 16 Jun 2024 09:11:04 +0000 (14:41 +0530)]
mgr/cephadm: fix flake8 test failures

after a recent update on flake8, some tests in cephadm started failing
```
flake8: commands[0]> flake8 --config=tox.ini alerts balancer cephadm cli_api crash devicehealth diskprediction_local hello iostat localpool nfs orchestrator prometheus rbd_support rgw selftest smb
cephadm/configchecks.py:94:6: E204 whitespace after decorator '@'
cephadm/module.py:3017:10: E204 whitespace after decorator '@'
cephadm/module.py:3344:144: E226 missing whitespace around arithmetic operator
cephadm/tests/test_cephadm.py:982:6: E204 whitespace after decorator '@'
3     E204 whitespace after decorator '@'
1     E226 missing whitespace around arithmetic operator
```

Signed-off-by: Nizamudeen A <nia@redhat.com>
(cherry picked from commit a9563d91c2ac405ae9101bd4e63fa9b749d54432)

13 months agoMerge pull request #58060 from zdover23/wip-doc-2024-06-16-backport-58057-to-reef
Anthony D'Atri [Sun, 16 Jun 2024 06:16:45 +0000 (02:16 -0400)]
Merge pull request #58060 from zdover23/wip-doc-2024-06-16-backport-58057-to-reef

reef: doc/rados: explain replaceable parts of command

13 months agodoc/rados: explain replaceable parts of command 58060/head
Zac Dover [Sat, 15 Jun 2024 11:55:18 +0000 (21:55 +1000)]
doc/rados: explain replaceable parts of command

Add an explanation that directs the reader to replace the "X" part of
the command "ceph tell mon.X mon_status" with the value specific to the
reader's Ceph cluster (which is (probably) not "X").

In the future, such replaceable strings in commands may be bounded by
angle brackets ("<" and ">").

This improvement to the documentation was suggested on the [ceph-users]
email list by Joel Davidow. This email, an absolute model of user
engagement with an upstream project, can be reviewed here:
https://lists.ceph.io/hyperkitty/list/ceph-users@ceph.io/message/KF67F5TXFSSTPXV7EKL6JKLA5KZQDLDQ/

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit d071ad2575c86f300a9ba39df3c4949e5dc9c47d)

13 months agoMerge pull request #57994 from ljflores/wip-66450-reef
Laura Flores [Fri, 14 Jun 2024 15:42:12 +0000 (10:42 -0500)]
Merge pull request #57994 from ljflores/wip-66450-reef

reef: mgr/k8sevents: update V1Events to CoreV1Events

13 months agoMerge pull request #58050 from zdover23/wip-doc-2024-06-14-backport-58007-to-reef
Anthony D'Atri [Fri, 14 Jun 2024 15:14:18 +0000 (11:14 -0400)]
Merge pull request #58050 from zdover23/wip-doc-2024-06-14-backport-58007-to-reef

reef: doc/rados: add pg-states and pg-concepts to tree

13 months agodoc/rados: add pg-states and pg-concepts to tree 58050/head
Zac Dover [Thu, 13 Jun 2024 11:29:47 +0000 (21:29 +1000)]
doc/rados: add pg-states and pg-concepts to tree

Add "pg-states" and "pg-concepts" to the left tree pane on
docs.ceph.com.

This commit has been made in response to a request from the upstream
made in https://pad.ceph.com/p/Report_Documentation_Bugs.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 0629f47faf73a5b88adbeceaf022ee23111bae7d)

13 months agomgr/k8sevents: update V1Events to CoreV1Events 57994/head
Nizamudeen A [Fri, 3 May 2024 08:56:19 +0000 (14:26 +0530)]
mgr/k8sevents: update V1Events to CoreV1Events

centos9 only provides kubernetes 26.1.0 as base dep and hence the
k8sevents code needs to be updated accordingly. the api changes happened
in kuberenetes while 19.0.0 was released

Fixes: https://tracker.ceph.com/issues/65627
Fixes: https://tracker.ceph.com/issues/64981
Signed-off-by: Nizamudeen A <nia@redhat.com>
(cherry picked from commit 6af964719217d720e6c2fd1ba2a607f6255d2604)

13 months agoMerge pull request #57983 from zdover23/wip-doc-2024-06-12-backport-57976-to-reef
Anthony D'Atri [Wed, 12 Jun 2024 13:06:46 +0000 (09:06 -0400)]
Merge pull request #57983 from zdover23/wip-doc-2024-06-12-backport-57976-to-reef

reef: doc/glossary: Add "S3"

13 months agoqa/cephfs: set joinable on FS before exiting tests in TestFSFail 57837/head
Rishabh Dave [Tue, 7 May 2024 14:50:55 +0000 (20:20 +0530)]
qa/cephfs: set joinable on FS before exiting tests in TestFSFail

After running TestFSFail, CephFSTestCase.tearDown() fails attempting
to unmount CephFS. Set joinable on FS and wait for the MDS to be up
before exiting the test. This will ensure that unmounting is
successful in teardown.

Fixes: https://tracker.ceph.com/issues/65841
Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit faa30e03f31551a71ebb8330dbbe7005d9ddd559)

13 months agoqa/cephfs: pass MDS name, not FS name, to "ceph mds fail" cmd
Rishabh Dave [Wed, 8 May 2024 13:59:11 +0000 (19:29 +0530)]
qa/cephfs: pass MDS name, not FS name, to "ceph mds fail" cmd

This issue was not caught in original QA run because "ceph mds fail"
returns 0 even though MDS name received by it in argument is
non-existent. This is done for the sake of idempotency, however it
caused this bug to go uncaught.

Fixea: https://tracker.ceph.com/issues/65864
Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit ab643f7a501797634a366fd29bf4acef6a8f0cf2)

13 months agoqa/cephfs: add tests failing MDS and FS when MDS is unhealthy
Rishabh Dave [Mon, 25 Mar 2024 12:05:38 +0000 (17:35 +0530)]
qa/cephfs: add tests failing MDS and FS when MDS is unhealthy

Add tests to verify that the confirmation flag is mandatory for running
commands "ceph mds fail" and "ceph fs fail" when MDS has one of the two
health warnings: MDS_CACHE_OVERSIZE or MDS_TRIM.

Also, add MDS_CACHE_OVERSIZE and MDS_TRIM to ignorelist for
test_admin.py so that QA jobs knows this an expected failure.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit 214d614309a75fe9926d1a46be5b184c7cd0cbc1)

13 months agoqa/cephfs: pass confirmation flag to fs fail in tear down code
Rishabh Dave [Mon, 25 Mar 2024 12:01:01 +0000 (17:31 +0530)]
qa/cephfs: pass confirmation flag to fs fail in tear down code

Since "ceph fs fail" command now requires the confirmation flag when
Ceph cluster has either health warning MDS_TRIM or MDS_CACHE_OVERSIZE,
update tear down in QA code. During the teardown, the CephFS should be
failed, regardless of whether or not Ceph cluster has health warnings,
since it is teardown.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit a1af1bf538a20ae7d021098f5787534ce61dbd7c)

13 months agoPendingReleaseNotes: note need of confirmation for "ceph fs fail"
Rishabh Dave [Fri, 19 Apr 2024 11:38:50 +0000 (17:08 +0530)]
PendingReleaseNotes: note need of confirmation for "ceph fs fail"

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit 24816428e65069fe84bef22f5e134bb1b651ef52)

13 months agodoc/cephfs: mention need of confirmation for "ceph fs fail"
Rishabh Dave [Fri, 19 Apr 2024 11:34:54 +0000 (17:04 +0530)]
doc/cephfs: mention need of confirmation for "ceph fs fail"

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit de18c5a1cc0c20be9cd1dc153d008828b5eadc9b)

13 months agocephfs,mon: require confirmation to fail unhealthy FS
Rishabh Dave [Wed, 13 Mar 2024 09:31:02 +0000 (15:01 +0530)]
cephfs,mon: require confirmation to fail unhealthy FS

Confirmation flag must be passed when running the command "ceph fs fail"
when the MDS for this FS has either of the two health warnings: MDS_TRIM
or MDS_CACHE_OVERSIZED. Else, the command will fail and print an
appropriate error message.

Restarting an MDS with these health warnings is not recommened since it
will have a slow recovery during restart which will create new problems.

Fixes: https://tracker.ceph.com/issues/61866
Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit b901616494a8359e59f7ec2cd661077c4aced01c)

Conflicts:
- src/mon/FSCommands.cc
  -  lines surrounding the patch are different in reef compared to main.
     the reef code was still accessing "mds_map" directly instead of
     accessing it using "get_mds_map()".
  - return value of get_filesystem() is different in main.

13 months agodoc/glossary: Add "S3" 57983/head
Zac Dover [Tue, 11 Jun 2024 23:11:35 +0000 (09:11 +1000)]
doc/glossary: Add "S3"

Add "S3" entry to the glossary.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit be6d0645c40431e8312244019c0331e4076bd5f2)

13 months agoMerge pull request #57959 from zdover23/wip-doc-2024-06-11-backport-57957-to-reef
Anthony D'Atri [Tue, 11 Jun 2024 01:39:41 +0000 (21:39 -0400)]
Merge pull request #57959 from zdover23/wip-doc-2024-06-11-backport-57957-to-reef

reef: doc/rados: improve leader/peon monitor explanation

13 months agodoc/rados: improve leader/peon monitor explanation 57959/head
Zac Dover [Mon, 10 Jun 2024 22:00:00 +0000 (08:00 +1000)]
doc/rados: improve leader/peon monitor explanation

Add an explanation of leader-peon conditions that obtain when the
cluster is in the "HEALTH_OK" state. Previously, the text discussed
these two monitor states only in the context of a health detail entry.

This improvement to the documentation was suggested on the [ceph-users]
email list by Joel Davidow. This email, an absolute model of user
engagement with an upstream project, can be reviewed here: https://lists.ceph.io/hyperkitty/list/ceph-users@ceph.io/message/KF67F5TXFSSTPXV7EKL6JKLA5KZQDLDQ/

I will list Joel Davidow here as the co-author for the sake of more
expediently getting this change into the documentation, but though he is
listed as the co-author, he is the true author.

Co-authored-by: Joel Davidow <jdavidow@nso.edu>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 6fb9a5ef817eda5184d51ebcb425a6091ca82299)

13 months agoMerge pull request #57949 from zdover23/wip-doc-2024-06-10-backport-57947-to-reef
Zac Dover [Sun, 9 Jun 2024 22:09:43 +0000 (08:09 +1000)]
Merge pull request #57949 from zdover23/wip-doc-2024-06-10-backport-57947-to-reef

reef: doc/start: remove "intro.rst"

Reviewed-by: Cole Mitchell <cole.mitchell.ceph@gmail.com>
13 months agodoc/start: remove "intro.rst" 57949/head
Zac Dover [Sun, 9 Jun 2024 18:55:13 +0000 (04:55 +1000)]
doc/start: remove "intro.rst"

Remove "start/intro.rst", which has been renamed "start/index.rst" in
order to follow the conventions followed elsewhere in the documentation.

Follows https://github.com/ceph/ceph/pull/57900.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 74cc624d002e51769da37c04b3bdc32e0077d370)

13 months agoMerge pull request #57937 from zdover23/wip-doc-2024-06-08-backport-57923-to-reef
Zac Dover [Sun, 9 Jun 2024 18:22:45 +0000 (04:22 +1000)]
Merge pull request #57937 from zdover23/wip-doc-2024-06-08-backport-57923-to-reef

reef: doc/src: add inline literals (``) to variables

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
13 months agoMerge pull request #57942 from zdover23/wip-doc-2024-06-09-backport-57939-to-reef
Anthony D'Atri [Sun, 9 Jun 2024 02:36:04 +0000 (22:36 -0400)]
Merge pull request #57942 from zdover23/wip-doc-2024-06-09-backport-57939-to-reef

reef: doc/glossary.rst: add "OpenStack Swift" and "Swift"

13 months agodoc/glossary.rst: add "OpenStack Swift" and "Swift" 57942/head
Zac Dover [Sat, 8 Jun 2024 20:24:43 +0000 (06:24 +1000)]
doc/glossary.rst: add "OpenStack Swift" and "Swift"

Add "OpenStack Swift" and "Swift" entries to the glossary.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit b2d413ee9db1d68392f29db148a7bc2e87a7b419)

13 months agodoc/src: add inline literals (``) to variables 57937/head
Zac Dover [Fri, 7 Jun 2024 01:24:00 +0000 (11:24 +1000)]
doc/src: add inline literals (``) to variables

Add inline-literal RST markup (``) (double backticks) around variable
names in src/common/options/global.yaml.in.

This follows https://github.com/ceph/ceph/pull/57887.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 6cdf9439c777ecac8915189a8e38a718f5013f7a)

13 months agoMerge pull request #57884 from zdover23/wip-doc-2024-06-05-backport-57868-to-reef
Zac Dover [Thu, 6 Jun 2024 23:39:46 +0000 (09:39 +1000)]
Merge pull request #57884 from zdover23/wip-doc-2024-06-05-backport-57868-to-reef

reef: doc: correct typo

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
13 months agoMerge pull request #57916 from zdover23/wip-doc-2024-06-07-backport-57887-to-reef
Zac Dover [Thu, 6 Jun 2024 20:13:28 +0000 (06:13 +1000)]
Merge pull request #57916 from zdover23/wip-doc-2024-06-07-backport-57887-to-reef

reef: doc/rados: add options to network config ref

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
13 months agoMerge pull request #57914 from zdover23/wip-doc-2024-06-07-backport-57886-to-reef
Ilya Dryomov [Thu, 6 Jun 2024 17:42:06 +0000 (19:42 +0200)]
Merge pull request #57914 from zdover23/wip-doc-2024-06-07-backport-57886-to-reef

reef: doc/dev: origin of Labeled Perf Counters

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
13 months agodoc/rados: add options to network config ref 57916/head
Zac Dover [Wed, 5 Jun 2024 04:30:01 +0000 (14:30 +1000)]
doc/rados: add options to network config ref

Add the following options to
doc/rados/configuration/network-config-ref.rst:

- public_network_interface
- cluster_network_interface

These additions were made in response to a request from Blaine Gardner.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 33bc1a0241cf29d0f1d12aa0a54c6cda5a469adc)

13 months agodoc/dev: origin of Labeled Perf Counters 57914/head
Zac Dover [Wed, 5 Jun 2024 01:18:46 +0000 (11:18 +1000)]
doc/dev: origin of Labeled Perf Counters

Note that Labeled Perf Counters were introduced in Reef.

Fixes: https://github.com/ceph/ceph/pull/57753#discussion_r1626483732
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 36e09fb6894dcec56224d483d36a7315b8d19d60)

13 months agoMerge pull request #57794 from rzarzynski/wip-bug-61948-reef-backport
Laura Flores [Wed, 5 Jun 2024 20:51:54 +0000 (15:51 -0500)]
Merge pull request #57794 from rzarzynski/wip-bug-61948-reef-backport

reef: mon, osd, *: expose upmap-primary in OSDMap::get_features()

13 months agoMerge pull request #57903 from zdover23/wip-doc-2024-06-06-backport-57900-to-reef
Zac Dover [Wed, 5 Jun 2024 20:45:52 +0000 (06:45 +1000)]
Merge pull request #57903 from zdover23/wip-doc-2024-06-06-backport-57900-to-reef

reef: doc/start: s/intro.rst/index.rst/

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
13 months agodoc/start: s/intro.rst/index.rst/ 57903/head
Zac Dover [Wed, 5 Jun 2024 16:43:15 +0000 (02:43 +1000)]
doc/start: s/intro.rst/index.rst/

Change the filename "doc/start/intro.rst" to "doc/start/index.rst" so
that Sphinx finds the root filename for the "/start" directory in the
default location.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 84ce2212e87a4b6b2416eeab7e8e1718ae3ce87b)

13 months agoMerge pull request #57871 from zdover23/wip-doc-2024-06-05-backport-57867-to-reef
Zac Dover [Wed, 5 Jun 2024 01:11:19 +0000 (11:11 +1000)]
Merge pull request #57871 from zdover23/wip-doc-2024-06-05-backport-57867-to-reef

reef: doc/start: s/http/https/ in links

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
13 months agodoc: correct typo 57884/head
Matthew Vernon [Tue, 4 Jun 2024 13:46:57 +0000 (14:46 +0100)]
doc: correct typo

Signed-off-by: Matthew Vernon <mvernon@wikimedia.org>
(cherry picked from commit 4769493887e9f99f990122135d7cab6caee27f71)

13 months agodoc/start: s/http/https/ in links 57871/head
Zac Dover [Tue, 4 Jun 2024 13:37:27 +0000 (23:37 +1000)]
doc/start: s/http/https/ in links

Replace "http" with "https" in doc/start/get-involved.rst.

This commit is, in a way, a repeat of
https://github.com/ceph/ceph/pull/57213/
(1c5383b91bd7dbfa9670c6485fcc5ff28b79f40d), which targeted the Reef
branch instead of the main branch. When this commit has been merged and
backported, I will close https://github.com/ceph/ceph/pull/57213/.

I am listing Casey Cain here as the co-author, but he is in fact the
true author of this change.

Co-authored-by: Casey Cain <ccain@linuxfoundation.org>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 922f23f0f30da856a661376527f413dc9424382d)

13 months agoMerge pull request #57637 from rhcs-dashboard/wip-66175-reef
Avan [Tue, 4 Jun 2024 09:40:56 +0000 (15:10 +0530)]
Merge pull request #57637 from rhcs-dashboard/wip-66175-reef

reef: mgr/dashboard: add absolute path validation for pseudo path of nfs export

Reviewed-by: afreen23 <NOT@FOUND>
13 months agoMerge pull request #57851 from zdover23/wip-doc-2024-06-04-backport-57824-to-reef
Anthony D'Atri [Mon, 3 Jun 2024 19:58:18 +0000 (15:58 -0400)]
Merge pull request #57851 from zdover23/wip-doc-2024-06-04-backport-57824-to-reef

reef: doc/rados: add stop monitor command

13 months agodoc/rados: add stop monitor command 57851/head
Zac Dover [Mon, 3 Jun 2024 00:23:43 +0000 (10:23 +1000)]
doc/rados: add stop monitor command

Add the command for stopping a monitor to the procedure that explains
how to inject a monmap into a monitor.

Zac of the future: cf. 05 Aug 2023.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit c032188d66539a78ab0e4de2a5f5fc4329927bf6)

13 months agoMerge pull request #57845 from zdover23/wip-doc-2024-06-04-backport-57839-to-reef
Anthony D'Atri [Mon, 3 Jun 2024 15:54:45 +0000 (11:54 -0400)]
Merge pull request #57845 from zdover23/wip-doc-2024-06-04-backport-57839-to-reef

reef: doc/start: Edit Beginner's Guide

13 months agodoc/start: Edit Beginner's Guide 57845/head
Zac Dover [Mon, 3 Jun 2024 13:43:47 +0000 (23:43 +1000)]
doc/start: Edit Beginner's Guide

Make some improvements to the basic text of the Beginner's Guide.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit f484a156bed655909617f4e351b692d7a23d0e87)

13 months agoMerge pull request #57306 from idryomov/wip-65487-reef
Ilya Dryomov [Mon, 3 Jun 2024 14:02:12 +0000 (16:02 +0200)]
Merge pull request #57306 from idryomov/wip-65487-reef

reef: rbd-mirror: clean up stale pool replayers and callouts better

Reviewed-by: Ramana Raja <rraja@redhat.com>
13 months agoMerge pull request #57309 from idryomov/wip-65624-reef
Yuri Weinstein [Mon, 3 Jun 2024 13:56:08 +0000 (06:56 -0700)]
Merge pull request #57309 from idryomov/wip-65624-reef

reef: pybind/rbd: expose CLONE_FORMAT and FLATTEN image options

Reviewed-by: Ramana Raja <rraja@redhat.com>
13 months agoMerge pull request #57091 from idryomov/wip-65573-reef
Yuri Weinstein [Mon, 3 Jun 2024 13:55:32 +0000 (06:55 -0700)]
Merge pull request #57091 from idryomov/wip-65573-reef

reef: librbd: make group and group snapshot IDs more random

Reviewed-by: Mykola Golub <mgolub@suse.com>
13 months agoMerge pull request #57030 from idryomov/wip-65481-reef
Yuri Weinstein [Mon, 3 Jun 2024 13:54:40 +0000 (06:54 -0700)]
Merge pull request #57030 from idryomov/wip-65481-reef

reef: qa: fix krbd_msgr_segments and krbd_rxbounce failing on 8.stream

Reviewed-by: Ramana Raja <rraja@redhat.com>
13 months agoqa/cephfs: update filesystem.Filesystem.rank_fail()
Rishabh Dave [Fri, 12 Apr 2024 11:04:04 +0000 (16:34 +0530)]
qa/cephfs: update filesystem.Filesystem.rank_fail()

Since the command "ceph mds fail" now may require confirmation flag
("--yes-i-really-mean-it"), update this method to allow/disallow adding
this flag to the command arguments.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit 4f333e1d06f9e1de903dd3668ebe4754c00b4ac2)

13 months agoPendingReleaseNotes: note need of confirmation for "ceph mds fail"
Rishabh Dave [Fri, 19 Apr 2024 11:32:29 +0000 (17:02 +0530)]
PendingReleaseNotes: note need of confirmation for "ceph mds fail"

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit f241a3c5d0c463679f746fcf241a4c06647b39f2)

13 months agodoc/cephfs: mention need of confirmation for "ceph mds fail"
Rishabh Dave [Fri, 19 Apr 2024 11:28:30 +0000 (16:58 +0530)]
doc/cephfs: mention need of confirmation for "ceph mds fail"

Update docs since command "ceph mds fail" will now fail if MDS has either
health warning MDS_TRIM or MDS_CACHE_OVERSIZED and if confirmation flag
is not passed.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit dea22203749bc87a209289f5aed1e78b503ab440)

13 months agocephfs,mon: require confirmation to fail unhealthy MDS
Rishabh Dave [Fri, 8 Mar 2024 15:39:18 +0000 (21:09 +0530)]
cephfs,mon: require confirmation to fail unhealthy MDS

When running the command "ceph mds fail" for an MDS that is unhealthy
due to, MDS_CACHE_OVERSIZED or MDS_TRIM, user must pass confirmation
flag. Else, the command will fail and print an appropriate error
message.

Restarting an MDS with such health warnings is not recommended since it
will have a slow reocvery during restart which will create new problems.

Fixes: https://tracker.ceph.com/issues/61866
Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit eeda00eea5043d3ba806695a207b732cb53b35c4)

13 months agoqa/cephfs: test that counters are not printed for SR MDS 57834/head
Rishabh Dave [Tue, 26 Mar 2024 06:04:23 +0000 (11:34 +0530)]
qa/cephfs: test that counters are not printed for SR MDS

- Add tests to verify that inode and stray counters are not
  replayed/included in the health warnings printed for the
  standby-replay MDS.

- Add "MDS_CACHE_OVERSIZED" health warning to ignorelist to
  failover.yaml.

- Add a helper method to qa.tasks.cephfs.filesystem.Filesystem to get
  MDS name of standby-replay MDS.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit 2784e224e7af38d5b96c573df7dfb373de53937b)

13 months agomds: add no counters in warning for standby-replay MDS
Rishabh Dave [Fri, 8 Mar 2024 15:31:51 +0000 (21:01 +0530)]
mds: add no counters in warning for standby-replay MDS

Don't include inode and stray counters in the health warnings printed
for standby-replay MDSs. Since these counters are present in the health
warnings only due to replay, it can confuse users, and therefore, do not
include them.

Fixes: https://tracker.ceph.com/issues/63514
Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit 03dcdc1329e471aa4aa403519ea5131db2f99b23)

13 months agoqa/workunits/rbd: wait for replaying status in bootstrap tests 57306/head
Ilya Dryomov [Mon, 6 May 2024 06:16:01 +0000 (08:16 +0200)]
qa/workunits/rbd: wait for replaying status in bootstrap tests

wait_for_replay_complete() doesn't wait for image status to get
updated.  This didn't matter previously because these tests are run on
two different pools and nothing else was following.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit d1d848276f0a33b20482321088c8c7470a84d0b3)

Conflicts:
qa/workunits/rbd/rbd_mirror_bootstrap.sh [ commit 3fd8a0388735
  ("qa/workunits/rbd: merge journal and snapshot test scripts")
  not in reef ]

13 months agoMerge pull request #57822 from zdover23/wip-doc-2024-06-02-backport-57820-to-reef
Zac Dover [Sun, 2 Jun 2024 23:27:28 +0000 (09:27 +1000)]
Merge pull request #57822 from zdover23/wip-doc-2024-06-02-backport-57820-to-reef

reef: doc/start: Add Beginner's Guide

Reviewed-by: Cole Mitchell <cole.mitchell.ceph@gmail.com>
Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
13 months agodoc/start: Add Beginner's Guide 57822/head
Zac Dover [Sun, 2 Jun 2024 11:06:25 +0000 (21:06 +1000)]
doc/start: Add Beginner's Guide

Add a Beginner's Guide to docs.ceph.com.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 91aafc6a7f72c105fbf3aa8419863e931d5b9e00)

13 months agoMerge pull request #57815 from zdover23/wip-doc-2024-06-01-backport-57804-to-reef
Anthony D'Atri [Sun, 2 Jun 2024 00:02:15 +0000 (20:02 -0400)]
Merge pull request #57815 from zdover23/wip-doc-2024-06-01-backport-57804-to-reef

reef: doc/cephfs: edit vstart warning text

13 months agodoc/cephfs: edit vstart warning text 57815/head
Zac Dover [Fri, 31 May 2024 12:19:59 +0000 (22:19 +1000)]
doc/cephfs: edit vstart warning text

Improve the English in the vstart warning in doc/cephfs/mantle.rst.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 9ce7f9bd6c006ede6e1d563f4273376e2dbc1d03)

13 months agoMerge pull request #57792 from zdover23/wip-doc-2024-05-30-backport-57790-to-reef
Anthony D'Atri [Fri, 31 May 2024 01:24:11 +0000 (21:24 -0400)]
Merge pull request #57792 from zdover23/wip-doc-2024-05-30-backport-57790-to-reef

reef: doc/cephfs: edit front matter in mantle.rst

13 months agomon: validate SERVER_REEF on set-require-min-compat-client 57794/head
Radoslaw Zarzynski [Thu, 30 May 2024 08:44:54 +0000 (08:44 +0000)]
mon: validate SERVER_REEF on set-require-min-compat-client

Unit testing
-------------
```
[rzarzynski@o06 build]$ bin/unittest_features
...
[ RUN      ] features.release_features
1 argonaut features 0x40000 looks like argonaut
2 bobtail features 0x40000 looks like argonaut
3 cuttlefish features 0x40000 looks like argonaut
4 dumpling features 0x42040000 looks like dumpling
5 emperor features 0x42040000 looks like dumpling
6 firefly features 0x20842040000 looks like firefly
7 giant features 0x20842040000 looks like firefly
8 hammer features 0x1020842040000 looks like hammer
9 infernalis features 0x1020842040000 looks like hammer
10 jewel features 0x401020842040000 looks like jewel
11 kraken features 0xc01020842040000 looks like kraken
12 luminous features 0xe01020842240000 looks like luminous
13 mimic features 0xe01020842240000 looks like luminous
14 nautilus features 0xe01020842240000 looks like luminous
15 octopus features 0xe01020842240000 looks like luminous
16 pacific features 0xe01020842240000 looks like luminous
17 quincy features 0xe01020842240000 looks like luminous
18 reef features 0xe010208d2240000 looks like reef
19 squid features 0xe010208d2240000 looks like reef
[       OK ] features.release_features (0 ms)
```

Manual testing
--------------
\### 'quincy` client connected to `main` cluster
There was `ceph -w` from `quincy` running in the background.

```
[rzarzynski@o06 build]$ bin/ceph osd set-require-min-compat-client reef
Error EPERM: cannot set require_min_compat_client to reef: 1 connected client(s) look like luminous (missing 0x80000000); add --yes-i-really-mean-it to do it anyway
```

\### Only `main` clients connected to `main` cluster
```
[rzarzynski@o06 build]$ bin/ceph osd get-require-min-compat-client
luminous
[rzarzynski@o06 build]$ bin/ceph daemon mon.a sessions | jq  -jr '.[] | .name, "\t", .con_features, "\t", .con_features_hex, "\n"' | grep client
client.?        4540701547738038271     3f03cffffffdffff
client.?        4540701547738038271     3f03cffffffdffff
[rzarzynski@o06 build]$ bin/ceph osd set-require-min-compat-client reef
set require_min_compat_client to reef
```

Fixes: https://tracker.ceph.com/issues/61948
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
(cherry picked from commit 4d74ff6327ebdc7deb44737cac08125204b2fec6)

13 months agomon, osd, *: expose upmap-primary in OSDMap::get_features()
Radoslaw Zarzynski [Mon, 27 May 2024 16:58:07 +0000 (16:58 +0000)]
mon, osd, *: expose upmap-primary in OSDMap::get_features()

This is a minimal fix to ensure only peers understanding
`pg-upmap-primary` are able to connect, and thus to exclude
the possibility of running into the `pg_upmap_primaries.empty()`
assertion in encoders.

Fixes for other problems will follow up.

The intention is to ship this patch in the very next minor
release of reef.

Manual testing
--------------

\### start using upmap-primar is presence of `quincy` client
NOTE: incompatible clients aren't disconnected but this is
known and expected as we lack the machinery.

```
[rzarzynski@o06 build]$ bin/ceph osd get-require-min-compat-client
reef
[rzarzynski@o06 build]$ bin/ceph daemon mon.a sessions | jq  -jr '.[] | .name, "\t", .con_features, "\t", .con_features_hex, "\n"' | grep client
client.?        4540701547738038271     3f03cffffffdffff
client.?        4540138320759226367     3f01cfbf7ffdffff
[rzarzynski@o06 build]$ bin/ceph osd pool create test_pool 1 1
pool 'test_pool' created
[rzarzynski@o06 build]$ bin/ceph osd pg-upmap-primary 1.0 2
change primary for pg 1.0 to osd.2
[rzarzynski@o06 build]$ bin/ceph daemon mon.a sessions | jq  -jr '.[] | .name, "\t", .con_features, "\t", .con_features_hex, "\n"' | grep client
client.?        4540701547738038271     3f03cffffffdffff
client.?        4540138320759226367     3f01cfbf7ffdffff
```

\### `main` client is still able to connect
```
[rzarzynski@o06 build]$ bin/ceph -w
  cluster:
    id:     d570a7cd-84ca-4fd0-aafb-80138762c6af
    health: HEALTH_WARN
            11 mgr modules have failed dependencies
            1 pool(s) do not have an application enabled

  services:
    mon: 1 daemons, quorum a (age 64m)
    mgr: x(active, since 64m)
    osd: 3 osds: 3 up (since 64m), 3 in (since 64m)

  data:
    pools:   1 pools, 1 pgs
    objects: 0 objects, 0 B
    usage:   3.0 GiB used, 300 GiB / 303 GiB avail
    pgs:     1 active+clean
```

\### `quincy` client is refused
```
[rzarzynski@o06 build-quincy]$ bin/ceph -s -c /home/rzarzynski/ceph2/build/ceph.conf
2024-05-30T08:59:42.411+0000 7f0911a9b700 -1 --2- 127.0.0.1:0/2812481872 >> [v2:127.0.0.1:40536/0,v1:127.0.0.1:40537/0] conn(0x7f090c111500 0x7f090c1118f0 secure :-1 s=SESSION_CONNECTING pgs=0 cs=0 l=0 rev1=1 crypto rx=0x7f08fc0048c0 tx=0x7f08fc009e30 comp rx=0 tx=0).handle_ident_missing_features client does not support all server features: 80000000
2024-05-30T08:59:42.612+0000 7f0911a9b700  0 --2- 127.0.0.1:0/2812481872 >> [v2:127.0.0.1:40536/0,v1:127.0.0.1:40537/0] conn(0x7f090c111500 0x7f090c1118f0 unknown :-1 s=AUTH_CONNECTING pgs=0 cs=0 l=0 rev1=1 crypto rx=0 tx=0 comp rx=0 tx=0).send_auth_request get_initial_auth_request returned -2
```

\### stop using upmap-primary
```
[rzarzynski@o06 build]$ bin/ceph osd rm-pg-upmap-primary 1.0
clear 1.0 pg_upmap_primary mapping
```

\### `quincy` client may connect again
```
[rzarzynski@o06 build-quincy]$ bin/ceph -s -c /home/rzarzynski/ceph2/build/ceph.conf
  cluster:
    id:     d570a7cd-84ca-4fd0-aafb-80138762c6af
    health: HEALTH_WARN
            11 mgr modules have failed dependencies
            1 pool(s) do not have an application enabled

  services:
    mon: 1 daemons, quorum a (age 77m)
    mgr: x(active, since 77m)
    osd: 3 osds: 3 up (since 76m), 3 in (since 76m)

  data:
    pools:   1 pools, 1 pgs
    objects: 0 objects, 0 B
    usage:   3.0 GiB used, 300 GiB / 303 GiB avail
    pgs:     1 active+clean

```

Fixes: https://tracker.ceph.com/issues/61948
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
(cherry picked from commit 5dbb7c4b12707b9c6d1334055cdd9dd820ffc1a6)

Conflicts:
src/osd/OSDMap.cc -- reef lacks MSR

13 months agodoc/cephfs: edit front matter in mantle.rst 57792/head
Zac Dover [Thu, 30 May 2024 13:28:56 +0000 (23:28 +1000)]
doc/cephfs: edit front matter in mantle.rst

Improve the structure and grammar of the front matter in the
doc/cephfs/mantle.rst file.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 559d4849ecd6f93b5812f3d8d0448115c5b5beab)

13 months agoMerge pull request #57658 from rhcs-dashboard/wip-65995-reef
Avan [Thu, 30 May 2024 10:01:02 +0000 (15:31 +0530)]
Merge pull request #57658 from rhcs-dashboard/wip-65995-reef

reef: exporter: fix regex for rgw sync metrics

Reviewed-by: Juan Miguel Olmo <jolmomar@redhat.com>
Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>
13 months agoMerge pull request #57758 from zdover23/wip-doc-2024-05-29-backport-57753-to-reef
Zac Dover [Wed, 29 May 2024 13:50:00 +0000 (23:50 +1000)]
Merge pull request #57758 from zdover23/wip-doc-2024-05-29-backport-57753-to-reef

reef: doc/dev: add note about intro of perf counters

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
13 months agopybind/mgr/mirroring: Fix KeyError: 'directory_count' in daemon status 57763/head
Jos Collin [Mon, 6 May 2024 12:47:29 +0000 (18:17 +0530)]
pybind/mgr/mirroring: Fix KeyError: 'directory_count' in daemon status

The directory_count key is missing in self.mgr.get_daemon_status() output json,
intermittently when there is a delay caused by m_listener.handle_mirroring_enabled() to update the
directory_count, which results in ServiceDaemon::update_status() creates a json with out 'directory_count' key/value.
But the mgr/mirroring -> daemon_status() always expects the 'directory_count' key to be present in the json returned by
self.mgr.get_daemon_status().

This issue occurs intermittently when we enable/disable mirroring and check the 'daemon status' in between.
This patch fixes this issue by setting a default value 0 for 'directory_count' in doemon_status().

Fixes: https://tracker.ceph.com/issues/65795
Signed-off-by: Jos Collin <jcollin@redhat.com>
(cherry picked from commit b78baa23e562742b8bdc5a75f82e3b6fbf55a8a5)