]> git.apps.os.sepia.ceph.com Git - ceph-ci.git/log
ceph-ci.git
15 months agoMerge pull request #58110 from rhcs-dashboard/bring-back-cluster-swticher
Nizamudeen A [Tue, 25 Jun 2024 08:27:42 +0000 (13:57 +0530)]
Merge pull request #58110 from rhcs-dashboard/bring-back-cluster-swticher

mgr/dashboard: fix (multi) cluster switcher disappearance after carbon UI shell

Reviewed-by: Aashish Sharma <aasharma@redhat.com>
Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
Reviewed-by: ivoalmeida <NOT@FOUND>
15 months agoMerge pull request #57930 from rhcs-dashboard/get-replication-info
Nizamudeen A [Tue, 25 Jun 2024 07:43:05 +0000 (13:13 +0530)]
Merge pull request #57930 from rhcs-dashboard/get-replication-info

mgr/dashboard: get and update bucket replication policies

Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
15 months agoMerge PR #56052 into main
Venky Shankar [Tue, 25 Jun 2024 07:10:25 +0000 (12:40 +0530)]
Merge PR #56052 into main

* refs/pull/56052/head:
qa/suites: ignore unresponsive client when the test passes
qa: enhance per-client labelled perf counters test

Reviewed-by: Venky Shankar <vshankar@redhat.com>
15 months agoMerge PR #56429 into main
Venky Shankar [Tue, 25 Jun 2024 07:09:33 +0000 (12:39 +0530)]
Merge PR #56429 into main

* refs/pull/56429/head:
mds: fix rank root doesn't insert root ino into its subtree map when starting
mds: flush mds log before finishing STATE_STARTING
mds/FSMap: go back to STARTING state when rank doesn't make it pass STARTING

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
15 months agoMerge PR #57034 into main
Venky Shankar [Tue, 25 Jun 2024 07:07:56 +0000 (12:37 +0530)]
Merge PR #57034 into main

* refs/pull/57034/head:
qa: cleanup snapshots before subvolume delete

Reviewed-by: Venky Shankar <vshankar@redhat.com>
15 months agoMerge PR #57085 into main
Venky Shankar [Tue, 25 Jun 2024 07:06:29 +0000 (12:36 +0530)]
Merge PR #57085 into main

* refs/pull/57085/head:
mds: set the correct WRLOCK flag always in wrlock_force()

Reviewed-by: Venky Shankar <vshankar@redhat.com>
15 months agoMerge PR #57576 into main
Venky Shankar [Tue, 25 Jun 2024 07:04:22 +0000 (12:34 +0530)]
Merge PR #57576 into main

* refs/pull/57576/head:
src/test: add testcase file for low level sync io code paths

Reviewed-by: Venky Shankar <vshankar@redhat.com>
15 months agomgr/dashboard: fix (multi) cluster switcher disappearance after carbon UI
Nizamudeen A [Tue, 18 Jun 2024 07:44:02 +0000 (13:14 +0530)]
mgr/dashboard: fix (multi) cluster switcher disappearance after carbon UI

Fixes: https://tracker.ceph.com/issues/66538
Signed-off-by: Nizamudeen A <nia@redhat.com>
15 months agoMerge pull request #57001 from rhcs-dashboard/cheroot-upgrade
Nizamudeen A [Tue, 25 Jun 2024 05:11:16 +0000 (10:41 +0530)]
Merge pull request #57001 from rhcs-dashboard/cheroot-upgrade

mgr/dashboard: bump cheroot to > 10.0

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: afreen23 <NOT@FOUND>
16 months agoMerge pull request #57977 from athanatos/sjust/wip-66308-obc-locking
Samuel Just [Mon, 24 Jun 2024 20:05:44 +0000 (13:05 -0700)]
Merge pull request #57977 from athanatos/sjust/wip-66308-obc-locking

crimson: simplify obc loading by locking excl for load and demoting to needed lock

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>
16 months agoMerge pull request #58142 from adk3798/rgw-zone-create-error-handling
Adam King [Mon, 24 Jun 2024 12:38:34 +0000 (08:38 -0400)]
Merge pull request #58142 from adk3798/rgw-zone-create-error-handling

mgr/rgw: fix error handling in rgw zone create

Reviewed-by: John Mulligan <jmulligan@redhat.com>
16 months agoMerge pull request #57339 from phlogistonjohn/jjm-smb-login-control
Adam King [Mon, 24 Jun 2024 12:35:29 +0000 (08:35 -0400)]
Merge pull request #57339 from phlogistonjohn/jjm-smb-login-control

smb: add login control access parameters to share resource

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Avan Thakkar <athakkar@redhat.com>
16 months agoMerge pull request #57927 from rhcs-dashboard/expand-cluster-improvements
Nizamudeen A [Mon, 24 Jun 2024 10:01:09 +0000 (15:31 +0530)]
Merge pull request #57927 from rhcs-dashboard/expand-cluster-improvements

mgr/dashboard: Expand Cluster improvements

Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
16 months agoMerge pull request #56658 from afreen23/wip-nfs-route
Nizamudeen A [Mon, 24 Jun 2024 09:35:09 +0000 (15:05 +0530)]
Merge pull request #56658 from afreen23/wip-nfs-route

mgr/dashboard: Fix NFS routing

Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
16 months agocrimson/.../object_context_loader: add missing dpp argument to debug statement
Samuel Just [Fri, 21 Jun 2024 01:54:58 +0000 (18:54 -0700)]
crimson/.../object_context_loader: add missing dpp argument to debug statement

Signed-off-by: Samuel Just <sjust@redhat.com>
16 months agocrimson/.../tri_mutex: use hobject_t for tri_mutex::name debug value
Samuel Just [Fri, 21 Jun 2024 00:48:04 +0000 (17:48 -0700)]
crimson/.../tri_mutex: use hobject_t for tri_mutex::name debug value

Useful for distinguishing between tri_mutex instances on different clones.

Signed-off-by: Samuel Just <sjust@redhat.com>
16 months agocrimson/.../tri_mutex: no need to store waiter_name on waiter_t
Samuel Just [Fri, 21 Jun 2024 00:46:11 +0000 (17:46 -0700)]
crimson/.../tri_mutex: no need to store waiter_name on waiter_t

It's going to be the same as tri_mutex::name for any waiter on
a tri_mutex.

Signed-off-by: Samuel Just <sjust@redhat.com>
16 months agocrimson/.../object_context: drop recovery_read_marker
Samuel Just [Wed, 19 Jun 2024 04:10:34 +0000 (21:10 -0700)]
crimson/.../object_context: drop recovery_read_marker

This doesn't seem to serve a purpose with current crimson.  classic
uses ObjectState::recovery_read_marker to indicate that backfill
should be requeued upon wakeup, but that hasn't been necessary so
far in crimson.  We can reintroduce this if it becomes useful.

Signed-off-by: Samuel Just <sjust@redhat.com>
16 months agocrimson/.../object_context: drop unused maybe_get_excl
Samuel Just [Wed, 19 Jun 2024 04:09:04 +0000 (21:09 -0700)]
crimson/.../object_context: drop unused maybe_get_excl

Signed-off-by: Samuel Just <sjust@redhat.com>
16 months agocrimson/.../object_context: remove loading_mutex
Samuel Just [Mon, 10 Jun 2024 21:15:16 +0000 (21:15 +0000)]
crimson/.../object_context: remove loading_mutex

Now that we acquire and demote an exclusive lock during loading without
blocking, a seperate loading_mutex shouldn't be necessary.

Signed-off-by: Samuel Just <sjust@redhat.com>
16 months agocrimson/.../tri_mutex: lock() methods return normal future
Samuel Just [Thu, 13 Jun 2024 00:47:08 +0000 (00:47 +0000)]
crimson/.../tri_mutex: lock() methods return normal future

f63d76a2 modified the lock() variants on tri_mutex so that the obc
loading pathway wouldn't invoke .then() on returned future known
statically to be ready.  Now that the loading pathway uses demotion
mechanisms that cannot block and do not return futures, we no longer
have any users like that and can drop the extra std::nullopt
possibility.

In a larger sense, if lock() *can* return a non-ready future in a
particular pathway, there's no semantic difference between returning
std::optional<future<>> and future<> as the caller would still have to
deal with a possible non-ready future return even if std::nullopt is
also possible.  If the pathway can be demonstrated statically to be
non-blocking, as with the obc loading mechanism, we really want to use a
mechanism that obviously cannot block rather relying on a mechanism with
a return signature of std::optional<future<>> to return std::nullopt.

Signed-off-by: Samuel Just <sjust@redhat.com>
16 months agoMerge PR #58125 into main
Patrick Donnelly [Sun, 23 Jun 2024 18:32:41 +0000 (14:32 -0400)]
Merge PR #58125 into main

* refs/pull/58125/head:
common/TrackedOp: do not count the ops marked as nowarn

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
16 months agoMerge PR #58121 into main
Patrick Donnelly [Sun, 23 Jun 2024 18:32:00 +0000 (14:32 -0400)]
Merge PR #58121 into main

* refs/pull/58121/head:
doc: add documentation for `ceph auth rotate`
PendingReleaseNotes: add note for new `auth rotate`
qa: test `auth rotate`
mon/AuthMonitor: add `ceph auth rotate` command

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
16 months agoMerge PR #57993 into main
Patrick Donnelly [Sun, 23 Jun 2024 18:31:28 +0000 (14:31 -0400)]
Merge PR #57993 into main

* refs/pull/57993/head:
mds: initialize epoch for quiescedb

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>
16 months agoMerge PR #57754 into main
Patrick Donnelly [Sun, 23 Jun 2024 18:29:53 +0000 (14:29 -0400)]
Merge PR #57754 into main

* refs/pull/57754/head:
mds: set the proper extra bl for the create request
mds: encode the correct extra info depending on the feature bits
mds: add set_reply_extra_bl() helper support
mds: cleanup the code to make it to be more readable

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Greg Farnum <gfarnum@redhat.com>
16 months agoMerge PR #57084 into main
Patrick Donnelly [Sun, 23 Jun 2024 18:29:13 +0000 (14:29 -0400)]
Merge PR #57084 into main

* refs/pull/57084/head:
mds: drop client metrics during recovery

Reviewed-by: Rishabh Dave <ridave@redhat.com>
Reviewed-by: Xiubo Li <xiubli@redhat.com>
Reviewed-by: Leonid Usov <leonid.usov@ibm.com>
16 months agoMerge PR #56828 into main
Patrick Donnelly [Sun, 23 Jun 2024 18:28:38 +0000 (14:28 -0400)]
Merge PR #56828 into main

* refs/pull/56828/head:
mds: do remove the cap when seqs equal or larger than last issue

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
16 months agoMerge pull request #58200 from zdover23/wip-doc-2024-06-22-start-begin-links
Zac Dover [Sun, 23 Jun 2024 08:15:26 +0000 (18:15 +1000)]
Merge pull request #58200 from zdover23/wip-doc-2024-06-22-start-begin-links

doc/start: add links to Beginner's Guide

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
16 months agodoc/start: add links to Beginner's Guide
Zac Dover [Sat, 22 Jun 2024 06:28:12 +0000 (16:28 +1000)]
doc/start: add links to Beginner's Guide

Add two links to Sage Weil's overview talks about Ceph.

Signed-off-by: Zac Dover <zac.dover@proton.me>
16 months agoMerge pull request #58099 from athanatos/sjust/wip-66461-obc-with-lock
Samuel Just [Sun, 23 Jun 2024 00:42:37 +0000 (17:42 -0700)]
Merge pull request #58099 from athanatos/sjust/wip-66461-obc-with-lock

crimson: fix ObjectContext::_with_lock to only unlock if lock is taken

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>
16 months agoMerge pull request #57866 from rhcs-dashboard/fix-66348-main
Nizamudeen A [Sat, 22 Jun 2024 11:18:58 +0000 (16:48 +0530)]
Merge pull request #57866 from rhcs-dashboard/fix-66348-main

mgr/dashboard: Block -> Images -> Create form improvements

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
Reviewed-by: afreen23 <NOT@FOUND>
Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Sunil Angadi <Sunil.Angadi@ibm.com>
16 months agoMerge pull request #58156 from zdover23/wip-doc-2024-06-20-rados-troubleshooting-mon
Zac Dover [Sat, 22 Jun 2024 06:21:20 +0000 (16:21 +1000)]
Merge pull request #58156 from zdover23/wip-doc-2024-06-20-rados-troubleshooting-mon

doc/rados: followup to PR#58057

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
16 months agocrimson: eliminate lock promotion from object_context and tri_mutex
Samuel Just [Mon, 10 Jun 2024 20:47:07 +0000 (20:47 +0000)]
crimson: eliminate lock promotion from object_context and tri_mutex

Since we now load obc's via ObjectContext::load_then_with_lock, we no
longer need to promote locks.  Eliminate support for now.

Signed-off-by: Samuel Just <sjust@redhat.com>
16 months agocrimson/.../object_context_loader: simplify obc loading
Samuel Just [Mon, 10 Jun 2024 20:40:43 +0000 (20:40 +0000)]
crimson/.../object_context_loader: simplify obc loading

Because we just constructed the obc, we know that we can get an
exclusive lock without blocking.  Introduce
ObjectContext::load_then_with_lock to take an exclusive lock
unconditionally, load, downgrade (which we also know must be safe), and
then run the passed function.

Signed-off-by: Samuel Just <sjust@redhat.com>
16 months agocrimson/.../object_context_loader: consolidate obc loading
Samuel Just [Sat, 8 Jun 2024 01:30:03 +0000 (01:30 +0000)]
crimson/.../object_context_loader: consolidate obc loading

Signed-off-by: Samuel Just <sjust@redhat.com>
16 months agocrimson/.../object_context_loader: remove impossible branch in get_or_load_obc
Samuel Just [Sat, 8 Jun 2024 00:31:53 +0000 (00:31 +0000)]
crimson/.../object_context_loader: remove impossible branch in get_or_load_obc

Signed-off-by: Samuel Just <sjust@redhat.com>
16 months agocrimson: fix ObjectContext::_with_lock to only unlock if lock is taken
Samuel Just [Thu, 13 Jun 2024 00:41:55 +0000 (00:41 +0000)]
crimson: fix ObjectContext::_with_lock to only unlock if lock is taken

Fixes: https://tracker.ceph.com/issues/66461
Signed-off-by: Samuel Just <sjust@redhat.com>
16 months agoMerge pull request #57974 from ljflores/wip-tracker-65189
Laura Flores [Fri, 21 Jun 2024 21:48:27 +0000 (16:48 -0500)]
Merge pull request #57974 from ljflores/wip-tracker-65189

Telemetry upgrade test fixes

16 months agoqa/suites/upgrade/telemetry-upgrade/quincy-x: update image reference
Laura Flores [Wed, 19 Jun 2024 23:01:31 +0000 (18:01 -0500)]
qa/suites/upgrade/telemetry-upgrade/quincy-x: update image reference

`quay.io/ceph/daemon-base:latest-quincy` points to the latest quincy tag (v17.2.7),
not the tip of quincy, which is actually what we want to test.

Pulling the daemon-base:latest-quincy image shows it is running v17.2.7:
```
lflores:ceph$ podman run --rm -it daemon-base:latest-quincy
[root@e4eb265e7fa0 /]# ceph -v
ceph version 17.2.7 (b12291d110049b2f35e32e0de30d70e9a4c060d2) quincy (stable)
```

It is important that we test the tip of quincy in case there is a fix there that
we want to pick up.

I referenced this commit to make the change: https://github.com/ceph/ceph/commit/d0381ed7e68a6de7c9ff1e4220698ed8eab28ac6

Signed-off-by: Laura Flores <lflores@ibm.com>
16 months agoqa/suites/upgrade/telemetry-upgrade/reef-x: update how cephadm is pulled and change...
Laura Flores [Wed, 19 Jun 2024 21:57:45 +0000 (16:57 -0500)]
qa/suites/upgrade/telemetry-upgrade/reef-x: update how cephadm is pulled and change image reference

Update how cephadm is pulled:

`cephadm_git_url` and `cephadm_branch` are used in releases older than reef
to install cephadm. Both of these keys are needed to install it from the github
repo.

However, in reef and on, the compiled zipapp cephadm needs to be pulled differently
than the old single python script `cephadm` from earlier releases.

This commit is what I referenced for this change: https://github.com/ceph/ceph/commit/967716f2e6c8cd3f7d810a5f8e5e49448717e513

-------

Change image reference:

`quay.io/ceph/daemon-base:latest-reef` points to the latest reef tag (v18.2.2),
not the tip of reef, which is actually what we want to test.

Pulling the daemon-base:latest-reef image shows it is running v18.2.2:
```
lflores:ceph$ podman run --rm -it daemon-base:latest-reef
[root@d4d9a0eb04be /]# ceph -v
ceph version 18.2.2 (531c0d11a1c5d39fbfe6aa8a521f023abf3bf3e2) reef (stable)
```

It is important that we test the tip of reef in case there is a fix there that we
want to pick up.

I referenced this commit to make the change: https://github.com/ceph/ceph/commit/d0381ed7e68a6de7c9ff1e4220698ed8eab28ac6

Signed-off-by: Laura Flores <lflores@ibm.com>
16 months agoqa/suites/upgrade/telemetry-upgrade: add more ignorelist items and require_osd_releas...
Laura Flores [Wed, 19 Jun 2024 21:07:31 +0000 (16:07 -0500)]
qa/suites/upgrade/telemetry-upgrade: add more ignorelist items and require_osd_release=squid

The warnings added to the ignorelist show up in the cluster log, but they are
expected during upgrades and should thus be ignored.

We also need to set require_osd_release=squid to avoid this warning:
```
cluster [WRN] Health check failed: all OSDs are running squid or later but require_osd_release < squid (OSD_UPGRADE_FINISHED)
```

Signed-off-by: Laura Flores <lflores@ibm.com>
16 months agoMerge pull request #58178 from ljflores/wip-tracker-66556
Radoslaw Zarzynski [Fri, 21 Jun 2024 14:41:12 +0000 (16:41 +0200)]
Merge pull request #58178 from ljflores/wip-tracker-66556

Revert "test: reproduce flap peering bug (wip)"

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Dan van der Ster <daniel.vanderster@cern.ch>
16 months agoMerge pull request #58175 from batrick/label-scripts
Ilya Dryomov [Fri, 21 Jun 2024 13:21:40 +0000 (15:21 +0200)]
Merge pull request #58175 from batrick/label-scripts

.github/labeler: add script label to script changes

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
16 months agomgr/dashboard: edit replication policy
Nizamudeen A [Fri, 7 Jun 2024 13:22:19 +0000 (18:52 +0530)]
mgr/dashboard: edit replication policy

Fixes: https://tracker.ceph.com/issues/66240
Signed-off-by: Nizamudeen A <nia@redhat.com>
16 months agomgr/dashboard: get bucket replication info
Nizamudeen A [Fri, 7 Jun 2024 13:13:00 +0000 (18:43 +0530)]
mgr/dashboard: get  bucket replication info

- Show bucket replication status and replication policy in the details
view

Fixes: https://tracker.ceph.com/issues/66241
Signed-off-by: Nizamudeen A <nia@redhat.com>
16 months agomgr/dashboard: select default daemon based on the default zonegroup
Nizamudeen A [Fri, 7 Jun 2024 07:45:06 +0000 (13:15 +0530)]
mgr/dashboard: select default daemon based on the default zonegroup

if multisite is configured, the default daemon needs to be selected
based on the default zonegroup. Otherwise dashboard gives you incorrect
details when doing the period commit

The issue occurs when you do a period update --commit and you reload one
of the block page, the api assigns the zonegroup of the second gateway
because for a moment, the first gateway reflects the period changes...
This is not true because the default zonegroup is of the previous active
gateway but even though the back-end correctly says the active
zonegroup, the dashboard api says it wrongly.

Fixes: https://tracker.ceph.com/issues/66394
Signed-off-by: Nizamudeen A <nia@redhat.com>
16 months agoMerge pull request #55675 from douglasawh/patch-2
Zac Dover [Fri, 21 Jun 2024 09:17:54 +0000 (19:17 +1000)]
Merge pull request #55675 from douglasawh/patch-2

doc/src: invadvisable is not a word

Reviewed-by: Zac Dover <zac.dover@proton.me>
16 months agomgr/dashboard: Block -> Images -> Create form improvements
Aashish Sharma [Tue, 4 Jun 2024 12:29:34 +0000 (17:59 +0530)]
mgr/dashboard: Block -> Images -> Create form improvements

Fixes: https://tracker.ceph.com/issues/66348
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
16 months agoMerge pull request #58109 from zdover23/wip-doc-2024-06-18-rados-ops-stretch-mode
Zac Dover [Fri, 21 Jun 2024 05:49:01 +0000 (15:49 +1000)]
Merge pull request #58109 from zdover23/wip-doc-2024-06-18-rados-ops-stretch-mode

doc/rados: add stretch_rule workaround

Reviewed-by: Prashant D <pdhange@redhat.com>
16 months agoMerge PR #58174 into main
Patrick Donnelly [Fri, 21 Jun 2024 01:50:25 +0000 (21:50 -0400)]
Merge PR #58174 into main

* refs/pull/58174/head:
script/backport-create-issue: update tag custom field

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
16 months agomgr/dashboard: Expand Cluster improvements
Naman Munet [Fri, 7 Jun 2024 10:49:23 +0000 (16:19 +0530)]
mgr/dashboard: Expand Cluster improvements

worked on expand cluster screen hide/show and persisting osd form values

Fixes: https://tracker.ceph.com/issues/66344
Signed-off-by: Naman Munet <nmunet@redhat.com>
16 months agodoc: add documentation for `ceph auth rotate`
Patrick Donnelly [Tue, 18 Jun 2024 18:07:31 +0000 (14:07 -0400)]
doc: add documentation for `ceph auth rotate`

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
16 months agoPendingReleaseNotes: add note for new `auth rotate`
Patrick Donnelly [Tue, 18 Jun 2024 17:47:29 +0000 (13:47 -0400)]
PendingReleaseNotes: add note for new `auth rotate`

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
16 months agoqa: test `auth rotate`
Patrick Donnelly [Tue, 18 Jun 2024 18:17:00 +0000 (14:17 -0400)]
qa: test `auth rotate`

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
16 months agomon/AuthMonitor: add `ceph auth rotate` command
Patrick Donnelly [Tue, 18 Jun 2024 17:31:14 +0000 (13:31 -0400)]
mon/AuthMonitor: add `ceph auth rotate` command

Add command to rotate the permanent key of an entity. This avoids the need to
delete / recreate the key when it is compromised, lost, or just scheduled for
rotation.

Fixes: https://tracker.ceph.com/issues/66509
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
16 months agodoc/src: invadvisable is not a word
Doug Whitfield [Tue, 20 Feb 2024 21:13:14 +0000 (15:13 -0600)]
doc/src: invadvisable is not a word

changed to inadvisable on line 4997

Signed-off-by: Doug Whitfield <douglasawh@gmail.com>
16 months agoRevert "test: reproduce flap peering bug (wip)"
Laura Flores [Thu, 20 Jun 2024 18:52:07 +0000 (13:52 -0500)]
Revert "test: reproduce flap peering bug (wip)"

This reverts commit c89f9c4e80a4ac45ae5ea86ea7e9c41999299f27.

Note from Dan van der Ster:
This is a test that should succeed,
it definitely used to succeed back in the L/O days of Ceph. At some
point peering code changed and this behaviour regressed. In short,
an OSD goes down then comes up, and no objects were modified in the mean time.
There should be no degraded PGs in this case.

As this commit is currently breaking make check on all PRs, I think it should
be re-evaluated and merged so whatever fix is needed along with this test to
make it work are merged together.

Fixes: https://tracker.ceph.com/issues/66556
Signed-off-by: Laura Flores <lflores@ibm.com>
16 months ago.github/labeler: add script label to script changes
Patrick Donnelly [Thu, 20 Jun 2024 16:00:05 +0000 (12:00 -0400)]
.github/labeler: add script label to script changes

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
16 months agoscript/backport-create-issue: update tag custom field
Patrick Donnelly [Thu, 20 Jun 2024 15:43:17 +0000 (11:43 -0400)]
script/backport-create-issue: update tag custom field

The redmineup_tags was added and, apparently, the old "Tags" custom field with
id == 3 was deleted and then recreated with id == 31. This broke the script.
Update and refactor the id for that "Tags" custom field.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
16 months agoMerge pull request #57412 from adk3798/stray-laundry2
Adam King [Thu, 20 Jun 2024 16:06:18 +0000 (12:06 -0400)]
Merge pull request #57412 from adk3798/stray-laundry2

qa/cephadm: fix ignorelist of CEPHADM_STRAY_DAEMON for rados_api_tests

Reviewed-by: Laura Flores <lflores@ibm.com>
16 months agoMerge pull request #58055 from zmc/qemu-task-version
Ilya Dryomov [Thu, 20 Jun 2024 15:05:11 +0000 (17:05 +0200)]
Merge pull request #58055 from zmc/qemu-task-version

qa/tasks/qemu: Fix OS version comparison

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
16 months agoMerge pull request #57079 from kchheda3/wip-lc-notification
Casey Bodley [Thu, 20 Jun 2024 14:19:18 +0000 (15:19 +0100)]
Merge pull request #57079 from kchheda3/wip-lc-notification

rgw/lifecycle-notification: Do not block lc processing for notification errors.

Reviewed-by: Yuval Lifshitz <ylifshit@ibm.com>
16 months agoMerge pull request #57933 from samarahu/d4n_copy_test_fix
Casey Bodley [Thu, 20 Jun 2024 14:04:47 +0000 (15:04 +0100)]
Merge pull request #57933 from samarahu/d4n_copy_test_fix

d4n/directory: Comment out `copy` unit tests and remove unnecessary `copy_object` implementation in filter

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
16 months agoMerge pull request #58147 from zdover23/wip-doc-2024-06-20-start-beginners-guide...
Zac Dover [Thu, 20 Jun 2024 13:53:42 +0000 (23:53 +1000)]
Merge pull request #58147 from zdover23/wip-doc-2024-06-20-start-beginners-guide-wiki

doc/start: add wiki to Beginner's Guide

Reviewed-by: Cole Mitchell <cole.mitchell.ceph@gmail.com>
16 months agomgr/dashboard: Fix NFS routing
Afreen [Wed, 3 Apr 2024 02:15:32 +0000 (07:45 +0530)]
mgr/dashboard: Fix NFS routing

Fixes https://tracker.ceph.com/issues/65310

The NFS tab in object and File nav uses same route due to which both
gets activated when one of them is clicked.
Hence, this PR separates the routing for Object and File nav.
Object-> NFS: /rgw/nfs
File-> NFS: /cephfs/nfs

Both routes use same NFS List and Form component but under different
routes as mentioned above.

Changes summary
- updated route for File from "/fs" to "/cephfs/<any_other_sub_route>"
  to support both fs and nfs tabs. Since using `/fs` and `/fs/nfs` will
activate both paths and it will be an undesirable user experience.
- `getFsalRouteFromPath` helper function to set the storage backend from
  route.
- removed `stoarge-backend` field from nfs form as now route decides teh
  storage backend
- breadcrumbs redirect to respective navs
- updated e2e tests
- updated unit tests
- changes list page of object-> nfs page to say Bucket instead of Path

Signed-off-by: Afreen <afreen23.git@gmail.com>
16 months agoMerge pull request #58069 from rhcs-dashboard/multiple-alert-panel-dashboard
Nizamudeen A [Thu, 20 Jun 2024 12:48:24 +0000 (18:18 +0530)]
Merge pull request #58069 from rhcs-dashboard/multiple-alert-panel-dashboard

mgr/dashboard: fix alert broken for multiple alerts

Reviewed-by: afreen23 <NOT@FOUND>
Reviewed-by: ivoalmeida <NOT@FOUND>
16 months agoMerge pull request #51566 from rhcs-dashboard/fix-pool-application
Avan [Thu, 20 Jun 2024 12:29:05 +0000 (17:59 +0530)]
Merge pull request #51566 from rhcs-dashboard/fix-pool-application

mgr/dashboard: make application field for pool creation mandatory

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
Reviewed-by: afreen23 <NOT@FOUND>
Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>
Reviewed-by: Sunil Angadi <Sunil.Angadi@ibm.com>
16 months agodoc/rados: followup to PR#58057
Zac Dover [Thu, 20 Jun 2024 11:43:53 +0000 (21:43 +1000)]
doc/rados: followup to PR#58057

Incorporate Anthony D'Atri's suggestions in
https://github.com/ceph/ceph/pull/58057

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
16 months agoMerge pull request #58068 from rhcs-dashboard/disable-telemetry-notifications-in-e2e
Nizamudeen A [Thu, 20 Jun 2024 06:25:44 +0000 (11:55 +0530)]
Merge pull request #58068 from rhcs-dashboard/disable-telemetry-notifications-in-e2e

mgr/dashboard: disable telemetry notification in e2e

Reviewed-by: afreen23 <NOT@FOUND>
Reviewed-by: nmunet <NOT@FOUND>
16 months agoMerge pull request #58138 from guits/fix-dmcrypt-check
Guillaume Abrioux [Thu, 20 Jun 2024 05:45:03 +0000 (07:45 +0200)]
Merge pull request #58138 from guits/fix-dmcrypt-check

ceph-volume: fix regex usage in `set_dmcrypt_no_workqueue`

16 months agoMerge pull request #57946 from ceph/wip-lusov-qdb-bootstrap-event
Leonid S. Usov [Thu, 20 Jun 2024 04:56:09 +0000 (07:56 +0300)]
Merge pull request #57946 from ceph/wip-lusov-qdb-bootstrap-event

mds: QuiesceDbManager: mark next retry event during bootstrap

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
16 months agoMerge pull request #57980 from ceph/wip-lusov-qdb-root-ttl
Leonid S. Usov [Thu, 20 Jun 2024 04:55:36 +0000 (07:55 +0300)]
Merge pull request #57980 from ceph/wip-lusov-qdb-root-ttl

mds/quiesce-db: calculate_quiesce_map: aggregate quiesce roots' TTL as `max`

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
16 months agodoc/start: add wiki to Beginner's Guide
Zac Dover [Thu, 20 Jun 2024 03:54:24 +0000 (13:54 +1000)]
doc/start: add wiki to Beginner's Guide

Add the Redmine Ceph Wiki to the Ceph Beginner's Guide.

Signed-off-by: Zac Dover <zac.dover@proton.me>
16 months agocommon/TrackedOp: do not count the ops marked as nowarn
Xiubo Li [Wed, 19 Jun 2024 03:27:33 +0000 (11:27 +0800)]
common/TrackedOp: do not count the ops marked as nowarn

If an op is marked as nowarn then it won't be counted as the slow
requests, but currently it will count the initiated time when
iterating the inflight ops.

For example:

[WRN] : 1 slow requests, 1 included below; oldest blocked for > 38.764892 secs
[WRN] : slow request 33.875059 seconds old, received at 2024-06-17T14:14:34.228261+0000: client_request(client.78109915:11369251 mkdir #0x1008ecedea2/chk-89588 2024-06-17T14:14:34.097825+0000 caller_uid=1002960000, caller_gid=0{0,1002960000,}) currently failed to wrlock, waiting

The oldest blocked request is 38.764892 old, but the oldest slow
request reported is 33.875059 old.

Fixes: commit e4160d7e783 ("mds: don't report slow request for blocked filelock request")
Fixes: https://tracker.ceph.com/issues/66557
Signed-off-by: Xiubo Li <xiubli@redhat.com>
16 months agoMerge PR #55792 into main
Patrick Donnelly [Wed, 19 Jun 2024 19:19:56 +0000 (15:19 -0400)]
Merge PR #55792 into main

* refs/pull/55792/head:
tools/cephfs: recover alternate_name of dentries from journal
qa: add test to verify recovery of alternate_name from journal
tools/cephfs/JournalTool: add some more debugging
tools/cephfs/JournalTool: remove extraneous 0x in debug output
mds: dump alternate_name to formatter
mds: add warning about encoding new fields

Reviewed-by: Christopher Hoffman <choffman@redhat.com>
Reviewed-by: Xiubo Li <xiubli@redhat.com>
16 months agomgr/rgw: fix error handling in rgw zone create
Adam King [Wed, 19 Jun 2024 19:04:21 +0000 (15:04 -0400)]
mgr/rgw: fix error handling in rgw zone create

This was returning either a list of strings or
a HandleCommandResult and in the latter case
it would error out trying to build the final
return message which covered up the original
error

Fixes: https://tracker.ceph.com/issues/66568
Signed-off-by: Adam King <adking@redhat.com>
16 months agopython-common/rgw: fix broken f-string in zone_create func
Adam King [Wed, 19 Jun 2024 18:46:24 +0000 (14:46 -0400)]
python-common/rgw: fix broken f-string in zone_create func

Or I guess I should say make it an f-string since the
"f" was mising despite using f-string syntax

Signed-off-by: Adam King <adking@redhat.com>
16 months agoMerge pull request #57973 from idryomov/wip-66418
Ilya Dryomov [Wed, 19 Jun 2024 18:17:15 +0000 (20:17 +0200)]
Merge pull request #57973 from idryomov/wip-66418

librbd: diff-iterate shouldn't crash on an empty byte range

Reviewed-by: Ramana Raja <rraja@redhat.com>
Reviewed-by: Mykola Golub <mgolub@suse.com>
16 months agoMerge pull request #58074 from idryomov/wip-66300
Ilya Dryomov [Wed, 19 Jun 2024 18:15:35 +0000 (20:15 +0200)]
Merge pull request #58074 from idryomov/wip-66300

librbd: disallow group snap rollback if memberships don't match

Reviewed-by: Ramana Raja <rraja@redhat.com>
16 months agoceph-volume: fix regex usage in `set_dmcrypt_no_workqueue`
Guillaume Abrioux [Wed, 19 Jun 2024 13:06:52 +0000 (15:06 +0200)]
ceph-volume: fix regex usage in `set_dmcrypt_no_workqueue`

- Updated the regex pattern to `r'(\d+\.?)+'` to more accurately
  capture version numbers.

- Replaced `re.match` with `re.search` to properly match the cryptsetup
  version in the output.

- `re.match` only checks for a match at the beginning of the string,
   while `re.search` looks for a match anywhere in the string.

This fix ensures that the function correctly retrieves the
cryptsetup version from the output.

Fixes: https://tracker.ceph.com/issues/66393
Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
16 months agomgr/dashboard: fix alert broken for multiple alerts
Nizamudeen A [Sun, 16 Jun 2024 12:46:52 +0000 (18:16 +0530)]
mgr/dashboard: fix alert broken for multiple alerts

After carbon the alert panel was broken when there are multiple alerts
present (telemetry, motd, password expiration).

Applying carbon banner to the existing alert banner

Fixes: https://tracker.ceph.com/issues/66512
Signed-off-by: Nizamudeen A <nia@redhat.com>
16 months agodoc: document smb share login_control and restrict_access options
John Mulligan [Mon, 6 May 2024 23:29:25 +0000 (19:29 -0400)]
doc: document smb share login_control and restrict_access options

Signed-off-by: John Mulligan <jmulligan@redhat.com>
16 months agomgr/smb: add handler unit tests for login_control in share generation
John Mulligan [Mon, 6 May 2024 23:20:57 +0000 (19:20 -0400)]
mgr/smb: add handler unit tests for login_control in share generation

Signed-off-by: John Mulligan <jmulligan@redhat.com>
16 months agomgr/smb: add unit tests for share login_control attribute
John Mulligan [Mon, 6 May 2024 23:20:45 +0000 (19:20 -0400)]
mgr/smb: add unit tests for share login_control attribute

Signed-off-by: John Mulligan <jmulligan@redhat.com>
16 months agomgr/smb: add validation unit tests for login names
John Mulligan [Mon, 6 May 2024 23:20:35 +0000 (19:20 -0400)]
mgr/smb: add validation unit tests for login names

Signed-off-by: John Mulligan <jmulligan@redhat.com>
16 months agomgr/smb: include login_control content when generating share config
John Mulligan [Mon, 6 May 2024 20:35:31 +0000 (16:35 -0400)]
mgr/smb: include login_control content when generating share config

The login_control list (modified by restrict_access) defines the
smb.conf params 'read list', 'write list', 'admin users', 'invalid
users', and 'valid users'.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
16 months agomgr/smb: add login_control and restrict_access attributes to share
John Mulligan [Mon, 6 May 2024 20:35:41 +0000 (16:35 -0400)]
mgr/smb: add login_control and restrict_access attributes to share

The login_control attribute contains a list of LoginAccessEntry objects.
These will be mapped to samba share access parameters.
The restrict_access attribute is a boolean that makes access to the
share contingent on being listed in the login_control list, as
either a listed user or member of a listed group, for any access
level other than 'none' (as that always denies access).

Signed-off-by: John Mulligan <jmulligan@redhat.com>
16 months agomgr/smb: add validation function for share login names
John Mulligan [Mon, 6 May 2024 20:35:18 +0000 (16:35 -0400)]
mgr/smb: add validation function for share login names

Add a validation function that will be used for a future share login
control attribute.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
16 months agomgr/smb: add enums for managing share login access control
John Mulligan [Mon, 6 May 2024 20:35:06 +0000 (16:35 -0400)]
mgr/smb: add enums for managing share login access control

Add enums that will be used to create a share login control attribute.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
16 months agoMerge pull request #57294 from phlogistonjohn/jjm-smb-free-customize
Adam King [Wed, 19 Jun 2024 13:17:51 +0000 (09:17 -0400)]
Merge pull request #57294 from phlogistonjohn/jjm-smb-free-customize

mgr/smb: add custom config options to share and cluster resources

Reviewed-by: Adam King <adking@redhat.com>
16 months agoMerge pull request #57430 from tchaikov/wip-rgw-driver-posi
Casey Bodley [Wed, 19 Jun 2024 12:14:30 +0000 (13:14 +0100)]
Merge pull request #57430 from tchaikov/wip-rgw-driver-posi

rgw/driver/posix: s/is_value()/success()/

Reviewed-by: Matt Benjamin <mbenjamin@redhat.com>
16 months agoMerge pull request #57691 from xxhdx1985126/wip-66216
Matan Breizman [Wed, 19 Jun 2024 10:21:25 +0000 (13:21 +0300)]
Merge pull request #57691 from xxhdx1985126/wip-66216

crimson/osd/osd_operations/client_request_common: `PeeringState::needs_recovery()` may fail if the object is under backfill

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>
16 months agomgr/dashboard: fix service page e2e tests
Afreen Misbah [Fri, 14 Jun 2024 05:21:47 +0000 (10:51 +0530)]
mgr/dashboard: fix service page e2e tests

- service page now uses defaults value for the placement count due to which mds test failing
- in test we pass "1" while "2" which is the default count for mds is already populated, making it 21 and causing unable to create mds service

Fixes: https://tracker.ceph.com/issues/66540
Signed-off-by: Afreen Misbah <afreen23.git@gmail.com>
(cherry picked from commit 5a718229dc43953f140a16342f19523a1c075863)

16 months agoMerge pull request #53832 from rhcs-dashboard/fix-cephadm-daemon-failed-alert
Nizamudeen A [Wed, 19 Jun 2024 06:49:56 +0000 (12:19 +0530)]
Merge pull request #53832 from rhcs-dashboard/fix-cephadm-daemon-failed-alert

mgr/dashboard: Show which daemons failed in CEPHADM_FAILED_DAEMON healthcheck

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
16 months agoMerge pull request #58083 from myoungwon/wip-seastore-rbm-stat
Yingxin [Wed, 19 Jun 2024 03:10:44 +0000 (11:10 +0800)]
Merge pull request #58083 from myoungwon/wip-seastore-rbm-stat

crimson/os/seastore: add writer level stats to RBM

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
16 months agoqa/tasks/qemu: Fix OS version comparison
Zack Cerza [Fri, 14 Jun 2024 19:37:16 +0000 (13:37 -0600)]
qa/tasks/qemu: Fix OS version comparison

See: https://sentry.ceph.com/share/issue/21ed88d705854238bdafbf6711e795ee/
They're strings, not floats.
This surfaced as a result of https://github.com/ceph/teuthology/pull/1953

Signed-off-by: Zack Cerza <zack@redhat.com>
16 months agoMerge pull request #58123 from gsoneill/clarify-ec-architecture
Anthony D'Atri [Wed, 19 Jun 2024 00:50:21 +0000 (20:50 -0400)]
Merge pull request #58123 from gsoneill/clarify-ec-architecture

doc: clarify availability vs integrity

16 months agodoc: clarify availability vs integrity
Gregory O'Neill [Tue, 18 Jun 2024 21:17:00 +0000 (17:17 -0400)]
doc: clarify availability vs integrity

16 months agomds: initialize epoch for quiescedb
Patrick Donnelly [Wed, 12 Jun 2024 16:09:11 +0000 (12:09 -0400)]
mds: initialize epoch for quiescedb

Fixes: https://tracker.ceph.com/issues/66449
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
16 months agoMerge pull request #58112 from nbalacha/wip-doc-typo-180624
Ilya Dryomov [Tue, 18 Jun 2024 14:18:35 +0000 (16:18 +0200)]
Merge pull request #58112 from nbalacha/wip-doc-typo-180624

doc/rbd: fix typos in NVMe-oF docs

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>