]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
3 months agoMerge pull request #61468 from Nordix/fix-69610-sunnat 63818/head
Adam King [Tue, 1 Jul 2025 14:23:02 +0000 (10:23 -0400)]
Merge pull request #61468 from Nordix/fix-69610-sunnat

mgr/prometheus: Make prometheus TLS config work with Rook orchestrator

Reviewed-by: Redouane Kachach <rkachach@ibm.com>
3 months agoMerge pull request #64255 from bluikko/doc-config-ceph-conf-improvements-rados
Anthony D'Atri [Tue, 1 Jul 2025 13:24:18 +0000 (09:24 -0400)]
Merge pull request #64255 from bluikko/doc-config-ceph-conf-improvements-rados

doc/rados/configuration: Small improvements in ceph-conf.rst

3 months agoMerge pull request #64239 from tchaikov/wip-doc-mgr-crash
Kefu Chai [Tue, 1 Jul 2025 07:59:14 +0000 (15:59 +0800)]
Merge pull request #64239 from tchaikov/wip-doc-mgr-crash

doc/mgr/crash.rst: remove outdated module enabling instructions

Reviewed-by: Zac Dover <zac.dover@proton.me>
3 months agoMerge pull request #64065 from tchaikov/wip-test-mon-election-fix-leaks
Kefu Chai [Tue, 1 Jul 2025 04:31:34 +0000 (12:31 +0800)]
Merge pull request #64065 from tchaikov/wip-test-mon-election-fix-leaks

test/mon/test_election: fix memory leaks of Owner and ConnectionTracker

Reviewed-by: Ronen Friedman <rfriedma@redhat.com>
Reviewed-by: Kamoltat Sirivadhna <ksirivad@redhat.com>
3 months agoMerge pull request #60416 from rhcs-dashboard/carbonize-usage-bar
Aashish Sharma [Tue, 1 Jul 2025 04:17:23 +0000 (09:47 +0530)]
Merge pull request #60416 from rhcs-dashboard/carbonize-usage-bar

mgr/dashboard: replace usage or progress bar with carbon meter chart

Reviewed-by: Aashish Sharma <aasharma@redhat.com>
3 months agoMerge pull request #64244 from ronen-fr/wip-rf-noshard2
Ronen Friedman [Mon, 30 Jun 2025 16:28:45 +0000 (19:28 +0300)]
Merge pull request #64244 from ronen-fr/wip-rf-noshard2

osd/scrub: fixing additional log messages to name pgids

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
3 months agoMerge pull request #63760 from mkogan1/wip-tst-perl-fix
Casey Bodley [Mon, 30 Jun 2025 12:53:03 +0000 (08:53 -0400)]
Merge pull request #63760 from mkogan1/wip-tst-perl-fix

qa/rgw: fix perl tests missing Amazon::S3 module

Reviewed-by: Adam C. Emerson <aemerson@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
3 months agoMerge pull request #62929 from rhcs-dashboard/fix-71033-main
Aashish Sharma [Mon, 30 Jun 2025 11:48:12 +0000 (17:18 +0530)]
Merge pull request #62929 from rhcs-dashboard/fix-71033-main

mgr/dashboard: Enable rgw module automatically in the primary and secondary cluster if not enabled during multi-site automation

Reviewed-by: Naman Munet <naman.munet@ibm.com>
3 months agoMerge pull request #64010 from yuvalif/wip-yuval-70756
Yuval Lifshitz [Mon, 30 Jun 2025 11:10:02 +0000 (14:10 +0300)]
Merge pull request #64010 from yuvalif/wip-yuval-70756

rgw/notifications: prevent deletion of skipped notifications

3 months agoosd/scrub: fixing additional log messages to name pgids 64244/head
Ronen Friedman [Sun, 29 Jun 2025 12:41:14 +0000 (07:41 -0500)]
osd/scrub: fixing additional log messages to name pgids

... instead of shards

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
3 months agoMerge pull request #63560 from benhanokh/dedup_pool
Yuval Lifshitz [Mon, 30 Jun 2025 08:40:08 +0000 (11:40 +0300)]
Merge pull request #63560 from benhanokh/dedup_pool

rgw/dedup: full object dedup continuous work

3 months agodoc/rados/configuration: Small improvements in ceph-conf.rst 64255/head
Ville Ojamo [Mon, 30 Jun 2025 07:30:34 +0000 (14:30 +0700)]
doc/rados/configuration: Small improvements in ceph-conf.rst

Fix a missing space typo.

Use title case in section titles.

Use ref for hyperlink to a section that already has a label instead of
linking to title text of the section.

Use privileged command prompt for all ceph commands except ceph commands
that invoke help.

Use INI preformatted block for last remaining block that was just using
plain preformatted.

Signed-off-by: Ville Ojamo <14869000+bluikko@users.noreply.github.com>
3 months agoMerge pull request #64243 from ronen-fr/wip-rf-getreq
Ronen Friedman [Mon, 30 Jun 2025 06:16:07 +0000 (09:16 +0300)]
Merge pull request #64243 from ronen-fr/wip-rf-getreq

osd: delete duplicate data in enqueue_op() log message

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
3 months agomgr/dashboard: Enable rgw module automatically in the primary and secondary cluster... 62929/head
Aashish Sharma [Wed, 23 Apr 2025 09:23:24 +0000 (14:53 +0530)]
mgr/dashboard: Enable rgw module automatically in the primary and secondary cluster if not enabled during multi-site automation

1. Enable rgw module automatically in the primary and secondary cluster if not enabled during multi-site automation
2. Improve progress bar descriptions and add sub-descriptions for steps

Fixes: https://tracker.ceph.com/issues/71033
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
3 months agoMerge pull request #62785 from bluikko/pybind-doc-empty-lines
Kefu Chai [Mon, 30 Jun 2025 05:19:00 +0000 (13:19 +0800)]
Merge pull request #62785 from bluikko/pybind-doc-empty-lines

pybind/mgr: Fix missing empty lines in documentation in mgr_module.py

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
Reviewed-by: Kefu Chai <tchaikov@gmail.com>
3 months agoosd: delete duplicate data in enqueue_op() log message 64243/head
Ronen Friedman [Sun, 29 Jun 2025 12:48:06 +0000 (07:48 -0500)]
osd: delete duplicate data in enqueue_op() log message

Following f1b7854, get_req() was logged twice in one of the lines.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
3 months agodoc/mgr/crash.rst: remove outdated module enabling instructions 64239/head
Kefu Chai [Sun, 29 Jun 2025 02:55:30 +0000 (10:55 +0800)]
doc/mgr/crash.rst: remove outdated module enabling instructions

The crash module has been enabled by default since commit 18f253aa in
Nautilus and is now in the always_on_modules list. However, the
documentation still contained instructions for manually enabling it.

When users followed these outdated instructions, they encountered:
```
  module 'crash' is already enabled (always-on)
```

The module cannot be disabled either. Running:
```
  ceph mgr module disable crash
```

Returns the error:
```
  Error EINVAL: module 'crash' cannot be disabled (always-on)
```
In this change, we remove the obsolete enabling instructions and clarify
that this module is always active and cannot be disabled.

Signed-off-by: Kefu Chai <tchaikov@gmail.com>
3 months agoMerge pull request #64211 from ronen-fr/wip-rf-noshard
Ronen Friedman [Fri, 27 Jun 2025 10:52:57 +0000 (13:52 +0300)]
Merge pull request #64211 from ronen-fr/wip-rf-noshard

osd/scrub: 'starts' messages should name PGs, not shards

Reviewed-by: Aishwarya Mathuria <amathuri@redhat.com>
3 months agoqa/rgw: fix perl tests missing Amazon::S3 module 63760/head
Mark Kogan [Wed, 25 Jun 2025 12:21:49 +0000 (12:21 +0000)]
qa/rgw: fix perl tests missing Amazon::S3 module

and a second case where perl tests can fail without error output

1. fix errors like: `Can't locate Amazon/S3.pm in @INC (you may need to
   install the Amazon::S3 module)`
by priming the perl tests with installing the Amazon::S3 module from cpan

ex:
```
2025-06-23T19:18:40.162 INFO:tasks.workunit.client.0.smithi090.stderr:Can't locate Amazon/S3.pm in @INC (you may need to install the Amazon::S3 module) (@INC contains: /usr/local/lib64/perl5/5.32 ...
```

2. log an error when RGW process is not detected

Fixes: https://tracker.ceph.com/issues/71577
Signed-off-by: Mark Kogan <mkogan@redhat.com>
3 months agoMerge pull request #64164 from tchaikov/wip-dashboard-api
Nizamudeen A [Fri, 27 Jun 2025 05:06:08 +0000 (10:36 +0530)]
Merge pull request #64164 from tchaikov/wip-dashboard-api

mgr/dashboard: Fix inline markup warning in API documentation

3 months agoMerge pull request #64123 from phlogistonjohn/jjm-el10-replace-pathfix-spec
David Galloway [Thu, 26 Jun 2025 19:41:36 +0000 (15:41 -0400)]
Merge pull request #64123 from phlogistonjohn/jjm-el10-replace-pathfix-spec

ceph.spec.in: use rpm macro for python shebang pathfix

3 months agoMerge pull request #62944 from lbausch/common-fix-config-description-typo
Zac Dover [Thu, 26 Jun 2025 18:40:29 +0000 (04:40 +1000)]
Merge pull request #62944 from lbausch/common-fix-config-description-typo

common/options: fix typo in option description

Reviewed-by: Zac Dover <zac.dover@proton.me>
3 months agoMerge pull request #63564 from Hezko/nvmeof-cli-update-nvme-submodule
Hezko [Thu, 26 Jun 2025 17:24:19 +0000 (20:24 +0300)]
Merge pull request #63564 from Hezko/nvmeof-cli-update-nvme-submodule

mgr/dashboard update ceph nvmeof proto files

3 months agoMerge pull request #63102 from ceph/wip-release-branch-protection
David Galloway [Thu, 26 Jun 2025 15:05:36 +0000 (11:05 -0400)]
Merge pull request #63102 from ceph/wip-release-branch-protection

doc: Add branch protection step to new release checklist

3 months agoMerge pull request #63179 from tchaikov/rgw-std-variant
Casey Bodley [Thu, 26 Jun 2025 15:03:13 +0000 (11:03 -0400)]
Merge pull request #63179 from tchaikov/rgw-std-variant

rgw: migrate from boost::variant to std::variant

Reviewed-by: Matt Benjamin <mbenjamin@redhat.com>
Reviewed-by: Alex Wojno <awojno@bloomberg.net>
3 months agoMerge pull request #63930 from cbodley/wip-cls-rgw-BucketIndexAioManager
Casey Bodley [Thu, 26 Jun 2025 15:02:22 +0000 (11:02 -0400)]
Merge pull request #63930 from cbodley/wip-cls-rgw-BucketIndexAioManager

cls/rgw: remove unused class BucketIndexAioManager

Reviewed-by: Adam C. Emerson <aemerson@redhat.com>
3 months agoMerge pull request #63926 from cbodley/wip-71667
Casey Bodley [Thu, 26 Jun 2025 14:41:43 +0000 (10:41 -0400)]
Merge pull request #63926 from cbodley/wip-71667

rgw/sal: Serializer::unlock() takes optional_yield

Reviewed-by: Adam C. Emerson <aemerson@redhat.com>
Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
3 months agoosd/scrub: 'starts' messages should name PGs, not shards 64211/head
Ronen Friedman [Thu, 26 Jun 2025 13:27:57 +0000 (08:27 -0500)]
osd/scrub: 'starts' messages should name PGs, not shards

By mistake, the 'scrub starts' message included the shard ID
of the primary OSD, instead of just the PG ID.

Fixes: https://tracker.ceph.com/issues/71780
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
3 months agomgr/dashboard: replace usage or progress bar with carbon meter chart 60416/head
Naman Munet [Mon, 21 Oct 2024 16:55:41 +0000 (22:25 +0530)]
mgr/dashboard: replace usage or progress bar with carbon meter chart

Fixes: https://tracker.ceph.com/issues/68258
Changes affect the following files:
- rbd-list.component.html
- cephfs-detail.component.html
- cephfs-subvolume-group.component.html
- cephfs-subvolume-list.componenet.html
- multi-cluster.component.html
- osd-list.component.html
- service-daemon-list.component.html
- pool-list.component.html
- rgw-bucket-list.component.html
- rgw-user-list.component.html

Signed-off-by: Naman Munet <naman.munet@ibm.com>
4 months agoMerge pull request #64182 from ronen-fr/wip-rf-perfcprio
Ronen Friedman [Thu, 26 Jun 2025 10:34:01 +0000 (13:34 +0300)]
Merge pull request #64182 from ronen-fr/wip-rf-perfcprio

osd/scrub: some perf counters had their priority set to '0'

Reviewed-by: Adam Kupczyk <akupczyk@ibm.com>
4 months agoMerge pull request #64181 from tchaikov/wip-rgw-iwyu
Kefu Chai [Thu, 26 Jun 2025 08:46:58 +0000 (16:46 +0800)]
Merge pull request #64181 from tchaikov/wip-rgw-iwyu

rgw: do not include unused header

Reviewed-by: Casey Bodley <cbodley@redhat.com>
4 months agoMerge pull request #62668 from rishabh-d-dave/vols-namespace
Rishabh Dave [Thu, 26 Jun 2025 08:23:32 +0000 (13:53 +0530)]
Merge pull request #62668 from rishabh-d-dave/vols-namespace

mgr/vol: include group name in subvolume's pool namespace name

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Kotresh HR <khiremat@redhat.com>
4 months agomgr/dashboard update ceph nvmoef proto files 63564/head
Tomer Haskalovitch [Tue, 27 May 2025 09:04:49 +0000 (12:04 +0300)]
mgr/dashboard update ceph nvmoef proto files

Signed-off-by: Tomer Haskalovitch <il033030@Tomers-MBP.lan>
4 months agoMerge pull request #63936 from tchaikov/wip-mds-flock-exclude-dump
Kefu Chai [Thu, 26 Jun 2025 06:18:30 +0000 (14:18 +0800)]
Merge pull request #63936 from tchaikov/wip-mds-flock-exclude-dump

mds/flock: exclude non-persisted fields from ceph_lock_state_t::dump()

Reviewed-by: Nitzan Mordechai <nmordech@redhat.com>
4 months agoMerge pull request #64174 from idryomov/wip-71838
Ilya Dryomov [Thu, 26 Jun 2025 06:01:06 +0000 (08:01 +0200)]
Merge pull request #64174 from idryomov/wip-71838

librbd/migration/QCOWFormat: don't complete read_clusters() inline

Reviewed-by: Ramana Raja <rraja@redhat.com>
4 months agoMerge pull request #63374 from Matan-B/wip-matanb-use-ceph-assert
SrinivasaBharathKanta [Wed, 25 Jun 2025 23:37:06 +0000 (05:07 +0530)]
Merge pull request #63374 from Matan-B/wip-matanb-use-ceph-assert

osd, os, mon: s/assert/ceph_assert/g'

4 months agoMerge pull request #63839 from aainscow/setattr_fix
Alex Ainscow [Wed, 25 Jun 2025 22:25:54 +0000 (23:25 +0100)]
Merge pull request #63839 from aainscow/setattr_fix

bluestore: Fix _setattr() with rare memory alignments

Reviewed-by: akupczyk@redhat.com
Reviewed-by: rzarzyns@redhat.com
4 months agoMerge pull request #63970 from tchaikov/wip-osd-scrubber-localtime
SrinivasaBharathKanta [Wed, 25 Jun 2025 21:50:19 +0000 (03:20 +0530)]
Merge pull request #63970 from tchaikov/wip-osd-scrubber-localtime

osd/scrubber: replace deprecated fmt::localtime() with localtime_r()

4 months agoMerge pull request #63792 from kamoltat/wip-ksirivad-small-refractor-68550
SrinivasaBharathKanta [Wed, 25 Jun 2025 21:49:48 +0000 (03:19 +0530)]
Merge pull request #63792 from kamoltat/wip-ksirivad-small-refractor-68550

src/mgr/ClusterState.cc: micro improve ingest_pgstats

4 months agoMerge pull request #63552 from shraddhaag/wip-shraddhaag-availability-3
SrinivasaBharathKanta [Wed, 25 Jun 2025 21:49:02 +0000 (03:19 +0530)]
Merge pull request #63552 from shraddhaag/wip-shraddhaag-availability-3

mon: add command osd pool clear-availability-status

4 months agorgw/notifications: prevent infinite loop in case of malformed notifications 64010/head
Yuval Lifshitz [Tue, 24 Jun 2025 10:02:57 +0000 (10:02 +0000)]
rgw/notifications: prevent infinite loop in case of malformed notifications

if we fail to decode a notification entry we should remove it.
o/w we will keep failing on that entry

Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>
4 months agorgw/notifications: make queue idle when all notifications are in "sleep" state
Yuval Lifshitz [Wed, 18 Jun 2025 14:56:01 +0000 (14:56 +0000)]
rgw/notifications: make queue idle when all notifications are in "sleep" state

this will prevent re-reading the queue when there is no work to do
also, put into "idle" state in case of failure with -EBUSY error code

Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>
4 months agotest/rgw/notifications: prevent client retries to avoid duplicates
Yuval Lifshitz [Wed, 18 Jun 2025 12:11:46 +0000 (12:11 +0000)]
test/rgw/notifications: prevent client retries to avoid duplicates

if the RGW is slow, and the client retry, it may cause test to fail
since the number of notifications would be off.
in addition, in slow RGW, we need to verify that the expiry time did
not pass before checking the queue, so we see the expected number of
entries in the queue before they expire.

Fixes: https://tracker.ceph.com/issues/70756
Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>
4 months agorgw/notifications: stop processing when we reach a skipped notifications
Yuval Lifshitz [Wed, 18 Jun 2025 12:09:12 +0000 (12:09 +0000)]
rgw/notifications: stop processing when we reach a skipped notifications

if a notification retry should be skipped, we should stop processing
all notifications. if we successfully processing another notification
it will not be removed (as we will remove only up to the marker of the
skipped notification). as a result, the successfull notification will be
processed again.

Fixes: https://tracker.ceph.com/issues/70756
Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>
4 months agoosd/scrub: some perf counters priority was '0' 64182/head
Ronen Friedman [Wed, 25 Jun 2025 14:25:08 +0000 (09:25 -0500)]
osd/scrub: some perf counters priority was '0'

Some scrub perf counters were created without specifying
individual priorities, assuming by mistake that the
default priority is '_INTERESTING'. That was not the case,
and those perf counters were not reported.

Fixes: https://tracker.ceph.com/issues/71842
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
4 months agoMerge pull request #63523 from Matan-B/wip-matanb-crimson-asock-race-fixes
Matan Breizman [Wed, 25 Jun 2025 14:17:24 +0000 (17:17 +0300)]
Merge pull request #63523 from Matan-B/wip-matanb-crimson-asock-race-fixes

crimson/osd: Admin Socket fixes

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
4 months agorgw: do not include unused header 64181/head
Kefu Chai [Wed, 25 Jun 2025 13:51:04 +0000 (21:51 +0800)]
rgw: do not include unused header

previously, when building cls_rgw, we could have following build
failure:

```
In file included from /home/jenkins-build/build/workspace/ceph-dashboard-pull-requests/src/cls/rgw/cls_rgw_types.cc:4:
In file included from /home/jenkins-build/build/workspace/ceph-dashboard-pull-requests/src/cls/rgw/cls_rgw_types.h:15:
In file included from /home/jenkins-build/build/workspace/ceph-dashboard-pull-requests/src/rgw/rgw_basic_types.h:32:
In file included from /home/jenkins-build/build/workspace/ceph-dashboard-pull-requests/src/rgw/rgw_user_types.h:27:
In file included from /home/jenkins-build/build/workspace/ceph-dashboard-pull-requests/src/common/dout.h:29:
In file included from /home/jenkins-build/build/workspace/ceph-dashboard-pull-requests/src/common/ceph_context.h:41:
In file included from /home/jenkins-build/build/workspace/ceph-dashboard-pull-requests/src/common/config_proxy.h:7:
In file included from /home/jenkins-build/build/workspace/ceph-dashboard-pull-requests/src/common/config.h:28:
In file included from /home/jenkins-build/build/workspace/ceph-dashboard-pull-requests/src/common/config_values.h:59:
/home/jenkins-build/build/workspace/ceph-dashboard-pull-requests/src/common/options/legacy_config_opts.h:1:10: fatal error: 'global_legacy_options.h' file not found
    1 | #include "global_legacy_options.h"
      |          ^~~~~~~~~~~~~~~~~~~~~~~~~
```

but it turned out that `cls_rgw_types.h` does not use `dout.h` at all.
so, in this change, we just drop this include. this helps to reduce
the build dependency.

Signed-off-by: Kefu Chai <tchaikov@gmail.com>
4 months agoMerge pull request #64172 from zdover23/wip-doc-2025-06-25-tr70758-radosgw-bucket_logging
Zac Dover [Wed, 25 Jun 2025 13:44:50 +0000 (23:44 +1000)]
Merge pull request #64172 from zdover23/wip-doc-2025-06-25-tr70758-radosgw-bucket_logging

doc/radosgw: line edit bucket_logging.rst

Reviewed-by: Yuval Lifshitz <ylifshit@ibm.com>
4 months agoMerge pull request #64161 from tchaikov/wip-doc-mgr-module-config
Kefu Chai [Wed, 25 Jun 2025 11:45:15 +0000 (19:45 +0800)]
Merge pull request #64161 from tchaikov/wip-doc-mgr-module-config

doc: Document ceph-mgr module configuration options

Reviewed-by: Zac Dover <zac.dover@proton.me>
Reviewed-by: Ville Ojamo <14869000+bluikko@users.noreply.github.com>
4 months agoMerge pull request #64162 from tchaikov/wip-doc-drop-typed-ast
Kefu Chai [Wed, 25 Jun 2025 11:43:41 +0000 (19:43 +0800)]
Merge pull request #64162 from tchaikov/wip-doc-drop-typed-ast

doc: do not depend on typed-ast

Reviewed-by: Zac Dover <zac.dover@proton.me>
4 months agomgr/dashboard: Fix inline markup warning in API documentation 64164/head
Kefu Chai [Wed, 25 Jun 2025 04:14:36 +0000 (12:14 +0800)]
mgr/dashboard: Fix inline markup warning in API documentation

Remove trailing space from summary field that was causing Sphinx build
warning.

Sphinx was generating a warning due to malformed inline markup:

```
/home/kefu/dev/ceph/doc/mgr/ceph_api/index.rst:3349: WARNING: Inline strong start-string without end-string.`
```

The openapi directive appears to convert trailing spaces into asterisk
markers, creating unterminated strong markup. This change removes the
trailing space to eliminate the warning and maintain consistency with
other entries in the file.

Signed-off-by: Kefu Chai <tchaikov@gmail.com>
4 months agolibrbd/migration/QCOWFormat: don't complete read_clusters() inline 64174/head
Ilya Dryomov [Wed, 25 Jun 2025 10:24:35 +0000 (12:24 +0200)]
librbd/migration/QCOWFormat: don't complete read_clusters() inline

When the cluster needs to be read, the completion is posted to ASIO.
However, in the two special cases (cluster DNE and zero cluster), the
completion is completed inline at the moment.  This violates invariants
and can eventually lead to a lockup.  For example, in a scenario of
a read from a clone image whose parent is under migration:

  io::ObjectReadRequest::read_parent()
    io::util::read_parent()
      < image_lock is taken for read >
      io::ImageDispatchSpec::send()
        migration::ImageDispatch::read()
          migration::QCOWFormat::ReadRequest::send()
            ...
            migration::QCOWFormat::ReadRequest::read_clusters()
              < cluster DNE >
              migration::QCOWFormat::ReadRequest::handle_read_clusters()
                io::AioCompletion::complete()
                  io::ObjectReadRequest::copyup()
                    is_copy_on_read()
                      < image_lock is taken for read >

copyup() expects to be called with no locks held, but going through
QCOWFormat in the "cluster DNE" case essentially maintains image_lock
taken in read_parent() and then it's taken again by the same thread in
is_copy_on_read().  Under pthreads, it's not a problem:

  A thread may hold multiple concurrent read locks on rwlock (that is,
  successfully call the pthread_rwlock_rdlock() function n times). If
  so, the thread must perform matching unlocks (that is, it must call
  the pthread_rwlock_unlock() function n times).

But according to C++ standard it's undefined behavior:

  If lock_shared is called by a thread that already owns the mutex in
  any mode (exclusive or shared), the behavior is undefined.

Other, longer and more elaborate, call chains are possible too and
there it may end up being a write lock, a tripped assertion, etc.  To
avoid this, make the special cases in read_clusters() behave the same
as the main path.

Fixes: https://tracker.ceph.com/issues/71838
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
4 months agolibrbd/migration/QCOWFormat: rename handle_read_cluster()
Ilya Dryomov [Wed, 25 Jun 2025 10:55:06 +0000 (12:55 +0200)]
librbd/migration/QCOWFormat: rename handle_read_cluster()

... to handle_read_clusters() as it's a callback for read_clusters().

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
4 months agodoc/radosgw: line edit bucket_logging.rst 64172/head
Zac Dover [Wed, 25 Jun 2025 09:19:49 +0000 (19:19 +1000)]
doc/radosgw: line edit bucket_logging.rst

Edit doc/radosgw/bucket_logging.rst so that it is not solecistic and so
that its punctuation is corrected and its use of articles is corrected.
This file remains in my judgment demotic and maybe demotic enough to
warrant another editorial pass in the future.

Fixes: https://tracker.ceph.com/issues/70758
Signed-off-by: Zac Dover <zac.dover@proton.me>
4 months agoMerge pull request #64163 from tchaikov/wip-doc-inline-literal
Zac Dover [Wed, 25 Jun 2025 08:40:11 +0000 (18:40 +1000)]
Merge pull request #64163 from tchaikov/wip-doc-inline-literal

doc: Fix unterminated inline literal in ceph-conf.rst

Reviewed-by: Ronen Friedman <rfriedma@redhat.com>
Reviewed-by: Zac Dover <zac.dover@proton.me>
4 months agoMerge pull request #63673 from Matan-B/wip-matanb-crimson-signals-aborts
Matan Breizman [Wed, 25 Jun 2025 07:36:59 +0000 (10:36 +0300)]
Merge pull request #63673 from Matan-B/wip-matanb-crimson-signals-aborts

crimson/common/fatal_signal: Rework signals

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
4 months agoMerge pull request #63972 from cyx1231st/wip-seastore-refine-extent-states-p2
Yingxin Cheng [Wed, 25 Jun 2025 07:07:10 +0000 (15:07 +0800)]
Merge pull request #63972 from cyx1231st/wip-seastore-refine-extent-states-p2

crimson/os/seastore: update extent states from pending to stable upon prepare_record()

Reviewed-by: Xuehan Xu <xuxuehan@qianxin.com>
4 months agoMerge PR #59435 into main
Venky Shankar [Wed, 25 Jun 2025 06:39:39 +0000 (12:09 +0530)]
Merge PR #59435 into main

* refs/pull/59435/head:
mgr/volumes: Fix json.loads for test on mon caps
mgr/volumes: Add test for mon caps if auth key has remaining mds/osd caps
mgr/volumes: Keep mon caps if auth key has remaining mds/osd caps

Reviewed-by: Milind Changire <mchangir@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
4 months agodoc: Document ceph-mgr module configuration options 64161/head
Kefu Chai [Wed, 25 Jun 2025 02:26:58 +0000 (10:26 +0800)]
doc: Document ceph-mgr module configuration options

Add comprehensive documentation for defining configuration options in
ceph-mgr modules, including all supported properties and their usage.

Previously, the documentation did not explain how to define ceph-mgr
module configuration options, despite subtle differences from other Ceph
components. This change documents all supported Option properties, their
types, and provides clear examples to help module developers properly
configure their options.

Signed-off-by: Kefu Chai <tchaikov@gmail.com>
4 months agoMerge pull request #64160 from tchaikov/wip-doc-dev-config
Zac Dover [Wed, 25 Jun 2025 05:11:42 +0000 (15:11 +1000)]
Merge pull request #64160 from tchaikov/wip-doc-dev-config

doc/dev/config: Document how to use :confval: directive for config options

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
Reviewed-by: Zac Dover <zac.dover@proton.me>
4 months agodoc: Fix unterminated inline literal in ceph-conf.rst 64163/head
Kefu Chai [Wed, 25 Jun 2025 03:34:50 +0000 (11:34 +0800)]
doc: Fix unterminated inline literal in ceph-conf.rst

Add missing space after closing backticks to properly terminate inline literal.

Sphinx was generating a warning due to an unterminated inline literal
string:

```
/home/kefu/dev/ceph/doc/rados/configuration/ceph-conf.rst:320: WARNING: Inline literal start-string without end-string.
```

This fix ensures the inline literal is properly closed and eliminates the
build warning.

Signed-off-by: Kefu Chai <tchaikov@gmail.com>
4 months agodoc: do not depend on typed-ast 64162/head
Kefu Chai [Wed, 25 Jun 2025 03:02:46 +0000 (11:02 +0800)]
doc: do not depend on typed-ast

the typed-ast project was marked end of life since July 2023, and
not maintained anymore. since we build the document using readthedocs'
service, and in .readtherdocs.yml we use python 3.9, which comes with
ast module included by its standard library.

the typed-ast dependency was originally added in 30d41597, but now that
we are using python 3.9, there is no need to use this module anymore.

Signed-off-by: Kefu Chai <tchaikov@gmail.com>
4 months agodoc/dev/config: Document how to use :confval: directive for config options 64160/head
Kefu Chai [Wed, 25 Jun 2025 03:50:24 +0000 (11:50 +0800)]
doc/dev/config: Document how to use :confval: directive for config options

Add comprehensive guide for documenting configuration options using the
:confval: directive, including naming conventions and cross-referencing.

Previously, the documentation lacked guidance on using the :confval:
directive and the important distinction between regular config options
and mgr module options (which require the mgr/<module>/ namespace
prefix). This change provides detailed examples and best practices for
properly documenting and referencing both types of configuration options.

Signed-off-by: Kefu Chai <tchaikov@gmail.com>
4 months agoMerge PR #64153 into main
Patrick Donnelly [Wed, 25 Jun 2025 01:06:14 +0000 (21:06 -0400)]
Merge PR #64153 into main

* refs/pull/64153/head:
Revert "Merge pull request #64111 from zdover23/wip-doc-2025-06-23-tr71796-rados-ops-balancer"

Reviewed-by: Zac Dover <zac.dover@gmail.com>
Reviewed-by: Greg Farnum <gfarnum@redhat.com>
4 months agoRevert "Merge pull request #64111 from zdover23/wip-doc-2025-06-23-tr71796-rados... 64153/head
Patrick Donnelly [Tue, 24 Jun 2025 18:52:09 +0000 (14:52 -0400)]
Revert "Merge pull request #64111 from zdover23/wip-doc-2025-06-23-tr71796-rados-ops-balancer"

This reverts commit f2b7c0465ea2727537ea58b44b2f17c6dda7d556, reversing
changes made to e5868d88dd3ada55a7cacd7a5faec5aa9bc96aff.

Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
4 months agoMerge pull request #64107 from zdover23/wip-doc-2025-06-23-tr71789-radosgw-notifications
Zac Dover [Tue, 24 Jun 2025 18:51:11 +0000 (04:51 +1000)]
Merge pull request #64107 from zdover23/wip-doc-2025-06-23-tr71789-radosgw-notifications

doc/radosgw: remove "pubsub_event_triggered"

Reviewed-by: Yuval Lifshitz <ylifshit@ibm.com>
4 months agoMerge pull request #64137 from tchaikov/wip-rbd-with-krbd
Ilya Dryomov [Tue, 24 Jun 2025 18:37:34 +0000 (20:37 +0200)]
Merge pull request #64137 from tchaikov/wip-rbd-with-krbd

rbd: fix unused function warning when WITH_KRBD is disabled

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
4 months agodoc/radosgw: remove "pubsub_event_triggered" 64107/head
Zac Dover [Mon, 23 Jun 2025 08:07:40 +0000 (18:07 +1000)]
doc/radosgw: remove "pubsub_event_triggered"

Remove "pubsub_event_triggered" from the list of "Notification
Performance Statistics". It is obsolete.

Fixes: https://tracker.ceph.com/issues/71789
Signed-off-by: Zac Dover <zac.dover@proton.me>
4 months agoMerge pull request #64111 from zdover23/wip-doc-2025-06-23-tr71796-rados-ops-balancer
Zac Dover [Tue, 24 Jun 2025 18:09:16 +0000 (04:09 +1000)]
Merge pull request #64111 from zdover23/wip-doc-2025-06-23-tr71796-rados-ops-balancer

doc/rados: clarify "upmap_max_deviation"

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
4 months agoMerge pull request #64105 from zdover23/wip-doc-2025-06-23-tr71791-radosgw-notifications
Zac Dover [Tue, 24 Jun 2025 17:56:49 +0000 (03:56 +1000)]
Merge pull request #64105 from zdover23/wip-doc-2025-06-23-tr71791-radosgw-notifications

doc/radosgw: add "persistent_topic_len"

Reviewed-by: Yuval Lifshitz <ylifshit@ibm.com>
4 months agodoc/radosgw: add "persistent_topic_len" 64105/head
Zac Dover [Mon, 23 Jun 2025 08:26:09 +0000 (18:26 +1000)]
doc/radosgw: add "persistent_topic_len"

Add the labeled counter "persistent_topic_len" to the list of
"Notification Performance Statistics" in doc/radosgw/notifications.rst.

Fixes: https://tracker.ceph.com/issues/71791
Signed-off-by: Zac Dover <zac.dover@proton.me>
4 months agoMerge pull request #64106 from zdover23/wip-doc-2025-06-23-tr71792-radosgw-notifications
Zac Dover [Tue, 24 Jun 2025 15:42:10 +0000 (01:42 +1000)]
Merge pull request #64106 from zdover23/wip-doc-2025-06-23-tr71792-radosgw-notifications

doc/radosgw: add "persistent_topic_size"

Reviewed-by: Yuval Lifshitz <ylifshit@ibm.com>
4 months agorbd: fix unused function warning when WITH_KRBD is disabled 64137/head
Kefu Chai [Tue, 24 Jun 2025 14:38:13 +0000 (22:38 +0800)]
rbd: fix unused function warning when WITH_KRBD is disabled

Guard print_error_description() and get_unsupported_features() with
`#ifdef WITH_KRBD` to prevent compiler warnings when KRBD support is
not enabled.

These functions are only called by do_kernel_map(), which is itself
conditionally compiled. When WITH_KRBD is not defined, the compiler
generates unused function warnings for these helper functions.

Fixes warning:

```
/home/kefu/dev/ceph/src/tools/rbd/action/Kernel.cc:305:13: warning: ‘void rbd::action::kernel::print_error_description(const char*, const char*, const char*, const char*, int)’ defined but not used [-Wunused-function]
  305 | static void print_error_description(const char *poolname,
      |             ^~~~~~~~~~~~~~~~~~~~~~~
```

this silences the warning.

Signed-off-by: Kefu Chai <tchaikov@gmail.com>
4 months agoMerge PR #62675 into main
Venky Shankar [Tue, 24 Jun 2025 11:08:48 +0000 (16:38 +0530)]
Merge PR #62675 into main

* refs/pull/62675/head:
test/mds: Fix catch by value

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Kefu Chai <tchaikov@gmail.com>
Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>
4 months agocrimson/common/smp_helpers: cleanup invoke_on_all_seq 63523/head
Matan Breizman [Tue, 24 Jun 2025 10:25:36 +0000 (10:25 +0000)]
crimson/common/smp_helpers: cleanup invoke_on_all_seq

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
4 months agoMerge pull request #64063 from tchaikov/wip-ec-cache-refactor
Alex Ainscow [Tue, 24 Jun 2025 10:04:15 +0000 (11:04 +0100)]
Merge pull request #64063 from tchaikov/wip-ec-cache-refactor

osd/ECExtentCache: cleanup and optimization

4 months agoMerge pull request #60797 from rhcs-dashboard/carbon-icons-subtask
Pedro Gonzalez Gomez [Tue, 24 Jun 2025 09:55:28 +0000 (11:55 +0200)]
Merge pull request #60797 from rhcs-dashboard/carbon-icons-subtask

mgr/dashboard: Carbonize Icons for Modules (Block, Object, File, Observability, Administration)

Reviewed-by: Afreen Misbah <afreen@ibm.com>
Reviewed-by: Pedro Gonzalez <pegonzal@ibm.com>
4 months agoMerge pull request #63963 from afreen23/wip-label-hosts
afreen23 [Tue, 24 Jun 2025 08:30:43 +0000 (14:00 +0530)]
Merge pull request #63963 from afreen23/wip-label-hosts

mgr/dashboard: Allow host with labels in listener form

Reviewed-by: Aashish Sharma <aasharma@redhat.com>
Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
4 months agoMerge pull request #63961 from afreen23/wip-force-listener
afreen23 [Tue, 24 Jun 2025 08:30:12 +0000 (14:00 +0530)]
Merge pull request #63961 from afreen23/wip-force-listener

mgr/dashboard: Add --force flag for listeners

Reviewed-by: Aashish Sharma <aasharma@redhat.com>
Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
4 months agoMerge pull request #64088 from InstelligenceIO/patch-2
Zac Dover [Tue, 24 Jun 2025 07:07:12 +0000 (17:07 +1000)]
Merge pull request #64088 from InstelligenceIO/patch-2

[Docs]: Corrected spelling mistake

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
4 months agodoc/rados: clarify "upmap_max_deviation" 64111/head
Zac Dover [Mon, 23 Jun 2025 12:50:03 +0000 (22:50 +1000)]
doc/rados: clarify "upmap_max_deviation"

Clarify the threshold set by "upmap_max_deviation" and add the
information about this configurable that is currently in
src/pybind/mgr/balancer/module.py to src/common/options/global.yaml.in,
so that it will be accessible by means of ".. confval::" declarations.

Fixes: https://tracker.ceph.com/issues/71796
Signed-off-by: Zac Dover <zac.dover@proton.me>
4 months agoMerge pull request #64104 from zdover23/wip-doc-2025-06-23-tr71790-radosgw-notifications
Zac Dover [Tue, 24 Jun 2025 04:46:11 +0000 (14:46 +1000)]
Merge pull request #64104 from zdover23/wip-doc-2025-06-23-tr71790-radosgw-notifications

doc/radosgw: remove "pubsub_event_lost"

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
4 months agoceph.spec.in: use rpm macro for python shebang pathfix 64123/head
John Mulligan [Thu, 13 Feb 2025 20:59:42 +0000 (15:59 -0500)]
ceph.spec.in: use rpm macro for python shebang pathfix

To support EL 10 distros, update the source of the pathfix tool (on EL
9+ distros) and use the macro for updating python shebangs that has been
available since at least EL 9.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
4 months agoMerge pull request #64072 from phlogistonjohn/jjm-wip-el10
Dan Mick [Mon, 23 Jun 2025 16:42:44 +0000 (09:42 -0700)]
Merge pull request #64072 from phlogistonjohn/jjm-wip-el10

script/build-with-container.: fixes and improvments towards el10

4 months agoMerge pull request #63883 from leonidc/fix_upgrade_gw_lost
SrinivasaBharathKanta [Mon, 23 Jun 2025 16:12:40 +0000 (21:42 +0530)]
Merge pull request #63883 from leonidc/fix_upgrade_gw_lost

nvmeofgw: fix sending acks during upgrade

4 months agoMerge pull request #63796 from tchaikov/wip-memstore-silence-warning
Kefu Chai [Mon, 23 Jun 2025 14:07:25 +0000 (22:07 +0800)]
Merge pull request #63796 from tchaikov/wip-memstore-silence-warning

cmake: suppress -Wmaybe-uninitialized warning in memstore PageSet

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
4 months agoMerge pull request #64075 from nh2/docs-balancer-variable-name
Zac Dover [Mon, 23 Jun 2025 14:02:25 +0000 (00:02 +1000)]
Merge pull request #64075 from nh2/docs-balancer-variable-name

doc/rados/operations: Actually mention `upmap_max_deviation` setting name

Reviewed-by: Zac Dover <zac.dover@proton.me>
4 months agodoc/radosgw: add "persistent_topic_size" 64106/head
Zac Dover [Mon, 23 Jun 2025 08:35:05 +0000 (18:35 +1000)]
doc/radosgw: add "persistent_topic_size"

Add "persistent_topic_size" to the list of "Notification Performance
Statistics" in doc/radosgw/notifications.rst.

Fixes: https://tracker.ceph.com/issues/71792
Signed-off-by: Zac Dover <zac.dover@proton.me>
4 months agoMerge pull request #64071 from tchaikov/wip-static_ptr-alignas
Casey Bodley [Mon, 23 Jun 2025 13:12:33 +0000 (09:12 -0400)]
Merge pull request #64071 from tchaikov/wip-static_ptr-alignas

common/static_ptr: pass an integer to alignas to fix GCC-11 build fai…

Reviewed-by: Casey Bodley <cbodley@redhat.com>
4 months agoMerge pull request #64103 from zdover23/wip-doc-2025-06-23-tr71793-radosgw-notifications
Zac Dover [Mon, 23 Jun 2025 13:02:42 +0000 (23:02 +1000)]
Merge pull request #64103 from zdover23/wip-doc-2025-06-23-tr71793-radosgw-notifications

doc/radosgw: improve "pubsub_push_pending" info

Reviewed-by: Yuval Lifshitz <ylifshit@ibm.com>
4 months agoMerge pull request #63398 from NitzanMordhai/wip-nitzan-calc-pool-availability-valgri...
NitzanMordhai [Mon, 23 Jun 2025 12:20:14 +0000 (15:20 +0300)]
Merge pull request #63398 from NitzanMordhai/wip-nitzan-calc-pool-availability-valgrind-invalid-read

src/mon/MgrStatMonitor: fix invalid iterator increment in calc_pool_availability()

4 months agodoc/rados/operations: Actually mention `upmap_max_deviation` setting name 64075/head
Niklas Hambüchen [Sat, 21 Jun 2025 17:53:34 +0000 (19:53 +0200)]
doc/rados/operations: Actually mention `upmap_max_deviation` setting name

Signed-off-by: Niklas Hambüchen <mail@nh2.me>
4 months agodoc: Fixed a spelling error. 64088/head
Stellios Williams [Mon, 23 Jun 2025 10:54:31 +0000 (20:54 +1000)]
doc: Fixed a spelling error.

Corrected communicte to communicate

Signed-off-by: Instelligence.io <github@instelligence.io>
4 months agocrimson/common/smp_helpers: cleanup sharded_map_seq
Matan Breizman [Mon, 23 Jun 2025 10:18:23 +0000 (10:18 +0000)]
crimson/common/smp_helpers: cleanup sharded_map_seq

Let PGShardManager::invoke_on_each_shard_seq pass the local shard_services
instance instead of using an additional helper.

The downside of dropping the generic sharded_map_seq helper is that it is
able to support *any* (seastar::)sharded object. However, as shard_services
is the only user of it - directly using the local instance without the
helper seems easier to read.

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
4 months agocrimson/common/smp_helpers: rename reactor_map_seq
Matan Breizman [Mon, 23 Jun 2025 10:02:37 +0000 (10:02 +0000)]
crimson/common/smp_helpers: rename reactor_map_seq

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
4 months agocrimson/common/smp_helpers: fix reactor_map_seq
Matan Breizman [Sun, 22 Jun 2025 10:10:10 +0000 (10:10 +0000)]
crimson/common/smp_helpers: fix reactor_map_seq

Copy f into reactor_map_seq which would be kept alive
due to this method being a coroutine. That way, we can ensure
the lambdas passed to each core that are capturing f by
reference would be safe.
Alternatively, we can also copy f by using it's copy ctor and
pass a copy to each shard:
co_await crimson::submit_to(core, F(f))
However, avoiding the copy is possible here due to the sequential
traversal. Note, seastar's invoke_on_all do copy each callback to
every shard and is running the invocation in parallel.

The above would have fixed f's captures to be invalid and result
in a segfaults on diffrent shards.

Fixes: https://tracker.ceph.com/issues/71457
Signed-off-by: Matan Breizman <mbreizma@redhat.com>
4 months agocrimson/osd/shard_services: PerShardState::dump_ops_in_flight debug
Matan Breizman [Wed, 18 Jun 2025 10:57:37 +0000 (10:57 +0000)]
crimson/osd/shard_services: PerShardState::dump_ops_in_flight debug

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
4 months agocrimson/admin/osd_admin: move to coroutines
Matan Breizman [Tue, 17 Jun 2025 13:23:26 +0000 (13:23 +0000)]
crimson/admin/osd_admin: move to coroutines

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
4 months agocrimson/admin/osd_admin: add logs when calling hooks
Matan Breizman [Thu, 29 May 2025 09:18:32 +0000 (09:18 +0000)]
crimson/admin/osd_admin: add logs when calling hooks

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
4 months agocrimson/admin/admin_socket: add logs, move to ceph_assert
Matan Breizman [Thu, 29 May 2025 09:18:01 +0000 (09:18 +0000)]
crimson/admin/admin_socket: add logs, move to ceph_assert

Signed-off-by: Matan Breizman <mbreizma@redhat.com>