]> git-server-git.apps.pok.os.sepia.ceph.com Git - ceph.git/log
ceph.git
2 days agomgr/cephadm: Fixed test_scheduling.py 68438/head
Shweta Bhosale [Fri, 17 Apr 2026 09:30:20 +0000 (15:00 +0530)]
mgr/cephadm: Fixed test_scheduling.py
Fixes: https://tracker.ceph.com/issues/76079
Signed-off-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
2 days agoMerge pull request #68392 from rkachach/fix_issue_cepahdm_qa_task
Redouane Kachach [Fri, 17 Apr 2026 08:50:46 +0000 (10:50 +0200)]
Merge pull request #68392 from rkachach/fix_issue_cepahdm_qa_task

qa: fix misleading "in cluster log" failures during cluster log scan

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Zack Cerza <zack@redhat.com>
Reviewed-by: Yuri Weinstein <yweins@redhat.com>
2 days agoMerge pull request #68297 from tchaikov/wip-feedback-without-tracker
Kefu Chai [Fri, 17 Apr 2026 05:34:51 +0000 (13:34 +0800)]
Merge pull request #68297 from tchaikov/wip-feedback-without-tracker

mgr/feedback: fix flaky test_issue_tracker_create_with_invalid_key

Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Afreen Misbah <afreen@ibm.com>
3 days agoMerge PR #68245 into main
Patrick Donnelly [Thu, 16 Apr 2026 23:58:06 +0000 (19:58 -0400)]
Merge PR #68245 into main

* refs/pull/68245/head:
mon/MonClient: check stopping for auth request handling

Reviewed-by: Brad Hubbard <bhubbard@redhat.com>
3 days agoMerge pull request #68289 from cbodley/wip-75945
Casey Bodley [Thu, 16 Apr 2026 20:18:38 +0000 (16:18 -0400)]
Merge pull request #68289 from cbodley/wip-75945

qa/valgrind: generalize suppressions for gcc-14 MismatchedFree

Reviewed-by: Adam C. Emerson <aemerson@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
3 days agoMerge pull request #68113 from benhanokh/dedup_split_head_with_tail_objects
Gabriel Benhanokh [Thu, 16 Apr 2026 18:01:29 +0000 (21:01 +0300)]
Merge pull request #68113 from benhanokh/dedup_split_head_with_tail_objects

rgw/dedup: split-head for objects with tails

3 days agoMerge PR #67823 into main
Patrick Donnelly [Thu, 16 Apr 2026 16:29:26 +0000 (12:29 -0400)]
Merge PR #67823 into main

* refs/pull/67823/head:
qa: remove unused qa_scripts

Reviewed-by: Ilya Dryomov <idryomov@redhat.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
3 days agoMerge PR #67822 into main
Patrick Donnelly [Thu, 16 Apr 2026 16:28:52 +0000 (12:28 -0400)]
Merge PR #67822 into main

* refs/pull/67822/head:
qa: remove vestiges of ceph-deploy
doc: remove references to ceph-deploy

Reviewed-by: Anthony D Atri <anthony.datri@gmail.com>
3 days agoMerge pull request #67493 from kginonredhat/Bug-56660-Haproxy-error-for-rgw-service...
Redouane Kachach [Thu, 16 Apr 2026 14:21:49 +0000 (16:21 +0200)]
Merge pull request #67493 from kginonredhat/Bug-56660-Haproxy-error-for-rgw-service-with-ipv6

added code to fix failure on Haproxy error for rgw service with ipv6

Reviewed-by: Redouane Kachach <rkachach@ibm.com>
3 days agoMerge pull request #66257 from ShwetaBhosale1/fix_issue_73851_cephadm_crashes_when_ga...
Redouane Kachach [Thu, 16 Apr 2026 13:48:08 +0000 (15:48 +0200)]
Merge pull request #66257 from ShwetaBhosale1/fix_issue_73851_cephadm_crashes_when_ganesha-rados-grace_fails

mgr/cephadm: Handle ganesha-rados-grace tool failure

Reviewed-by: Redouane Kachach <rkachach@ibm.com>
Reviewed-by: Adam King <adking@redhat.com>
3 days agoMerge pull request #66313 from ShwetaBhosale1/fix_issue_73912_prometheus_cannot_acces...
Redouane Kachach [Thu, 16 Apr 2026 13:47:00 +0000 (15:47 +0200)]
Merge pull request #66313 from ShwetaBhosale1/fix_issue_73912_prometheus_cannot_access_nfs_metrics_endpoints

mgr/cephadm: Allow NFS monitoring port through firewall

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Redouane Kachach <rkachach@ibm.com>
3 days agoMerge pull request #66381 from ShwetaBhosale1/fix_issue_73949_nfs_with_keepalived_only
Redouane Kachach [Thu, 16 Apr 2026 13:45:58 +0000 (15:45 +0200)]
Merge pull request #66381 from ShwetaBhosale1/fix_issue_73949_nfs_with_keepalived_only

mgr/cephadm: Fix NFS to work properly in keepalived-only ingress mode

Reviewed-by: Redouane Kachach <rkachach@ibm.com>
Reviewed-by: Adam King <adking@redhat.com>
3 days agoMerge pull request #68391 from ifed01/wip-ifed-fix-fcm
Igor Fedotov [Thu, 16 Apr 2026 13:19:12 +0000 (16:19 +0300)]
Merge pull request #68391 from ifed01/wip-ifed-fix-fcm

extblkdev/fcm: do not abort on multi-device volume before we discover…

Reviewed-by: Adam Kupczyk <akupczyk@ibm.com>
3 days agomon/MonClient: check stopping for auth request handling 68245/head
Patrick Donnelly [Tue, 31 Mar 2026 13:10:08 +0000 (18:40 +0530)]
mon/MonClient: check stopping for auth request handling

When the MonClient is shutting down, it is no longer safe to
access MonClient::auth and other members. The AuthClient
methods should be checking the stopping flag in this case.

The key bit from the segfault backtrace (thanks Brad Hubbard!) is here:

#13 0x00007f921ee23c40 in ProtocolV2::handle_auth_done (this=0x7f91cc0945f0, payload=...) at /usr/include/c++/12/bits/shared_ptr_base.h:1665
#14 0x00007f921ee16a29 in ProtocolV2::run_continuation (this=0x7f91cc0945f0, continuation=...) at msg/./src/msg/async/ProtocolV2.cc:54
#15 0x00007f921edee56e in std::function<void (char*, long)>::operator()(char*, long) const (__args#1=0, __args#0=<optimized out>, this=0x7f91cc0744d8) at /usr/include/c++/12/bits/std_function.h:591
#16 AsyncConnection::process (this=0x7f91cc074140) at msg/./src/msg/async/AsyncConnection.cc:485
#17 0x00007f921ee3300c in EventCenter::process_events (this=0x55efc9d0a058, timeout_microseconds=<optimized out>, working_dur=0x7f921a891d88) at msg/./src/msg/async/Event.cc:465
#18 0x00007f921ee38bf9 in operator() (__closure=<optimized out>) at msg/./src/msg/async/Stack.cc:50
#19 std::__invoke_impl<void, NetworkStack::add_thread(Worker*)::<lambda()>&> (__f=...) at /usr/include/c++/12/bits/invoke.h:61
#20 std::__invoke_r<void, NetworkStack::add_thread(Worker*)::<lambda()>&> (__fn=...) at /usr/include/c++/12/bits/invoke.h:111
#21 std::_Function_handler<void(), NetworkStack::add_thread(Worker*)::<lambda()> >::_M_invoke(const std::_Any_data &) (__functor=...) at /usr/include/c++/12/bits/std_function.h:290
#22 0x00007f921e81f253 in std::execute_native_thread_routine (__p=0x55efc9e9c5f0) at ../../../../../src/libstdc++-v3/src/c++11/thread.cc:82
#23 0x00007f921f5e8ac3 in start_thread (arg=<optimized out>) at ./nptl/pthread_create.c:442
#24 0x00007f921f67a8d0 in clone3 () at ../sysdeps/unix/sysv/linux/x86_64/clone3.S:81

I originally thought this may be the issue causing [1] however that
turned out to be an issue caused by OpenSSL's use of atexit handlers.

I still think there is a bug here so I am continuing with this change.

[1] https://tracker.ceph.com/issues/59335

Fixes: https://tracker.ceph.com/issues/76017
Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
3 days agoMerge pull request #68205 from idryomov/wip-transient-policy-release-note
Ilya Dryomov [Thu, 16 Apr 2026 09:00:58 +0000 (11:00 +0200)]
Merge pull request #68205 from idryomov/wip-transient-policy-release-note

doc: add RBD_LOCK_MODE_EXCLUSIVE_TRANSIENT release note

Reviewed-by: Miki Patel <miki.patel132@gmail.com>
3 days agorgw/dedup: This PR extends the RGW dedup split-head feature to support objects that... 68113/head
benhanokh [Mon, 30 Mar 2026 08:22:51 +0000 (11:22 +0300)]
rgw/dedup: This PR extends the RGW dedup split-head feature to support objects that already have tail RADOS objects (i.e. objects larger than the head chunk size).
Previously, split-head was restricted to objects whose entire data fit in the head (≤4 MiB).
It also migrates the split-head manifest representation from the legacy explicit-objs format to the prefix+index rules-based format.

Refactored should_split_head():
Now performs a larger set of eligibility checks:
 * d_split_head flag is set
 * single-part object only
 * non-empty head
 * not a legacy manifest
 * not an Alibaba Cloud OSS AppendObject

Explicit skips for unsupported manifest types:
 — old-style explicit-objs manifests
 — OSS AppendObject manifests (detected via non-empty override_prefix)

New config option: rgw_dedup_split_obj_head:
  Default is true (split-head enabled).
  Setting to false disables split-head entirely.

Tail object lookup via manifest iterator:
  Replaces the old get_tail_ioctx() which manually constructed the tail OID via generate_split_head_tail_name().
  The new function simply calls manifest.obj_begin() and resolves the first tail object location through the standard manifest iterator.

Stats cleanup:
Removed the "Potential Dedup" stats section (small_objs_stat, dup_head_bytes, dup_head_bytes_estimate, ingress_skip_too_small_64KB*)
 which tracked 64KB–4MB objects as potential-but-skipped candidates.
 Since split-head now covers all sizes, this distinction is no longer meaningful. calc_deduped_bytes() is simplified accordingly.

Signed-off-by: benhanokh <gbenhano@redhat.com>
3 days agoMerge pull request #68383 from amathuria/wip-amat-fix-bad-statemachine-event
Matan Breizman [Thu, 16 Apr 2026 08:14:11 +0000 (11:14 +0300)]
Merge pull request #68383 from amathuria/wip-amat-fix-bad-statemachine-event

crimson/osd: fix race between AllReplicasRecovered and DeferRecovery

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
3 days agoqa: fix misleading "in cluster log" failures during cluster log scan 68392/head
Redouane Kachach [Wed, 15 Apr 2026 16:05:36 +0000 (18:05 +0200)]
qa: fix misleading "in cluster log" failures during cluster log scan

Summary:

Fix misleading failure reasons reported as `"… in cluster log"` when
no such log entry actually exists.

The cephadm task currently treats `grep` errors from the cluster log
scan as if they were actual log matches. This can produce bogus
failure summaries when `ceph.log` is missing, especially after early
failures such as image pull or bootstrap problems.

Problem:

first_in_ceph_log() currently:

- returns stdout if a match is found
- otherwise returns stderr

The caller then treats any non-None value as a real cluster log hit and formats it as:

"<value>" in cluster log

That means an error like:

  grep: /var/log/ceph/<fsid>/ceph.log: No such file or directory

can be misreported as if it came from the cluster log.

This change makes cluster log scanning robust and accurate by:

- checking whether /var/log/ceph/<fsid>/ceph.log exists before scanning
- using check_status=False for the grep pipeline
- treating only stdout as a real log match
- treating stderr as a scan error instead of log content
- avoiding overwrite of a more accurate pre-existing failure_reason
- reporting scan failures separately as cluster log scan failed

Fixes: https://tracker.ceph.com/issues/76051
Signed-off-by: Redouane Kachach <rkachach@ibm.com>
3 days agoMerge pull request #68154 from leiwen2025/rv64-crc32c
Kefu Chai [Thu, 16 Apr 2026 06:55:48 +0000 (14:55 +0800)]
Merge pull request #68154 from leiwen2025/rv64-crc32c

src/common: optimize crc32c using zbc extension for riscv64

Reviewed-by: Kefu Chai <k.chai@proxmox.com>
3 days agoMerge pull request #68301 from rhcs-dashboard/inline-edit-emitter
Nizamudeen A [Thu, 16 Apr 2026 05:59:12 +0000 (11:29 +0530)]
Merge pull request #68301 from rhcs-dashboard/inline-edit-emitter

mgr/dashboard: table cell inline edit emit editing state

Reviewed-by: Aashish Sharma <aasharma@redhat.com>
3 days agoMerge pull request #68282 from tchaikov/wip-mgr-module-neg-exit-code
Kefu Chai [Thu, 16 Apr 2026 05:13:05 +0000 (13:13 +0800)]
Merge pull request #68282 from tchaikov/wip-mgr-module-neg-exit-code

mgr/crash, mgr/status: return negative errno to fix CLI exit code

Reviewed-by: Dan Mick <dmick@ibm.com>
3 days agoMerge pull request #68283 from tchaikov/wip-ceph-crash-less-noisy
Kefu Chai [Thu, 16 Apr 2026 05:11:59 +0000 (13:11 +0800)]
Merge pull request #68283 from tchaikov/wip-ceph-crash-less-noisy

ceph-crash: reduce log noise from auth fallback in post_crash()

Reviewed-by: Dan Mick <dmick@ibm.com>
3 days agoMerge pull request #68181 from cloudbehl/subvolume-fixes-for-double-values
Aashish Sharma [Thu, 16 Apr 2026 05:00:07 +0000 (10:30 +0530)]
Merge pull request #68181 from cloudbehl/subvolume-fixes-for-double-values

Fixes for subvolume overview in grafana

Reviewed-by: Aashish Sharma <aasharma@redhat.com>
3 days agoMerge pull request #68403 from phlogistonjohn/jjm-codeowners-build-sig
Dan Mick [Thu, 16 Apr 2026 03:45:49 +0000 (20:45 -0700)]
Merge pull request #68403 from phlogistonjohn/jjm-codeowners-build-sig

CODEOWNERS: add a build-sig group for various build / test files

4 days agoCODEOWNERS: add a build-sig group for various build / test files 68403/head
John Mulligan [Wed, 15 Apr 2026 21:15:03 +0000 (17:15 -0400)]
CODEOWNERS: add a build-sig group for various build / test files

Add a new build-sig group that covers some of the high level tools and
scripts used in the build and CI processes. This should help PRs not
pass by without notifying people who care about these things.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
4 days agoRevert "vstart: disable extblkdev plugins for file-backed OSDs" 68391/head
Igor Fedotov [Wed, 15 Apr 2026 18:42:46 +0000 (21:42 +0300)]
Revert "vstart: disable extblkdev plugins for file-backed OSDs"

This reverts commit 92e902ecfe2cfed217136dc64e47500ec50f9c07.

Signed-off-by: Igor Fedotov <igor.fedotov@croit.io>
4 days agoextblkdev/fcm: do not abort on multi-device volume before we discovered it's FCM one
Igor Fedotov [Wed, 15 Apr 2026 15:46:56 +0000 (18:46 +0300)]
extblkdev/fcm: do not abort on multi-device volume before we discovered it's FCM one

https://tracker.ceph.com/issues/75819

Signed-off-by: Igor Fedotov <igor.fedotov@croit.io>
4 days agoqa/valgrind: generalize and group MismatchedFree suppressions 68289/head
Casey Bodley [Tue, 14 Apr 2026 18:50:04 +0000 (14:50 -0400)]
qa/valgrind: generalize and group MismatchedFree suppressions

combine the various MismatchedFree suppressions into unconditional ones
for each function

Signed-off-by: Casey Bodley <cbodley@redhat.com>
4 days agoFixes for subvolume overview in grafana 68181/head
Ankush Behl [Thu, 2 Apr 2026 11:00:09 +0000 (16:30 +0530)]
Fixes for subvolume overview in grafana
- multiple values were shown in graph and single state
- Remove All selection from subvolume path

fixes: https://tracker.ceph.com/issues/75849

Signed-off-by: Ankush Behl <cloudbehl@gmail.com>
4 days agoMerge pull request #68355 from nhoad/pretty-format-docs
Casey Bodley [Wed, 15 Apr 2026 13:29:08 +0000 (09:29 -0400)]
Merge pull request #68355 from nhoad/pretty-format-docs

rgw: Add documentation for the --pretty-format option

Reviewed-by: Casey Bodley <cbodley@redhat.com>
4 days agoMerge pull request #68062 from anthonyeleven/global.yaml.in
Anthony D'Atri [Wed, 15 Apr 2026 13:22:37 +0000 (09:22 -0400)]
Merge pull request #68062 from anthonyeleven/global.yaml.in

src/common/options: Modernize language in global.yaml.in

4 days agoMerge pull request #65986 from HeinleinSupport/wip-cephadm-72696
Guillaume Abrioux [Wed, 15 Apr 2026 13:18:26 +0000 (15:18 +0200)]
Merge pull request #65986 from HeinleinSupport/wip-cephadm-72696

mgr/cephadm: renames ceph_device to ceph_device_lvm

4 days agoMerge PR #67032 into main
Patrick Donnelly [Wed, 15 Apr 2026 13:11:34 +0000 (09:11 -0400)]
Merge PR #67032 into main

* refs/pull/67032/head:
qa: add trivial cephfs-tool bench test
doc/cephfs: add cephfs-tool documentation
tools/cephfs: add new cephfs-tool

Reviewed-by: Venky Shankar <vshankar@redhat.com>
4 days agomgr/dashboard: table cell inline edit emit editing state 68301/head
Nizamudeen A [Fri, 10 Apr 2026 06:25:27 +0000 (11:55 +0530)]
mgr/dashboard: table cell inline edit emit editing state

- Emit the editing state so that the consuming component can manipulate
that to add some extra validations

- Replace button with cds-icon-button.
- Replace submit button with tertiary instead of ghost for visibility.
- Also added a cancel button to cancel the ongoing edit

Fixes: https://tracker.ceph.com/issues/75949
Signed-off-by: Nizamudeen A <nia@redhat.com>
4 days agoMerge pull request #68322 from anthonyeleven/percent
Anthony D'Atri [Wed, 15 Apr 2026 12:07:45 +0000 (08:07 -0400)]
Merge pull request #68322 from anthonyeleven/percent

doc/rados/configuration: Update bluestore-config-ref.rst WAL+DB sizing

4 days agocrimson/osd: fix race between AllReplicasRecovered and DeferRecovery 68383/head
Aishwarya Mathuria [Tue, 14 Apr 2026 07:59:36 +0000 (13:29 +0530)]
crimson/osd: fix race between AllReplicasRecovered and DeferRecovery

Fixes a crash where AllReplicasRecovered event arrives at NotRecovering
state due to async event delivery race with DeferRecovery preemption.

The issue occurs when:
1. Recovery completes and AllReplicasRecovered is queued asynchronously
2. A higher priority operation (e.g., client I/O) triggers AsyncReserver
   to preempt recovery, posting DeferRecovery event
3. DeferRecovery is processed first, transitioning PG to NotRecovering
4. AllReplicasRecovered arrives at wrong state → crash with "bad state
   machine event" because NotRecovering doesn't handle it

The fix follows Classic OSD's approach in PrimaryLogPG::start_recovery_ops():
clear PG_STATE_RECOVERING before posting recovery completion events. This
makes the existing safety check in PeeringState::Recovering::react() work:
when DeferRecovery arrives and sees !state_test(PG_STATE_RECOVERING), it
discards itself, preventing the state transition that would cause the crash.

Fixes:https://tracker.ceph.com/issues/73314
Signed-off-by: Aishwarya Mathuria <amathuri@redhat.com>
4 days agocephadm: fix HAProxy/RGW IPv6 failure (ip_nonlocal_bind) 67493/head
kginon [Tue, 24 Feb 2026 18:28:56 +0000 (20:28 +0200)]
cephadm: fix HAProxy/RGW IPv6 failure (ip_nonlocal_bind)

Fixes: https://tracker.ceph.com/issues/56660
Signed-off-by: Kobi Ginon <kginon@redhat.com>
4 days agoMerge pull request #68367 from guits/fix-orch-osd-add-raw
Guillaume Abrioux [Wed, 15 Apr 2026 07:05:04 +0000 (09:05 +0200)]
Merge pull request #68367 from guits/fix-orch-osd-add-raw

cephadm: wait for latest osd map after ceph-volume before OSD deploy

4 days agoMerge pull request #68336 from rhcs-dashboard/rm-golang-gh-prom
Nizamudeen A [Wed, 15 Apr 2026 06:31:49 +0000 (12:01 +0530)]
Merge pull request #68336 from rhcs-dashboard/rm-golang-gh-prom

ceph.spec.in: replace golang github prometheus with promtool binary path

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: John Mulligan
5 days agoMerge pull request #68298 from tchaikov/wip-rgw-inotify-ctor
Casey Bodley [Tue, 14 Apr 2026 18:38:58 +0000 (14:38 -0400)]
Merge pull request #68298 from tchaikov/wip-rgw-inotify-ctor

rgw/posix: fix Inotify member initialization order race

Reviewed-by: Matt Benjamin <mbenjamin@redhat.com>
5 days agoMerge pull request #68351 from joscollin/wip-TestCorruptedSubvolumes-fix
Venky Shankar [Tue, 14 Apr 2026 18:02:22 +0000 (23:32 +0530)]
Merge pull request #68351 from joscollin/wip-TestCorruptedSubvolumes-fix

qa: update yaml file for TestCorruptedSubvolumes

Reviewed-by: Rishabh Dave <ridave@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
5 days agoMerge PR #66294 into main
Patrick Donnelly [Tue, 14 Apr 2026 15:46:54 +0000 (11:46 -0400)]
Merge PR #66294 into main

* refs/pull/66294/head:
qa: enforce centos9 for test
qa: rename distro
qa/suites/fs/bugs: use centos9 for squid upgrade test
qa: remove unused variables
qa: use centos9 for fs suites using k-testing
qa: update fs suite to rocky10
qa: skip dashboard install due to dependency noise
qa: only setup nat rules during bridge creation
qa: correct wording of comment
qa: use nft instead iptables
qa: use py3 builtin ipaddress module

Reviewed-by: Venky Shankar <vshankar@redhat.com>
5 days agoMerge pull request #62351 from vshankar/wip-revert-referent-inodes
Venky Shankar [Tue, 14 Apr 2026 15:35:25 +0000 (21:05 +0530)]
Merge pull request #62351 from vshankar/wip-revert-referent-inodes

mds: revert referent inode work

Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>
5 days agoMerge pull request #68034 from timqn22/add-multi-line-to-yaml-representer
Redouane Kachach [Tue, 14 Apr 2026 14:20:40 +0000 (16:20 +0200)]
Merge pull request #68034 from timqn22/add-multi-line-to-yaml-representer

python-common: multi-line for yaml

Reviewed-by: Redouane Kachach <rkachach@redhat.com>
Reviewed-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
5 days agoMerge pull request #68239 from Kushal-deb/fix-nvme-apply-path
Redouane Kachach [Tue, 14 Apr 2026 14:16:49 +0000 (16:16 +0200)]
Merge pull request #68239 from Kushal-deb/fix-nvme-apply-path

mgr/cephadm: create default pool for nvmeof

Reviewed-by: Redouane Kachach <rkachach@redhat.com>
5 days agoqa: update yaml file for TestCorruptedSubvolumes 68351/head
Jos Collin [Mon, 13 Apr 2026 10:30:57 +0000 (16:00 +0530)]
qa: update yaml file for TestCorruptedSubvolumes

Update yaml file to include TestCorruptedSubvolumes in qa tests.

Fixes: https://tracker.ceph.com/issues/76001
Signed-off-by: Jos Collin <jcollin@redhat.com>
5 days agoqa: remove vestiges of ceph-deploy 67822/head
Patrick Donnelly [Mon, 16 Mar 2026 16:12:44 +0000 (12:12 -0400)]
qa: remove vestiges of ceph-deploy

Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
5 days agodoc: remove references to ceph-deploy
Patrick Donnelly [Mon, 16 Mar 2026 15:56:43 +0000 (11:56 -0400)]
doc: remove references to ceph-deploy

Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
5 days agoqa: add trivial cephfs-tool bench test 67032/head
Patrick Donnelly [Tue, 14 Apr 2026 12:57:33 +0000 (08:57 -0400)]
qa: add trivial cephfs-tool bench test

Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
5 days agodoc/cephfs: add cephfs-tool documentation
Patrick Donnelly [Wed, 21 Jan 2026 18:37:50 +0000 (13:37 -0500)]
doc/cephfs: add cephfs-tool documentation

AI-Assisted: this doc was generated from the help text / inital commit memssage
Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
5 days agotools/cephfs: add new cephfs-tool
Patrick Donnelly [Wed, 21 Jan 2026 17:25:31 +0000 (12:25 -0500)]
tools/cephfs: add new cephfs-tool

This patch introduces `cephfs-tool`, a new standalone C++ utility
designed to interact directly with `libcephfs`.

While the tool is architected to support various subcommands in the
future, the initial implementation focuses on a `bench` command to
measure library performance. This allows developers and administrators
to benchmark the userspace library isolated from FUSE or Kernel client
overheads.

Key features include:
* Multi-threaded Read/Write throughput benchmarking.
* Configurable block sizes, file counts, and fsync intervals.
* Detailed statistical reporting (Mean, Std Dev, Min/Max) for throughput and IOPS.
* Support for specific CephFS user/group impersonation (UID/GID) via `ceph_mount_perms_set`.

As an example test on a "trial" sepia machine against the new LRC, I
used a command like:

    pdonnell@trial154:~$ env CEPH_ARGS="--log-to-stderr=false --log-to-file=false --log-file=/tmp/bench.log" ./cephfs-tool -c ~/ceph.conf -k ~/keyring -i scratch --filesystem scratch  bench --root-path=/pdonnell --files 256 --size=$(( 128 * 2 ** 20 )) --threads=8 --iterations 3
    Benchmark Configuration:
      Threads: 8 | Iterations: 3
      Files: 256 | Size: 134217728
      Filesystem: scratch
      Root: /pdonnell
      Subdirectory: bench_run_d942
      UID: -1
      GID: -1

    --- Iteration 1 of 3 ---
    Starting Write Phase...
      Write: 2761.97 MB/s, 21.5779 files/s (11.864s)
    Starting Read Phase...
      Read:  2684.36 MB/s, 20.9716 files/s (12.207s)

    --- Iteration 2 of 3 ---
    Starting Write Phase...
      Write: 2698.51 MB/s, 21.0821 files/s (12.143s)
    Starting Read Phase...
      Read:  2682.16 MB/s, 20.9544 files/s (12.217s)

    --- Iteration 3 of 3 ---
    Starting Write Phase...
      Write: 2720.69 MB/s, 21.2554 files/s (12.044s)
    Starting Read Phase...
      Read:  2695.18 MB/s, 21.0561 files/s (12.158s)

    *** Final Report ***

    Write Throughput Statistics (3 runs):
      Mean:    2727.06 MB/s
      Std Dev: 26.2954 MB/s
      Min:     2698.51 MB/s
      Max:     2761.97 MB/s

    Read Throughput Statistics (3 runs):
      Mean:    2687.24 MB/s
      Std Dev: 5.68904 MB/s
      Min:     2682.16 MB/s
      Max:     2695.18 MB/s

    File Creates Statistics (3 runs):
      Mean:    21.3051 files/s
      Std Dev: 0.205433 files/s
      Min:     21.0821 files/s
      Max:     21.5779 files/s

    File Reads (Opens) Statistics (3 runs):
      Mean:    20.994 files/s
      Std Dev: 0.0444456 files/s
      Min:     20.9544 files/s
      Max:     21.0561 files/s

    Cleaning up...

For a 25Gb NIC, this is just about saturating the sticker bandwidth with
a single shared mount and 8 threads. For a per-thread mount:

    pdonnell@trial154:~$ env CEPH_ARGS="--log-to-stderr=false --log-to-file=false --log-file=/tmp/bench.log" ./cephfs-tool -c ~/ceph.conf -k ~/keyring -i scratch --filesystem scratch  bench --root-path=/pdonnell --files 256 --size=$(( 128 * 2 ** 20 )) --threads=8 --iterations 3 --per-thread-mount
    Benchmark Configuration:
      Threads: 8 | Iterations: 3
      Files: 256 | Size: 134217728
      Filesystem: scratch
      Root: /pdonnell
      Subdirectory: bench_run_9d1c
      UID: -1
      GID: -1

    --- Iteration 1 of 3 ---
    Starting Write Phase...
      Write: 2691.2 MB/s, 21.025 files/s (12.176s)
    Starting Read Phase...
      Read:  2486.76 MB/s, 19.4278 files/s (13.177s)

    --- Iteration 2 of 3 ---
    Starting Write Phase...
      Write: 2688.77 MB/s, 21.006 files/s (12.187s)
    Starting Read Phase...
      Read:  2496.42 MB/s, 19.5033 files/s (13.126s)

    --- Iteration 3 of 3 ---
    Starting Write Phase...
      Write: 2692.08 MB/s, 21.0319 files/s (12.172s)
    Starting Read Phase...
      Read:  2488.27 MB/s, 19.4396 files/s (13.169s)
    *** Final Report ***

    Write Throughput Statistics (3 runs):
      Mean:    2690.68 MB/s
      Std Dev: 1.40086 MB/s
      Min:     2688.77 MB/s
      Max:     2692.08 MB/s

    Read Throughput Statistics (3 runs):
      Mean:    2490.48 MB/s
      Std Dev: 4.24374 MB/s
      Min:     2486.76 MB/s
      Max:     2496.42 MB/s

    File Creates Statistics (3 runs):
      Mean:    21.0209 files/s
      Std Dev: 0.0109442 files/s
      Min:     21.006 files/s
      Max:     21.0319 files/s

    File Reads (Opens) Statistics (3 runs):
      Mean:    19.4569 files/s
      Std Dev: 0.0331542 files/s
      Min:     19.4278 files/s
      Max:     19.5033 files/s

    Cleaning up...

Or to measure file create performance:

    pdonnell@trial154:~$ env CEPH_ARGS="--log-to-stderr=false --log-to-file=false --log-file=/tmp/bench.log" ./cephfs-tool -c ~/ceph.conf -k ~/keyring -i scratch --filesystem scratch  bench --root-path=/pdonnell --files=$(( 2 ** 16 )) --size=$(( 0 * 2 ** 20 )) --threads=8 --iterations 3
    Benchmark Configuration:
      Threads: 8 | Iterations: 3
      Files: 65536 | Size: 0
      Filesystem: scratch
      Root: /pdonnell
      Subdirectory: bench_run_d435
      UID: -1
      GID: -1

    --- Iteration 1 of 3 ---
    Starting Write Phase...
      Write: 3974.77 files/s (16.488s)
    Starting Read Phase...
      Read:  14537.7 files/s (4.508s)
    Cleaning up for next iteration...

    --- Iteration 2 of 3 ---
    Starting Write Phase...
      Write: 4167.1 files/s (15.727s)
    Starting Read Phase...
      Read:  13636.3 files/s (4.806s)
    Cleaning up for next iteration...

    --- Iteration 3 of 3 ---
    Starting Write Phase...
      Write: 3863.7 files/s (16.962s)
    Starting Read Phase...
      Read:  14972.8 files/s (4.377s)

    *** Final Report ***

    File Creates Statistics (3 runs):
      Mean:    4001.86 files/s
      Std Dev: 125.337 files/s
      Min:     3863.7 files/s
      Max:     4167.1 files/s

    File Reads (Opens) Statistics (3 runs):
      Mean:    14382.3 files/s
      Std Dev: 556.594 files/s
      Min:     13636.3 files/s
      Max:     14972.8 files/s

    Cleaning up...

Here is the current help text:

    Usage: cephfs-bench [general-options] <command> [command-options]

    Commands:
      bench      Run IO benchmark

    Allowed options:

    General Options:
      -h [ --help ]                   Produce help message
      -c [ --conf ] arg               Ceph config file path
      -i [ --id ] arg (=admin)        Client ID
      -k [ --keyring ] arg            Path to keyring file
      --filesystem arg                CephFS filesystem name to mount
      --uid arg (=-1)                 User ID to mount as
      --gid arg (=-1)                 Group ID to mount as

    Benchmark Options (used with 'bench' command):
      --threads arg (=1)              Number of threads
      --iterations arg (=1)           Number of iterations
      --files arg (=100)              Total number of files
      --size arg (=4MB)               File size (e.g. 4MB, 0 for creates only)
      --block-size arg (=4MB)         IO block size (e.g. 1MB)
      --fsync-every arg (=0)          Call fsync every N bytes
      --prefix arg (=benchmark_)      Filename prefix
      --dir-prefix arg (=bench_run_)  Directory prefix
      --root-path arg (=/)            Root path in CephFS
      --per-thread-mount              Use separate mount per thread
      --no-cleanup                    Disable cleanup of files

AI-Assisted: significant portions of this code were AI-generated through a dozens of iterative prompts.
Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
5 days agoMerge PR #67555 into main
Patrick Donnelly [Tue, 14 Apr 2026 12:18:07 +0000 (08:18 -0400)]
Merge PR #67555 into main

* refs/pull/67555/head:
qa/tasks/quiescer: remove racy assertion

Reviewed-by: Rishabh Dave <ridave@redhat.com>
5 days agoMerge PR #68360 into main
Patrick Donnelly [Tue, 14 Apr 2026 12:17:28 +0000 (08:17 -0400)]
Merge PR #68360 into main

* refs/pull/68360/head:
qa: ignore expected MON_DOWN

Reviewed-by: Venky Shankar <vshankar@redhat.com>
5 days agoMerge pull request #66495 from rkachach/fix_issue_nvmeof_ssl
Redouane Kachach [Tue, 14 Apr 2026 11:53:11 +0000 (13:53 +0200)]
Merge pull request #66495 from rkachach/fix_issue_nvmeof_ssl

mgr/cephadm: fix nvmeof TLS handling and add coverage for ssl/mTLS

Reviewed-by: Adam King <adking@redhat.com>
5 days agoMerge pull request #60156 from neesingh-rh/wip-fix-traceback-issue
Venky Shankar [Tue, 14 Apr 2026 10:31:27 +0000 (16:01 +0530)]
Merge pull request #60156 from neesingh-rh/wip-fix-traceback-issue

src: remove showing traceback from assert_valid_host when it fails

Reviewed-by: Venky Shankar <vshankar@redhat.com>
5 days agoMerge pull request #67645 from rhcs-dashboard/75312-EC-profile-default-value
Pedro Gonzalez Gomez [Tue, 14 Apr 2026 10:02:34 +0000 (12:02 +0200)]
Merge pull request #67645 from rhcs-dashboard/75312-EC-profile-default-value

mgr/dashboard: 75312 Making EC profile default plugin value to ISA

Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Abhishek Desai <abhishek.desai1@ibm.com>
Reviewed-by: Pedro Gonzalez Gomez <pegonzal@ibm.com>
5 days agoMerge pull request #67268 from perezjosibm/wip-perezjos-tracker74444
Jose Juan Palacios-Perez [Tue, 14 Apr 2026 09:27:41 +0000 (10:27 +0100)]
Merge pull request #67268 from perezjosibm/wip-perezjos-tracker74444

qa/suites/crimson-rados: first version of cephfs/

5 days agoMerge pull request #67358 from knrt10/wip-crimson-rgw-sts
Matan Breizman [Tue, 14 Apr 2026 08:32:11 +0000 (11:32 +0300)]
Merge pull request #67358 from knrt10/wip-crimson-rgw-sts

qa: Add rgw sts suites to crimson-rados

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
5 days agoMerge pull request #68363 from gbregman/main
Gil Bregman [Tue, 14 Apr 2026 07:59:49 +0000 (10:59 +0300)]
Merge pull request #68363 from gbregman/main

mgr/dashboard: Add port and secure-listeners to subsystem add NVMeoF CLI command

5 days agocephadm: wait for latest osd map after ceph-volume before OSD deploy 68367/head
Guillaume Abrioux [Fri, 10 Apr 2026 14:37:58 +0000 (16:37 +0200)]
cephadm: wait for latest osd map after ceph-volume before OSD deploy

after ceph-volume creates an OSD, the cached osd map of the mgr can
lag behind the monitors, then get_osd_uuid_map() misses the new osd
id and deploy_osd_daemons_for_existing_osds() skips deploying the
cephadm daemon, which reports a misleading "Created no osd(s)" while
the osd exists.

This behavior is often seen with raw devices. (lvm list returns quicker).

This also fixes get_osd_uuid_map(only_up=True) as the previous branch
never populated the map when 'only_up' was true.
Now we only include osds with 'up==1' so a new OSD created (but still down)
is not treated as already present.

Fixes: https://tracker.ceph.com/issues/75965
Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
5 days agomgr/cephadm: create default pool for nvmeof 68239/head
Kushal Deb [Tue, 7 Apr 2026 14:00:22 +0000 (19:30 +0530)]
mgr/cephadm: create default pool for nvmeof

Signed-off-by: Kushal Deb <Kushal.Deb@ibm.com>
5 days agoMerge pull request #65508 from MaxKellermann/mgr_includes
Kefu Chai [Tue, 14 Apr 2026 03:58:21 +0000 (11:58 +0800)]
Merge pull request #65508 from MaxKellermann/mgr_includes

mgr: reduce header dependencies

Reviewed-by: Ronen Friedman <rfriedma@redhat.com>
Reviewed-by: Kefu Chai <k.chai@proxmox.com>
5 days agoqa: enforce centos9 for test 66294/head
Patrick Donnelly [Tue, 14 Apr 2026 01:22:16 +0000 (21:22 -0400)]
qa: enforce centos9 for test

Avoids problem where rocky10 packages do not exist for squid.

Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
5 days agoqa: rename distro
Patrick Donnelly [Tue, 14 Apr 2026 00:47:43 +0000 (20:47 -0400)]
qa: rename distro

The kernel mount overrides for the distro have no effect if they are
applied before `supported-random-distro`.

Fixes:

    2026-04-13T19:06:13.603 INFO:teuthology.task.pexec:sudo dnf remove nvme-cli -y
    2026-04-13T19:06:13.603 INFO:teuthology.task.pexec:sudo dnf install nvmetcli nvme-cli -y
    2026-04-13T19:06:13.626 INFO:teuthology.task.pexec:Running commands on host ubuntu@trial005.front.sepia.ceph.com
    2026-04-13T19:06:13.627 INFO:teuthology.task.pexec:sudo dnf remove nvme-cli -y
    2026-04-13T19:06:13.627 INFO:teuthology.task.pexec:sudo dnf install nvmetcli nvme-cli -y
    2026-04-13T19:06:13.652 INFO:teuthology.orchestra.run.trial148.stderr:sudo: dnf: command not found
    2026-04-13T19:06:13.653 DEBUG:teuthology.orchestra.run:got remote process result: 1
    2026-04-13T19:06:13.654 ERROR:teuthology.run_tasks:Saw exception from tasks.
    Traceback (most recent call last):
      File "/home/teuthworker/src/git.ceph.com_teuthology_426ec63bc4a39bba882efb593125294667afc593/teuthology/run_tasks.py", line 105, in run_tasks
        manager = run_one_task(taskname, ctx=ctx, config=config)
                  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
      File "/home/teuthworker/src/git.ceph.com_teuthology_426ec63bc4a39bba882efb593125294667afc593/teuthology/run_tasks.py", line 83, in run_one_task
        return task(**kwargs)
               ^^^^^^^^^^^^^^
      File "/home/teuthworker/src/git.ceph.com_teuthology_426ec63bc4a39bba882efb593125294667afc593/teuthology/task/pexec.py", line 149, in task
        with parallel() as p:
      File "/home/teuthworker/src/git.ceph.com_teuthology_426ec63bc4a39bba882efb593125294667afc593/teuthology/parallel.py", line 84, in __exit__
        for result in self:
      File "/home/teuthworker/src/git.ceph.com_teuthology_426ec63bc4a39bba882efb593125294667afc593/teuthology/parallel.py", line 98, in __next__
        resurrect_traceback(result)
      File "/home/teuthworker/src/git.ceph.com_teuthology_426ec63bc4a39bba882efb593125294667afc593/teuthology/parallel.py", line 30, in resurrect_traceback
        raise exc.exc_info[1]
      File "/home/teuthworker/src/git.ceph.com_teuthology_426ec63bc4a39bba882efb593125294667afc593/teuthology/parallel.py", line 23, in capture_traceback
        return func(*args, **kwargs)
               ^^^^^^^^^^^^^^^^^^^^^
      File "/home/teuthworker/src/git.ceph.com_teuthology_426ec63bc4a39bba882efb593125294667afc593/teuthology/task/pexec.py", line 62, in _exec_host
        tor.wait([r])
      File "/home/teuthworker/src/git.ceph.com_teuthology_426ec63bc4a39bba882efb593125294667afc593/teuthology/orchestra/run.py", line 485, in wait
        proc.wait()
      File "/home/teuthworker/src/git.ceph.com_teuthology_426ec63bc4a39bba882efb593125294667afc593/teuthology/orchestra/run.py", line 161, in wait
        self._raise_for_status()
      File "/home/teuthworker/src/git.ceph.com_teuthology_426ec63bc4a39bba882efb593125294667afc593/teuthology/orchestra/run.py", line 181, in _raise_for_status
        raise CommandFailedError(
    teuthology.exceptions.CommandFailedError: Command failed on trial148 with status 1: 'TESTDIR=/home/ubuntu/cephtest bash -s'

which was done because these dnf commands were pulled from rocky10.yaml from the kclient overrides but ubuntu_latest was used for the random distro.

Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
5 days agoqa/suites/fs/bugs: use centos9 for squid upgrade test
Patrick Donnelly [Mon, 13 Apr 2026 18:37:44 +0000 (14:37 -0400)]
qa/suites/fs/bugs: use centos9 for squid upgrade test

To avoid missing package error for rocky10

Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
5 days agoqa: remove unused variables
Patrick Donnelly [Fri, 20 Mar 2026 21:46:29 +0000 (17:46 -0400)]
qa: remove unused variables

To make tox-qa happy.

Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
5 days agoqa: use centos9 for fs suites using k-testing
Patrick Donnelly [Thu, 12 Feb 2026 15:36:29 +0000 (10:36 -0500)]
qa: use centos9 for fs suites using k-testing

A better approach would be to include centos9 OR rocky10 for
distribution choice. Then we can just filter out rocky10 when we're
testing the `testing` kernel but keep rocky10 coverage for other
testing.

Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
5 days agoqa: update fs suite to rocky10
Patrick Donnelly [Wed, 12 Nov 2025 00:20:22 +0000 (19:20 -0500)]
qa: update fs suite to rocky10

Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
5 days agoqa: skip dashboard install due to dependency noise
Patrick Donnelly [Wed, 19 Nov 2025 17:25:45 +0000 (12:25 -0500)]
qa: skip dashboard install due to dependency noise

    2025-11-18T19:46:46.226 INFO:teuthology.orchestra.run.smithi008.stdout:/usr/bin/ceph: stderr Error ENOTSUP: Module 'alerts' is not enabled/loaded (required by command 'dashboard set-ssl-certificate'): use `ceph mgr module enable alerts` to enable it

Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
5 days agoqa: only setup nat rules during bridge creation
Patrick Donnelly [Wed, 11 Mar 2026 06:16:35 +0000 (11:46 +0530)]
qa: only setup nat rules during bridge creation

Currently the code recreates these NAT rules for every mount. This only
needs to be done once by the first mount.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
5 days agoqa: correct wording of comment
Patrick Donnelly [Wed, 11 Mar 2026 06:14:39 +0000 (11:44 +0530)]
qa: correct wording of comment

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
5 days agoqa: use nft instead iptables
Patrick Donnelly [Mon, 17 Nov 2025 18:14:47 +0000 (13:14 -0500)]
qa: use nft instead iptables

rocky.10 does not support iptables with MASQUERADE targets. (Or maybe it
does with more prodding but it's easier to just switch to nft.)

Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
5 days agoqa: use py3 builtin ipaddress module
Patrick Donnelly [Mon, 17 Nov 2025 18:13:37 +0000 (13:13 -0500)]
qa: use py3 builtin ipaddress module

Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
6 days agomgr/dashboard: Add port and secure-listeners to subsystem add NVMeoF CLI command 68363/head
Gil Bregman [Mon, 13 Apr 2026 21:41:25 +0000 (00:41 +0300)]
mgr/dashboard: Add port and secure-listeners to subsystem add NVMeoF CLI command
Fixes: https://tracker.ceph.com/issues/75998
Signed-off-by: Gil Bregman <gbregman@il.ibm.com>
6 days agoMerge pull request #67661 from jzhu116-bloomberg/wip-75340
Shilpa Jagannath [Mon, 13 Apr 2026 21:06:23 +0000 (14:06 -0700)]
Merge pull request #67661 from jzhu116-bloomberg/wip-75340

rgw: break the coupling of olh epoch and epochs of olh ops

6 days agoMerge pull request #68167 from smanjara/wip-fix-75465
Shilpa Jagannath [Mon, 13 Apr 2026 20:06:27 +0000 (13:06 -0700)]
Merge pull request #68167 from smanjara/wip-fix-75465

qa/multisite: fix test_bucket_remove failure

6 days agoqa: ignore expected MON_DOWN 68360/head
Patrick Donnelly [Mon, 13 Apr 2026 18:22:35 +0000 (14:22 -0400)]
qa: ignore expected MON_DOWN

tasks.cephfs.test_misc.TestMisc.test_dump_inmemory_log_on_missed_internal_heartbeats
deliberately stops the monitors.

Fixes: https://tracker.ceph.com/issues/75992
Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
6 days agoMerge pull request #67253 from VinayakTiwari1103/osd-fix-build-incremental-map-msg...
Radoslaw Zarzynski [Mon, 13 Apr 2026 18:00:17 +0000 (20:00 +0200)]
Merge pull request #67253 from VinayakTiwari1103/osd-fix-build-incremental-map-msg-abort

osd: avoid ceph_abort in build_incremental_map_msg when newest_map is misssig

Reviewed-by: Kamoltat Sirivadhna <ksirivad@redhat.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
6 days agoMerge pull request #68342 from Matan-B/wip-matanb-objecter-branch
Matan Breizman [Mon, 13 Apr 2026 16:20:40 +0000 (19:20 +0300)]
Merge pull request #68342 from Matan-B/wip-matanb-objecter-branch

qa/tasks/admin_socket.py: replace git.ceph.com with GitHub raw URLs

Reviewed-by: Shraddha Agrawal <shraddhaag@ibm.com>
6 days agodoc/rados/configuration: Update bluestore-config-ref.rst WAL+DB sizing 68322/head
Anthony D'Atri [Fri, 10 Apr 2026 13:38:49 +0000 (09:38 -0400)]
doc/rados/configuration: Update bluestore-config-ref.rst WAL+DB sizing

Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
6 days agoMerge pull request #67744 from SHASHI9705/dashboard-align-no-notification-icon
Afreen Misbah [Mon, 13 Apr 2026 15:29:48 +0000 (20:59 +0530)]
Merge pull request #67744 from SHASHI9705/dashboard-align-no-notification-icon

dashboard: align 'no notifications available' icon in notifications page

Reviewed-by: Afreen Misbah <afreen@ibm.com>
6 days agorgw: Add documentation for the --pretty-format option 68355/head
Nathan Hoad [Mon, 13 Apr 2026 14:26:34 +0000 (10:26 -0400)]
rgw: Add documentation for the --pretty-format option

Fixes: http://tracker.ceph.com/issues/75988
Signed-off-by: Nathan Hoad <nhoad@bloomberg.net>
6 days agomgr/cephadm: fix nvmeof TLS handling and add coverage for ssl/mTLS 66495/head
Redouane Kachach [Wed, 3 Dec 2025 09:36:25 +0000 (10:36 +0100)]
mgr/cephadm: fix nvmeof TLS handling and add coverage for ssl/mTLS

This PR fixes the value of `ssl` field on `NvmeofServiceSpec` (was
always set to enable_auth) and add some UT to make sure both specs
with ssl only and with mTLS enabled (enable_auth) generate the
expected daemon configuration.

Fixes: https://tracker.ceph.com/issues/74073
Signed-off-by: Redouane Kachach <rkachach@ibm.com>
6 days agoMerge pull request #68275 from VallariAg/fix-top-tool
Afreen Misbah [Mon, 13 Apr 2026 13:20:15 +0000 (18:50 +0530)]
Merge pull request #68275 from VallariAg/fix-top-tool

mgr/dashboard: validate args in nvmeof top cmds

Reviewed-by: Afreen Misbah <afreen@ibm.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
6 days agoMerge pull request #68306 from shraddhaag/wip-shraddhaag-bad-machine-state-logs
Shraddha Agrawal [Mon, 13 Apr 2026 13:12:00 +0000 (18:42 +0530)]
Merge pull request #68306 from shraddhaag/wip-shraddhaag-bad-machine-state-logs

src/osd: add more logs to the AllReplicasRecovered handler

6 days agoceph.spec.in: replace golang github prometheus with promtool binary path 68336/head
Nizamudeen A [Sun, 12 Apr 2026 06:06:30 +0000 (11:36 +0530)]
ceph.spec.in: replace golang github prometheus with promtool binary path

i don't see golang-github-prometheus available for centos anymore and
other distro's as well. And different package provides the promtool in
different distro's so instead of identifying all the corresponding
packages and its name, replacing the package name with binary path so it
works across distros without distro specific conditions

Some build failures are captured in our internal runs recently
https://github.com/rhcs-dashboard/ceph-dev/actions/runs/24298848427/job/70949666821

Signed-off-by: Nizamudeen A <nia@redhat.com>
6 days agoMerge pull request #65917 from MattyWilliams22/wip-select-data-generator
SrinivasaBharathKanta [Mon, 13 Apr 2026 12:24:26 +0000 (17:54 +0530)]
Merge pull request #65917 from MattyWilliams22/wip-select-data-generator

test: Add option to select data generation type

6 days agoMerge pull request #68329 from tchaikov/wip-mgr-codecs.open
Kefu Chai [Mon, 13 Apr 2026 10:11:27 +0000 (18:11 +0800)]
Merge pull request #68329 from tchaikov/wip-mgr-codecs.open

mgr/dashboard: replace deprecated codecs.open with open

Reviewed-by: Nizamudeen A <nia@redhat.com>
6 days agoMerge pull request #68166 from pdvian/wip-fix-mon-crash
SrinivasaBharathKanta [Mon, 13 Apr 2026 09:58:24 +0000 (15:28 +0530)]
Merge pull request #68166 from pdvian/wip-fix-mon-crash

ceph_mon: Fix shutdown order to destroy Monitor before closing mon store

6 days agoMerge pull request #68043 from dheart-joe/master
SrinivasaBharathKanta [Mon, 13 Apr 2026 09:55:10 +0000 (15:25 +0530)]
Merge pull request #68043 from dheart-joe/master

BlueFS: _flush_and_sync_log_jump_D should validate that jump_to >= log.writer->pos

6 days agoMerge pull request #66460 from MattyWilliams22/omap-rm-range
SrinivasaBharathKanta [Mon, 13 Apr 2026 09:52:41 +0000 (15:22 +0530)]
Merge pull request #66460 from MattyWilliams22/omap-rm-range

librados: Add omap_rm_range function to ObjectWriteOperation

6 days agoMerge pull request #68048 from sseshasa/wip-fix-max-criteria-in-phase1
Sridhar Seshasayee [Mon, 13 Apr 2026 09:44:24 +0000 (15:14 +0530)]
Merge pull request #68048 from sseshasa/wip-fix-max-criteria-in-phase1

mgr/DaemonServer: Limit search for OSDs to upgrade within the crush bucket

Reviewed-by: Nitzan Mordechai <nmordech@redhat.com>
Reviewed-by: Kamoltat Sirivadhna <ksirivad@redhat.com>
6 days agoMerge pull request #67388 from asm0deuz/stop_start_v2
Redouane Kachach [Mon, 13 Apr 2026 09:22:12 +0000 (11:22 +0200)]
Merge pull request #67388 from asm0deuz/stop_start_v2

cephadm: Automates the procedure to shutdown and bring up an entire Ceph cluster

Reviewed-by: Redouane Kachach <rkachach@ibm.com>
6 days agoMerge pull request #68305 from rkachach/fix_teuthology_nfs_issue
Redouane Kachach [Mon, 13 Apr 2026 08:33:58 +0000 (10:33 +0200)]
Merge pull request #68305 from rkachach/fix_teuthology_nfs_issue

qa/cephadm: fix NFS ganesha startup failure in containers

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
6 days agoMerge pull request #67622 from tchaikov/wip-qa-cephadm-ignore-failed-daemon
Kefu Chai [Mon, 13 Apr 2026 08:27:11 +0000 (16:27 +0800)]
Merge pull request #67622 from tchaikov/wip-qa-cephadm-ignore-failed-daemon

qa/cephadm: ignore transient CEPHADM_FAILED_DAEMON in smoke-singlehost

Reviewed-by: Laura Flores <lflores@redhat.com>
Reviewed-by: Adam King <adking@redhat.com>
6 days agoMerge pull request #67584 from aainscow/osd-backend-test-infrastructure
Alex Ainscow [Mon, 13 Apr 2026 07:15:25 +0000 (08:15 +0100)]
Merge pull request #67584 from aainscow/osd-backend-test-infrastructure

OSD: Add EC backend test infrastructure with peering support

Reviewed-by: Bill Scales <bill_scales@uk.ibm.com>
6 days agoMerge pull request #67443 from aainscow/ec-stretch-cluster-design
Alex Ainscow [Mon, 13 Apr 2026 07:13:31 +0000 (08:13 +0100)]
Merge pull request #67443 from aainscow/ec-stretch-cluster-design

docs: First pass at EC stretch cluster design

Reviewed-by: Bill Scales <bill_scales@uk.ibm.com>
Reviewed-by: Anthony D'Atri <unknown@unknown.com>
6 days agoMerge pull request #67951 from leonidc/propagate_quorum_features_to_client
Aviv Caro [Mon, 13 Apr 2026 07:07:38 +0000 (10:07 +0300)]
Merge pull request #67951 from leonidc/propagate_quorum_features_to_client

Propagate quorum features to client

6 days agoMerge pull request #67872 from rhcs-dashboard/75547-adding-msr-ec-profile-via-dashboard
Nizamudeen A [Mon, 13 Apr 2026 05:28:39 +0000 (10:58 +0530)]
Merge pull request #67872 from rhcs-dashboard/75547-adding-msr-ec-profile-via-dashboard

mgr/dashboard:Adding MSR EC Profile via dashboard

Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Afreen Misbah <afreen@ibm.com>
Reviewed-by: Abhishek Desai <abhishek.desai1@ibm.com>
7 days agoMerge pull request #68343 from gbregman/main
Gil Bregman [Sun, 12 Apr 2026 19:41:32 +0000 (22:41 +0300)]
Merge pull request #68343 from gbregman/main

mgr/dashboard: Add location to gateway info command in NVMeoF CLI