]> git-server-git.apps.pok.os.sepia.ceph.com Git - ceph.git/log
ceph.git
9 days agoosd/test: Add EC peering test infrastructure and recovery test cases 68697/head
Alex Ainscow [Mon, 27 Apr 2026 13:24:45 +0000 (14:24 +0100)]
osd/test: Add EC peering test infrastructure and recovery test cases

This commit enhances the EC peering test framework and adds test cases
for erasure-coded pool recovery scenarios:

NOTE: Many of the tests cases are disabled as they recreate certain
problems. Later commits will enable these tests and fix the production
issues, but under different PRs.

Test Infrastructure Improvements:
- Add MockStore wrapper with read error injection capabilities for testing
  error handling in EC recovery
- Enhance ECPeeringTestFixture with recovery callback verification
- Add support for pg_upmap to better simulate OSD placement
- Implement write_attribute() for testing partial vs full stripe writes
- Add read_shard_object_info() to verify on-disk version consistency
- Improve logging with missing object stats (m=, u=, mbc=)
- Add support for doing object recovery in Fast EC.
- Add set_config() helper for runtime configuration changes
- Preserve xinfo features when marking OSDs up/down
- Fix pg_temp handling for EC pools with optimizations

Mock Object Enhancements:
- Update MockPGBackendListener with recovery callback tracking
- Add on_local_recover, on_peer_recover, on_global_recover tracking
- Implement proper stats publishing (pg_stats_publish)
- Add is_missing_object() implementation
- Enhance should_send_op() with async_recovery_target logic
- Add apply_stats() to update PeeringState statistics

Test Cases Added:
- ECRecoveryTest: Verifies recovery with missing objects after OSD failure
- ECSequentialOSDFailoverTest: Tests sequential OSD failure/recovery cycles
- MultiObjectRecoveryReadCrash: Reproduces bug #75432 (multi-object reads)
- RollbackVersionMismatch: Reproduces bug #76213 (version mismatch)
- RollbackAfterMixedBlockedWrites: Reproduces bug #75211 (rollback issues)

These tests validate EC recovery mechanisms including:
- Object version tracking across shards
- Recovery callback invocation (local, peer, global)
- Handling of read errors during recovery
- Rollback behavior after blocked writes
- Multi-object recovery with partial failures

Assisted-by: IBM Bob, using Claude Sonnet
Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
11 days agoMerge pull request #68540 from nbalacha/wip-nbalacha-76206
Yuval Lifshitz [Tue, 28 Apr 2026 07:35:08 +0000 (10:35 +0300)]
Merge pull request #68540 from nbalacha/wip-nbalacha-76206

rgw/bucket-logging: handle SigV2 presigned URLs

11 days agoMerge pull request #68424 from NitzanMordhai/wip-nitzan-rados-perf-test-epel10-pdsh...
Shraddha Agrawal [Tue, 28 Apr 2026 06:29:05 +0000 (11:59 +0530)]
Merge pull request #68424 from NitzanMordhai/wip-nitzan-rados-perf-test-epel10-pdsh-missing

qa/tasks/cbt: install pdsh from el9 RPMs on el10 systems

11 days agoMerge pull request #68046 from rhcs-dashboard/fix-service-name
Afreen Misbah [Mon, 27 Apr 2026 20:49:05 +0000 (02:19 +0530)]
Merge pull request #68046 from rhcs-dashboard/fix-service-name

mgr/dashboard: Fix service name disablemnt in servcie form UI

Reviewed-by: Nizamudeen A <nia@redhat.com>
11 days agoMerge pull request #68416 from aclamk/aclamk-fcm-health-warnings
Jaya Prakash [Mon, 27 Apr 2026 19:51:06 +0000 (01:21 +0530)]
Merge pull request #68416 from aclamk/aclamk-fcm-health-warnings

bluestore, extblkdev: Now plugins can raise health warnings

Reviewed-by: Igor Fedotov <igor.fedotov@croit.io>
11 days agoMerge pull request #68376 from aclamk/aclamk-rocksdb-prio-fix
Jaya Prakash [Mon, 27 Apr 2026 19:50:28 +0000 (01:20 +0530)]
Merge pull request #68376 from aclamk/aclamk-rocksdb-prio-fix

kv/rocksdb: Fix priority of rocksdb cache perf counters

Reviewed-by: Jaya Prakash <jayaprakash@ibm.com>
12 days agoMerge pull request #68578 from phlogistonjohn/jjm-codeowners-smb
Dan Mick [Mon, 27 Apr 2026 16:59:28 +0000 (09:59 -0700)]
Merge pull request #68578 from phlogistonjohn/jjm-codeowners-smb

CODEOWNERS: add an smb group for various smb related files

12 days agoMerge pull request #68278 from cheese-cakee/wip-rgw-inline-help-rm-delete
Yuval Lifshitz [Mon, 27 Apr 2026 15:55:44 +0000 (18:55 +0300)]
Merge pull request #68278 from cheese-cakee/wip-rgw-inline-help-rm-delete

rgw: align period/zone commands in radosgw-admin help

12 days agoMerge pull request #68228 from cheese-cakee/wip-doc-fix-period-zone-commands
Yuval Lifshitz [Mon, 27 Apr 2026 15:55:28 +0000 (18:55 +0300)]
Merge pull request #68228 from cheese-cakee/wip-doc-fix-period-zone-commands

doc: fix period and zone command names in radosgw-admin man page

12 days agoMerge pull request #68570 from rkachach/fix_issue_adding_coverage_target
Redouane Kachach [Mon, 27 Apr 2026 15:30:30 +0000 (17:30 +0200)]
Merge pull request #68570 from rkachach/fix_issue_adding_coverage_target

mgr/tox: adding coverage target to the main tox.ini

Reviewed-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
12 days agoMerge pull request #68543 from myoungwon/wip-myoungwon-email
Anthony M [Mon, 27 Apr 2026 15:01:24 +0000 (10:01 -0500)]
Merge pull request #68543 from myoungwon/wip-myoungwon-email

doc: update governance.rst to change email address

12 days agoMerge pull request #68595 from guits/cv-fcm-setkeepcaps
Guillaume Abrioux [Mon, 27 Apr 2026 14:19:37 +0000 (16:19 +0200)]
Merge pull request #68595 from guits/cv-fcm-setkeepcaps

ceph-volume: pass --set-keepcaps for FCM crush device class on mkfs

12 days agoMerge pull request #68429 from cbodley/wip-74595
Casey Bodley [Mon, 27 Apr 2026 13:14:56 +0000 (09:14 -0400)]
Merge pull request #68429 from cbodley/wip-74595

rgw/sns: ListTopics uses account root arn for policy evaluation

Reviewed-by: Yuval Lifshitz <ylifshit@redhat.com>
12 days agoMerge pull request #68550 from shraddhaag/wip-shraddhaag-block-size-error
Shraddha Agrawal [Mon, 27 Apr 2026 13:02:28 +0000 (18:32 +0530)]
Merge pull request #68550 from shraddhaag/wip-shraddhaag-block-size-error

src/crimson/seastore: improve block size validation assert message

12 days agoMerge pull request #68407 from knrt10/crimson-rgw-lifecycle
Kautilya Tripathi [Mon, 27 Apr 2026 12:07:34 +0000 (17:37 +0530)]
Merge pull request #68407 from knrt10/crimson-rgw-lifecycle

qa: Port rgw lifecycle test to crimson-rados

12 days agoMerge pull request #68618 from oduameh/wip-doc-nvmeof-subcommands
Ilya Dryomov [Mon, 27 Apr 2026 09:20:33 +0000 (11:20 +0200)]
Merge pull request #68618 from oduameh/wip-doc-nvmeof-subcommands

doc: document ceph nvmeof CLI subcommands for target configuration

Reviewed-by: Aviv Caro <Aviv.Caro@ibm.com>
12 days agoMerge pull request #68338 from fivetime/ceph-volume-raw-osd-fsid
Guillaume Abrioux [Mon, 27 Apr 2026 08:29:04 +0000 (10:29 +0200)]
Merge pull request #68338 from fivetime/ceph-volume-raw-osd-fsid

ceph-volume: add --osd-fsid support to raw mode prepare

12 days agoMerge pull request #66586 from ShwetaBhosale1/execute_command_retry_logic_if_command_...
Redouane Kachach [Mon, 27 Apr 2026 08:25:24 +0000 (10:25 +0200)]
Merge pull request #66586 from ShwetaBhosale1/execute_command_retry_logic_if_command_fails_with_connection_error

mgr/cephadm: Added retry logic for execute command if command fails with connection error

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Redouane Kachach <rkachach@ibm.com>
12 days agoMerge pull request #67651 from ShwetaBhosale1/fix_issue_75189_nfs_rdma_support
Redouane Kachach [Mon, 27 Apr 2026 08:20:18 +0000 (10:20 +0200)]
Merge pull request #67651 from ShwetaBhosale1/fix_issue_75189_nfs_rdma_support

mgr/cephadm: nfs rdma support

Reviewed-by: Redouane Kachach <rkachach@ibm.com>
12 days agoMerge pull request #68152 from ShwetaBhosale1/fix_issue_75704_make_haproxy_peer_confi...
Redouane Kachach [Mon, 27 Apr 2026 08:19:13 +0000 (10:19 +0200)]
Merge pull request #68152 from ShwetaBhosale1/fix_issue_75704_make_haproxy_peer_configuration_port_configurable

mgr/cephadm: Provide option to configure custom haproxy peer communication port

Reviewed-by: Kushal Deb <Kushal.Deb@ibm.com>
12 days agoMerge pull request #67906 from ShwetaBhosale1/update_nfs
Redouane Kachach [Mon, 27 Apr 2026 08:17:58 +0000 (10:17 +0200)]
Merge pull request #67906 from ShwetaBhosale1/update_nfs

Bump NFS Version in Ceph

Reviewed-by: Shubha Jain <SHUBHA.JAIN1@ibm.com>
12 days agoMerge pull request #68549 from afreen23/service-form-fix
Afreen Misbah [Mon, 27 Apr 2026 08:07:26 +0000 (13:37 +0530)]
Merge pull request #68549 from afreen23/service-form-fix

Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Sagar Gopale <sagar.gopale@ibm.com>
12 days agoMerge pull request #66331 from ujjawal-ibm/daemon-upgrade-issue-with-space
Redouane Kachach [Mon, 27 Apr 2026 08:03:54 +0000 (10:03 +0200)]
Merge pull request #66331 from ujjawal-ibm/daemon-upgrade-issue-with-space

mgr/cephadm: Handle whitespace in upgrade daemon types

Reviewed-by: Adam King <adking@redhat.com>
12 days agoMerge pull request #68164 from phlogistonjohn/jjm-cephadm-smb-deps-action
Redouane Kachach [Mon, 27 Apr 2026 07:51:58 +0000 (09:51 +0200)]
Merge pull request #68164 from phlogistonjohn/jjm-cephadm-smb-deps-action

cephadm: add get_dependencies to smb service class

Reviewed-by: Redouane Kachach <rkachach@ibm.com>
12 days agoMerge pull request #68608 from tchaikov/wip-minimal-release
Kefu Chai [Mon, 27 Apr 2026 06:01:12 +0000 (14:01 +0800)]
Merge pull request #68608 from tchaikov/wip-minimal-release

doc/rados: document require_osd_release and require_min_compat_client

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
Reviewed-by: Ville Ojamo <git2233+ceph@ojamo.eu>
12 days agoqa: Port rgw lifecycle test to crimson-rados 68407/head
Kautilya Tripathi [Tue, 9 Dec 2025 02:25:35 +0000 (02:25 +0000)]
qa: Port rgw lifecycle test to crimson-rados

This adds lifecycle qa tests of rgw to crimson-rados suite

Signed-off-by: Kautilya Tripathi <kautilya.tripathi@ibm.com>
12 days agoMerge pull request #67372 from knrt10/wip-crimson-rgw-multifs
Kautilya Tripathi [Mon, 27 Apr 2026 04:11:14 +0000 (09:41 +0530)]
Merge pull request #67372 from knrt10/wip-crimson-rgw-multifs

qa: Port rgw multifs test to crimson-rados

12 days agoMerge pull request #68341 from ronen-fr/wip-rf-dev3-crimson
Ronen Friedman [Mon, 27 Apr 2026 04:09:55 +0000 (07:09 +0300)]
Merge pull request #68341 from ronen-fr/wip-rf-dev3-crimson

crimson/osd: move to a unified on-device super-block structure

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
12 days agodoc/rados: document require_osd_release 68608/head
Kefu Chai [Fri, 24 Apr 2026 13:41:29 +0000 (21:41 +0800)]
doc/rados: document require_osd_release

Add a standalone concept page for the OSDMap require_osd_release field,
the upgrade-gate counterpart to require_min_compat_client. Cover:

 - how to set it and how to check it;
 - the full set of pre-commit guards the monitor runs, rendered as a
   table with each guard's error text and bypass status;
 - which commands and features become available as the flag is raised,
   per release;
 - the OSD boot window that refuses OSDs more than two releases ahead
   of the flag;
 - the OSD_UPGRADE_FINISHED health warning that prompts admins to set
   the flag after an upgrade;
 - the initial value on new clusters and the two mon_debug_* knobs
   that override it for testing.

Also cross-link the new page from the related-flags table on
require-min-compat-client.rst, and from the rados operations index.

Signed-off-by: Kefu Chai <k.chai@proxmox.com>
12 days agodoc/rados: document require_min_compat_client
Kefu Chai [Fri, 24 Apr 2026 13:16:00 +0000 (21:16 +0800)]
doc/rados: document require_min_compat_client

Add a standalone concept page for the OSDMap require_min_compat_client
field, covering: how to set and check it, the non-monotonic lowering
behavior (with the features-in-use floor derived from
OSDMap::get_min_compat_client()), and the operator commands it gates.
Include tables for the floor-pinning features and the flag-gated
commands, so operators can reason about transitions without reading
OSDMonitor.cc.

Cross-reference to the CephFS per-filesystem required_client_features
mechanism, which is the MDSMap-side equivalent for client-protocol
features. Add an anchor on the existing CephFS Required Client Features
section so the cross-reference resolves.

Link the new page from the rados operations index.

Signed-off-by: Kefu Chai <k.chai@proxmox.com>
13 days agoMerge pull request #68394 from yuvalif/wip-yuval-75928
Yuval Lifshitz [Sun, 26 Apr 2026 14:38:24 +0000 (17:38 +0300)]
Merge pull request #68394 from yuvalif/wip-yuval-75928

rgw/kafka: setting maximum batch size when sending messages

13 days agodoc: document ceph nvmeof CLI subcommands for target configuration 68618/head
oduameh [Sun, 26 Apr 2026 13:22:55 +0000 (14:22 +0100)]
doc: document ceph nvmeof CLI subcommands for target configuration

Replaces verbose podman run container commands with native ceph nvmeof
CLI subcommands. The nvmeof-cli container approach is preserved as an
alternative in a note block, with a clarification that its option names
differ from the ceph nvmeof CLI.

Fixes: https://tracker.ceph.com/issues/76042
Signed-off-by: Emmanuel Ameh <oduameh@users.noreply.github.com>
Signed-off-by: oduameh <oduameh@users.noreply.github.com>
13 days agoMerge pull request #68127 from Matan-B/wip-matanb-seastore-conflict-user-lba
Matan Breizman [Sun, 26 Apr 2026 07:36:53 +0000 (10:36 +0300)]
Merge pull request #68127 from Matan-B/wip-matanb-seastore-conflict-user-lba

crimson/os/seastore: Prepare no invalidation to LBA user txns

Reviewed-by: Xuehan Xu <xuxuehan@qianxin.com>
13 days agotest/rgw/kafka: make batch size tests manual 68394/head
Yuval Lifshitz [Sun, 26 Apr 2026 07:15:03 +0000 (07:15 +0000)]
test/rgw/kafka: make batch size tests manual

currently failing in teuthology

Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>
13 days agorgw/kafka: setting maximum batch size when sending messages
Yuval Lifshitz [Mon, 20 Apr 2026 14:06:25 +0000 (14:06 +0000)]
rgw/kafka: setting maximum batch size when sending messages

Fixes: https://tracker.ceph.com/issues/75928
Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>
2 weeks agoMerge PR #68582 into main
Patrick Donnelly [Sat, 25 Apr 2026 14:05:32 +0000 (19:35 +0530)]
Merge PR #68582 into main

* refs/pull/68582/head:
script/ptl-tool: improve help messages during startup

Reviewed-by: John Mulligan <jmulligan@redhat.com>
Reviewed-by: Yaarit Hatuka <yaarithatuka@gmail.com>
2 weeks agoMerge pull request #68606 from tchaikov/wip-doc-silence-warning
Kefu Chai [Sat, 25 Apr 2026 05:06:44 +0000 (13:06 +0800)]
Merge pull request #68606 from tchaikov/wip-doc-silence-warning

doc/scripts: use raw string for regex in gen_state_diagram.py

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
2 weeks agoMerge pull request #68366 from ljflores/wip-tracker-74517
Laura Flores [Fri, 24 Apr 2026 19:12:03 +0000 (14:12 -0500)]
Merge pull request #68366 from ljflores/wip-tracker-74517

qa/suites/rados: temporarily disable ceph-post-file test

Reviewed-by: Sridhar Seshasayee <Sridhar.Seshasayee@ibm.com>
2 weeks agodoc/scripts: use raw string for regex in gen_state_diagram.py 68606/head
Kefu Chai [Fri, 24 Apr 2026 13:34:43 +0000 (21:34 +0800)]
doc/scripts: use raw string for regex in gen_state_diagram.py

Python 3.12 emits SyntaxWarning for invalid escape sequences in ordinary
string literals. The re.search() call on line 162 was the only pattern
in the file passed as a non-raw string, causing:

  doc/scripts/gen_state_diagram.py:162: SyntaxWarning: invalid escape
    sequence '\s'
    i = re.search("return\s+transit<\s*(\w*)\s*>()", line)

Add the r"" prefix to match the other re.search / re.finditer / re.sub
call sites in the same file. No behavior change; \s was already being
interpreted as a regex whitespace class because Python leaves unknown
escapes untouched, but this will become a SyntaxError in a future
Python release.

Signed-off-by: Kefu Chai <k.chai@proxmox.com>
2 weeks agoMerge pull request #68576 from rhcs-dashboard/fix-76239-main
Aashish Sharma [Fri, 24 Apr 2026 12:14:18 +0000 (17:44 +0530)]
Merge pull request #68576 from rhcs-dashboard/fix-76239-main

mr/dashboard: remove rgw_servers filter from radosgw-sync-overview grafana dashboard

Reviewed-by: Afreen Misbah <afreen@ibm.com>
2 weeks agoceph-volume: pass --set-keepcaps for FCM crush device class on mkfs 68595/head
Guillaume Abrioux [Fri, 24 Apr 2026 09:08:09 +0000 (11:08 +0200)]
ceph-volume: pass --set-keepcaps for FCM crush device class on mkfs

When preparing the ceph-osd --mkfs command for BlueStore OSDs, add
--set-keepcaps true if crush_device_class is fcm.

Fixes: https://tracker.ceph.com/issues/76252
Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
2 weeks agoMerge pull request #68483 from rhcs-dashboard/fix-smb-mgrdb
Pedro Gonzalez Gomez [Fri, 24 Apr 2026 08:48:36 +0000 (10:48 +0200)]
Merge pull request #68483 from rhcs-dashboard/fix-smb-mgrdb

mgr: fix continous smb MgrDBNotReady

Reviewed-by: John Mulligan <jmulligan@redhat.com>
2 weeks agoceph-volume: add tests for raw prepare --osd-fsid 68338/head
Guillaume Abrioux [Fri, 24 Apr 2026 08:19:11 +0000 (10:19 +0200)]
ceph-volume: add tests for raw prepare --osd-fsid

Add unit tests to cover the raw prepare help text for --osd-fsid,
assert generate_uuid is used when no osd_fsid is supplied and
assert an externally provided osd_fsid is passed through to
create_id without generating a new UUID.

Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
2 weeks agoceph-volume: add --osd-fsid support to raw mode prepare
Duanming Zhou [Sun, 12 Apr 2026 07:40:40 +0000 (07:40 +0000)]
ceph-volume: add --osd-fsid support to raw mode prepare

The LVM mode already supports --osd-fsid to allow external tools
(e.g., Kubernetes operators) to pre-register an OSD ID+UUID via
"ceph osd new" and then pass both to ceph-volume, ensuring the
operator retains full control of the OSD ID lifecycle and can
reliably clean up on prepare failure (no orphan OSDs).

The raw mode was missing this support: prepare() unconditionally
called system.generate_uuid(), ignoring any --osd-fsid value.
When an operator pre-registered osd.N with uuid_A and then ran
"ceph-volume raw prepare --osd-id N --dmcrypt", ceph-volume
generated uuid_B internally and called "ceph osd new uuid_B N",
which failed with EINVAL because the ID was already registered
with a different UUID.

This commit:
- Adds --osd-fsid argument to the raw mode argument parser
  (devices/raw/common.py), consistent with the LVM mode.
- Changes raw.prepare() to honor an externally provided osd_fsid,
  falling back to generate_uuid() only when none is given
  (objectstore/raw.py), consistent with the LVM mode.

Signed-off-by: Duanming Zhou <zhouduanming@gmail.com>
2 weeks agocrimson/tools: add dump-superblock command to crimson_objectstore_tool 68341/head
Ronen Friedman [Mon, 30 Mar 2026 06:55:58 +0000 (06:55 +0000)]
crimson/tools: add dump-superblock command to crimson_objectstore_tool

We now also check magic before attempting full decode
so that non-crimson or corrupt devices produce a clean
error instead of a decode exception.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
2 weeks agocrimson/tests: fix unitest-seastore-randomblock-manager
Ronen Friedman [Mon, 16 Mar 2026 13:47:27 +0000 (13:47 +0000)]
crimson/tests: fix unitest-seastore-randomblock-manager

test following a change to the device header block.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
2 weeks agocrimson/osd: use a unified super-block for devices
Ronen Friedman [Mon, 9 Mar 2026 17:23:18 +0000 (17:23 +0000)]
crimson/osd: use a unified super-block for devices

This commit refactors the on-hardware super-block structure
used by the seastore to a unified format that
can accommodate all three device types (HDD, ZBD, RBM).

All devices now have a 60 bytes header at address 0,
similar to the existing BlueStore layout. A 23-byte magic
string ("CRIMSON_DEVICE") is placed at the beginning of
the header, followed by 37 bytes of null padding (to
match the existing 60 bytes of the super-block), and
then the DENC-encoded device_superblock_t structure starting
at offset 60.

A unified device_config_t is now used for all device types.

The per-shard data structure is also unified, now including a union
of all relevant fields for each device type.

We are also adding a check for the super-block magic value in the
RBMDevice::read_rbm_superblock() method, similar to the existing check
in SegmentManager::read_segment_manager_superblock().

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
2 weeks agorgw/bucket-logging: handle SigV2 presigned URLs 68540/head
Nithya Balachandran [Wed, 22 Apr 2026 09:31:34 +0000 (09:31 +0000)]
rgw/bucket-logging: handle SigV2 presigned URLs

Presigned URLs using SigV2 do not contain x-amz-credential causing the
log record field Authentication Type to be incorrectly set to '-'.
This has been fixed to check for the presence of the x-amz-expires and Expires
parameters instead.

Fixes: https://tracker.ceph.com/issues/76206
Signed-off-by: Nithya Balachandran <nithya.balachandran@ibm.com>
2 weeks agoMerge PR #68555 into main
Patrick Donnelly [Thu, 23 Apr 2026 23:42:05 +0000 (19:42 -0400)]
Merge PR #68555 into main

* refs/pull/68555/head:
script/redmine-upkeep: resolve comment misformatting

Reviewed-by: John Mulligan <jmulligan@redhat.com>
2 weeks agoscript/ptl-tool: improve help messages during startup 68582/head
Patrick Donnelly [Thu, 23 Apr 2026 23:40:13 +0000 (19:40 -0400)]
script/ptl-tool: improve help messages during startup

For newer folks who've never run it before.

Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
2 weeks agoCODEOWNERS: add an smb group for various smb related files 68578/head
John Mulligan [Thu, 23 Apr 2026 21:37:28 +0000 (17:37 -0400)]
CODEOWNERS: add an smb group for various smb related files

Add a new smb group that covers parts of orch that manage smb as well as
the cephfs proxy. This will help automatically notify smb focused devs
on PRs.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
2 weeks agoMerge pull request #68414 from guits/has_bs_repl
Guillaume Abrioux [Thu, 23 Apr 2026 20:16:28 +0000 (22:16 +0200)]
Merge pull request #68414 from guits/has_bs_repl

ceph-volume: has_bluestore_label checks all bluestore label replica o…

2 weeks agoMerge pull request #66704 from ivancich/wip-document-object-manifest-cmd
J. Eric Ivancich [Thu, 23 Apr 2026 18:31:19 +0000 (14:31 -0400)]
Merge pull request #66704 from ivancich/wip-document-object-manifest-cmd

rgw: add internal docs for `radosgw-admin object manifest ...` sub-command

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2 weeks agomr/dashboard: remove rgw_servers filter from radosgw-sync-overview grafana dashboard 68576/head
Aashish Sharma [Thu, 23 Apr 2026 16:17:41 +0000 (21:47 +0530)]
mr/dashboard: remove rgw_servers filter from radosgw-sync-overview grafana dashboard

Fixes: https://tracker.ceph.com/issues/76239
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
2 weeks agoMerge pull request #66432 from stzuraski898/unittests-mgr
stzuraski898 [Thu, 23 Apr 2026 14:52:23 +0000 (09:52 -0500)]
Merge pull request #66432 from stzuraski898/unittests-mgr

test/mgr: Improve unit-test coverage for ceph-mgr

Reviewed by: Nitzan Mordechai <nmordech@ibm.com>
Reviewed by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed by: Ronen Friedman <rfriedma@redhat.com>

2 weeks agoMerge pull request #68472 from yuvalif/wip-yuval-76127
Yuval Lifshitz [Thu, 23 Apr 2026 14:04:17 +0000 (17:04 +0300)]
Merge pull request #68472 from yuvalif/wip-yuval-76127

test/rgw/kafka: cleanup kafka tgz file

2 weeks agoqa/cephadm: zap raw devices before OSD deployment 68414/head
Guillaume Abrioux [Wed, 22 Apr 2026 21:02:10 +0000 (23:02 +0200)]
qa/cephadm: zap raw devices before OSD deployment

follow up on commit f3c938f5032885434dd2ab77f9b2e6bc05917f3c

```
2026-04-22T10:16:21.350 DEBUG:teuthology.orchestra.run.trial118:> sudo ceph-bluestore-tool zap-device --dev /dev/nvme3n1 --yes-i-really-really-mean-it
2026-04-22T10:16:21.358 INFO:teuthology.orchestra.run.trial118.stderr:sudo: ceph-bluestore-tool: command not found
```

ceph-bluestore-tool is not available on the host and must be run from within a container.

Instead, use `ceph-volume lvm zap` which internally calls `ceph-bluestore-tool zap-device`.

Fixes: https://tracker.ceph.com/issues/76238
Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
2 weeks agoFix unit test 68549/head
Afreen Misbah [Thu, 23 Apr 2026 13:23:25 +0000 (18:53 +0530)]
Fix unit test

Signed-off-by: Afreen Misbah <afreen@ibm.com>
2 weeks agomgr/tox: adding coverage target to the main tox.ini 68570/head
Redouane Kachach [Thu, 23 Apr 2026 11:15:19 +0000 (13:15 +0200)]
mgr/tox: adding coverage target to the main tox.ini

Let's add a coverage target so we can measure this metric for mgr modules.

Usage examples:

Run coverage for a single module (percentages only):
  tox -e coverage -- cephadm
  tox -e coverage -- smb

Show uncovered line numbers:
  COVERAGE_REPORT=term-missing tox -e coverage -- cephadm

Generate a visual HTML report:
  COVERAGE_REPORT=html tox -e coverage -- cephadm
  firefox htmlcov/index.html

Signed-off-by: Redouane Kachach <rkachach@ibm.com>
2 weeks agoMerge pull request #68541 from rhcs-dashboard/rgw-nfs-export-path-issue
Afreen Misbah [Thu, 23 Apr 2026 11:42:33 +0000 (17:12 +0530)]
Merge pull request #68541 from rhcs-dashboard/rgw-nfs-export-path-issue

mgr/dashboard: Difference in "path" value observed when rgw user level export created via dashboard vs cli

Reviewed-by: Afreen Misbah <afreen@ibm.com>
Reviewed-by: Abhishek Desai <abhishek.desai1@ibm.com>
2 weeks agomgr/dashboard: Nvmeof gateway group should account for labels
Afreen Misbah [Thu, 23 Apr 2026 10:42:26 +0000 (16:12 +0530)]
mgr/dashboard: Nvmeof gateway group should account for labels

- updates gateway group sections to account for placement label

Signed-off-by: Afreen Misbah <afreen@ibm.com>
2 weeks agoMerge PR #65656 into main
Venky Shankar [Thu, 23 Apr 2026 10:19:55 +0000 (15:49 +0530)]
Merge PR #65656 into main

* refs/pull/65656/head:
client: do not allow zero‑length reads
src/test: test zero-length async-fsync read using ceph_ll_nonblocking_readv_writev
src/test: test zero-length async-fsync read using ll_preadv_pwritev

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Christopher Hoffman <choffman@redhat.com>
2 weeks agocrimson/os/seastore: introduce stage_visibility_handoff 68127/head
Matan Breizman [Tue, 31 Mar 2026 09:30:25 +0000 (12:30 +0300)]
crimson/os/seastore: introduce stage_visibility_handoff

Generalize prepare_rewrite_publish_to_prior() into stage_visibility_handoff().

* introduce should_use_no_conflict_publish

* Replace is_rewrite_transaction() checks with should_use_no_conflict_publish(),
  so adding new no-conflict users becomes straightforward.

* Stop committing metadata (commit_state + sync_checksum) during prepare_record()
  (pre-commit). While it is correct for rewrite, doing it pre-commit doesn't buy
  us anything today because readers are still blocked until the publish finishes.

  Moving metadata commit to the after commit phase would also make future
  non-rewrite users easier to support.

This is a prep step for expanding no-conflict publish coverage.

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
2 weeks agoMerge pull request #68562 from bluikko/wip-doc-sphinx-warnings-202604b 60683/head
bluikko [Thu, 23 Apr 2026 09:21:28 +0000 (16:21 +0700)]
Merge pull request #68562 from bluikko/wip-doc-sphinx-warnings-202604b

doc: fix Sphinx complaints

2 weeks agoMerge pull request #68529 from rhcs-dashboard/fix-step-3
Afreen Misbah [Thu, 23 Apr 2026 08:51:41 +0000 (14:21 +0530)]
Merge pull request #68529 from rhcs-dashboard/fix-step-3

mgr/dashboard: Allow moving to next when Bidirectional mode is chosen

Reviewed-by: Nizamudeen A <nia@redhat.com>
2 weeks agodoc: fix Sphinx complaints 68562/head
Ville Ojamo [Thu, 23 Apr 2026 07:55:38 +0000 (14:55 +0700)]
doc: fix Sphinx complaints

Fix section title underlines in ec_stretch_cluster.rst.
Fix table that was not rendered at all in lua-scripting.rst.

Signed-off-by: Ville Ojamo <git2233+ceph@ojamo.eu>
2 weeks agoMerge pull request #68469 from VallariAg/fix-top-image-name
Vallari Agrawal [Thu, 23 Apr 2026 07:12:48 +0000 (12:42 +0530)]
Merge pull request #68469 from VallariAg/fix-top-image-name

mgr/dashboard: show rados ns in 'ceph nvmeof top io'

2 weeks agoMerge pull request #68535 from bluikko/wip-doc-rados-troubleshooting-mon-backticks
bluikko [Thu, 23 Apr 2026 03:39:15 +0000 (10:39 +0700)]
Merge pull request #68535 from bluikko/wip-doc-rados-troubleshooting-mon-backticks

doc/rados: fix markup in troubleshooting-mon.rst

2 weeks agomgr/dashboard: Persist labels and hosts in service form
Afreen Misbah [Thu, 23 Apr 2026 01:09:29 +0000 (06:39 +0530)]
mgr/dashboard: Persist labels and hosts in service form

Fixes https://tracker.ceph.com/issues/75280

Signed-off-by: Afreen Misbah <afreen@ibm.com>
2 weeks agomgr/dashboard: Fix service form to take into account labels
Afreen Misbah [Wed, 22 Apr 2026 13:06:21 +0000 (18:36 +0530)]
mgr/dashboard: Fix service form to take into account labels

Fixes https://tracker.ceph.com/issues/76215

- placement label for services takes only a single label
- when multiple labels or an array of labels is passed the service was not getting created

Signed-off-by: Afreen Misbah <afreen@ibm.com>
2 weeks agoMerge pull request #68547 from rhcs-dashboard/encryption-fix
Afreen Misbah [Thu, 23 Apr 2026 00:01:56 +0000 (05:31 +0530)]
Merge pull request #68547 from rhcs-dashboard/encryption-fix

mgr/dashboard: Encryption key not being passed in gateway group form

Reviewed-by: Nizamudeen A <nia@redhat.com>
2 weeks agoMerge pull request #68531 from rhcs-dashboard/fix-step-2
Afreen Misbah [Wed, 22 Apr 2026 23:58:23 +0000 (05:28 +0530)]
Merge pull request #68531 from rhcs-dashboard/fix-step-2

mgr/dashboard: Validate no added hosts in second step

Reviewed-by: Nizamudeen A <nia@redhat.com>
2 weeks agoMerge pull request #68525 from rhcs-dashboard/y-axis-are
Afreen Misbah [Wed, 22 Apr 2026 23:57:34 +0000 (05:27 +0530)]
Merge pull request #68525 from rhcs-dashboard/y-axis-are

mgr/dashboard: Round off y-axis value of area chart

Reviewed-by: Nizamudeen A <nia@redhat.com>
2 weeks agotest/cli/radosgw-admin: align help golden with period/zone delete 68278/head
cheese-cakee [Wed, 22 Apr 2026 22:55:07 +0000 (04:25 +0530)]
test/cli/radosgw-admin: align help golden with period/zone delete

Update help.t expected output to match the inline help text updated in this PR: period rm -> period delete and zone rm -> zone delete.\n\nThis keeps the CLI golden test consistent with radosgw-admin --help output and addresses make check failures for this branch.

Signed-off-by: cheese-cakee <farzanaman99@gmail.com>
2 weeks agoceph-volume: has_bluestore_label checks all bluestore label replica offsets
Guillaume Abrioux [Thu, 16 Apr 2026 07:29:23 +0000 (09:29 +0200)]
ceph-volume: has_bluestore_label checks all bluestore label replica offsets

BlueStore replicates the block device label at fixed offsets (0 and
multiples of 1Gb up to 1000gb). has_bluestore_label() only read the
first 22 bytes, so disks with a wiped primary label but intact
replicas are missed.

with this commit, has_bluestore_label() scans each known offset with
seek/read and compares the ASCII prefix as bytes.

Fixes: https://tracker.ceph.com/issues/76060
Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
2 weeks agoscript/redmine-upkeep: resolve comment misformatting 68555/head
Patrick Donnelly [Wed, 22 Apr 2026 17:43:26 +0000 (13:43 -0400)]
script/redmine-upkeep: resolve comment misformatting

See for example [1].

[1] https://github.com/ceph/ceph/pull/67343#issuecomment-4298533030

Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
2 weeks agodoc/rados: fix markup in troubleshooting-mon.rst 68535/head
Ville Ojamo [Wed, 22 Apr 2026 05:01:42 +0000 (12:01 +0700)]
doc/rados: fix markup in troubleshooting-mon.rst

Double backticks do not stack with strong emphasis.
Use automatic emphasis instead.

Signed-off-by: Ville Ojamo <git2233+ceph@ojamo.eu>
2 weeks agoMerge pull request #67019 from Jayaprakash-ibm/wip-object-fragmentation
Jaya Prakash [Wed, 22 Apr 2026 16:53:36 +0000 (22:23 +0530)]
Merge pull request #67019 from Jayaprakash-ibm/wip-object-fragmentation

os/bluestore: add static and runtime object fragmentation tracking

Reviewed-by: Adam Kupczyk <akupczyk@ibm.com>
2 weeks agoMerge pull request #68183 from rhcs-dashboard/set-motd
Aashish Sharma [Wed, 22 Apr 2026 16:43:47 +0000 (22:13 +0530)]
Merge pull request #68183 from rhcs-dashboard/set-motd

mgr/dashboard: Add option to set motd via api

Reviewed-by: Nizamudeen A <nia@redhat.com>
2 weeks agoMerge pull request #68467 from ujjawal-ibm/filter-virtual-ram-devices
Guillaume Abrioux [Wed, 22 Apr 2026 15:28:25 +0000 (17:28 +0200)]
Merge pull request #68467 from ujjawal-ibm/filter-virtual-ram-devices

ceph-volume: skip /dev/ram* devices in inventory

2 weeks agoMerge pull request #68280 from xhernandez/proxy-fix-fscrypt-buffers
John Mulligan [Wed, 22 Apr 2026 14:51:47 +0000 (10:51 -0400)]
Merge pull request #68280 from xhernandez/proxy-fix-fscrypt-buffers

libcephfs_proxy: fix passed buffers for fscrypt functions

Reviewed-by: John Mulligan <jmulligan@redhat.com>
Reviewed-by: Anoop C S <anoopcs@cryptolab.net>
2 weeks agoos/bluestore: Fix collect_alerts 68416/head
Adam Kupczyk [Thu, 16 Apr 2026 09:38:44 +0000 (09:38 +0000)]
os/bluestore: Fix collect_alerts

In some cases bdev->collect_alerts() was called twice on the same device.
It happened when DB and SLOW are the same device.
Now only called once.

It made no difference when we just set alert string,
but with EXTBLKDEV we append warning message.

Signed-off-by: Adam Kupczyk <akupczyk@ibm.com>
2 weeks agoos/bluestore: extblkdev related health warnings
Adam Kupczyk [Thu, 16 Apr 2026 09:36:03 +0000 (09:36 +0000)]
os/bluestore: extblkdev related health warnings

Replaced start error with health warnings:
EXTBLKDEV: plugin xxx not loaded
EXTBLKDEV: plugin xxx used on mkfs, but now uses plugin yyy

Signed-off-by: Adam Kupczyk <akupczyk@ibm.com>
2 weeks agoextblkdev/fcm: Replace errors with health warning
Adam Kupczyk [Thu, 16 Apr 2026 09:25:19 +0000 (09:25 +0000)]
extblkdev/fcm: Replace errors with health warning

Now plugin does not assert or fail to load,
but instead raises following health warnings:
EXTBLKDEV: multivolume fcm will not work properly
EXTBLKDEV: failed accessing FCM utilization log
EXTBLKDEV: bdev_enable_discard not enabled - free space will leak

Signed-off-by: Adam Kupczyk <akupczyk@ibm.com>
2 weeks agoextblkdev/fcm: Cleanup
Adam Kupczyk [Thu, 16 Apr 2026 09:22:08 +0000 (09:22 +0000)]
extblkdev/fcm: Cleanup

Add `override` to virtual interface functions.

Signed-off-by: Adam Kupczyk <akupczyk@ibm.com>
2 weeks agoextblkdev: Fix preload
Adam Kupczyk [Thu, 16 Apr 2026 09:18:26 +0000 (09:18 +0000)]
extblkdev: Fix preload

One execution path did not save preload_result value.
Fixed now.

Signed-off-by: Adam Kupczyk <akupczyk@ibm.com>
2 weeks agoblk/extblkdev: Health warns from ExtBlkDev plugins
Adam Kupczyk [Wed, 15 Apr 2026 10:59:46 +0000 (10:59 +0000)]
blk/extblkdev: Health warns from ExtBlkDev plugins

Expanded interface by collect_alerts.
Created category "EBD_PROBLEMS" for ExtBlkDev problems.

Signed-off-by: Adam Kupczyk <akupczyk@ibm.com>
2 weeks agoblk: Expand collect_alerts to allow specialization
Adam Kupczyk [Tue, 14 Apr 2026 17:57:42 +0000 (17:57 +0000)]
blk: Expand collect_alerts to allow specialization

Previously we had BlockDevice::collect_alerts that had fixed
implementation.
Expanded BlockDevice::collect_alerts into virtual, so KernelDevice can
override it.

Signed-off-by: Adam Kupczyk <akupczyk@ibm.com>
2 weeks agosrc/crimson/seastore: improve block size validation assert message 68550/head
Shraddha Agrawal [Wed, 22 Apr 2026 13:29:27 +0000 (18:59 +0530)]
src/crimson/seastore: improve block size validation assert message

This commit changes the error message emitted when the device's block
size is lesser than the minimum expected by seastore. This is done to
improve usability and provide an actionable error message.

Fixes: https://tracker.ceph.com/issues/76123
Signed-off-by: Shraddha Agrawal <shraddha.agrawal000@gmail.com>
2 weeks agoMerge pull request #68493 from rhcs-dashboard/76158-rgw-restart-stop-bug
Afreen Misbah [Wed, 22 Apr 2026 12:37:03 +0000 (18:07 +0530)]
Merge pull request #68493 from rhcs-dashboard/76158-rgw-restart-stop-bug

mgr/dashboard : Fix services restart/stop issue

Reviewed-by: Afreen Misbah <afreen@ibm.com>
Reviewed-by: Dnyaneshwari Talwekar <dtalweka@redhat.com>
2 weeks agomgr/dashboard: Difference in "path" value observed when rgw user level export created... 68541/head
Dnyaneshwari Talwekar [Wed, 22 Apr 2026 09:50:18 +0000 (15:20 +0530)]
mgr/dashboard: Difference in "path" value observed when rgw user level export created via dashboard vs cli

Fixes: https://tracker.ceph.com/issues/75425
Signed-off-by: Dnyaneshwari Talwekar <dtalweka@redhat.com>
2 weeks agoMerge pull request #68036 from ifed01/wip-ifed-print-effective-vsel-extra
Igor Fedotov [Wed, 22 Apr 2026 11:33:50 +0000 (14:33 +0300)]
Merge pull request #68036 from ifed01/wip-ifed-print-effective-vsel-extra

os/bluestore: print effective extra in 'bluefs stats' report

Reviewed-by: Adam Kupczyk <akupczyk@ibm.com>
2 weeks agoMerge pull request #68177 from ifed01/wip-ifed-better-ebd-detect
Igor Fedotov [Wed, 22 Apr 2026 11:33:04 +0000 (14:33 +0300)]
Merge pull request #68177 from ifed01/wip-ifed-better-ebd-detect

blk,os/bluestore: do not call extblkdev::detect_device on every

Reviewed-by: Adam Kupczyk <akupczyk@ibm.com>
2 weeks agomgr/dashboard: Encryption key not being passed in gateway group form 68547/head
Afreen Misbah [Wed, 22 Apr 2026 11:13:46 +0000 (16:43 +0530)]
mgr/dashboard: Encryption key not being passed in gateway group form

- encryption_key despite fo being added by user is not added in service spec
- this blocks both bi and uni directional auth

Fixes https://tracker.ceph.com/issues/76214

Signed-off-by: Afreen Misbah <afreen@ibm.com>
2 weeks agomgr/dashboard: Allow moving to next when Bidirectional mode is chosen 68529/head
Afreen Misbah [Tue, 21 Apr 2026 21:48:55 +0000 (03:18 +0530)]
mgr/dashboard: Allow moving to next when Bidirectional mode is chosen

- there is an issue with bi-auth , where the validation is getting empty due to some state refresh
- fixed by removing re-setting of form state

Fixes https://tracker.ceph.com/issues/76194

Signed-off-by: Afreen Misbah <afreen@ibm.com>
2 weeks agoMerge pull request #68353 from Jayaprakash-ibm/wip-clean-discard-warn
Jaya Prakash [Wed, 22 Apr 2026 10:46:42 +0000 (16:16 +0530)]
Merge pull request #68353 from Jayaprakash-ibm/wip-clean-discard-warn

mon: simplify discard queue health warning

Reviewed-by: Adam Kupczyk <akupczyk@ibm.com>
Reviewed-by: Igor Fedotov <igor.fedotov@croit.io>
Reviewed-by: Kefu Chai <tchaikov@gmail.com>
2 weeks agoMerge pull request #68441 from rhcs-dashboard/mgr/dashboard/carbonize-multisite-wizar...
Afreen Misbah [Wed, 22 Apr 2026 10:38:50 +0000 (16:08 +0530)]
Merge pull request #68441 from rhcs-dashboard/mgr/dashboard/carbonize-multisite-wizard-realm-token

mgr/dashboard: mgr/dashboard: Carbonize Realm Name and Token block in Multi-site Replication Wizard

Reviewed-by: Afreen Misbah <afreen@ibm.com>
2 weeks agomgr/dashboard: Fix review page showing auth details when all hosts selected 68531/head
Afreen Misbah [Wed, 22 Apr 2026 00:25:06 +0000 (05:55 +0530)]
mgr/dashboard: Fix review page showing auth details when all hosts selected

Signed-off-by: Afreen Misbah <afreen@ibm.com>
2 weeks agomgr/dashboard: Round off y-axis value of area chart 68525/head
Afreen Misbah [Tue, 21 Apr 2026 20:14:31 +0000 (01:44 +0530)]
mgr/dashboard: Round off y-axis value of area chart

- by default y-axos set to 1 for all
- the value round off for area chart is seperated from y-axis ticks
- also fixes a bug where all IOPS y-ticks being repeated 1,1,0,0

The following values are set for now:

IOPS: valueDecimals=0, axisDecimals=1
Latency: valueDecimals=2, axisDecimals=1
Throughput: valueDecimals=2, axisDecimals=1
Consumption: valueDecimals=2, axisDecimals=1

Fixes https://tracker.ceph.com/issues/76191

Signed-off-by: Afreen Misbah <afreen@ibm.com>
2 weeks agomgr/dashboard: Add option to set motd via api 68183/head
Aashish Sharma [Thu, 2 Apr 2026 13:29:40 +0000 (18:59 +0530)]
mgr/dashboard: Add option to set motd via api

Signed-off-by: Aashish Sharma <aasharma@redhat.com>