]> git-server-git.apps.pok.os.sepia.ceph.com Git - ceph.git/log
ceph.git
7 weeks agomgr: Bucket scoped OSD upgrades using ok-to-upgrade
Ashwin M. Joshi [Tue, 7 Apr 2026 10:05:10 +0000 (15:35 +0530)]
mgr: Bucket scoped OSD upgrades using ok-to-upgrade

Fixes: https://tracker.ceph.com/issues/75603
Signed-off-by: Ashwin M. Joshi <ashjosh1@in.ibm.com>
 Conflicts:
src/pybind/mgr/orchestrator/module.py

7 weeks agoMerge pull request #68561 from bluikko/wip-doc-rados-troubleshooting-mon-improve
bluikko [Wed, 29 Apr 2026 05:53:02 +0000 (12:53 +0700)]
Merge pull request #68561 from bluikko/wip-doc-rados-troubleshooting-mon-improve

doc/rados: improve troubleshooting-mon.rst

7 weeks agorgw: add RestoreStatus support to object listings 67547/head
Matthew N. Heler [Thu, 26 Feb 2026 01:03:56 +0000 (19:03 -0600)]
rgw: add RestoreStatus support to object listings

S3 clients can request restore status in listing responses through the
x-amz-optional-object-attributes header, but we had no support for it.
This stores the restore state in the bucket index so listings can
include <RestoreStatus> without having to read each object's attrs
individually.

Signed-off-by: Matthew N. Heler <matthew.heler@hotmail.com>
7 weeks agosrc/script: init test_stretch crush_collisions.sh
Kamoltat (Junior) Sirivadhna [Tue, 28 Apr 2026 20:13:10 +0000 (20:13 +0000)]
src/script: init test_stretch crush_collisions.sh

Add script to test for CRUSH retry exhaustion in stretch mode with
2 datacenters. Tests unbiased stretch rules by running multiple
iterations of PG mappings and checking for collisions that exceed
the 50-try limit.

Also add --show-retry-exhaustion flag to crushtool to detect and
report when CRUSH mapping hits the maximum retry limit.

Signed-off-by: Kamoltat (Junior) Sirivadhna <ksirivad@redhat.com>
8 weeks agoscript/ptl-tool: allow PR numbers as GH urls 68655/head
Patrick Donnelly [Tue, 28 Apr 2026 14:55:06 +0000 (10:55 -0400)]
script/ptl-tool: allow PR numbers as GH urls

For easier pasting.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
8 weeks agoscript/ptl-tool: get git dir via git command 68639/head
Patrick Donnelly [Mon, 27 Apr 2026 19:37:43 +0000 (15:37 -0400)]
script/ptl-tool: get git dir via git command

Rather than a manual process.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
8 weeks agoMerge pull request #66258 from aainscow/read_only_execs
Alex Ainscow [Tue, 28 Apr 2026 12:56:07 +0000 (13:56 +0100)]
Merge pull request #66258 from aainscow/read_only_execs

osd/rados/rgw/cephfs: Modernize cls interface with compile time safety

Reviewed-by: Bill Scales <bill_scales@uk.ibm.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Adam Emerson <aemerson@ibm.com>
8 weeks agoMerge pull request #68026 from rhcs-dashboard/fix-theme
Afreen Misbah [Tue, 28 Apr 2026 12:45:35 +0000 (18:15 +0530)]
Merge pull request #68026 from rhcs-dashboard/fix-theme

mgr/dashboard: Enable gray 10 theme as per carbon standards

Reviewed-by: Abhishek Desai <abhishek.desai1@ibm.com>
8 weeks agoMerge pull request #68577 from cbodley/wip-74398
Casey Bodley [Tue, 28 Apr 2026 12:00:01 +0000 (08:00 -0400)]
Merge pull request #68577 from cbodley/wip-74398

rgw: read_obj_policy() consults s3:prefix when deciding between 403/404

Reviewed-by: Oguzhan Ozmen <oozmen@bloomberg.net>
8 weeks agocrimson/seastore: fixing some 'unused' warnings 67484/head
Ronen Friedman [Tue, 24 Feb 2026 12:59:54 +0000 (12:59 +0000)]
crimson/seastore: fixing some 'unused' warnings

in btree_types compilation.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
8 weeks agomgr/cephadm: replace md5_hash with FIPS-safe config_hash 68638/head
Kobi Ginon [Mon, 27 Apr 2026 19:08:54 +0000 (22:08 +0300)]
mgr/cephadm: replace md5_hash with FIPS-safe config_hash

Replace md5_hash() usages in cephadm dependency hashing with an
algorithm-agnostic config_hash() helper. config_hash() is backed by
SHA-256, making dependency hash generation unconditionally FIPS-safe
while preserving change-detection behavior.

Fixes: https://tracker.ceph.com/issues/76185
Signed-off-by: Kobi Ginon <kginon@redhat.com>
8 weeks agoMerge pull request #66993 from ceph/crimson-pg-subcommands
Kautilya Tripathi [Tue, 28 Apr 2026 11:14:10 +0000 (16:44 +0530)]
Merge pull request #66993 from ceph/crimson-pg-subcommands

crimson: add pg subcommands support in CLI

Reviewed-by: Aishwarya Mathuria aishwarya.mathuria@ibm.com
Reviewed-by: Kefu Chai tchaikov@gmail.com
8 weeks agoMerge pull request #68502 from ifed01/wip-ifed-more-zoned-remove
Igor Fedotov [Tue, 28 Apr 2026 09:59:08 +0000 (12:59 +0300)]
Merge pull request #68502 from ifed01/wip-ifed-more-zoned-remove

os/bluestore: remove obsolete "zoned" freelist type

Reviewed-by: Adam Kupczyk <akupczyk@ibm.com>
Reviewed-by: Jaya Prakash <jayaprakash@ibm.com>
8 weeks agodoc/rados: improve troubleshooting-mon.rst 68561/head
Ville Ojamo [Wed, 22 Apr 2026 06:51:34 +0000 (13:51 +0700)]
doc/rados: improve troubleshooting-mon.rst

Don't ceph tell mon_status and then claim it passes the help command.
Improve language and link to cephadm doc on asok usage. Add label and
note about accessing asok from the host in troubleshooting.rst.
Capitalize and use double backticks consistently.
Add some missing articles and other minor word changes.
Fix indentation.
Use ref and link definitions consistently, use automatic bold.
Use privileged prompts for CLI commands where necessary.
Remove spaces at end of lines and change tabs to four spaces.

Signed-off-by: Ville Ojamo <git2233+ceph@ojamo.eu>
8 weeks agomgr/dashboard: Fixed modal forms background color 68026/head
Afreen Misbah [Mon, 13 Apr 2026 23:09:51 +0000 (04:39 +0530)]
mgr/dashboard: Fixed modal forms background color

Signed-off-by: Afreen Misbah <afreen@ibm.com>
8 weeks agomgr/dashboard: Fix grid issues in notifications page and password form
Afreen Misbah [Thu, 2 Apr 2026 22:27:59 +0000 (03:57 +0530)]
mgr/dashboard: Fix grid issues in notifications page and password form

Signed-off-by: Afreen Misbah <afreen@ibm.com>
8 weeks agomgr/dashboard: Add gray10 theme base color to all pages
Afreen Misbah [Fri, 27 Mar 2026 16:06:38 +0000 (21:36 +0530)]
mgr/dashboard: Add gray10 theme base color to all pages

- applies #f4f4f4 - $background to all pages as base page
- earlier the base color of page was white
- also updates tabs/navs/tables css to adapt
- some fixes of spacings in alerts tabs, nvmeof

Signed-off-by: Afreen Misbah <afreen@ibm.com>
8 weeks agomgr/dashboard: Add gray10 theme background to overview and rgw page
Afreen Misbah [Fri, 27 Mar 2026 09:16:27 +0000 (14:46 +0530)]
mgr/dashboard: Add gray10 theme background to overview and rgw page

Fixes https://tracker.ceph.com/issues/75752

Signed-off-by: Afreen Misbah <afreen@ibm.com>
8 weeks agomgr/dashboard: Remove dashboard overrides
Afreen Misbah [Thu, 26 Mar 2026 13:34:56 +0000 (19:04 +0530)]
mgr/dashboard: Remove dashboard overrides

- we have responsive layout now so removing overrides
- also removing duplicate spacings css

Signed-off-by: Afreen Misbah <afreen@ibm.com>
8 weeks agomgr/dashboard: Remove modal defaults
Afreen Misbah [Thu, 26 Mar 2026 13:31:43 +0000 (19:01 +0530)]
mgr/dashboard: Remove modal defaults

Signed-off-by: Afreen Misbah <afreen@ibm.com>
8 weeks agomgr/dashboard: Remove tooltip and popover defaults
Afreen Misbah [Thu, 26 Mar 2026 13:25:18 +0000 (18:55 +0530)]
mgr/dashboard: Remove tooltip and popover defaults

Fixes https://tracker.ceph.com/issues/75410

These defaults are not required as carbon adds blackish color to tooltips and moving forward we want to align to CDS.
If anything breaks then add / fix in the used component

Signed-off-by: Afreen Misbah <afreen@ibm.com>
8 weeks agomgr/dashboard: Enable gray 10 theme as per carbon standards
Afreen Misbah [Thu, 26 Mar 2026 13:01:54 +0000 (18:31 +0530)]
mgr/dashboard: Enable gray 10 theme as per carbon standards

- this keeps only branding related colors and removes other colors

Signed-off-by: Afreen Misbah <afreen@ibm.com>
8 weeks agoMerge pull request #68540 from nbalacha/wip-nbalacha-76206
Yuval Lifshitz [Tue, 28 Apr 2026 07:35:08 +0000 (10:35 +0300)]
Merge pull request #68540 from nbalacha/wip-nbalacha-76206

rgw/bucket-logging: handle SigV2 presigned URLs

8 weeks agomgr/dashboard : Fixes EC profile used pool empty 68648/head
Abhishek Desai [Tue, 28 Apr 2026 07:15:16 +0000 (12:45 +0530)]
mgr/dashboard : Fixes EC profile used pool empty
fixes : https://tracker.ceph.com/issues/76288
Signed-off-by: Abhishek Desai <abhishek.desai1@ibm.com>
8 weeks agoMerge pull request #68424 from NitzanMordhai/wip-nitzan-rados-perf-test-epel10-pdsh...
Shraddha Agrawal [Tue, 28 Apr 2026 06:29:05 +0000 (11:59 +0530)]
Merge pull request #68424 from NitzanMordhai/wip-nitzan-rados-perf-test-epel10-pdsh-missing

qa/tasks/cbt: install pdsh from el9 RPMs on el10 systems

8 weeks agomgr/cephadm: log full tracebacks for upgrade exceptions 68646/head
Shweta Bhosale [Tue, 28 Apr 2026 05:48:15 +0000 (11:18 +0530)]
mgr/cephadm: log full tracebacks for upgrade exceptions

Fixes: https://tracker.ceph.com/issues/76284
Signed-off-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
8 weeks agocrimson/qa/objectstore-tool: reduce segments size 68474/head
Ronen Friedman [Mon, 20 Apr 2026 15:10:46 +0000 (15:10 +0000)]
crimson/qa/objectstore-tool: reduce segments size

used in testing. This translates into more segments, which helps
in preventing test failures due to insufficient free segments for mounting.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
8 weeks agoqa/tasks: add timeout to 'GC' ceph_objectstore_tool calls
Ronen Friedman [Sat, 18 Apr 2026 16:32:03 +0000 (16:32 +0000)]
qa/tasks: add timeout to 'GC' ceph_objectstore_tool calls

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
8 weeks agoqa/tasks/ceph_objectstore_tool.py: add gc_before_restart option
Ronen Friedman [Thu, 16 Apr 2026 18:02:21 +0000 (18:02 +0000)]
qa/tasks/ceph_objectstore_tool.py: add gc_before_restart option

The objectstore tool tests restart the OSDs without allowing enough
time for GC to run, which can lead to no-OOL-segments conditions on restart. This
adds a gc_before_restart option to the test config, which when set
to true will run crimson-objectstore-tool --op gc on each OSD
before restarting them.

Fixes: https://tracker.ceph.com/issues/73101
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
8 weeks agocrimson/tools/objectstore: add GC operation to crimson-objectstore-tool
Ronen Friedman [Thu, 16 Apr 2026 17:58:09 +0000 (17:58 +0000)]
crimson/tools/objectstore: add GC operation to crimson-objectstore-tool

This adds a GC operation to the crimson-objectstore-tool, allowing
us to trigger GC cycles on demand during testing. This will
help reduce segment pressure and avoid 'no-segments' conditions.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
8 weeks agocrimson/os: add GC operation to Seastore
Ronen Friedman [Thu, 16 Apr 2026 17:55:22 +0000 (17:55 +0000)]
crimson/os: add GC operation to Seastore

Will be used to force immediate GC cycles in Seastore during testing, to
reduce segment pressure and avoid missing-OOL-segments conditions.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
8 weeks agomgr/cephadm: Skip RDMA device check for NFS during upgrade
Shweta Bhosale [Tue, 28 Apr 2026 04:52:08 +0000 (10:22 +0530)]
mgr/cephadm: Skip RDMA device check for NFS during upgrade

During image upgrade, prepare_create run on the asyncio event-loop
thread while an outer wait_async is active. Calling wait_async again for
cephadm list-rdma on that thread blocks the loop and can hang or time out.

Fixes: https://tracker.ceph.com/issues/76284
Signed-off-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
8 weeks agoMerge pull request #68046 from rhcs-dashboard/fix-service-name
Afreen Misbah [Mon, 27 Apr 2026 20:49:05 +0000 (02:19 +0530)]
Merge pull request #68046 from rhcs-dashboard/fix-service-name

mgr/dashboard: Fix service name disablemnt in servcie form UI

Reviewed-by: Nizamudeen A <nia@redhat.com>
8 weeks agoscript/ptl-tool: add option to not create a tag 68641/head
Patrick Donnelly [Mon, 27 Apr 2026 20:25:27 +0000 (16:25 -0400)]
script/ptl-tool: add option to not create a tag

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
8 weeks agorgw/lc: drop per-bucket LC counters to PRIO_DEBUGONLY 66064/head
Matthew Heler [Sun, 26 Apr 2026 21:00:44 +0000 (16:00 -0500)]
rgw/lc: drop per-bucket LC counters to PRIO_DEBUGONLY

mgr's perf-schema bridge silently drops labeled counters on the way
out, so shipping the per-bucket LC counters up through MgrReport just
costs ingest memory for data mgr can't expose anyway. ceph-exporter
already handles labeled counters via the daemon admin socket, so make
that the only path.

Signed-off-by: Matthew N. Heler <matthew.heler@hotmail.com>
8 weeks agoMerge pull request #68416 from aclamk/aclamk-fcm-health-warnings
Jaya Prakash [Mon, 27 Apr 2026 19:51:06 +0000 (01:21 +0530)]
Merge pull request #68416 from aclamk/aclamk-fcm-health-warnings

bluestore, extblkdev: Now plugins can raise health warnings

Reviewed-by: Igor Fedotov <igor.fedotov@croit.io>
8 weeks agoMerge pull request #68376 from aclamk/aclamk-rocksdb-prio-fix
Jaya Prakash [Mon, 27 Apr 2026 19:50:28 +0000 (01:20 +0530)]
Merge pull request #68376 from aclamk/aclamk-rocksdb-prio-fix

kv/rocksdb: Fix priority of rocksdb cache perf counters

Reviewed-by: Jaya Prakash <jayaprakash@ibm.com>
8 weeks agomgr/cephadm: remove SSH error logs from health detail when host is unreachable 67070/head
Joshua Blanch [Sat, 24 Jan 2026 16:53:14 +0000 (16:53 +0000)]
mgr/cephadm: remove SSH error logs from health detail when host is unreachable

HostConnectionError exception includes verbose logs from asyncssh which
creates noise when looking at ceph health detail. This moves the SSH logs
to log.exception() and remove it from appearing under `health detail`.

Fixes: https://tracker.ceph.com/issues/74551
Signed-off-by: Joshua Blanch <joshua.blanch@clyso.com>
8 weeks agoMerge pull request #68578 from phlogistonjohn/jjm-codeowners-smb
Dan Mick [Mon, 27 Apr 2026 16:59:28 +0000 (09:59 -0700)]
Merge pull request #68578 from phlogistonjohn/jjm-codeowners-smb

CODEOWNERS: add an smb group for various smb related files

8 weeks agomgr/dashboard: use cephadm root CA for RGW SSL and improve error handling 67857/head
yaelazulay-redhat [Mon, 16 Mar 2026 20:37:22 +0000 (22:37 +0200)]
mgr/dashboard: use cephadm root CA for RGW SSL and improve error handling

Problem: Dashboard fails to access object pages when RGW is deployed with SSL using cephadm-signed certificates.

Root cause: RGW REST API connection fails with SSL certificate verification error because the cephadm root CA certificate that signed the RGW SSL certificates is not in the dashboard's trust store.

Code Fixes:
1. rgw_client.py:
Added _get_ssl_ca_bundle() which fetches the cephadm root CA certificate from the cert store and writes it atomically (via a temp file and os.replace) to a fixed path (/tmp/ceph-dashboard-ca/rgw-cephadm-root-ca.pem), returning the file path for SSL verification.
Notes:
- The file is written once per mgr process lifetime and reused by all RgwClient instances. On mgr restart it is refetched and overwritten.
- A dedicated subdirectory (/tmp/ceph-dashboard-ca/) is used because /tmp has the sticky bit set, which prevents os.replace from overwriting files owned by a different user.

2. rest_client.py
Fixed secondary that handle_connection_error crash - when the initial SSL error occurred, the error handler itself crashed trying to process the exception, because it assumed reason.args[0] was always a string, but for SSL errors it's an SSLError object.

Fixes: https://tracker.ceph.com/issues/74393
Signed-off-by: Yael Azulay <yazulay@redhat.com>
8 weeks agocephadm: replace call_throws with call in command_inspect_image 68538/head
yaelazulay-redhat [Wed, 22 Apr 2026 06:37:57 +0000 (09:37 +0300)]
cephadm: replace call_throws with call in command_inspect_image

Problem:
During the upgrade, when inspecting the new ceph image for the first time, an error is printed to the ceph-mgr log instead of displaying a user-friendly message.

Root cause: During an upgrade, inspect-image is called on each node to check if the target image exists locally before pulling it. This flow, where inspect-image always precedes the pull, occurs on nodes other than the first.

Code Fixes:
1. src/cephadm/cephadm.py:
   Replace call_throws with call in command_inspect_image. call_throws raises a RuntimeError on any non-zero exit code, producing a full traceback in the logs. call returns the exit code instead of raising, so the function exits cleanly with errno.ENOENT when the image is not found.

Fixes: https://tracker.ceph.com/issues/75448
Signed-off-by: Yael Azulay <yazulay@redhat.com>
8 weeks agocephadm: convert lists back to tuples when loading last_client_files 68553/head
yaelazulay-redhat [Wed, 22 Apr 2026 16:08:22 +0000 (19:08 +0300)]
cephadm: convert lists back to tuples when loading last_client_files

Problem: ceph mgr fail or active ceph mgr restart causes unnecessary client files recreation on _admin hosts. Files such as /etc/ceph/ceph.conf and /etc/ceph/ceph.client.admin.keyring are rewritten even when their content has not changed.

Root cause:
update_client_file() stores client file metadata as a Python tuple (digest, mode, uid, gid).
When save_host() persists this to the mon store via json.dumps(), the tuple is serialized as a JSON array since JSON has no tuple type.
On mgr failover or restart, cache.load() deserializes the data with json.loads(), which returns a Python list instead of a tuple.
The comparison in _write_client_files(): match = old_files[path] == (digest, mode, uid, gid) then compares a list (from JSON) against a tuple (freshly built), which always evaluates to False.
This causes every client file to be rewritten on every mgr failover or restart.

Code Fixes:
1. src/pybind/mgr/cephadm/inventory.py:
    convert the deserialized lists back to tuples when loading last_client_files

Fixes: https://tracker.ceph.com/issues/76176
Signed-off-by: Yael Azulay <yazulay@redhat.com>
8 weeks agoMerge pull request #68278 from cheese-cakee/wip-rgw-inline-help-rm-delete
Yuval Lifshitz [Mon, 27 Apr 2026 15:55:44 +0000 (18:55 +0300)]
Merge pull request #68278 from cheese-cakee/wip-rgw-inline-help-rm-delete

rgw: align period/zone commands in radosgw-admin help

8 weeks agoMerge pull request #68228 from cheese-cakee/wip-doc-fix-period-zone-commands
Yuval Lifshitz [Mon, 27 Apr 2026 15:55:28 +0000 (18:55 +0300)]
Merge pull request #68228 from cheese-cakee/wip-doc-fix-period-zone-commands

doc: fix period and zone command names in radosgw-admin man page

8 weeks agoMerge pull request #68570 from rkachach/fix_issue_adding_coverage_target
Redouane Kachach [Mon, 27 Apr 2026 15:30:30 +0000 (17:30 +0200)]
Merge pull request #68570 from rkachach/fix_issue_adding_coverage_target

mgr/tox: adding coverage target to the main tox.ini

Reviewed-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
8 weeks agoMerge pull request #68543 from myoungwon/wip-myoungwon-email
Anthony M [Mon, 27 Apr 2026 15:01:24 +0000 (10:01 -0500)]
Merge pull request #68543 from myoungwon/wip-myoungwon-email

doc: update governance.rst to change email address

8 weeks agoMerge pull request #68595 from guits/cv-fcm-setkeepcaps
Guillaume Abrioux [Mon, 27 Apr 2026 14:19:37 +0000 (16:19 +0200)]
Merge pull request #68595 from guits/cv-fcm-setkeepcaps

ceph-volume: pass --set-keepcaps for FCM crush device class on mkfs

8 weeks agomgr/cephadm: Handle daemon colocation port for cluster qos port 61826/head
Shweta Bhosale [Mon, 27 Apr 2026 11:43:10 +0000 (17:13 +0530)]
mgr/cephadm: Handle daemon colocation port for cluster qos port

Fixes: https://tracker.ceph.com/issues/69861
Signed-off-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
8 weeks agoMerge pull request #68429 from cbodley/wip-74595
Casey Bodley [Mon, 27 Apr 2026 13:14:56 +0000 (09:14 -0400)]
Merge pull request #68429 from cbodley/wip-74595

rgw/sns: ListTopics uses account root arn for policy evaluation

Reviewed-by: Yuval Lifshitz <ylifshit@redhat.com>
8 weeks agoMerge pull request #68550 from shraddhaag/wip-shraddhaag-block-size-error
Shraddha Agrawal [Mon, 27 Apr 2026 13:02:28 +0000 (18:32 +0530)]
Merge pull request #68550 from shraddhaag/wip-shraddhaag-block-size-error

src/crimson/seastore: improve block size validation assert message

8 weeks agomgr/cephadm: Fixed scheduling test for nfs qos port
Shweta Bhosale [Wed, 22 Apr 2026 09:37:18 +0000 (15:07 +0530)]
mgr/cephadm: Fixed scheduling test for nfs qos port

Fixes: https://tracker.ceph.com/issues/69861
Signed-off-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
8 weeks agomgr/cephadm: For colocating nfs daeoms, check daemon qos ports instead of fetching...
Shweta Bhosale [Wed, 22 Apr 2026 09:35:08 +0000 (15:05 +0530)]
mgr/cephadm: For colocating nfs daeoms, check daemon qos ports instead of fetching port from spec

Fixes: https://tracker.ceph.com/issues/69861
Signed-off-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
8 weeks agomgr/nfs: 1. Removed the option to enable and disable cluster wide qos, it will be...
Shweta Bhosale [Wed, 18 Feb 2026 14:29:58 +0000 (19:59 +0530)]
mgr/nfs: 1. Removed the option to enable and disable cluster wide qos, it will be enabled by default
2. Removed the cluster_enable_qos field from the cluster-level block as it was causing confusion for the user.
3. Instead of using cluster use global while showing cluster level qos values in export qos get

Fixes: https://tracker.ceph.com/issues/69861
Signed-off-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
8 weeks agomgr/nfs: Update cluster qos_type from int to str
Shweta Bhosale [Mon, 16 Feb 2026 17:03:00 +0000 (22:33 +0530)]
mgr/nfs: Update cluster qos_type from int to str

Fixes: https://tracker.ceph.com/issues/69861
Signed-off-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
8 weeks agomgr/cephadm: support nfs cluster level qos
Shweta Bhosale [Thu, 6 Nov 2025 13:04:19 +0000 (18:34 +0530)]
mgr/cephadm: support nfs cluster level qos
Added below CEPH_NODES_LIST block in ganesha.conf and enable_cluster_qos in cluster level QoS block
CEPH_NODES_LIST {
        Ceph_Nodes = 192.168.100.100, 192.168.100.101, 192.168.100.102;
}

Fixes: https://tracker.ceph.com/issues/69861
Signed-off-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
mgr/cephadm: Changes to add NFS cluster qos inter node communication port in spec

mgr/nfs: Addressed review comments for cluster level qos support

mgr/nfs: add enable_cluster_qos = true while enabling qos

8 weeks agomgr/cephadm: NFSSpec changes to apply cluster level QOS while nfs service creation
Shweta Bhosale [Wed, 16 Jul 2025 07:23:05 +0000 (12:53 +0530)]
mgr/cephadm: NFSSpec changes to apply cluster level QOS while nfs service creation

Fixes: https://tracker.ceph.com/issues/69458
Signed-off-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
8 weeks agomgr/nfs: update QOS minimum BandWidth value and IOPS max value
Shweta Bhosale [Fri, 30 May 2025 11:35:29 +0000 (17:05 +0530)]
mgr/nfs: update QOS minimum BandWidth value and IOPS max value

Fixes: https://tracker.ceph.com/issues/69458
Signed-off-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
8 weeks agomgr/nfs: Changes for get QoS command to return bandwidth in GiB instead of GB
Shweta Bhosale [Mon, 14 Apr 2025 09:30:28 +0000 (15:00 +0530)]
mgr/nfs: Changes for get QoS command to return bandwidth in GiB instead of GB

Fixes: https://tracker.ceph.com/issues/69458
Signed-off-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
8 weeks agomgr/nfs: Changed IOPS max limit to 409600 and Bandwidth max limit to 100GB
Shweta Bhosale [Fri, 11 Apr 2025 04:09:43 +0000 (09:39 +0530)]
mgr/nfs: Changed IOPS max limit to 409600 and Bandwidth max limit to 100GB

Fixes: https://tracker.ceph.com/issues/69458
Signed-off-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
8 weeks agomgr/nfs: Addressd review comments
Shweta Bhosale [Sat, 5 Apr 2025 04:30:57 +0000 (10:00 +0530)]
mgr/nfs: Addressd review comments

Fixes: https://tracker.ceph.com/issues/69458
Signed-off-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
8 weeks agonfs/mgr: Updated logs and exception messages
Shweta Bhosale [Tue, 25 Mar 2025 10:05:28 +0000 (15:35 +0530)]
nfs/mgr: Updated logs and exception messages

fixes: https://tracker.ceph.com/issues/69458
Signed-off-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
8 weeks agomgr/nfs: Added skip_notify_nfs_server parameter to update export commands
Shweta Bhosale [Tue, 25 Mar 2025 08:19:53 +0000 (13:49 +0530)]
mgr/nfs: Added skip_notify_nfs_server parameter to update export commands

fixes: https://tracker.ceph.com/issues/69458
Signed-off-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
8 weeks agomgr/nfs: When cluster level qos is disabled and export still has qos parameters,...
Shweta Bhosale [Wed, 19 Mar 2025 11:16:10 +0000 (16:46 +0530)]
mgr/nfs: When cluster level qos is disabled and export still has qos parameters, then allow nfs export apply command if file has same qos block which is already set

fixes: https://tracker.ceph.com/issues/69458
Signed-off-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
8 weeks agomgr/nfs: Add cluster level qos status to export get qos command
Shweta Bhosale [Mon, 10 Mar 2025 08:10:34 +0000 (13:40 +0530)]
mgr/nfs: Add cluster level qos status to export get qos command

fixes: https://tracker.ceph.com/issues/69458
Signed-off-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
8 weeks agomgr/nfs: Updated get qos to return bandwidth in bytes as dashboard needs bw in bytes
Shweta Bhosale [Mon, 24 Feb 2025 06:21:11 +0000 (11:51 +0530)]
mgr/nfs: Updated get qos to return bandwidth in bytes as dashboard needs bw in bytes

Fixes: https://tracker.ceph.com/issues/69458
Signed-off-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
8 weeks agomgr/nfs: Updating QoS min and max limit of bandwidth and iops values
Shweta Bhosale [Mon, 24 Feb 2025 05:52:20 +0000 (11:22 +0530)]
mgr/nfs: Updating QoS min and max limit of bandwidth and iops values

fixes: https://tracker.ceph.com/issues/69458
Signed-off-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
8 weeks agomgr/nfs: NFS cluster and export commands to enable and disable ops/s control
Shweta Bhosale [Wed, 12 Feb 2025 14:08:07 +0000 (19:38 +0530)]
mgr/nfs: NFS cluster and export commands to enable and disable ops/s control

Fixes: https://tracker.ceph.com/issues/69861
Signed-off-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
8 weeks agomgr/nfs: NFS commands to enable, disable and get QOS config for cluster and export
Shweta Bhosale [Thu, 9 Jan 2025 13:53:37 +0000 (19:23 +0530)]
mgr/nfs: NFS commands to enable, disable and get QOS config for cluster and export

fixes: https://tracker.ceph.com/issues/69458
Signed-off-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
8 weeks agomgr/cephadm: Handled stick table for ipv6 setup, also removed stick table expiry 67720/head
Shweta Bhosale [Wed, 25 Mar 2026 10:29:40 +0000 (15:59 +0530)]
mgr/cephadm: Handled stick table for ipv6 setup, also removed stick table expiry

Fixes: https://tracker.ceph.com/issues/74970
Signed-off-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
8 weeks agomgr/cephadm: Update haproxy.cfg template to support nfs active active deployment
Shweta Bhosale [Mon, 16 Feb 2026 17:27:47 +0000 (22:57 +0530)]
mgr/cephadm: Update haproxy.cfg template to support nfs active active deployment

Fixes: https://tracker.ceph.com/issues/74970
Signed-off-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
8 weeks agoMerge pull request #68407 from knrt10/crimson-rgw-lifecycle
Kautilya Tripathi [Mon, 27 Apr 2026 12:07:34 +0000 (17:37 +0530)]
Merge pull request #68407 from knrt10/crimson-rgw-lifecycle

qa: Port rgw lifecycle test to crimson-rados

8 weeks agorgw/lc: add per-bucket lifecycle performance monitoring
Matthew N. Heler [Sun, 5 Oct 2025 15:43:26 +0000 (10:43 -0500)]
rgw/lc: add per-bucket lifecycle performance monitoring

Implements real-time per-bucket lifecycle (LC) monitoring via performance
counters exposed through the admin socket.

Signed-off-by: Matthew N. Heler <matthew.heler@hotmail.com>
8 weeks agomgr/cephadm: plumb force_delete_data through daemon/service removal 68428/head
Redouane Kachach [Wed, 3 Dec 2025 08:28:08 +0000 (09:28 +0100)]
mgr/cephadm: plumb force_delete_data through daemon/service removal

This PR wires the `force_delete_data` already existing flag in the
binary through cephadm’s daemon and service removal paths, so that
commands such as `ceph orch rm service` or equivalent daemon removal
can explicitly ask for data deletion instead of the default "move
under <fsid>/removed/" for daemons such as Prometheus, osd and mon.

Fixes: https://tracker.ceph.com/issues/74058
Signed-off-by: Kobi Ginon <kginon@redhat.com>
8 weeks agoMerge pull request #68618 from oduameh/wip-doc-nvmeof-subcommands
Ilya Dryomov [Mon, 27 Apr 2026 09:20:33 +0000 (11:20 +0200)]
Merge pull request #68618 from oduameh/wip-doc-nvmeof-subcommands

doc: document ceph nvmeof CLI subcommands for target configuration

Reviewed-by: Aviv Caro <Aviv.Caro@ibm.com>
8 weeks agoosdc: add missing include 68629/head
Max Kellermann [Fri, 24 Apr 2026 08:33:17 +0000 (10:33 +0200)]
osdc: add missing include

Signed-off-by: Max Kellermann <max.kellermann@ionos.com>
8 weeks agoMerge pull request #68338 from fivetime/ceph-volume-raw-osd-fsid
Guillaume Abrioux [Mon, 27 Apr 2026 08:29:04 +0000 (10:29 +0200)]
Merge pull request #68338 from fivetime/ceph-volume-raw-osd-fsid

ceph-volume: add --osd-fsid support to raw mode prepare

8 weeks agoMerge pull request #66586 from ShwetaBhosale1/execute_command_retry_logic_if_command_...
Redouane Kachach [Mon, 27 Apr 2026 08:25:24 +0000 (10:25 +0200)]
Merge pull request #66586 from ShwetaBhosale1/execute_command_retry_logic_if_command_fails_with_connection_error

mgr/cephadm: Added retry logic for execute command if command fails with connection error

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Redouane Kachach <rkachach@ibm.com>
8 weeks agoMerge pull request #67651 from ShwetaBhosale1/fix_issue_75189_nfs_rdma_support
Redouane Kachach [Mon, 27 Apr 2026 08:20:18 +0000 (10:20 +0200)]
Merge pull request #67651 from ShwetaBhosale1/fix_issue_75189_nfs_rdma_support

mgr/cephadm: nfs rdma support

Reviewed-by: Redouane Kachach <rkachach@ibm.com>
8 weeks agoMerge pull request #68152 from ShwetaBhosale1/fix_issue_75704_make_haproxy_peer_confi...
Redouane Kachach [Mon, 27 Apr 2026 08:19:13 +0000 (10:19 +0200)]
Merge pull request #68152 from ShwetaBhosale1/fix_issue_75704_make_haproxy_peer_configuration_port_configurable

mgr/cephadm: Provide option to configure custom haproxy peer communication port

Reviewed-by: Kushal Deb <Kushal.Deb@ibm.com>
8 weeks agoMerge pull request #67906 from ShwetaBhosale1/update_nfs
Redouane Kachach [Mon, 27 Apr 2026 08:17:58 +0000 (10:17 +0200)]
Merge pull request #67906 from ShwetaBhosale1/update_nfs

Bump NFS Version in Ceph

Reviewed-by: Shubha Jain <SHUBHA.JAIN1@ibm.com>
8 weeks agoMerge pull request #68549 from afreen23/service-form-fix
Afreen Misbah [Mon, 27 Apr 2026 08:07:26 +0000 (13:37 +0530)]
Merge pull request #68549 from afreen23/service-form-fix

Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Sagar Gopale <sagar.gopale@ibm.com>
8 weeks agoMerge pull request #66331 from ujjawal-ibm/daemon-upgrade-issue-with-space
Redouane Kachach [Mon, 27 Apr 2026 08:03:54 +0000 (10:03 +0200)]
Merge pull request #66331 from ujjawal-ibm/daemon-upgrade-issue-with-space

mgr/cephadm: Handle whitespace in upgrade daemon types

Reviewed-by: Adam King <adking@redhat.com>
8 weeks agoMerge pull request #68164 from phlogistonjohn/jjm-cephadm-smb-deps-action
Redouane Kachach [Mon, 27 Apr 2026 07:51:58 +0000 (09:51 +0200)]
Merge pull request #68164 from phlogistonjohn/jjm-cephadm-smb-deps-action

cephadm: add get_dependencies to smb service class

Reviewed-by: Redouane Kachach <rkachach@ibm.com>
8 weeks agoMerge pull request #68608 from tchaikov/wip-minimal-release
Kefu Chai [Mon, 27 Apr 2026 06:01:12 +0000 (14:01 +0800)]
Merge pull request #68608 from tchaikov/wip-minimal-release

doc/rados: document require_osd_release and require_min_compat_client

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
Reviewed-by: Ville Ojamo <git2233+ceph@ojamo.eu>
8 weeks agoqa: Port rgw lifecycle test to crimson-rados 68407/head
Kautilya Tripathi [Tue, 9 Dec 2025 02:25:35 +0000 (02:25 +0000)]
qa: Port rgw lifecycle test to crimson-rados

This adds lifecycle qa tests of rgw to crimson-rados suite

Signed-off-by: Kautilya Tripathi <kautilya.tripathi@ibm.com>
8 weeks agocrimson/admin: avoid pg asok crashes 66993/head
Kautilya Tripathi [Mon, 27 Apr 2026 04:36:49 +0000 (10:06 +0530)]
crimson/admin: avoid pg asok crashes

Keep parsed command data alive while running hooks to avoid a
stack-use-after-return in Formatter::create().
Return -EAGAIN from PGCommand when the OSDMap is not ready.

Signed-off-by: Kautilya Tripathi <kautilya.tripathi@ibm.com>
8 weeks agoqa/crimson: add pg subcommands workunit
Kautilya Tripathi [Wed, 8 Apr 2026 05:01:59 +0000 (10:31 +0530)]
qa/crimson: add pg subcommands workunit

Add a crimson workunit that validates pg query and list_unfound
including offset behavior, and wire it into the singleton suite.

Signed-off-by: Kautilya Tripathi <kautilya.tripathi@ibm.com>
8 weeks agoqa/suites: add crimson pg-subcommands suite
Kautilya Tripathi [Tue, 31 Mar 2026 14:04:19 +0000 (19:34 +0530)]
qa/suites: add crimson pg-subcommands suite

Add a dedicated crimson-rados singleton suite yaml to run the
osd/pg-subcommands.sh standalone test.

Signed-off-by: Kautilya Tripathi <kautilya.tripathi@ibm.com>
8 weeks agoqa/standalone: Add pg_subcommands test
Kautilya Tripathi [Tue, 27 Jan 2026 03:07:20 +0000 (08:37 +0530)]
qa/standalone: Add pg_subcommands test

This adds test script for pg subcommands like query, log, scrub,
deep-scrub, list-unfound. It compares the output between classic and
crimson.

Signed-off-by: Kautilya Tripathi <kautilya.tripathi@ibm.com>
8 weeks agocrimson: Add missing pg subcommands
Kautilya Tripathi [Tue, 27 Jan 2026 04:32:04 +0000 (10:02 +0530)]
crimson: Add missing pg subcommands

This adds log, deep-scrub, listunfound pg subcommands to crimson

Signed-off-by: Kautilya Tripathi <kautilya.tripathi@ibm.com>
8 weeks agocrimson: add pg subcommands support in CLI
Kautilya Tripathi [Tue, 20 Jan 2026 13:24:38 +0000 (18:54 +0530)]
crimson: add pg subcommands support in CLI

Crimson OSD was missing the PG admin/tell hooks that classic OSD exposes, and it
did not accept the legacy `rados_pg_command()` / `ceph pg <pgid> <cmd>` JSON form
(e.g. `{"prefix":"pg","pgid":"1.0","cmd":"query",...}`), so `ceph pg <pgid> query`
failed.

Adds a `pg` old-form wrapper hook that exists to advertise that exists
to advertise the classic `pgid` + `cmd` + optional `arg` signature. The
runtime dispatch rewrites this to the real subcommand.

This updates parse_cmd  to rewrite `prefix=pg` requests to the requested
subcommand and remap the generic `arg` field to the concrete parameter
names (`offset` for `list_unfound`, `mulcmd` for `mark_unfound_lost`)
so validation/parsing is unambiguous.

Fixes: https://tracker.ceph.com/issues/73266
Signed-off-by: Kautilya Tripathi <kautilya.tripathi@ibm.com>
8 weeks agoMerge pull request #67372 from knrt10/wip-crimson-rgw-multifs
Kautilya Tripathi [Mon, 27 Apr 2026 04:11:14 +0000 (09:41 +0530)]
Merge pull request #67372 from knrt10/wip-crimson-rgw-multifs

qa: Port rgw multifs test to crimson-rados

8 weeks agoMerge pull request #68341 from ronen-fr/wip-rf-dev3-crimson
Ronen Friedman [Mon, 27 Apr 2026 04:09:55 +0000 (07:09 +0300)]
Merge pull request #68341 from ronen-fr/wip-rf-dev3-crimson

crimson/osd: move to a unified on-device super-block structure

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
8 weeks agodoc/rados: document require_osd_release 68608/head
Kefu Chai [Fri, 24 Apr 2026 13:41:29 +0000 (21:41 +0800)]
doc/rados: document require_osd_release

Add a standalone concept page for the OSDMap require_osd_release field,
the upgrade-gate counterpart to require_min_compat_client. Cover:

 - how to set it and how to check it;
 - the full set of pre-commit guards the monitor runs, rendered as a
   table with each guard's error text and bypass status;
 - which commands and features become available as the flag is raised,
   per release;
 - the OSD boot window that refuses OSDs more than two releases ahead
   of the flag;
 - the OSD_UPGRADE_FINISHED health warning that prompts admins to set
   the flag after an upgrade;
 - the initial value on new clusters and the two mon_debug_* knobs
   that override it for testing.

Also cross-link the new page from the related-flags table on
require-min-compat-client.rst, and from the rados operations index.

Signed-off-by: Kefu Chai <k.chai@proxmox.com>
8 weeks agodoc/rados: document require_min_compat_client
Kefu Chai [Fri, 24 Apr 2026 13:16:00 +0000 (21:16 +0800)]
doc/rados: document require_min_compat_client

Add a standalone concept page for the OSDMap require_min_compat_client
field, covering: how to set and check it, the non-monotonic lowering
behavior (with the features-in-use floor derived from
OSDMap::get_min_compat_client()), and the operator commands it gates.
Include tables for the floor-pinning features and the flag-gated
commands, so operators can reason about transitions without reading
OSDMonitor.cc.

Cross-reference to the CephFS per-filesystem required_client_features
mechanism, which is the MDSMap-side equivalent for client-protocol
features. Add an anchor on the existing CephFS Required Client Features
section so the cross-reference resolves.

Link the new page from the rados operations index.

Signed-off-by: Kefu Chai <k.chai@proxmox.com>
8 weeks agoMerge pull request #68394 from yuvalif/wip-yuval-75928
Yuval Lifshitz [Sun, 26 Apr 2026 14:38:24 +0000 (17:38 +0300)]
Merge pull request #68394 from yuvalif/wip-yuval-75928

rgw/kafka: setting maximum batch size when sending messages

8 weeks agodoc: document ceph nvmeof CLI subcommands for target configuration 68618/head
oduameh [Sun, 26 Apr 2026 13:22:55 +0000 (14:22 +0100)]
doc: document ceph nvmeof CLI subcommands for target configuration

Replaces verbose podman run container commands with native ceph nvmeof
CLI subcommands. The nvmeof-cli container approach is preserved as an
alternative in a note block, with a clarification that its option names
differ from the ceph nvmeof CLI.

Fixes: https://tracker.ceph.com/issues/76042
Signed-off-by: Emmanuel Ameh <oduameh@users.noreply.github.com>
Signed-off-by: oduameh <oduameh@users.noreply.github.com>
8 weeks agoMerge pull request #68127 from Matan-B/wip-matanb-seastore-conflict-user-lba
Matan Breizman [Sun, 26 Apr 2026 07:36:53 +0000 (10:36 +0300)]
Merge pull request #68127 from Matan-B/wip-matanb-seastore-conflict-user-lba

crimson/os/seastore: Prepare no invalidation to LBA user txns

Reviewed-by: Xuehan Xu <xuxuehan@qianxin.com>
8 weeks agotest/rgw/kafka: make batch size tests manual 68394/head
Yuval Lifshitz [Sun, 26 Apr 2026 07:15:03 +0000 (07:15 +0000)]
test/rgw/kafka: make batch size tests manual

currently failing in teuthology

Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>
8 weeks agorgw/kafka: setting maximum batch size when sending messages
Yuval Lifshitz [Mon, 20 Apr 2026 14:06:25 +0000 (14:06 +0000)]
rgw/kafka: setting maximum batch size when sending messages

Fixes: https://tracker.ceph.com/issues/75928
Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>