]> git-server-git.apps.pok.os.sepia.ceph.com Git - ceph.git/log
ceph.git
5 days agoosd/SnapMapper::update_snaps() to handle a missing OBJ_ record 68707/head
Ronen Friedman [Thu, 30 Apr 2026 16:46:58 +0000 (16:46 +0000)]
osd/SnapMapper::update_snaps() to handle a missing OBJ_ record

by falling back to add_oid() instead of silently creating an
inconsistent state (OBJ_ without matching SNA_ entries). This
was observed on replicas that had recently recovered objects:
the snap mapper entries created during recovery were not visible
to a subsequent snap-trim repop's update_snaps() call, leaving
the clone with no snap mapper entries. Scrub would then detect
and report the inconsistency as an error.

Promote snap mapper remove_oid/clear_snaps logging to dout(10)
and add apply_op_stats tracing to aid diagnosis of any remaining
stat or snap mapper drift.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
5 days agoMerge pull request #68626 from shraddhaag/wip-shraddhaag-btree-node-courotuine
Shraddha Agrawal [Mon, 4 May 2026 18:27:57 +0000 (23:57 +0530)]
Merge pull request #68626 from shraddhaag/wip-shraddhaag-btree-node-courotuine

seastore/omap_manager/btree: change node insert/del funcs to coroutines

5 days agoMerge pull request #68625 from shraddhaag/wip-shraddhaag-btree-manager-coroutine
Shraddha Agrawal [Mon, 4 May 2026 18:27:44 +0000 (23:57 +0530)]
Merge pull request #68625 from shraddhaag/wip-shraddhaag-btree-manager-coroutine

seastore/omap_manager/btree: change omap manager funcs to coroutines

5 days agoMerge PR #68211 into main
Venky Shankar [Mon, 4 May 2026 15:18:05 +0000 (20:48 +0530)]
Merge PR #68211 into main

* refs/pull/68211/head:
doc/cephfs/fs-volumes.rst: Correct volume creation with pre-existing pools

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Ville Ojamo <git2233+ceph@ojamo.eu>
5 days agoMerge PR #64561 into main
Patrick Donnelly [Mon, 4 May 2026 15:16:33 +0000 (08:16 -0700)]
Merge PR #64561 into main

* refs/pull/64561/head:
ceph-exporter: fix systemd unit

Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>
Reviewed-by: Kefu Chai <k.chai@proxmox.com>
5 days agoMerge pull request #68694 from rhcs-dashboard/bump-lodash
Afreen Misbah [Mon, 4 May 2026 14:23:31 +0000 (19:53 +0530)]
Merge pull request #68694 from rhcs-dashboard/bump-lodash

mgr/dashboard: Bump lodash

Reviewed-by: Naman Munet <nmunet@redhat.com>
6 days agoMerge pull request #68533 from xxhdx1985126/wip-76196
Matan Breizman [Mon, 4 May 2026 10:41:48 +0000 (13:41 +0300)]
Merge pull request #68533 from xxhdx1985126/wip-76196

crimson/osd/pg: skip replicas that's still missing the object being

Reviewed-by: Aishwarya Mathuria <amathuri@redhat.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>
Reviewed-by: Mohit Agrawal <moagrawa@redhat.com>
6 days agoMerge pull request #67787 from SHASHI9705/fix-notification-footer-alignment
Afreen Misbah [Mon, 4 May 2026 08:39:17 +0000 (14:09 +0530)]
Merge pull request #67787 from SHASHI9705/fix-notification-footer-alignment

dashboard: align notifications footer to bottom when body has fewer items

Reviewed-by: Afreen Misbah <afreen@ibm.com>
6 days agoseastore/omap_manager/btree: change omap manager funcs to coroutines 68625/head
Shraddha Agrawal [Mon, 27 Apr 2026 06:31:17 +0000 (12:01 +0530)]
seastore/omap_manager/btree: change omap manager funcs to coroutines

This commit changes funcs in BTree OMap manager to coroutines. Apart
from cleaner code that's easier to follow this is done to fix ASan
heap-use-after-free asserts.

Example QA job with the error: https://pulpito.ceph.com/shraddhaag-2026-04-20_07:04:25-crimson-rados-main-distro-debug-trial/164374/
Signed-off-by: Shraddha Agrawal <shraddha.agrawal000@gmail.com>
6 days agoMerge pull request #68479 from Naveenaidu/wip-naveen-revert-67299-and-68111
Naveen Naidu [Mon, 4 May 2026 06:13:44 +0000 (11:43 +0530)]
Merge pull request #68479 from Naveenaidu/wip-naveen-revert-67299-and-68111

Revert common/version: strip trailing newline from vendor release file

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
6 days agoseastore/omap_manager/btree: change node insert/del funcs to coroutines 68626/head
Shraddha Agrawal [Mon, 27 Apr 2026 07:33:14 +0000 (13:03 +0530)]
seastore/omap_manager/btree: change node insert/del funcs to coroutines

This commit changes OMapLeafNode and OMapInnerNode funcs to coroutines
to improve readability and prevent any ASan heap-use-after-free asserts.
Signed-off-by: Shraddha Agrawal <shraddha.agrawal000@gmail.com>
6 days agoMerge pull request #66695 from bluikko/wip-doc-rados-ops-health-checks-fixes4
bluikko [Mon, 4 May 2026 05:33:10 +0000 (12:33 +0700)]
Merge pull request #66695 from bluikko/wip-doc-rados-ops-health-checks-fixes4

doc/rados: Fix minor issues, improvements in health-checks.rst (4 of x)

7 days agoMerge PR #67978 into main
Patrick Donnelly [Sat, 2 May 2026 23:21:55 +0000 (16:21 -0700)]
Merge PR #67978 into main

* refs/pull/67978/head:
rgw/kafka: honor verify-ssl and separate cached connections

Reviewed-by: Yuval Lifshitz <ylifshit@redhat.com>
7 days agoMerge PR #67536 into main
Patrick Donnelly [Sat, 2 May 2026 23:21:19 +0000 (16:21 -0700)]
Merge PR #67536 into main

* refs/pull/67536/head:
qa/multisite: enable the multisite test for oidc.
rgw/oidc: plumb RGWObjVersionTracker through load/store for race detection
rgw/oidc rados: add rgwrados::oidcs namespace abstraction for cls_user for accounts.
rgw/rest-oidc: Forward all oidc mutation request to master zone.
rgw/oidc: add rgwrados::oidc interface to support multisite.

Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Shilpa Jagannath <smanjara@redhat.com>
8 days agoMerge pull request #68701 from shraddhaag/wip-shraddhaag-fix-omap-log
Shraddha Agrawal [Sat, 2 May 2026 11:52:10 +0000 (17:22 +0530)]
Merge pull request #68701 from shraddhaag/wip-shraddhaag-fix-omap-log

sestore/omap_manager/btree: prevent heap buffer overflow in log

8 days agoMerge pull request #67484 from ronen-fr/wip-rf-btreeunused
Ronen Friedman [Sat, 2 May 2026 09:53:16 +0000 (12:53 +0300)]
Merge pull request #67484 from ronen-fr/wip-rf-btreeunused

crimson/seastore: fixing some 'unused' warnings

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
8 days agoMerge pull request #68284 from aainscow/rmissing_second_fix
Alex Ainscow [Fri, 1 May 2026 16:11:07 +0000 (17:11 +0100)]
Merge pull request #68284 from aainscow/rmissing_second_fix

    osd: Allow multiple objects with same version in missing list.

Reviewed-by: Bill Scales <bill_scales@uk.ibm.com>
Reviewed-by: Josh Durgin <jdurgin@ibm.com>
8 days agoMerge pull request #67974 from aainscow/twiddle_twiddle
Alex Ainscow [Fri, 1 May 2026 16:03:58 +0000 (17:03 +0100)]
Merge pull request #67974 from aainscow/twiddle_twiddle

osd: Twiddle should create a full sized vector for optimized EC

Reviewed-by: Bill Scales <bill_scales@uk.ibm.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
9 days agoMerge pull request #68474 from ronen-fr/wip-rf-smallsegs-crimson
Ronen Friedman [Fri, 1 May 2026 11:51:20 +0000 (14:51 +0300)]
Merge pull request #68474 from ronen-fr/wip-rf-smallsegs-crimson

crimson/os,tools,qa: fix crimson-objectstore-tool tests

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
9 days agoMerge pull request #68697 from aainscow/wip-update-to-test-harness
Alex Ainscow [Fri, 1 May 2026 07:16:30 +0000 (08:16 +0100)]
Merge pull request #68697 from aainscow/wip-update-to-test-harness

osd/test: Add EC peering test infrastructure and recovery test cases

Reviewed-by: Bill Scales <bill_scales@uk.ibm.com>
9 days agoMerge pull request #68437 from aainscow/ceph_test_rados_docs
Alex Ainscow [Fri, 1 May 2026 07:09:57 +0000 (08:09 +0100)]
Merge pull request #68437 from aainscow/ceph_test_rados_docs

Add documentation for ceph_test_rados

Reviewed-by: Bill Scales <bill_scales@uk.ibm.com>
Reviewed-by: Anthony D'Atri
9 days agotest: Add help to ceph_test_rados 68437/head
Alex Ainscow [Fri, 17 Apr 2026 07:52:38 +0000 (08:52 +0100)]
test: Add help to ceph_test_rados

Basic help text to compliment the full docs in the
previous commit.

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
9 days agodocs: Add documentation for ceph_test_rados
Alex Ainscow [Fri, 17 Apr 2026 07:50:27 +0000 (08:50 +0100)]
docs: Add documentation for ceph_test_rados

No documentation existed for ceph_test_rados.

This commit adds that documentation, as generated by Claude Code.

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
9 days agoMerge PR #68665 into main
Patrick Donnelly [Thu, 30 Apr 2026 18:33:28 +0000 (11:33 -0700)]
Merge PR #68665 into main

* refs/pull/68665/head:
doc/start/os-recommendations: update for Umbrella and future releases

Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Anthony D Atri <anthony.datri@gmail.com>
9 days agosestore/omap_manager/btree: prevent heap buffer overflow in log 68701/head
Shraddha Agrawal [Wed, 29 Apr 2026 15:59:07 +0000 (21:29 +0530)]
sestore/omap_manager/btree: prevent heap buffer overflow in log

This commit fixes a heap overflow in omap_btree_node_impl when
logging the full bufferlist. This issue was already tracked in
https://tracker.ceph.com/issues/71524. To prevent this from happening,
we log the length of the bufferlist instead of the full log.

Signed-off-by: Shraddha Agrawal <shraddha.agrawal000@gmail.com>
9 days agoMerge PR #68640 into main
Patrick Donnelly [Thu, 30 Apr 2026 17:26:05 +0000 (10:26 -0700)]
Merge PR #68640 into main

* refs/pull/68640/head:
script/ptl-tool: source githumap from main branch

Reviewed-by: John Mulligan <jmulligan@redhat.com>
9 days agoMerge pull request #68007 from cbodley/wip-75722
Casey Bodley [Thu, 30 Apr 2026 16:54:10 +0000 (12:54 -0400)]
Merge pull request #68007 from cbodley/wip-75722

rgw/iam: User/Group/Role apis map ECANCELED to ERR_CONCURRENT_MODIFICATION

Reviewed-by: Shilpa Jagannath <smanjara@redhat.com>
9 days agoosd/test: Add EC peering test infrastructure and recovery test cases 68697/head
Alex Ainscow [Mon, 27 Apr 2026 13:24:45 +0000 (14:24 +0100)]
osd/test: Add EC peering test infrastructure and recovery test cases

This commit enhances the EC peering test framework and adds test cases
for erasure-coded pool recovery scenarios:

NOTE: Many of the tests cases are disabled as they recreate certain
problems. Later commits will enable these tests and fix the production
issues, but under different PRs.

Test Infrastructure Improvements:
- Add MockStore wrapper with read error injection capabilities for testing
  error handling in EC recovery
- Enhance ECPeeringTestFixture with recovery callback verification
- Add support for pg_upmap to better simulate OSD placement
- Implement write_attribute() for testing partial vs full stripe writes
- Add read_shard_object_info() to verify on-disk version consistency
- Improve logging with missing object stats (m=, u=, mbc=)
- Add support for doing object recovery in Fast EC.
- Add set_config() helper for runtime configuration changes
- Preserve xinfo features when marking OSDs up/down
- Fix pg_temp handling for EC pools with optimizations

Mock Object Enhancements:
- Update MockPGBackendListener with recovery callback tracking
- Add on_local_recover, on_peer_recover, on_global_recover tracking
- Implement proper stats publishing (pg_stats_publish)
- Add is_missing_object() implementation
- Enhance should_send_op() with async_recovery_target logic
- Add apply_stats() to update PeeringState statistics

Test Cases Added:
- ECRecoveryTest: Verifies recovery with missing objects after OSD failure
- ECSequentialOSDFailoverTest: Tests sequential OSD failure/recovery cycles
- MultiObjectRecoveryReadCrash: Reproduces bug #75432 (multi-object reads)
- RollbackVersionMismatch: Reproduces bug #76213 (version mismatch)
- RollbackAfterMixedBlockedWrites: Reproduces bug #75211 (rollback issues)

These tests validate EC recovery mechanisms including:
- Object version tracking across shards
- Recovery callback invocation (local, peer, global)
- Handling of read errors during recovery
- Rollback behavior after blocked writes
- Multi-object recovery with partial failures

Assisted-by: IBM Bob, using Claude Sonnet
Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
9 days agodoc/start/os-recommendations: update for Umbrella and future releases 68665/head
Patrick Donnelly [Tue, 28 Apr 2026 22:25:44 +0000 (15:25 -0700)]
doc/start/os-recommendations: update for Umbrella and future releases

Overhaul the OS recommendations documentation to reflect deployment
practices and map out the support matrices for upcoming releases through
Ceph X (24.x).

Key changes include:

* Emphasized container-based deployments: Added a new section strongly
  recommending containerized deployments via `cephadm` over legacy
  package-based installations to simplify upgrades and avoid host-level
  dependency conflicts.
* Expanded support tables: Updated the Platforms and Container Hosts
  tables to include Umbrella (21.x), Vampire (22.x), W (23.x), and
  X (24.x). Removed EOL releases like Reef.
* Added EOL visibility: Included End-of-Life dates for Linux
  distributions and anticipated EOL dates for Ceph releases to help
  administrators plan lifecycle events.
* Updated OS targets: Added support tracking for Ubuntu 24.04 (Noble),
  Ubuntu 26.04, Ubuntu 28.04, Rocky Linux 10, and Rocky Linux 11.
* Addressed CentOS transition: Added a warning that CentOS 10+ will no
  longer be built or tested by upstream. Documented that Rocky Linux 10
  is the new default container base image for Umbrella, while clarifying
  that the bare-metal host OS can remain any supported distribution.
* Added horizontal upgrade guidance: Introduced a new section outlining
  safe "horizontal" bare-metal OS upgrade paths (e.g., CentOS 9 to
  Rocky 10, Ubuntu 22.04 to 24.04) so users can safely migrate their
  nodes outside of Ceph version upgrade windows.

AI-Assisted: Gemini Pro, through numerous prompts
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
10 days agoMerge pull request #68453 from mheler/wip-coroutine-cloud-transition
J. Eric Ivancich [Thu, 30 Apr 2026 14:17:00 +0000 (10:17 -0400)]
Merge pull request #68453 from mheler/wip-coroutine-cloud-transition

rgw/lc: add coroutine support for cloud-transition and cloud-restore

Reviewed-by: Casey Bodley <cbodley@redhat.com>
10 days agoMerge pull request #67856 from cbodley/wip-75568
Casey Bodley [Thu, 30 Apr 2026 14:12:07 +0000 (10:12 -0400)]
Merge pull request #67856 from cbodley/wip-75568

rgw/beast: add frontend option 'tls_groups'

Reviewed-by: Matt Benjamin <mbenjamin@redhat.com>
10 days agomgr/dashboard: Bump lodash 68694/head
Afreen Misbah [Thu, 30 Apr 2026 14:07:11 +0000 (19:37 +0530)]
mgr/dashboard: Bump lodash

Fixes https://tracker.ceph.com/issues/76370

Signed-off-by: Afreen Misbah <afreen@ibm.com>
10 days agoMerge pull request #68622 from yuvalif/wip-yuval-76262
Yuval Lifshitz [Thu, 30 Apr 2026 13:33:58 +0000 (16:33 +0300)]
Merge pull request #68622 from yuvalif/wip-yuval-76262

rgw/notifications: relax topic names validation

10 days agoMerge pull request #66170 from kchheda3/wip-fix-account-acls-backward-compatbility
Casey Bodley [Thu, 30 Apr 2026 13:14:17 +0000 (09:14 -0400)]
Merge pull request #66170 from kchheda3/wip-fix-account-acls-backward-compatbility

rgw/account: Support backward compatibility for s3:PutAcls calls for users migrated to account.

Reviewed-by: Casey Bodley <cbodley@redhat.com>
10 days agoMerge pull request #67962 from smanjara/wip-async-lock
Casey Bodley [Thu, 30 Apr 2026 13:13:23 +0000 (09:13 -0400)]
Merge pull request #67962 from smanjara/wip-async-lock

rgw/multisite: convert lock/unlock coroutines to use aio_operate

Reviewed-by: Casey Bodley <cbodley@redhat.com>
10 days agoMerge pull request #68055 from cbodley/wip-rgw-req-state-keys
Casey Bodley [Thu, 30 Apr 2026 13:09:30 +0000 (09:09 -0400)]
Merge pull request #68055 from cbodley/wip-rgw-req-state-keys

rgw: authorization avoids sal::Object::get_instance()

Reviewed-by: Pritha Srivastava <prsrivas@redhat.com>
10 days agoMerge pull request #68210 from lumir-sliva/rgw/ratelimit-response-improvements
Casey Bodley [Thu, 30 Apr 2026 13:07:22 +0000 (09:07 -0400)]
Merge pull request #68210 from lumir-sliva/rgw/ratelimit-response-improvements

rgw: add Retry-After header and configurable rate-limit response

Reviewed-by: Ville Ojamo <git2233+ceph@ojamo.eu>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
10 days agoMerge pull request #66146 from tobias-urdin/keystone-cache-miss
Casey Bodley [Thu, 30 Apr 2026 12:51:50 +0000 (08:51 -0400)]
Merge pull request #66146 from tobias-urdin/keystone-cache-miss

rgw/keystone: perf counter for cache hit wrong

Reviewed-by: Casey Bodley <cbodley@redhat.com>
10 days agoMerge pull request #68553 from yaelazulay-redhat/issue_76176_ceph_mgr_fail_or_active_...
Redouane Kachach [Thu, 30 Apr 2026 12:19:15 +0000 (14:19 +0200)]
Merge pull request #68553 from yaelazulay-redhat/issue_76176_ceph_mgr_fail_or_active_ceph_mgr_restart_causes_unnecessary_client_files_recreation_on_admin_hosts

cephadm: ceph mgr fail or active ceph mgr restart causes unnecessary …

Reviewed-by: Redouane Kachach <rkachach@ibm.com>
Reviewed-by: Adam King <adking@redhat.com>
10 days agoMerge pull request #67417 from webalexeu/feat/mgmt_sso_improvements
Redouane Kachach [Thu, 30 Apr 2026 11:10:42 +0000 (13:10 +0200)]
Merge pull request #67417 from webalexeu/feat/mgmt_sso_improvements

mgr/dashboard: Improve oauth2 sso configuration

Reviewed-by: Redouane Kachach <rkachach@ibm.com>
Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
10 days agoMerge pull request #68452 from soumyakoduri/wip-skoduri-restore-crash
Soumya Koduri [Thu, 30 Apr 2026 11:08:00 +0000 (16:38 +0530)]
Merge pull request #68452 from soumyakoduri/wip-skoduri-restore-crash

rgw/cloud-restore: Fix the restore workers' shutdown order

Reviewed by: Casey Bodley <cbodley@redhat.com>
Reviewed by: Matthew N. Heler <matthew.heler@hotmail.com>

10 days agoMerge pull request #68658 from rhcs-dashboard/pool-permissions
Afreen Misbah [Thu, 30 Apr 2026 09:58:13 +0000 (15:28 +0530)]
Merge pull request #68658 from rhcs-dashboard/pool-permissions

mgr/dashboard: Update permissions for pool-manager role

Reviewed-by: Nizamudeen A <nia@redhat.com>
10 days agoMerge pull request #67049 from adk3798/cgroup-cleanup-retry
Redouane Kachach [Thu, 30 Apr 2026 08:48:49 +0000 (10:48 +0200)]
Merge pull request #67049 from adk3798/cgroup-cleanup-retry

cephadm: retry cleaning old cgroups when it fails

Reviewed-by: Redouane Kachach <rkachach@ibm.com>
10 days agorgw/notifications: relax topic names validation 68622/head
Yuval Lifshitz [Sun, 26 Apr 2026 15:17:54 +0000 (15:17 +0000)]
rgw/notifications: relax topic names validation

Fixes: https://tracker.ceph.com/issues/76262
Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>
10 days agoMerge pull request #61826 from ShwetaBhosale1/fix_issue_69861_NFS_commands_to_enable_...
Redouane Kachach [Thu, 30 Apr 2026 08:20:57 +0000 (10:20 +0200)]
Merge pull request #61826 from ShwetaBhosale1/fix_issue_69861_NFS_commands_to_enable_disable_ops_limiting

mgr/nfs: NFS cluster and export commands to enable and disable ops control

Reviewed-by: Redouane Kachach <rkachach@ibm.com>
Reviewed-by: Adam King <adking@redhat.com>
10 days agoMerge pull request #67720 from ShwetaBhosale1/fix_issue_74970_update_haproxy.cfg_to_s...
Redouane Kachach [Thu, 30 Apr 2026 08:19:45 +0000 (10:19 +0200)]
Merge pull request #67720 from ShwetaBhosale1/fix_issue_74970_update_haproxy.cfg_to_support_nfs_active_active_deployment

mgr/cephadm: Update haproxy.cfg template to support nfs active active deployment

Reviewed-by: Redouane Kachach <rkachach@ibm.com>
10 days agoMerge pull request #68657 from guits/fix-generic-activate-tpm2
Guillaume Abrioux [Thu, 30 Apr 2026 08:12:11 +0000 (10:12 +0200)]
Merge pull request #68657 from guits/fix-generic-activate-tpm2

ceph-volume: raw activate should ignore lvm backed OSD devices

10 days agoMerge pull request #68670 from guits/cv-fix-tpm2-pcrs
Guillaume Abrioux [Thu, 30 Apr 2026 08:11:29 +0000 (10:11 +0200)]
Merge pull request #68670 from guits/cv-fix-tpm2-pcrs

ceph-volume: make TPM2 PCR policy configurable (default to PCR 7)

10 days agoMerge pull request #68538 from yaelazulay-redhat/issue_75448_during_upgrade_an_error_...
Redouane Kachach [Thu, 30 Apr 2026 08:03:43 +0000 (10:03 +0200)]
Merge pull request #68538 from yaelazulay-redhat/issue_75448_during_upgrade_an_error_is_printed_when_inspecting_the_new_ceph_image_for_the_first_time

cephadm: During the upgrade, when inspecting the new ceph image for t…

Reviewed-by: Redouane Kachach <rkachach@ibm.com>
Reviewed-by: Adam King <adking@redhat.com>
10 days agoMerge pull request #68638 from kginonredhat/issue-76185-enable-mgmt-gateway-on-a...
Redouane Kachach [Thu, 30 Apr 2026 08:00:47 +0000 (10:00 +0200)]
Merge pull request #68638 from kginonredhat/issue-76185-enable-mgmt-gateway-on-a-FIPS-cluster-failed

Issue 76185 enable mgmt gateway on a fips cluster failed

Reviewed-by: Redouane Kachach <rkachach@ibm.com>
10 days agoMerge pull request #67984 from cheese-cakee/wip-75416-fix-log-req-id
Yuval Lifshitz [Thu, 30 Apr 2026 07:59:38 +0000 (10:59 +0300)]
Merge pull request #67984 from cheese-cakee/wip-75416-fix-log-req-id

rgw/logging: use trans_id for standard access log record

10 days agoMerge pull request #67825 from bluikko/wip-doc-rados-spelling3
bluikko [Thu, 30 Apr 2026 07:05:36 +0000 (14:05 +0700)]
Merge pull request #67825 from bluikko/wip-doc-rados-spelling3

doc/rados: Fix spelling errors (3 of 3)

10 days agocrimson/osd/pg: unify the current should_send_op method implementation 68533/head
Xuehan Xu [Wed, 22 Apr 2026 02:14:58 +0000 (10:14 +0800)]
crimson/osd/pg: unify the current should_send_op method implementation

Fixes: https://tracker.ceph.com/issues/76196
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
10 days agoMerge PR #68639 into main
Patrick Donnelly [Wed, 29 Apr 2026 21:56:45 +0000 (14:56 -0700)]
Merge PR #68639 into main

* refs/pull/68639/head:
script/ptl-tool: get git dir via git command

Reviewed-by: John Mulligan <jmulligan@redhat.com>
10 days agoMerge PR #68641 into main
Patrick Donnelly [Wed, 29 Apr 2026 20:39:07 +0000 (13:39 -0700)]
Merge PR #68641 into main

* refs/pull/68641/head:
script/ptl-tool: add option to not create a tag

Reviewed-by: John Mulligan <jmulligan@redhat.com>
10 days agoMerge PR #68655 into main
Patrick Donnelly [Wed, 29 Apr 2026 19:29:38 +0000 (12:29 -0700)]
Merge PR #68655 into main

* refs/pull/68655/head:
script/ptl-tool: allow PR numbers as GH urls

Reviewed-by: John Mulligan <jmulligan@redhat.com>
10 days agoscript/ptl-tool: source githumap from main branch 68640/head
Patrick Donnelly [Mon, 27 Apr 2026 20:08:56 +0000 (16:08 -0400)]
script/ptl-tool: source githumap from main branch

To make it authoritative.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
10 days agodoc/cephfs/fs-volumes.rst: Correct volume creation with pre-existing pools 68211/head
adatri [Sat, 4 Apr 2026 00:42:20 +0000 (20:42 -0400)]
doc/cephfs/fs-volumes.rst: Correct volume creation with pre-existing pools

Signed-off-by: adatri <anthony.datri@gmail.com>
10 days agoMerge pull request #68186 from cbodley/wip-75534
Casey Bodley [Wed, 29 Apr 2026 17:15:53 +0000 (13:15 -0400)]
Merge pull request #68186 from cbodley/wip-75534

rgw: CompleteMultipartUpload can fail with 404 NoSuchUpload

Reviewed-by: Mark Kogan <mkogan@redhat.com>
10 days agoMerge pull request #68010 from phlogistonjohn/jjm-smb-mgr-incorrect-type-err
John Mulligan [Wed, 29 Apr 2026 15:24:56 +0000 (11:24 -0400)]
Merge pull request #68010 from phlogistonjohn/jjm-smb-mgr-incorrect-type-err

smb: improve smb mgr module resource type error handling

Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Shachar Sharon <ssharon@redhat.com>
11 days agoMerge pull request #68648 from rhcs-dashboard/76288-fix-ec-profile-pool
Afreen Misbah [Wed, 29 Apr 2026 13:44:07 +0000 (19:14 +0530)]
Merge pull request #68648 from rhcs-dashboard/76288-fix-ec-profile-pool

mgr/dashboard : Fixes EC profile used pool empty

Reviewed-by: Afreen Misbah <afreen@ibm.com>
Reviewed-by: Devika Babrekar <devika.babrekar@ibm.com>
11 days agoceph-volume: make TPM2 PCR policy configurable (default to PCR 7) 68670/head
Guillaume Abrioux [Wed, 29 Apr 2026 09:17:23 +0000 (11:17 +0200)]
ceph-volume: make TPM2 PCR policy configurable (default to PCR 7)

tpm enrollment for dmcrypt OSDs is hardcoded to systemd-cryptenroll
--tpm2-pcrs 9+12 which ties the LUKS key to initrd and kernel
command line measurements, which is brittle on RHEL image mode
systems: after a bootc switch, the kernel, initrd, or cmdline often
change, the PCRs move, and the volume won't unlock until you re-enroll
or fall back to another key.

typical error:

```
Apr 27 14:17:25 ceph-jx5fq20u bash[4289]: Running command: nsenter --mount=/rootfs/proc/1/ns/mnt --ipc=/rootfs/proc/1/ns/ipc --net=/rootfs/proc/1/ns/net --uts=/rootfs/proc/1/ns/uts /usr/lib/systemd/systemd-cryptsetup attach M3zE7r-qsGZ-xs0T-610d-SJNZ-U89x-J0cJq8 /dev/ceph-cac05fb6-51d3-4a60-9fc1-4958c568b433/osd-block-b1a495a0-e1a4-4888-baf9-7990f45f1e56 - tpm2-device=auto,discard,headless=true,nofail
Apr 27 14:17:26 ceph-jx5fq20u ceph-e5520e2c-420d-11f1-a7b9-5254001191fb-osd-0-activate[4300]:  stderr: Failed to unseal secret using TPM2: Operation not permitted
Apr 27 14:17:26 ceph-jx5fq20u bash[4289]:  stderr: Failed to unseal secret using TPM2: Operation not permitted
```

The patch makes the PCR set configurable and defaults to 7 so bootc style
deployments behave correctly.

Fixes: https://tracker.ceph.com/issues/76318
Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
11 days agomgr/dashboard: Update permissions for pool-manager role 68658/head
Afreen Misbah [Tue, 28 Apr 2026 16:55:32 +0000 (22:25 +0530)]
mgr/dashboard: Update permissions for pool-manager role

Fixes https://tracker.ceph.com/issues/76307

-  says denied access when clicked on create pool table action
- this was happening due to the failing monitor API added for stretch cluster configuration
- also updates overview nav permissions

Signed-off-by: Afreen Misbah <afreen@ibm.com>
11 days agodoc/rados: Fix spelling errors (3 of 3) 67825/head
Ville Ojamo [Mon, 16 Mar 2026 16:54:06 +0000 (23:54 +0700)]
doc/rados: Fix spelling errors (3 of 3)

Signed-off-by: Ville Ojamo <git2233+ceph@ojamo.eu>
11 days agoceph-volume: raw activate should ignore lvm backed OSD devices 68657/head
Guillaume Abrioux [Tue, 28 Apr 2026 15:10:59 +0000 (17:10 +0200)]
ceph-volume: raw activate should ignore lvm backed OSD devices

the generic activate (`ceph-volume activate`) runs the
raw path before LVM. Raw.activate was walking lsblk / raw
list entries and could hit block devices that are actually
logical volumes from `ceph-volume lvm prepare` or `lvm batch`
(with ceph lvm tags on the lv).
That made raw activation poke at LVM backed OSDs instead of
leaving it to `lvm activate`.

with this commit ceph-volume now builds the set of LV paths
that carry those tags once (`lvs` via ceph_volume_lvm_prepare_lv_paths)
and skip any candidate path that matches, so only real raw
OSDs go through the 'raw activate path'.

Also, we now pass `with_tpm` through luks_open() calls for db and
wal so encrypted metadata uses the same systemd-cryptsetup path
as the block LV when ceph.with_tpm is set.

Fixes: https://tracker.ceph.com/issues/76305
Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
11 days agoMerge pull request #68561 from bluikko/wip-doc-rados-troubleshooting-mon-improve
bluikko [Wed, 29 Apr 2026 05:53:02 +0000 (12:53 +0700)]
Merge pull request #68561 from bluikko/wip-doc-rados-troubleshooting-mon-improve

doc/rados: improve troubleshooting-mon.rst

11 days agoscript/ptl-tool: allow PR numbers as GH urls 68655/head
Patrick Donnelly [Tue, 28 Apr 2026 14:55:06 +0000 (10:55 -0400)]
script/ptl-tool: allow PR numbers as GH urls

For easier pasting.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
11 days agoscript/ptl-tool: get git dir via git command 68639/head
Patrick Donnelly [Mon, 27 Apr 2026 19:37:43 +0000 (15:37 -0400)]
script/ptl-tool: get git dir via git command

Rather than a manual process.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
12 days agoMerge pull request #66258 from aainscow/read_only_execs
Alex Ainscow [Tue, 28 Apr 2026 12:56:07 +0000 (13:56 +0100)]
Merge pull request #66258 from aainscow/read_only_execs

osd/rados/rgw/cephfs: Modernize cls interface with compile time safety

Reviewed-by: Bill Scales <bill_scales@uk.ibm.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Adam Emerson <aemerson@ibm.com>
12 days agoMerge pull request #68026 from rhcs-dashboard/fix-theme
Afreen Misbah [Tue, 28 Apr 2026 12:45:35 +0000 (18:15 +0530)]
Merge pull request #68026 from rhcs-dashboard/fix-theme

mgr/dashboard: Enable gray 10 theme as per carbon standards

Reviewed-by: Abhishek Desai <abhishek.desai1@ibm.com>
12 days agoMerge pull request #68577 from cbodley/wip-74398
Casey Bodley [Tue, 28 Apr 2026 12:00:01 +0000 (08:00 -0400)]
Merge pull request #68577 from cbodley/wip-74398

rgw: read_obj_policy() consults s3:prefix when deciding between 403/404

Reviewed-by: Oguzhan Ozmen <oozmen@bloomberg.net>
12 days agocrimson/seastore: fixing some 'unused' warnings 67484/head
Ronen Friedman [Tue, 24 Feb 2026 12:59:54 +0000 (12:59 +0000)]
crimson/seastore: fixing some 'unused' warnings

in btree_types compilation.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
12 days agomgr/cephadm: replace md5_hash with FIPS-safe config_hash 68638/head
Kobi Ginon [Mon, 27 Apr 2026 19:08:54 +0000 (22:08 +0300)]
mgr/cephadm: replace md5_hash with FIPS-safe config_hash

Replace md5_hash() usages in cephadm dependency hashing with an
algorithm-agnostic config_hash() helper. config_hash() is backed by
SHA-256, making dependency hash generation unconditionally FIPS-safe
while preserving change-detection behavior.

Fixes: https://tracker.ceph.com/issues/76185
Signed-off-by: Kobi Ginon <kginon@redhat.com>
12 days agoMerge pull request #66993 from ceph/crimson-pg-subcommands
Kautilya Tripathi [Tue, 28 Apr 2026 11:14:10 +0000 (16:44 +0530)]
Merge pull request #66993 from ceph/crimson-pg-subcommands

crimson: add pg subcommands support in CLI

Reviewed-by: Aishwarya Mathuria aishwarya.mathuria@ibm.com
Reviewed-by: Kefu Chai tchaikov@gmail.com
12 days agoMerge pull request #68502 from ifed01/wip-ifed-more-zoned-remove
Igor Fedotov [Tue, 28 Apr 2026 09:59:08 +0000 (12:59 +0300)]
Merge pull request #68502 from ifed01/wip-ifed-more-zoned-remove

os/bluestore: remove obsolete "zoned" freelist type

Reviewed-by: Adam Kupczyk <akupczyk@ibm.com>
Reviewed-by: Jaya Prakash <jayaprakash@ibm.com>
12 days agodoc/rados: improve troubleshooting-mon.rst 68561/head
Ville Ojamo [Wed, 22 Apr 2026 06:51:34 +0000 (13:51 +0700)]
doc/rados: improve troubleshooting-mon.rst

Don't ceph tell mon_status and then claim it passes the help command.
Improve language and link to cephadm doc on asok usage. Add label and
note about accessing asok from the host in troubleshooting.rst.
Capitalize and use double backticks consistently.
Add some missing articles and other minor word changes.
Fix indentation.
Use ref and link definitions consistently, use automatic bold.
Use privileged prompts for CLI commands where necessary.
Remove spaces at end of lines and change tabs to four spaces.

Signed-off-by: Ville Ojamo <git2233+ceph@ojamo.eu>
12 days agomgr/dashboard: Fixed modal forms background color 68026/head
Afreen Misbah [Mon, 13 Apr 2026 23:09:51 +0000 (04:39 +0530)]
mgr/dashboard: Fixed modal forms background color

Signed-off-by: Afreen Misbah <afreen@ibm.com>
12 days agomgr/dashboard: Fix grid issues in notifications page and password form
Afreen Misbah [Thu, 2 Apr 2026 22:27:59 +0000 (03:57 +0530)]
mgr/dashboard: Fix grid issues in notifications page and password form

Signed-off-by: Afreen Misbah <afreen@ibm.com>
12 days agomgr/dashboard: Add gray10 theme base color to all pages
Afreen Misbah [Fri, 27 Mar 2026 16:06:38 +0000 (21:36 +0530)]
mgr/dashboard: Add gray10 theme base color to all pages

- applies #f4f4f4 - $background to all pages as base page
- earlier the base color of page was white
- also updates tabs/navs/tables css to adapt
- some fixes of spacings in alerts tabs, nvmeof

Signed-off-by: Afreen Misbah <afreen@ibm.com>
12 days agomgr/dashboard: Add gray10 theme background to overview and rgw page
Afreen Misbah [Fri, 27 Mar 2026 09:16:27 +0000 (14:46 +0530)]
mgr/dashboard: Add gray10 theme background to overview and rgw page

Fixes https://tracker.ceph.com/issues/75752

Signed-off-by: Afreen Misbah <afreen@ibm.com>
12 days agomgr/dashboard: Remove dashboard overrides
Afreen Misbah [Thu, 26 Mar 2026 13:34:56 +0000 (19:04 +0530)]
mgr/dashboard: Remove dashboard overrides

- we have responsive layout now so removing overrides
- also removing duplicate spacings css

Signed-off-by: Afreen Misbah <afreen@ibm.com>
12 days agomgr/dashboard: Remove modal defaults
Afreen Misbah [Thu, 26 Mar 2026 13:31:43 +0000 (19:01 +0530)]
mgr/dashboard: Remove modal defaults

Signed-off-by: Afreen Misbah <afreen@ibm.com>
12 days agomgr/dashboard: Remove tooltip and popover defaults
Afreen Misbah [Thu, 26 Mar 2026 13:25:18 +0000 (18:55 +0530)]
mgr/dashboard: Remove tooltip and popover defaults

Fixes https://tracker.ceph.com/issues/75410

These defaults are not required as carbon adds blackish color to tooltips and moving forward we want to align to CDS.
If anything breaks then add / fix in the used component

Signed-off-by: Afreen Misbah <afreen@ibm.com>
12 days agomgr/dashboard: Enable gray 10 theme as per carbon standards
Afreen Misbah [Thu, 26 Mar 2026 13:01:54 +0000 (18:31 +0530)]
mgr/dashboard: Enable gray 10 theme as per carbon standards

- this keeps only branding related colors and removes other colors

Signed-off-by: Afreen Misbah <afreen@ibm.com>
12 days agoMerge pull request #68540 from nbalacha/wip-nbalacha-76206
Yuval Lifshitz [Tue, 28 Apr 2026 07:35:08 +0000 (10:35 +0300)]
Merge pull request #68540 from nbalacha/wip-nbalacha-76206

rgw/bucket-logging: handle SigV2 presigned URLs

12 days agomgr/dashboard : Fixes EC profile used pool empty 68648/head
Abhishek Desai [Tue, 28 Apr 2026 07:15:16 +0000 (12:45 +0530)]
mgr/dashboard : Fixes EC profile used pool empty
fixes : https://tracker.ceph.com/issues/76288
Signed-off-by: Abhishek Desai <abhishek.desai1@ibm.com>
12 days agoMerge pull request #68424 from NitzanMordhai/wip-nitzan-rados-perf-test-epel10-pdsh...
Shraddha Agrawal [Tue, 28 Apr 2026 06:29:05 +0000 (11:59 +0530)]
Merge pull request #68424 from NitzanMordhai/wip-nitzan-rados-perf-test-epel10-pdsh-missing

qa/tasks/cbt: install pdsh from el9 RPMs on el10 systems

12 days agocrimson/qa/objectstore-tool: reduce segments size 68474/head
Ronen Friedman [Mon, 20 Apr 2026 15:10:46 +0000 (15:10 +0000)]
crimson/qa/objectstore-tool: reduce segments size

used in testing. This translates into more segments, which helps
in preventing test failures due to insufficient free segments for mounting.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
12 days agoqa/tasks: add timeout to 'GC' ceph_objectstore_tool calls
Ronen Friedman [Sat, 18 Apr 2026 16:32:03 +0000 (16:32 +0000)]
qa/tasks: add timeout to 'GC' ceph_objectstore_tool calls

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
12 days agoqa/tasks/ceph_objectstore_tool.py: add gc_before_restart option
Ronen Friedman [Thu, 16 Apr 2026 18:02:21 +0000 (18:02 +0000)]
qa/tasks/ceph_objectstore_tool.py: add gc_before_restart option

The objectstore tool tests restart the OSDs without allowing enough
time for GC to run, which can lead to no-OOL-segments conditions on restart. This
adds a gc_before_restart option to the test config, which when set
to true will run crimson-objectstore-tool --op gc on each OSD
before restarting them.

Fixes: https://tracker.ceph.com/issues/73101
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
12 days agocrimson/tools/objectstore: add GC operation to crimson-objectstore-tool
Ronen Friedman [Thu, 16 Apr 2026 17:58:09 +0000 (17:58 +0000)]
crimson/tools/objectstore: add GC operation to crimson-objectstore-tool

This adds a GC operation to the crimson-objectstore-tool, allowing
us to trigger GC cycles on demand during testing. This will
help reduce segment pressure and avoid 'no-segments' conditions.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
12 days agocrimson/os: add GC operation to Seastore
Ronen Friedman [Thu, 16 Apr 2026 17:55:22 +0000 (17:55 +0000)]
crimson/os: add GC operation to Seastore

Will be used to force immediate GC cycles in Seastore during testing, to
reduce segment pressure and avoid missing-OOL-segments conditions.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
12 days agoMerge pull request #68046 from rhcs-dashboard/fix-service-name
Afreen Misbah [Mon, 27 Apr 2026 20:49:05 +0000 (02:19 +0530)]
Merge pull request #68046 from rhcs-dashboard/fix-service-name

mgr/dashboard: Fix service name disablemnt in servcie form UI

Reviewed-by: Nizamudeen A <nia@redhat.com>
12 days agoscript/ptl-tool: add option to not create a tag 68641/head
Patrick Donnelly [Mon, 27 Apr 2026 20:25:27 +0000 (16:25 -0400)]
script/ptl-tool: add option to not create a tag

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
12 days agoMerge pull request #68416 from aclamk/aclamk-fcm-health-warnings
Jaya Prakash [Mon, 27 Apr 2026 19:51:06 +0000 (01:21 +0530)]
Merge pull request #68416 from aclamk/aclamk-fcm-health-warnings

bluestore, extblkdev: Now plugins can raise health warnings

Reviewed-by: Igor Fedotov <igor.fedotov@croit.io>
12 days agoMerge pull request #68376 from aclamk/aclamk-rocksdb-prio-fix
Jaya Prakash [Mon, 27 Apr 2026 19:50:28 +0000 (01:20 +0530)]
Merge pull request #68376 from aclamk/aclamk-rocksdb-prio-fix

kv/rocksdb: Fix priority of rocksdb cache perf counters

Reviewed-by: Jaya Prakash <jayaprakash@ibm.com>
12 days agoMerge pull request #68578 from phlogistonjohn/jjm-codeowners-smb
Dan Mick [Mon, 27 Apr 2026 16:59:28 +0000 (09:59 -0700)]
Merge pull request #68578 from phlogistonjohn/jjm-codeowners-smb

CODEOWNERS: add an smb group for various smb related files

12 days agocephadm: replace call_throws with call in command_inspect_image 68538/head
yaelazulay-redhat [Wed, 22 Apr 2026 06:37:57 +0000 (09:37 +0300)]
cephadm: replace call_throws with call in command_inspect_image

Problem:
During the upgrade, when inspecting the new ceph image for the first time, an error is printed to the ceph-mgr log instead of displaying a user-friendly message.

Root cause: During an upgrade, inspect-image is called on each node to check if the target image exists locally before pulling it. This flow, where inspect-image always precedes the pull, occurs on nodes other than the first.

Code Fixes:
1. src/cephadm/cephadm.py:
   Replace call_throws with call in command_inspect_image. call_throws raises a RuntimeError on any non-zero exit code, producing a full traceback in the logs. call returns the exit code instead of raising, so the function exits cleanly with errno.ENOENT when the image is not found.

Fixes: https://tracker.ceph.com/issues/75448
Signed-off-by: Yael Azulay <yazulay@redhat.com>
12 days agocephadm: convert lists back to tuples when loading last_client_files 68553/head
yaelazulay-redhat [Wed, 22 Apr 2026 16:08:22 +0000 (19:08 +0300)]
cephadm: convert lists back to tuples when loading last_client_files

Problem: ceph mgr fail or active ceph mgr restart causes unnecessary client files recreation on _admin hosts. Files such as /etc/ceph/ceph.conf and /etc/ceph/ceph.client.admin.keyring are rewritten even when their content has not changed.

Root cause:
update_client_file() stores client file metadata as a Python tuple (digest, mode, uid, gid).
When save_host() persists this to the mon store via json.dumps(), the tuple is serialized as a JSON array since JSON has no tuple type.
On mgr failover or restart, cache.load() deserializes the data with json.loads(), which returns a Python list instead of a tuple.
The comparison in _write_client_files(): match = old_files[path] == (digest, mode, uid, gid) then compares a list (from JSON) against a tuple (freshly built), which always evaluates to False.
This causes every client file to be rewritten on every mgr failover or restart.

Code Fixes:
1. src/pybind/mgr/cephadm/inventory.py:
    convert the deserialized lists back to tuples when loading last_client_files

Fixes: https://tracker.ceph.com/issues/76176
Signed-off-by: Yael Azulay <yazulay@redhat.com>
12 days agoMerge pull request #68278 from cheese-cakee/wip-rgw-inline-help-rm-delete
Yuval Lifshitz [Mon, 27 Apr 2026 15:55:44 +0000 (18:55 +0300)]
Merge pull request #68278 from cheese-cakee/wip-rgw-inline-help-rm-delete

rgw: align period/zone commands in radosgw-admin help