]> git-server-git.apps.pok.os.sepia.ceph.com Git - ceph.git/log
ceph.git
8 days agocrimson/os/seastore/transaction: should consider non-aligned remapped 67589/head
Xuehan Xu [Wed, 4 Mar 2026 15:12:51 +0000 (23:12 +0800)]
crimson/os/seastore/transaction: should consider non-aligned remapped
extents when updating paddrs for TRIM_DIRTY/CLEANER transactions

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
8 days agocrimson/os/seastore/lba: TRIM/CLEANER trans to adjust deltas of
Xuehan Xu [Tue, 24 Feb 2026 07:35:58 +0000 (15:35 +0800)]
crimson/os/seastore/lba: TRIM/CLEANER trans to adjust deltas of
LBALeafNodes when committing them.

This is to deal with the following scenario:
1. A client transaction modifies the value of the LBALeafNode, but not
   the pladdr but other field;
2. A TRIM/CLEANER transaction modifies the pladdr for the same laddr_t
   concurrently

In the old approach, the client trans may override the pladdr with the
outdated value after the TRIM/CLEANER transaction commits

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
8 days agocrimson/os/seastore: correct the exception condition when merging
Xuehan Xu [Wed, 11 Feb 2026 06:50:27 +0000 (14:50 +0800)]
crimson/os/seastore: correct the exception condition when merging
rewritten fixed kv nodes

Fixes: https://tracker.ceph.com/issues/74798
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
8 days agocrimson/os/seastore/btree: make updates of lba leaf nodes ptrs
Xuehan Xu [Sun, 1 Mar 2026 04:42:49 +0000 (12:42 +0800)]
crimson/os/seastore/btree: make updates of lba leaf nodes ptrs
synchronous with contents updates

Since we need merge content of lba leaf nodes when committing
trim/cleaner transactions, and we rely on the child ptrs to determine
whether to modify mappings of pending leaf nodes. We must make sure
the ptr updates and node content updates are synchronous.

See LBALeafNode::merge_content_to() for detail

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
8 days agocrimson/os/seastore/extent_pinboard: reset 2q_state when removing
Xuehan Xu [Thu, 11 Dec 2025 08:11:26 +0000 (16:11 +0800)]
crimson/os/seastore/extent_pinboard: reset 2q_state when removing
extents

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
8 days agocrimson/os/seastore/transaction_manager: block client transactions if
Xuehan Xu [Mon, 1 Dec 2025 09:44:45 +0000 (17:44 +0800)]
crimson/os/seastore/transaction_manager: block client transactions if
they conflict with rewriting transactions until the rewriting
transactions finishes

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
8 days agocrimson/os/seastore/cached_extent: treat extents under rewrite io as
Xuehan Xu [Mon, 1 Dec 2025 09:41:21 +0000 (17:41 +0800)]
crimson/os/seastore/cached_extent: treat extents under rewrite io as
stable too

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
8 days agocrimson/os/seastore: disable linked tree node operations when committing
Xuehan Xu [Wed, 26 Nov 2025 08:39:37 +0000 (16:39 +0800)]
crimson/os/seastore: disable linked tree node operations when committing
rewriting transactions

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
8 days agocrimson/os/seastore/cache: rewrite transactions don't invalidate other
Xuehan Xu [Fri, 21 Nov 2025 13:01:15 +0000 (21:01 +0800)]
crimson/os/seastore/cache: rewrite transactions don't invalidate other
transactions anymore

Fixes: https://tracker.ceph.com/issues/73070
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
8 days agocrimson/os/seastore/cache: drop unused last_commit
Xuehan Xu [Fri, 21 Nov 2025 08:49:00 +0000 (16:49 +0800)]
crimson/os/seastore/cache: drop unused last_commit

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
8 days agocrimson/os/seastore/cache: since extent committer will also set
Xuehan Xu [Fri, 21 Nov 2025 07:20:24 +0000 (15:20 +0800)]
crimson/os/seastore/cache: since extent committer will also set
CachedExtent::prior_poffset, remove invalid asserts

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
8 days agocrimson/os/seastore/cache: unlink mutated extents from the stable
Xuehan Xu [Fri, 21 Nov 2025 07:13:11 +0000 (15:13 +0800)]
crimson/os/seastore/cache: unlink mutated extents from the stable
extents' transaction views when committing or invalidating the
transaction

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
8 days agocrimson/os/seastore/cache: add facilities to synchronize data and states
Xuehan Xu [Thu, 20 Nov 2025 09:16:57 +0000 (17:16 +0800)]
crimson/os/seastore/cache: add facilities to synchronize data and states
between rewriting trasactions and others when committing

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
8 days agocrimson/os/seastore/seastore_types: define rewriting transactions
Xuehan Xu [Wed, 19 Nov 2025 10:21:52 +0000 (18:21 +0800)]
crimson/os/seastore/seastore_types: define rewriting transactions

These are the transactions that only rewrite extents and mutation lba
nodes, e.g. TRIM_DIRTY and CLEANER transactions

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
8 days agocrimson/os/seastore/async_cleaner: renew backref cursors when they are
Xuehan Xu [Tue, 14 Oct 2025 02:48:09 +0000 (10:48 +0800)]
crimson/os/seastore/async_cleaner: renew backref cursors when they are
generated by backref retrieval transactions and used by reclaim
transactions

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
8 days agocrimson/os/seastore/lba_manager: make sure alloc_extents return viewable
Xuehan Xu [Sat, 11 Oct 2025 02:39:26 +0000 (10:39 +0800)]
crimson/os/seastore/lba_manager: make sure alloc_extents return viewable
mappings

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
8 days agocrimson/os/seastore/async_cleaner: avoid its header dependence on
Xuehan Xu [Tue, 14 Oct 2025 03:08:53 +0000 (11:08 +0800)]
crimson/os/seastore/async_cleaner: avoid its header dependence on
backref_manager.h

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
8 days agocrimson/os/seastore/btree_types: BtreeCursors don't hold local copies of
Xuehan Xu [Tue, 14 Oct 2025 03:05:19 +0000 (11:05 +0800)]
crimson/os/seastore/btree_types: BtreeCursors don't hold local copies of
lba/backref values

Since lba mapping values might change during the executions of
client transactions once we allow background transactions to be
submitted without invalidating client ones, we want to avoid other
components using lba/backref mappings from keep local copies to prevent
petential problem

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
8 days agocrimson/os/seastore/lba_mapping: don't allow classes above
Xuehan Xu [Mon, 6 Oct 2025 04:00:07 +0000 (12:00 +0800)]
crimson/os/seastore/lba_mapping: don't allow classes above
TransactionManager to retrieve lba mappings' paddrs.

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
8 days agoMerge pull request #67457 from Matan-B/wip-matanb-reactor-type
Matan Breizman [Mon, 9 Mar 2026 09:52:36 +0000 (11:52 +0200)]
Merge pull request #67457 from Matan-B/wip-matanb-reactor-type

crimson/admin/osd_admin: introduce reactor_backend command

Reviewed-by: Mohit Agrawal <moagrawa@redhat.com>
8 days agoMerge pull request #67689 from kotreshhr/cephfs-mirror-remove-extra-wait
Venky Shankar [Mon, 9 Mar 2026 09:33:47 +0000 (15:03 +0530)]
Merge pull request #67689 from kotreshhr/cephfs-mirror-remove-extra-wait

tools/cephfs_mirror: Remove additional wait in pop_dataq_entry

Reviewed-by: Venky Shankar <vshankar@redhat.com>
8 days agoMerge pull request #65858 from sajibreadd/wip-71167-scrub-improvement
Venky Shankar [Mon, 9 Mar 2026 07:52:43 +0000 (13:22 +0530)]
Merge pull request #65858 from sajibreadd/wip-71167-scrub-improvement

mds: scrub pins more inodes than the mds_cache_memory_limit

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Igor Fedotov <igor.fedotov@croit.io>
9 days agocrimson/admin/osd_admin: introduce reactor_backend command 67457/head
Matan Breizman [Sun, 22 Feb 2026 11:37:36 +0000 (11:37 +0000)]
crimson/admin/osd_admin: introduce reactor_backend command

follow-up to: https://github.com/ceph/ceph/pull/67165

Blocked by: https://github.com/scylladb/seastar/pull/3266

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
9 days agoMerge pull request #65423 from benhanokh/split_head_simple
Gabriel Benhanokh [Sat, 7 Mar 2026 13:55:17 +0000 (15:55 +0200)]
Merge pull request #65423 from benhanokh/split_head_simple

rgw/dedup: split-head mechanism

10 days agoMerge pull request #67541 from afreen23/resiliency-card
Afreen Misbah [Sat, 7 Mar 2026 09:38:26 +0000 (15:08 +0530)]
Merge pull request #67541 from afreen23/resiliency-card

mgr/dashboard: Add resiliency card

Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Aashish Sharma <aasharma@redhat.com>
10 days agoMerge pull request #67435 from avanthakkar/qos-clusterwide
Avan [Sat, 7 Mar 2026 06:23:13 +0000 (11:53 +0530)]
Merge pull request #67435 from avanthakkar/qos-clusterwide

mgr/smb: QoS bandwidth pass-through and burst_mult parameter

Reviewed-by: Anoop C S <anoopcs@cryptolab.net>
Reviewed-by: John Mulligan <jmulligan@redhat.com>
10 days agoMerge pull request #66181 from anshuman-agarwala/ppc64-ci
Zack Cerza [Fri, 6 Mar 2026 23:08:01 +0000 (16:08 -0700)]
Merge pull request #66181 from anshuman-agarwala/ppc64-ci

[run-make] Added flag for Dashboard and WError

10 days agoMerge PR #67630 into main
Patrick Donnelly [Fri, 6 Mar 2026 17:54:12 +0000 (12:54 -0500)]
Merge PR #67630 into main

* refs/pull/67630/head:
.github: limit what CI checks run for only doc/qa changes

Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>
10 days agoMerge PR #67682 into main
Patrick Donnelly [Fri, 6 Mar 2026 17:43:41 +0000 (12:43 -0500)]
Merge PR #67682 into main

* refs/pull/67682/head:
qa: remove ceph-deploy configs with no effect
qa: remove long retired ceph-deploy

Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@redhat.com>
Reviewed-by: Yuri Weinstein <yweins@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
10 days ago.github: limit what CI checks run for only doc/qa changes 67630/head
Shraddha Agrawal [Tue, 3 Mar 2026 06:47:04 +0000 (12:17 +0530)]
.github: limit what CI checks run for only doc/qa changes

Signed-off-by: Shraddha Agrawal <shraddha.agrawal000@gmail.com>
10 days agoMerge pull request #67115 from ShreeJejurikar/wip-74491
Yuval Lifshitz [Fri, 6 Mar 2026 15:54:55 +0000 (17:54 +0200)]
Merge pull request #67115 from ShreeJejurikar/wip-74491

rgw: Add bucket logging pytest test suite

10 days agoMerge pull request #65513 from gardran/wip-gardran-plogpg-optimize
Igor Fedotov [Fri, 6 Mar 2026 13:57:44 +0000 (16:57 +0300)]
Merge pull request #65513 from gardran/wip-gardran-plogpg-optimize

osd/PrimaryLogPG: avoid redundant container clones and lookups

Reviewed-by: Kefu Chai <tchaikov@gmail.com>
11 days agoMerge pull request #66368 from adamemerson/wip-neorados-leak
Ilya Dryomov [Fri, 6 Mar 2026 12:41:53 +0000 (13:41 +0100)]
Merge pull request #66368 from adamemerson/wip-neorados-leak

neorados: Fix Neorados CephContext leak and prevent future ones

Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Bill Scales <bill_scales@uk.ibm.com>
Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>
11 days agorgw/dedup split-head 65423/head
benhanokh [Mon, 23 Feb 2026 09:26:17 +0000 (11:26 +0200)]
rgw/dedup split-head
Simplified check for shared-tail-objects.
Added test for copy after dedup
Use tail-ioctx when removing newly created tail-head

Signed-off-by: benhanokh <gbenhano@redhat.com>
11 days agorgw/dedup split-head
Gabriel BenHanokh [Mon, 1 Dec 2025 06:48:57 +0000 (06:48 +0000)]
rgw/dedup split-head
Limit Split-Head to RGW-Objects without existing tail-objects (i.e.
obj_size <= 4MB)

Signed-off-by: benhanokh <gbenhano@redhat.com>
11 days agorgw/dedup: split-head mechanism
Gabriel BenHanokh [Mon, 15 Sep 2025 19:01:02 +0000 (19:01 +0000)]
rgw/dedup: split-head mechanism
Split head object into 2 objects - one with attributes and no data and
a new tail-object with only data.
The new-tail object will be deduped (unlike the head objects which can't
be dedup)
We will split head for objects with size 16MB or less

A few extra improvemnts:
Skip objects created by server-side-copy
Use reftag for comp-swap instead of manifest
Skip shared-manifest objects after readint attributes
Made max_obj_size_for_split and min_obj_size_for_dedup config value in
rgw.yaml.in

refined test: validate size after dedup
TBD: add rados ls -l to report object size on-bulk to speedup the process
improved tests - verify refcount are working, validate objects, remove
duplicates and then verify the last remaining object making sure it was
not deleted

Signed-off-by: Gabriel BenHanokh <gbenhano@redhat.com>
11 days agorados/src/common: use /etc/ceph_version to append vendor release version to the versi...
Naveen Naidu [Fri, 6 Mar 2026 11:39:09 +0000 (17:09 +0530)]
rados/src/common: use /etc/ceph_version to append vendor release version to the version string

11 days agotools/cephfs_mirror: Remove additional wait in pop_dataq_entry 67689/head
Kotresh HR [Fri, 6 Mar 2026 07:28:38 +0000 (12:58 +0530)]
tools/cephfs_mirror: Remove additional wait in pop_dataq_entry

An additional wait has sneaked in while popping job from
syncm's data_q. When the conditional wait was converted to
timed wait as part of f6a6e781b887b01a640d6321a2c085577d9ba07e,
this should have been removed. The extra wait causes no
harm in most of the workflow but might cause issues when
the mirror daemon is stopped. So it should be removed.

This patch removes the extra cond wait

Introduced-by: f6a6e781b887b01a640d6321a2c085577d9ba07e
Signed-off-by: Kotresh HR <khiremat@redhat.com>
11 days agomgr/dashboard: Fix snapshot Api firing twice 67541/head
Afreen Misbah [Tue, 3 Mar 2026 16:45:48 +0000 (22:15 +0530)]
mgr/dashboard: Fix snapshot Api firing twice

- two subs being created

Signed-off-by: Afreen Misbah <afreen@ibm.com>
11 days agomgr/dashboard: Add data resileincy panel
Afreen Misbah [Thu, 26 Feb 2026 01:38:44 +0000 (07:08 +0530)]
mgr/dashboard: Add data resileincy panel

- adds table to show PG states and counts
- adds recovery io,read/write IO

Signed-off-by: Afreen Misbah <afreen@ibm.com>
11 days agomgr/dashboard: Add data resileincy card
Afreen Misbah [Wed, 25 Feb 2026 15:18:57 +0000 (20:48 +0530)]
mgr/dashboard: Add data resileincy card

-  shows data resiliency status
- shows active-clean PGs donut chart
- shows reasons for missing active+clean PGs

Fixes https://tracker.ceph.com/issues/75067

Signed-off-by: Afreen Misbah <afreen@ibm.com>
11 days agoMerge pull request #67668 from bluikko/wip-doc-crimson-fix-and-improvements
bluikko [Fri, 6 Mar 2026 05:17:08 +0000 (12:17 +0700)]
Merge pull request #67668 from bluikko/wip-doc-crimson-fix-and-improvements

doc: Fix link and improve Crimson doc

11 days agodoc: Fix link and improve Crimson doc 67668/head
Ville Ojamo [Thu, 5 Mar 2026 06:02:55 +0000 (13:02 +0700)]
doc: Fix link and improve Crimson doc

Fix Seastar external link that was not working.
Capitalize consistently as Crimson, SeaStore in text.
Fix typos including in a label and in a ref using it.
Wrap text at column 80.
Remove unused highlight directive.
Fix article and hyphenation.
Try to reduce amount of commas in text and improve language.
Use already existing label and ref instead of section title for link.
Use confval role for configuration keys in text.
Use an autoclass reference instead of hardcoding URL.
Trim spaces at end of lines and convert tabs to spaces.
Use a colon instead of a hyphen pretending to be an em dash.

Signed-off-by: Ville Ojamo <git2233+ceph@ojamo.eu>
11 days agoqa: remove ceph-deploy configs with no effect 67682/head
Patrick Donnelly [Fri, 6 Mar 2026 02:07:08 +0000 (21:07 -0500)]
qa: remove ceph-deploy configs with no effect

Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
11 days agoqa: remove long retired ceph-deploy
Patrick Donnelly [Thu, 5 Mar 2026 20:50:16 +0000 (15:50 -0500)]
qa: remove long retired ceph-deploy

Long live cephadm!

Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
11 days agoqa/workunits/smb: update QoS tests for burst multipliers and bandwidth units 67435/head
Avan Thakkar [Mon, 2 Mar 2026 09:36:38 +0000 (15:06 +0530)]
qa/workunits/smb: update QoS tests for burst multipliers and bandwidth units

Signed-off-by: Avan Thakkar <athakkar@redhat.com>
11 days agodoc/mgr/smb: update QoS doc with burst multipliers and bandwidth units
Avan Thakkar [Mon, 2 Mar 2026 08:44:27 +0000 (14:14 +0530)]
doc/mgr/smb: update QoS doc with burst multipliers and bandwidth units

- Add read/write_burst_mult parameters (10-100 range, 15 default)
- Document human-readable bandwidth units (K,M,G,T)
- Add burst behavior explanation
- Remove obsolete delay parameters

Signed-off-by: Avan Thakkar <athakkar@redhat.com>
11 days agomgr/smb: QoS bandwidth pass-through and burst_mult parameter
Avan Thakkar [Thu, 29 Jan 2026 07:18:59 +0000 (12:48 +0530)]
mgr/smb: QoS bandwidth pass-through and burst_mult parameter

Replace delay_max with burst_mult and add human-readable bandwidth
format support for QoS configuration.

Signed-off-by: Avan Thakkar <athakkar@redhat.com>
11 days agorgw: add teuthology integration for bucket logging tests 67115/head
ShreeJejurikar [Thu, 5 Mar 2026 15:23:47 +0000 (20:53 +0530)]
rgw: add teuthology integration for bucket logging tests

Signed-off-by: ShreeJejurikar <shreemj8@gmail.com>
11 days agorgw: add bucket logging pytest suite
ShreeJejurikar [Thu, 26 Feb 2026 07:57:55 +0000 (13:27 +0530)]
rgw: add bucket logging pytest suite

Add a pytest-based test suite for RGW bucket logging that exercises the
radosgw-admin bucket logging CLI commands (list, info, flush) and
verifies the associated S3-level cleanup behavior.

Fixes: https://tracker.ceph.com/issues/74491
Signed-off-by: ShreeJejurikar <shreemj8@gmail.com>
11 days agoMerge pull request #67672 from bluikko/wip-doc-start-rbd-improve
Ilya Dryomov [Thu, 5 Mar 2026 13:42:57 +0000 (14:42 +0100)]
Merge pull request #67672 from bluikko/wip-doc-start-rbd-improve

doc: Improve start/quick-rbd.rst

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
11 days agoMerge pull request #67571 from phlogistonjohn/jjm-smb-remotectl-local
John Mulligan [Thu, 5 Mar 2026 13:30:37 +0000 (08:30 -0500)]
Merge pull request #67571 from phlogistonjohn/jjm-smb-remotectl-local

smb: add remote-control local mode feature

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Anoop C S <anoopcs@cryptolab.net>
Reviewed-by: Xavi Hernandez <xhernandez@gmail.com>
12 days agodoc: Improve start/quick-rbd.rst 67672/head
Ville Ojamo [Thu, 5 Mar 2026 09:02:42 +0000 (16:02 +0700)]
doc: Improve start/quick-rbd.rst

Remove mention of FAQ with a broken link.
Use ref for intra-docs links and add labels in destination documents.
Promptify all CLI example commands.
Use standard angle brackets for mandatory arguments in commands.
Remove an unused external link definition.
Trim spaces at end of lines and convert tabs to spaces.

Signed-off-by: Ville Ojamo <git2233+ceph@ojamo.eu>
12 days agoMerge pull request #67378 from ivancich/wip-add-datalog-error
anrao19 [Thu, 5 Mar 2026 10:06:18 +0000 (15:36 +0530)]
Merge pull request #67378 from ivancich/wip-add-datalog-error

rgw: only log errors to add_datalog_entry when error

12 days agoMerge pull request #67081 from adamemerson/wip-gcc16-clang21
Kefu Chai [Thu, 5 Mar 2026 09:32:49 +0000 (17:32 +0800)]
Merge pull request #67081 from adamemerson/wip-gcc16-clang21

Fixes for GCC 16 and Clang 21

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Kefu Chai <k.chai@proxmox.com>
12 days agoMerge pull request #67658 from batrick/qa-symlinks-workflow
Ernesto Puerta [Thu, 5 Mar 2026 08:53:21 +0000 (09:53 +0100)]
Merge pull request #67658 from batrick/qa-symlinks-workflow

.github: mitigate possible "hackerbot-claw" exploit

12 days agoMerge pull request #66571 from NitzanMordhai/wip-nitzan-prometheus-HealthHistory...
NitzanMordhai [Thu, 5 Mar 2026 06:48:39 +0000 (08:48 +0200)]
Merge pull request #66571 from NitzanMordhai/wip-nitzan-prometheus-HealthHistory-deadlock

mgr/prometheus: Use RLock to fix deadlock in HealthHistory

12 days ago.github: mitigate possible "hackerbot-claw" exploit 67658/head
Patrick Donnelly [Wed, 4 Mar 2026 21:21:31 +0000 (16:21 -0500)]
.github: mitigate possible "hackerbot-claw" exploit

There's no reason to believe this script is actually vulnerable but
now it's best practice to avoid using pull_request_target.

Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
12 days agoMerge pull request #67653 from yuvalif/wip-yuval-75323
Yuval Lifshitz [Wed, 4 Mar 2026 19:23:47 +0000 (21:23 +0200)]
Merge pull request #67653 from yuvalif/wip-yuval-75323

test/rgw/kafka: fix kafka relase to more recent one

12 days agodoc: document the new locally_enabled field 67571/head
John Mulligan [Fri, 27 Feb 2026 16:04:19 +0000 (11:04 -0500)]
doc: document the new locally_enabled field

Document the new locally_enabled field for the remote_control
subsection of the Cluster resource config.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
12 days agomgr/smb: configure smb service for new remote control local feature
John Mulligan [Mon, 23 Feb 2026 17:24:04 +0000 (12:24 -0500)]
mgr/smb: configure smb service for new remote control local feature

Signed-off-by: John Mulligan <jmulligan@redhat.com>
12 days agomgr/smb: add an option to enable the local variation of remotectl
John Mulligan [Mon, 23 Feb 2026 17:23:57 +0000 (12:23 -0500)]
mgr/smb: add an option to enable the local variation of remotectl

Signed-off-by: John Mulligan <jmulligan@redhat.com>
12 days agopython-common/smb: add the remote-control-local feature
John Mulligan [Mon, 23 Feb 2026 17:23:38 +0000 (12:23 -0500)]
python-common/smb: add the remote-control-local feature

See previous commit for the meaning of this feature flag.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
12 days agocephadm: add support for a remote control local socket
John Mulligan [Mon, 23 Feb 2026 17:23:06 +0000 (12:23 -0500)]
cephadm: add support for a remote control local socket

It's not an oxymoron, it's Remote Control Local Socket (tm)!
This allows processes on the ceph host to use a unix domain socket
without mTLS to communicate with the remote control sidecar server
in the samba service.

At the higher level We treat the 2nd listener as a "feature" even
though it really configures the same sidecar as "remote-contol".
This way it's easy to have one of "remote-control",
"remote-control-local" or both in the service spec configuring the
smb service.

NOTE: This service does have the ability to verify that the client has
admin-ish access to ceph services by needing the client to pass
the ceph user name and key over the grpc headers.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
12 days agoMerge pull request #67615 from phlogistonjohn/jjm-exo-show-fix
John Mulligan [Wed, 4 Mar 2026 17:45:12 +0000 (12:45 -0500)]
Merge pull request #67615 from phlogistonjohn/jjm-exo-show-fix

smb: fix ceph smb show ceph.smb.ext.cluster

Reviewed-by: Anoop C S <anoopcs@cryptolab.net>
Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Adam King <adking@redhat.com>
12 days agoMerge pull request #67534 from phlogistonjohn/jjm-smb-debug-opts
John Mulligan [Wed, 4 Mar 2026 17:44:32 +0000 (12:44 -0500)]
Merge pull request #67534 from phlogistonjohn/jjm-smb-debug-opts

smb: add debug level options to smb cluster resource

Reviewed-by: Xavi Hernandez <xhernandez@gmail.com>
Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Anoop C S <anoopcs@cryptolab.net>
Reviewed-by: Adam King <adking@redhat.com>
12 days agotest/rgw/kafka: fix kafka relase to more recent one 67653/head
Yuval Lifshitz [Wed, 4 Mar 2026 14:53:13 +0000 (14:53 +0000)]
test/rgw/kafka: fix kafka relase to more recent one

Fixes: https://tracker.ceph.com/issues/75323
Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>
12 days agoMerge pull request #67225 from amathuria/wip-amat-fix-74504
Matan Breizman [Wed, 4 Mar 2026 13:34:40 +0000 (15:34 +0200)]
Merge pull request #67225 from amathuria/wip-amat-fix-74504

crimson/osd: fix PG splitting logic during map gaps

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
13 days agoMerge pull request #67629 from idryomov/wip-75239
Ilya Dryomov [Wed, 4 Mar 2026 12:35:12 +0000 (13:35 +0100)]
Merge pull request #67629 from idryomov/wip-75239

qa: rbd_mirror_fsx_compare.sh doesn't error out as expected

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
13 days agoMerge pull request #66383 from bluikko/src-mgmt-gateway-spec-descs-python-common
Redouane Kachach [Wed, 4 Mar 2026 10:53:09 +0000 (11:53 +0100)]
Merge pull request #66383 from bluikko/src-mgmt-gateway-spec-descs-python-common

python-common: Improve mgmt-gateway descriptions in service_spec.py

Reviewed-by: Redouane Kachach <rkachach@ibm.com>
13 days agoMerge pull request #66758 from ajarr/wip-stagger-schedules
Ilya Dryomov [Wed, 4 Mar 2026 09:32:51 +0000 (10:32 +0100)]
Merge pull request #66758 from ajarr/wip-stagger-schedules

mgr/rbd_support: Stagger mirror snapshot and trash purge schedules

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
13 days agoMerge pull request #67590 from idryomov/wip-75200
Ilya Dryomov [Wed, 4 Mar 2026 09:22:20 +0000 (10:22 +0100)]
Merge pull request #67590 from idryomov/wip-75200

qa/tasks: make rbd_mirror_thrash inherit from ThrasherGreenlet

Reviewed-by: Miki Patel <miki.patel132@gmail.com>
13 days agoMerge pull request #65911 from kotreshhr/mirror-json-pretty
Kotresh HR [Wed, 4 Mar 2026 08:11:06 +0000 (13:41 +0530)]
Merge pull request #65911 from kotreshhr/mirror-json-pretty

mgr/mirroring: json pretty formatting

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Jos Collin <jcollin@redhat.com>
13 days agoMerge pull request #67599 from bluikko/wip-doc-start-get-involved-fixes
bluikko [Wed, 4 Mar 2026 05:50:15 +0000 (12:50 +0700)]
Merge pull request #67599 from bluikko/wip-doc-start-get-involved-fixes

doc/start: Update and fix get-involved.rst

13 days agopython-common: Improve mgmt-gateway descriptions in service_spec.py 66383/head
Ville Ojamo [Mon, 24 Nov 2025 09:56:48 +0000 (16:56 +0700)]
python-common: Improve mgmt-gateway descriptions in service_spec.py

Fix one description that referred to time while the nginx config doesn't
configure time.

Use consistent language. Remove reference to "multioption" which is a
simple bool.

Signed-off-by: Ville Ojamo <git2233+ceph@ojamo.eu>
13 days agomgr/rbd_support: Stagger mirror snapshot and trash purge schedules 66758/head
Ramana Raja [Mon, 29 Dec 2025 22:17:28 +0000 (17:17 -0500)]
mgr/rbd_support: Stagger mirror snapshot and trash purge schedules

Previously, multiple images or namespaces scheduled with the same
interval ran mirror snapshots or trash purges at around the same time,
creating spikes in cluster activity.

This change staggers scheduled jobs by:

- Adding a deterministic phase offset per image or namespace when no
  start-time is set.
- Picking a random element from the queue at each scheduled time, rather
  than always the first.

Together, these changes spread snapshot and trash purge operations more
evenly over time and improve cluster stability.

Fixes: https://tracker.ceph.com/issues/74288
Signed-off-by: Ramana Raja <rraja@redhat.com>
13 days agoMerge PR #67563 into main
Patrick Donnelly [Tue, 3 Mar 2026 17:03:46 +0000 (12:03 -0500)]
Merge PR #67563 into main

* refs/pull/67563/head:
script/redmine-upkeep: do not warn for backport PRs
script/redmine-upkeep: do not raise backport warnings for backports

Reviewed-by: Casey Bodley <cbodley@redhat.com>
13 days agoMerge pull request #67613 from ronen-fr/wip-rf-75270by1
Ronen Friedman [Tue, 3 Mar 2026 16:53:40 +0000 (18:53 +0200)]
Merge pull request #67613 from ronen-fr/wip-rf-75270by1

osd/scrub: extract_crc_from_bufferlist: fix off-by-1 error

Reviewed-by: Jon Bailey <jonathan.bailey1@ibm.com>
13 days agomgr/smb: reimplement part of the _search_resources function 67615/head
John Mulligan [Mon, 2 Mar 2026 21:09:16 +0000 (16:09 -0500)]
mgr/smb: reimplement part of the _search_resources function

Reimplement part of the _search_resources function to avoid using yet
another static mapping between the SMBResource type and it's partner
entry type which is one more place you forget to update when you
add a new type. Now, the type mapping is based on the matcher class
and the typ mapping function provided by the internal.py module.

Fixes: 5712016c2133870da3f704d8457358ad06efc87f
Signed-off-by: John Mulligan <jmulligan@redhat.com>
13 days agomgr/smb: rename func to map_resource_entry to make it public
John Mulligan [Mon, 2 Mar 2026 21:07:54 +0000 (16:07 -0500)]
mgr/smb: rename func to map_resource_entry to make it public

Rename the _map_resource_entry to map_resource_entry to make it a public
function and enable easier dynamic mapping between smb resource types
and their partner entry types.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
13 days agoMerge pull request #67400 from vmackie/victoriam-stretch-cluster
vmackie [Tue, 3 Mar 2026 14:42:57 +0000 (14:42 +0000)]
Merge pull request #67400 from vmackie/victoriam-stretch-cluster

dashboard: add location field to NVMeoF namespace and gateway group APIs

13 days agoMerge pull request #67301 from rhcs-dashboard/nfs-toggle-snapshot
Afreen Misbah [Tue, 3 Mar 2026 14:14:09 +0000 (19:44 +0530)]
Merge pull request #67301 from rhcs-dashboard/nfs-toggle-snapshot

mgr/dashboard: NFS: Toggle visibility of CephFS snapshots

Reviewed-by: Afreen Misbah <afreen@ibm.com>
Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
2 weeks agomgr/mirroring: json pretty formatting 65911/head
Kotresh HR [Wed, 18 Feb 2026 10:48:51 +0000 (16:18 +0530)]
mgr/mirroring: json pretty formatting

The 'daemon status' and 'peer_list' command
out don't support json-pretty format and isn't reader
friendly. This patch adds support of 'json-pretty'
when format='json-pretty' is passed.

Fixes: https://tracker.ceph.com/issues/73516
Signed-off-by: Kotresh HR <khiremat@redhat.com>
2 weeks agoosd/scrub: extract_crc_from_bufferlist: fix off-by-1 error 67613/head
Ronen Friedman [Mon, 2 Mar 2026 17:51:09 +0000 (19:51 +0200)]
osd/scrub: extract_crc_from_bufferlist: fix off-by-1 error

Fixes: https://tracker.ceph.com/issues/75270
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
2 weeks agomds: scrub pins more inodes than the mds_cache_memory_limit 65858/head
sajibreadd [Thu, 9 Oct 2025 11:48:35 +0000 (13:48 +0200)]
mds: scrub pins more inodes than the mds_cache_memory_limit

For scrubbing dirfrag we are pushing children back into the scrub stack. Instead we can follow the same
strategy for scrub directory and pushing children front of the scrub stack, and in kick_off_scrubs always
start scrubbing from the front of the stack. It will prevent ScrubStack to pinning whole level of the file-system
tree.

Fixes: https://tracker.ceph.com/issues/71167
Signed-off-by: Md Mahamudur Rahaman Sajib <mahamudur.sajib@croit.io>
2 weeks agoMerge pull request #65878 from edwinzrodriguez/ceph-wip-73424
Venky Shankar [Tue, 3 Mar 2026 10:56:18 +0000 (16:26 +0530)]
Merge pull request #65878 from edwinzrodriguez/ceph-wip-73424

mds: Refactor cache dump path formatting

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2 weeks agoMerge pull request #67335 from kotreshhr/fs_mirror_daemon_status
Venky Shankar [Tue, 3 Mar 2026 10:55:50 +0000 (16:25 +0530)]
Merge pull request #67335 from kotreshhr/fs_mirror_daemon_status

mgr/mirroring: Display mon_host and fsid in daemon status command

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2 weeks agoMerge pull request #66551 from salieri11/wip-igolikov-new-metrics-74135-73700
Venky Shankar [Tue, 3 Mar 2026 10:55:27 +0000 (16:25 +0530)]
Merge pull request #66551 from salieri11/wip-igolikov-new-metrics-74135-73700

mds: add new performance and subvolume utilization metrics

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2 weeks agoMerge pull request #67346 from rhcs-dashboard/custom-image-to-bootstrap
Nizamudeen A [Tue, 3 Mar 2026 10:50:08 +0000 (16:20 +0530)]
Merge pull request #67346 from rhcs-dashboard/custom-image-to-bootstrap

mgr/dashboard: support custom image for bootstrapping

Reviewed-by: Abhishek Desai <abhishek.desai1@ibm.com>
2 weeks agoqa/workunits/rbd: fix unbound variable in status() 67629/head
Ilya Dryomov [Mon, 2 Mar 2026 11:07:48 +0000 (12:07 +0100)]
qa/workunits/rbd: fix unbound variable in status()

It was missed in commit 5fe64fa806f3 ("qa: rbd_mirror.sh: change
parameters to cluster rather than daemon name").

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2 weeks agoqa/workunits/rbd: short-circuit status() if "ceph -s" fails
Ilya Dryomov [Sun, 1 Mar 2026 21:55:52 +0000 (22:55 +0100)]
qa/workunits/rbd: short-circuit status() if "ceph -s" fails

In mirror-thrash tests, status() can be invoked after one of the
clusters is effectively stopped due to a watchdog bark:

2026-03-01T22:27:38.633 INFO:tasks.daemonwatchdog.daemon_watchdog:thrasher.rbd_mirror.[cluster2] failed
2026-03-01T22:27:38.633 INFO:tasks.daemonwatchdog.daemon_watchdog:BARK! unmounting mounts and killing all daemons
...
2026-03-01T22:32:46.964 INFO:tasks.workunit.cluster1.client.mirror.trial199.stderr:+ status
2026-03-01T22:32:46.964 INFO:tasks.workunit.cluster1.client.mirror.trial199.stderr:+ local cluster daemon image_pool image_ns image
2026-03-01T22:32:46.964 INFO:tasks.workunit.cluster1.client.mirror.trial199.stderr:+ for cluster in ${CLUSTER1} ${CLUSTER2}

In this scenario all commands that are invoked from the loop body
are going to time out anyway.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2 weeks agoqa: rbd_mirror_fsx_compare.sh doesn't error out as expected
Ilya Dryomov [Sun, 1 Mar 2026 16:45:51 +0000 (17:45 +0100)]
qa: rbd_mirror_fsx_compare.sh doesn't error out as expected

In mirror-thrash tests, one of the clusters can be effectively stopped
due to a watchdog bark while rbd_mirror_fsx_compare.sh is running and is
in the middle of the "wait for all images" loop:

2026-03-01T12:55:35.059 INFO:tasks.workunit.cluster1.client.mirror.trial055.stderr:+ retrying_seconds=1040
2026-03-01T12:55:35.060 INFO:tasks.workunit.cluster1.client.mirror.trial055.stderr:+ '[' 1040 -le 7200 ']'
2026-03-01T12:55:35.060 INFO:tasks.workunit.cluster1.client.mirror.trial055.stderr:++ rbd --cluster cluster2 --pool mirror ls
2026-03-01T12:55:35.060 INFO:tasks.workunit.cluster1.client.mirror.trial055.stderr:++ wc -l
2026-03-01T12:55:35.084 INFO:tasks.workunit.cluster1.client.mirror.trial055.stderr:+ '[' 290 -ge 292 ']'
2026-03-01T12:55:35.084 INFO:tasks.workunit.cluster1.client.mirror.trial055.stderr:+ sleep 10
...
2026-03-01T12:55:49.568 INFO:tasks.daemonwatchdog.daemon_watchdog:thrasher.rbd_mirror.[cluster2] failed
2026-03-01T12:55:49.568 INFO:tasks.daemonwatchdog.daemon_watchdog:BARK! unmounting mounts and killing all daemons

In this scenario "rbd ls" is going to time out repeatedly, turning the
loop into up to a ~60-hour sleep (up to 720 iterations with a 5-minute
timeout + 10-second sleep per iteration).

Fixes: https://tracker.ceph.com/issues/75239
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2 weeks agoMerge pull request #67136 from JonBailey1993/stats_fix_part_1
SrinivasaBharathKanta [Tue, 3 Mar 2026 10:27:21 +0000 (15:57 +0530)]
Merge pull request #67136 from JonBailey1993/stats_fix_part_1

osd: Fix for num_bytes mismatch occurring from snapshot workloads with partial writes in fast_ec

2 weeks agodashboard: add location field to NVMeoF namespace and gateway group APIs 67400/head
Victoria Mackie [Fri, 13 Feb 2026 21:40:01 +0000 (21:40 +0000)]
dashboard: add location field to NVMeoF namespace and gateway group APIs

Namespace location:
- Add location field to Namespace model in nvmeof.py
- Add location parameter to PATCH /api/nvmeof/subsystem/{nqn}/namespace/{nsid}
- Location can now be retrieved via GET and set via PATCH

Gateway group locations:
- Add locations array to gateway group endpoint response
- Extract locations from all gateways in a service group
- Add _get_gateway_locations() helper method using nvme-gw show command
- Locations appear in placement.locations for each service

Signed-off-by: Victoria Mackie <victoriam@uk.ibm.com>
2 weeks agoMerge pull request #67626 from tchaikov/wip-qa-remove-unused-var
Kefu Chai [Tue, 3 Mar 2026 10:08:56 +0000 (18:08 +0800)]
Merge pull request #67626 from tchaikov/wip-qa-remove-unused-var

qa/tasks/keycloak: drop unused local variable

Reviewed-by: Igor Fedotov <igor.fedotov@croit.io>
Reviewed-by: Avan Thakkar <athakkar@redhat.com>
2 weeks agoMerge pull request #67604 from amathuria/wip-amat-update-seastar-march-26
Matan Breizman [Tue, 3 Mar 2026 08:42:20 +0000 (10:42 +0200)]
Merge pull request #67604 from amathuria/wip-amat-update-seastar-march-26

seastar: bump up seastar submodule

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
2 weeks agoMerge pull request #67227 from rhcs-dashboard/isolate-cherrypy
Nizamudeen A [Tue, 3 Mar 2026 07:14:50 +0000 (12:44 +0530)]
Merge pull request #67227 from rhcs-dashboard/isolate-cherrypy

mgr: isolated CherryPy to prevent global state sharing

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Redouane Kachach <rkachach@redhat.com>
Reviewed-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
2 weeks agoqa/tasks/keycloak: drop unused local variable 67626/head
Kefu Chai [Tue, 3 Mar 2026 06:54:36 +0000 (14:54 +0800)]
qa/tasks/keycloak: drop unused local variable

Regression introduced by ee710390d277784ddac3d70c9e11e427f46f363d
("rgw/tests: add os-specific java 1.7 install commands to keycloak
task").

```
283/322 Test #301: run-tox-qa ................................***Failed 92.31 sec
...
flake8: install_deps /ceph/qa> python -I -m pip install flake8
  flake8: commands[0] /ceph/qa> flake8 --select=F,E9 --exclude=venv,.tox
  ./tasks/keycloak.py:51:5: F841 local variable 'os_version' is assigned to but never used
```
Remove the unused os_version assignment to fix flake8 F841 in run-tox-qa.

Signed-off-by: Kefu Chai <k.chai@proxmox.com>
2 weeks agoMerge pull request #67291 from rhcs-dashboard/add-smb-share-qos
Nizamudeen A [Tue, 3 Mar 2026 06:43:03 +0000 (12:13 +0530)]
Merge pull request #67291 from rhcs-dashboard/add-smb-share-qos

mgr/dashboard: add SMB share QoS rate limiting

Reviewed-by: Dnyaneshwari talwekar <dtalweka@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
2 weeks agoMerge pull request #67602 from bluikko/wip-doc-radosgw-adminops-fixes
bluikko [Tue, 3 Mar 2026 06:09:02 +0000 (13:09 +0700)]
Merge pull request #67602 from bluikko/wip-doc-radosgw-adminops-fixes

doc/radosgw: Fix markup in adminops.rst