]> git-server-git.apps.pok.os.sepia.ceph.com Git - ceph.git/log
ceph.git
2 weeks agoMerge pull request #67865 from tchaikov/doc-runtime-updatable
Kefu Chai [Wed, 18 Mar 2026 02:13:08 +0000 (10:13 +0800)]
Merge pull request #67865 from tchaikov/doc-runtime-updatable

doc: show runtime-updatable config metadata

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
2 weeks agodoc: show runtime-updatable config metadata 67865/head
Kefu Chai [Tue, 17 Mar 2026 10:33:42 +0000 (18:33 +0800)]
doc: show runtime-updatable config metadata

Extend the confval Sphinx extension to render whether an option
can be updated at runtime.

Use the same policy as Option::can_update_at_runtime() so the
documentation reflects both native config options and mgr module
options consistently.

Signed-off-by: Kefu Chai <k.chai@proxmox.com>
2 weeks agorgw: config options for mp lock error/delay injection
Casey Bodley [Fri, 13 Mar 2026 15:10:45 +0000 (11:10 -0400)]
rgw: config options for mp lock error/delay injection

rgw_mp_lock_inject_delay can inject delays in multipart complete to make
sure lock renewal requests get issued

rgw_mp_lock_inject_renewal_error can inject lock renewal errors to test
the error handling

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2 weeks agorgw/rados: MPRadosSerializer spawns lock renewal coroutine
Casey Bodley [Wed, 11 Mar 2026 21:45:55 +0000 (17:45 -0400)]
rgw/rados: MPRadosSerializer spawns lock renewal coroutine

CompleteMultipartUpload depends on this lock to ensure consistency of
uploads and protect against data loss, so we should try very hard to
hold this lock as long as it takes to complete successfully

MPRadosSerializer accomplishes this by spawning a background lock
renewal coroutine. this coroutine is started during a successful call to
try_lock(), and stopped before unlock() releases the lock

Fixes: https://tracker.ceph.com/issues/75375
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2 weeks agocommon/async: async_cond Executor defaults to any_io_executor
Casey Bodley [Mon, 16 Mar 2026 20:14:24 +0000 (16:14 -0400)]
common/async: async_cond Executor defaults to any_io_executor

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2 weeks agorgw/sal: Object::get_serializer() takes optional_yield
Casey Bodley [Tue, 17 Mar 2026 16:54:18 +0000 (12:54 -0400)]
rgw/sal: Object::get_serializer() takes optional_yield

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2 weeks agomgr/dashboard: Add capacity thresholds 67807/head
Afreen Misbah [Mon, 16 Mar 2026 14:20:51 +0000 (19:50 +0530)]
mgr/dashboard: Add capacity thresholds

Signed-off-by: Afreen Misbah <afreen@ibm.com>
2 weeks agorgw/sal: Serializer::try_lock() takes ceph::timespan
Casey Bodley [Wed, 11 Mar 2026 20:21:11 +0000 (16:21 -0400)]
rgw/sal: Serializer::try_lock() takes ceph::timespan

this duration ultimately gets passed down to cls_lock's set_duration()
function, which has overloads for both utime_t and ceph::timespan.
prefer ceph::timespan because it also works with boost asio timers

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2 weeks agorgw/sal: StoreMPSerializer uses atomic bool locked
Casey Bodley [Thu, 12 Mar 2026 19:44:22 +0000 (15:44 -0400)]
rgw/sal: StoreMPSerializer uses atomic bool locked

make it safe for other threads to call clear_locked() and is_locked()

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2 weeks agorgw: check for broken lock before multipart complete
Casey Bodley [Thu, 12 Mar 2026 14:39:02 +0000 (10:39 -0400)]
rgw: check for broken lock before multipart complete

if lock renewal fails, is_locked() will return false. check that just
before upload->complete() goes on to write/overwrite the head object,
and return the same ERR_INTERNAL_ERROR from lock contention

Fixes: https://tracker.ceph.com/issues/75375
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2 weeks agomgr/dashboard: services page breaking for test_orchestrator module 67847/head
Naman Munet [Tue, 17 Mar 2026 13:33:28 +0000 (19:03 +0530)]
mgr/dashboard: services page breaking for test_orchestrator module

Fixes: https://tracker.ceph.com/issues/75561
Signed-off-by: Naman Munet <naman.munet@ibm.com>
2 weeks agoMerge pull request #67124 from batrick/i74652
Venky Shankar [Tue, 17 Mar 2026 09:57:46 +0000 (15:27 +0530)]
Merge pull request #67124 from batrick/i74652

 mds: improve debugging for snaprealms marked subvolume

Venky Shankar <vshankar@redhat.com>

2 weeks agoMerge pull request #67125 from batrick/multifs-auth-fix
Venky Shankar [Tue, 17 Mar 2026 09:57:25 +0000 (15:27 +0530)]
Merge pull request #67125 from batrick/multifs-auth-fix

qa: clean up and simplify client 2 caps

Venky Shankar <vshankar@redhat.com>
Reviewed-by: Jos Collin <jcollin@redhat.com>
2 weeks agoMerge pull request #67377 from chrisphoffman/wip-74934
Venky Shankar [Tue, 17 Mar 2026 09:56:43 +0000 (15:26 +0530)]
Merge pull request #67377 from chrisphoffman/wip-74934

client: During encryption of short case-insensitive file names, store raw ciphertext

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2 weeks agoMerge pull request #67496 from batrick/i75143
Venky Shankar [Tue, 17 Mar 2026 09:55:54 +0000 (15:25 +0530)]
Merge pull request #67496 from batrick/i75143

mds: use SimpleLock::WAIT_ALL for wait mask

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2 weeks agoMerge pull request #67556 from batrick/i70580
Venky Shankar [Tue, 17 Mar 2026 09:55:20 +0000 (15:25 +0530)]
Merge pull request #67556 from batrick/i70580

qa: allow multiple mgr sessions during eviction test

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2 weeks agoMerge pull request #67610 from kginonredhat/75267-service-is-not-getting-up-when...
Redouane Kachach [Tue, 17 Mar 2026 09:34:18 +0000 (10:34 +0100)]
Merge pull request #67610 from kginonredhat/75267-service-is-not-getting-up-when-being-added-with-ipv6-vip-address

Fixing issue of service mgmt-gateway not coming up when configured to…

Reviewed-by: Redouane Kachach <rkachach@ibm.com>
2 weeks agocrimson/os/seastore: handle duplicate keys in LogNode::remove_entry 64439/head
myoungwon oh [Sat, 7 Mar 2026 11:38:53 +0000 (20:38 +0900)]
crimson/os/seastore: handle duplicate keys in LogNode::remove_entry

Previously, LogNode::remove_entry returned early when a log_key was
found, assuming uniqueness. However, duplicate keys can exist in the
node if an older entry was previously removed.
This commit also adds a unit test to verify this scenario.

Signed-off-by: Myoungwon Oh <ohmyoungwon@gmail.com>
2 weeks agocrimson/os/seastore: reload head if modified
myoungwon oh [Tue, 3 Mar 2026 15:42:51 +0000 (00:42 +0900)]
crimson/os/seastore: reload head if modified

This commit also fixes the test case to verify that
the head is correctly allocated and updated
during omap_set_keys operations involving multiple keys.

Signed-off-by: Myoungwon Oh <ohmyoungwon@gmail.com>
2 weeks agocrimson/os/seastore, osd/PGLog: handle omap_iterate retry to avoid duplicate entries
myoungwon oh [Sat, 28 Feb 2026 04:38:16 +0000 (13:38 +0900)]
crimson/os/seastore, osd/PGLog: handle omap_iterate retry to avoid duplicate entries

Seastore omap_iterate may retry internally on conflicts, which can
cause PGLog to process the same entries multiple times when entries
are handled directly in the iteration callback.

Introduce a conflict hook in omap_iterate so callers can reset
iteration state on retry. PGLog now buffers entries during iteration and
applies process_entry() only after a successful pass, clearing the buffer
on retry to avoid duplicates.

Signed-off-by: Myoungwon Oh <ohmyoungwon@gmail.com>
2 weeks agocrimson/os/seastore: ensure data integrity with deep copy in omap_get_value
myoungwon oh [Fri, 27 Feb 2026 08:01:59 +0000 (17:01 +0900)]
crimson/os/seastore: ensure data integrity with deep copy in omap_get_value

Previously, omap_get_value could return a bufferlist pointing to
memory without guaranteed lifetime. This patch introduces LogNode::copy_t
to distinguish between DEEP and SHALLOW copies.

- Default get_value to DEEP copy for external safety.
- Use SHALLOW copy in internal paths (e.g., remove_kv) to maintain performance.
- Refactor LogManager::omap_get_value to simplify coroutine flow.

Signed-off-by: Myoungwon Oh <ohmyoungwon@gmail.com>
2 weeks agocrimson/os/seastore: ensure extent is loaded if missing from cache
myoungwon oh [Wed, 25 Feb 2026 12:58:53 +0000 (21:58 +0900)]
crimson/os/seastore: ensure extent is loaded if missing from cache

Signed-off-by: Myoungwon Oh <ohmyoungwon@gmail.com>
2 weeks agocrimson/os/seastore: correct UPPER_BOUND behavior to start looking up items exceeding...
myoungwon oh [Wed, 25 Feb 2026 12:41:12 +0000 (21:41 +0900)]
crimson/os/seastore: correct UPPER_BOUND behavior to start looking up items exceeding the target

Signed-off-by: Myoungwon Oh <ohmyoungwon@gmail.com>
2 weeks agocrimson/os/seastore: fix crash by checking for nullopt in search input
myoungwon oh [Tue, 24 Feb 2026 05:56:46 +0000 (14:56 +0900)]
crimson/os/seastore: fix crash by checking for nullopt in search input

Signed-off-by: Myoungwon Oh <ohmyoungwon@gmail.com>
2 weeks agocrimson/os/seastore: return correct result for non-logkey entries
myoungwon oh [Tue, 24 Feb 2026 04:03:01 +0000 (13:03 +0900)]
crimson/os/seastore: return correct result for non-logkey entries

Signed-off-by: Myoungwon Oh <ohmyoungwon@gmail.com>
2 weeks agocrimson/os/seastore: use do_with to avoid stack-use-after-return
myoungwon oh [Thu, 19 Feb 2026 05:01:22 +0000 (14:01 +0900)]
crimson/os/seastore: use do_with to avoid stack-use-after-return

Signed-off-by: Myoungwon Oh <ohmyoungwon@gmail.com>
2 weeks agoosd/PGLog: set dirty_to to max to avoid partial writes
myoungwon oh [Sat, 14 Feb 2026 08:07:39 +0000 (17:07 +0900)]
osd/PGLog: set dirty_to to max to avoid partial writes

Signed-off-by: Myoungwon Oh <ohmyoungwon@gmail.com>
2 weeks agocrimson/os/seastore: add test for log_manager
myoungwon oh [Thu, 28 Aug 2025 02:50:44 +0000 (11:50 +0900)]
crimson/os/seastore: add test for log_manager

Signed-off-by: Myoungwon Oh <ohmyoungwon@gmail.com>
2 weeks agocrimson/os/seastore: support for large kv pair in LogNode
myoungwon oh [Fri, 13 Feb 2026 02:06:02 +0000 (11:06 +0900)]
crimson/os/seastore: support for large kv pair in LogNode

Each log_key_t contains a chunk_idx field to manage values
that span multiple LogNodes when the value size exceeds the
maximum capacity of a single LogNode.
See detailed description in log_manager.h.

Signed-off-by: Myoungwon Oh <ohmyoungwon@gmail.com>
2 weeks agocrimson/os/seastore: optimize handling of batched requests
myoungwon oh [Mon, 19 Jan 2026 17:14:24 +0000 (02:14 +0900)]
crimson/os/seastore: optimize handling of batched requests

During 4KB random write workloads, SeaStore  receives
batched dup_* entries in both omap_set_keys.
This change enables efficient batch processing of these
requests to reduce overhead.

Signed-off-by: Myoungwon Oh <ohmyoungwon@gmail.com>
2 weeks agocrimson/os/seastore: introduce omap_rm_keys interface in omap_manager
myoungwon oh [Sat, 30 Aug 2025 12:18:12 +0000 (21:18 +0900)]
crimson/os/seastore: introduce omap_rm_keys interface in omap_manager

Deletion of pg_log_entry_t entries is performed by omap_rm_keys using a set.
For example, omap_rm_keys might be called with a set containing
pg_log_entry_t entries ranging from 0011.0001 to 0011.0010.
In this case, calling omap_rm_key individually for each entry is inefficient,
because each call triggers a traversal of the entire list.

To avoid this, omap_rm_keys with a set is introduced in omap_manager
to handle removal request more efficiently.

Signed-off-by: Myoungwon Oh <ohmyoungwon@gmail.com>
2 weeks agocrimson/os/seastore: adjust select_log_omap_root to return a single root
myoungwon oh [Sat, 3 Jan 2026 06:59:33 +0000 (15:59 +0900)]
crimson/os/seastore: adjust select_log_omap_root to return a single root

Signed-off-by: Myoungwon Oh <ohmyoungwon@gmail.com>
2 weeks agocrimson/os/seastore: invoke managers depending types based if branch
myoungwon oh [Fri, 2 Jan 2026 06:57:37 +0000 (15:57 +0900)]
crimson/os/seastore: invoke managers depending types based if branch

Signed-off-by: Myoungwon Oh <ohmyoungwon@gmail.com>
2 weeks agocrimson/os/seastore: remove duplicate keys for non-log entries
myoungwon oh [Fri, 13 Feb 2026 05:04:14 +0000 (14:04 +0900)]
crimson/os/seastore: remove duplicate keys for non-log entries

When writing a non-log key, remove any existing duplicate key
before inserting the new KV pair. With this change, full list
traversal is no longer required during remove_kv.

Signed-off-by: Myoungwon Oh <ohmyoungwon@gmail.com>
2 weeks agocrimson/os/seastore: make _fastinfo overwritable to minimize space overhead
myoungwon oh [Thu, 1 Jan 2026 09:23:47 +0000 (18:23 +0900)]
crimson/os/seastore: make _fastinfo overwritable to minimize space overhead

This commit forces _fastinfo to be stored at the last position of a LogNode.
By doing so, _fastinfo can be overwritten by the next pg_log_entry.

Since _fastinfo has a fixed key with varying contents and is included in
every write transaction, placing it at the tail enables efficient overwrites.

As a result, this change reduces LogNode allocation and deallocation,
thereby lowering space overhead. Moreover, garbage collection for obsolete
key-value pairs is unnecessary due to overwrite semantics.

Signed-off-by: Myoungwon Oh <ohmyoungwon@gmail.com>
2 weeks agocrimson/os/seastore: add log_manager to handle pgmeta_oid's key-value pairs separately
myoungwon oh [Thu, 28 Aug 2025 02:49:50 +0000 (11:49 +0900)]
crimson/os/seastore: add log_manager to handle pgmeta_oid's key-value pairs separately

Signed-off-by: Myoungwon Oh <ohmyoungwon@gmail.com>
2 weeks agoMerge pull request #65541 from MaxKellermann/auth_includes2
Kefu Chai [Tue, 17 Mar 2026 07:22:34 +0000 (15:22 +0800)]
Merge pull request #65541 from MaxKellermann/auth_includes2

auth: reduce header dependencies

Reviewed-by: Kefu Chai <k.chai@proxmox.com>
2 weeks agocrimson/os/seastore: remove log_root to maintain a single root for OMAP and LOG
myoungwon oh [Fri, 2 Jan 2026 05:49:16 +0000 (14:49 +0900)]
crimson/os/seastore: remove log_root to maintain a single root for OMAP and LOG

Signed-off-by: Myoungwon Oh <ohmyoungwon@gmail.com>
2 weeks agoMerge pull request #66261 from aclamk/aclamk-bs-cputrace-for-write-path
Jaya Prakash [Tue, 17 Mar 2026 07:05:02 +0000 (12:35 +0530)]
Merge pull request #66261 from aclamk/aclamk-bs-cputrace-for-write-path

os/bluestore: Cputrace measurements for core BlueStore functions

Reviewed-by: Igor Fedotov <igor.fedotov@croit.io>
Reviewed-by: Jaya Prakash <jayaprakash@ibm.com>
2 weeks agoMerge pull request #67716 from rhcs-dashboard/fix-namespace-issue
Nizamudeen A [Tue, 17 Mar 2026 06:42:11 +0000 (12:12 +0530)]
Merge pull request #67716 from rhcs-dashboard/fix-namespace-issue

mgr/dashboard: fix add host API path for namespace mapping in NVMe/TCP

Reviewed-by: Puja Shahu <pshahu@redhat.com>
Reviewed-by: Afreen Misbah <afreen@ibm.com>
2 weeks agoMerge pull request #67769 from rhcs-dashboard/fix-nvmeof-subsystem-create-firefox...
Nizamudeen A [Tue, 17 Mar 2026 06:41:48 +0000 (12:11 +0530)]
Merge pull request #67769 from rhcs-dashboard/fix-nvmeof-subsystem-create-firefox-next

mgr/dashboard: fix-nvmeof-subsystem-create-firefox-next

Reviewed-by: Puja Shahu <pshahu@redhat.com>
Reviewed-by: Afreen Misbah <afreen@ibm.com>
2 weeks agoMerge pull request #67810 from gbregman/main
Gil Bregman [Tue, 17 Mar 2026 06:41:30 +0000 (08:41 +0200)]
Merge pull request #67810 from gbregman/main

mgr/dashboard: Add "connection get_io_statistics" to NVMeoF CLI

2 weeks agoqa/tasks/backfill_toofull.py: Fix assert failures with & without compression 67829/head
Sridhar Seshasayee [Mon, 9 Mar 2026 09:31:54 +0000 (15:01 +0530)]
qa/tasks/backfill_toofull.py: Fix assert failures with & without compression

The following issues with the test are addressed:

1. The test was encountering assertion failure (assert backfillfull < 0.9) with
   compression enabled. This was because the condition was not factoring in the
   compression ratio. Without it the backfillfull ratio can easily exceed 1. By
   factoring in the compression ratio, the backfillfull ratio will be in the
   range (0 - n), where n can vary depending on the type of compression used.

2. The main contributing factor for (1) above is the amount of data written to
   the pool. The writes were time-bound earlier leading to excess data and
   eventually the assertion failure. By limiting the data written to the OSDs
   to 50% of the OSD capacity in the first phase and only 20% in the re-write
   phase, the outcome of the test is more deterministic regardless of
   compression being enabled or not.

3. A potential false cluster error is avoided by swapping the setting of
   the nearfull-ratio and backfill-ratio after the re-write phase.

4. Fix a couple of typos - s/tartget/target.

Fixes: https://tracker.ceph.com/issues/71005
Signed-off-by: Sridhar Seshasayee <sridhar.seshasayee@ibm.com>
2 weeks agoMerge pull request #67742 from VallariAg/wip-nvmeof-upgrade-fix
Vallari Agrawal [Tue, 17 Mar 2026 04:50:06 +0000 (10:20 +0530)]
Merge pull request #67742 from VallariAg/wip-nvmeof-upgrade-fix

qa: Add nvmeof upgrade from v20.2.0 and tentacle

2 weeks agomgr/dashboard: Fix loading states in storage overview card
Afreen Misbah [Mon, 16 Mar 2026 09:04:30 +0000 (14:34 +0530)]
mgr/dashboard: Fix loading states in storage overview card

Fixes https://tracker.ceph.com/issues/75321
Fixes https://tracker.ceph.com/issues/75299

- removes storage type
- stabilizes overview card for loading data
- raw capcity shown when promethues not there
- multiple refresh intervals which may vcause sync issues and bugs hence moved the logic to parent - overview component
- Now all queries are updated at 5 s interval except data consumption - using promethues interval. This needs more refactor hence would do in a later PR

Signed-off-by: Afreen Misbah <afreen@ibm.com>
2 weeks agomgr/dashboard: Add tootltip to storage overview
Afreen Misbah [Mon, 16 Mar 2026 07:53:08 +0000 (13:23 +0530)]
mgr/dashboard: Add tootltip to storage overview

Signed-off-by: Afreen Misbah <afreen@ibm.com>
2 weeks agoMerge pull request #67812 from bluikko/wip-doc-cephadm-spelling2
bluikko [Tue, 17 Mar 2026 02:53:35 +0000 (09:53 +0700)]
Merge pull request #67812 from bluikko/wip-doc-cephadm-spelling2

doc/cephadm: Fix more spelling errors

2 weeks agoMerge pull request #67783 from phlogistonjohn/jjm-bwc-configure-args
Dan Mick [Tue, 17 Mar 2026 02:18:20 +0000 (19:18 -0700)]
Merge pull request #67783 from phlogistonjohn/jjm-bwc-configure-args

script/build-with-container: add CONFIGURE_ARGS env var to configure step

2 weeks agocommon,arch,cmake: add RISC-V crc32c support 66026/head
lvshuo2016 [Wed, 22 Oct 2025 10:09:52 +0000 (18:09 +0800)]
common,arch,cmake: add RISC-V crc32c support

This adds hardware-accelerated crc32c support for the RISC-V
architecture. It includes the feature implementation, necessary
CMake configuration, and plumbing in src/arch/riscv.c to correctly
detect and select the optimized instructions.

Signed-off-by: lvshuo2016 <lv.shuo@sanechips.com.cn>
2 weeks agoqa/workunits: add crimson/ and move test_crimson_dump_metrics.sh. Edit osd-dump-metri... 67723/head
Jose J Palacios-Perez [Mon, 9 Mar 2026 16:50:15 +0000 (16:50 +0000)]
qa/workunits: add crimson/ and move test_crimson_dump_metrics.sh. Edit osd-dump-metrics.yaml accordingly

Signed-off-by: Jose J Palacios-Perez <perezjos@uk.ibm.com>
2 weeks agomgr/dashboard: Add "connection get_io_statistics" to NVMeoF CLI 67810/head
Gil Bregman [Mon, 16 Mar 2026 10:59:06 +0000 (12:59 +0200)]
mgr/dashboard: Add "connection get_io_statistics" to NVMeoF CLI

Fixes: https://tracker.ceph.com/issues/74964
Signed-off-by: Gil Bregman <gbregman@il.ibm.com>
2 weeks agoMerge pull request #67122 from Naveenaidu/wip-naveen-fix-RemoteReservationRevoked...
Radoslaw Zarzynski [Mon, 16 Mar 2026 18:50:04 +0000 (19:50 +0100)]
Merge pull request #67122 from Naveenaidu/wip-naveen-fix-RemoteReservationRevoked-race-condition

osd/PeeringState: handle race condition of RemoteReservationRevoked event for Backfilling state

Reviewed-by: Bill Scales <bill_scales@uk.ibm.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2 weeks agoMerge pull request #67754 from idryomov/wip-kclient-features-update
Ilya Dryomov [Mon, 16 Mar 2026 18:35:23 +0000 (19:35 +0100)]
Merge pull request #67754 from idryomov/wip-kclient-features-update

include/ceph_features: note more kernel versions

Reviewed-by: Miki Patel <miki.patel132@gmail.com>
2 weeks agoMerge pull request #67244 from bill-scales/issue73891
Kamoltat (Junior) Sirivadhna [Mon, 16 Mar 2026 17:45:48 +0000 (13:45 -0400)]
Merge pull request #67244 from bill-scales/issue73891

osd: Avoid pwlc spanning intervals

Reviewed-by: Samuel Just <sjust@ibm.com>
Reviewed-by: Radoslaw Zarzynski<rzarzyns@redhat.com>
Reviewed-by: Alex Ainscow <aainscow@uk.ibm.com>
2 weeks agoMerge pull request #62227 from atta/fix-lua-mkdtemp-permissions
Yuval Lifshitz [Mon, 16 Mar 2026 16:47:03 +0000 (18:47 +0200)]
Merge pull request #62227 from atta/fix-lua-mkdtemp-permissions

rgw: fix luarocks directory permissions

2 weeks agomgr/cephadm: fix mgmt-gateway startup on IPv6 VIP 67610/head
kginon [Mon, 2 Mar 2026 14:44:05 +0000 (16:44 +0200)]
mgr/cephadm: fix mgmt-gateway startup on IPv6 VIP

Ensure mgmt-gateway service starts correctly when it is configured
to listen on an IPv6 VIP address.

Fixes: https://tracker.ceph.com/issues/75267
Signed-off-by: Kobi Ginon <kginon@redhat.com>
2 weeks agoMerge pull request #67436 from adk3798/cephadm-fix-mon-crush-loc-text
Redouane Kachach [Mon, 16 Mar 2026 16:19:04 +0000 (17:19 +0100)]
Merge pull request #67436 from adk3798/cephadm-fix-mon-crush-loc-text

qa/cephadm: fix test_set_mon_crush_locations test

Reviewed-by: Redouane Kachach <rkachach@ibm.com>
2 weeks agoMerge pull request #67776 from bluikko/wip-doc-radosgw-spelling
bluikko [Mon, 16 Mar 2026 15:43:46 +0000 (22:43 +0700)]
Merge pull request #67776 from bluikko/wip-doc-radosgw-spelling

doc/radosgw: Fix spelling errors

2 weeks agorgw/pubsub: fix uninitialized num_shards causing topic deletion hang wip-oozmen-75466 67798/head
Oguzhan Ozmen [Fri, 13 Mar 2026 21:56:18 +0000 (21:56 +0000)]
rgw/pubsub: fix uninitialized num_shards causing topic deletion hang

The num_shards member of rgw_pubsub_dest was not included in JSON
serialization (dump/decode_json), causing garbage values when topic
metadata synced between zones. This resulted in topic deletion
iterating millions of times over non-existent shards, blocking
frontend pause during realm reload for extended periods.

Fixes: https://tracker.ceph.com/issues/75466
Signed-off-by: Oguzhan Ozmen <oozmen@bloomberg.net>
2 weeks agodoc/cephadm: Fix more spelling errors 67812/head
Ville Ojamo [Mon, 16 Mar 2026 09:39:16 +0000 (16:39 +0700)]
doc/cephadm: Fix more spelling errors

And other such minor obvious issues, including a spelling error
introduced in the previous commit 2565579caa1a118e9032283b55e969f9badcd6b6

Signed-off-by: Ville Ojamo <git2233+ceph@ojamo.eu>
2 weeks agoMerge pull request #67619 from tchaikov/wip-cmake-cephfs-journal-tool-without-liburing
Kefu Chai [Mon, 16 Mar 2026 14:25:41 +0000 (22:25 +0800)]
Merge pull request #67619 from tchaikov/wip-cmake-cephfs-journal-tool-without-liburing

cmake: link uring::uring to Boost::boost and fix cephfs tools linkage

Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Edwin Rodriguez <edwin.rodriguez1@ibm.com>
2 weeks agoMerge pull request #67708 from rhcs-dashboard/fix-miscellaneous
Afreen Misbah [Mon, 16 Mar 2026 14:19:47 +0000 (19:49 +0530)]
Merge pull request #67708 from rhcs-dashboard/fix-miscellaneous

mgr/dashboard: show System metadata used

Reviewed-by: Afreen Misbah <afreen@ibm.com>
2 weeks agoMerge pull request #66450 from rkachach/fix_issue_74015
Redouane Kachach [Mon, 16 Mar 2026 14:14:49 +0000 (15:14 +0100)]
Merge pull request #66450 from rkachach/fix_issue_74015

mgr/cephadm: Fix mgmt-gateway default port in get_port_start()

Reviewed-by: Adam King <adking@redhat.com>
2 weeks agoMerge pull request #67530 from rkachach/fix_ipv6_with_brackets
Redouane Kachach [Mon, 16 Mar 2026 14:08:42 +0000 (15:08 +0100)]
Merge pull request #67530 from rkachach/fix_ipv6_with_brackets

cephadm: fix EndPoint to handle bracketed IPv6 addresses

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Guillaume Abrioux <gabrioux@ibm.com>
2 weeks agoMerge pull request #67690 from rhcs-dashboard/75322-Updating-empty-state-message...
Afreen Misbah [Mon, 16 Mar 2026 13:53:32 +0000 (19:23 +0530)]
Merge pull request #67690 from rhcs-dashboard/75322-Updating-empty-state-message-on-performance-chart

mgr/dashboard: Fixing message when prometheus is disabled in performa…

Reviewed-by: Afreen Misbah <afreen@ibm.com>
2 weeks agodoc/radosgw: Fix spelling errors 67776/head
Ville Ojamo [Fri, 13 Mar 2026 08:48:01 +0000 (15:48 +0700)]
doc/radosgw: Fix spelling errors

Signed-off-by: Ville Ojamo <git2233+ceph@ojamo.eu>
2 weeks agoMerge pull request #67421 from ifed01/wip-ifed-raise-min-comp-size-hdd
Igor Fedotov [Mon, 16 Mar 2026 11:35:11 +0000 (14:35 +0300)]
Merge pull request #67421 from ifed01/wip-ifed-raise-min-comp-size-hdd

options/global: raise bluestore_compression_min_blob_size_hdd to 64K

Reviewed-by: Adam Kupczyk <akupczyk@ibm.com>
2 weeks agoMerge pull request #67695 from rkachach/fix_issue_74860
Redouane Kachach [Mon, 16 Mar 2026 10:48:41 +0000 (11:48 +0100)]
Merge pull request #67695 from rkachach/fix_issue_74860

qa: fixing cephadm mgmt-gateway test to remove openssl dependency

Reviewed-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
2 weeks agoMerge pull request #67433 from ifed01/wip-ifed-fix-min-blob-size-tracking
Igor Fedotov [Mon, 16 Mar 2026 10:20:26 +0000 (13:20 +0300)]
Merge pull request #67433 from ifed01/wip-ifed-fix-min-blob-size-tracking

os/bluestore: track compression_*blob_size* parameters for online update

Signed-off-by: Adam Kupczyk <akupczyk@ibm.com>
2 weeks agomgr/cephadm: Fix mgmt-gateway default port in get_port_start() 66450/head
Redouane Kachach [Fri, 28 Nov 2025 08:38:45 +0000 (09:38 +0100)]
mgr/cephadm: Fix mgmt-gateway default port in get_port_start()

The mgmt-gateway port was already defaulted to 443 in most places, but
get_port_start() did not apply this default. Since the output of
get_port_start() is used both to configure the daemon ports which are
later used to to open them in firewalld, this inconsistency meant the
HTTPS port was not opened when firewalld service was active.

This change makes get_port_start() also default to port 443, ensuring
the daemon is configured correctly and the corresponding firewalld port
is opened as expected.

Fixes: https://tracker.ceph.com/issues/74015
Signed-off-by: Redouane Kachach <rkachach@ibm.com>
2 weeks agomgr/dashboard: fix-add-node-issue 67809/head
Sagar Gopale [Mon, 16 Mar 2026 09:44:21 +0000 (15:14 +0530)]
mgr/dashboard: fix-add-node-issue

Fixes:https://tracker.ceph.com/issues/75500

Signed-off-by: Sagar Gopale <sagar.gopale@ibm.com>
2 weeks agomgr/dashboard: show miscellaneous data used 67708/head
Aashish Sharma [Mon, 9 Mar 2026 05:37:45 +0000 (11:07 +0530)]
mgr/dashboard: show miscellaneous data used

Fixes: https://tracker.ceph.com/issues/75320
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
2 weeks agoMerge pull request #64975 from liu-chunmei/shard_osd_final
Matan Breizman [Mon, 16 Mar 2026 09:27:40 +0000 (11:27 +0200)]
Merge pull request #64975 from liu-chunmei/shard_osd_final

osd shard changes upon restart

Reviewed-by: Samuel Just <sjust@redhat.com>
2 weeks agoMerge pull request #65912 from MattyWilliams22/wip-dont-delete-objects
SrinivasaBharathKanta [Mon, 16 Mar 2026 09:26:03 +0000 (14:56 +0530)]
Merge pull request #65912 from MattyWilliams22/wip-dont-delete-objects

test: Add option to not delete created objects

2 weeks agoqa: Add nvmeof upgrade from v20.2.0 67742/head
Vallari Agrawal [Wed, 11 Mar 2026 11:58:22 +0000 (17:28 +0530)]
qa: Add nvmeof upgrade from v20.2.0

Also update install task for nvmeof/upgrade test

Fixes: https://tracker.ceph.com/issues/75453
Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
2 weeks agomgr/dashboard: Fixing message when prometheus is disabled in performance charts 67690/head
Devika Babrekar [Fri, 6 Mar 2026 07:58:45 +0000 (13:28 +0530)]
mgr/dashboard: Fixing message when prometheus is disabled in performance charts
Fixes: https://tracker.ceph.com/issues/75322
Signed-off-by: Devika Babrekar <devika.babrekar@ibm.com>
2 weeks agoMerge pull request #67734 from rhcs-dashboard/fix-namespace-size
Nizamudeen A [Mon, 16 Mar 2026 05:39:42 +0000 (11:09 +0530)]
Merge pull request #67734 from rhcs-dashboard/fix-namespace-size

mgr/dashboard: Namespace size should accept GiB in NVMe/TCP

Reviewed-by: Puja Shahu <pshahu@redhat.com>
Reviewed-by: Afreen Misbah <afreen@ibm.com>
2 weeks agoqa/cephfs: do not validate error string in "fs authorize" tests 67406/head
Venky Shankar [Mon, 16 Mar 2026 04:48:22 +0000 (10:18 +0530)]
qa/cephfs: do not validate error string in "fs authorize" tests

Error string validation is prone to failures when error string
changes. errno (retval) validation suffices for tests.

Signed-off-by: Venky Shankar <vshankar@redhat.com>
2 weeks agomon/AuthMonitor: add osd w cap for superuser client
Patrick Donnelly [Wed, 18 Feb 2026 20:27:30 +0000 (15:27 -0500)]
mon/AuthMonitor: add osd w cap for superuser client

Right now only a client with "rw" permissions on an MDS gets "rw" on an
OSD.

[@vshankar: fixed malformed OSD cap when authorizing multiple paths]

Reported-by: John Mulligan <jmulligan@redhat.com>
Fixes: https://tracker.ceph.com/issues/75013
Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
Signed-off-by: Venky Shankar <vshankar@redhat.com>
2 weeks agoMerge pull request #67782 from rkachach/fix_issue_75492
Redouane Kachach [Sat, 14 Mar 2026 09:46:02 +0000 (10:46 +0100)]
Merge pull request #67782 from rkachach/fix_issue_75492

mgr/nvmeof: Adding missing CLICommand field to nvmeof mgr module

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: John Mulligan <jmulligan@redhat.com>
Reviewed-by: Adam King <adking@redhat.com>
2 weeks agoqa: ignore NVMEOF_GATEWAY_DOWN in nvmeof_scalability.yaml 67804/head
Vallari Agrawal [Fri, 13 Mar 2026 08:47:46 +0000 (14:17 +0530)]
qa: ignore NVMEOF_GATEWAY_DOWN in nvmeof_scalability.yaml

Sometimes during scale-up/scale-down, a gateway goes in
UNAVAILABLE state (which triggers NVMEOF_GATEWAY_DOWN warning)
for a couple of seconds and self-recovers.
In this, none of the scale test asserts fail.

So NVMEOF_GATEWAY_DOWN can be ignorelist, because scale test asserts
on expected gw count and checks if all expected gws are AVAILABLE
between each iteration of scale-up/scale-down.

Fixes: https://tracker.ceph.com/issues/75179
Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
2 weeks agoqa/tasks/nvmeof.py: retry do_check if gw in CREATED
Vallari Agrawal [Fri, 13 Mar 2026 08:32:06 +0000 (14:02 +0530)]
qa/tasks/nvmeof.py: retry do_check if gw in CREATED

In do_check(), ensure all the namespaces+listeners are
added in gateway (i.e. gateway not in CREATED state)
after gateway is restarted. This is to prevent going into
next iteration of tharshing while gateways are still being
updated.

Fixes: https://tracker.ceph.com/issues/75382
Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
2 weeks agorgw/test/multisite: revise test_period_update_commit zone selection for clarity
Oguzhan Ozmen [Fri, 13 Mar 2026 22:35:19 +0000 (22:35 +0000)]
rgw/test/multisite: revise test_period_update_commit zone selection for clarity

Signed-off-by: Oguzhan Ozmen <oozmen@bloomberg.net>
2 weeks agorgw/test/multisite: revise test_period_update_commit testcase client wkld settings
Oguzhan Ozmen [Fri, 13 Mar 2026 22:33:45 +0000 (22:33 +0000)]
rgw/test/multisite: revise test_period_update_commit testcase client wkld settings

- set wkld concurrency level to default urllib pool size

    Set wkld_concurrency to 10 which is the default urllib pool size
    to avoid the event:

    WARNING:urllib3.connectionpool:Connection pool is full,
    discarding connection: ... Connection pool size: 10

- make the client wkld less aggresive

Signed-off-by: Oguzhan Ozmen <oozmen@bloomberg.net>
2 weeks agorgw/test/multisite: get_oldest_incremental_change_not_applied_epoch - handle sync...
Oguzhan Ozmen [Fri, 13 Mar 2026 22:30:50 +0000 (22:30 +0000)]
rgw/test/multisite: get_oldest_incremental_change_not_applied_epoch - handle sync-status failure gracefully

Signed-off-by: Oguzhan Ozmen <oozmen@bloomberg.net>
2 weeks agorgw/test/multisite: run sync status on the intended zone
Oguzhan Ozmen [Fri, 13 Mar 2026 22:30:11 +0000 (22:30 +0000)]
rgw/test/multisite: run sync status on the intended zone

Signed-off-by: Oguzhan Ozmen <oozmen@bloomberg.net>
2 weeks agorgw/test/multisite: test_period_update_commit use a custom retry setting
Oguzhan Ozmen [Fri, 13 Mar 2026 22:36:22 +0000 (22:36 +0000)]
rgw/test/multisite: test_period_update_commit use a custom retry setting

Signed-off-by: Oguzhan Ozmen <oozmen@bloomberg.net>
2 weeks agorgw/test/multisite: use config's retry settings
Oguzhan Ozmen [Fri, 13 Mar 2026 22:31:40 +0000 (22:31 +0000)]
rgw/test/multisite: use config's retry settings

Signed-off-by: Oguzhan Ozmen <oozmen@bloomberg.net>
2 weeks agorgw/test/multisite: add a mechanism to use custom config temporarily
Oguzhan Ozmen [Fri, 13 Mar 2026 22:29:05 +0000 (22:29 +0000)]
rgw/test/multisite: add a mechanism to use custom config temporarily

Signed-off-by: Oguzhan Ozmen <oozmen@bloomberg.net>
2 weeks agoMerge pull request #66580 from kamoltat/wip-ksirivad-fix-72994
Kamoltat (Junior) Sirivadhna [Fri, 13 Mar 2026 21:34:34 +0000 (17:34 -0400)]
Merge pull request #66580 from kamoltat/wip-ksirivad-fix-72994

mon [stretch-mode]: Allow a max bucket weight diff threshold
Reviewed-by: Ronen Friedman <rfriedma@ibm.com>
2 weeks agoMerge PR #67780 into main
Patrick Donnelly [Fri, 13 Mar 2026 19:54:51 +0000 (01:24 +0530)]
Merge PR #67780 into main

* refs/pull/67780/head:
Revert "Merge PR #67630 into main"

Reviewed-by: Shraddha Agrawal <shraddhaag@ibm.com>
2 weeks agomgr/nvmeof: Adding missing CLICommand file to nvmeof mgr module 67782/head
Redouane Kachach [Fri, 13 Mar 2026 15:40:01 +0000 (16:40 +0100)]
mgr/nvmeof: Adding missing CLICommand file to nvmeof mgr module

Fixes: https://tracker.ceph.com/issues/75492
Signed-off-by: Redouane Kachach <rkachach@ibm.com>
2 weeks agoos/bluestore: add health warning for oversized BlueFS usage 67192/head
Indira Sawant [Tue, 11 Nov 2025 17:51:43 +0000 (11:51 -0600)]
os/bluestore: add health warning for oversized BlueFS usage

Add a BLUESTORE_BLUEFS_OVERSIZED health warning when total BlueFS usage
(DB, WAL, and spillover on the slow device) exceeds a configurable ratio
of the main device size.

The threshold is controlled by the new configuration option
`bluestore_bluefs_warn_ratio` (default 0.06).

Fixes: https://tracker.ceph.com/issues/73826
Signed-off-by: Indira Sawant <indira.sawant@ibm.com>
2 weeks agoscript/build-with-container: add CONFIGURE_ARGS env var to configure step 67783/head
John Mulligan [Fri, 13 Mar 2026 17:42:09 +0000 (13:42 -0400)]
script/build-with-container: add CONFIGURE_ARGS env var to configure step

Add a new optional CONFIGURE_ARGS environment variable to the configure
step so that there's a mechanism to pass custom cmake options that
aren't handled elsewhere in the run-make.sh script.

Because configure is a rather fundamental build step it's probably
preferable to set this via an env file so that it persists across
rebuilds. Using an environment var here also avoids both needing to
change run-make.sh or add another CLI option to BWC which already has
too many.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
2 weeks agoRevert "Merge PR #67630 into main" 67780/head
Patrick Donnelly [Fri, 13 Mar 2026 14:18:06 +0000 (19:48 +0530)]
Revert "Merge PR #67630 into main"

This reverts commit 3a5e4524aa56de4c26400ccf994baa6ba8e16d9e, reversing
changes made to d334ff531c563bb7d0e37777f606322ec91b7453.

To everyone's surprise, skipping a workflow does not make it less
required. Well done Github!

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2 weeks agoMerge pull request #67275 from ifed01/wip-ifed-fix-bluefs-expand-test
Igor Fedotov [Fri, 13 Mar 2026 14:05:44 +0000 (17:05 +0300)]
Merge pull request #67275 from ifed01/wip-ifed-fix-bluefs-expand-test

qa/standalone: fix/improve bluefs tests

Reviewed-by: Adam Kupczyk <akupczyk@ibm.com>
2 weeks agoMerge pull request #67609 from ifed01/wip-ifed-bluefs-stats-reset
Igor Fedotov [Fri, 13 Mar 2026 14:01:39 +0000 (17:01 +0300)]
Merge pull request #67609 from ifed01/wip-ifed-bluefs-stats-reset

os/bluestore: add 'bluefs stats reset' admin socket command.

Reviewed-by: Adam Kupczyk <akupczyk@ibm.com>
2 weeks agoMerge pull request #67770 from bluikko/wip-doc-cephadm-spelling
bluikko [Fri, 13 Mar 2026 12:40:57 +0000 (19:40 +0700)]
Merge pull request #67770 from bluikko/wip-doc-cephadm-spelling

doc/cephadm: Fix spelling errors

2 weeks agoMerge pull request #67718 from rhcs-dashboard/fix-subsystem-create-layout-issue
Afreen Misbah [Fri, 13 Mar 2026 09:52:18 +0000 (15:22 +0530)]
Merge pull request #67718 from rhcs-dashboard/fix-subsystem-create-layout-issue

mgr/dashboard: Footer actions shift upward instead of staying pinned at modal bottom in NVMe/TCP subsystem create wizard

Reviewed-by: Afreen Misbah <afreen@ibm.com>
Reviewed-by: Devika Babrekar <devika.babrekar@ibm.com>
2 weeks agoqa/tasks/nvmeof.py: Fix tharsher daemon_rm revival
Vallari Agrawal [Fri, 13 Mar 2026 08:24:31 +0000 (13:54 +0530)]
qa/tasks/nvmeof.py: Fix tharsher daemon_rm revival

Instead of "ceph orch daemon restart",
wait for daemon to come backup on it's own
during revival.
Also improve do_check retry logic.
And some logging improvements in nvmeof.thrasher task.

Fixes: https://tracker.ceph.com/issues/75383
Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>