git-server-git.apps.pok.os.sepia.ceph.com Git

]> git-server-git.apps.pok.os.sepia.ceph.com Git - ceph-ci.git/log

projects / ceph-ci.git / log

commit | commitdiff | tree

Kotresh HR [Thu, 19 Feb 2026 18:51:32 +0000 (00:21 +0530)]

mds: build_snap_diff instrumentation

Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Sun, 15 Feb 2026 18:41:51 +0000 (00:11 +0530)]

tools/cephfs_mirror: Fix lock order issue

Lock order 1:
InstanceWatcher::m_lock ----> FSMirror::m_lock
Lock order 2:
FSMirror::m_lock -----> InstanceWatcher::m_lock

The Lock order 1 is where it's aborted and it happens
during blocklisting. The InstanceWatcher::handle_rewatch_complete()
acquires InstanceWatcher::m_lock and calls
m_elistener.set_blocklisted_ts() which tries to acquire
FSMirror::m_lock

The Lock order 2 exists in mirror peer status command.
The FSMirror::mirror_status(Formatter *f) takes FSMirro::m_lock
and calls is_blocklisted which takes InstanceWatcher::m_lock

Fix:
FSMirror::m_blocklisted_ts and FSMirror::m_failed_ts is converted
to std::<atomic> and also fixed the scope of m_lock in
InstanceWatcher::handle_rewatch_complete() and
MirrorWatcher::handle_rewatch_complete()

Look at the tracker for traceback and further details.

Fixes: https://tracker.ceph.com/issues/74953
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Wed, 4 Feb 2026 10:07:30 +0000 (15:37 +0530)]

doc: Document cephfs mirroring multi-thread

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Wed, 4 Feb 2026 09:55:01 +0000 (15:25 +0530)]

doc/PendingReleaseNotes: cephfs mirroring multi-thread

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Wed, 4 Feb 2026 09:17:12 +0000 (14:47 +0530)]

doc: Document config for max data threads

Document the config cephfs_mirror_max_concurrent_data_sync_threads

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Sun, 15 Feb 2026 19:38:00 +0000 (01:08 +0530)]

tools/cephfs_mirror: Do remote fs sync once instead of fsync on each fd

Doing remote fs sync once just before taking snapshot is faster than
doing fsync on each fd. Moreover, all the datasync threads use the
same sinlge libceph connection and doing ceph_fsync concurrently on
different fds on a single libcephfs connection could cause hang
as observed in testing as below.

-----
Thread 2 (Thread 0xffff644cc400 (LWP 74020) "d_replayer-0"):
0  0x0000ffff8e82656c in __futex_abstimed_wait_cancelable64 () from /lib64/libc.so.6
1  0x0000ffff8e828ff0 [PAC] in pthread_cond_wait@@GLIBC_2.17 () from /lib64/libc.so.6
2  0x0000ffff8fc90fd4 [PAC] in ceph::condition_variable_debug::wait ...
3  0x0000ffff9080fc9c in ceph::condition_variable_debug::wait<Client::wait_on_context_list ...
4  Client::wait_on_context_list ... at /lsandbox/upstream/ceph/src/client/Client.cc:4540
5  0x0000ffff9083fae8 in Client::_fsync ... at /lsandbox/upstream/ceph/src/client/Client.cc:13299
6  0x0000ffff90840278 in Client::_fsync ...
7  0x0000ffff90840514 in Client::fsync ... at /lsandbox/upstream/ceph/src/client/Client.cc:13042
8  0x0000ffff907f06e0 in ceph_fsync ... at /lsandbox/upstream/ceph/src/libcephfs.cc:316
9  0x0000aaaaad5b2f88 in cephfs::mirror::PeerReplayer::copy_to_remote ...
----

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Sun, 15 Feb 2026 09:37:09 +0000 (15:07 +0530)]

tools/cephfs_mirror: Don't use blockdiff on smaller files

Use blockdiff only if the file size is greater than 16MiB

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Sun, 15 Feb 2026 09:19:59 +0000 (14:49 +0530)]

tools/cephfs_mirror: Handle shutdown/blocklist/cancel at syncm dataq wait

1. Add is_stopping() predicate at sdq_cv wait
2. Use the existing should_backoff() routine to validate
shutdown/blocklsit/cancel errors and set corresponding errors.
3. Handle notify logic at the end
4. In shutdown(), notify all syncm's sdq_cv wait

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Sun, 15 Feb 2026 08:30:07 +0000 (14:00 +0530)]

tools/cephfs_mirror: Handle shutdown/blocklist at syncm_q wait

1. Convert smq_cv.wait to timed wait as blocklist doesn't have
   predicate to evaluate. Evaluate is_shutdown() as predicate.
   When either of the two is true, set corresponding error and
   backoff flag in all the syncm objects. The last thread data
   sync thread would wake up all the crawler threads. This is
   necessary to wake up the crawler threads whose data queue
   is not picked by any datasync threads.
2. In shutdown(), change the order of join, join datasync threads
   first. The idea is kill datasync threads first before crawler
   threads as datasync threads are extension of crawler threads
   and othewise might cause issues. Also wake up smq_cv wait for
   shutdown.

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Sun, 15 Feb 2026 06:39:45 +0000 (12:09 +0530)]

tools/cephfs_mirror: Monitor num of active datasync threads

Introduce an atomic counter in PeerReplayer to track the number of
active SnapshotDataSyncThread instances.

The counter is incremented when a datasync thread enters its entry()
function and decremented automatically on exit via a small RAII guard
(DataSyncThreadGuard). This ensures accurate accounting even in the
presence of early returns or future refactoring.

This change helps in handling of shutdown and blocklist scenarios.
At the time of shutdown or blocklisting, datasync threads may still
be processing multiple jobs across different SyncMechanism instances.
It is therefore essential that only the final exiting datasync thread
performs the notifications for all relevant waiters, including the
syncm data queue, syncm queue, and m_cond.

This approach ensures orderly teardown by keeping crawler threads
active until all datasync threads have completed execution.
Terminating crawler threads prematurely—before datasync threads have
exited—can lead to inconsistencies, as crawler threads deregister the
mirroring directory while datasync threads may still be accessing it.

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Sun, 15 Feb 2026 04:00:25 +0000 (09:30 +0530)]

tools/cephfs_mirror: Store a reference of PeerReplayer object in SyncMechanism

Store a reference of PeerReplayer object in SyncMechanism.
This allows SyncMechansim object to call functions of PeerReplayer.
This is required in multiple places like handling
shutdown/blocklist/cancel where should_backoff() needs to be
called by syncm object while poppig dataq by data sync threads.

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Sun, 15 Feb 2026 03:09:54 +0000 (08:39 +0530)]

tools/cephfs_mirror: Make PeerReplayer::m_stopping atomic

Make PeerReplayer::m_stopping as std::<atomic> and make it
independant of m_lock. This helps 'm_stopping' to be used
as predicate in any conditional wait which doesn't use
m_lock.

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Wed, 14 Jan 2026 20:06:31 +0000 (01:36 +0530)]

tools/cephfs_mirror: Fix assert while opening handles

When the crawler or a datasync thread encountered an error,
it's possible that the crawler gets notified by a datasync
thread and bails out resulting in the unregister of the
particular dir_root. The other datasync threads might
still hold the same syncm object and tries to open the
handles during which the following assert is hit.

ceph_assert(it != m_registered.end());

The above assert is removed and the error is handled.

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Wed, 14 Jan 2026 19:59:36 +0000 (01:29 +0530)]

tools/cephfs_mirror: Fix dequeue of syncm on error

On error encountered in crawler thread or datasync
thread while processing a syncm object, it's possible
that multiple datasync threads attempts the dequeue of
syncm object. Though it's safe, add a condition to avoid
it.

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Wed, 14 Jan 2026 19:53:34 +0000 (01:23 +0530)]

tools/cephfs_mirror: Handle errors in crawler thread

Any error encountered in crawler threads should be
communicated to the data sync threads by marking the
crawl error in the corresponding syncm object. The
data sync threads would finish pending jobs, dequeue
the syncm object and notify crawler to bail out.

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Wed, 14 Jan 2026 19:35:29 +0000 (01:05 +0530)]

tools/cephfs_mirror: Handle error in datasync thread

On any error encountered in datasync threads while syncing
a particular syncm dataq, mark the datasync error and
communicate the error to the corresponding syncm's crawler
which is waiting to take a snaphsot. The crawler will log
the error and bail out.

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Thu, 8 Jan 2026 09:18:01 +0000 (14:48 +0530)]

tools/cephfs_mirror: Add debug to capture file sync time

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Wed, 14 Jan 2026 13:07:13 +0000 (18:37 +0530)]

tools/cephfs_mirror: Efficient use of data sync threads

The job queue is something like below for data sync threads.

  |syncm1|---------|syncm2|------...---|syncmn|
     |                |                   |
   |m_sync_dataq|   |m_sync_dataq|    |m_sync_dataq|

There is global queue of SyncMechanism objects(syncm). Each syncm
object represents a single snapshot being synced and each syncm
object owns m_sync_dataq representing list of files in the snapshot
to be synced.

The data sync threads should consume the next syncm job
if the present syncm has no pending work. This can evidently
happen if the last file being synced in the present syncm
job is a large file from it's syncm_dataq. In this case, one
data sync thread is busy syncing the large file, the rest of
data sync threads just wait for it to finish to avoid busy loop.
Instead, the idle data sync threads could start consuming the next
syncm job.

This brings in a change to data structure.
- syncm_q has to be std::deque instead of std::queue as syncm in the
   middle can finish syncing first and that needs to be removed before
   the front

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Wed, 14 Jan 2026 12:50:26 +0000 (18:20 +0530)]

tools/cephfs_mirror: Make max datasync threads configureable

Add a config to configure the number of data sync threads.

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Wed, 14 Jan 2026 12:30:43 +0000 (18:00 +0530)]

tools/cephfs_mirror: Synchronize taking snapshot

The crawler/entry creation thread needs to wait until
all the data is synced by datasync threads to take
the snapshot. This patch adds the necessary conditions
for the same.

It is important for the conditional flag to be part
of SyncMechanism and not part of PeerReplayer class.
The following bug would be hit if it were part of
PeerReplayer class.

When multiple directories are confiugred for mirroring as below
/d0                /d1              /d2
Crawler1         Crawler2          Crawler3
DoneEntryOps     DoneEntryOps      DoneEntryOps
WaitForSafeSnap  WaitForSafeSnap   WaitForSafeSnap

When all crawler threads are waiting at above, the data sync threads
which is done processing /d1, would notify, waking up all the crawlers
causing spurious/unwanted wake up and half baked snapshots.

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Wed, 14 Jan 2026 12:17:47 +0000 (17:47 +0530)]

tools/cephfs_mirror: Add SnapDiff entries to dataq

Add SnapDiff entries to dataq and process the same
in datasync threads similar to RemoteSync entries.

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Wed, 14 Jan 2026 11:51:04 +0000 (17:21 +0530)]

tools/cephfs_mirror: Process entries from dataq

Consume entries from syncm's data queue and sync
them to remote.

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Wed, 14 Jan 2026 11:40:51 +0000 (17:10 +0530)]

tools/cephfs_mirror: Move dir_root to SyncMechanism

Store m_dir_root in parent (SyncMehansim) to make
it accessible in the data sync threads to sync
files

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Wed, 14 Jan 2026 12:05:33 +0000 (17:35 +0530)]

tools/cephfs_mirror: Fix data sync threads completion logic

We need to exactly know when all data threads completes
the processing of a syncm. If a few threads finishes the
job, they all need to wait for the in processing threads
of that syncm to complete. Otherwise the finished threads
would be busy loop until in processing threads finishes.

And only after all threads finishes processing, the crawler
thread can be notified to take the snapshot.

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Tue, 9 Dec 2025 10:49:57 +0000 (16:19 +0530)]

tools/cephfs_mirror: Populate dataq for RemoteSync

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Wed, 14 Jan 2026 11:11:50 +0000 (16:41 +0530)]

tools/cephfs_mirror: Move remote_mkdir to SyncMechanism

This is required as SyncMechanism::get_entry would sync
directories during crawl.

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Tue, 9 Dec 2025 10:05:08 +0000 (15:35 +0530)]

tools/cephfs_mirror: Mark crawl finished

After entry operations are synced and stack is empty,
mark the crawl as finished so the data sync threads'
wait logic works correctly and doesn't indefinitely wait.

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Wed, 14 Jan 2026 09:56:25 +0000 (15:26 +0530)]

tools/cephfs_mirror: Add m_sync_data queue

Add data sync queue for each SyncMechanism.

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Wed, 14 Jan 2026 08:47:07 +0000 (14:17 +0530)]

tools/cephfs_mirror: Add SyncMechanism Queue

Add a queue of shared_ptr of type SyncMechanism.
Since it's shared_ptr, the queue can hold both
shared_ptr to both RemoteSync and SnapDiffSync objects.
Each SyncMechanism holds the queue for the SyncEntry
items to be synced using the data sync threads.

The SyncMechanism queue needs to be shared_ptr because
all the data sync threads needs to access the object
of SyncMechanism to process the SyncEntry Queue.

This patch sets up the building blocks for the same.

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Tue, 25 Nov 2025 10:25:05 +0000 (15:55 +0530)]

tools/cephfs_mirror: Join datasync threads on shutdown

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Wed, 14 Jan 2026 08:27:34 +0000 (13:57 +0530)]

tools/cephfs_mirror: Use the existing m_lock and m_cond

The entire snapshot is synced outside the lock.
The m_lock and m_cond pair is used for data sync
threads along with crawler threads to work well
with all terminal conditions like shutdown and
existing data structures.

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Mon, 24 Nov 2025 14:43:04 +0000 (20:13 +0530)]

tools/cephfs_mirror: Add a pool of datasync threads

Fixes: https://tracker.ceph.com/issues/73452
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Samarah Uriarte [Tue, 17 Feb 2026 15:21:42 +0000 (09:21 -0600)]

Merge pull request #66484 from samarahu/d4n-remove-erase-from-update

Reviewed-by: Pritha Srivastava <prsrivas@redhat.com>

commit | commitdiff | tree

Vallari Agrawal [Tue, 17 Feb 2026 12:34:16 +0000 (18:04 +0530)]

Merge pull request #67348 from VallariAg/wip-nvmeof-udisks-disable

qa: Fix coredumps caused by udisks

commit | commitdiff | tree

Ilya Dryomov [Tue, 17 Feb 2026 12:03:38 +0000 (13:03 +0100)]

Merge pull request #67370 from kotreshhr/qa-mirror-flake8-fix

qa/test_mirroring: Fix flake8 errors

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Kotresh HR [Mon, 16 Feb 2026 19:35:59 +0000 (01:05 +0530)]

qa/test_mirroring: Fix flake8 errors

Introduced-by: c1e827247bd20e8a1851bc2d7a9861c12d033ef0
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Mon, 16 Feb 2026 14:10:45 +0000 (15:10 +0100)]

Merge pull request #67231 from guits/ceph-volume-inventory-ls-all

ceph-volume: include LVM mapper devices in get_devices()

commit | commitdiff | tree

Yuval Lifshitz [Mon, 16 Feb 2026 11:48:36 +0000 (13:48 +0200)]

Merge pull request #67110 from FredNass/patch-2

doc/radosgw: rgw_lua_max_memory_per_state defaults to 128K (not 500K)

commit | commitdiff | tree

Afreen Misbah [Mon, 16 Feb 2026 06:51:32 +0000 (12:21 +0530)]

Merge pull request #66967 from rhcs-dashboard/gateway-add-modal

mgr/dashboard: gateway-add-modal

Reviewed-by: Afreen Misbah <afreen@ibm.com>

commit | commitdiff | tree

Venky Shankar [Mon, 16 Feb 2026 06:02:59 +0000 (11:32 +0530)]

Merge pull request #67305 from kotreshhr/qa-mirror

qa: Add retry logic to remove most sleeps in mirroring tests

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

NitzanMordhai [Sun, 15 Feb 2026 15:20:26 +0000 (17:20 +0200)]

Merge pull request #66912 from idryomov/wip-74394

osd/PrimaryLogPG: encode an empty data_bl for empty sparse reads

commit | commitdiff | tree

NitzanMordhai [Sun, 15 Feb 2026 15:20:05 +0000 (17:20 +0200)]

Merge pull request #66894 from tchaikov/wip-ec-isa-fix-cache-collision

erasure-code/isa: fix cache collision causing buffer overflow

commit | commitdiff | tree

NitzanMordhai [Sun, 15 Feb 2026 15:17:51 +0000 (17:17 +0200)]

Merge pull request #66376 from NitzanMordhai/wip-nitzan-self-test-influx-set-hostname

qa/tasks/mgr: test_module_selftest set influx hostname to avoid warnings

commit | commitdiff | tree

NitzanMordhai [Sun, 15 Feb 2026 15:17:33 +0000 (17:17 +0200)]

Merge pull request #62067 from ljflores/wip-tracker-67179

osd: add pg-upmap-primary to clean_pg_upmaps

commit | commitdiff | tree

Ilya Dryomov [Sat, 14 Feb 2026 21:36:37 +0000 (22:36 +0100)]

Merge pull request #67353 from idryomov/wip-daemonwatchdog-unbound

qa/tasks/daemonwatchdog: fix unbound variable in bark_reason message

Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>

commit | commitdiff | tree

Ilya Dryomov [Sat, 14 Feb 2026 16:49:59 +0000 (17:49 +0100)]

Merge pull request #67351 from idryomov/wip-74712

qa: krbd_rxbounce.sh: do more reads to generate more errors

Reviewed-by: Ramana Raja <rraja@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sat, 14 Feb 2026 03:18:18 +0000 (11:18 +0800)]

Merge pull request #65318 from tchaikov/wip-mgr-progress-cleanup

pybind/mgr/progress: cleanups

Reviewed-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Fri, 30 Jan 2026 16:17:52 +0000 (17:17 +0100)]

qa/tasks/daemonwatchdog: fix unbound variable in bark_reason message

Introduced in commit 783f0e3a9903 ("qa: Adding a new class for the
daemonwatchdog to monitor").

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Ilya Dryomov [Sun, 8 Feb 2026 08:27:26 +0000 (09:27 +0100)]

qa: krbd_rxbounce.sh: do more reads to generate more errors

On faster hardware having each thread do 1024 reads isn't always
sufficient for the "two orders of magnitude" threshold that is used in
the test.

Fixes: https://tracker.ceph.com/issues/74712
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Jamie Pryde [Fri, 13 Feb 2026 11:34:13 +0000 (11:34 +0000)]

Merge pull request #67218 from jamiepryde/jerasure-unit-test-big-endian-fix

test/erasure-code: Use memory comparisons that are safe on big endian arch

commit | commitdiff | tree

Vallari Agrawal [Fri, 13 Feb 2026 08:01:18 +0000 (13:31 +0530)]

qa: Fix coredumps caused by udisks

Also redo commands to install nvme 2.13.
Fixes: https://tracker.ceph.com/issues/74922
Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>

commit | commitdiff | tree

Guillaume Abrioux [Fri, 13 Feb 2026 07:36:33 +0000 (08:36 +0100)]

Merge pull request #67240 from guits/cv-raw-list-perf

ceph-volume: single lvs call to speed up exclude_lvm_osd_devices

commit | commitdiff | tree

Guillaume Abrioux [Fri, 13 Feb 2026 07:08:29 +0000 (08:08 +0100)]

Merge pull request #66646 from ujjawal-ibm/ceph-volume-format-device

Add NVMe preformat support to skip redundant mkfs discard

commit | commitdiff | tree

Venky Shankar [Fri, 13 Feb 2026 06:35:53 +0000 (12:05 +0530)]

Merge PR #66183 into main

* refs/pull/66183/head:

Reviewed-by: Anoop C S <anoopcs@cryptolab.net>
Reviewed-by: Shachar Sharon <ssharon@redhat.com>

commit | commitdiff | tree

Shraddha Agrawal [Fri, 13 Feb 2026 04:46:29 +0000 (10:16 +0530)]

Merge pull request #66606 from shraddhaag/add-logs

PeeringState::needs_recovery(): log first missing item

commit | commitdiff | tree

SrinivasaBharathKanta [Fri, 13 Feb 2026 01:56:43 +0000 (07:26 +0530)]

Merge pull request #66352 from tchaikov/debian-invoke-rc.d

debian: remove invoke-rc.d calls from postrm scripts

commit | commitdiff | tree

SrinivasaBharathKanta [Fri, 13 Feb 2026 01:50:31 +0000 (07:20 +0530)]

Merge pull request #66591 from NitzanMordhai/wip-nitzan-pickup-object-corpus-20.2.0-380-gdbcbbd3f281

ceph-object-corpus: pick up 20.2.0-380-gdbcbbd3f281

commit | commitdiff | tree

Shilpa Jagannath [Thu, 12 Feb 2026 19:56:14 +0000 (11:56 -0800)]

Merge pull request #66553 from smanjara/wip-fix-object-sync-enoent

rgw: retry ENOENT if bucket reshard races with index operation

commit | commitdiff | tree

J. Eric Ivancich [Thu, 12 Feb 2026 18:57:22 +0000 (13:57 -0500)]

Merge pull request #66721 from swjtu-zhanglei/main-wip

rgw: refact by encapsulating io_context_pool to enforce lazy initialization

Reviewed-by: Adam Emerson <aemerson@redhat.com>

commit | commitdiff | tree

J. Eric Ivancich [Thu, 12 Feb 2026 18:55:43 +0000 (13:55 -0500)]

Merge pull request #66567 from mertsunacoglu/wip-prerequest-auth

RGW: Change prerequest hook to run after authorization process

Reviewed-by: Yuval Lifshitz <ylifshit@redhat.com>

commit | commitdiff | tree

J. Eric Ivancich [Thu, 12 Feb 2026 18:53:17 +0000 (13:53 -0500)]

Merge pull request #66509 from morphes1995/rgw_qos_fix

rgw: ops large budget caused by rate limiter was enabled and max_ops …

Reviewed-by: Mark Kogan <mkogan@ibm.com>

commit | commitdiff | tree

Kotresh HR [Thu, 12 Feb 2026 07:56:28 +0000 (13:26 +0530)]

qa: Fix test_cephfs_mirror_remote_snap_corrupt_fails_synced_snapshot

Fixes: https://tracker.ceph.com/issues/74896
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Sat, 7 Feb 2026 14:26:36 +0000 (19:56 +0530)]

qa: Add retry logic to remove most sleeps in mirroring tests

The mirroring tests contain lot of sleeps adding it up to ~1hr.
This patch adds a retry logic and removes most of them. This
is cleaner and saves considerable time in test time for mirroring.

Fixes: https://tracker.ceph.com/issues/74878
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Ujjawal Anand [Fri, 6 Feb 2026 11:26:58 +0000 (16:56 +0530)]

ceph-volume: skip redundant NVMe mkfs discards

- Avoid redundant discard during mkfs when discard is disabled
- Reduces mkfs time on large NVMe devices by skipping long running discard operations

Fixes: https://tracker.ceph.com/issues/74908
Signed-off-by: Ujjawal Anand <ujjawal.anand@ibm.com>

commit | commitdiff | tree

Guillaume Abrioux [Thu, 12 Feb 2026 15:33:17 +0000 (16:33 +0100)]

Merge pull request #67264 from guits/fix-74818

ceph-volume: pass osd_type value to ceph-volume

commit | commitdiff | tree

Casey Bodley [Thu, 12 Feb 2026 15:30:17 +0000 (10:30 -0500)]

Merge pull request #66594 from cbodley/wip-qa-rgw-account-migration

qa/rgw: add test_account_migration workunit

Reviewed-by: Krunal Chheda <kchheda3@bloomberg.net>

commit | commitdiff | tree

Patrick Donnelly [Thu, 12 Feb 2026 14:46:36 +0000 (09:46 -0500)]

Merge PR #66657 into main

* refs/pull/66657/head:
developer: Add name to maps

Reviewed-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>

commit | commitdiff | tree

Patrick Donnelly [Thu, 12 Feb 2026 14:40:27 +0000 (09:40 -0500)]

Merge PR #67309 into main

* refs/pull/67309/head:
doc: correct spelling

Reviewed-by: Zac Dover <zac.dover@gmail.com>
Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>

commit | commitdiff | tree

Patrick Donnelly [Thu, 12 Feb 2026 14:40:05 +0000 (09:40 -0500)]

Merge PR #67078 into main

* refs/pull/67078/head:
doc: Fixes a spelling error on the RATIO for PG scaling recommendation.

Reviewed-by: Anthony D Atri <anthony.datri@gmail.com>
Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>

commit | commitdiff | tree

Igor Fedotov [Thu, 12 Feb 2026 14:34:26 +0000 (17:34 +0300)]

Merge pull request #66526 from gardran/wip-gardran-show-sharding

os/bluestore: show RocksDB sharding information

Reviewed-by: Adam Kupczyk <akupczyk@ibm.com>

commit | commitdiff | tree

Patrick Donnelly [Thu, 12 Feb 2026 13:10:00 +0000 (08:10 -0500)]

Merge PR #67320 into main

* refs/pull/67320/head:
.github/workflows: use main branch for config diff

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Naveen Naidu <naveen.naidu@ibm.com>

commit | commitdiff | tree

Afreen Misbah [Thu, 12 Feb 2026 12:33:41 +0000 (18:03 +0530)]

Merge pull request #66821 from rhcs-dashboard/cephfs-mirroring-filesystem

mgr/dashboard: Cephfs mirroring Filesystem Selection

Reviewed-by: Naman Munet <nmunet@redhat.com>
Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>

commit | commitdiff | tree

Hezko [Thu, 12 Feb 2026 09:26:52 +0000 (11:26 +0200)]

Merge pull request #67044 from Hezko/nvme-submodule-to-1-6-5

mgr/dashboard: bump nvmeof submodule to 1.6.5

commit | commitdiff | tree

Kefu Chai [Thu, 12 Feb 2026 06:45:59 +0000 (14:45 +0800)]

Merge pull request #66929 from mkogan1/wip-t74245

ceph-volume: fix build on RHEL 10/python 3.12

Reviewed-by: Kefu Chai <k.chai@proxmox.com>

commit | commitdiff | tree

Afreen Misbah [Mon, 9 Feb 2026 13:02:18 +0000 (18:32 +0530)]

mgr/dashboard: use gray10 theme colors

- sidenav, header , table - white
- background - gray

Signed-off-by: Afreen Misbah <afreen@ibm.com>

commit | commitdiff | tree

Sagar Gopale [Mon, 19 Jan 2026 11:42:23 +0000 (17:12 +0530)]

mgr/dashboard: add-gateway-nodes

Fixes: https://tracker.ceph.com/issues/74335
Signed-off-by: Sagar Gopale <sagar.gopale@ibm.com>
:wq

commit | commitdiff | tree

Afreen Misbah [Thu, 12 Feb 2026 02:17:14 +0000 (07:47 +0530)]

Merge pull request #66906 from rhcs-dashboard/gateway-resources

mgr/dashboard: NVme-gateway-resource

Reviewed-by: Afreen Misbah <afreen@ibm.com>
Reviewed-by: Naman Munet <nmunet@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Thu, 12 Feb 2026 01:02:00 +0000 (20:02 -0500)]

Merge PR #67316 into main

* refs/pull/67316/head:
doc/dev/release-checklist: add release to redmine-upkeep trigger
.github/workflows/redmine-upkeep: run for release branches

Reviewed-by: Ilya Dryomov <idryomov@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Wed, 11 Feb 2026 16:30:22 +0000 (11:30 -0500)]

.github/workflows: use main branch for config diff

This script does not exist for the "head" reference of backport PRs.

Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>

commit | commitdiff | tree

Casey Bodley [Wed, 10 Dec 2025 16:05:57 +0000 (11:05 -0500)]

qa/rgw: add test_account_migration workunit

Signed-off-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

leonidc [Wed, 11 Feb 2026 19:54:28 +0000 (21:54 +0200)]

Merge pull request #67307 from leonidc/fix-tests-after-beacon-diff

qa: fixing the qa test in mon/misc.sh after adding beacon-diff feature

commit | commitdiff | tree

Patrick Donnelly [Wed, 11 Feb 2026 19:11:20 +0000 (14:11 -0500)]

doc/dev/release-checklist: add release to redmine-upkeep trigger

Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>

commit | commitdiff | tree

Patrick Donnelly [Wed, 11 Feb 2026 19:08:48 +0000 (14:08 -0500)]

.github/workflows/redmine-upkeep: run for release branches

Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>

commit | commitdiff | tree

Patrick Donnelly [Wed, 11 Feb 2026 19:05:07 +0000 (14:05 -0500)]

Merge PR #67011 into main

* refs/pull/67011/head:
qa/multisite: use boto3's ClientError in place of assert_raises from tools.py.
qa/multisite: test fixes
qa/multisite: boto3 in tests.py
qa/multisite: zone files use boto3 resource api
qa/multisite: switch to boto3 in multisite test libraries

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Amnay [Wed, 11 Feb 2026 15:32:19 +0000 (16:32 +0100)]

doc: correct spelling

I saw a typo so I fixed it

Signed-off-by: Amnay <amnay.m@gmail.com>

commit | commitdiff | tree

Xavi Hernandez [Tue, 11 Nov 2025 08:44:10 +0000 (09:44 +0100)]

libcephfs_proxy: log more information about the negotiation

Signed-off-by: Xavi Hernandez <xhernandez@gmail.com>

commit | commitdiff | tree

Xavi Hernandez [Fri, 7 Nov 2025 08:50:51 +0000 (09:50 +0100)]

libcephfs_proxy: implement client support for batched readdir

Signed-off-by: Xavi Hernandez <xhernandez@gmail.com>

commit | commitdiff | tree

Xavi Hernandez [Wed, 5 Nov 2025 09:36:27 +0000 (10:36 +0100)]

libcephfs_proxy: create internal ceph_dir_result structure

Signed-off-by: Xavi Hernandez <xhernandez@gmail.com>

commit | commitdiff | tree

Xavi Hernandez [Wed, 5 Nov 2025 09:05:45 +0000 (10:05 +0100)]

libcephfs_proxy: add the number of supported operations to negotiation

The v0 negotiation structure has been modified to hold the total number
of operations and callbacks supported by the peer. The changes are done in
a way that it's completely transparent and harmless for a peer expecting
the previous definition.

This will be useful to quickly check if the daemon supports some
operation, or the client supports some callback before sending them.

Signed-off-by: Xavi Hernandez <xhernandez@gmail.com>

commit | commitdiff | tree

Xavi Hernandez [Wed, 5 Nov 2025 08:36:04 +0000 (09:36 +0100)]

libcephfs_proxy: implement daemon support for batched readdir

Signed-off-by: Xavi Hernandez <xhernandez@gmail.com>

commit | commitdiff | tree

Xavi Hernandez [Wed, 5 Nov 2025 07:55:01 +0000 (08:55 +0100)]

libcephfs_proxy: define protocol for batched readdir

Signed-off-by: Xavi Hernandez <xhernandez@gmail.com>

commit | commitdiff | tree

Xavi Hernandez [Fri, 7 Nov 2025 21:49:14 +0000 (22:49 +0100)]

libcephfs_proxy: cleanup unneded variable

Signed-off-by: Xavi Hernandez <xhernandez@gmail.com>

commit | commitdiff | tree

Xavi Hernandez [Fri, 7 Nov 2025 21:47:03 +0000 (22:47 +0100)]

libcephfs_proxy: fix include dependency

Signed-off-by: Xavi Hernandez <xhernandez@gmail.com>

commit | commitdiff | tree

Leonid Chernin [Wed, 11 Feb 2026 14:14:03 +0000 (16:14 +0200)]

qa: fixing the qa test in mon/misc.sh after adding beacon-diff feature

Signed-off-by: Leonid Chernin <leonidc@il.ibm.com>

commit | commitdiff | tree

Sagar Gopale [Tue, 10 Feb 2026 06:21:32 +0000 (11:51 +0530)]

mgr/dashboard: NVme-gateway-resource

Fixes: https://tracker.ceph.com/issues/74334
Signed-off-by: Sagar Gopale <sagar.gopale@ibm.com>

commit | commitdiff | tree

Shraddha Agrawal [Wed, 11 Feb 2026 13:51:14 +0000 (19:21 +0530)]

Merge pull request #67290 from shraddhaag/wip-shraddhaag-cephadm-crimson-tests

cephadm, ceph-volume: add tests for crimson OSD support

commit | commitdiff | tree

Redouane Kachach [Wed, 11 Feb 2026 13:35:13 +0000 (14:35 +0100)]

Merge pull request #66255 from rkachach/fix_issue_73853

mgr/cephadm: don't remove TLS certs if svc still has daemons on host

Reviewed-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>

commit | commitdiff | tree

Dnyaneshwari Talwekar [Wed, 7 Jan 2026 10:46:55 +0000 (16:16 +0530)]

mgr/dashboard: Cephfs Mirroring - Filesystem Selection

Fixes: https://tracker.ceph.com/issues/74280
Signed-off-by: Dnyaneshwari Talwekar <dtalweka@redhat.com>

commit | commitdiff | tree

Venky Shankar [Wed, 11 Feb 2026 09:41:28 +0000 (15:11 +0530)]

Merge PR #65538 into main

* refs/pull/65538/head:

Reviewed-by: Anoop C S <anoopcs@cryptolab.net>
Reviewed-by: Shachar Sharon <ssharon@redhat.com>

commit | commitdiff | tree

bluikko [Wed, 11 Feb 2026 07:53:52 +0000 (14:53 +0700)]

Merge pull request #67182 from Kh9705/add-s3select-limit-examples

doc/radosgw/s3select: added SQL LIMIT operator examples

Unnamed repository; edit this file 'description' to name the repository.

RSS Atom