]> git-server-git.apps.pok.os.sepia.ceph.com Git - ceph.git/log
ceph.git
12 days agosrc/test/crimson/CMakeLists: include dmclock 62002/head
Mohit Agrawal [Thu, 6 Nov 2025 12:52:23 +0000 (18:22 +0530)]
src/test/crimson/CMakeLists: include dmclock

Signed-off-by: Mohit Agrawal <moagrawa@redhat.com>
12 days agocrimson/CMakeLists.txt: include dmclock
Mohit Agrawal [Mon, 28 Jul 2025 14:44:47 +0000 (20:14 +0530)]
crimson/CMakeLists.txt: include dmclock

Signed-off-by: Mohit Agrawal <moagrawa@redhat.com>
12 days agocrimson/mclock_scheduler: Support mclock for crimson
Mohit Agrawal [Mon, 28 Jul 2025 13:25:41 +0000 (18:55 +0530)]
crimson/mclock_scheduler: Support mclock for crimson

The patch is trying to sync mclock source of crimson similar to
classic osds. Currently the feature is use by crimson only for
background recovery operations but later we will use it for other
OSD operations also.To use the same user need to configure
crimson_osd_scheduler_concurrency parameter for osd.

Replace item_t with WorkItem variant to maintain similarity
with classic OSD.

Introduce cost and priority as part of item_t.

Fixes: https://tracker.ceph.com/issues/67367
Signed-off-by: Mohit Agrawal <moagrawa@redhat.com>
12 days agocrimson/background_recovery: switch to unified SchedulerClass and introduce get_avera...
Mohit Agrawal [Mon, 28 Jul 2025 13:19:14 +0000 (18:49 +0530)]
crimson/background_recovery: switch to unified SchedulerClass and introduce get_average_object_size for pg

1)  Replace usage of crimson::osd::scheduler::scheduler_class_t
    with unified SchedulerClass
2) Add priority to scheduler params structure
3) Introduce get_average_object_size for pg

Signed-off-by: Mohit Agrawal <moagrawa@redhat.com>
12 days agocrimson/osd: Refactor crimson scheduler wrapper
Mohit Agrawal [Mon, 28 Jul 2025 13:03:38 +0000 (18:33 +0530)]
crimson/osd: Refactor crimson scheduler wrapper

Signed-off-by: Mohit Agrawal <moagrawa@redhat.com>
12 days agocrimson/osd,osd_operation: initialize mClock scheduler, detect rotational devices...
Mohit Agrawal [Mon, 28 Jul 2025 12:48:57 +0000 (18:18 +0530)]
crimson/osd,osd_operation: initialize mClock scheduler, detect rotational devices, and run OperationThrottler background task

Initialize the mClock scheduler on all shards when the device class
is non-rotational. If the device is rotational throw an exception
to prevent unsupported configurations.

In addition, introduce a background task in OperationThrottler that
continuously dequeues and schedules client requests from the mClock
scheduler based on available credits and throttling limits.

Signed-off-by: Mohit Agrawal <moagrawa@redhat.com>
12 days agocommon/mclock_common: Declared MonClient for crimson in mclock_common
Mohit Agrawal [Mon, 28 Jul 2025 12:08:07 +0000 (17:38 +0530)]
common/mclock_common: Declared MonClient for crimson in mclock_common

Signed-off-by: Mohit Agrawal <moagrawa@redhat.com>
12 days agoMerge pull request #66453 from aainscow/lrc_fix
NitzanMordhai [Wed, 3 Dec 2025 09:55:59 +0000 (11:55 +0200)]
Merge pull request #66453 from aainscow/lrc_fix

osd: Perform shard look up correctly in partial EC writes

12 days agoMerge pull request #66384 from bill-scales/issue72879
NitzanMordhai [Wed, 3 Dec 2025 09:54:19 +0000 (11:54 +0200)]
Merge pull request #66384 from bill-scales/issue72879

Fix teuthology timeout issues with bluestore software compression and improve thread heartbeat timeout code

12 days agoMerge PR #66093 into main
Venky Shankar [Wed, 3 Dec 2025 06:40:18 +0000 (12:10 +0530)]
Merge PR #66093 into main

* refs/pull/66093/head:

Reviewed-by: John Mulligan <jmulligan@redhat.com>
Reviewed-by: Anoop C S <anoopcs@cryptolab.net>
Reviewed-by: Shwetha Acharya <sacharya@redhat.com>
2 weeks agoosd: Perform shard look up correctly in partial EC writes 66453/head
Alex Ainscow [Fri, 28 Nov 2025 14:33:13 +0000 (14:33 +0000)]
osd: Perform shard look up correctly in partial EC writes

Plugins are permitted to provide a mapping to change the order in which OSDs
are used. In practice only LRC does this and it is not currently enabled
with optimisations, so this is a theoretical bug.

The bug here was that the "first" shard was assumed to be shard_id_t(0).  However,
this is not true for LRC.

Fixes: https://tracker.ceph.com/issues/74016
Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
2 weeks agoMerge pull request #65771 from aainscow/ec_direct_reads_pr_1
Alex Ainscow [Thu, 27 Nov 2025 23:17:37 +0000 (23:17 +0000)]
Merge pull request #65771 from aainscow/ec_direct_reads_pr_1

EC Direct Reads: First PR, background work

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Bill Scales <bill_scales@uk.ibm.com>
2 weeks agoMerge pull request #66377 from baum/rbd_aio_write_with_crc32c_initial_fix
Ilya Dryomov [Thu, 27 Nov 2025 22:58:38 +0000 (23:58 +0100)]
Merge pull request #66377 from baum/rbd_aio_write_with_crc32c_initial_fix

librbd: rbd_aio_write_with_crc32c store CRC32C with initial value -1 to match msgr2 validation

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
2 weeks agoqa: Reduce number of osd threads when using compression 66384/head
Bill Scales [Fri, 21 Nov 2025 10:06:22 +0000 (10:06 +0000)]
qa: Reduce number of osd threads when using compression

Smithi nodes used by teuthology tests have 8 CPU cores and typically run
4 OSD processes. When bluestore software compression is enabled the size
of the OSD thread pool needs to be reduced to 2 threads per OSD because
these threads can easily use 100% of a core. This avoids excessive
amounts of context switches, which leads to OSD threads timing out,
which causes the OSD to drop heartbeat pings and for the monitor to
temporarily mark it down. In extreme cases this can lead to PGs getting
stuck in repeated loops of peering until the teuthology test times out.

Context switches happen oppurtunistically at the end of system calls
so functions with lots of logging are some of the worst affected.

Fixes: https://tracker.ceph.com/issues/72879
Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>
2 weeks agoosd: Restrict logging in MissingLoc::add_source_info
Bill Scales [Fri, 21 Nov 2025 10:38:44 +0000 (10:38 +0000)]
osd: Restrict logging in MissingLoc::add_source_info

add_source_info can generate an excessive amount of logging
if a PG has thousands of missing objects. When a system is
under load and threads are repeatedly context switching this
can lead to timeouts (tests showed this function taking up
to 10 seconds to execute with 99% of that time being in
logging calls where the thread was being pre-empted).
Stopping logging after the function has been running for
more than 0.5 seconds strikes a balance between providing
sufficient informtion to debug problems while providing
more stability when a system is heavily loaded.

Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>
2 weeks agoosd: Increase log level for listing missing list
Bill Scales [Fri, 21 Nov 2025 10:25:48 +0000 (10:25 +0000)]
osd: Increase log level for listing missing list

Logging the entire contents of a missing list can generate a
1M character log line when there are 8000 missing objects in a
PG. Other places in the code logging the missing list use debug
level 25 which is not enabled by default in teuthology tests.

Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>
2 weeks agoosd: reset_tp_timeout should reset timeout for all shards
Bill Scales [Mon, 24 Nov 2025 09:18:21 +0000 (09:18 +0000)]
osd: reset_tp_timeout should reset timeout for all shards

ShardedThreadPools are only used by the classic OSD process
which can have more than one thread for the same shard. Each
thread has a heartbeat timeout used to detect stalled threads.
Some code that is known to take a long time makes calls to
reset_tp_timeout to reset this timeout. However for sharded
pools this can be ineffective because it is common for threads
for the same shard to use the same locks (e.g. PG Lock) and
therefore if thread A is taking a long time and resetting
its timeout while holding a lock, thread B for the same shard
is liable to be waiting for the same lock, will not be
resetting its timeout and can be timed out.

Debug for issue 72879 showed heartbeat timeouts occurring at
the same time for both shards, an attempt to fix the problem
by calling reset_tp_timeout for the slow thread still showed
the other threads for the shard timing out waiting for the PG
lock that was held bythe slow thread. Looking at the OSD code
most places where reset_tp_timeout is called the thread is
holding the PG lock.

This commit moves the concept of shard_index from OSD into
ShardedThreadPool and modifies reset_tp_timeout so that it resets
the timeout for all threads for the same shard.

Some code calls reset_tp_timeout from inside loops that can take
a long time without consideration for how long the thread has
actually been running for. There is a risk that this type of
call could repeatedly reset the timeout for another shard which
is genuinely stuck and hence defeat the heartbeat checks. To
prevent this reset_tp_timeout is modified to be a NOP unless
the thread has been processing the current workitem for more
than 0.5 seconds. Therefore threads have to be slow but making
forward progress to be abe to reset the timeout.

Fixes: https://tracker.ceph.com/issues/72879
Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>
2 weeks agoMerge pull request #65739 from tchaikov/rgw-gap-list-manpage
Kefu Chai [Thu, 27 Nov 2025 04:12:08 +0000 (12:12 +0800)]
Merge pull request #65739 from tchaikov/rgw-gap-list-manpage

debian: include rgw-gap-list manpage and rgw-policy-check in ceph-common

Reviewed-by: J. Eric Ivancich <ivancich@redhat.com>
Reviewed-by: Matan Breizman <mbreizma@ibm.com>
2 weeks agoosdc: Add SplitOp capability to Objecter 65771/head
Alex Ainscow [Tue, 14 Oct 2025 08:24:56 +0000 (09:24 +0100)]
osdc: Add SplitOp capability to Objecter

This will provide the ability for Objecter to split up
certain ops and distribute them to the OSDs directly if
that provides a preformance advantage.

This is experimental code and is switched off unless the
magic pool flags are enabled. These magic pool flags were
pushed in an earlier commit in the same PR.

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
2 weeks agoosdc: Fix minor typo
Alex Ainscow [Mon, 13 Oct 2025 11:50:11 +0000 (12:50 +0100)]
osdc: Fix minor typo

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
2 weeks agoosdc: Interface to allow split reads to copy op from client op to split op
Alex Ainscow [Fri, 3 Oct 2025 14:34:55 +0000 (15:34 +0100)]
osdc: Interface to allow split reads to copy op from client op to split op

When spliting ops, certain addition sub ops (e.g. get xattr) can be simply passed
through to the child op.

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
2 weeks agoosdc: Add stub for ability to force an op to always go to a particular shard
Alex Ainscow [Fri, 3 Oct 2025 14:32:22 +0000 (15:32 +0100)]
osdc: Add stub for ability to force an op to always go to a particular shard

This will eventually be used by SplitIo to direct ops to the correct OSD.

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
2 weeks agoosdc: Split handle_osd_op_reply into two functions
Alex Ainscow [Fri, 3 Oct 2025 14:15:29 +0000 (15:15 +0100)]
osdc: Split handle_osd_op_reply into two functions

The functionality is not altered by this commit.

In the future we want to post-process split-ios after
recombining the read data.

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
2 weeks agoosdc: Remove unused con parameter from Objecter::_calc_target()
Alex Ainscow [Fri, 3 Oct 2025 14:11:00 +0000 (15:11 +0100)]
osdc: Remove unused con parameter from Objecter::_calc_target()

This parameter is not used by the _calc_target code.  It is being
removed just to clean up the code, as we are making some changes
to _calc_target in later stages of the split io PR.

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
2 weeks agoosdc: Interface to submit IO with ASIO Post.
Alex Ainscow [Fri, 3 Oct 2025 13:55:56 +0000 (14:55 +0100)]
osdc: Interface to submit IO with ASIO Post.

For direct read failures, the locking is such that we cannot
immediately send a new IO without deadlocking. This new interface
allows an op to be sent as an asio post.

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
2 weeks agoosd: Call clear_repop_obc for EC as well as Replica.
Alex Ainscow [Fri, 3 Oct 2025 13:51:23 +0000 (14:51 +0100)]
osd: Call clear_repop_obc for EC as well as Replica.

This function is necessary for balanced reads and as such is required for EC too.

Rename the function to make sense, given this change of purpose, but the
functionality does not change.

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
2 weeks agoosd: Implement sync reads and sparse reads for EC for direct reads
Alex Ainscow [Fri, 3 Oct 2025 13:39:03 +0000 (14:39 +0100)]
osd: Implement sync reads and sparse reads for EC for direct reads

Sparse reads for EC are simple to implement, as the code is essentially
identical to that of replica, with some address translation.

When doing a direct read in EC, only a single OSD is involved and
that OSD, by definition is the only OSD involved. As such we can
do the more performant sync read, rather than async read.

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
2 weeks agoosd: Add extent_to_shard_extent interface to PGBackend.
Alex Ainscow [Fri, 3 Oct 2025 13:24:49 +0000 (14:24 +0100)]
osd: Add extent_to_shard_extent interface to PGBackend.

This allows a backend to expose how an object offset/length translates to
an offset/length on a particular shard.

For Replica, this is trivial.

For EC, this means looking up the start and end offsets, then translating
this to shard address space.

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
2 weeks agoosd: Set the from shard in the EC read reply.
Alex Ainscow [Fri, 3 Oct 2025 13:17:48 +0000 (14:17 +0100)]
osd: Set the from shard in the EC read reply.

This was not necessary prior to direct reads, but is essential when the
client needs to know which shard the read came from.

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
2 weeks agoosd: Generalise can_serve_replica_read for consumption by EC.
Alex Ainscow [Fri, 3 Oct 2025 13:15:32 +0000 (14:15 +0100)]
osd: Generalise can_serve_replica_read for consumption by EC.

The can_serve_replica_read() function is called by replica to determine whether there are
any uncommitted writes.  If such writes exist, then the system will reject the IO to avoid
the risk of reading data from a write which may yet be rolled back.

The same code is going to be useful for EC direct reads.

The string_view code is not expensive.

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
2 weeks agoosd: Create EC Direct Read flag and pass through to EC.
Alex Ainscow [Fri, 3 Oct 2025 13:00:10 +0000 (14:00 +0100)]
osd: Create EC Direct Read flag and pass through to EC.

This is in preperation for supporting sparse and sync reads in EC.
Such ops will only be supported for "balance reads".

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
2 weeks agoosd: Replace unused EC offset translation function with useful one.
Alex Ainscow [Fri, 3 Oct 2025 12:53:33 +0000 (13:53 +0100)]
osd: Replace unused EC offset translation function with useful one.

The old chunk_aligned_shard_offset_to_ro_offset was not only unused, it
didn't actually have the correct logic. We replace it here with similar,
but more useful function that will be used in sparse reads for EC

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
2 weeks agoosd: Introduce pool flag for "split IO" and Plugin flag for "direct read"
Alex Ainscow [Fri, 3 Oct 2025 12:49:58 +0000 (13:49 +0100)]
osd: Introduce pool flag for "split IO" and Plugin flag for "direct read"

These flags will currently behave as follows:

1. The pool flag is never set, unless by a user with the osd_pool_default_flags
   config option.
2. The pool flag will be removed for EC pools where the plugin does not support
   direct reads.
3. Replica pools will never remove the flag.

The intention is to eventually invert this logic and allow split IOs upon
upgrade to Umberella in this same function.

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
2 weeks agolibrbd: store CRC32C with initial value -1 to match msgr2 validation 66377/head
Alexander Indenbaum [Sun, 23 Nov 2025 12:21:39 +0000 (14:21 +0200)]
librbd: store CRC32C with initial value -1 to match msgr2 validation

Fix runtime error, using test command:
   sudo dd if=/dev/zero bs=32k of=/dev/nvme0n1 count=1

The error log:
   2025-11-23T11:24:10.512+0000 7f30f4ec0640  1 --2- [v2:192.168.13.2:6802/3444906816,v1:192.168.13.2:6803/3444906816] >> 192.168.13.3:0/3916714748 conn(0x527d400 0x728f700 crc :-1 s=THROTTLE_DONE pgs=2038703 gs=2038723 cs=0 l=1 c_cookie=0 s_cookie=0 reconnecting=0 rev1=1 crypto rx=0 tx=0 comp rx=0 tx=0)._handle_read_frame_epilogue_main bad segment crc calculated=1136411986 expected=4294967295

Ceph msgr2 validation (ceph/src/msg/async/frames_v2.cc:47):
   uint32_t crc = segment_bl.crc32c(-1);  // Uses initial value -1

Co-authored-by: Ilya Dryomov <idryomov@gmail.com>
Signed-off-by: Alexander Indenbaum <aindenba@redhat.com>
2 weeks agoMerge pull request #66336 from Matan-B/wip-matanb-crimson-snapmapper-osddriver
Matan Breizman [Tue, 25 Nov 2025 13:59:03 +0000 (15:59 +0200)]
Merge pull request #66336 from Matan-B/wip-matanb-crimson-snapmapper-osddriver

osd/SnapMapper: fix Crimson logs

Reviewed-by: Aishwarya Mathuria <amathuri@redhat.com>
2 weeks agoMerge pull request #66332 from rhcs-dashboard/73854-CephFS-Authorize-modal-Update...
afreen23 [Tue, 25 Nov 2025 11:00:32 +0000 (16:30 +0530)]
Merge pull request #66332 from rhcs-dashboard/73854-CephFS-Authorize-modal-Update-issues

mgr/dashboard : fix - CephFS Authorize Modal Update issue

Reviewed-by: Dnyaneshwari Talwekar dtalweka@redhat.com
2 weeks agoMerge pull request #66382 from bluikko/doc-mgmt-gateway-improvements-cephadm
bluikko [Tue, 25 Nov 2025 05:22:05 +0000 (12:22 +0700)]
Merge pull request #66382 from bluikko/doc-mgmt-gateway-improvements-cephadm

doc/cephadm: Fix command plus improvements in services/mgmt-gateway.rst

3 weeks agoMerge pull request #66006 from afreen23/carbonize-chnage-password
afreen23 [Mon, 24 Nov 2025 12:22:40 +0000 (17:52 +0530)]
Merge pull request #66006 from afreen23/carbonize-chnage-password

mgr/dashboard: Carbonize the Change Password Form

Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Aashish Sharma <aasharma@redhat.com>
Reviewed-by: Dnyaneshwari Talwekar dtalweka@redhat.com
3 weeks agoMerge pull request #66326 from afreen23/fixes-mixins
afreen23 [Mon, 24 Nov 2025 12:17:33 +0000 (17:47 +0530)]
Merge pull request #66326 from afreen23/fixes-mixins

monitoring: Fixes for development

Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Anthony D Atri <anthony.datri@gmail.com>
3 weeks agomgr/dashboard : fix - CephFS Authorize Modal Update issue 66332/head
Devika Babrekar [Thu, 20 Nov 2025 11:33:56 +0000 (17:03 +0530)]
mgr/dashboard : fix - CephFS Authorize Modal Update issue
fixes : https://tracker.ceph.com/issues/73854
Signed-off-by: Devika Babrekar <devika.babrekar@ibm.com>
3 weeks agodoc/cephadm: Fix command plus improvements in service/mgmt-gateway.rst 66382/head
Ville Ojamo [Mon, 24 Nov 2025 09:34:19 +0000 (16:34 +0700)]
doc/cephadm: Fix command plus improvements in service/mgmt-gateway.rst

Remove double backticks from a CLI command.

Use bash prompt consistently for CLI command blocks.

Don't capitalize word in middle of sentence.

Talk about "admin" instead of "user", similarly to the last text
paragraph in the doc.

Signed-off-by: Ville Ojamo <14869000+bluikko@users.noreply.github.com>
3 weeks agomgr/dashboard: Carbonize the Change Password Form 66006/head
Afreen Misbah [Tue, 21 Oct 2025 16:37:46 +0000 (22:07 +0530)]
mgr/dashboard: Carbonize the Change Password Form

Fixes https://tracker.ceph.com/issues/73193

-  using carbon based stylings, typography and components
-  used grid layout for form arrangement
-  breadcrumb is slightly off, which needs to be fixed by applying grid layout to the app shell

Signed-off-by: Afreen Misbah <afreen@ibm.com>
3 weeks agoMerge pull request #66372 from tchaikov/wip-qa-encoder-exclude
Kefu Chai [Mon, 24 Nov 2025 08:27:14 +0000 (16:27 +0800)]
Merge pull request #66372 from tchaikov/wip-qa-encoder-exclude

qa/suites/rados/encoder: exclude ceph-osd-classic when installing LTS…

Reviewed-by: Matan Breizman <mbreizma@ibm.com>
3 weeks agoqa/suites/rados/encoder: exclude ceph-osd-* when installing LTS releases 66372/head
Kefu Chai [Sat, 22 Nov 2025 00:24:36 +0000 (08:24 +0800)]
qa/suites/rados/encoder: exclude ceph-osd-* when installing LTS releases

In a37b5b5, the ceph-osd-classic and ceph-osd-crimson packages were
added to qa/packages/packages.yaml. The "install" task uses this file as
the default package list for all branches, including LTS releases like
Reef.

However, a37b5b5 only exists in the main branch and won't be backported
to LTS branches. This causes installation failures in the rados/encoder
test suite, which verifies forward compatibility by installing LTS
releases and testing whether they can decode the latest corpus.

Exclude ceph-osd-classic and ceph-osd-crimson from LTS installations to
ensure the test suite can successfully install ceph-dencoder, which is
required for the interoperability tests.

Fixes: https://tracker.ceph.com/issues/73957
Signed-off-by: Kefu Chai <k.chai@proxmox.com>
3 weeks agoMerge pull request #66293 from anthonyeleven/instore.dbnoonecanhearyouscream
Anthony D'Atri [Mon, 24 Nov 2025 06:07:04 +0000 (01:07 -0500)]
Merge pull request #66293 from anthonyeleven/instore.dbnoonecanhearyouscream

doc: Improve start/hardware-recommendations.rst

3 weeks agoMerge pull request #65995 from pcuzner/rocksdb_compaction_metric
Laura Flores [Sat, 22 Nov 2025 00:04:21 +0000 (18:04 -0600)]
Merge pull request #65995 from pcuzner/rocksdb_compaction_metric

rados/osd: enable compact_running perfcounter at PRIO=5

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Neha Ojha <nojha@ibm.com>
3 weeks agodoc: Improve start/hardware-recommendations.rst 66293/head
Anthony D'Atri [Mon, 17 Nov 2025 17:57:29 +0000 (12:57 -0500)]
doc: Improve start/hardware-recommendations.rst

Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
3 weeks agoosd/SnapMapper: fix Crimson logs 66336/head
Matan Breizman [Thu, 20 Nov 2025 13:46:49 +0000 (13:46 +0000)]
osd/SnapMapper: fix Crimson logs

Switch to crimson's debugging macro and fix the faulty
subsystem defined of ceph_subsys_

Might help with https://tracker.ceph.com/issues/73790

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
3 weeks agomonitoring: Fixes for development 66326/head
Afreen Misbah [Wed, 19 Nov 2025 20:03:26 +0000 (01:33 +0530)]
monitoring: Fixes for development

- fixes tox.ini using and undefined env - `grafonnet-check`( instead of `jsonnet-check`)
- adds steps for local development of mixins and building jsonnet
- added help command in Makefile
- added comments and descriptions for Makefile and tox.ini

Signed-off-by: Afreen Misbah <afreen@ibm.com>
3 weeks agoMerge pull request #66268 from Matan-B/wip-matanb-old-clients-installs
Matan Breizman [Wed, 19 Nov 2025 13:07:59 +0000 (15:07 +0200)]
Merge pull request #66268 from Matan-B/wip-matanb-old-clients-installs

qa/suites/thrash-old-clients/1-install: exclude ceph-osd-classic

Reviewed-by: Radosław Zarzyński <rzarzyns@redhat.com>
Reviewed-by: Laura Flores <lflores@redhat.com>
3 weeks agoMerge pull request #62765 from bobham-bloomberg/unittest-ceph-assert
Joseph Mundackal [Wed, 19 Nov 2025 11:28:21 +0000 (06:28 -0500)]
Merge pull request #62765 from bobham-bloomberg/unittest-ceph-assert

test/ceph_assert.cc: Disable core files

3 weeks agoMerge pull request #66307 from shraddhaag/wip-shraddhaag-fix-slow-ops
Shraddha Agrawal [Wed, 19 Nov 2025 07:21:19 +0000 (12:51 +0530)]
Merge pull request #66307 from shraddhaag/wip-shraddhaag-fix-slow-ops

qa/clusters/crimson: increase reactors in fixed-1 cluster

Reviewed-by: Matan Breizman <mbreizma@ibm.com>
3 weeks agoMerge pull request #65971 from ceph/wip-20.2.0-documentation
Dan Mick [Tue, 18 Nov 2025 18:55:39 +0000 (10:55 -0800)]
Merge pull request #65971 from ceph/wip-20.2.0-documentation

doc: add Tentacle v20.2.0 release notes

3 weeks agoMerge pull request #65374 from dang/wip-dang-standalone
Daniel Gryniewicz [Tue, 18 Nov 2025 18:03:48 +0000 (13:03 -0500)]
Merge pull request #65374 from dang/wip-dang-standalone

RGW - Standalone - Enable building without librados

3 weeks agodoc/releases: add cephadm notes 65971/head
Laura Flores [Tue, 18 Nov 2025 17:57:39 +0000 (11:57 -0600)]
doc/releases: add cephadm notes

Signed-off-by: Laura Flores <lflores@ibm.com>
3 weeks agodoc/releases: change final release date
Laura Flores [Tue, 18 Nov 2025 17:36:11 +0000 (11:36 -0600)]
doc/releases: change final release date

Signed-off-by: Laura Flores <lflores@ibm.com>
3 weeks agodoc/releases/tentacle: add missing rgw deprecation notice
Casey Bodley [Fri, 24 Oct 2025 12:57:52 +0000 (08:57 -0400)]
doc/releases/tentacle: add missing rgw deprecation notice

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 weeks agodoc/releases/tentacle: fix Crimson highlight
Matan Breizman [Wed, 22 Oct 2025 11:36:04 +0000 (11:36 +0000)]
doc/releases/tentacle: fix Crimson highlight

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
3 weeks agodoc/releases/tentacle: add Crimson highlight
Matan Breizman [Tue, 21 Oct 2025 09:03:23 +0000 (09:03 +0000)]
doc/releases/tentacle: add Crimson highlight

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
3 weeks agodoc/releases: arrange sections in alphabetical order
Laura Flores [Mon, 20 Oct 2025 20:43:04 +0000 (15:43 -0500)]
doc/releases: arrange sections in alphabetical order

Signed-off-by: Laura Flores <lflores@ibm.com>
3 weeks agodoc/releases/tentacle.rst: more RBD items
Ilya Dryomov [Mon, 20 Oct 2025 16:29:07 +0000 (18:29 +0200)]
doc/releases/tentacle.rst: more RBD items

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
3 weeks agodoc/releases/tentacle.rst: fix seastore typo
Matan Breizman [Mon, 20 Oct 2025 16:21:37 +0000 (16:21 +0000)]
doc/releases/tentacle.rst: fix seastore typo

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
3 weeks agodoc/releases: remove extra word
Laura Flores [Mon, 20 Oct 2025 16:08:13 +0000 (11:08 -0500)]
doc/releases: remove extra word

Signed-off-by: Laura Flores <lflores@ibm.com>
3 weeks agodoc/releases/tentacle.rst: Update Crimson section
Matan Breizman [Sun, 19 Oct 2025 08:52:03 +0000 (08:52 +0000)]
doc/releases/tentacle.rst: Update Crimson section

Refer to Tentacle Crimson updates blog post

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
3 weeks agodoc/releases: remove note that is not specific to tentacle
Laura Flores [Fri, 17 Oct 2025 23:15:05 +0000 (18:15 -0500)]
doc/releases: remove note that is not specific to tentacle

Signed-off-by: Laura Flores <lflores@ibm.com>
3 weeks agodoc/releases: move smb support update into highlights
Laura Flores [Fri, 17 Oct 2025 22:54:06 +0000 (17:54 -0500)]
doc/releases: move smb support update into highlights

Signed-off-by: Laura Flores <lflores@ibm.com>
3 weeks agodoc/releases: give smb update a better category title
Laura Flores [Fri, 17 Oct 2025 20:32:25 +0000 (15:32 -0500)]
doc/releases: give smb update a better category title

Signed-off-by: Laura Flores <lflores@ibm.com>
3 weeks agodoc/releases: add Crimson section and clean up references
Laura Flores [Fri, 17 Oct 2025 16:13:42 +0000 (11:13 -0500)]
doc/releases: add Crimson section and clean up references

Signed-off-by: Laura Flores <lflores@ibm.com>
3 weeks agodoc/releases: edit formatting and other small changes
Ville Ojamo [Fri, 17 Oct 2025 15:32:58 +0000 (22:32 +0700)]
doc/releases: edit formatting and other small changes

Formatting with double backticks, indentation etc.

Fix some typos and add missing hyphenation.

Add review comment suggestions by anthonyeleven after OOB discussion.

Signed-off-by: Ville Ojamo <14869000+bluikko@users.noreply.github.com>
3 weeks agodoc/releases: add one more item to RADOS
Laura Flores [Thu, 16 Oct 2025 19:14:44 +0000 (14:14 -0500)]
doc/releases: add one more item to RADOS

Signed-off-by: Laura Flores <lflores@ibm.com>
3 weeks agodoc/releases: add to CephFS section
Laura Flores [Thu, 16 Oct 2025 19:06:24 +0000 (14:06 -0500)]
doc/releases: add to CephFS section

And a bit of RGW

Signed-off-by: Laura Flores <lflores@ibm.com>
3 weeks agodoc/releases: add more to RBD section
Laura Flores [Thu, 16 Oct 2025 18:48:00 +0000 (13:48 -0500)]
doc/releases: add more to RBD section

Signed-off-by: Laura Flores <lflores@ibm.com>
3 weeks agodoc/releases: clean up RGW formatting
Laura Flores [Thu, 16 Oct 2025 18:40:01 +0000 (13:40 -0500)]
doc/releases: clean up RGW formatting

Signed-off-by: Laura Flores <lflores@ibm.com>
3 weeks agodoc/releases: take quincy out of timeline since it is EOL
Laura Flores [Thu, 16 Oct 2025 18:34:07 +0000 (13:34 -0500)]
doc/releases: take quincy out of timeline since it is EOL

Signed-off-by: Laura Flores <lflores@ibm.com>
3 weeks agodoc/releases: add more to mgr section and fix rados formatting
Laura Flores [Thu, 16 Oct 2025 18:32:10 +0000 (13:32 -0500)]
doc/releases: add more to mgr section and fix rados formatting

Signed-off-by: Laura Flores <lflores@ibm.com>
3 weeks agodoc: improve rados section for tentacle release notes
Laura Flores [Thu, 16 Oct 2025 18:09:29 +0000 (13:09 -0500)]
doc: improve rados section for tentacle release notes

Signed-off-by: Laura Flores <lflores@ibm.com>
3 weeks agodoc/tentacle: add more RGW release notes
Casey Bodley [Thu, 16 Oct 2025 13:20:54 +0000 (09:20 -0400)]
doc/tentacle: add more RGW release notes

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 weeks agodoc: add Tentacle v20.2.0 release notes
Laura Flores [Wed, 15 Oct 2025 22:30:08 +0000 (17:30 -0500)]
doc: add Tentacle v20.2.0 release notes

Signed-off-by: Laura Flores <lflores@ibm.com>
3 weeks agoqa/clusters/crimson: increase reactors in fixed-1 cluster 66307/head
Shraddha Agrawal [Mon, 17 Nov 2025 19:50:44 +0000 (01:20 +0530)]
qa/clusters/crimson: increase reactors in fixed-1 cluster

Issue: Various different tests were failing randomly due to slow
ops. There was no common ground between them, it was happening
across differnet object stores (seastore and bluestore) and
across different tests.

Cause: Since this is happening quite randomly, this is likely
happening due to low reactor count.

Solution: We are opting the solution to increase reactors used
for testing. I've increased them to 3 from the initial 2 value.

Fixes: https://tracker.ceph.com/issues/72778
Signed-off-by: Shraddha Agrawal <shraddha.agrawal000@gmail.com>
3 weeks agoMerge pull request #66207 from steliaio/fix-rados-fsid-return-doc
Casey Bodley [Tue, 18 Nov 2025 14:25:12 +0000 (09:25 -0500)]
Merge pull request #66207 from steliaio/fix-rados-fsid-return-doc

include/rados/librados.h: fix documented rados_cluster_fsid return value

Reviewed-by: Casey Bodley <cbodley@redhat.com>
4 weeks agotest/ceph_assert.cc: Disable core files 62765/head
Bob Ham [Thu, 10 Apr 2025 12:24:22 +0000 (12:24 +0000)]
test/ceph_assert.cc: Disable core files

Without this, core files are produced which subsequently cause
failures in other tests, specifically smoke.sh and safe-to-destroy.sh.

Fixes: bfa83df6d33ee2238f1389ca4518592b5c4fb267
Signed-off-by: Bob Ham <bham12@bloomberg.net>
test/ceph_assert.cc: Update line numbers

Signed-off-by: Bob Ham <bham12@bloomberg.net>
4 weeks agoMerge pull request #66105 from rhcs-dashboard/carbonize-sync-policy-form
Pedro Gonzalez Gomez [Mon, 17 Nov 2025 17:57:49 +0000 (18:57 +0100)]
Merge pull request #66105 from rhcs-dashboard/carbonize-sync-policy-form

mgr/dashboard: Carbonize multisite sync policy forms

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@ibm.com>
4 weeks agoqa/suites: exclude ceph-osd-classic 66268/head
Matan Breizman [Sun, 16 Nov 2025 12:52:05 +0000 (12:52 +0000)]
qa/suites: exclude ceph-osd-classic

a37b5b5bde8c2e8d6890f16b31046119ed55f25d added ceph-osd-classic
package.
old-clients and upgrade tests should not try to install the new package
as it is not available in older releases.

Fixes: https://tracker.ceph.com/issues/73848
Signed-off-by: Matan Breizman <mbreizma@redhat.com>
4 weeks agoMerge pull request #65999 from anoopcs9/smb-disable-posix-locking
Adam King [Mon, 17 Nov 2025 16:29:30 +0000 (11:29 -0500)]
Merge pull request #65999 from anoopcs9/smb-disable-posix-locking

mgr/smb: Disable posix locking in share definition

Reviewed-by: Adam King <adking@redhat.com>
4 weeks agoMerge pull request #65931 from anoopcs9/update-smbd-ports-config
Adam King [Mon, 17 Nov 2025 16:28:35 +0000 (11:28 -0500)]
Merge pull request #65931 from anoopcs9/update-smbd-ports-config

smb: Update the configuration logic for smbd ports

Reviewed-by: Adam King <adking@redhat.com>
4 weeks agoMerge pull request #66179 from rhcs-dashboard/73766-remove-subalerts-detail
afreen23 [Mon, 17 Nov 2025 09:52:17 +0000 (15:22 +0530)]
Merge pull request #66179 from rhcs-dashboard/73766-remove-subalerts-detail

mgr/dashboard : Remove subalerts details for multiple subalerts

Reviewed-by: Afreen Misbah <afreen@ibm.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
4 weeks agomgr/dashboard: Carbonize multisite sync policy forms 66105/head
Naman Munet [Fri, 31 Oct 2025 17:38:49 +0000 (23:08 +0530)]
mgr/dashboard: Carbonize multisite sync policy forms

Fixes: https://tracker.ceph.com/issues/73164
Signed-off-by: Naman Munet <naman.munet@ibm.com>
4 weeks agoMerge pull request #66149 from amathuria/wip-amat-update-seastar-nov25
Matan Breizman [Sun, 16 Nov 2025 09:50:07 +0000 (11:50 +0200)]
Merge pull request #66149 from amathuria/wip-amat-update-seastar-nov25

seastar: bump up seastar submodule

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
4 weeks agomgr/smb: Disable posix locking in share definition 65999/head
Anoop C S [Tue, 21 Oct 2025 08:53:50 +0000 (14:23 +0530)]
mgr/smb: Disable posix locking in share definition

The prerequisites for supporting durable handles[1] in Samba include
disabling the mapping of POSIX locks, as well as setting the `kernel
oplocks` and `kernel sharemodes` parameters to disabled. Currently
this configuration is hard‑coded, but in the future it could be made
conditional and combined with other settings to enable persistent
handles on continuously available shares.

[1] https://www.samba.org/samba/docs/current/man-html/smb.conf.5.html#DURABLEHANDLES

Signed-off-by: Anoop C S <anoopcs@cryptolab.net>
4 weeks agoMerge pull request #65962 from NitzanMordhai/wip-nitzan-MOSDOpReply-pgid64-encode...
SrinivasaBharathKanta [Thu, 13 Nov 2025 23:45:18 +0000 (05:15 +0530)]
Merge pull request #65962 from NitzanMordhai/wip-nitzan-MOSDOpReply-pgid64-encode-decode-fix

messages: MOSDOpReply encode and decode errorcode32_t with PGID64 fea…