]> git.apps.os.sepia.ceph.com Git - ceph-ci.git/log
ceph-ci.git
2 months agoosd: Make projected_size in ECTransaction const
Alex Ainscow [Thu, 24 Apr 2025 12:57:45 +0000 (13:57 +0100)]
osd: Make projected_size in ECTransaction const

This does not need to change once set, so adapt constructor to
allow it to be const.

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
(cherry picked from commit 350f3a7a7a8511899adb52b499bc537e178acb0e)

2 months agoosd: Fix parity updates in truncates.
Alex Ainscow [Wed, 23 Apr 2025 14:41:11 +0000 (15:41 +0100)]
osd: Fix parity updates in truncates.

Previously in optimised EC, when truncating to a partial
stripe, the parity was not being updated.  This fix reads
the non-truncated data from the final stripe and calculates
parity updates, which are written to the parity shards.

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
(cherry picked from commit 54a06c97a279d842db0f8059ef991245c3350171)

2 months agoosd: Fix EC cache invalidation bug
Alex Ainscow [Tue, 22 Apr 2025 12:41:19 +0000 (13:41 +0100)]
osd: Fix EC cache invalidation bug

With optimised EC, there were two bugs with cache invalidation:
1. If two invalidates were in the queue, its possible the second
invalidate might be cleared by the first.

2. Reads were being requested if size was being reduced.

Also, added a few debug improvements and some new asserts.

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
(cherry picked from commit 7fad8b94a7509d052e4f6f773fe718b18964c282)

2 months agoosd: Fix panic on fast_read completion in optimised EC
Alex Ainscow [Thu, 17 Apr 2025 21:53:31 +0000 (22:53 +0100)]
osd: Fix panic on fast_read completion in optimised EC

The completion of sub reads was incorrectly marking all processed reads complete on the first read.

This was causing an early attempt at reconstruct, which panics.

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
(cherry picked from commit b72a13ef1fcde577edd5ed397e924178ca083f9b)

2 months agoosd: Use partial read path for fast_reads
Alex Ainscow [Thu, 17 Apr 2025 21:51:51 +0000 (22:51 +0100)]
osd: Use partial read path for fast_reads

Previously fast reads had attempted to read entire stripes.  This is not necessary or desirable.

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
(cherry picked from commit 9ff7589c873d368cfa797acdd970278232a0c906)

2 months agoosd: Fix access-freed-memory issue in EC extent cache.
Alex Ainscow [Thu, 17 Apr 2025 16:23:04 +0000 (17:23 +0100)]
osd: Fix access-freed-memory issue in EC extent cache.

A very similar issue has been in product code, but this was found using valgrind.

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
(cherry picked from commit 1d7425ef1621816e061c4a8d22d9e4f7617912e8)

2 months agoosd: Make EC alignment independent of page size.
Alex Ainscow [Wed, 9 Apr 2025 12:49:49 +0000 (13:49 +0100)]
osd: Make EC alignment independent of page size.

Code which manipulates full pages is often faster. To exploit this
optimised EC was written to deal with 4k alignment wherever possible.
When inputs are not aligned, they are quickly aligned to 4k.

Not all architectures use 4k page sizes. Some power architectures for
example have a 64k page size.  In such situations, it is unlikely that
using 64k page alignment will provide any performance boost, indeed it
is likely to hurt performance significantly.  As such, EC has been
moved to maintain its internal alignment (4k), whcih can be configured.

This has the added advantage, that we can can potentially tweak this
value in the future.

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
(cherry picked from commit 6bbc3f9b9947575d91748bba849d9e7a1e7d27e5)

2 months agoosd: Fix Truncates in Optimised EC
Alex Ainscow [Wed, 16 Apr 2025 09:41:48 +0000 (10:41 +0100)]
osd: Fix Truncates in Optimised EC

The previous truncate code attempted to perform a non-aligned truncate by
creating a zero buffer at the end of the object, which was written.

The new code initially truncates to the exact size of the user object before
growing the object to the required 4k alignment. This simpler arrangement
also simplifies the rollback.

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
(cherry picked from commit d17f06b5fcb2fc749c4d0cbae9beb963bd06c145)

2 months agoosd: Fix written shards policing for multiple loops through generate.
Alex Ainscow [Wed, 16 Apr 2025 06:44:25 +0000 (07:44 +0100)]
osd: Fix written shards policing for multiple loops through generate.

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
(cherry picked from commit c5f67d3771384e7a780ed6488019fca2509e31c2)

2 months agoosd: Fix shard ordering bug
Alex Ainscow [Thu, 24 Apr 2025 14:14:08 +0000 (15:14 +0100)]
osd: Fix shard ordering bug

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
(cherry picked from commit 2ead42b8bdf836acd662537a4f2ec3e0b3b61a34)

2 months agoosd: Remove EC-optimized only flag for not reset_complete_to
Alex Ainscow [Tue, 10 Jun 2025 14:58:28 +0000 (15:58 +0100)]
osd: Remove EC-optimized only flag for not reset_complete_to

The protection here applies to non-optimized EC and replica shards, but will
not be exercised as much. So this is essentially a clean up

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
(cherry picked from commit ca7eebcce7a7f7879deac5b917c3fcaf2249da14)

2 months agoosd: EC optimizations fix bug when recovering only partial write objects
Bill Scales [Fri, 6 Jun 2025 12:28:14 +0000 (13:28 +0100)]
osd: EC optimizations fix bug when recovering only partial write objects

PGLog::reset_complete_to is not handling the scenario where all the
missing objects have a partial write that excludes updating the shard being
recovered as their most recent update. In this scenario the oldest need
is newer than newest log entry. Setting last_compelte to the head of the
log confuses code and makes it think that recovery has completed.

The fix is to hold last_complete one entry behind the head of the log
until all missing objects have been recovered.

PGLog::recover_got already does this when an object is recovered and the
remaining objects to recover match this scenario, so this fix just makes
reset_complete_to behave the same way as recover_got.

Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>
(cherry picked from commit 8ca209e33709b1915858a4cd9747d6c580797a4c)

2 months agoosd: EC optimizations correct pwlc after PG split
Bill Scales [Thu, 5 Jun 2025 10:17:06 +0000 (11:17 +0100)]
osd: EC optimizations correct pwlc after PG split

When a PG splits the log entries are divided between the two PGs,
this can result in PWLC refering to log entries in the other PG.
Rollback PWLC after the split so it is not further advanced that
the most recently completed log entry.

Non-primary shards can be missing log entries and may rollback
PWLC too far because of this, however this does not matter
because a split occurs at the start of a peering cycle and these
shards will be updated with the correct PWLC from the primary
shard later in the peering cycle when they are activated.

Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>
(cherry picked from commit 6c04e4cf2b81c768eb313c3d0e6ac37d8e69b150)

2 months agoosd: EC optimizations overaggresive check for missing objects
Bill Scales [Mon, 26 May 2025 13:33:12 +0000 (14:33 +0100)]
osd: EC optimizations overaggresive check for missing objects

Relax an assert in read_log_and_missing for optimized EC
pools. Because the log may not have entries for partial
writes but the missing list is calculated from the full
log the need version for a missing item may be newer than
the lastest log entry for that object.

ceph_objectstore_tool needs care because we don't want to add
extra dependencies. To minimise the dependencies, we always
relax the asserts when using this tool.

Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>
Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
(cherry picked from commit 20e883fedaf19293e939c4cac44de196bd6c9c19)

2 months agoosd: EC Optimizations fix proc_master_log handling of splits
Bill Scales [Wed, 21 May 2025 17:16:50 +0000 (18:16 +0100)]
osd: EC Optimizations fix proc_master_log handling of splits

For optimized EC pools proc_master_log needs to deal with
the other log being merged being behind the local log because
it is missing partial writes. This is done by finding the
point where the logs diverge and then checking whether local
log entries have been committed on all the shards.

A bug in this code meant that after a PG split (where there
may be gaps in the log due to entries moving to the other PG)
that the divergence point was not found and committed
partial writes ended up being discarded which creates
unfound objects.

Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>
(cherry picked from commit 5f687c4a182b18cab31476854a6b04a46e8c8464)

2 months agoosd: EC Optimizations fix missing call to partial_write
Bill Scales [Tue, 20 May 2025 10:37:05 +0000 (11:37 +0100)]
osd: EC Optimizations fix missing call to partial_write

When a shard is backfilling and it receives a log entry where the
transaction is not applied it can skip the roll forward by
immediately advancing crt. However it is still necessary to
call partial_write in this scenario to keep the pwlc information
up to date.

Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>
(cherry picked from commit ddc306255868f26ae0a3951710ef18207fff9b30)

2 months agoosd: Do not complete log on non primary until missing recovered.
Alex Ainscow [Fri, 16 May 2025 13:52:22 +0000 (14:52 +0100)]
osd: Do not complete log on non primary until missing recovered.

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
(cherry picked from commit 7f134a32b38f16555984b06f87a8ce581b492cf9)

2 months agoosd: EC Optimizations bug fix for flip/flop acting set
Bill Scales [Wed, 14 May 2025 07:39:40 +0000 (08:39 +0100)]
osd: EC Optimizations bug fix for flip/flop acting set

EC optimizations pools have a set of non-primary shards which
cannot become the primary because they do not have all the
metadata updates. If one of these shards is chosen as the
primary it will set the acting set to force another shard to
be chosen.

It is important that the selected acting set is the same
acting set that will be chosen by the next primary (assuming
nothing else changes) otherwise a PG can get into a state where
the acting set flip/flops between two different states causing
the PG to get stuck in peering and hanging I/O.

A bug in update_peer_info meant that non-primary shards did not
present the same info to choose_acting_set as primary shards
because they were not updating their pg_info_t based on pwlc
information from other shards.

Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>
(cherry picked from commit 54b265f811e545885916367d7d63c7f4d734fae0)

2 months agoosd: Refuse to commit/rollforward beyond end of log.
Alex Ainscow [Tue, 13 May 2025 11:55:14 +0000 (12:55 +0100)]
osd: Refuse to commit/rollforward beyond end of log.

In optimised EC, if transaction is applied to all shards, followed by a
partial transaction AND these two transactions overlap, then it is
possible for the non-primary shards to commit a version which is after
then end of the log.

This commit changes the apply_log such that the commit version will be
changed to the head of the log in such situations.

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
(cherry picked from commit 912437d47053f92086261e285462ac5b4d8d749a)

2 months agoosd: Refactor partial_write to address multiple issues.
Alex Ainscow [Tue, 29 Apr 2025 11:02:07 +0000 (12:02 +0100)]
osd: Refactor partial_write to address multiple issues.

We fix a number of issues with partial_write here.

Fix an issue where it is unclear whether the empty PWLC state is
newer or older than a populated PWLC on another shard by always
updating the pwlc with an empty range, rather than blank.

This is an unfortunate small increase in metadata, so we should
come back to this in a later commit (or possibly later PR).

Normally a PG log consists of a set of log entries with each
log entry have a version number one greater than the previous
entry. When a PG splits the PG log is split so that each of the
new PGs only has log entries for objects in that PG, which
means there can be gaps between version numbers.

PGBackend::partial_write is trying to keep track of adjacent
log updates than do not update a particular shard storing
these as a range in partial_writes_last_complete. To do this
it must compare with the version number of the previous log
entry rather than testing for a version number increment of one.

Also simplify partial_writes to make it more readable.

Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>
Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
(cherry picked from commit 2467406f3e22c0746ce20cd04b838dccedadf055)

2 months agoosd: nonprimary shards are permitted to have a crt newer than head
Alex Ainscow [Tue, 29 Apr 2025 10:59:24 +0000 (11:59 +0100)]
osd: nonprimary shards are permitted to have a crt newer than head

Non-primary shards do not get updates for some transactions.  It is possible
however for other transactions to increase the can_rollback_to to a later
version.  This causes an assert for some operations.

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
(cherry picked from commit 6209c8cdf980f261c60073d4535d745f24538a7d)

2 months agoosd: overaggressive assert in read_log_and_missing with optimized EC pool
Bill Scales [Fri, 25 Apr 2025 14:03:02 +0000 (15:03 +0100)]
osd: overaggressive assert in read_log_and_missing with optimized EC pool

read_log_and_missing is called during OSD initializaiton to sanity check
the PG log. One of its checks is too agressive for an optimized EC pool
where because of a partial write there can be a log entry but no update
to the object on this shard (other shards will have been updated). The
fix is to skip the checks when the log entry indicates this shard was
not updated.

Only affects pool with allow_ec_optimizations flag on.

Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>
(cherry picked from commit c8739a2bdb0bf523402d85517a7fce39d445eef5)

2 months agoosd: EC optimizations rework for pg_temp
Bill Scales [Thu, 29 May 2025 11:53:27 +0000 (12:53 +0100)]
osd: EC optimizations rework for pg_temp

Bug fixes for how pg_temp is used with optimized EC pools. For these
pools pg_temp is re-ordered with non-primary shards last. The acting
set was undoing this re-ordering in PeeringState, but this is too
late and results code getting the shard id wrong. One consequence
iof this was an OSD refusing to create a PG because of an incorrect
shard id.

This commit moves the re-ordering earlier into OSDMap::_get_temp_osds,
some changes are then required to OSDMap::clean_temps.

Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>
(cherry picked from commit 6c8b0297aaafeb0cff7350e52212140c85435afe)

2 months agoosd: EC Optimizations OSDMap::clean_temps preventing change of primary
Bill Scales [Fri, 23 May 2025 09:45:46 +0000 (10:45 +0100)]
osd: EC Optimizations OSDMap::clean_temps preventing change of primary

clean_temps is clearing pg_temp if the acting set will be the same
as the up set. For optimized EC pools this is overaggressive because
there are scenarios where it is setting acting set to be the same as
up set to force an alternative shard to be chosen as primary - this
happens because the acting set is transformed to place non-primary
shards at the end of the pg_temp vector.

Detect this scenario and stop clean_temps from undoing the acting
set which is being set by PeeringState::choose_acting.

Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>
(cherry picked from commit 9d9265337a43b3edab8a3c41752baaca835be92a)

2 months agoosd: EC optimizations bug in OSDMap::clean_temps
Bill Scales [Thu, 22 May 2025 12:12:57 +0000 (13:12 +0100)]
osd: EC optimizations bug in OSDMap::clean_temps

OSDMap clean_temps clears pg_temp for a PG when the up set
matches the acting_set. For optimized EC pools the pg_temp
is reordered to place primary shards first, this function
was not calling pgtemp_undo_primaryfirst to revert the
reordering.

This meant that a 2+1 EC PG with up set [1,2,3] and
a desired acting set [1,3,2] re-ordered the acting
set to produce pg_temp as [1,2,3] and then deleted this
because it equals the up set.

Calling pgtemp_undo_primaryfirst makes this code work
as intended.

Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>
(cherry picked from commit ef0025ab168e6dd604465921dbecb7fa3b0331bd)

2 months agomon: prime_pg_temp needs to call pgtemp_primaryfirst when encoding pg_temp
Bill Scales [Wed, 21 May 2025 12:23:33 +0000 (13:23 +0100)]
mon: prime_pg_temp needs to call pgtemp_primaryfirst when encoding pg_temp

Optimized EC pools encode pgtemp in a different order putting the shards
which cannot become the primary (nonprimary shards) last.

prime_pg_temp is missing a call to pgtemp_primaryfirst to reorder the
acting set it is trying to set as pg_temp.

Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>
(cherry picked from commit 95da0cdf96da96ba14cfbc5ee9489aabf9e10fbf)

2 months agoosd: EC Optimizations fix routing of requests to non-zero shard id
Bill Scales [Mon, 19 May 2025 16:01:39 +0000 (17:01 +0100)]
osd: EC Optimizations fix routing of requests to non-zero shard id

Pools with EC optimizations can use pg_temp to modify the selection
of the primary. When this happens the clients route request to the
correct OSD, but wrong shard which causes hung I/Os or misdirected
I/Os.

Fix Objecter to select the correct shard when sending requests to
EC optimized pools. Fix OSD to modify the shard when receving
requests from legacy clients.

Add new unittests to test new functions for remapping the shard id.

Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>
(cherry picked from commit f66cd3e860093380a7f97023bfd1f06312779383)

2 months agoosd: fmt::format shard_versions
Alex Ainscow [Thu, 24 Apr 2025 20:44:15 +0000 (21:44 +0100)]
osd: fmt::format shard_versions

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
(cherry picked from commit 291bece2ff805d41fd4445ea7ddd78ef86f45b73)

2 months agoosd: Cosmetic code cleanup and improve debug
Alex Ainscow [Thu, 24 Apr 2025 14:13:31 +0000 (15:13 +0100)]
osd: Cosmetic code cleanup and improve debug

All these changes are one of:
* Whitespace changes
* Addition/removal of debug
* Typos
* Make CPU-intensive debug statements deb ug level 30
* Remove unnecessary counter which did not really help with debug
* Extra comments

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
(cherry picked from commit 8ba4c0e6958f3aa277a194b806154c0928cbf119)

2 months agoMerge pull request #64189 from Matan-B/wip-71823-tentacle
Matan Breizman [Wed, 9 Jul 2025 15:16:16 +0000 (18:16 +0300)]
Merge pull request #64189 from Matan-B/wip-71823-tentacle

tentacle: crimson/os/seastore/omap_manager: only mutate the parent when merge/balance can proceed

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
2 months agoMerge pull request #64187 from Matan-B/wip-71843-tentacle
Matan Breizman [Wed, 9 Jul 2025 15:11:40 +0000 (18:11 +0300)]
Merge pull request #64187 from Matan-B/wip-71843-tentacle

tentacle: crimson/osd: Admin Socket fixes

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
2 months agoMerge pull request #64407 from Matan-B/wip-72029-tentacle
Matan Breizman [Wed, 9 Jul 2025 15:10:58 +0000 (18:10 +0300)]
Merge pull request #64407 from Matan-B/wip-72029-tentacle

tentacle: qa/suites/crimson-rados/singleton: add install task

Reviewed-by: Aishwarya Mathuria <amathuri@redhat.com>
2 months agoMerge pull request #64367 from VallariAg/wip-71984-tentacle
afreen23 [Wed, 9 Jul 2025 12:56:28 +0000 (18:26 +0530)]
Merge pull request #64367 from VallariAg/wip-71984-tentacle

tentacle: monitoring: Add alert NVMeoFHostKeepAliveTimeout

Reviewed-by: Afreen Misbah <afreen@ibm.com>
2 months agoqa/suites/crimson-rados/singleton: add install task
Matan Breizman [Mon, 7 Jul 2025 12:10:45 +0000 (12:10 +0000)]
qa/suites/crimson-rados/singleton: add install task

This should fix the chmod 777 /var/log/ceph failures.

We were missing the install task which resulted in no /var/log/ceph:
```
2025-07-07T08:55:44.586 INFO:teuthology.run_tasks:Running task ceph...
2025-07-07T08:55:44.679 INFO:tasks.ceph:Making ceph log dir writeable by
non-root...
2025-07-07T08:55:44.679 DEBUG:teuthology.orchestra.run.smithi144:> sudo
chmod 777 /var/log/ceph
2025-07-07T08:55:44.711
INFO:teuthology.orchestra.run.smithi144.stderr:chmod: cannot access
'/var/log/ceph': No such file or directory
```

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
(cherry picked from commit ecbc607d2e715ba05c113a4d9c9dd3f04c6e2b68)

2 months agoMerge pull request #64165 from zdover23/wip-doc-2025-06-25-backport-64160-to-tentacle
Anthony D'Atri [Wed, 9 Jul 2025 04:23:38 +0000 (00:23 -0400)]
Merge pull request #64165 from zdover23/wip-doc-2025-06-25-backport-64160-to-tentacle

tentacle: doc/dev/config: Document how to use :confval: directive for config op…

2 months agoMerge pull request #64395 from zdover23/wip-doc-2025-07-09-backport-64161-to-tentacle
Anthony D'Atri [Wed, 9 Jul 2025 04:04:04 +0000 (00:04 -0400)]
Merge pull request #64395 from zdover23/wip-doc-2025-07-09-backport-64161-to-tentacle

tentacle: doc: Document ceph-mgr module configuration options

2 months agodoc: Document ceph-mgr module configuration options
Kefu Chai [Wed, 25 Jun 2025 02:26:58 +0000 (10:26 +0800)]
doc: Document ceph-mgr module configuration options

Add comprehensive documentation for defining configuration options in
ceph-mgr modules, including all supported properties and their usage.

Previously, the documentation did not explain how to define ceph-mgr
module configuration options, despite subtle differences from other Ceph
components. This change documents all supported Option properties, their
types, and provides clear examples to help module developers properly
configure their options.

Signed-off-by: Kefu Chai <tchaikov@gmail.com>
(cherry picked from commit 121192f4c95df0ee282cecc7237c8ca788e9cfba)

2 months agoMerge pull request #64392 from zdover23/wip-doc-2025-07-09-backport-64334-to-tentacle
Zac Dover [Wed, 9 Jul 2025 02:01:22 +0000 (12:01 +1000)]
Merge pull request #64392 from zdover23/wip-doc-2025-07-09-backport-64334-to-tentacle

tentacle: doc/rados: remove clonedata command

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
2 months agoMerge pull request #64216 from zdover23/wip-doc-2025-06-27-backport-62944-to-tentacle
Anthony D'Atri [Wed, 9 Jul 2025 01:58:56 +0000 (21:58 -0400)]
Merge pull request #64216 from zdover23/wip-doc-2025-06-27-backport-62944-to-tentacle

tentacle: common/options: fix typo in description

2 months agodoc/rados: remove clonedata command
Zac Dover [Thu, 3 Jul 2025 12:32:48 +0000 (22:32 +1000)]
doc/rados: remove clonedata command

Remove the "clonedata" command from the rados manpage.

The "clonedata" command was removed in
https://github.com/ceph/ceph/pull/13008.

Fixes: https://tracker.ceph.com/issues/71943
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 1eff1c1147bdb683701a55fbe3ea846fde6937f0)

2 months agoMerge pull request #64011 from cbodley/wip-71673-tentacle
Yuri Weinstein [Tue, 8 Jul 2025 21:25:22 +0000 (14:25 -0700)]
Merge pull request #64011 from cbodley/wip-71673-tentacle

tentacle: build: Fix opentelemetry-cpp build failure on Noble

Reviewed-by: Adam Emerson <aemerson@redhat.com>
Reviewed-by: David Galloway <dgallowa@redhat.com>
2 months agoMerge pull request #64241 from Hezko/wip-71888-tentacle
afreen23 [Tue, 8 Jul 2025 17:45:42 +0000 (23:15 +0530)]
Merge pull request #64241 from Hezko/wip-71888-tentacle

tentacle: mgr/dashboard: add missing namespace CLI commands

Reviewed-by: Afreen Misbah <afreen@ibm.com>
2 months agoMerge pull request #63944 from thotz/bp-tentacle-cloud-restore-allow-copy-cloud-resto...
Casey Bodley [Tue, 8 Jul 2025 11:58:02 +0000 (07:58 -0400)]
Merge pull request #63944 from thotz/bp-tentacle-cloud-restore-allow-copy-cloud-restored-temp-obj

tentacle: rgw/cloud-restore: allow copy of temporary object

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2 months agomgr/dashboard: add missing ns commands
Tomer Haskalovitch [Thu, 22 May 2025 10:35:26 +0000 (13:35 +0300)]
mgr/dashboard: add missing ns commands

Signed-off-by: Tomer Haskalovitch <tomer.haska@ibm.com>
(cherry picked from commit f682d62a9d897ae82d48e155b7b8ee4d07eaaecb)

2 months agoMerge pull request #63982 from cbodley/wip-71699-tentacle
anrao19 [Tue, 8 Jul 2025 05:01:31 +0000 (10:31 +0530)]
Merge pull request #63982 from cbodley/wip-71699-tentacle

tentacle: rgw/multisite: change HTTP error handling from EIO to ERR_INTERNAL_ERROR

2 months agoMerge pull request #64304 from Hezko/wip-71934-tentacle
afreen23 [Tue, 8 Jul 2025 03:14:30 +0000 (08:44 +0530)]
Merge pull request #64304 from Hezko/wip-71934-tentacle

tentacle: mgr/dashboard: NVMeoF CLI migrate prs

Reviewed-by: Afreen Misbah <afreen@ibm.com>
2 months agoMerge pull request #64237 from Hezko/wip-71884-tentacle
afreen23 [Tue, 8 Jul 2025 03:14:10 +0000 (08:44 +0530)]
Merge pull request #64237 from Hezko/wip-71884-tentacle

tentacle: mgr/dashboard: align response of subsystem add and ns add with old cli

Reviewed-by: Afreen Misbah <afreen@ibm.com>
2 months agoMerge pull request #64080 from adk3798/wip-71665-tentacle
Adam King [Mon, 7 Jul 2025 17:18:16 +0000 (13:18 -0400)]
Merge pull request #64080 from adk3798/wip-71665-tentacle

tentacle: src/pybind/mgr/cephadm/service_discovery: fixed HAProxy labels

Reviewed-by: Redouane Kachach <rkachach@redhat.com>
2 months agoMerge pull request #64078 from adk3798/wip-71196-tentacle
Adam King [Mon, 7 Jul 2025 17:06:54 +0000 (13:06 -0400)]
Merge pull request #64078 from adk3798/wip-71196-tentacle

tentacle: mgr/cephadm: Add command to stop host drain

Reviewed-by: Redouane Kachach <rkachach@redhat.com>
2 months agoMerge pull request #64209 from shraddhaag/wip-71858-tentacle
Shraddha Agrawal [Mon, 7 Jul 2025 15:48:56 +0000 (21:18 +0530)]
Merge pull request #64209 from shraddhaag/wip-71858-tentacle

tentacle: mon: add command osd pool clear-availability-status

2 months agoMerge pull request #64129 from NitzanMordhai/wip-71799-tentacle
Shraddha Agrawal [Mon, 7 Jul 2025 15:48:42 +0000 (21:18 +0530)]
Merge pull request #64129 from NitzanMordhai/wip-71799-tentacle

tentacle: src/mon/MgrStatMonitor: fix invalid iterator increment in calc_pool_availability()

2 months agoMerge pull request #64064 from aainscow/wip-71717-tentacle
Laura Flores [Mon, 7 Jul 2025 14:55:51 +0000 (10:55 -0400)]
Merge pull request #64064 from aainscow/wip-71717-tentacle

tentacle: osd: Correct scrub analysis for optimised EC

2 months agoMerge pull request #64077 from adk3798/wip-71195-tentacle
Adam King [Mon, 7 Jul 2025 14:31:37 +0000 (10:31 -0400)]
Merge pull request #64077 from adk3798/wip-71195-tentacle

tentacle: mgr/nfs: Add VIP to HAProxy_Hosts list for ingress type haproxy-protocol

Reviewed-by: Redouane Kachach <rkachach@redhat.com>
2 months agomonitoring: Add alert NVMeoFHostKeepAliveTimeout
Vallari Agrawal [Fri, 20 Jun 2025 14:37:00 +0000 (20:07 +0530)]
monitoring: Add alert NVMeoFHostKeepAliveTimeout

Fixes: https://tracker.ceph.com/issues/71772
Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
(cherry picked from commit 9977e5c963d6d9c303cf50335f2b6a52e60fba6e)

2 months agoMerge pull request #64197 from idryomov/wip-71838-tentacle
Sunil Angadi [Mon, 7 Jul 2025 06:24:40 +0000 (11:54 +0530)]
Merge pull request #64197 from idryomov/wip-71838-tentacle

tentacle: librbd/migration/QCOWFormat: don't complete read_clusters() inline

2 months agoMerge pull request #64242 from NitzanMordhai/wip-71867-tentacle
NitzanMordhai [Sun, 6 Jul 2025 11:00:37 +0000 (14:00 +0300)]
Merge pull request #64242 from NitzanMordhai/wip-71867-tentacle

tentacle: test/librados/aio_cxx: skip EIO boundary assertion when no in-flight I/Os

2 months agomgr/dashboard: migrate nvmeof pr #1346 to new cli
Tomer Haskalovitch [Thu, 26 Jun 2025 10:25:53 +0000 (13:25 +0300)]
mgr/dashboard: migrate nvmeof pr #1346 to new cli

Signed-off-by: Tomer Haskalovitch <tomer.haska@ibm.com>
(cherry picked from commit 2e82486884b67ac6b7fd553a2ab7283f8b8cb096)

2 months agoMerge pull request #64188 from Matan-B/wip-71821-tentacle
NitzanMordhai [Sun, 6 Jul 2025 08:01:41 +0000 (11:01 +0300)]
Merge pull request #64188 from Matan-B/wip-71821-tentacle

tentacle: src: fix NDEBUG typo

2 months agoMerge pull request #64134 from JonBailey1993/wip-71798-tentacle
NitzanMordhai [Sun, 6 Jul 2025 08:01:26 +0000 (11:01 +0300)]
Merge pull request #64134 from JonBailey1993/wip-71798-tentacle

tentacle: test/osd: Improve readability of errors in ceph_test_rados_io_sequence

2 months agoMerge pull request #63014 from jamiepryde/tentacle-ec-plugins-tidying-nits-and-bits
NitzanMordhai [Sun, 6 Jul 2025 08:01:01 +0000 (11:01 +0300)]
Merge pull request #63014 from jamiepryde/tentacle-ec-plugins-tidying-nits-and-bits

tentacle: erasure-code: reformat EC plugins

2 months agomgr/dashboard: migrate nvmeof pr #1335 to new cli
Tomer Haskalovitch [Thu, 26 Jun 2025 10:00:22 +0000 (13:00 +0300)]
mgr/dashboard: migrate nvmeof pr #1335 to new cli

Signed-off-by: Tomer Haskalovitch <tomer.haska@ibm.com>
(cherry picked from commit 652d93acd96660fa1359377b2ed32aef0145293d)

2 months agomgr/dashboard: migrate nvmeof pr #1327 to new cli
Tomer Haskalovitch [Thu, 26 Jun 2025 09:53:05 +0000 (12:53 +0300)]
mgr/dashboard: migrate nvmeof pr #1327 to new cli

Signed-off-by: Tomer Haskalovitch <tomer.haska@ibm.com>
(cherry picked from commit 4713b19f2c418af340c0d58d46a8f0eeb0e1643f)

2 months agomgr/dashboard: migrate nvmeof pr #1233 to new cli
Tomer Haskalovitch [Wed, 25 Jun 2025 19:08:16 +0000 (22:08 +0300)]
mgr/dashboard: migrate nvmeof pr #1233 to new cli

Signed-off-by: Tomer Haskalovitch <tomer.haska@ibm.com>
(cherry picked from commit 9b1a24bf502267237b3aec594ce5521be7df3ba5)

2 months agomgr/dashboard: migrate nvmeof pr #1277 to new cli
Tomer Haskalovitch [Wed, 25 Jun 2025 09:05:53 +0000 (12:05 +0300)]
mgr/dashboard: migrate nvmeof pr #1277 to new cli

Signed-off-by: Tomer Haskalovitch <tomer.haska@ibm.com>
(cherry picked from commit a3165e081e783d24fa4e5987b01546108cc2f270)

2 months agomgr/dashboard: migrate nvmeof pr #1308 to new cli
Tomer Haskalovitch [Mon, 23 Jun 2025 21:00:58 +0000 (00:00 +0300)]
mgr/dashboard: migrate nvmeof pr #1308 to new cli

Signed-off-by: Tomer Haskalovitch <tomer.haska@ibm.com>
(cherry picked from commit 9a85c27c3749d14693cdbd90b682af8ebb6ab667)

2 months agomgr/dashboard: migrate nvmeof pr #1238 to new cli
Tomer Haskalovitch [Mon, 23 Jun 2025 20:23:05 +0000 (23:23 +0300)]
mgr/dashboard: migrate nvmeof pr #1238 to new cli

Signed-off-by: Tomer Haskalovitch <tomer.haska@ibm.com>
(cherry picked from commit 4c6304e7ff1e2df469b7f6e26337e5cfa13bc4b2)

2 months agomgr/dashboard: align response of subsystem add and ns add with old cli
Tomer Haskalovitch [Tue, 20 May 2025 20:27:43 +0000 (23:27 +0300)]
mgr/dashboard: align response of subsystem add and ns add with old cli

Signed-off-by: Tomer Haskalovitch <il033030@Tomers-MBP.lan>
(cherry picked from commit 512f2133bd746094bd1cecd9d1e897b79ec3dd6d)
Signed-off-by: Tomer Haskalovitch <tomer.haska@ibm.com>
2 months agoMerge pull request #64234 from Hezko/wip-71882-tentacle
Hezko [Sun, 6 Jul 2025 07:41:11 +0000 (10:41 +0300)]
Merge pull request #64234 from Hezko/wip-71882-tentacle

tentacle: mgr/dashboard: Nvmeof CLI plain format support

2 months agoMerge pull request #64257 from leonidc/wip-71893-tentacle
leonidc [Sun, 6 Jul 2025 05:33:34 +0000 (08:33 +0300)]
Merge pull request #64257 from leonidc/wip-71893-tentacle

tentacle: fix duplicated entity addr in the map during reboot of several GWs

2 months agoMerge pull request #64258 from leonidc/wip-71894-tentacle
leonidc [Sun, 6 Jul 2025 05:33:07 +0000 (08:33 +0300)]
Merge pull request #64258 from leonidc/wip-71894-tentacle

tentacle: nvmeofgw: fix sending acks during upgrade

2 months agomgr/dashboard: add plain output type support to nvme cli
Tomer Haskalovitch [Thu, 8 May 2025 08:54:39 +0000 (11:54 +0300)]
mgr/dashboard: add plain output type support to nvme cli

Signed-off-by: Tomer Haskalovitch <tomer.haska@ibm.com>
(cherry picked from commit 27698aec2b1aedf9124725709e85803213633232)

2 months agoMerge pull request #64295 from rhcs-dashboard/sys-packages-dashboard-tentacle
afreen23 [Fri, 4 Jul 2025 08:00:20 +0000 (13:30 +0530)]
Merge pull request #64295 from rhcs-dashboard/sys-packages-dashboard-tentacle

tentacle: mgr/dashboard: use system packages when running tox

Reviewed-by: Afreen Misbah <afreen@ibm.com>
2 months agoMerge pull request #64240 from Hezko/wip-71887-tentacle
afreen23 [Fri, 4 Jul 2025 07:52:25 +0000 (13:22 +0530)]
Merge pull request #64240 from Hezko/wip-71887-tentacle

tentacle: mgr/dashboard: update ceph nvmeof proto files

Reviewed-by: Afreen Misbah <afreen@ibm.com>
2 months agoMerge pull request #64236 from Hezko/wip-71885-tentacle
afreen23 [Fri, 4 Jul 2025 07:52:10 +0000 (13:22 +0530)]
Merge pull request #64236 from Hezko/wip-71885-tentacle

tentacle: mgr/dashboard: align cli commands listener list, ns list, host list

Reviewed-by: Afreen Misbah <afreen@ibm.com>
2 months agoMerge pull request #64342 from zdover23/wip-doc-2025-07-04-backport-63779-to-tentacle
Anthony D'Atri [Thu, 3 Jul 2025 22:36:14 +0000 (18:36 -0400)]
Merge pull request #64342 from zdover23/wip-doc-2025-07-04-backport-63779-to-tentacle

tentacle: doc/mgr: edit telemetry.rst

2 months agoMerge pull request #63980 from cbodley/wip-71690-tentacle
Casey Bodley [Thu, 3 Jul 2025 22:16:29 +0000 (18:16 -0400)]
Merge pull request #63980 from cbodley/wip-71690-tentacle

tentacle: test/common: unittest_fault_injector omits unit-main target

Reviewed-by: Adam C. Emerson <aemerson@redhat.com>
2 months agoMerge pull request #64337 from zdover23/wip-doc-2025-07-03-backport-64319-to-tentacle
Zac Dover [Thu, 3 Jul 2025 22:14:37 +0000 (08:14 +1000)]
Merge pull request #64337 from zdover23/wip-doc-2025-07-03-backport-64319-to-tentacle

tentacle: doc/radosgw: add rgw_enable_lc_threads & rgw_enable_gc_threads

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
2 months agodoc/mgr: edit telemetry.rst
Zac Dover [Fri, 6 Jun 2025 04:57:19 +0000 (14:57 +1000)]
doc/mgr: edit telemetry.rst

Edit doc/mgr/telemetry.rst.

Incorporate the suggestions made by Anthony D'Atri in
https://github.com/ceph/ceph/pull/63735.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 97ef126e4b6a9dcdf9872cf37cdebec289e84acd)

2 months agoMerge pull request #64279 from mkogan1/wip-71906-tentacle
Yuri Weinstein [Thu, 3 Jul 2025 16:54:45 +0000 (09:54 -0700)]
Merge pull request #64279 from mkogan1/wip-71906-tentacle

tentacle: qa/rgw: fix perl tests missing Amazon::S3 module

Reviewed-by: Adam Emerson <aemerson@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
2 months agoMerge pull request #63929 from cbodley/wip-71672-tentacle
Yuri Weinstein [Thu, 3 Jul 2025 16:53:57 +0000 (09:53 -0700)]
Merge pull request #63929 from cbodley/wip-71672-tentacle

tentacle: qa/rgw: bump maven version in hadoop task to resolve 404 Not Found

Reviewed-by: Adam Emerson <aemerson@redhat.com>
2 months agoMerge pull request #63724 from cbodley/wip-71564-tentacle
Yuri Weinstein [Thu, 3 Jul 2025 16:53:04 +0000 (09:53 -0700)]
Merge pull request #63724 from cbodley/wip-71564-tentacle

tentacle: debian: radosgw: add media-types packages as alternative for mime-support

Reviewed-by: Adam Emerson <aemerson@redhat.com>
2 months agoMerge pull request #63436 from mkogan1/wip-71420-tentacle
Yuri Weinstein [Thu, 3 Jul 2025 16:52:21 +0000 (09:52 -0700)]
Merge pull request #63436 from mkogan1/wip-71420-tentacle

tentacle: rgw/d3n: fix valgrind invalid read during exit

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2 months agoMerge pull request #63362 from cbodley/wip-71376-tentacle
Yuri Weinstein [Thu, 3 Jul 2025 16:50:46 +0000 (09:50 -0700)]
Merge pull request #63362 from cbodley/wip-71376-tentacle

tentacle: rgw/rgw_rest: determine the domain uri prefix by rgw_transport_is_secure

Reviewed-by: Adam Emerson <aemerson@redhat.com>
2 months agoMerge pull request #64320 from zdover23/wip-doc-2025-07-03-backport-64306-to-tentacle
Anthony D'Atri [Thu, 3 Jul 2025 14:37:03 +0000 (10:37 -0400)]
Merge pull request #64320 from zdover23/wip-doc-2025-07-03-backport-64306-to-tentacle

tentacle: doc/rados/ops: add caps restore command

2 months agodoc/radosgw: add rgw_enable_lc_threads & rgw_enable_gc_threads
Zac Dover [Thu, 3 Jul 2025 06:45:31 +0000 (16:45 +1000)]
doc/radosgw: add rgw_enable_lc_threads & rgw_enable_gc_threads

Add rgw_enable_lc_threads and rgw_enable_gc_threads to
doc/radosgw/config-ref.rst.

Fixes: https://tracker.ceph.com/issues/58092
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 45e5468ea7fc57b1cb60f1936558dc2678adc6fd)

2 months agoMerge pull request #64079 from adk3798/wip-71571-tentacle
Adam King [Thu, 3 Jul 2025 12:43:47 +0000 (08:43 -0400)]
Merge pull request #64079 from adk3798/wip-71571-tentacle

tentacle: cephadm: don't collect image ids for daemons with no container info

Reviewed-by: John Mulligan <jmulligan@redhat.com>
2 months agoMerge pull request #63444 from JoshuaGabriel/wip-71407-tentacle
Adam King [Thu, 3 Jul 2025 12:39:56 +0000 (08:39 -0400)]
Merge pull request #63444 from JoshuaGabriel/wip-71407-tentacle

tentacle: orch/module: Make orch unpause an alias for orch resume

Reviewed-by: Redouane Kachach <rkachach@ibm.com>
2 months agomgr/dashboard: catch protobuf error due to mismatch in version
Nizamudeen A [Thu, 26 Jun 2025 07:25:22 +0000 (12:55 +0530)]
mgr/dashboard: catch protobuf error due to mismatch in version

Signed-off-by: Nizamudeen A <nia@redhat.com>
2 months agomgr/dashboard: unpin grpcio deps
Nizamudeen A [Thu, 26 Jun 2025 04:51:34 +0000 (10:21 +0530)]
mgr/dashboard: unpin grpcio deps

Fixes: https://tracker.ceph.com/issues/70411
Signed-off-by: Nizamudeen A <nia@redhat.com>
(cherry picked from commit 84922d1ff0f3aae39ecd998b2b94469ce9989c08)

2 months agomgr/dashboard: sitepackages only for py3
Nizamudeen A [Mon, 17 Mar 2025 05:10:21 +0000 (10:40 +0530)]
mgr/dashboard: sitepackages only for py3

Signed-off-by: Nizamudeen A <nia@redhat.com>
(cherry picked from commit 6985ed15ce20c24f7fa87c23ac65b28cf544c9b7)

2 months agoRevert "mgr/dashboard: pin lxml to fix run-dashboard-tox-make-check failure"
Nizamudeen A [Thu, 13 Mar 2025 06:47:41 +0000 (12:17 +0530)]
Revert "mgr/dashboard: pin lxml to fix run-dashboard-tox-make-check failure"

This reverts commit 1f84505f1232dd8445df1a2a819fa000062d3934.

Signed-off-by: Nizamudeen A <nia@redhat.com>
(cherry picked from commit 8e6b6cdd5c302577c81aa041ef4b3f494ed717a7)

2 months agomgr/dashboard: disable saml2 tests when the dep is not there
Nizamudeen A [Wed, 12 Mar 2025 08:32:10 +0000 (14:02 +0530)]
mgr/dashboard: disable saml2 tests when the dep is not there

Signed-off-by: Nizamudeen A <nia@redhat.com>
(cherry picked from commit 8acf8f7e4fe6d283212154857abd97b6e641de4d)

2 months agomgr/dashboard: use system packages when running tox
Nizamudeen A [Wed, 12 Mar 2025 05:27:29 +0000 (10:57 +0530)]
mgr/dashboard: use system packages when running tox

Fixes: https://tracker.ceph.com/issues/70411
Signed-off-by: Nizamudeen A <nia@redhat.com>
(cherry picked from commit ffde8d044a207d304ec8f916e45c98289f740bec)

2 months agodoc/rados/ops: add caps restore command
Zac Dover [Wed, 2 Jul 2025 12:29:48 +0000 (22:29 +1000)]
doc/rados/ops: add caps restore command

Add a command that restores caps to client.admin after they have been
accidentally destroyed or removed.

Fixes: https://tracker.ceph.com/issues/23594
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit de8c148a2f99db3a1dc4eb70a19aca19a0e9bacf)

2 months agoMerge pull request #64180 from zdover23/wip-doc-2025-06-25-backport-64172-to-tentacle
Zac Dover [Thu, 3 Jul 2025 05:41:13 +0000 (15:41 +1000)]
Merge pull request #64180 from zdover23/wip-doc-2025-06-25-backport-64172-to-tentacle

tentacle: doc/radosgw: line edit bucket_logging.rst

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
Reviewed-by: Yuval Lifshitz <ylifshit@ibm.com>
2 months agoMerge pull request #64314 from zdover23/wip-doc-2025-07-03-backport-63359-to-tentacle
Anthony D'Atri [Thu, 3 Jul 2025 02:02:18 +0000 (22:02 -0400)]
Merge pull request #64314 from zdover23/wip-doc-2025-07-03-backport-63359-to-tentacle

tentacle: doc/radosgw/s3: Document delete-if-unmodified-since

2 months agodoc/radosgw/s3: Document delete-if-unmodified-since
Anthony D'Atri [Mon, 19 May 2025 14:14:00 +0000 (10:14 -0400)]
doc/radosgw/s3: Document delete-if-unmodified-since

Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
(cherry picked from commit 3381041d190fd48cf9f6d53f679f1737028cccf4)
(cherry picked from commit 7e2bd1595d64c12e1959cc29acfaf906a9027fc6)

2 months agoMerge pull request #64232 from Hezko/wip-71880-tentacle
afreen23 [Wed, 2 Jul 2025 14:38:21 +0000 (20:08 +0530)]
Merge pull request #64232 from Hezko/wip-71880-tentacle

tentacle: mgr/dashboard: Add missing fields to subsytem list API/CLI

Reviewed-by: Afreen Misbah <afreen@ibm.com>
2 months agoMerge pull request #64233 from Hezko/wip-71881-tentacle
afreen23 [Wed, 2 Jul 2025 14:37:03 +0000 (20:07 +0530)]
Merge pull request #64233 from Hezko/wip-71881-tentacle

tentacle: mgr/dashboard: fix set spdk log level cli command

Reviewed-by: Afreen Misbah <afreen@ibm.com>
2 months agoMerge pull request #64146 from zdover23/wip-doc-2025-06-25-backport-64088-to-tentacle
Zac Dover [Wed, 2 Jul 2025 13:19:34 +0000 (23:19 +1000)]
Merge pull request #64146 from zdover23/wip-doc-2025-06-25-backport-64088-to-tentacle

tentacle: doc: Fixed a spelling error.

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>