git-server-git.apps.pok.os.sepia.ceph.com Git

doc/mgr: edit debug.inc.rst

Edit doc/mgr/dashboard_plugins/debug.inc.rst.

This commit is part of a project to separate out the twenty-five files
that were committed to https://github.com/ceph/ceph/pull/63255.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 1e87aa66b9de0a4e74bb4b1fbd806d974e0e8dcf)

Merge pull request #63336 from zdover23/wip-doc-2025-05-19-backport-63329-to-tentacle

tentacle: doc: Fix missing blank line Sphinx warnings

Merge pull request #63332 from Matan-B/wip-matanb-crimson-tentacle-63166

tentacle: qa/suites/crimson-rados: Seastore recovery tests

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>

doc: Fix missing blank line Sphinx warnings

Fix four warnings from Sphinx about missing blank line after explicit
markup.

Indent content in list items correctly, fixing formatting errors.

Signed-off-by: Ville Ojamo <14869000+bluikko@users.noreply.github.com>
(cherry picked from commit 5e1b3cd566fd700b36001551d1c7e092fc850def)

qa/suites/crimson-rados:Seastore (recovery) thrash tests

Seastore is currently only being tested with thrash_simpe without recovery.
This commit adds recovery thrash tests with radosbench only for now.
Other workloads, mainly `ceph_test_rados` (rados) are not yet supported.
See: https://tracker.ceph.com/issues/71237

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
(cherry picked from commit 88f2269ca7fb2550e6b5f4fc2262594e0de4bbc6)

Merge pull request #63225 from Matan-B/wip-71273-tentacle

tentacle: crimson/os/seastore/cache/LRU: account the empty extent when adding it to LRU

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>

Merge pull request #63278 from Matan-B/wip-matanb-crimson-tentacle-63226

tentacle: qa/suites/crimson-rados: disable chance_trim_stale_osdmaps

Reviewed-by: Aishwarya Mathuria <amathuri@redhat.com>

Merge pull request #63310 from zdover23/wip-doc-2025-05-16-backport-63293-to-tentacle

tentacle: doc/radosgw: Use ref for hyperlinking to multisite

Merge pull request #63285 from afreen23/wip-71293-tentacle

tentacle: mgr/dashboard: deprecate `transifex-i18ntool` and support transifex cli

Reviewed-by: Afreen Misbah <afreen@ibm.com>
Reviewed-by: Anthony D Atri <anthony.datri@gmail.com>

Merge pull request #63314 from zdover23/wip-doc-2025-05-16-backport-63301-to-tentacle

tentacle: doc/mgr: edit dashboard.rst

Merge pull request #63265 from Matan-B/wip-71309-tentacle

tentacle: crimson/osd/pg: set log_entry_update_waiting_on prior to sending requests

doc/mgr: edit dashboard.rst

Edit doc/mgr/crash.rst. Add prompts.

This changes eighty-nine prompts. Because this makes so many changes,
all other edits included in https://github.com/ceph/ceph/pull/63255 will
be made in a separate commit. This done for the sake of the patience of
the reviewers (probably Anthony, if history is any guide).

This commit is part of a project to separate out the twenty-five files
that were committed to https://github.com/ceph/ceph/pull/63255.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 1f587883a671775b44b656cf6b7c641d02319b5f)

doc/radosgw: Use ref for hyperlinking to multisite

Use validated ":ref:" hyperlinks instead of "external links" in "target
definitions" when linking within the Ceph docs:
- Update to use existing label in multisite.rst.
- Remove unused "target definitions".

Also use existing label for linking from multisite.rst.
Fix a broken link within multisite.rst.

The rendered PR should look the same as the old docs, only differing in
the source RST.

Signed-off-by: Ville Ojamo <14869000+bluikko@users.noreply.github.com>
(cherry picked from commit e296c756ca191753080cb65e3df4bf3bb160398b)

Merge pull request #63297 from zdover23/wip-doc-2025-05-15-backport-63256-to-tentacle

tentacle: doc/dev/cephfs-mirroring: edit file 1 of x

Merge pull request #63182 from Matan-B/wip-matanb-crimson-tentacle-cmake-rel

tentacle: CMakeLists: Fallback to RelWithDebInfo

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>

Merge pull request #63258 from Matan-B/wip-matanb-crimson-tentacle-63138

tentacle: crimson: bump defaults to be more appropriate

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>

Merge pull request #63257 from Matan-B/wip-matanb-crimson-tentacle-63137

tentacle: crimson: simplify vstart crimson-osd reactor pinning

Reviewed-by: Samuel Just <sjust@redhat.com>

Merge pull request #63125 from Matan-B/wip-matanb-crimson-tentacle-62847

tentacle: crimson/osd/pg_recovery: rework start_recovery_ops

Reviewed-by: Aishwarya Mathuria <amathuri@redhat.com>

doc/dev/cephfs-mirroring: edit file 1 of x

Add prompts (and perform necessary corrections to glaring grammatical
errors) to doc/dev/cephfs-mirroring.rst, as requested by Jos Collin in https://github.com/ceph/ceph/pull/63237/files#r2085886075.

This commit edits the first quarter of the doc/dev/cephfs-mirroring.rst
file. This commit encompasses about one-hundred lines of RST.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit e9a27051a7c0f4884e685a4105e45273aef0abe0)

mgr/dashboard: Update translations for tentacle

Fixes https://tracker.ceph.com/issues/71293

- pulled and added and latest translations for tentacle

Signed-off-by: Afreen Misbah <afreen@ibm.com>

mgr/dashboard: Edit .tx config as per tentacle file

Fixes https://tracker.ceph.com/issues/71293
Updates tentacle as the source

Signed-off-by: Afreen Misbah <afreen@ibm.com>

mgr/dashboard: deprecate `transifex-i18ntool` and support transifex cli

- transifex-i18ntool is no longer maintained
- transifex officialy supports now a tool https://github.com/transifex/cli
- doc update

Fixes https://tracker.ceph.com/issues/71127

Signed-off-by: Afreen Misbah <afreen@ibm.com>
(cherry picked from commit 25fa4218f6263d5ea8130f75f264761c14cc5a36)

Merge pull request #63272 from zdover23/wip-doc-2025-05-14-backport-63259-to-tentacle

tentacle: doc/dev/cephfs-mirroring: edit file 2 of x

qa/suites/crimson-rados: disable chance_trim_stale_osdmaps

Avoid "no valid command" in teuthology logs.
trim stale osdmaps is not planned to be introduce soon:
See: https://tracker.ceph.com/issues/65288

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
(cherry picked from commit aa3ff63b6fa9c51e540d1768a2dbb8c267b9d2c4)

Merge pull request #63245 from afreen23/wip-71290-tentacle

tentacle: mgr/dashboard: Add default state when gateway groups are empty

Reviewed-by: Nizamudeen A <nia@redhat.com>

Merge pull request #63066 from mohit84/wip-71145-tentacle

tentacle: crimson: PG backfill is not showing any progress

Reviewed-by: Matan Breizman <mbreizma@redhat.com>

doc/dev/cephfs-mirroring: edit file 2 of x

Add prompts (and perform necessary corrections to glaring grammatical
errors) to doc/dev/cephfs-mirroring.rst, as requested by Jos Collin in
https://github.com/ceph/ceph/pull/63237/files#r2085886075.

This commit edits the second quarter of the doc/dev/cephfs-mirroring.rst
file. This commit encompasses about one-hundred lines of RST.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 0768ef3fcab69392635d32ed6c4c7051b27afd37)

Merge pull request #63167 from aclamk/aclamk-bs-recomp-improve-is-worth-tentacle

[tentacle] os/bluestore: Do not recompress large compressed blobs

Reviewed-by: Ronen Friedman <rfriedma@redhat.com>

Merge pull request #63263 from ronen-fr/wip-rf-63183-tentacle

tentacle: osd/scrub: remove (was: fix) deadline calculations

Reviewed-by: Samuel Just <sjust@redhat.com>

Merge pull request #63210 from ceph/tentacle-rtd

tentacle: .github: Fix RTD build retrigger

Merge pull request #63249 from zdover23/wip-doc-2025-05-13-backport-63237-to-tentacle

tentacle: doc: fix formatting in cephfs_mirror dev doc

crimson/osd/../client_request: add logs around get_obc stage

If this stage is in use by other operation we would keep waiting for it
to finish. Add logs before entering the stage and after to keep track of
stuck requests.

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
(cherry picked from commit 548ee41a0ab12b70362f264abc5d31ff947a95bd)

crimson/osd/pg: set log_entry_update_waiting_on prior to sending requests

Before this patch, we would first send the MOSDPGUpdateLogMissing to
all peers and only then insert this rep_tid to log_entry_update_waiting_on.

This could have resulted in race where we receive the reply prior to
actually inserting the rep_tid.
The reply would have been discarded with "reply on unknown tid" (which
is now aborting).
The unhandled reply would have not let submit_error to return and would
keep holding the lock on this obc.

Fixes: https://tracker.ceph.com/issues/71204
Signed-off-by: Matan Breizman <mbreizma@redhat.com>
(cherry picked from commit a2121ebbc8ab7b41f0ffcd724b46ab778aa54efa)

Merge pull request #63246 from afreen23/wip-71289-tentacle

tentacle: mgr/dashboard: Fix delete listener

Reviewed-by: Nizamudeen A <nia@redhat.com>

qa/standalone/scrub: no 'deadline' checks in osd-scrub-test.sh

As the 'deadline' attribute was removed from the scrub jobs.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
(cherry picked from commit 8d99563fffd111d0c82b338524a5f304f8b71d5f)

osd/scrub: remove the 'deadline' attribute from the scrub job

The scrub job's 'overdue' attribute is no longer calculated -
the only 'scrub is overdue' status remaining after latest
scheduling refactor, is the one performed in PGMap.cc (the
one affecting the 'health warning' status of the cluster).
Thus - there is no longer any reason to maintain any 'deadline'
attribute for the scrub scheduler.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
(cherry picked from commit 5f83bde31ebf51ab521a488444392d5ca05a09a0)

osd/scrub: remove the deep-scrubs deadline attribute

As it is no longer meaningful in the context of the new
scrub scheduling design.

The change mandates fixes to the way 'schedule-[deeps]crub'
commands are implemented. The offset to use when forcing the
last-scrub timestamp to a new value in now calculated in
ScrubJob::guaranteed_offset(), as ScrubJob is where all
schedule adjustments (which employ the same logic) are
implemented.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
(cherry picked from commit 4e6323f894fcd96a3d6235451a7d54913e6a1d40)

doc/rados/operations/pools.rst: changes to scrub parameters

Following Tentacle changes to the operation of the cluster/pool
options that affect the determination of the 'max allowed time
to a scrub'.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
(cherry picked from commit 61673450ac68209f98dd91ef5667dd3f6dcaa481)

osd/scrub: fix deadline calculations

The scrub scheduling deadlines are calculated based on pool and OSD
configuration parameters. The specifics of the calculations are
modified to match the new scrub scheduling design.

Comments and documentation are updated to reflect the fact that
the deadlines no longer have any meaningful effect on scrub
scheduling.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
(cherry picked from commit 170e9f75fd7bcfe2ab93a5ad2f28b2ea5955db48)

Merge pull request #63104 from ronen-fr/wip-rf-63093-tentacle

tentacle: osd/scrub: check all(*) conditions in restrictions_on_scrubbing()

Reviewed-by: Nitzan Mordechai <nmordech@redhat.com>

crimson: bump seastore_max_concurrent_transactions size from 8->128, clarify description

Retain old value for qa coverage.

Signed-off-by: Samuel Just <sjust@redhat.com>
(cherry picked from commit 988c668775e5a0afe83940e4bb1de6d57794330f)

crimson: bump lru size from 64M->1G, clarify description

Retain old value for qa coverage.

Signed-off-by: Samuel Just <sjust@redhat.com>
(cherry picked from commit 1824937803aad26b4e5ff16ded73ed521afe5517)

tools/contrib: remove balance_cpu and helpers

Replaced by assign_crimson_cores.

Signed-off-by: Samuel Just <sjust@redhat.com>
(cherry picked from commit 7eaa56b0657e8d057ea47b1b8e0078e6591f1779)

vstart.sh: simplify crimson core assignment, use assign_crimson_cores.py

This commit simplifies the internal flow in a few ways:
- core assignment is entirely handled by prep_balance_cpu and
  do_balance_cpu.  The latter simply does as the cpu_table
  instructs.
- assign_crimson_cores calls lscpu and taskset internally, no
  need for temp files.

It also changes some defaults:
- if crimson-balance-cpu is unset or set to none, crimson-osd will not
  pin cpus at all rather than using the simple sequential allocation
  scheme, which could be much less efficient on platforms where
  cpuids 0,1,2,3,... are on socket 0,1,2,3,...  "osd" and "socket"
  options provide numa aware assignments when requested.

New features:
- Alienstore cores are now assigned with assign_crimson_cores
  using the same balance strategy using
  --crimson-alien-num-cores.
- --crimson-reactor-physical-only and
  --crimson-alienstore-physical-only will cause reactor or
  alienstore cpus respectively to be allocated with one
  cpu per physical core rather than including smt siblings.

Fixes: https://tracker.ceph.com/issues/71096
Signed-off-by: Samuel Just <sjust@redhat.com>
(cherry picked from commit 1795f46ebbc2f061e26f0298815d891fa12c1b96)

tools/contrib: add assign_crimson_cores as a more general replacement for balance_cpu

Improvements:
- shorter
- has tests
- uses lscpu -e --json to get logical<->physical mappings and avoid
needing to parse cpu ranges in lscpu --json
- supports allocating alienstore threads
- supports requiring physical cores only independently for alienstore
and seastar reactors

Signed-off-by: Samuel Just <sjust@redhat.com>
(cherry picked from commit efd4aa955c37fbf94b9f72ad98f36f65d3bd3494)

doc: fix formatting in cephfs_mirror dev doc

Signed-off-by: Jos Collin <jcollin@redhat.com>
(cherry picked from commit 8280d334b9b3ee98cb1372d91991fa476a89424b)

mgr/dashboard: Fix delete listener

- pass gw_group to delete API in frontend
- when more than one gw groups present delete listener failing with error message: Multiple NVMe-oF gateway groups are configured. Please specify the 'gw_group' parameter in the request.
- added missing types, i18n

Fixes https://tracker.ceph.com/issues/71236

Signed-off-by: Afreen Misbah <afreen@ibm.com>
(cherry picked from commit 87ff77183d84066378618939a0a357a80d773e34)

mgr/dashboard: Add default state when gateway groups are empty

Fixes https://tracker.ceph.com/issues/71247

- after upgrades the nvmeof service spec does not contain `group` field
- this causes UI combobox internal errors
- checking for `group` in spec and disabling the selector

Signed-off-by: Afreen Misbah <afreen@ibm.com>
(cherry picked from commit 9a7c907bfc2345a79ecc1f850cc851f21f74fca5)

Merge pull request #63235 from nbalacha/wip-71241-tentacle

tentacle: cls/rbd: write image mirror status if state is CREATING

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

Merge pull request #63228 from zdover23/wip-doc-2025-05-12-backport-63209-to-tentacle

tentacle: doc/radosgw: Cosmetic and formatting improvements in vault.rst

rbd: write image mirror status if state is CREATING

It can take upto 30s for the image mirror status to be written
to rbd_mirroring on the secondary for a newly created image. This fix
attempts to reduce the time by writing the status to rbd_mirroring even
if the image state is set to CREATING.

Fixes: https://tracker.ceph.com/issues/71138
Signed-off-by: N Balachandran <nithya.balachandran@ibm.com>
(cherry picked from commit 25a8de9c3db8309387eed3502e781872bc1e035e)

doc/radosgw: Cosmetic and formatting improvements in vault.rst

Use generated "contents" instead of repeating section titles in an
ordered list.

Use Title Case for section titles.

Capitalize consistently "Ceph" and "Object Gateway", "Vault".

Use consistently all caps for acronyms.

Change one entry of "kms" to "SSE-KMS", seems obvious from the context.
Use "certificates" instead of "certs".

Use double backticks for file paths, commands, config options etc.

Improve language in one sentence, several words missing.

Call it consistently "the Object Gateway" and not sometimes "the
Gateway".

Add pre-formatted block for CLI command examples where missing (should
be promptified in the future).

Signed-off-by: Ville Ojamo <14869000+bluikko@users.noreply.github.com>
(cherry picked from commit aa2e91ff70ea37a56f4013e6087f7e790f527329)

crimson/os/seastore/cache/LRU: account the empty extent when adding it to LRU

Don't update num_extents when a partial extent is already in LRU.

Fixes: https://tracker.ceph.com/issues/69986
Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
(cherry picked from commit ee388c3e4268a55bb1f9ddfba08321ff0fb4f4fe)

Merge pull request #63153 from NitzanMordhai/wip-71223-tentacle

tentacle: osd_types: Restore new_object marking for delete missing entries

crimson/osd/pg_recovery: only reset_pglog_based_recovery_op if complete

ce4e9aaad, as part of the start_recovery_ops changed the call to
reset_pglog_based_recovery_op to occur unconditionally rather than only
if recovery has completed.

Note, this fix only restores the prior behavior. There's actually still
a race here where a DeferRecovery could be processed between the call to
reset_pglog_based_recovery_op and the RequestBackfill or
AllReplicasRecovered being processed.

Introduced: ce4e9aaad8f2cafae24511fe1687c61dc41affc1
Related: https://tracker.ceph.com/issues/71267
Fixes: https://tracker.ceph.com/issues/70337
Signed-off-by: Samuel Just <sjust@redhat.com>
(cherry picked from commit c95509edcdf387f8e57416f557bd154049ef34df)

Merge pull request #63213 from samarahu/remove-d4n-subsuite-temp

tentacle: qa/d4n: Temporarily remove subsuite

Reviewed-by: Pritha Srivastava <prsrivas@redhat.com>

qa/d4n: Temporarily remove subsuite

Signed-off-by: Samarah <samarah.uriarte@ibm.com>

.github: Fix RTD build retrigger

Fixes: https://tracker.ceph.com/issues/70430
Signed-off-by: David Galloway <david.galloway@ibm.com>
(cherry picked from commit 27521bf5580d4f878fbf7cafbd854bc6ef94aed4)

Merge pull request #63196 from zdover23/wip-doc-2025-05-09-backport-63172-to-tentacle

tentacle: doc/mgr/ceph_api: edit index.rst

Merge pull request #63199 from zdover23/wip-doc-2025-05-08-backport-63173-to-tentacle

tentacle: doc/mgr: edit alerts.rst

Merge pull request #63206 from zdover23/wip-doc-2025-05-09-backport-63084-to-tentacle

tentacle: doc/mgr: edit administrator.rst

doc/mgr: edit administrator.rst

Edit doc/mgr/administrator.rst. Add prompts and correct some of the
inelegant English present in the file.

This commit is part of a project to separate out the twenty-five files
that were committed to https://github.com/ceph/ceph/pull/62782.

Credit for this initiative should go to Ville Ojamo.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 859950180f0b7314b097ee9b2f9f192f4cfb3ed4)

Merge pull request #63185 from rhcs-dashboard/wip-71256-tentacle

tentacle: mgr/dashboard: fix make check tests

doc/mgr: edit alerts.rst

Edit doc/mgr/alerts.rst as part of the project to determine where the
error is in https://github.com/ceph/ceph/pull/62782 that prevents the
Jenkins tests from passing.

This commit adds to the work done in
https://github.com/ceph/ceph/pull/62782 by correcting some of the
English that was present in that PR.

This is a change to one of twenty-five files in
https://github.com/ceph/ceph/pull/62782, and this commit represents one
of what will be at least twenty-five other commits made to track this
error down.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit db28c1f32ff67bd1087866c7976233e8ab23fa2c)

doc/mgr/ceph_api: edit index.rst

Edit doc/mgr/ceph_api/index.rst as part of the project to determine
where the error is in https://github.com/ceph/ceph/pull/62782 that
prevents the Jenkins tests from passing.

This is a change to one of twenty-five files in
https://github.com/ceph/ceph/pull/62782, and this commit represents one
of what will be at least twenty-five other commits made to track this
error down.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 5c17f2242318d57e9faaafc83774a8f8a36f6e0e)

mgr/dashboard: fix make check tests

Fixes https://tracker.ceph.com/issues/71246

Pr are failing with errors import errors running pylint

Signed-off-by: Afreen Misbah <afreen@ibm.com>
(cherry picked from commit 29f0f2914efacd2e6b9bc7115f5c2d3790bb12cc)

Merge pull request #63094 from afreen23/wip-71158-tentacle

tentacle: mgr/dashboard: Bump grafana version to 11.6.0

Reviewed-by: Adam King adking@redhat.com

common/version: --version to output BUILD_TYPE

```
$ ceph-osd --version
ceph version 20.0.0-514-g7a036c6b017 (7a036c6b01769b300ead54db2a3b42182cc59442) tentacle (dev - Debug)
```

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
(cherry picked from commit 2e6ef8740269de40315f9d593d6eacddfdf28f53)

do_cmake.sh: update build type warning

also, use printf instead of cat<<

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
(cherry picked from commit fc7597df316b536e32379a6a67629e2f8293467a)

CMakeLists: Fallback to RelWithDebInfo

Currently, if .git exists, we set CMAKE_BUILD_TYPE=Debug.
Otherwise, we leave it empty and no optimization flags will
be used.
With this change, the fallback CMAKE_BUILD_TYPE is set
to RelWithDebInfo instead.

From CMAKE_BUILD_TYPE manual:
The default value is often an empty string, but this is usually not
desirable and one of the other standard build types is usually more appropriate.

Note: One notable change is that -DNDEBUG will now be defined.

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
(cherry picked from commit c5766bb2c48634925c8685b779ec814edc43d17b)

Merge pull request #63082 from cbodley/wip-qa-rgw-tentacle

tentacle: qa/rgw: point tentacle qa suites at ceph-tentacle test branches

Reviewed-by: Adam C. Emerson <aemerson@redhat.com>

os/bluestore/recompression: Estimator omits large compressed blobs

The problem was that Estimator accepted large compressed blobs for
recompression. The fix is to discourage such actions by penalizing
compressed blobs based on their size. In effect small compressed
blob is likely to be recompressed, and large compressed blob will not.

Fixes: https://tracker.ceph.com/issues/71244
Signed-off-by: Adam Kupczyk <akupczyk@ibm.com>
(cherry picked from commit bbc9e961e9046949138bb3d70e8dd91761fcb088)

os/bluestore/recompression: Now able to reach left boundary

Bad comparision caused recompression range to exclude left boundary
point. In most cases it makes little difference, but it prevents from:
1) including extent starting at 0
2) including extent at begging of onode segment
Now fixed.

Fixes: https://tracker.ceph.com/issues/71244
Signed-off-by: Adam Kupczyk <akupczyk@ibm.com>
(cherry picked from commit acfe527d9bbe3364f9e321ce6e790f93eafe41df)

osd_types: Restore new_object marking for delete missing entries

Recent changes (PR #29893) removed the “new_object” parameter from missing.add() and the
pg_missing_item constructor. As a result, when processing delete log entries,
if an object is found on disk, its on‑disk version is stored as “have” instead
of the default eversion_t() (0'0). The invariant in read_log_and_missing() then
fails because delete entries are expected to have “have” set to eversion_t().

This patch reintroduces the following check:
if (have == eversion_t())
clean_regions.mark_object_new();

By doing so, we ensure that when the on‑disk “have” is default, the missing record
is marked as new—restoring the previous behavior and satisfying the invariant for
delete operations.

Fixes: https://tracker.ceph.com/issues/45702
Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>
(cherry picked from commit 4289371b20bb437ad38ded92bc53f5b192a8a16e)

Merge pull request #63146 from zdover23/wip-doc-2025-05-07-backport-63139-to-tentacle

tentacle: doc/cephadm: Correct formatting in upgrade.rst

doc/cephadm: Correct formatting in upgrade.rst

Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
(cherry picked from commit 0e1adc4ae69b9fdc8f4a8480f3e498efa06520f4)

crimson/osd: Logging fixes

* Fix "failed to log message"
* PGRecovery move to new logging macro
* PGRecovery to print pg prefix as it's impossible to debug specific pg
recovery ops without it.

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
(cherry picked from commit 314d2e7dddcd01b3e239561ecc3281f274d7012f)

crimson/osd/pg: Let PGListener use start_peering_event_operation

PG::start_peering_event_operation is a template function while
PGRecovery::pg is of PGRecoveryListener* type. We can't expose a template
function through the PGRecoveryListener interface since it must be
also virtual.
Instead, introduce start_peering_event_operation_listener which will act
as a wrapper to PG::start_peering_event_operation for PGRecovery to use
freely.

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
(cherry picked from commit c94e604bbb6550109ad5945ab86f82d6d5aeb1a1)

crimson/osd: Convert do_recovery() from bool to stop_iteration

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
(cherry picked from commit 798a14a0f7c48daf203b31fd64de0c4a502053e5)

crimson/osd/pg_recovery: rewrite start_recovery_ops

We had few confusions around the return value from start_recovery_ops.
This commit is a groundwork for the return type change.

* Move to coroutines
* Update logging macro

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
(cherry picked from commit ce4e9aaad8f2cafae24511fe1687c61dc41affc1)

Merge pull request #63061 from Matan-B/wip-71137-tentacle

tentacle: crimson: osd_operation cleanups and fix for MOSDRepOpReply ordering

Reviewed-by: Aishwarya Mathuria <amathuri@redhat.com>

Merge pull request #63059 from Matan-B/wip-matanb-crimson-tentacle-62894

crimson: fix unused variable warnings due to assert() and NDEBUG builds

Reviewed-by: Aishwarya Mathuria <amathuri@redhat.com>

Merge pull request #63058 from Matan-B/wip-matanb-crimson-tentacle-63035

tentacle: crimson/osd/recovery_backend: scan_for_backfill_primary correctly handle

Reviewed-by: Aishwarya Mathuria <amathuri@redhat.com>

Merge pull request #63107 from zdover23/wip-doc-2025-05-05-backport-62860-to-tentacle

tentacle: doc: update cephfs-journal-tool docs

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

doc: update cephfs-journal-tool docs

Fixes: https://tracker.ceph.com/issues/70953
Signed-off-by: Jos Collin <jcollin@redhat.com>
(cherry picked from commit 7ae0c22dfd63d14773966ab3796c68f1be913534)

osd/scrub: check all(*) conditions in restrictions_on_scrubbing()

Modified OsdScrub::restrictions_on_scrubbing() to check all(*)
conditions, instead of stopping at the first one that is true.
The "new" (since Tentacle) scrub-type-to-conditions mapping is no
longer a simple one (is not "monotonic" in the sense of restrictions
always being removed as the scrub type is more important),
and the caller may want to know them all.

(*) The somewhat costly check for the random backoff is still only
performed if the OSD is not already running too many scrubs.

Fixes: https://tracker.ceph.com/issues/71169
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
(cherry picked from commit 5a3a4f73476580b44442a30c8404a62b5055c96d)

mgr/dashboard: Bump grafana version to 11.6.0

Fixes https://tracker.ceph.com/issues/70929

Signed-off-by: Afreen Misbah <afreen@ibm.com>
(cherry picked from commit 8fcb0b67409d4101de7d5a53c438df7c253db898)

Merge pull request #63043 from cbodley/wip-71115-tentacle

tentacle: rgw/sts: correcting authentication in case s3 ops are directed to a primary from secondary after assumerole.

Reviewed-by: Seena Fallah <seenafallah@gmail.com>

Merge pull request #63075 from cbodley/wip-71151-tentacle

tentacle: rgw/lc: stop using merge_and_store_attrs in remove_bucket_config

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>

Merge pull request #63051 from pritha-srivastava/wip-71108-tentacle

tentacle: rgw/sts: Implementation of validating JWT using modulus and exponent

Reviewed-by: Casey Bodley <cbodley@redhat.com>

qa/rgw: point tentacle qa suites at ceph-tentacle test branches

Signed-off-by: Casey Bodley <cbodley@redhat.com>

qa/rgw/upgrade: add upgrade from squid

Signed-off-by: Casey Bodley <cbodley@redhat.com>
(cherry picked from commit f9eced0ba754a4d2fe586b14a6c809292357ab6e)

qa/rgw/upgrade: remove upgrade from quincy

Signed-off-by: Casey Bodley <cbodley@redhat.com>
(cherry picked from commit 01d2ff3fb0df1381cbfc8c8ff6af623877d4e5ff)

rgw/lc: stop using merge_and_store_attrs in remove_bucket_config

merge_and_store_attrs() is no longer able to remove xattrs. change the
sal::Bucket's attrs manually and call put_info() to write them back

Fixes: https://tracker.ceph.com/issues/71083
Signed-off-by: Casey Bodley <cbodley@redhat.com>
(cherry picked from commit f3cc52124c650d32be2adf3cf540167142423c42)

Merge pull request #63070 from sseshasa/wip-71143-tentacle

tentacle: doc/rados: Update mClock doc on steps to override OSD IOPS capacity config

doc/rados: Update mClock doc on steps to override OSD IOPS capacity config

Describe the steps involved to
- Specify a global value for osd_mclock_max_capacity_iops_{ssd,hdd}, and
- Override existing individually scoped values for OSDs determined during
   start-up for osd_mclock_max_capacity_iops_{ssd,hdd}.

The above is to help with the following:
- Steps to override existing setting with a global value.
- reduce the number of entries in the mon store and instead use a single
   global specification for all OSDs in the cluster in case the underlying
   hardware is the same for all OSDs.

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
Fixes: https://tracker.ceph.com/issues/70774
(cherry picked from commit 899da82ae2e857f2a769c08d6623e6da7678d16b)

crimson: Create the shared promise before waited upon

RecoveryBackend::pushes map creates each shared_promise
in wait_for_pushes call. There can be a situation where
set_pushed is called due to handled push reply (handle_push_reply)
before the shared_promise was even constructed due to backfill progress
is stuck.

Fixes: https://tracker.ceph.com/issues/70502
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Signed-off-by: Mohit Agrawal <moagrawa@redhat.com>
(cherry picked from commit 435f065a13c661c4f2439dee340a786ca4b5f31e)

crimson: add operation wrapper for MOSDRepOpReply

This should avoid reordering between cores.

Fixes: https://tracker.ceph.com/issues/69439
Signed-off-by: Samuel Just <sjust@redhat.com>
(cherry picked from commit 0c15eb5ca59c98d776091c5602888b0895df0a72)

crimson: convert cross-core operations to use RemoteOperation

Signed-off-by: Samuel Just <sjust@redhat.com>
(cherry picked from commit b031373de33894e29fa4bf4d207b480a31431c08)

crimson: fix DynamicPerfStats usage in ClientRequest

ClientRequest::get_connection() return l_conn, which will be
null by the time PG::add_client_request_lat is called in
ClientRequest::do_process. Modify get_connection() to
return a Connection& from whichever of l_conn or r_conn
isn't null.

Signed-off-by: Samuel Just <sjust@redhat.com>
(cherry picked from commit 794bb024efa94cb2bb2a90df226bab05c8b9449b)