git-server-git.apps.pok.os.sepia.ceph.com Git - ceph.git/log

]> git-server-git.apps.pok.os.sepia.ceph.com Git - ceph.git/log

projects / ceph.git / log

summary | shortlog | log | commit | commitdiff | tree
first ⋅ prev ⋅ next

commit | commitdiff | tree

Kefu Chai [Tue, 26 May 2026 14:01:41 +0000 (22:01 +0800)]

crimson/seastore: make RecordSubmitter::wait_available() idempotent

Under sustained 4K randwrite workloads that roll journal segments
frequently, crimson-osd hits
```
    crimson/os/seastore/journal/record_submitter.cc:198:
    FAILED ceph_assert(!is_available())
```
and, in release builds without assertions, a downstream
`boost::throw_exception<std::length_error>` from
`seastar::shared_promise::get_shared_future()` called on a
disengaged `std::optional` in the same code path.

`RecordSubmitter::roll_segment()` arms wait_available_promise on entry,
then chains `journal_allocator.roll().safe_then(...)` whose continuation
sets the promise's value and resets the optional. The background
continuation can resolve before the subsequent `wait_available()` call
is entered -- the optional gets reset, `is_available()` becomes true
again, and `wait_available()`'s `assert(!is_available())` fires. The
brittle invariant being assumed

> .safe_then's continuation will not run before its outer call returns

is not part of seastar's contract.

Honour the documented contract instead.  record_submitter.h
says:

> wait for available if cannot submit, should check
> is_available() again when the future is resolved.

The postcondition is "available when resolved"; the precondition
"unavailable when called" was incidental.  Make `wait_available()`
idempotent: if `is_available()` is already true on entry, return a
ready future immediately. All three external callers
- `RecordSubmitter::roll_segment`
- `CircularBoundedJournal::submit_record`
- `SegmentedOolWriter::do_write`

re-check `is_available()` on the next iteration or in the chained
continuation and dispatch correctly.

Validated by runing a 96-job fio randwrite bench to confirm
the fix in operation; pre-patch the assert fires within ~30 min
and kills the OSD.

Signed-off-by: Kefu Chai <k.chai@proxmox.com>

commit | commitdiff | tree

Nizamudeen A [Wed, 27 May 2026 05:52:17 +0000 (11:22 +0530)]

Merge pull request #69116 from rhcs-dashboard/fix-cephadm-e2e-quoting

mgr/dashboard: fix nested shell quoting in cephadm e2e start-cluster

Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Afreen Misbah [Wed, 27 May 2026 00:07:38 +0000 (05:37 +0530)]

mgr/dashboard: fix nested shell quoting in cephadm e2e start-cluster

with_libvirt wraps commands in sg libvirt -c "$1", adding an extra
shell layer. Nested double quotes inside the outer double-quoted
string caused the argument to be split — with_libvirt received a
truncated $1, producing "Unterminated quoted string" on the remote
shell.

Drop the unnecessary inner double quotes around cephadm shell
arguments since cephadm shell accepts the command as separate args.
Use single quotes for the grep pattern inside the double-quoted
string so it survives the sg subshell.

Signed-off-by: Afreen Misbah <afreen@ibm.com>

commit | commitdiff | tree

Kefu Chai [Wed, 27 May 2026 00:05:25 +0000 (08:05 +0800)]

Merge pull request #69068 from tchaikov/wip-bump-arrow-submodule

rgw: bump Apache Arrow submodule from 17.0.0 to 19.0.1

Reviewed-by: Justin Caratzas <jcaratza@ibm.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Kamoltat (Junior) Sirivadhna [Tue, 26 May 2026 22:04:43 +0000 (18:04 -0400)]

Merge pull request #67551 from Ericmzhang/wip-improve-pg-autoscale

mgr: Fix autoscaling PG distribution
Reviewed-by: Kamoltat (Junior) Sirivadhna <ksirivad@redhat.com>

commit | commitdiff | tree

Brad Hubbard [Tue, 26 May 2026 21:59:37 +0000 (07:59 +1000)]

Merge pull request #67337 from badone/wip-tracker-74919-ceph-dump-log-new-global-access

scripts: ceph_dump_log.py change global context access

Reviewed-by: Aishwarya Mathuria <amathuri@redhat.com>

commit | commitdiff | tree

Afreen Misbah [Tue, 26 May 2026 21:35:55 +0000 (03:05 +0530)]

Merge pull request #67857 from yaelazulay-redhat/issues_74393_dashboard_fail_to_access_object_when_rgw_use_cephadm_certificate

Issues 74393 dashboard fail to access object when rgw use cephadm certificate

Reviewed-by: Afreen Misbah <afreen@ibm.com>
Reviewed-by: Redouane Kachach <rkachach@ibm.com>

commit | commitdiff | tree

Adam Emerson [Tue, 26 May 2026 18:49:10 +0000 (14:49 -0400)]

Merge pull request #68874 from BBoozmen/wip-oozmen-76563

neorados/cls/log: fix infinite trim loop on empty data log shards

Reviewed-by: Adam C. Emerson <aemerson@redhat.com>

commit | commitdiff | tree

Radoslaw Zarzynski [Tue, 26 May 2026 16:31:12 +0000 (18:31 +0200)]

Merge pull request #67079 from MattyWilliams22/ec-sync-reads

osd: Support for Synchronous Reads in EC

Reviewed-by: Alex Ainscow <aainscow@uk.ibm.com>
Reviewed-by: Bill Scales <bill_scales@uk.ibm.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

Pedro Gonzalez Gomez [Tue, 26 May 2026 10:16:32 +0000 (12:16 +0200)]

Merge pull request #67950 from rhcs-dashboard/add-telemetry-status

mgr/dashboard: add telemetry status to overview-health-card

Reviewed-by: Abhishek Desai <abhishek.desai1@ibm.com>

commit | commitdiff | tree

Kefu Chai [Tue, 26 May 2026 09:53:02 +0000 (17:53 +0800)]

Merge pull request #68258 from tchaikov/wip-with-system-jerasure

cmake: support building with system jerasure and gf-complete

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

NitzanMordhai [Tue, 26 May 2026 09:31:45 +0000 (12:31 +0300)]

Merge pull request #61131 from NitzanMordhai/wip-nitzan-mgr-modules-perf-counts

mgr: Add per-module performance counters to mgr

Reviewed-by: Sridhar Seshasayee sridhar.seshasayee@ibm.com

commit | commitdiff | tree

Guillaume Abrioux [Tue, 26 May 2026 09:19:03 +0000 (11:19 +0200)]

Merge pull request #68858 from rsacherer/wip-fix-limit-break-existing-devices

ceph-volume: fix re-deployment of OSD issues with disk selection filters and DB Devices

commit | commitdiff | tree

Pedro Gonzalez Gomez [Tue, 26 May 2026 09:06:14 +0000 (11:06 +0200)]

Merge pull request #67935 from rhcs-dashboard/add-csv

mgr/dashboard: Add Hosts via CSV Upload

Reviewed-by: Devika Babrekar <devika.babrekar@ibm.com>
Reviewed-by: Puja Shahu <pshahu@redhat.com>
Reviewed-by: Pedro Gonzalez Gomez <pegonzal@ibm.com>

commit | commitdiff | tree

Guillaume Abrioux [Tue, 26 May 2026 09:04:53 +0000 (11:04 +0200)]

Merge pull request #68894 from guits/cv-dm-mgmt

ceph-volume: OSD mapper lifecycle (LVM + raw) for activate

commit | commitdiff | tree

Matan Breizman [Tue, 26 May 2026 08:02:54 +0000 (11:02 +0300)]

Merge pull request #69064 from tchaikov/wip-crimson-scrub-blocked

crimson/scrub: fix assert in PGScrubber::release_range() on interval change

Reviewed-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Matan Breizman [Tue, 26 May 2026 08:01:14 +0000 (11:01 +0300)]

Merge pull request #69020 from tchaikov/wip-level-triggered-unblock

crimson/osd: only unblock wait_for_active_blocker on replica when ACTIVE

Reviewed-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Matan Breizman [Tue, 26 May 2026 08:00:37 +0000 (11:00 +0300)]

Merge pull request #69018 from tchaikov/wip-large-object-size

crimson/seastore: reject oversized writes and zeros instead of aborting

Reviewed-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Xuehan Xu [Tue, 26 May 2026 05:40:09 +0000 (13:40 +0800)]

Merge pull request #59476 from zhscn/wip-new-128

crimson/os/seastore: introduce static layout of laddr_t

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>
Reviewed-by: Anthony D Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Yuval Lifshitz [Tue, 26 May 2026 04:58:58 +0000 (07:58 +0300)]

Merge pull request #68887 from ShreeJejurikar/wip-bucket-logging-requester-assumed-role

rgw/logging: use assumed-role ARN as Requester for STS requests

commit | commitdiff | tree

Xuehan Xu [Tue, 26 May 2026 02:54:07 +0000 (10:54 +0800)]

Merge pull request #69067 from xxhdx1985126/wip-seastore-lba-wrong-asserts

crimson/os/seastore/lba: fix wrong asserts and "if" conditions

Reviewed-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Mon, 25 May 2026 18:27:42 +0000 (21:27 +0300)]

Merge pull request #69082 from ronen-fr/wip-rf-trimlmt-rst

doc/PendingReleaseNotes: document osd_scrub_queued_snaptrims_limit

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Ronen Friedman [Mon, 25 May 2026 13:13:03 +0000 (13:13 +0000)]

doc/PendingReleaseNotes: document osd_scrub_queued_snaptrims_limit

osd_scrub_queued_snaptrims_limit, introduced in PR#68737,
blocks the initiation of non-urgent scrubs on OSDs that
are overloaded with snap-trim operations.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

ShreeJejurikar [Wed, 20 May 2026 07:18:03 +0000 (12:48 +0530)]

qa/rgw/bucket-logging: configure STS for assume-role test

Set rgw sts key and enable rgw s3 auth use sts, both needed by
test_bucket_logging_requester_assumed_role. Mirrors the existing
settings in qa/suites/rgw/verify/overrides.yaml.

Signed-off-by: ShreeJejurikar <shreemj8@gmail.com>

commit | commitdiff | tree

Matan Breizman [Mon, 25 May 2026 10:39:18 +0000 (13:39 +0300)]

Merge pull request #69006 from tchaikov/wip-seastore-clamp-block-size-on-small-lba

crimson/seastore: clamp block_size to laddr_t::UNIT_SIZE on small-LBA devices

Reviewed-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Matan Breizman [Mon, 25 May 2026 10:24:59 +0000 (13:24 +0300)]

Merge pull request #68961 from fultheim/fix-cleaner-stall-projected-ratio

crimson/os/seastore: fix cleaner stall under IO-block pressure

Reviewed-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Matan Breizman [Mon, 25 May 2026 09:27:01 +0000 (12:27 +0300)]

Merge pull request #68884 from tchaikov/wip-crimson-advance-osdmap

crimson/osd: fix mark-down crash for removed OSDs

Reviewed-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Matan Breizman [Mon, 25 May 2026 09:26:06 +0000 (12:26 +0300)]

Merge pull request #68861 from tchaikov/wip-crimson-reset-logger

crimson/osd: inline log file stream setup to fix dangling pointer

Reviewed-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Redouane Kachach [Mon, 25 May 2026 09:14:57 +0000 (11:14 +0200)]

Merge pull request #69042 from Shubhaj1810/revert-67999

Revert "mgr/cephadm: align nodeid and add register_service for NFS Ganesha service visibility"

Reviewed-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>

commit | commitdiff | tree

Xuehan Xu [Fri, 15 May 2026 09:10:04 +0000 (17:10 +0800)]

crimson/os/seastore: also update the mappings copied by client
transactions when committing background rewriting transactions

With the 128-bit laddr key layout in place, SeaStore::rename would
involve copying mappings. These mappings must also be updated when
the logical extents they point to are rewritten.

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Xuehan Xu [Tue, 28 Apr 2026 07:00:24 +0000 (15:00 +0800)]

crimson/os/seastore/omap_manager/log: better output

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Xuehan Xu [Sun, 10 May 2026 07:36:22 +0000 (15:36 +0800)]

doc/dev/crimson/seastore_laddr.rst: add descriptions about temp
recovering objects

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Xuehan Xu [Thu, 16 Apr 2026 05:47:18 +0000 (13:47 +0800)]

crimson/osd: treat OI-not-existing cases as enoent

This is consistent with classic osds

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Xuehan Xu [Tue, 14 Apr 2026 06:00:52 +0000 (14:00 +0800)]

crimson/os/seastore/object_data_handler: new debug logs

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Xuehan Xu [Wed, 22 Apr 2026 05:37:46 +0000 (13:37 +0800)]

crimson/osd: create temp recovering objects through touch_temp

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Xuehan Xu [Sun, 29 Mar 2026 03:20:52 +0000 (11:20 +0800)]

crimson/os/seastore: handle OP_TOUCH_TEMP

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Xuehan Xu [Thu, 26 Mar 2026 08:08:41 +0000 (16:08 +0800)]

os/Transaction: add the interface dedicated to touching temp objects

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Xuehan Xu [Tue, 3 Feb 2026 03:11:33 +0000 (11:11 +0800)]

crimson/os/seastore/lba: fix possible namespace lookup error

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Zhang Song [Thu, 8 Jan 2026 04:14:20 +0000 (12:14 +0800)]

dev/doc/crimson: clarify dynamic PG and object bits for static laddr design

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Zhang Song [Wed, 3 Sep 2025 07:54:40 +0000 (15:54 +0800)]

crimson/os/seastore: adapt copy on write for static onode prefix

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Zhang Song [Tue, 26 Aug 2025 03:42:49 +0000 (11:42 +0800)]

crimson/os/seastore: support rename for static layout of laddr

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Xuehan Xu [Tue, 26 Aug 2025 06:28:55 +0000 (14:28 +0800)]

crimson/os/seastore: add "move_mapping" to TransactionManager and LBAManager

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Xuehan Xu [Mon, 2 Feb 2026 07:42:49 +0000 (15:42 +0800)]

crimson/os/seastore/lba: set extent type for ZERO lba mappings

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Sagar Gopale [Mon, 23 Mar 2026 06:08:44 +0000 (11:38 +0530)]

mgr/dashboard: Add Hosts via CSV Upload

Fixes: https://tracker.ceph.com/issues/75578
Signed-off-by: Sagar Gopale <sagar.gopale@ibm.com>

commit | commitdiff | tree

Redouane Kachach [Mon, 25 May 2026 08:32:23 +0000 (10:32 +0200)]

Merge pull request #68667 from rhcs-dashboard/fix-76316-main

mgr/dashboard: add remote write section to prometheus configuration

Reviewed-by: Redouane Kachach <rkachach@ibm.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Nitzan Mordechai [Tue, 17 Dec 2024 13:49:00 +0000 (13:49 +0000)]

workunits/mgr/test_mgr_modules_perf_counters: new test for enable\disable\perf counts

Simple test to enable \ disable and get counters dump
for checking perf counters.

Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>

commit | commitdiff | tree

Nitzan Mordechai [Sun, 8 Dec 2024 18:08:39 +0000 (18:08 +0000)]

mgr: Add per-module performance counters to mgr

This commit introduces performance counters for individual Ceph mgr modules.
These counters allow monitoring module behavior, debugging latency issues,
and identifying performance bottlenecks, all without modifying the modules themselves.

The following counters are now exposed under:
  > ceph daemon mgr.<id> perf dump

Example structure:
"mgr_module_<module_name>": {
    "notify_avg_usec": {     <- Average time spent handling notify events
        "avgcount": 0,
        "sum": 0
    },
    "cmd_avg_usec": {        <- Average time spent processing CLI/admin commands
        "avgcount": 0,
        "sum": 0
    },
    "serve_avg_usec": {      <- Average time spent in module serve loop (if applicable)
        "avgcount": 0,
        "sum": 0
    },
    "alive": 1               <- Module is alive (1 = running, 0 = exited)
    "cpu_usage": 0,          <- CPU usage in percent
    "mem_rss_change": 0,     <- Memory RSS change in bytes
    "mem_rss_current": 490737664 <- Memory RSS current in bytes

}

Signed-off-by: Nitzan Mordechai <nmordech@ibm.com>
Conflicts:
  src/mgr/ActivePyModules.cc - finisher.queue changed by 63859, adding py_module to the parameter list
  src/mgr/PyModuleRegistry.cc - check_all_modules_started added by 63859

commit | commitdiff | tree

Guillaume Abrioux [Wed, 13 May 2026 12:57:03 +0000 (14:57 +0200)]

ceph-volume: OSD mapper lifecycle (LVM + raw) for activate

This adds small helpers so activate can consistently bring the OSD device
stack online (LVM lvchange, optional mapper open) and tear it down again,
with refresh in between. Same idea for the raw path. Crypto is handled
inside that flow when the OSD is encrypted.

Fixes: https://tracker.ceph.com/issues/76591
Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>

commit | commitdiff | tree

Yuval Lifshitz [Sun, 24 May 2026 19:29:38 +0000 (22:29 +0300)]

Merge pull request #68771 from jrse/rgw-kafka-mtls-rebased

rgw/kafka: add mTLS support (extends #61572)

commit | commitdiff | tree

Kefu Chai [Sun, 24 May 2026 08:25:46 +0000 (16:25 +0800)]

rgw: bump Apache Arrow submodule from 17.0.0 to 19.0.1

When WITH_SYSTEM_ARROW is false, Ceph builds Arrow from the bundled
src/apache submodule. Our CI uses ubuntu:jammy as the base image, which
does not package libarrow-dev, so the bundled path is always taken there.

Arrow 17.0.0 vendors a copy of Thrift whose download URLs are no longer
reachable, breaking CI builds that try to fetch them at configure time.

Bump arrow submodule to 19.0.1, the latest Arrow release that:
- builds successfully on ubuntu:jammy, and
- requires only CMake 3.22 (the version shipped by ubuntu:jammy)

See also

CMake version shipped by ubuntu:jammy
- https://packages.ubuntu.com/jammy/cmake

arrow releases' CMake support
- maint-19.0.1: https://github.com/apache/arrow/blob/272715f6df2a042d69881ffa03d5078c58e4b345/cpp/CMakeLists.txt#L18
- maint-20.0.0: https://github.com/apache/arrow/blob/3ad0370a04ccdae638755b94c3c31c8760a11193/cpp/CMakeLists.txt#L18

arrow enabled minmalloc by default
-
https://github.com/apache/arrow/commit/b907c5dadb516b525c8fafbf34b0116d44044733

Because arrow uses the bundled mialloc library be default, we need
to disable it in the same commit bumping up the submodule.

Signed-off-by: Kefu Chai <k.chai@proxmox.com>

commit | commitdiff | tree

Kefu Chai [Sun, 24 May 2026 09:55:45 +0000 (17:55 +0800)]

Merge pull request #66150 from MaodiMa/AVX512_crc32c

common: enable AVX512+VPCLMULQDQ for crc32c performance on x86

Reviewed-by: Kefu Chai <k.chai@proxmox.com>

commit | commitdiff | tree

Xuehan Xu [Sat, 23 May 2026 09:23:02 +0000 (17:23 +0800)]

crimson/os/seastore/lba: fix wrong asserts and "if" conditions

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Zhang Song [Fri, 30 May 2025 09:45:39 +0000 (17:45 +0800)]

crimson/os/seastore/OMapManager: only store the relative block offset to omap root in OMapInnerNode

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Zhang Song [Tue, 27 May 2025 07:31:13 +0000 (15:31 +0800)]

test/crimson/seastore/test_btree_lba_manager: add test cases for conflict policy

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Zhang Song [Tue, 26 Aug 2025 03:38:49 +0000 (11:38 +0800)]

crimson/os/seastore/lba_manager: implement conflict policy

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Zhang Song [Wed, 11 Jun 2025 04:04:25 +0000 (12:04 +0800)]

crimson/os/seastore: reserve region in LBABtree when touching onode

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Zhang Song [Wed, 11 Jun 2025 04:04:03 +0000 (12:04 +0800)]

crimson/os/seastore/OnodeManager: adapt laddr_hint_t approach

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Zhang Song [Mon, 26 May 2025 07:23:25 +0000 (15:23 +0800)]

crimson/os/seastore/OMapManager: adapt laddr_hint_t approach

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Zhang Song [Tue, 26 Aug 2025 03:36:07 +0000 (11:36 +0800)]

crimson/os/seastore: use laddr_hint_t to allocate the laddr

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Zhang Song [Wed, 11 Jun 2025 03:50:12 +0000 (11:50 +0800)]

crimson/os/seastore/Onode: get sibling's object id when creating new onode

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Zhang Song [Tue, 26 Aug 2025 03:34:37 +0000 (11:34 +0800)]

crimson/os/seastore/Onode: adapt new get hint approach

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Zhang Song [Thu, 22 May 2025 08:58:14 +0000 (16:58 +0800)]

crimson/os/seastore/Onode: support get object/clone prefix

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Zhang Song [Tue, 26 Aug 2025 03:31:03 +0000 (11:31 +0800)]

crimson/os/seastore/Onode: remove default metadata offset/range

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Zhang Song [Wed, 14 May 2025 08:34:00 +0000 (16:34 +0800)]

crimson/os/seastore: introduce laddr_hint_t and associated factory methods

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Zhang Song [Tue, 26 Aug 2025 02:35:55 +0000 (10:35 +0800)]

crimson/os/seastore: make pladdr_t only store the local clone id instead of full laddr_t

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Zhang Song [Wed, 14 May 2025 08:26:26 +0000 (16:26 +0800)]

crimson/os/seastore: introduce static layout of laddr_t

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Zhang Song [Wed, 14 May 2025 07:22:15 +0000 (15:22 +0800)]

crimson/os/seastore: extend the size of laddr_t from 64 bits to 128 bits

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Kefu Chai [Sat, 23 May 2026 13:56:32 +0000 (21:56 +0800)]

Merge pull request #69045 from xxhdx1985126/wip-seastore-drop-retired-placeholder

crimson/os/seastore: remove RetiredExtentPlaceholder

Reviewed-by: Kefu Chai <k.chai@proxmox.com>

commit | commitdiff | tree

Kefu Chai [Sat, 23 May 2026 13:33:14 +0000 (21:33 +0800)]

Merge pull request #68823 from tchaikov/wip-crimson-remove-from

crimson/osd: make PGAdvanceMap idempotent

Reviewed-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Kefu Chai [Fri, 22 May 2026 11:01:17 +0000 (19:01 +0800)]

crimson/scrub: fix assert in PGScrubber::release_range() on interval change

when an interval change occurs while ScrubReserveRange is still
waiting to acquire background_process_lock, ChunkState::exit()
calls release_range() but blocked is not yet set. this triggers
ceph_assert(blocked) in release_range().

fix by checking if blocked is set before asserting. if blocked is
not set, the range was never reserved, so release_range() is a
no-op. ScrubReserveRange's finally block handles lock cleanup in
this case.

Fixes: https://tracker.ceph.com/issues/76752
Signed-off-by: Kefu Chai <k.chai@proxmox.com>

commit | commitdiff | tree

Ronen Friedman [Sat, 23 May 2026 08:04:45 +0000 (11:04 +0300)]

Merge pull request #68684 from ronen-fr/wip-rf-statfx

osd/scrub: auto-correct accounting-only stat mismatches

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

Redouane Kachach [Sat, 23 May 2026 08:04:37 +0000 (10:04 +0200)]

Merge pull request #68292 from Kushal-deb/fix-nvme-gw-crash

mgr/cephadm: fix nvmeof reconfig loop by preserving daemon deps

Reviewed-by: Redouane Kachach <rkachach@ibm.com>

commit | commitdiff | tree

Redouane Kachach [Sat, 23 May 2026 08:04:03 +0000 (10:04 +0200)]

Merge pull request #67308 from rkachach/fix_issue_ssl_cert_deps

mgr/cephadm: track TLS spec changes in deps and cleanup stale certmgr entries on cert source transitions

Reviewed-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>

commit | commitdiff | tree

Ronen Friedman [Sat, 23 May 2026 07:50:42 +0000 (10:50 +0300)]

Merge pull request #68737 from ronen-fr/wip-rf-stqlength

crimson+classic/osd/scrub: limit scrubbing under snap-trimming overload

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

Jan Radon [Fri, 15 May 2026 13:42:08 +0000 (15:42 +0200)]

feat(rgw/kafka): add mTLS client certificate authentication for Kafka notifications
Add support for mutual TLS (mTLS) client certificate authentication
when publishing bucket notifications to Kafka brokers. RGW can now
present a client certificate and private key to authenticate with
brokers that require ssl.client.auth=required.
Changes:
- Add ssl-certificate-location, ssl-key-location, and ssl-key-password
topic attributes for configuring client certificates
- Validate that ssl_certificate and ssl_key are provided together
- Include ssl_key_password in connection identity (hash/equality)
- Add kafka-security.sh script for generating broker and client TLS certs
- Add mTLS test (test_notification_kafka_security_ssl_mtls) using
use_mtls=True flag on the existing SSL security path
- Update RGW notifications documentation with mTLS parameters

Fixes: http://tracker.ceph.com/issues/67427
Signed-off-by: Jan Radon <jan.fabian.radon@sap.com>

commit | commitdiff | tree

Redouane Kachach [Fri, 22 May 2026 19:28:33 +0000 (21:28 +0200)]

Merge pull request #67315 from timqn22/misreporting_count_osd_services

mgr/cephadm: verify spec service_id before applying

Reviewed-by: Kefu Chai <k.chai@proxmox.com>
Reviewed-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
Reviewed-by: Adam King <adking@redhat.com>

commit | commitdiff | tree

Redouane Kachach [Fri, 22 May 2026 19:27:37 +0000 (21:27 +0200)]

Merge pull request #66477 from xelexin/fix_cephadm_agent_volume_gatherer

orch/cephadm: Fixes an unlimited env append in cephadm agent

Reviewed-by: Adam King <adking@redhat.com>

commit | commitdiff | tree

Redouane Kachach [Fri, 22 May 2026 19:26:44 +0000 (21:26 +0200)]

Merge pull request #68902 from timqn22/logrotate-list

src/cephadm: added ceph-exporter to post-rotate signal list

Reviewed-by: Redouane Kachach <rkachach@ibm.com>
Reviewed-by: Kefu Chai <k.chai@proxmox.com>

commit | commitdiff | tree

Redouane Kachach [Fri, 22 May 2026 19:25:50 +0000 (21:25 +0200)]

Merge pull request #68915 from kginonredhat/issue-76564-mgr-daemon-ports-list-grows-unbounded-across-redeploys

mgr daemon ports list grows unbounded across redeploys

Reviewed-by: Redouane Kachach <rkachach@ibm.com>

commit | commitdiff | tree

Redouane Kachach [Fri, 22 May 2026 19:24:54 +0000 (21:24 +0200)]

Merge pull request #68976 from kginonredhat/issue-76295-nfs-sample-enable-udp-false

cephadm: disable UDP in samples/nfs.json for test_cephadm Ganesha

Reviewed-by: Redouane Kachach <rkachach@ibm.com>
Reviewed-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>

commit | commitdiff | tree

Redouane Kachach [Mon, 9 Mar 2026 15:11:50 +0000 (16:11 +0100)]

mgr/cephadm: adding UT for the new functionality

Fixes: https://tracker.ceph.com/issues/75009
Signed-off-by: Redouane Kachach <rkachach@ibm.com>

commit | commitdiff | tree

Redouane Kachach [Mon, 23 Feb 2026 15:15:12 +0000 (16:15 +0100)]

mgr/cephadm: moving certificates reconciliation code to a new method

This way we ensure it's called everytime there's a switch in the
certificate

Signed-off-by: Redouane Kachach <rkachach@ibm.com>

commit | commitdiff | tree

Xuehan Xu [Thu, 21 May 2026 07:10:59 +0000 (15:10 +0800)]

crimson/os/seastore: drop RetiredExtentPlaceholder

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Xuehan Xu [Thu, 21 May 2026 06:50:42 +0000 (14:50 +0800)]

crimson/os/seastore/cache: remove retire_extent_addr

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Xuehan Xu [Wed, 20 May 2026 08:31:29 +0000 (16:31 +0800)]

crimson/os/seastore/cache: re-implement Cache::retire_absent_extent_addr

The new implementation retire an absent extent by constructing a real
empty extent and add it to the transaction's retired_set, instead of
creating a retired placeholder

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Ronen Friedman [Fri, 22 May 2026 05:49:42 +0000 (08:49 +0300)]

Merge pull request #68358 from ronen-fr/wip-rf-notazns

crimson/os/seastore: do not treat non-ZNS devices as errors

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
Reviewed-by: Kefu Chai <k.chai@proxmox.com>

commit | commitdiff | tree

Ronen Friedman [Thu, 21 May 2026 19:36:06 +0000 (22:36 +0300)]

Merge pull request #68948 from ronen-fr/wip-rf-fix-trimsnap

crimson/osd: decouple snap trim initiation from scrub completion

Reviewed-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Wed, 29 Apr 2026 04:55:02 +0000 (04:55 +0000)]

osd/scrub: limit scrubbing under snap-trimming overload

When the snap-trim queues are long, scrubbing is likely to
make things worse. This change adds a new scrubbing restriction
for that case, and prevents periodic scrubs from starting when
the total snap-trim queue length across all PGs exceeds a
configurable threshold.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Wed, 29 Apr 2026 04:14:23 +0000 (04:14 +0000)]

crimson/osd: collect total snap-trim queueus length

Periodically collect the total snap-trim
queue length across all PGs. Expose it through
OSDService::get_snap_trim_queue_total().

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Wed, 29 Apr 2026 03:45:34 +0000 (03:45 +0000)]

osd: collect total snap-trim queueus length

Periodically collect the total snap-trim
queue length across all PGs. Expose it through
OSDService::get_snap_trim_queue_total().

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Casey Bodley [Thu, 21 May 2026 14:41:52 +0000 (10:41 -0400)]

Merge pull request #68873 from cbodley/wip-73475

librados/asio: clear cancellation slot in associated executor

Reviewed-by: Adam Emerson <aemerson@redhat.com>
Reviewed-by: Shilpa Jagannath <smanjara@redhat.com>

commit | commitdiff | tree

David Galloway [Thu, 21 May 2026 14:21:53 +0000 (10:21 -0400)]

Merge pull request #68985 from djgalloway/nfs-ganesha-selinux

Revert "Use GANESHA_REPO_BASEURL for NFS-Ganesha on all distros"

commit | commitdiff | tree

Pedro Gonzalez Gomez [Mon, 23 Mar 2026 11:02:29 +0000 (12:02 +0100)]

mgr/dashboard: add telemetry status to overview-health-card

Fixes: https://tracker.ceph.com/issues/75666
Signed-off-by: Pedro Gonzalez Gomez <pegonzal@ibm.com>

commit | commitdiff | tree

Maodi Ma [Wed, 5 Nov 2025 02:35:46 +0000 (02:35 +0000)]

common: enable AVX512+VPCLMULQDQ for crc32c performance on x86

- Add crc32_iscsi_by16_10 in src/isa-l into candidates for ceph_crc32c
- Add hardware capability check for AVX512 instr before register
- Add NASM feature check to ensure compatibility and to enable
AS_FEATURE_LEVEL in crc32_iscsi_by16_10.asm

Signed-off-by: Maodi Ma <mamaodi@hygon.cn>

commit | commitdiff | tree

Shubha Jain [Thu, 21 May 2026 08:51:08 +0000 (14:21 +0530)]

Revert "Merge pull request #67999 from Shubhaj1810/nfs-ganesha-servicemap-fix"

This reverts commit d44d4fd402a0c23ab98056368d12cb83afd7bb32, reversing
changes made to 0e05a6054c822e36dcdf7b25d8d031fc937ac278.

Signed-off-by: Shubha Jain <SHUBHA.JAIN1@ibm.com>

commit | commitdiff | tree

ShreeJejurikar [Wed, 13 May 2026 13:05:39 +0000 (18:35 +0530)]

rgw/logging: use assumed-role ARN as Requester for STS requests

When a request is made with STS temporary credentials, the bucket logging
Requester field was being set to the underlying user ID instead of the
assumed-role ARN. Per the AWS S3 server-access-log spec, the Requester
field should contain the assumed-role ARN (e.g.
arn:aws:sts::<account>:assumed-role/<role>/<session>) for STS-credentialed
requests.

Detect TYPE_ROLE identities via s->auth.identity->get_identity_type() and
use the ARN returned by Identity::get_caller_identity() (already
implemented by RoleApplier in the expected AWS format) instead of falling
straight through to s->user->get_id(). Existing behavior for account- and
user-scoped requests is unchanged.

Fixes: https://tracker.ceph.com/issues/71742
Signed-off-by: Shree Jejurikar <shree.jejurikar@gmail.com>

commit | commitdiff | tree

bluikko [Thu, 21 May 2026 02:46:54 +0000 (09:46 +0700)]

Merge pull request #69013 from bluikko/wip-doc-rados-ops-pool-fix-label

doc/rados: move label to right place in pools.rst

commit | commitdiff | tree

bluikko [Thu, 21 May 2026 02:46:43 +0000 (09:46 +0700)]

Merge pull request #69014 from bluikko/wip-doc-man-cephadm-fix-markup

doc/man: fix broken markup in cephadm.rst

commit | commitdiff | tree

David Galloway [Wed, 20 May 2026 20:38:52 +0000 (16:38 -0400)]

Revert "Use GANESHA_REPO_BASEURL for NFS-Ganesha on all distros"

The ganesha spec file is calling in a system package that is in CentOS 10 Stream but not yet in Rocky/Alma/RHEL/whatever.

This reverts commit 1163bd6b01560bb435821d1ec14b69a5a4f3b0cc.

Fixes: https://tracker.ceph.com/issues/76681
Signed-off-by: David Galloway <david.galloway@ibm.com>

commit | commitdiff | tree

Patrick Donnelly [Wed, 20 May 2026 20:16:20 +0000 (16:16 -0400)]

Merge PR #68907 into main

* refs/pull/68907/head:
qa: ignore pg stuck peering

Reviewed-by: Yuri Weinstein <yweins@redhat.com>

Unnamed repository; edit this file 'description' to name the repository.