]> git-server-git.apps.pok.os.sepia.ceph.com Git - ceph.git/log
ceph.git
7 days agocephadm: in cephadm shell mount /var/lib/ceph under /srv 68756/head
John Mulligan [Tue, 5 May 2026 17:47:44 +0000 (13:47 -0400)]
cephadm: in cephadm shell mount /var/lib/ceph under /srv

When running cephadm shell mount /var/lib/ceph at /srv/ceph unless
/var/lib/ceph is already being passed to cephadm shell -v option.
The default mount is read only. Passing it manually allows the user
to mount it in a custom location read/write.

The mount location at /srv/ceph is chosen because /var/lib/ceph is
already in use for compatibility with various ceph. The /srv tree
is currently unused by the container and serves a similar purpose
to /var/lib if you turn your head in squint a little.

Making this change enables the use of tools that want to read
files or connect to sockets in that file heirarchy. Specifically,
in this case, the ceph smb ctl tool.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
7 days agopython-common/ceph/smb: add frontend entry point to library
John Mulligan [Fri, 3 Apr 2026 19:13:47 +0000 (15:13 -0400)]
python-common/ceph/smb: add frontend entry point to library

Add a __main__.py file with a frontend for interacting with the
remote-control (grpc) feature for SMB. This can be invoked
on the command line using `python -m ceph.smb.ctl` assuming that
the ceph module is importable.

This command line makes it easier to interact with the remote-control
server without knowing a lot about how it is implemented.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
7 days agopython-common/ceph/smb: add client.py for remote-control grpc client
John Mulligan [Wed, 1 Apr 2026 22:22:53 +0000 (18:22 -0400)]
python-common/ceph/smb: add client.py for remote-control grpc client

Add a new client.py that contains the main library for acting as a
client of the remote-control grpc service for SMB. This is based on grpc
reflection rather than rigidly following an api generated from protobuf.
As this system is rapidly evolving this avoids having to keep generated
files in sync and more closely matches the grpcurl tool people are
already using with this feature.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
8 days agopython-common/ceph/smb: add file with basic config classes
John Mulligan [Wed, 1 Apr 2026 22:22:54 +0000 (18:22 -0400)]
python-common/ceph/smb: add file with basic config classes

Add a file with the basic configuration classes and functions needed to
set up a connection with the remote-control grpc server for SMB.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
8 days agopython-common/ceph/smb: add a shim file for configuring grpc imports
John Mulligan [Sat, 4 Apr 2026 20:34:32 +0000 (16:34 -0400)]
python-common/ceph/smb: add a shim file for configuring grpc imports

Add a private shim file that can be used by ceph.smb.ctl to choose
the grpc implementation in order to work around issues present in some
grpc versions. These are unfortunately the versions packaged for EL
distros.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
8 days agopython-common/ceph/smb: add typing shim file
John Mulligan [Wed, 1 Apr 2026 22:22:51 +0000 (18:22 -0400)]
python-common/ceph/smb: add typing shim file

Avoid having to put this in a bunch of files in our new ceph.smb.ctl files.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
8 days agopython-common: add smb module to black formatting in tox.ini
John Mulligan [Thu, 2 Apr 2026 20:52:19 +0000 (16:52 -0400)]
python-common: add smb module to black formatting in tox.ini

Signed-off-by: John Mulligan <jmulligan@redhat.com>
8 days agopython-common: work around some wonky grpc type checking behavior
John Mulligan [Fri, 3 Apr 2026 15:37:30 +0000 (11:37 -0400)]
python-common: work around some wonky grpc type checking behavior

Enable mypy type checking for gprc related packages that will be
consumed within the ceph.smb.ctl package. The google[.protobuf] package
causes problems and gets an exception instead of a working stubs
dependency - see the extensive comment in the file for more details.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
8 days agopython-common/ceph/smb: create a new ctl submodule
John Mulligan [Wed, 1 Apr 2026 22:22:48 +0000 (18:22 -0400)]
python-common/ceph/smb: create a new ctl submodule

This new submodule of smb will be used for interacting with the (gRPC
based) remote control api provided by smb services.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
8 days agopython-common: copy mgr isort tox config into this tox.ini
John Mulligan [Fri, 3 Apr 2026 15:23:23 +0000 (11:23 -0400)]
python-common: copy mgr isort tox config into this tox.ini

For consistency, add tox env for running isort in python common.
Basically copies the tox.ini config.
Like pybind/mgr this is 'opt in' so we only initially have the smb
module listed here.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
8 days agopython-common/ceph/smb: fix formatting in preparation for isort
John Mulligan [Fri, 3 Apr 2026 15:40:38 +0000 (11:40 -0400)]
python-common/ceph/smb: fix formatting in preparation for isort

Make check-isort happier.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
8 days agoMerge pull request #69126 from gbregman/main
Gil Bregman [Wed, 27 May 2026 18:22:28 +0000 (21:22 +0300)]
Merge pull request #69126 from gbregman/main

mgr/dashboard: Add EC pools support to NVMEoF CLI

8 days agoMerge pull request #68972 from bluikko/wip-common-options-rgw-separator-note
Casey Bodley [Wed, 27 May 2026 17:23:56 +0000 (13:23 -0400)]
Merge pull request #68972 from bluikko/wip-common-options-rgw-separator-note

common/options: improve rgw_dns_name and clarify separator

Reviewed-by: Anthony D Atri <anthony.datri@gmail.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
8 days agoMerge pull request #68964 from fultheim/fix-cleaner-gc-autotune
Matan Breizman [Wed, 27 May 2026 16:54:10 +0000 (19:54 +0300)]
Merge pull request #68964 from fultheim/fix-cleaner-gc-autotune

  crimson/os/seastore: auto-tune cleaner gc segment pick under random-write

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
8 days agoMerge pull request #69125 from NitzanMordhai/wip-nitzan-perf-count-high-cpu
NitzanMordhai [Wed, 27 May 2026 15:44:48 +0000 (18:44 +0300)]
Merge pull request #69125 from NitzanMordhai/wip-nitzan-perf-count-high-cpu

mgr/ThreadMonitor: monitor interval running in seconds and not nanose…

Reviewed-by: Sridhar Seshasayee <sridhar.seshasayee@ibm.com>
8 days agoMerge pull request #68727 from aainscow/wip_75962
Laura Flores [Wed, 27 May 2026 15:26:43 +0000 (10:26 -0500)]
Merge pull request #68727 from aainscow/wip_75962

osd: Correct missing list on divergent merge of partial writes

Reviewed-by: Bill Scales <bill_scales@uk.ibm.com>
8 days agoMerge pull request #69122 from tchaikov/wip-crimson-silence-unused-warning
Matan Breizman [Wed, 27 May 2026 14:08:45 +0000 (17:08 +0300)]
Merge pull request #69122 from tchaikov/wip-crimson-silence-unused-warning

crimson/seastore: segment_manager: fix -Wunused warnings

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
8 days agoMerge pull request #69019 from tchaikov/wip-crimson-wake-in-loaded
Matan Breizman [Wed, 27 May 2026 13:23:16 +0000 (16:23 +0300)]
Merge pull request #69019 from tchaikov/wip-crimson-wake-in-loaded

crimson/osd: wake pgs_creating waiters in PGMap::pg_loaded()

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
8 days agomgr/dashboard: Add EC pools support to NVMEoF CLI 69126/head
Gil Bregman [Wed, 27 May 2026 13:14:36 +0000 (16:14 +0300)]
mgr/dashboard: Add EC pools support to NVMEoF CLI
Fixes: https://tracker.ceph.com/issues/76937
Signed-off-by: Gil Bregman <gbregman@il.ibm.com>
8 days agoMerge pull request #64293 from cbodley/wip-71265
Casey Bodley [Wed, 27 May 2026 13:08:45 +0000 (09:08 -0400)]
Merge pull request #64293 from cbodley/wip-71265

rgw: add s3control apis for account-wide PublicAccessBlock

Reviewed-by: Adam Emerson <aemerson@redhat.com>
9 days agomgr/ThreadMonitor: monitor interval running in seconds and not nanoseconds 69125/head
Nitzan Mordechai [Wed, 27 May 2026 11:48:14 +0000 (11:48 +0000)]
mgr/ThreadMonitor: monitor interval running in seconds and not nanoseconds

The ctor accidently use the mgr_module_monitor_interval as nanoseconds
we need to use it as seconds.
Also, prevent high cpu loop in case read_process_statm failed during
while loop

Fixes: https://tracker.ceph.com/issues/76938
Signed-off-by: Nitzan Mordechai <nmordec@ibm.com>
9 days agocrimson/seastore: segment_manager: fix -Wunused warnings 69122/head
Kefu Chai [Wed, 27 May 2026 08:52:31 +0000 (16:52 +0800)]
crimson/seastore: segment_manager: fix -Wunused warnings

7f739adae2 dropped the last log call from get_segment_manager(), after
which `LOG_PREFIX(SegmentManager::get_segment_manager)` and
`SET_SUBSYS(seastore_device)` had no remaining users under `HAVE_ZNS`,
generating:

```
src/crimson/os/seastore/segment_manager.cc:38:3: warning: unused variable 'FNAME' [-Wunused-variable]
   38 |   LOG_PREFIX(SegmentManager::get_segment_manager);
      |   ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
src/crimson/common/log.h:49:38: note: expanded from macro 'LOG_PREFIX'
   49 | #define LOG_PREFIX(x) constexpr auto FNAME = #x
      |                                      ^~~~~
src/crimson/os/seastore/segment_manager.cc:10:1: warning: unused variable 'SOURCE_SUBSYS' [-Wunused-const-variable]
   10 | SET_SUBSYS(seastore_device);
      | ^~~~~~~~~~~~~~~~~~~~~~~~~~~
src/crimson/common/log.h:46:52: note: expanded from macro 'SET_SUBSYS'
   46 | #define SET_SUBSYS(subname_) static constexpr auto SOURCE_SUBSYS = ceph_subsys_##subname_
      |                                                    ^~~~~~~~~~~~~
2 warnings generated.
```

drop both to silence them.

Signed-off-by: Kefu Chai <k.chai@proxmox.com>
9 days agoMerge pull request #69116 from rhcs-dashboard/fix-cephadm-e2e-quoting
Nizamudeen A [Wed, 27 May 2026 05:52:17 +0000 (11:22 +0530)]
Merge pull request #69116 from rhcs-dashboard/fix-cephadm-e2e-quoting

mgr/dashboard: fix nested shell quoting in cephadm e2e start-cluster

Reviewed-by: Nizamudeen A <nia@redhat.com>
9 days agomgr/dashboard: fix nested shell quoting in cephadm e2e start-cluster 69116/head
Afreen Misbah [Wed, 27 May 2026 00:07:38 +0000 (05:37 +0530)]
mgr/dashboard: fix nested shell quoting in cephadm e2e start-cluster

with_libvirt wraps commands in sg libvirt -c "$1", adding an extra
shell layer. Nested double quotes inside the outer double-quoted
string caused the argument to be split — with_libvirt received a
truncated $1, producing "Unterminated quoted string" on the remote
shell.

Drop the unnecessary inner double quotes around cephadm shell
arguments since cephadm shell accepts the command as separate args.
Use single quotes for the grep pattern inside the double-quoted
string so it survives the sg subshell.

Signed-off-by: Afreen Misbah <afreen@ibm.com>
9 days agoMerge pull request #69068 from tchaikov/wip-bump-arrow-submodule
Kefu Chai [Wed, 27 May 2026 00:05:25 +0000 (08:05 +0800)]
Merge pull request #69068 from tchaikov/wip-bump-arrow-submodule

rgw: bump Apache Arrow submodule from 17.0.0 to 19.0.1

Reviewed-by: Justin Caratzas <jcaratza@ibm.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
9 days agoMerge pull request #67551 from Ericmzhang/wip-improve-pg-autoscale
Kamoltat (Junior) Sirivadhna [Tue, 26 May 2026 22:04:43 +0000 (18:04 -0400)]
Merge pull request #67551 from Ericmzhang/wip-improve-pg-autoscale

mgr: Fix autoscaling PG distribution
Reviewed-by: Kamoltat (Junior) Sirivadhna <ksirivad@redhat.com>
9 days agoMerge pull request #67337 from badone/wip-tracker-74919-ceph-dump-log-new-global...
Brad Hubbard [Tue, 26 May 2026 21:59:37 +0000 (07:59 +1000)]
Merge pull request #67337 from badone/wip-tracker-74919-ceph-dump-log-new-global-access

scripts: ceph_dump_log.py change global context access

Reviewed-by: Aishwarya Mathuria <amathuri@redhat.com>
9 days agoMerge pull request #67857 from yaelazulay-redhat/issues_74393_dashboard_fail_to_acces...
Afreen Misbah [Tue, 26 May 2026 21:35:55 +0000 (03:05 +0530)]
Merge pull request #67857 from yaelazulay-redhat/issues_74393_dashboard_fail_to_access_object_when_rgw_use_cephadm_certificate

Issues 74393 dashboard fail to access object when rgw use cephadm certificate

Reviewed-by: Afreen Misbah <afreen@ibm.com>
Reviewed-by: Redouane Kachach <rkachach@ibm.com>
9 days agoMerge pull request #68874 from BBoozmen/wip-oozmen-76563
Adam Emerson [Tue, 26 May 2026 18:49:10 +0000 (14:49 -0400)]
Merge pull request #68874 from BBoozmen/wip-oozmen-76563

neorados/cls/log: fix infinite trim loop on empty data log shards

Reviewed-by: Adam C. Emerson <aemerson@redhat.com>
9 days agoMerge pull request #67079 from MattyWilliams22/ec-sync-reads
Radoslaw Zarzynski [Tue, 26 May 2026 16:31:12 +0000 (18:31 +0200)]
Merge pull request #67079 from MattyWilliams22/ec-sync-reads

osd: Support for Synchronous Reads in EC

Reviewed-by: Alex Ainscow <aainscow@uk.ibm.com>
Reviewed-by: Bill Scales <bill_scales@uk.ibm.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
9 days agorgw/s3control: skip account id check for admin users 64293/head
Casey Bodley [Tue, 26 May 2026 16:03:48 +0000 (12:03 -0400)]
rgw/s3control: skip account id check for admin users

allow access to admin users that don't belong to the requested account.
this is also necessary for multisite, where requests are forwarded to
the metadata master as the multisite system user instead of the original
requester

Signed-off-by: Casey Bodley <cbodley@redhat.com>
10 days agoMerge pull request #67950 from rhcs-dashboard/add-telemetry-status
Pedro Gonzalez Gomez [Tue, 26 May 2026 10:16:32 +0000 (12:16 +0200)]
Merge pull request #67950 from rhcs-dashboard/add-telemetry-status

mgr/dashboard: add telemetry status to overview-health-card

Reviewed-by: Abhishek Desai <abhishek.desai1@ibm.com>
10 days agoMerge pull request #68258 from tchaikov/wip-with-system-jerasure
Kefu Chai [Tue, 26 May 2026 09:53:02 +0000 (17:53 +0800)]
Merge pull request #68258 from tchaikov/wip-with-system-jerasure

cmake: support building with system jerasure and gf-complete

Reviewed-by: Casey Bodley <cbodley@redhat.com>
10 days agoMerge pull request #61131 from NitzanMordhai/wip-nitzan-mgr-modules-perf-counts
NitzanMordhai [Tue, 26 May 2026 09:31:45 +0000 (12:31 +0300)]
Merge pull request #61131 from NitzanMordhai/wip-nitzan-mgr-modules-perf-counts

mgr: Add per-module performance counters to mgr

Reviewed-by: Sridhar Seshasayee sridhar.seshasayee@ibm.com
10 days agoMerge pull request #68858 from rsacherer/wip-fix-limit-break-existing-devices
Guillaume Abrioux [Tue, 26 May 2026 09:19:03 +0000 (11:19 +0200)]
Merge pull request #68858 from rsacherer/wip-fix-limit-break-existing-devices

ceph-volume: fix re-deployment of OSD issues with disk selection filters and DB Devices

10 days agoMerge pull request #67935 from rhcs-dashboard/add-csv
Pedro Gonzalez Gomez [Tue, 26 May 2026 09:06:14 +0000 (11:06 +0200)]
Merge pull request #67935 from rhcs-dashboard/add-csv

mgr/dashboard: Add Hosts via CSV Upload

Reviewed-by: Devika Babrekar <devika.babrekar@ibm.com>
Reviewed-by: Puja Shahu <pshahu@redhat.com>
Reviewed-by: Pedro Gonzalez Gomez <pegonzal@ibm.com>
10 days agoMerge pull request #68894 from guits/cv-dm-mgmt
Guillaume Abrioux [Tue, 26 May 2026 09:04:53 +0000 (11:04 +0200)]
Merge pull request #68894 from guits/cv-dm-mgmt

ceph-volume: OSD mapper lifecycle (LVM + raw) for activate

10 days agoMerge pull request #69064 from tchaikov/wip-crimson-scrub-blocked
Matan Breizman [Tue, 26 May 2026 08:02:54 +0000 (11:02 +0300)]
Merge pull request #69064 from tchaikov/wip-crimson-scrub-blocked

crimson/scrub: fix assert in PGScrubber::release_range() on interval change

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
10 days agoMerge pull request #69020 from tchaikov/wip-level-triggered-unblock
Matan Breizman [Tue, 26 May 2026 08:01:14 +0000 (11:01 +0300)]
Merge pull request #69020 from tchaikov/wip-level-triggered-unblock

crimson/osd: only unblock wait_for_active_blocker on replica when ACTIVE

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
10 days agoMerge pull request #69018 from tchaikov/wip-large-object-size
Matan Breizman [Tue, 26 May 2026 08:00:37 +0000 (11:00 +0300)]
Merge pull request #69018 from tchaikov/wip-large-object-size

crimson/seastore: reject oversized writes and zeros instead of aborting

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
10 days agoMerge pull request #59476 from zhscn/wip-new-128
Xuehan Xu [Tue, 26 May 2026 05:40:09 +0000 (13:40 +0800)]
Merge pull request #59476 from zhscn/wip-new-128

crimson/os/seastore: introduce static layout of laddr_t

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>
Reviewed-by: Anthony D Atri <anthony.datri@gmail.com>
10 days agoMerge pull request #68887 from ShreeJejurikar/wip-bucket-logging-requester-assumed...
Yuval Lifshitz [Tue, 26 May 2026 04:58:58 +0000 (07:58 +0300)]
Merge pull request #68887 from ShreeJejurikar/wip-bucket-logging-requester-assumed-role

rgw/logging: use assumed-role ARN as Requester for STS requests

10 days agoMerge pull request #69067 from xxhdx1985126/wip-seastore-lba-wrong-asserts
Xuehan Xu [Tue, 26 May 2026 02:54:07 +0000 (10:54 +0800)]
Merge pull request #69067 from xxhdx1985126/wip-seastore-lba-wrong-asserts

crimson/os/seastore/lba: fix wrong asserts and "if" conditions

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
10 days agoMerge pull request #69082 from ronen-fr/wip-rf-trimlmt-rst
Ronen Friedman [Mon, 25 May 2026 18:27:42 +0000 (21:27 +0300)]
Merge pull request #69082 from ronen-fr/wip-rf-trimlmt-rst

doc/PendingReleaseNotes: document osd_scrub_queued_snaptrims_limit

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
10 days agodoc/PendingReleaseNotes: document osd_scrub_queued_snaptrims_limit 69082/head
Ronen Friedman [Mon, 25 May 2026 13:13:03 +0000 (13:13 +0000)]
doc/PendingReleaseNotes: document osd_scrub_queued_snaptrims_limit

osd_scrub_queued_snaptrims_limit, introduced in PR#68737,
blocks the initiation of non-urgent scrubs on OSDs that
are overloaded with snap-trim operations.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
10 days agocrimson/os/seastore: auto-tune cleaner gc segment pick under random-write 68964/head
Shai Fultheim [Sun, 17 May 2026 09:40:44 +0000 (12:40 +0300)]
crimson/os/seastore: auto-tune cleaner gc segment pick under random-write

SegmentCleaner uses one of three configurable gc formulas to select
the next segment to reclaim: GREEDY (lowest util wins), COST_BENEFIT
((1-u)*age/(2u)), or BENEFIT (an age-weighted quadratic in util).
COST_BENEFIT is the default and the right choice for journaling /
LIFO workloads, where old segments accumulate more dead bytes than
young ones — age predicts deadness, so an old high-util segment is
worth reclaiming because its util will keep rising as long as we wait.

That assumption breaks under random-write at high cluster fill. Dead
bytes spread uniformly across segments regardless of age, so age stops
predicting future deadness, and (1-u)/(2u) becomes the only term that
distinguishes candidates. With every segment in the 0.7-0.94 util
band, (1-u)/(2u) ranges from 0.227 to 0.032 — a 7x spread the formula
can easily lose to a 7x age difference. Result: a 0.94-util old
segment scores higher than a 0.68-util young one, even though
reclaiming the 0.68 segment would free 5x more space (32% of a
64 MB segment vs 6%).

Observed in qa/standalone/crimson randwrite at ~70% full: with the
unmodified formula, cleaner picks settled on 0.92-0.94 util segments
freeing ~4 MB net each; net free rate collapsed to single-digit KB/s
even though the cleaner was running cycles at ~30 µs each. fio's
stall watchdog killed the bench after 535 GB user written (target
1280 GB). Switching gc_formula = greedy by hand let the bench
complete the target.

This patch detects the mis-selection at runtime and overrides the
formula's pick with the greedy choice only when the difference is
significant. In get_next_reclaim_segment() we already iterate all
closed reclaimable segments to find the formula's max-score candidate;
in the same pass we now also track the lowest-util candidate (what
GREEDY would have picked). After the loop, if greedy's free-fraction
(1 - greedy_util) is at least seastore_segment_cleaner_gc_autotune_ratio
times the formula's pick's free-fraction (default 2.0), we swap to
greedy. Since all segments share the same size, comparing free-
fractions is equivalent to comparing freed bytes; the fraction form
avoids an unnecessary multiplication.

The full design rationale (regime-by-regime behaviour, safety guard
against picked_free near zero, score-recompute on override, threshold
calibration) lives in doc/dev/crimson/seastore.rst under the new
"Cleaner GC autotune" section. The code references it from short
inline comments.

Configurable knobs:

  * seastore_segment_cleaner_gc_autotune (bool, default true) —
    operators can disable the override entirely to honor the
    configured formula unconditionally. Ignored when gc_formula =
    greedy.

  * seastore_segment_cleaner_gc_autotune_ratio (float, default 2.0,
    min 1.0) — operators can tune the override threshold. Higher is
    more conservative (preserves age weighting more aggressively);
    lower is more aggressive (behaviour converges toward pure greedy).

The override predicate is factored into a static helper
`SegmentCleaner::should_override_to_greedy(picked_free, greedy_free,
ratio)` so the call site stays readable and the predicate is
independently testable.

With this change the qa/standalone/crimson randwrite bench at 70%
fill completes the target run rather than stalling at the 500-600 GB
mark, with the override firing reliably under high uniform alive_
ratio and not firing under low or non-uniform alive_ratio. Override
behaviour can be observed with debug_seastore_cleaner=20.

Signed-off-by: Shai Fultheim <shai.fultheim@gmail.com>
10 days agoqa/rgw/bucket-logging: configure STS for assume-role test 68887/head
ShreeJejurikar [Wed, 20 May 2026 07:18:03 +0000 (12:48 +0530)]
qa/rgw/bucket-logging: configure STS for assume-role test

Set rgw sts key and enable rgw s3 auth use sts, both needed by
test_bucket_logging_requester_assumed_role. Mirrors the existing
settings in qa/suites/rgw/verify/overrides.yaml.

Signed-off-by: ShreeJejurikar <shreemj8@gmail.com>
11 days agoMerge pull request #69006 from tchaikov/wip-seastore-clamp-block-size-on-small-lba
Matan Breizman [Mon, 25 May 2026 10:39:18 +0000 (13:39 +0300)]
Merge pull request #69006 from tchaikov/wip-seastore-clamp-block-size-on-small-lba

crimson/seastore: clamp block_size to laddr_t::UNIT_SIZE on small-LBA devices

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
11 days agoMerge pull request #68961 from fultheim/fix-cleaner-stall-projected-ratio
Matan Breizman [Mon, 25 May 2026 10:24:59 +0000 (13:24 +0300)]
Merge pull request #68961 from fultheim/fix-cleaner-stall-projected-ratio

crimson/os/seastore: fix cleaner stall under IO-block pressure

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
11 days agoMerge pull request #68884 from tchaikov/wip-crimson-advance-osdmap
Matan Breizman [Mon, 25 May 2026 09:27:01 +0000 (12:27 +0300)]
Merge pull request #68884 from tchaikov/wip-crimson-advance-osdmap

crimson/osd: fix mark-down crash for removed OSDs

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
11 days agoMerge pull request #68861 from tchaikov/wip-crimson-reset-logger
Matan Breizman [Mon, 25 May 2026 09:26:06 +0000 (12:26 +0300)]
Merge pull request #68861 from tchaikov/wip-crimson-reset-logger

crimson/osd: inline log file stream setup to fix dangling pointer

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
11 days agoMerge pull request #69042 from Shubhaj1810/revert-67999
Redouane Kachach [Mon, 25 May 2026 09:14:57 +0000 (11:14 +0200)]
Merge pull request #69042 from Shubhaj1810/revert-67999

Revert "mgr/cephadm: align nodeid and add register_service for NFS Ganesha service visibility"

Reviewed-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
11 days agocrimson/os/seastore: also update the mappings copied by client 59476/head
Xuehan Xu [Fri, 15 May 2026 09:10:04 +0000 (17:10 +0800)]
crimson/os/seastore: also update the mappings copied by client
transactions when committing background rewriting transactions

With the 128-bit laddr key layout in place, SeaStore::rename would
involve copying mappings. These mappings must also be updated when
the logical extents they point to are rewritten.

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
11 days agocrimson/os/seastore/omap_manager/log: better output
Xuehan Xu [Tue, 28 Apr 2026 07:00:24 +0000 (15:00 +0800)]
crimson/os/seastore/omap_manager/log: better output

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
11 days agodoc/dev/crimson/seastore_laddr.rst: add descriptions about temp
Xuehan Xu [Sun, 10 May 2026 07:36:22 +0000 (15:36 +0800)]
doc/dev/crimson/seastore_laddr.rst: add descriptions about temp
recovering objects

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
11 days agocrimson/osd: treat OI-not-existing cases as enoent
Xuehan Xu [Thu, 16 Apr 2026 05:47:18 +0000 (13:47 +0800)]
crimson/osd: treat OI-not-existing cases as enoent

This is consistent with classic osds

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
11 days agocrimson/os/seastore/object_data_handler: new debug logs
Xuehan Xu [Tue, 14 Apr 2026 06:00:52 +0000 (14:00 +0800)]
crimson/os/seastore/object_data_handler: new debug logs

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
11 days agocrimson/osd: create temp recovering objects through touch_temp
Xuehan Xu [Wed, 22 Apr 2026 05:37:46 +0000 (13:37 +0800)]
crimson/osd: create temp recovering objects through touch_temp

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
11 days agocrimson/os/seastore: handle OP_TOUCH_TEMP
Xuehan Xu [Sun, 29 Mar 2026 03:20:52 +0000 (11:20 +0800)]
crimson/os/seastore: handle OP_TOUCH_TEMP

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
11 days agoos/Transaction: add the interface dedicated to touching temp objects
Xuehan Xu [Thu, 26 Mar 2026 08:08:41 +0000 (16:08 +0800)]
os/Transaction: add the interface dedicated to touching temp objects

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
11 days agocrimson/os/seastore/lba: fix possible namespace lookup error
Xuehan Xu [Tue, 3 Feb 2026 03:11:33 +0000 (11:11 +0800)]
crimson/os/seastore/lba: fix possible namespace lookup error

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
11 days agodev/doc/crimson: clarify dynamic PG and object bits for static laddr design
Zhang Song [Thu, 8 Jan 2026 04:14:20 +0000 (12:14 +0800)]
dev/doc/crimson: clarify dynamic PG and object bits for static laddr design

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
11 days agocrimson/os/seastore: adapt copy on write for static onode prefix
Zhang Song [Wed, 3 Sep 2025 07:54:40 +0000 (15:54 +0800)]
crimson/os/seastore: adapt copy on write for static onode prefix

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
11 days agocrimson/os/seastore: support rename for static layout of laddr
Zhang Song [Tue, 26 Aug 2025 03:42:49 +0000 (11:42 +0800)]
crimson/os/seastore: support rename for static layout of laddr

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
11 days agocrimson/os/seastore: add "move_mapping" to TransactionManager and LBAManager
Xuehan Xu [Tue, 26 Aug 2025 06:28:55 +0000 (14:28 +0800)]
crimson/os/seastore: add "move_mapping" to TransactionManager and LBAManager

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
11 days agocrimson/os/seastore/lba: set extent type for ZERO lba mappings
Xuehan Xu [Mon, 2 Feb 2026 07:42:49 +0000 (15:42 +0800)]
crimson/os/seastore/lba: set extent type for ZERO lba mappings

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
11 days agomgr/dashboard: Add Hosts via CSV Upload 67935/head
Sagar Gopale [Mon, 23 Mar 2026 06:08:44 +0000 (11:38 +0530)]
mgr/dashboard: Add Hosts via CSV Upload

Fixes: https://tracker.ceph.com/issues/75578
Signed-off-by: Sagar Gopale <sagar.gopale@ibm.com>
11 days agoMerge pull request #68667 from rhcs-dashboard/fix-76316-main
Redouane Kachach [Mon, 25 May 2026 08:32:23 +0000 (10:32 +0200)]
Merge pull request #68667 from rhcs-dashboard/fix-76316-main

mgr/dashboard: add remote write section to prometheus configuration

Reviewed-by: Redouane Kachach <rkachach@ibm.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
11 days agoworkunits/mgr/test_mgr_modules_perf_counters: new test for enable\disable\perf counts 61131/head
Nitzan Mordechai [Tue, 17 Dec 2024 13:49:00 +0000 (13:49 +0000)]
workunits/mgr/test_mgr_modules_perf_counters: new test for enable\disable\perf counts

Simple test to enable \ disable and get counters dump
for checking perf counters.

Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>
11 days agomgr: Add per-module performance counters to mgr
Nitzan Mordechai [Sun, 8 Dec 2024 18:08:39 +0000 (18:08 +0000)]
mgr: Add per-module performance counters to mgr

This commit introduces performance counters for individual Ceph mgr modules.
These counters allow monitoring module behavior, debugging latency issues,
and identifying performance bottlenecks, all without modifying the modules themselves.

The following counters are now exposed under:
  > ceph daemon mgr.<id> perf dump

Example structure:
"mgr_module_<module_name>": {
    "notify_avg_usec": {     <- Average time spent handling notify events
        "avgcount": 0,
        "sum": 0
    },
    "cmd_avg_usec": {        <- Average time spent processing CLI/admin commands
        "avgcount": 0,
        "sum": 0
    },
    "serve_avg_usec": {      <- Average time spent in module serve loop (if applicable)
        "avgcount": 0,
        "sum": 0
    },
    "alive": 1               <- Module is alive (1 = running, 0 = exited)
    "cpu_usage": 0,          <- CPU usage in percent
    "mem_rss_change": 0,     <- Memory RSS change in bytes
    "mem_rss_current": 490737664 <- Memory RSS current in bytes

}

Signed-off-by: Nitzan Mordechai <nmordech@ibm.com>
Conflicts:
  src/mgr/ActivePyModules.cc - finisher.queue changed by 63859, adding py_module to the parameter list
  src/mgr/PyModuleRegistry.cc - check_all_modules_started added by 63859

11 days agoceph-volume: OSD mapper lifecycle (LVM + raw) for activate 68894/head
Guillaume Abrioux [Wed, 13 May 2026 12:57:03 +0000 (14:57 +0200)]
ceph-volume: OSD mapper lifecycle (LVM + raw) for activate

This adds small helpers so activate can consistently bring the OSD device
stack online (LVM lvchange, optional mapper open) and tear it down again,
with refresh in between. Same idea for the raw path. Crypto is handled
inside that flow when the OSD is encrypted.

Fixes: https://tracker.ceph.com/issues/76591
Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
11 days agoMerge pull request #68771 from jrse/rgw-kafka-mtls-rebased
Yuval Lifshitz [Sun, 24 May 2026 19:29:38 +0000 (22:29 +0300)]
Merge pull request #68771 from jrse/rgw-kafka-mtls-rebased

rgw/kafka: add mTLS support (extends #61572)

12 days agorgw: bump Apache Arrow submodule from 17.0.0 to 19.0.1 69068/head
Kefu Chai [Sun, 24 May 2026 08:25:46 +0000 (16:25 +0800)]
rgw: bump Apache Arrow submodule from 17.0.0 to 19.0.1

When WITH_SYSTEM_ARROW is false, Ceph builds Arrow from the bundled
src/apache submodule. Our CI uses ubuntu:jammy as the base image, which
does not package libarrow-dev, so the bundled path is always taken there.

Arrow 17.0.0 vendors a copy of Thrift whose download URLs are no longer
reachable, breaking CI builds that try to fetch them at configure time.

Bump arrow submodule to 19.0.1, the latest Arrow release that:
- builds successfully on ubuntu:jammy, and
- requires only CMake 3.22 (the version shipped by ubuntu:jammy)

See also

CMake version shipped by ubuntu:jammy
- https://packages.ubuntu.com/jammy/cmake

arrow releases' CMake support
- maint-19.0.1: https://github.com/apache/arrow/blob/272715f6df2a042d69881ffa03d5078c58e4b345/cpp/CMakeLists.txt#L18
- maint-20.0.0: https://github.com/apache/arrow/blob/3ad0370a04ccdae638755b94c3c31c8760a11193/cpp/CMakeLists.txt#L18

arrow enabled minmalloc by default
-
https://github.com/apache/arrow/commit/b907c5dadb516b525c8fafbf34b0116d44044733

Because arrow uses the bundled mialloc library be default, we need
to disable it in the same commit bumping up the submodule.

Signed-off-by: Kefu Chai <k.chai@proxmox.com>
12 days agoMerge pull request #66150 from MaodiMa/AVX512_crc32c
Kefu Chai [Sun, 24 May 2026 09:55:45 +0000 (17:55 +0800)]
Merge pull request #66150 from MaodiMa/AVX512_crc32c

common: enable AVX512+VPCLMULQDQ for crc32c performance on x86

Reviewed-by: Kefu Chai <k.chai@proxmox.com>
12 days agocrimson/os/seastore/lba: fix wrong asserts and "if" conditions 69067/head
Xuehan Xu [Sat, 23 May 2026 09:23:02 +0000 (17:23 +0800)]
crimson/os/seastore/lba: fix wrong asserts and "if" conditions

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
12 days agocrimson/os/seastore/OMapManager: only store the relative block offset to omap root...
Zhang Song [Fri, 30 May 2025 09:45:39 +0000 (17:45 +0800)]
crimson/os/seastore/OMapManager: only store the relative block offset to omap root in OMapInnerNode

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
12 days agotest/crimson/seastore/test_btree_lba_manager: add test cases for conflict policy
Zhang Song [Tue, 27 May 2025 07:31:13 +0000 (15:31 +0800)]
test/crimson/seastore/test_btree_lba_manager: add test cases for conflict policy

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
12 days agocrimson/os/seastore/lba_manager: implement conflict policy
Zhang Song [Tue, 26 Aug 2025 03:38:49 +0000 (11:38 +0800)]
crimson/os/seastore/lba_manager: implement conflict policy

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
12 days agocrimson/os/seastore: reserve region in LBABtree when touching onode
Zhang Song [Wed, 11 Jun 2025 04:04:25 +0000 (12:04 +0800)]
crimson/os/seastore: reserve region in LBABtree when touching onode

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
12 days agocrimson/os/seastore/OnodeManager: adapt laddr_hint_t approach
Zhang Song [Wed, 11 Jun 2025 04:04:03 +0000 (12:04 +0800)]
crimson/os/seastore/OnodeManager: adapt laddr_hint_t approach

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
12 days agocrimson/os/seastore/OMapManager: adapt laddr_hint_t approach
Zhang Song [Mon, 26 May 2025 07:23:25 +0000 (15:23 +0800)]
crimson/os/seastore/OMapManager: adapt laddr_hint_t approach

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
12 days agocrimson/os/seastore: use laddr_hint_t to allocate the laddr
Zhang Song [Tue, 26 Aug 2025 03:36:07 +0000 (11:36 +0800)]
crimson/os/seastore: use laddr_hint_t to allocate the laddr

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
12 days agocrimson/os/seastore/Onode: get sibling's object id when creating new onode
Zhang Song [Wed, 11 Jun 2025 03:50:12 +0000 (11:50 +0800)]
crimson/os/seastore/Onode: get sibling's object id when creating new onode

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
12 days agocrimson/os/seastore/Onode: adapt new get hint approach
Zhang Song [Tue, 26 Aug 2025 03:34:37 +0000 (11:34 +0800)]
crimson/os/seastore/Onode: adapt new get hint approach

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
12 days agocrimson/os/seastore/Onode: support get object/clone prefix
Zhang Song [Thu, 22 May 2025 08:58:14 +0000 (16:58 +0800)]
crimson/os/seastore/Onode: support get object/clone prefix

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
12 days agocrimson/os/seastore/Onode: remove default metadata offset/range
Zhang Song [Tue, 26 Aug 2025 03:31:03 +0000 (11:31 +0800)]
crimson/os/seastore/Onode: remove default metadata offset/range

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
12 days agocrimson/os/seastore: introduce laddr_hint_t and associated factory methods
Zhang Song [Wed, 14 May 2025 08:34:00 +0000 (16:34 +0800)]
crimson/os/seastore: introduce laddr_hint_t and associated factory methods

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
12 days agocrimson/os/seastore: make pladdr_t only store the local clone id instead of full...
Zhang Song [Tue, 26 Aug 2025 02:35:55 +0000 (10:35 +0800)]
crimson/os/seastore: make pladdr_t only store the local clone id instead of full laddr_t

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
12 days agocrimson/os/seastore: introduce static layout of laddr_t
Zhang Song [Wed, 14 May 2025 08:26:26 +0000 (16:26 +0800)]
crimson/os/seastore: introduce static layout of laddr_t

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
12 days agocrimson/os/seastore: extend the size of laddr_t from 64 bits to 128 bits
Zhang Song [Wed, 14 May 2025 07:22:15 +0000 (15:22 +0800)]
crimson/os/seastore: extend the size of laddr_t from 64 bits to 128 bits

Signed-off-by: Zhang Song <zhangsong02@qianxin.com>
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
12 days agoMerge pull request #69045 from xxhdx1985126/wip-seastore-drop-retired-placeholder
Kefu Chai [Sat, 23 May 2026 13:56:32 +0000 (21:56 +0800)]
Merge pull request #69045 from xxhdx1985126/wip-seastore-drop-retired-placeholder

crimson/os/seastore: remove RetiredExtentPlaceholder

Reviewed-by: Kefu Chai <k.chai@proxmox.com>
12 days agoMerge pull request #68823 from tchaikov/wip-crimson-remove-from
Kefu Chai [Sat, 23 May 2026 13:33:14 +0000 (21:33 +0800)]
Merge pull request #68823 from tchaikov/wip-crimson-remove-from

crimson/osd: make PGAdvanceMap idempotent

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
12 days agocrimson/scrub: fix assert in PGScrubber::release_range() on interval change 69064/head
Kefu Chai [Fri, 22 May 2026 11:01:17 +0000 (19:01 +0800)]
crimson/scrub: fix assert in PGScrubber::release_range() on interval change

when an interval change occurs while ScrubReserveRange is still
waiting to acquire background_process_lock, ChunkState::exit()
calls release_range() but blocked is not yet set. this triggers
ceph_assert(blocked) in release_range().

fix by checking if blocked is set before asserting. if blocked is
not set, the range was never reserved, so release_range() is a
no-op. ScrubReserveRange's finally block handles lock cleanup in
this case.

Fixes: https://tracker.ceph.com/issues/76752
Signed-off-by: Kefu Chai <k.chai@proxmox.com>
13 days agoMerge pull request #68684 from ronen-fr/wip-rf-statfx
Ronen Friedman [Sat, 23 May 2026 08:04:45 +0000 (11:04 +0300)]
Merge pull request #68684 from ronen-fr/wip-rf-statfx

osd/scrub: auto-correct accounting-only stat mismatches

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
13 days agoMerge pull request #68292 from Kushal-deb/fix-nvme-gw-crash
Redouane Kachach [Sat, 23 May 2026 08:04:37 +0000 (10:04 +0200)]
Merge pull request #68292 from Kushal-deb/fix-nvme-gw-crash

mgr/cephadm: fix nvmeof reconfig loop by preserving daemon deps

Reviewed-by: Redouane Kachach <rkachach@ibm.com>
13 days agoMerge pull request #67308 from rkachach/fix_issue_ssl_cert_deps
Redouane Kachach [Sat, 23 May 2026 08:04:03 +0000 (10:04 +0200)]
Merge pull request #67308 from rkachach/fix_issue_ssl_cert_deps

mgr/cephadm: track TLS spec changes in deps and cleanup stale certmgr entries on cert source transitions

Reviewed-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
13 days agoMerge pull request #68737 from ronen-fr/wip-rf-stqlength
Ronen Friedman [Sat, 23 May 2026 07:50:42 +0000 (10:50 +0300)]
Merge pull request #68737 from ronen-fr/wip-rf-stqlength

crimson+classic/osd/scrub: limit scrubbing under snap-trimming overload

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
13 days agofeat(rgw/kafka): add mTLS client certificate authentication for Kafka notifications 68771/head
Jan Radon [Fri, 15 May 2026 13:42:08 +0000 (15:42 +0200)]
feat(rgw/kafka): add mTLS client certificate authentication for Kafka notifications
Add support for mutual TLS (mTLS) client certificate authentication
when publishing bucket notifications to Kafka brokers. RGW can now
present a client certificate and private key to authenticate with
brokers that require ssl.client.auth=required.
Changes:
- Add ssl-certificate-location, ssl-key-location, and ssl-key-password
  topic attributes for configuring client certificates
- Validate that ssl_certificate and ssl_key are provided together
- Include ssl_key_password in connection identity (hash/equality)
- Add kafka-security.sh script for generating broker and client TLS certs
- Add mTLS test (test_notification_kafka_security_ssl_mtls) using
  use_mtls=True flag on the existing SSL security path
- Update RGW notifications documentation with mTLS parameters

Fixes: http://tracker.ceph.com/issues/67427
Signed-off-by: Jan Radon <jan.fabian.radon@sap.com>
13 days agoMerge pull request #67315 from timqn22/misreporting_count_osd_services
Redouane Kachach [Fri, 22 May 2026 19:28:33 +0000 (21:28 +0200)]
Merge pull request #67315 from timqn22/misreporting_count_osd_services

mgr/cephadm: verify spec service_id before applying

Reviewed-by: Kefu Chai <k.chai@proxmox.com>
Reviewed-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>
Reviewed-by: Adam King <adking@redhat.com>
13 days agoMerge pull request #66477 from xelexin/fix_cephadm_agent_volume_gatherer
Redouane Kachach [Fri, 22 May 2026 19:27:37 +0000 (21:27 +0200)]
Merge pull request #66477 from xelexin/fix_cephadm_agent_volume_gatherer

orch/cephadm: Fixes an unlimited env append in cephadm agent

Reviewed-by: Adam King <adking@redhat.com>