]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
4 months agoMerge pull request #63159 from shraddhaag/wip-shraddhaag-availability-2
Shraddha Agrawal [Thu, 19 Jun 2025 02:22:51 +0000 (07:52 +0530)]
Merge pull request #63159 from shraddhaag/wip-shraddhaag-availability-2

mon: add config option to toggle availability score feature

4 months agoMerge pull request #63416 from VallariAg/wip-revert-nvmeof-ceph-status
SrinivasaBharathKanta [Thu, 19 Jun 2025 00:04:49 +0000 (05:34 +0530)]
Merge pull request #63416 from VallariAg/wip-revert-nvmeof-ceph-status

mon: Revert "mon: Add nvmeof group/gateway name in "ceph -s""

4 months agoMerge pull request #63296 from NitzanMordhai/wip-nitzan-peering-state-toofull-race
SrinivasaBharathKanta [Thu, 19 Jun 2025 00:03:16 +0000 (05:33 +0530)]
Merge pull request #63296 from NitzanMordhai/wip-nitzan-peering-state-toofull-race

osd/PeeringState: re-evaluate full OSDs while waiting for recovery re…

4 months agoMerge pull request #62952 from aainscow/scrub_improvements
Laura Flores [Wed, 18 Jun 2025 21:15:55 +0000 (16:15 -0500)]
Merge pull request #62952 from aainscow/scrub_improvements

osd: Correct scrub analysis for optimised EC

4 months agoMerge pull request #61850 from AliMasarweh/wip-alimasa-rgw-standalone
Daniel Gryniewicz [Wed, 18 Jun 2025 17:03:12 +0000 (13:03 -0400)]
Merge pull request #61850 from AliMasarweh/wip-alimasa-rgw-standalone

RGW/standalone: Build without RadosStore

4 months agoMerge pull request #63572 from Naveenaidu/wip-naveen-fix-DeferBackfill-race-condition
Naveen Naidu [Wed, 18 Jun 2025 16:25:54 +0000 (21:55 +0530)]
Merge pull request #63572 from Naveenaidu/wip-naveen-fix-DeferBackfill-race-condition

osd/PeeringState: handle race condition of DeferBackfill event for Backfilling state

4 months agoMerge pull request #64001 from tchaikov/wip-debian-systemd-install
Kefu Chai [Wed, 18 Jun 2025 13:27:54 +0000 (21:27 +0800)]
Merge pull request #64001 from tchaikov/wip-debian-systemd-install

deb: use glob match to support systemd unit dir changes

Reviewed-by: Casey Bodley <cbodley@redhat.com>
4 months agodeb: use variable expansion to support systemd unit dir changes 64001/head
Kefu Chai [Wed, 18 Jun 2025 09:22:36 +0000 (17:22 +0800)]
deb: use variable expansion to support systemd unit dir changes

Ubuntu changed the systemd unit directory location between releases:
- Jammy (22.04): /lib/systemd/system
- Noble (24.04): /usr/lib/systemd/system

To maintain compatibility across both versions, update .install files
to use brace expansion pattern {usr/,}lib/systemd/system/<service>.

This pattern works because dh_install uses bsd_glob() with GLOB_CSH
flags, which expands braces and matches files in both locations
depending on where CMakeLists.txt actually installed them.

Fixes installation issues when building packages on Noble while
maintaining backward compatibility with Jammy builds.

Signed-off-by: Kefu Chai <tchaikov@gmail.com>
4 months agoMerge PR #63619 into main 18.2.1-334.el9cp
Venky Shankar [Wed, 18 Jun 2025 05:29:58 +0000 (10:59 +0530)]
Merge PR #63619 into main

* refs/pull/63619/head:
client: asynchronous fsync can decrement request ref twice

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
Reviewed-by: Christopher Hoffman <choffman@redhat.com>
4 months agoMerge pull request #60507 from synarete/ss-cephfs-asyncio-use-after-free-bugfix
Venky Shankar [Wed, 18 Jun 2025 05:06:01 +0000 (10:36 +0530)]
Merge pull request #60507 from synarete/ss-cephfs-asyncio-use-after-free-bugfix

client: fix memory leak in Client::CRF_iofinish::complete

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Dhairya Parmar <dparmar@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>
4 months agoMerge pull request #63937 from tchaikov/wip-mds-inode-valid-test-instances
Kefu Chai [Wed, 18 Jun 2025 03:28:52 +0000 (11:28 +0800)]
Merge pull request #63937 from tchaikov/wip-mds-inode-valid-test-instances

mds: generate symlink inode with correct mode

Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>
4 months agoMerge pull request #63933 from tchaikov/wip-rbd-cls_rbd_snap
Ilya Dryomov [Tue, 17 Jun 2025 20:41:52 +0000 (22:41 +0200)]
Merge pull request #63933 from tchaikov/wip-rbd-cls_rbd_snap

cls/rbd: use default values for non-decoded fields in test instances

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
4 months agoMerge pull request #63975 from tchaikov/wip-cmake-find_program
Casey Bodley [Tue, 17 Jun 2025 16:13:13 +0000 (12:13 -0400)]
Merge pull request #63975 from tchaikov/wip-cmake-find_program

cmake: use find_program(REQUIRED) to fail early on missing programs

Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>
4 months agoMerge pull request #63399 from Shwetha-Acharya/share_config_update
Adam King [Tue, 17 Jun 2025 15:52:27 +0000 (11:52 -0400)]
Merge pull request #63399 from Shwetha-Acharya/share_config_update

mgr/smb: Add new configs to share definition

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: John Mulligan <jmulligan@redhat.com>
4 months agocls/rbd: use default values for non-decoded fields in test instances 63933/head
Kefu Chai [Sat, 14 Jun 2025 13:44:05 +0000 (21:44 +0800)]
cls/rbd: use default values for non-decoded fields in test instances

Previously, test instances for cls_rbd_snap used non-default values
for the "parent" field, which is ignored during decoding. The
check-generated.sh test passed because they reused the same instance
for re-encoding, preserving undecoded fields.

An upcoming change will allocate new instances for each encode/decode
verification instead of reusing instances. This will expose
discrepancies between original test instances and re-encoded values
when fields contain non-default values but aren't decoded.

This change sets ignored fields to their default values in test
instances, ensuring consistency between encoding and decoding
operations regardless of the verification approach used.

Since the incompatibility of cls_rbd_snap's on-disk format was
introduced in 32b14ed1, which was introduced Ceph v14, we will
mark this version the first incompatible version in ceph-object-corpus
in the sense that the re-encoded cls_rbd_snap with v8 struct version
is different from the original copy if its parent field is set with
< v8 struct version.

Signed-off-by: Kefu Chai <tchaikov@gmail.com>
4 months agoMerge pull request #63951 from zdover23/wip-doc-2025-06-16-tracker-71628-cephfs-disas...
Zac Dover [Tue, 17 Jun 2025 10:27:32 +0000 (20:27 +1000)]
Merge pull request #63951 from zdover23/wip-doc-2025-06-16-tracker-71628-cephfs-disaster-recovery-first-damage

doc/cephfs: document first-damage.py

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
4 months agocmake: use find_program(REQUIRED) to fail early on missing programs 63975/head
Kefu Chai [Tue, 17 Jun 2025 09:22:16 +0000 (17:22 +0800)]
cmake: use find_program(REQUIRED) to fail early on missing programs

Since upgrading minimum CMake version to 3.22.1 (commit 469d82a1), we can
now use find_program(REQUIRED) which was introduced in CMake 3.18.

This change replaces manual FATAL_ERROR checks with the REQUIRED option
and adds it to programs that are actually needed during the build. This
ensures the build fails early during configuration rather than later
during compilation when missing programs are invoked.

Changes:
- Replace find_program() + message(FATAL_ERROR) patterns with REQUIRED
- Add REQUIRED to programs that are used during build but previously
  had no error checking

Reference: https://cmake.org/cmake/help/latest/command/find_program.html

Signed-off-by: Kefu Chai <tchaikov@gmail.com>
4 months agodoc/cephfs: document first-damage.py 63951/head
Zac Dover [Mon, 16 Jun 2025 08:02:49 +0000 (18:02 +1000)]
doc/cephfs: document first-damage.py

Document the use of first-damage.py.

Fixes: https://tracker.ceph.com/issues/71628
Signed-off-by: Zac Dover <zac.dover@proton.me>
4 months agoMerge pull request #55875 from rhcs-dashboard/update-npm-deps
Nizamudeen A [Tue, 17 Jun 2025 04:52:58 +0000 (10:22 +0530)]
Merge pull request #55875 from rhcs-dashboard/update-npm-deps

mgr/dashboard: update npm dependencies

4 months agoMerge pull request #63872 from rhcs-dashboard/side-panel-carbon
Nizamudeen A [Tue, 17 Jun 2025 04:52:00 +0000 (10:22 +0530)]
Merge pull request #63872 from rhcs-dashboard/side-panel-carbon

mgr/dashboard: introduce side panel as a reusable component

4 months agoMerge pull request #62806 from NitzanMordhai/wip-nitzan-mixed-balance-read-and-rworde...
SrinivasaBharathKanta [Tue, 17 Jun 2025 04:02:05 +0000 (09:32 +0530)]
Merge pull request #62806 from NitzanMordhai/wip-nitzan-mixed-balance-read-and-rwordered-fix

mixed balance read and rwordered in read ops

4 months agoMerge pull request #63515 from xxhdx1985126/wip-seastore-start-stop-trim_dirty
Yingxin Cheng [Tue, 17 Jun 2025 03:19:48 +0000 (11:19 +0800)]
Merge pull request #63515 from xxhdx1985126/wip-seastore-start-stop-trim_dirty

crimson/os/seastore/async_cleaner: set different threshold for starting and stopping trimming dirty

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
4 months agoMerge pull request #63822 from cbodley/wip-71600
Casey Bodley [Mon, 16 Jun 2025 22:35:28 +0000 (18:35 -0400)]
Merge pull request #63822 from cbodley/wip-71600

test/common: unittest_fault_injector omits unit-main target

Reviewed-by: Adam Emerson <aemerson@redhat.com>
4 months agoMerge pull request #63303 from mkogan1/wip-eio-to-err_internal
Adam Emerson [Mon, 16 Jun 2025 21:34:10 +0000 (17:34 -0400)]
Merge pull request #63303 from mkogan1/wip-eio-to-err_internal

rgw/multisite: change HTTP error handling from EIO to ERR_INTERNAL_ERROR

Reviewed-by: Casey Bodley <cbodley@redhat.com>
4 months agoMerge pull request #63177 from NitzanMordhai/wip-nitzan-aio-cxx-test-inflight-checks
SrinivasaBharathKanta [Mon, 16 Jun 2025 14:05:49 +0000 (19:35 +0530)]
Merge pull request #63177 from NitzanMordhai/wip-nitzan-aio-cxx-test-inflight-checks

test/librados/aio_cxx: skip EIO boundary assertion when no in-flight I/Os

4 months agoMerge pull request #62197 from NitzanMordhai/wip-nitzan-mgr-pymodule-mem-leaks
SrinivasaBharathKanta [Mon, 16 Jun 2025 14:04:09 +0000 (19:34 +0530)]
Merge pull request #62197 from NitzanMordhai/wip-nitzan-mgr-pymodule-mem-leaks

mgr: PyModule memory leaks fix

4 months agoMerge pull request #63326 from linuxbox2/wip-cksum-golang
Matt Benjamin [Mon, 16 Jun 2025 13:27:12 +0000 (09:27 -0400)]
Merge pull request #63326 from linuxbox2/wip-cksum-golang

rgw: recognize checksum from x-amz-checksum-{type} alone

4 months agoMerge pull request #63821 from zdover23/wip-doc-2025-06-09-pr-63490-followup 18.2.1-327.el9c
Anthony D'Atri [Mon, 16 Jun 2025 12:30:54 +0000 (08:30 -0400)]
Merge pull request #63821 from zdover23/wip-doc-2025-06-09-pr-63490-followup

doc/src: edit osd.yaml.in (osd_deep_scrub_interval_cv)

4 months agomgr/dashboard: bump node to 20.18.1 55875/head
Nizamudeen A [Mon, 16 Jun 2025 04:50:45 +0000 (10:20 +0530)]
mgr/dashboard: bump node to 20.18.1

one of the inner dependency expects it
```
npm WARN EBADENGINE Unsupported engine {
npm WARN EBADENGINE   package: 'undici@7.10.0',
npm WARN EBADENGINE   required: { node: '>=20.18.1' },
npm WARN EBADENGINE   current: { node: 'v20.13.1', npm: '10.5.2' }
npm WARN EBADENGINE }

```

Fixes: https://tracker.ceph.com/issues/70934
Signed-off-by: Nizamudeen A <nia@redhat.com>
4 months agomgr/dashboard: update npm dependencies
Nizamudeen A [Sat, 5 Aug 2023 18:23:29 +0000 (23:53 +0530)]
mgr/dashboard: update npm dependencies

Fixes: https://tracker.ceph.com/issues/70934
Signed-off-by: Nizamudeen A <nia@redhat.com>
4 months agomgr/dashboard: introduce side panel as a reusable component 63872/head
Nizamudeen A [Wed, 11 Jun 2025 07:59:58 +0000 (13:29 +0530)]
mgr/dashboard: introduce side panel as a reusable component

Fixes: https://tracker.ceph.com/issues/71653
Signed-off-by: Nizamudeen A <nia@redhat.com>
4 months agoMerge pull request #63291 from VallariAg/nvmeof-teuthology-reduce-devices
Vallari Agrawal [Mon, 16 Jun 2025 07:39:06 +0000 (13:09 +0530)]
Merge pull request #63291 from VallariAg/nvmeof-teuthology-reduce-devices

qa: reduce nvmeof thrasher fio to 32 devices from 200

4 months agoMerge pull request #63034 from thotz/cloud-restore-allow-copy-cloud-restored-temp-obj
Soumya Koduri [Mon, 16 Jun 2025 05:44:53 +0000 (11:14 +0530)]
Merge pull request #63034 from thotz/cloud-restore-allow-copy-cloud-restored-temp-obj

rgw/cloud-restore: allow copy of temporary object

4 months agoqa/suites/nvmeof: remove "--refresh" from "ceph orch ps/ls" 63291/head
Vallari Agrawal [Tue, 10 Jun 2025 14:47:13 +0000 (20:17 +0530)]
qa/suites/nvmeof: remove "--refresh" from "ceph orch ps/ls"

Also in scalability test, set SCALING_DELAYS: '400'
instead of '300' because we might need more time for
namespace load balancing.

Fixes: https://tracker.ceph.com/issues/70927
Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
4 months agoqa: archive dmesg logs if nvmeof basic_tests fails
Vallari Agrawal [Thu, 29 May 2025 03:30:22 +0000 (09:00 +0530)]
qa: archive dmesg logs if nvmeof basic_tests fails

Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
4 months agoqa: change processing of "nvme list" json output
Vallari Agrawal [Wed, 28 May 2025 02:36:36 +0000 (08:06 +0530)]
qa: change processing of "nvme list" json output

Revert changes made in 22f91cdc65246a0f9255e9348c15230670574e64
to processing of "nvme list -o json".

nvme-cli has changed json output in 2.11
and then reverted the change in 2.13 release.
So this commit goes back to 2.13 (or pre-2.11)
processing of "nvme list" json output.

Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
4 months agoqa: run nvmeof tests against "devel" tag
Vallari Agrawal [Tue, 20 May 2025 11:40:21 +0000 (17:10 +0530)]
qa: run nvmeof tests against "devel" tag

Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
4 months agoqa: reduce nvmeof thrasher fio to 32 devices from 200
Vallari Agrawal [Mon, 19 May 2025 06:08:21 +0000 (11:38 +0530)]
qa: reduce nvmeof thrasher fio to 32 devices from 200

This is because teuthology machines are not
powerful enough to support fio on huge amount of
devices.

This commit also adds CEPHADM_STRAY_DAEMON and
CEPHADM_DAEMON_PLACE_FAIL to thrasher ignorelist.

Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
4 months agoMerge pull request #63934 from anthonyeleven/ceph-conf
Zac Dover [Mon, 16 Jun 2025 02:48:59 +0000 (12:48 +1000)]
Merge pull request #63934 from anthonyeleven/ceph-conf

doc/rados/configuration: Improve ceph-conf.rst

Reviewed-by: Zac Dover <zac.dover@proton.me>
4 months agodoc/rados/configuration: Improve ceph-conf.rst 63934/head
Anthony D'Atri [Sun, 15 Jun 2025 02:09:11 +0000 (22:09 -0400)]
doc/rados/configuration: Improve ceph-conf.rst

Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
4 months agoMerge pull request #62671 from adamemerson/wip-test-objectstore-signed-comparison
Matan Breizman [Sun, 15 Jun 2025 08:07:53 +0000 (11:07 +0300)]
Merge pull request #62671 from adamemerson/wip-test-objectstore-signed-comparison

test/objectstore: Fix signed comparison warning

Reviewed-by: Kefu Chai <kefu.chai@scylladb.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>
4 months agoMerge pull request #63833 from Matan-B/wip-matanb-crimson-flavor
Matan Breizman [Sun, 15 Jun 2025 07:57:32 +0000 (10:57 +0300)]
Merge pull request #63833 from Matan-B/wip-matanb-crimson-flavor

qa/suites/crimson-rados: Accommodate to new crimson flavors

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
4 months agoMerge pull request #63834 from tchaikov/wip-install-deps
Kefu Chai [Sun, 15 Jun 2025 07:37:07 +0000 (15:37 +0800)]
Merge pull request #63834 from tchaikov/wip-install-deps

install-deps.sh: refresh package index before installing dependencies

Reviewed-by: Ronen Friedman <rfriedma@redhat.com>
4 months agomds: generate symlink inode with correct mode 63937/head
Kefu Chai [Fri, 13 Jun 2025 12:22:20 +0000 (20:22 +0800)]
mds: generate symlink inode with correct mode

Fix test instance generation for InodeStoreBare and InodeStore to
properly set the mode field to S_IFLNK for symlink inodes.

Previously, generated test instances with symlink inodes had unset
mode fields, creating inconsistent data. This issue was masked because
ceph-dencoder reused existing instances during encode/decode consistency
tests, leaving stale values intact.

The problem would surface when check-generated.sh and readable.sh
allocate fresh instances for decoding tests, as the missing mode field
would cause decode/encode inconsistencies.

This change fixes generate_test_instances() to set the mode field to
S_IFLNK for symlink inodes, creating valid InodeStore and InodeStoreBare
instances with consistent field values for proper encode/decode testing.

Signed-off-by: Kefu Chai <tchaikov@gmail.com>
4 months agoMerge pull request #63675 from afreen23/main 63932/head
afreen23 [Sat, 14 Jun 2025 13:33:23 +0000 (19:03 +0530)]
Merge pull request #63675 from afreen23/main

mgr/dashboard: Fix capitalization in SMB

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
4 months agoMerge pull request #62616 from adamemerson/wip-noble
Casey Bodley [Fri, 13 Jun 2025 20:48:04 +0000 (16:48 -0400)]
Merge pull request #62616 from adamemerson/wip-noble

build: Fix opentelemetry-cpp build failure on Noble

Reviewed-by: Dan Mick <dmick@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
4 months agoMerge pull request #63793 from cbodley/wip-71584
Casey Bodley [Fri, 13 Jun 2025 20:42:48 +0000 (16:42 -0400)]
Merge pull request #63793 from cbodley/wip-71584

qa/rgw: bump maven version in hadoop task to resolve 404 Not Found

Reviewed-by: Adam Emerson <aemerson@redhat.com>
4 months agoMerge pull request #63918 from bluikko/doc-update-deprecation-cephfs
Anthony D'Atri [Fri, 13 Jun 2025 18:41:27 +0000 (14:41 -0400)]
Merge pull request #63918 from bluikko/doc-update-deprecation-cephfs

doc/cephfs: Update deprecation notice in experimental-features.rst

4 months agoMerge pull request #63919 from bluikko/doc-mantle-formatting-cephfs
Anthony D'Atri [Fri, 13 Jun 2025 18:35:36 +0000 (14:35 -0400)]
Merge pull request #63919 from bluikko/doc-mantle-formatting-cephfs

doc/cephfs: Improve formatting in mantle.rst

4 months agoadd explicit checksum matrix 63326/head
Matt Benjamin [Tue, 3 Jun 2025 16:54:38 +0000 (12:54 -0400)]
add explicit checksum matrix

Signed-off-by: Matt Benjamin <mbenjamin@redhat.com>
4 months agorgw: framework shell of gosdk tests
Matt Benjamin [Fri, 30 May 2025 21:56:10 +0000 (17:56 -0400)]
rgw: framework shell of gosdk tests

Contains two golang functions based on the checksum failure reproducer
provided by Fred Heinecke.

Signed-off-by: Matt Benjamin <mbenjamin@redhat.com>
4 months agoMerge pull request #63795 from tchaikov/wip-dmclock-cmake-required
Casey Bodley [Fri, 13 Jun 2025 17:22:40 +0000 (13:22 -0400)]
Merge pull request #63795 from tchaikov/wip-dmclock-cmake-required

cmake: bump minimum required CMake version to 3.22.1

Reviewed-by: Casey Bodley <cbodley@redhat.com>
4 months agoMerge pull request #63920 from bluikko/doc-sphinx-warnings-new
Anthony D'Atri [Fri, 13 Jun 2025 16:50:23 +0000 (12:50 -0400)]
Merge pull request #63920 from bluikko/doc-sphinx-warnings-new

doc: Fix sphinx warnings and one error

4 months agoMerge pull request #63913 from phlogistonjohn/jjm-bwc-pyold
Zack Cerza [Fri, 13 Jun 2025 16:47:58 +0000 (10:47 -0600)]
Merge pull request #63913 from phlogistonjohn/jjm-bwc-pyold

script/build-with-container: add workarounds for older python versions

4 months agoMerge pull request #63922 from thmour/patch-1
Casey Bodley [Fri, 13 Jun 2025 15:57:51 +0000 (11:57 -0400)]
Merge pull request #63922 from thmour/patch-1

src: typo in radosgw-admin.cc

Reviewed-by: Casey Bodley <cbodley@redhat.com>
4 months agodoc: Fix sphinx warnings 63920/head
Ville Ojamo [Fri, 13 Jun 2025 10:28:23 +0000 (17:28 +0700)]
doc: Fix sphinx warnings

doc/cephadm/services/snmp-gateway.rst: Don't use double backticks for
links. Makes it a link instead of rendering syntax verbatim.
Also for consistency use single backticks for links instead of a plain
trailing underscore.
Improve language of opening sentence.

doc/dev/cephfs-mirroring.rst: Add missing empty line before preformatted
blocks. No change in rendered docs.

doc/mgr/telemetry.rst: Fix external link syntax. Makes it a link instead
of rendering syntax and pointing to non-existing link.

Signed-off-by: Ville Ojamo <14869000+bluikko@users.noreply.github.com>
4 months agosrc: typo in radosgw-admin.cc 63922/head
Mouratidis Theofilos [Fri, 13 Jun 2025 13:29:54 +0000 (15:29 +0200)]
src: typo in radosgw-admin.cc

Probably a result of a refactor

Signed-off-by: Mouratidis Theofilos <mtheofilos@gmail.com>
4 months agoMerge pull request #62299 from thegreenbear/cephadm-sd-fix-haproxy-labels
Adam King [Fri, 13 Jun 2025 13:04:42 +0000 (09:04 -0400)]
Merge pull request #62299 from thegreenbear/cephadm-sd-fix-haproxy-labels

src/pybind/mgr/cephadm/service_discovery: fixed HAProxy labels

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Afreen Misbah <afreen@ibm.com>
4 months agoclient: fix memory leak in Client::CRF_iofinish::complete 60507/head
Shachar Sharon [Tue, 22 Oct 2024 12:06:54 +0000 (15:06 +0300)]
client: fix memory leak in Client::CRF_iofinish::complete

Commit 1210ddf7a ("Client: Add non-blocking helper classes") introduced
Client::C_Read_Finisher Context object for async READ operations, but
it has a read-after-free bug which may cause memory leak when calling
libcephf's non-blocking ceph_ll_nonblocking_readv_writev API with async
READ:

ceph_ll_nonblocking_readv_writev (READ)
  Client::ll_preadv_pwritev
  ...
    Client::_read_async
      Context::complete
        Client::CRF_iofinish::complete
          Client::CRF_iofinish::finish
          CRF->finish_io()
            Client::C_Read_Finisher::finish_io
            ...
            delete this; // frees CRF_iofinish->CRF
          if (CRF->iofinished) // use-after-free of CRF
            delete this; // may not get here

A possible memory leak depends on timing and race with other thread
allocation which alters the memory address of CRF->iofinished to
false, thus skipping the last delete operation.

The check of `if (CRF->iofinished)` is unnecessary: it is always set to
true upon calling CRF->finish_io(). Thus, there is no need to have the
override function Client::CRF_iofinish::complete() as it now has the
same logic as Context::complete(). Removed.

Signed-off-by: Shachar Sharon <ssharon@redhat.com>
4 months agodoc/cephfs: Improve formatting in mantle.rst 63919/head
Ville Ojamo [Fri, 13 Jun 2025 10:02:33 +0000 (17:02 +0700)]
doc/cephfs: Improve formatting in mantle.rst

Use ordered lists instead of hardcoded list item number paragraphs.

Indent list item contents correctly so that a text block is not
rendered inside a previous preformatted block.
Also fix indentation of one preformatted block inside a list item to be
at the same amount of indent as other such blocks.

Use inline preformatted for commands, method/function names etc. instead
of italic/MD-style inline preformatted.

Signed-off-by: Ville Ojamo <14869000+bluikko@users.noreply.github.com>
4 months agodoc/cephfs: Update deprecation notice in experimental-features.rst 63918/head
Ville Ojamo [Fri, 13 Jun 2025 09:39:35 +0000 (16:39 +0700)]
doc/cephfs: Update deprecation notice in experimental-features.rst

The "inline data" feature was not removed on Quincy so update
deprecation notice to refer to "a future release" instead.

Signed-off-by: Ville Ojamo <14869000+bluikko@users.noreply.github.com>
4 months agocmake: bump minimum required CMake version to 3.22.1 63795/head
Kefu Chai [Sun, 8 Jun 2025 08:24:51 +0000 (16:24 +0800)]
cmake: bump minimum required CMake version to 3.22.1

Bump the minimum required CMake version from 3.5 to 3.22.1 in both
Ceph and embedded dmclock to address deprecation warnings.

CMake 4.0.2 generates deprecation warnings for projects requiring
versions below 3.10:

```
CMake Deprecation Warning at src/dmclock/CMakeLists.txt:1 (cmake_minimum_required):
  Compatibility with CMake < 3.10 will be removed from a future version of
  CMake.

  Update the VERSION argument <min> value.  Or, use the <min>...<max> syntax
  to tell CMake that the project requires at least <min> but has been updated
  to work with policies introduced by <max> or earlier.
```

The upstream CI workflow uses Ubuntu 22.04 (CMake 3.22.1) and CentOS 9
(CMake 3.26.5), so bumping to 3.22.1 maintains compatibility with our
supported build environments while enabling access to newer CMake
features.

Signed-off-by: Kefu Chai <tchaikov@gmail.com>
4 months agocmake: drop c-ares::c-ares alias
Kefu Chai [Fri, 13 Jun 2025 08:20:10 +0000 (16:20 +0800)]
cmake: drop c-ares::c-ares alias

Remove the c-ares::c-ares alias that was causing build failures after
bumping the minimum CMake version:

```
CMake Error at cmake/modules/Findc-ares.cmake:34 (add_library):
  add_library cannot create ALIAS target "c-ares::c-ares" because another
  target with the same name already exists.
Call Stack (most recent call first):
  src/CMakeLists.txt:463 (_find_package)
  src/seastar/cmake/SeastarDependencies.cmake:136 (find_package)
  src/seastar/CMakeLists.txt:395 (seastar_find_dependencies)`
```

The alias was originally added for backward compatibility with Seastar,
but is no longer needed since the updated Seastar submodule no longer
references the c-ares::c-ares target.

Signed-off-by: Kefu Chai <tchaikov@gmail.com>
4 months agoscript/build-with-container: add workarounds for older python versions 63913/head
John Mulligan [Fri, 13 Jun 2025 00:37:56 +0000 (20:37 -0400)]
script/build-with-container: add workarounds for older python versions

Signed-off-by: John Mulligan <jmulligan@redhat.com>
4 months agoMerge pull request #63887 from phlogistonjohn/jjm-bwc-improvements
Dan Mick [Thu, 12 Jun 2025 23:32:26 +0000 (16:32 -0700)]
Merge pull request #63887 from phlogistonjohn/jjm-bwc-improvements

Assorted build-with-container improvements

4 months agomissed internal, apparently invalid no-length exception case
Matt Benjamin [Tue, 3 Jun 2025 16:16:28 +0000 (12:16 -0400)]
missed internal, apparently invalid no-length exception case

Signed-off-by: Matt Benjamin <mbenjamin@redhat.com>
4 months agorgw: aws-chunked need not supply any content-length
Matt Benjamin [Sun, 18 May 2025 01:02:34 +0000 (21:02 -0400)]
rgw: aws-chunked need not supply any content-length

The updated logic for aws chunked handling (2024) appears sufficient
to handle the cases produced by aws-sdk-go-v2.

Note that https://docs.aws.amazon.com/AmazonS3/latest/API/sigv4-streaming.html
states that "For all requests, you must include the
x-amz-decoded-content-length header, specifying the size of the object in
bytes." (accessed 5/17/2025) (but now we do not enforce it).

Reported (with reproducer!) by: Fred Heinecke.

Fixes: https://tracker.ceph.com/issues/71183
Signed-off-by: Matt Benjamin <mbenjamin@redhat.com>
4 months agorgw_cksum: select checksum algo from only a checksum trailer header
Matt Benjamin [Sat, 17 May 2025 23:42:09 +0000 (19:42 -0400)]
rgw_cksum: select checksum algo from only a checksum trailer header

When the checksum payload will be sent in trailer section, a typed
checksum header name will be one of the values of x-amz-trailer.

Signed-off-by: Matt Benjamin <mbenjamin@redhat.com>
4 months agorgw: recognize checksum from x-amz-checksum-{type} alone
Matt Benjamin [Sat, 17 May 2025 19:52:20 +0000 (15:52 -0400)]
rgw: recognize checksum from x-amz-checksum-{type} alone

Some SDKs may send x-amz-checksum-algorithm or
x-amz-sdk-checksum-algorithm regardless as well, but those are
only required if the checksum header is in the trailer section.

Fixes: https://tracker.ceph.com/issues/71350
Signed-off-by: Matt Benjamin <mbenjamin@redhat.com>
4 months agoMerge pull request #63557 from Matan-B/wip-matanb-crimson-nightly-2-flavors
Matan Breizman [Thu, 12 Jun 2025 14:28:43 +0000 (17:28 +0300)]
Merge pull request #63557 from Matan-B/wip-matanb-crimson-nightly-2-flavors

qa/crontab/teuthology-cronjobs: Use Crimson flavors

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
4 months agosrc/pybind/mgr/cephadm/service_discovery: fixed HAProxy labels 62299/head
Bernard Landon [Thu, 13 Mar 2025 11:04:42 +0000 (11:04 +0000)]
src/pybind/mgr/cephadm/service_discovery: fixed HAProxy labels

Fixes: https://tracker.ceph.com/issues/70477
Signed-off-by: Bernard Landon <bernard@lndn.ch>
4 months agoMerge pull request #63605 from tchaikov/wip-rgw-sal-posix-fix-leak
Casey Bodley [Thu, 12 Jun 2025 13:47:18 +0000 (09:47 -0400)]
Merge pull request #63605 from tchaikov/wip-rgw-sal-posix-fix-leak

rgw/driver/posix: closedir() to free dir handle

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
4 months agoscript/buildcontainer-setup: set WITH_CRIMSON to support crimson builds 63887/head
John Mulligan [Wed, 11 Jun 2025 00:15:28 +0000 (20:15 -0400)]
script/buildcontainer-setup: set WITH_CRIMSON to support crimson builds

Set the WITH_CRIMSON env var so that crimson related dependencies are
installed and the build image can build crimson flavored ceph.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
4 months agoscript/build-with-container: handle failure to find source rpm better
John Mulligan [Thu, 8 May 2025 17:22:52 +0000 (13:22 -0400)]
script/build-with-container: handle failure to find source rpm better

Don't use an obscure assert, raise an exception with a helpful error
message.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
4 months agoscript/build-with-container: fix rpm build step when using an overlay
John Mulligan [Thu, 8 May 2025 15:21:32 +0000 (11:21 -0400)]
script/build-with-container: fix rpm build step when using an overlay

Signed-off-by: John Mulligan <jmulligan@redhat.com>
4 months agoscript/build-with-container: fix rpmbuild when no rpm build args are set
John Mulligan [Thu, 8 May 2025 15:21:11 +0000 (11:21 -0400)]
script/build-with-container: fix rpmbuild when no rpm build args are set

Signed-off-by: John Mulligan <jmulligan@redhat.com>
4 months agoscript/build-with-container: cache git branch result
John Mulligan [Fri, 21 Mar 2025 18:28:25 +0000 (14:28 -0400)]
script/build-with-container: cache git branch result

Cache the branch we got from the git command as it is highly unlikely
to change during the script execution and if it does -- we mostly don't
care anyway.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
4 months agoscript/build-with-container: detect when cached build images need rebuilding
John Mulligan [Fri, 21 Mar 2025 17:53:16 +0000 (13:53 -0400)]
script/build-with-container: detect when cached build images need rebuilding

Signed-off-by: John Mulligan <jmulligan@redhat.com>
4 months agoscript/build-with-container: add log line with time taken
John Mulligan [Thu, 20 Mar 2025 21:03:43 +0000 (17:03 -0400)]
script/build-with-container: add log line with time taken

Add a log line with time taken to the output of the script, this can
help to see at a glance how long a particular bwc step took.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
4 months agoMerge pull request #63005 from bluikko/doc-layout-formatting-radosgw
Anthony D'Atri [Thu, 12 Jun 2025 12:40:35 +0000 (08:40 -0400)]
Merge pull request #63005 from bluikko/doc-layout-formatting-radosgw

doc/radosgw: Promptify CLI commands and fix formatting in layout.rst

4 months agoMerge pull request #63160 from yuvalif/wip-yuval-71219
Yuval Lifshitz [Thu, 12 Jun 2025 12:26:46 +0000 (15:26 +0300)]
Merge pull request #63160 from yuvalif/wip-yuval-71219

rgw/logging: send flushed object name in API reply

4 months agomgr/dashboard: Fix capitalization in SMB 63675/head
Afreen Misbah [Tue, 3 Jun 2025 15:03:50 +0000 (20:33 +0530)]
mgr/dashboard: Fix capitalization in SMB

- changes from 'Smb' to 'SMB

Fixes https://tracker.ceph.com/issues/71538

Signed-off-by: Afreen Misbah <afreen@ibm.com>
4 months agoqa/standalone/mon/availability.sh: add test for config option 63159/head
Shraddha Agrawal [Thu, 29 May 2025 10:10:01 +0000 (15:40 +0530)]
qa/standalone/mon/availability.sh: add test for config option

This commit adds two tests, first, to ensure we get an error
message when the feature is disabled. It checks if the config
option, enable_availability_tracking is working properly.
Second test ensures that we actually do stop calculating the
score when the feature is disabled.

Fixes: https://tracker.ceph.com/issues/71494
Signed-off-by: Shraddha Agrawal <shraddhaag@ibm.com>
4 months agodoc: add docs and update release notes for the new config option
Shraddha Agrawal [Thu, 29 May 2025 08:05:40 +0000 (13:35 +0530)]
doc: add docs and update release notes for the new config option

Fixes: https://tracker.ceph.com/issues/71494
Signed-off-by: Shraddha Agrawal <shraddhaag@ibm.com>
4 months agomon/MgrStatMonitor: ignore duration for which feature is off
Shraddha Agrawal [Thu, 22 May 2025 10:26:41 +0000 (15:56 +0530)]
mon/MgrStatMonitor: ignore duration for which feature is off

When the availability tracking feature is disabled, we should not
be updating the score. We should start recalculating the score
when the user enables the features again. Essentially, for the
purpose of calculating the score, we need to ignore the duration
for which the feature was turned off.

The score is calculated from the uptime and downtime durations
recorded in `pool_availability` object. These durations are updated
in `calc_pool_availability` by adding the diff between last_uptime/
last_downtime and now.

To discard the duration for which the feature was turned off, we
need to offset the uptime/downtime by this duration. A simple way
to do this is to update the last_uptime and last_downtime to the
timestamp when the feature is toggled on again. To implement the
same, we record the time at which the feature is toggled from off
to on. When `calc_pool_availability` is invoked, if a reset is
required, it resets last_uptime and last_downtime before proceeding
with availability calculations.

We only care about the state when the feature is toggled from off to
on. All other toggle states for the config option will not have any
effect on the score.

Fixes: https://tracker.ceph.com/issues/71494
Signed-off-by: Shraddha Agrawal <shraddhaag@ibm.com>
4 months agoMgrStatMonitor: add config observer
Shraddha Agrawal [Thu, 22 May 2025 09:16:50 +0000 (14:46 +0530)]
MgrStatMonitor: add config observer

This commit adds a config observer to MgrStatMonitor so we
can track when a user enables/disables enable_availability_tracking
config option. The time difference between disabling and then
enabling the config option will be used to offset the uptime
and/or downtime from the availability score feature.

Fixes: https://tracker.ceph.com/issues/71494
Signed-off-by: Shraddha Agrawal <shraddhaag@ibm.com>
4 months agomon/MgrStatMonitor.cc: do not update score when disabled
Shraddha Agrawal [Thu, 22 May 2025 08:20:57 +0000 (13:50 +0530)]
mon/MgrStatMonitor.cc: do not update score when disabled

This commit adds changes to ensure the availability score
tracking is not updated when the feature is disabled. We
will preserve the score calculated before the feature is
turned off and start updating it again when the feature
is enabled.

Fixes: https://tracker.ceph.com/issues/71494
Signed-off-by: Shraddha Agrawal <shraddhaag@ibm.com>
4 months agosrc/common/options: add config option for availability score
Shraddha Agrawal [Tue, 6 May 2025 06:20:59 +0000 (11:50 +0530)]
src/common/options: add config option for availability score

This commit modifies src/common/options/mon.yaml.in to add a
new config option to enable/disable tracking availability
score. This config option can be modified dynamically at
runtime as well.

To enable tracking availability score, we can run the
following command:

  ceph config set mon enable_availability_tracking true

By default, tracking availability score is enabled.

To disable tracking availability score:

  ceph config set mon enable_availablity_tracking false

When the feature is turned off, invoking the
`availability-status` command will display an error, prompting
the user to turn on the feature using the config option.

Fixes: https://tracker.ceph.com/issues/71494
Signed-off-by: Shraddha Agrawal <shraddhaag@ibm.com>
4 months agocontainer: update crimson flavors 63833/head
Matan Breizman [Thu, 12 Jun 2025 08:29:29 +0000 (08:29 +0000)]
container: update crimson flavors

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
4 months agoqa/tasks: update crimson flavor types
Matan Breizman [Thu, 12 Jun 2025 08:32:35 +0000 (08:32 +0000)]
qa/tasks: update crimson flavor types

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
4 months agoMerge pull request #63714 from Naveenaidu/wip-naveen-update-config-diff-script
Naveen Naidu [Thu, 12 Jun 2025 02:36:22 +0000 (08:06 +0530)]
Merge pull request #63714 from Naveenaidu/wip-naveen-update-config-diff-script

.github/workflow/diff-ceph-config.yml: only detect the configuration changes that were made in the PR

4 months agoMerge pull request #63840 from tchaikov/wip-test-erasure-code-fix-leak
Kefu Chai [Thu, 12 Jun 2025 00:10:07 +0000 (08:10 +0800)]
Merge pull request #63840 from tchaikov/wip-test-erasure-code-fix-leak

test/erasure-code: fix memory leak in ErasureCodePlugin.parity_delta_…

Reviewed-by: Alex Ainscow <aainscow@uk.ibm.com>
4 months agoMerge pull request #63876 from zdover23/wip-doc-2025-06-11-radosgw-cloud-transition...
Anthony D'Atri [Wed, 11 Jun 2025 16:31:01 +0000 (12:31 -0400)]
Merge pull request #63876 from zdover23/wip-doc-2025-06-11-radosgw-cloud-transition-63447-followup

doc/radosgw: edit cloud-transition.rst

4 months agoMerge pull request #63877 from zdover23/wip-doc-2025-06-11-rados-ops-cache-tiering...
Anthony D'Atri [Wed, 11 Jun 2025 16:30:45 +0000 (12:30 -0400)]
Merge pull request #63877 from zdover23/wip-doc-2025-06-11-rados-ops-cache-tiering-63745-followup

doc/rados/ops: edit cache-tiering.rst

4 months agoMerge pull request #61543 from Naveenaidu/naveen/add-telemetry-upgrade-docs
Naveen Naidu [Wed, 11 Jun 2025 16:10:02 +0000 (21:40 +0530)]
Merge pull request #61543 from Naveenaidu/naveen/add-telemetry-upgrade-docs

doc/mgr/telemetry: add doc for telemetry upgrade tests

4 months agoMerge pull request #63003 from leonidc/fix_duplicate_entity_addr
Samuel Just [Wed, 11 Jun 2025 15:30:29 +0000 (08:30 -0700)]
Merge pull request #63003 from leonidc/fix_duplicate_entity_addr

fix duplicated entity addr in the map during reboot of several GWs

Reviewed-by: Samuel Just <sjust@redhat.com>
4 months agoMerge pull request #62880 from kshtsk/wip-random-sample-population
kyr [Wed, 11 Jun 2025 15:21:33 +0000 (17:21 +0200)]
Merge pull request #62880 from kshtsk/wip-random-sample-population

qa/tasks/ceph_manager: population must be a sequence

4 months agoMerge pull request #62148 from Jayaprakash-ibm/wip-create-bdev-label
Jaya Prakash [Wed, 11 Jun 2025 14:10:07 +0000 (19:40 +0530)]
Merge pull request #62148 from Jayaprakash-ibm/wip-create-bdev-label

os/bluestore: Implemented create-bdev-label

Reviewed-by: Adam Kupczyk <akupczyk@ibm.com>
4 months agorgw/cloud-restore: allow copy of temporary object 63034/head
Jiffin Tony Thottan [Tue, 29 Apr 2025 07:14:42 +0000 (12:44 +0530)]
rgw/cloud-restore: allow copy of temporary object

Since storage class is cloud tier s3 cp of temporary object was failing.

Fixes: https://tracker.ceph.com/issues/71121
Signed-off-by: Jiffin Tony Thottan <thottanjiffin@gmail.com>
4 months agodoc/rados/ops: edit cache-tiering.rst 63877/head
Zac Dover [Wed, 11 Jun 2025 12:44:32 +0000 (22:44 +1000)]
doc/rados/ops: edit cache-tiering.rst

Add material to doc/rados/operations/cache-tiering.rst, as suggested by
Anthony D'Atri in
https://github.com/ceph/ceph/pull/63745#discussion_r2127887785.

Signed-off-by: Zac Dover <zac.dover@proton.me>