]> git.apps.os.sepia.ceph.com Git - ceph-ci.git/log
ceph-ci.git
8 days agoMerge pull request #65388 from jamiepryde/main-choose-pdw-over-rmw
SrinivasaBharathKanta [Sun, 14 Sep 2025 10:50:21 +0000 (16:20 +0530)]
Merge pull request #65388 from jamiepryde/main-choose-pdw-over-rmw

osd: Choose PDW over traditional RMW if number of reads required is equal

8 days agoMerge pull request #65181 from qiuxinyidian/lttng
SrinivasaBharathKanta [Sun, 14 Sep 2025 10:49:14 +0000 (16:19 +0530)]
Merge pull request #65181 from qiuxinyidian/lttng

rados/lttng: add lttng open *tp.so error message

8 days agoMerge pull request #65172 from shraddhaag/wip-shraddhaag-availability-frequency-config
SrinivasaBharathKanta [Sun, 14 Sep 2025 10:48:59 +0000 (16:18 +0530)]
Merge pull request #65172 from shraddhaag/wip-shraddhaag-availability-frequency-config

mon: add config option to change availability score update interval

10 days agoMerge PR #65499 into main wip-vshankar-testing-sanity-1
Patrick Donnelly [Fri, 12 Sep 2025 19:57:39 +0000 (15:57 -0400)]
Merge PR #65499 into main

* refs/pull/65499/head:
script/redmine-upkeep: reset merge state after PR change

Reviewed-by: Adam C. Emerson <aemerson@redhat.com>
10 days agoMerge pull request #65350 from adamemerson/wip-perfcounters-unique-string
Adam Emerson [Fri, 12 Sep 2025 15:59:41 +0000 (11:59 -0400)]
Merge pull request #65350 from adamemerson/wip-perfcounters-unique-string

common: Allow PerfCounters to return a provided service ID

Reviewed-by: Matt Benjamin <mbenjamin@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
10 days agoMerge pull request #65507 from MaxKellermann/log__includes
Casey Bodley [Fri, 12 Sep 2025 15:55:37 +0000 (11:55 -0400)]
Merge pull request #65507 from MaxKellermann/log__includes

log: add missing includes

Reviewed-by: Casey Bodley <cbodley@redhat.com>
10 days agoMerge pull request #64829 from aclamk/aclamk-jenkins-fix-make-check
Adam Kupczyk [Fri, 12 Sep 2025 12:07:30 +0000 (14:07 +0200)]
Merge pull request #64829 from aclamk/aclamk-jenkins-fix-make-check

common/ceph_context.h: reserve space for breakpad in CephContext

10 days agolog: add missing includes
Max Kellermann [Fri, 12 Sep 2025 06:46:24 +0000 (08:46 +0200)]
log: add missing includes

Signed-off-by: Max Kellermann <max.kellermann@ionos.com>
10 days agoMerge pull request #64271 from benhanokh/full_dedup
Adam Emerson [Fri, 12 Sep 2025 06:11:52 +0000 (02:11 -0400)]
Merge pull request #64271 from benhanokh/full_dedup

rgw/dedup: Tech preview of full dedup support which means duplicate objects will be removed

Reviewed-by: Yuval Lifshitz <ylifshit@ibm.com>
10 days agoMerge pull request #64341 from mertsunacoglu/wip-swift-conditional-write
anrao19 [Fri, 12 Sep 2025 05:56:02 +0000 (11:26 +0530)]
Merge pull request #64341 from mertsunacoglu/wip-swift-conditional-write

fix: Swift API conditional write if_match, if_none_match

11 days agoMerge PR #64434 into main
Venky Shankar [Fri, 12 Sep 2025 03:52:21 +0000 (09:22 +0530)]
Merge PR #64434 into main

* refs/pull/64434/head:

Reviewed-by: Juan Miguel Olmo <jolmomar@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
11 days agoMerge pull request #65010 from irq0/pr/verify-barbican-keyid-before-sending
Adam Emerson [Fri, 12 Sep 2025 02:18:02 +0000 (22:18 -0400)]
Merge pull request #65010 from irq0/pr/verify-barbican-keyid-before-sending

rgw: Verify key id before sending to Barbican

Reviewed-by: Tobias Urdin <tobias.urdin@binero.com>
11 days agoMerge pull request #64569 from cbodley/wip-71607
Adam Emerson [Fri, 12 Sep 2025 02:15:59 +0000 (22:15 -0400)]
Merge pull request #64569 from cbodley/wip-71607

rgw: DeleteObject supports sigv4 with Transfer-Encoding: chunked

Reviewed-by: Robin H. Johnson <robbat2@orbis-terrarum.net>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
11 days agoscript/redmine-upkeep: reset merge state after PR change
Patrick Donnelly [Thu, 11 Sep 2025 17:44:12 +0000 (13:44 -0400)]
script/redmine-upkeep: reset merge state after PR change

This prevents mischief like: https://tracker.ceph.com/issues/70916?tab=history#note-34

Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
11 days agoMerge pull request #64504 from NitzanMordhai/wip-nitzan-rados-suites-remove-cache...
Yuri Weinstein [Thu, 11 Sep 2025 17:48:06 +0000 (10:48 -0700)]
Merge pull request #64504 from NitzanMordhai/wip-nitzan-rados-suites-remove-cache-tier-test

qa/workunits/rados: remove cache tier test

Reviewed-by: Kefu Chai <tchaikov@gmail.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
11 days agoMerge PR #64324 into main
Venky Shankar [Thu, 11 Sep 2025 16:06:35 +0000 (21:36 +0530)]
Merge PR #64324 into main

* refs/pull/64324/head:
qa/cephfs: fix test_subvolume_group_charmap_inheritance test
doc: add name mangling documentation for subvolume group creation
qa: add tests for name mangling in subvolume group creation
pybind/mgr: add name mangling options to subvolume group creation

Reviewed-by: Anthony D Atri <anthony.datri@gmail.com>
Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
11 days agoMerge pull request #64992 from rhcs-dashboard/tiering-enhancement
afreen23 [Thu, 11 Sep 2025 14:23:40 +0000 (19:53 +0530)]
Merge pull request #64992 from rhcs-dashboard/tiering-enhancement

mgr/dashboard: Tiering form -  Placement Target in Advanced Section

Reviewed-by: Afreen Misbah <afreen@ibm.com>
Reviewed-by: pujashahu <pshahu@redhat.com>
11 days agoMerge pull request #65115 from rhcs-dashboard/72628-fix-inputfield-css-carbon
afreen23 [Thu, 11 Sep 2025 12:58:51 +0000 (18:28 +0530)]
Merge pull request #65115 from rhcs-dashboard/72628-fix-inputfield-css-carbon

mgr/dashboard : fix css for carbon input fields

Reviewed-by: Afreen Misbah <afreen@ibm.com>
11 days agoMerge pull request #64683 from tobias-urdin/rgwadmin-max-entries-fix
Casey Bodley [Thu, 11 Sep 2025 12:45:31 +0000 (08:45 -0400)]
Merge pull request #64683 from tobias-urdin/rgwadmin-max-entries-fix

radosgw-admin: Pass max_entries for bucket list

Reviewed-by: Adam Emerson <aemerson@redhat.com>
11 days agoMerge pull request #65321 from guits/support-extra-dmcrypt-options
Guillaume Abrioux [Thu, 11 Sep 2025 08:21:32 +0000 (10:21 +0200)]
Merge pull request #65321 from guits/support-extra-dmcrypt-options

ceph-volume: support additional dmcrypt params

11 days agoMerge pull request #64855 from rhcs-dashboard/72431-mirrored-images-have-incorrect...
afreen23 [Thu, 11 Sep 2025 07:43:17 +0000 (13:13 +0530)]
Merge pull request #64855 from rhcs-dashboard/72431-mirrored-images-have-incorrect-usage-info

mgr/dashboard : Fixed mirrored image usage info bar

Reviewed-by: Nizamudeen A <nia@redhat.com>
11 days agoMerge pull request #65312 from rhcs-dashboard/72788-group-similar-alerts
afreen23 [Thu, 11 Sep 2025 07:38:19 +0000 (13:08 +0530)]
Merge pull request #65312 from rhcs-dashboard/72788-group-similar-alerts

mgr/dashboard: Group similar alerts

Reviewed-by: Afreen Misbah <afreen@ibm.com>
Reviewed-by: Aashish Sharma <aasharma@redhat.com>
11 days agoMerge pull request #64190 from SundownRises/carbon-toast
afreen23 [Thu, 11 Sep 2025 07:34:33 +0000 (13:04 +0530)]
Merge pull request #64190 from SundownRises/carbon-toast

mgr/dasboard: Carbonised Toast Notification

Reviewed-by: Afreen Misbah <afreen@ibm.com>
Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
12 days agoqa/cephfs: fix test_subvolume_group_charmap_inheritance test
Venky Shankar [Thu, 11 Sep 2025 03:35:19 +0000 (03:35 +0000)]
qa/cephfs: fix test_subvolume_group_charmap_inheritance test

Signed-off-by: Venky Shankar <vshankar@redhat.com>
12 days agoMerge pull request #65158 from jzhu116-bloomberg/wip-72667
Casey Bodley [Wed, 10 Sep 2025 20:39:16 +0000 (16:39 -0400)]
Merge pull request #65158 from jzhu116-bloomberg/wip-72667

rgw: discard olh_ attributes when copying object from a versioning-suspended bucket to a versioning-disabled bucket

Reviewed-by: Casey Bodley <cbodley@redhat.com>
12 days agoMerge pull request #65393 from MaxKellermann/rgw_includes
Casey Bodley [Wed, 10 Sep 2025 20:00:37 +0000 (16:00 -0400)]
Merge pull request #65393 from MaxKellermann/rgw_includes

rgw: include cleanup

Reviewed-by: Casey Bodley <cbodley@redhat.com>
12 days agoMerge pull request #64800 from BBoozmen/wip-72375-bulk_delete_olh
Casey Bodley [Wed, 10 Sep 2025 19:35:49 +0000 (15:35 -0400)]
Merge pull request #64800 from BBoozmen/wip-72375-bulk_delete_olh

RGW: multi object delete op; skip olh update for all deletes but the last one

Reviewed-by: Casey Bodley <cbodley@redhat.com>
12 days agodoc: update documentation igolikov-subvolume-68929-3
Igor Golikov [Sun, 13 Jul 2025 11:14:21 +0000 (11:14 +0000)]
doc: update documentation

Fixes: https://tracker.ceph.com/issues/68931
Signed-off-by: Igor Golikov <igolikov@ibm.com>
12 days agotest: add subvolume metrics sanity test
Igor Golikov [Thu, 7 Aug 2025 16:35:47 +0000 (16:35 +0000)]
test: add subvolume metrics sanity test

Signed-off-by: Igor Golikov <igolikov@ibm.com>
Fixes: https://tracker.ceph.com/issues/68929
12 days agoqa/cephfs: run selective test classes from basic volumes test
Venky Shankar [Mon, 8 Sep 2025 06:40:35 +0000 (06:40 +0000)]
qa/cephfs: run selective test classes from basic volumes test

Signed-off-by: Venky Shankar <vshankar@redhat.com>
12 days agoqa/cephfs: use fuse mount for volumes/subvolume tests
Venky Shankar [Fri, 29 Aug 2025 07:15:09 +0000 (07:15 +0000)]
qa/cephfs: use fuse mount for volumes/subvolume tests

Using the kernel client is a) not really required existing
volume/subvolume test and b) per-subvolume metrics is only
supported by the user-space client library.

Signed-off-by: Venky Shankar <vshankar@redhat.com>
12 days agomds, messages: include subvolume metric count in log dumps and message exchanges
Venky Shankar [Fri, 29 Aug 2025 17:59:05 +0000 (17:59 +0000)]
mds, messages: include subvolume metric count in log dumps and message exchanges

Signed-off-by: Venky Shankar <vshankar@redhat.com>
12 days agomds: remove unneeded SubvolumeMetric field from `struct Metric`
Venky Shankar [Fri, 29 Aug 2025 17:56:21 +0000 (17:56 +0000)]
mds: remove unneeded SubvolumeMetric field from `struct Metric`

Signed-off-by: Venky Shankar <vshankar@redhat.com>
12 days agomds: add metric debug log in refresh_subvolume_metrics_for_rank()
Venky Shankar [Fri, 29 Aug 2025 17:55:03 +0000 (17:55 +0000)]
mds: add metric debug log in refresh_subvolume_metrics_for_rank()

Signed-off-by: Venky Shankar <vshankar@redhat.com>
12 days agomgr,stats: integrate subvolume metrics
Igor Golikov [Thu, 10 Jul 2025 10:21:56 +0000 (10:21 +0000)]
mgr,stats: integrate subvolume metrics

mgr and stats support for the new subvolume metrics via existing perf
queries mechanism

Fixes: https://tracker.ceph.com/issues/68932
Signed-off-by: Igor Golikov <igolikov@ibm.com>
12 days agomds: aggregate and expose subvolume metrics
Igor Golikov [Thu, 10 Jul 2025 10:18:57 +0000 (10:18 +0000)]
mds: aggregate and expose subvolume metrics

rank0 periodically receives subvolume metrics from other MDS instances
and aggregate subvolume metrics using sliding window.
The MetricsAggregator exposes PerfCounters and PerfQueries for these
metrics.

Fixes: https://tracker.ceph.com/issues/68931
Signed-off-by: Igor Golikov <igolikov@ibm.com>
12 days agoclient,mds: add support for subvolume level metrics
Igor Golikov [Thu, 10 Jul 2025 10:17:36 +0000 (10:17 +0000)]
client,mds: add support for subvolume level metrics

Add support for client side metrics collection using SimpleIOMetric
struct and aggregation using AggregatedIOMetrics struct,
Client holds SimpleIOMetrics vector per each subvolume it recognized
(via caps/metadata messages), aggregates them into the
AggregatedIOMetric struct, and sends  periodically to the MDS, along
with regulat client metrics.
MDS holds map of subvolume_path -> vector<AggregatedIOMetrics> and sends
it periodically to rank0, for further aggregation and exposure.

Fixes: https://tracker.ceph.com/issues/68929, https://tracker.ceph.com/issues/68930
Signed-off-by: Igor Golikov <igolikov@ibm.com>
12 days agoMerge pull request #65470 from rhcs-dashboard/add-user-agent
Nizamudeen A [Wed, 10 Sep 2025 15:55:26 +0000 (21:25 +0530)]
Merge pull request #65470 from rhcs-dashboard/add-user-agent

monitoring: add user-agent headers to the urllib

12 days agoMerge pull request #65428 from rkachach/fix_rgw_docs_certmgr
Redouane Kachach [Wed, 10 Sep 2025 14:26:57 +0000 (16:26 +0200)]
Merge pull request #65428 from rkachach/fix_rgw_docs_certmgr

doc: update RGW HTTPS configuration to use certmgr and new fields

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
Reviewed-by: Adam King <adking@redhat.com>
12 days agoMerge pull request #65406 from rkachach/fix_grafana_docs
Redouane Kachach [Wed, 10 Sep 2025 14:23:48 +0000 (16:23 +0200)]
Merge pull request #65406 from rkachach/fix_grafana_docs

doc: update Grafana certificate configuration to use certmgr

Reviewed-by: Adam King <adking@redhat.com>
12 days agomonitoring: add user-agent headers to the urllib
Nizamudeen A [Wed, 10 Sep 2025 13:00:36 +0000 (18:30 +0530)]
monitoring: add user-agent headers to the urllib

The documentation started raising 403 suddenly. Adding User-Agent
headers to the request

Signed-off-by: Nizamudeen A <nia@redhat.com>
12 days agoMerge PR #65320 into main
Venky Shankar [Wed, 10 Sep 2025 06:44:11 +0000 (12:14 +0530)]
Merge PR #65320 into main

* refs/pull/65320/head:

Reviewed-by: Anoop C S <anoopcs@cryptolab.net>
13 days agoMerge pull request #64463 from ljflores/wip-qa-summary-script
Laura Flores [Tue, 9 Sep 2025 22:11:41 +0000 (17:11 -0500)]
Merge pull request #64463 from ljflores/wip-qa-summary-script

script: add script to help format QA review summaries

13 days agoMerge pull request #58926 from TRYTOBE8TME/wip-shard-id-option
Daniel Gryniewicz [Tue, 9 Sep 2025 16:08:01 +0000 (12:08 -0400)]
Merge pull request #58926 from TRYTOBE8TME/wip-shard-id-option

src/rgw: Adding "sync error trim" option

13 days agomgr/dashboard: Tiering form - Placement Target in Advanced Section
Dnyaneshwari [Wed, 20 Aug 2025 04:46:21 +0000 (10:16 +0530)]
mgr/dashboard: Tiering form - Placement Target in Advanced Section

Fixes: https://tracker.ceph.com/issues/72545
Signed-off-by: Dnyaneshwari Talwekar <dtalweka@redhat.com>
13 days agomgr/dashboard: Carbonised Toast Notification
Anikait Sehwag [Wed, 25 Jun 2025 06:48:44 +0000 (12:18 +0530)]
mgr/dashboard: Carbonised Toast Notification
Used carbon toast component to carbonise toast notifications

Dashboard: Toast Notification carbonised
This PR replaces the existing ngx-toastr implementation with Carbon Design System toast notifications to maintain UI consistency across the Ceph dashboard application.

Fixes:https://tracker.ceph.com/issues/71735
Signed-off-by: Anikait Sehwag <anikaitsehwag.amg@gmail.com>
13 days agolibcephfs_proxy: fix userperm pointer decoding for older protocols
Xavi Hernandez [Mon, 1 Sep 2025 12:43:26 +0000 (14:43 +0200)]
libcephfs_proxy: fix userperm pointer decoding for older protocols

The random data used to decode pointers coming from the old protocol was
taken from the client instead of using the global_random data, which is
the correct one.

Fixes: https://tracker.ceph.com/issues/72800
Signed-off-by: Xavi Hernandez <xhernandez@gmail.com>
13 days agolibcephfs_proxy: remove unnecessary protocol references in daemon
Xavi Hernandez [Mon, 1 Sep 2025 09:58:30 +0000 (11:58 +0200)]
libcephfs_proxy: remove unnecessary protocol references in daemon

With the new protocol structure definitions, it's not necessary to
explicitly access each field inside its version substructure (v0, for
example). Now all fields of the latest version are declared inside an
anonymous substructure that can be accessed without a prefix.

Fixes: https://tracker.ceph.com/issues/72800
Signed-off-by: Xavi Hernandez <xhernandez@gmail.com>
13 days agolibcephfs_proxy: remove unnecessary protocol references in client
Xavi Hernandez [Mon, 1 Sep 2025 09:41:10 +0000 (11:41 +0200)]
libcephfs_proxy: remove unnecessary protocol references in client

With the new protocol structure definitions, it's not necessary to
explicitly access each field inside its version substructure (v0, for
example). Now all fields of the latest version are declared inside an
anonymous substructure that can be accessed without a prefix.

Fixes: https://tracker.ceph.com/issues/72800
Signed-off-by: Xavi Hernandez <xhernandez@gmail.com>
13 days agolibcephfs_proxy: fix protocol structures for backward compatibility
Xavi Hernandez [Mon, 1 Sep 2025 09:22:05 +0000 (11:22 +0200)]
libcephfs_proxy: fix protocol structures for backward compatibility

The structures used for transferring data between the proxy client and
the proxy daemon had been reworked in a recent change to be able to
expand the protocol. This caused an inconsistency in the size of the
data transferred when communication with a peer using the older version.
The result was that the peer receiving the data with an unexpected size
was closing the connection, causing unexpected errors.

The discrepancy in size is the result of how compilers pad structures
combined with the change in the structure layout introduced when
extending the protocol. With these changes, the computation of the size
of each version of the structures was not done correctly.

This change makes the layout equal to the older version, so that
computing the size of the structures becomes easier and doesn't depend
on unexpected paddings.

Fixes: https://tracker.ceph.com/issues/72800
Signed-off-by: Xavi Hernandez <xhernandez@gmail.com>
13 days agoMerge pull request #63895 from Kushal-deb/rgw-qat-compression
Adam King [Tue, 9 Sep 2025 13:07:35 +0000 (09:07 -0400)]
Merge pull request #63895 from Kushal-deb/rgw-qat-compression

cephadm: improve hw qat experience with cephadm

Reviewed-by: Adam King <adking@redhat.com>
13 days agoMerge pull request #65387 from yuvalif/wip-yuval-72542
Yuval Lifshitz [Tue, 9 Sep 2025 12:45:05 +0000 (15:45 +0300)]
Merge pull request #65387 from yuvalif/wip-yuval-72542

rgw/logging: allow committing empty objects

13 days agoMerge pull request #64844 from ljflores/wip-tracker-72312
Radoslaw Zarzynski [Tue, 9 Sep 2025 12:28:21 +0000 (14:28 +0200)]
Merge pull request #64844 from ljflores/wip-tracker-72312

qa/tasks/thrashosds-health: fine tune ignorelist for degraded and undersized pgs

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
13 days agomgr/dashboard: Group similar alerts
Abhishek Desai [Fri, 29 Aug 2025 14:29:09 +0000 (19:59 +0530)]
mgr/dashboard: Group similar alerts
fixes : https://tracker.ceph.com/issues/72788

Signed-off-by: Abhishek Desai <abhishek.desai1@ibm.com>
13 days agoMerge pull request #64375 from SundownRises/header-component
afreen23 [Tue, 9 Sep 2025 10:58:38 +0000 (16:28 +0530)]
Merge pull request #64375 from SundownRises/header-component

mgr/dashboard: Carbonised Notification Header

Reviewed-by: Afreen Misbah <afreen@ibm.com>
13 days agodoc: update RGW HTTPS configuration to use certmgr and new fields
Redouane Kachach [Mon, 8 Sep 2025 13:20:27 +0000 (15:20 +0200)]
doc: update RGW HTTPS configuration to use certmgr and new fields

With the introduction of certmgr, RGW services now support three
certificate sources: cephadm-signed (default), inline, and reference.
Docs have been updated to:

- Show how to provide inline certificates using the new ssl_cert/ssl_key
  fields instead of the deprecated rgw_frontend_ssl_certificate.
- Explain how to register and reference user-provided certs/keys
- Clarify that cephadm-signed certificates remain the default, with
  optional wildcard SANs support.

The usage of rgw_frontend_ssl_certificate is still supported for
backward compatibility, but is now documented as deprecated.

Signed-off-by: Redouane Kachach <rkachach@ibm.com>
13 days agoMerge pull request #59515 from kamoltat/wip-ksirivad-fix-67801
SrinivasaBharathKanta [Tue, 9 Sep 2025 10:38:10 +0000 (16:08 +0530)]
Merge pull request #59515 from kamoltat/wip-ksirivad-fix-67801

mon [stretch mode]: restrict changing mon election strategy post stretch mode

13 days agoMerge PR #63636 into main
Venky Shankar [Tue, 9 Sep 2025 10:24:01 +0000 (15:54 +0530)]
Merge PR #63636 into main

* refs/pull/63636/head:

Reviewed-by: Christopher Hoffman <choffman@redhat.com>
Reviewed-by: Dhairya Parmar <dparmar@redhat.com>
13 days agoMerge pull request #65015 from connorfawcett/pg-autoscale-threshold-cmd
Connor Fawcett [Tue, 9 Sep 2025 09:36:19 +0000 (10:36 +0100)]
Merge pull request #65015 from connorfawcett/pg-autoscale-threshold-cmd

mgr/pg_autoscaler: Add 'osd pool get threshold' command which returns the current threshold value

13 days agoMerge pull request #64788 from rhcs-dashboard/acl-mapping
afreen23 [Tue, 9 Sep 2025 09:16:52 +0000 (14:46 +0530)]
Merge pull request #64788 from rhcs-dashboard/acl-mapping

mgr/dashboard:RGW- Storage Class ACL Mapping

Reviewed-by: Afreen Misbah <afreen@ibm.com>
13 days agoMerge PR #64958 into main
Venky Shankar [Tue, 9 Sep 2025 06:41:58 +0000 (12:11 +0530)]
Merge PR #64958 into main

* refs/pull/64958/head:

Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
13 days agoMerge PR #64967 into main
Venky Shankar [Tue, 9 Sep 2025 06:39:40 +0000 (12:09 +0530)]
Merge PR #64967 into main

* refs/pull/64967/head:

Reviewed-by: Anthony D Atri <anthony.datri@gmail.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
13 days agoMerge pull request #65436 from gbregman/main
Gil Bregman [Tue, 9 Sep 2025 05:38:27 +0000 (08:38 +0300)]
Merge pull request #65436 from gbregman/main

mgr/cephadm/nvmeof: Add fields for prometheus frequency to NVMEOF spec file

2 weeks agotest/libcephfs: validate asynchronous write and fsync executing concurrently
Venky Shankar [Mon, 2 Jun 2025 05:08:01 +0000 (05:08 +0000)]
test/libcephfs: validate asynchronous write and fsync executing concurrently

This synthetic reproducer does three things:

- setup a client mount with a configuration to delay write operations and
  initiates a write operation via a thread.
- a thread that invokes asynchronous fsync
- a thread that invokes setxattr for the client to track early replies

Without the fix[0], the test reproduces the following crash:

```
/home/vshankar/ceph/src/client/Client.cc: In function 'void Client::put_request(MetaRequest*)' thread 7f7210ff9640 time 2025-06-03T09:34:45.634974+0000
/home/vshankar/ceph/src/client/Client.cc: 2290: FAILED ceph_assert(request->ref >= 1)
 ceph version 20.3.0-673-gdd152807f7e (dd152807f7e7f7a82df6cfc0159f5fc65f60ecd5) tentacle (dev - Debug)
 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x119) [0x7f72222ebb98]
 2: (ceph::__ceph_assert_fail(ceph::assert_data const&)+0x17) [0x7f72222ebedc]
 3: /home/vshankar/ceph/build/lib/libcephfs.so.2(+0x6a075) [0x7f7222e6a075]
 4: /home/vshankar/ceph/build/lib/libcephfs.so.2(+0xb8289) [0x7f7222eb8289]
 5: /home/vshankar/ceph/build/lib/libcephfs.so.2(+0xee951) [0x7f7222eee951]
 6: /home/vshankar/ceph/build/lib/libcephfs.so.2(+0xf167c) [0x7f7222ef167c]
 7: (Context::complete(int)+0x9) [0x7f7222e5949d]
 8: /home/vshankar/ceph/build/lib/libcephfs.so.2(+0x16a853) [0x7f7222f6a853]
 9: /home/vshankar/ceph/build/lib/libcephfs.so.2(+0xa7cc5) [0x7f7222ea7cc5]
 10: /home/vshankar/ceph/build/lib/libcephfs.so.2(+0xf128d) [0x7f7222ef128d]
 11: /home/vshankar/ceph/build/lib/libcephfs.so.2(+0x16e09d) [0x7f7222f6e09d]
 12: (Context::complete(int)+0x9) [0x7f7222e5949d]
 13: /home/vshankar/ceph/build/lib/libcephfs.so.2(+0x6d108) [0x7f7222e6d108]
 14: (Context::complete(int)+0x9) [0x7f7222e5949d]
 15: (Finisher::finisher_thread_entry()+0x665) [0x7f722226fdc1]
 16: (Finisher::FinisherThread::entry()+0xd) [0x7f7222270ddf]
 17: (Thread::entry_wrapper()+0x2f) [0x7f72222b88f5]
 18: (Thread::_entry_func(void*)+0x9) [0x7f72222b8907]
 19: /lib64/libc.so.6(+0x89e92) [0x7f7221089e92]
 20: /lib64/libc.so.6(+0x10ef20) [0x7f722110ef20]
[1]    2162689 IOT instruction (core dumped)  ./bin/ceph_test_libcephfs --gtest_filter=LibCephFS.ConcurrentWriteAndFsync
```

[0]: https://github.com/ceph/ceph/pull/63619

Fixes: http://tracker.ceph.com/issues/71515
Signed-off-by: Venky Shankar <vshankar@redhat.com>
2 weeks agoclient: catch buggy reference count drop for MetaRequest
Venky Shankar [Tue, 3 Jun 2025 10:04:44 +0000 (10:04 +0000)]
client: catch buggy reference count drop for MetaRequest

With the prior commit that introduces a synthetic delay in write
operation so as to write a test reproducer which would interleave
asynchronous fsync and an operation that makes the MDS send a early
reply to the client (therefore, having the client track the early
replied response for an inode in Inode::unsafe_ops). Now, this is
enough to trick the client into the code path that causes a buggy
reference drop for the request (MetaRequest), but, hitting the
_exact_ crash backtrace requires the request to be a in various
[x]list's.

This last bit is tricky to synthetically massage in the test. So,
in order to catch the buggy reference drop, it would suffice to
assert on the reference count dropping to less than zero (0).

Signed-off-by: Venky Shankar <vshankar@redhat.com>
2 weeks agoclient: synthetically delay write operation
Venky Shankar [Mon, 2 Jun 2025 05:05:44 +0000 (05:05 +0000)]
client: synthetically delay write operation

To allow the client to hold Fb caps for an extended period of
time, to allow an asynchronous fsync to intervene and block, so
as to hunt [0].

[0]: https://tracker.ceph.com/issues/71510

Signed-off-by: Venky Shankar <vshankar@redhat.com>
2 weeks agoclient: log unsafe operation count (for debugging)
Venky Shankar [Mon, 2 Jun 2025 05:04:46 +0000 (05:04 +0000)]
client: log unsafe operation count (for debugging)

Signed-off-by: Venky Shankar <vshankar@redhat.com>
2 weeks agolibcephfs/client: asynchronous fsync interface
Venky Shankar [Mon, 2 Jun 2025 05:03:50 +0000 (05:03 +0000)]
libcephfs/client: asynchronous fsync interface

Mostly for writing test for hunting [0].

[0]: https://tracker.ceph.com/issues/71510

Signed-off-by: Venky Shankar <vshankar@redhat.com>
2 weeks agoMerge pull request #64999 from rishabh-d-dave/fs-pyx-chown
Rishabh Dave [Tue, 9 Sep 2025 03:54:03 +0000 (09:24 +0530)]
Merge pull request #64999 from rishabh-d-dave/fs-pyx-chown

cephfs.pyx: handle when UID/GID passed to chown() is -1

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2 weeks agoMerge pull request #64927 from NitzanMordhai/wip-nitzan-suites-ignore-still-running...
SrinivasaBharathKanta [Tue, 9 Sep 2025 03:42:39 +0000 (09:12 +0530)]
Merge pull request #64927 from NitzanMordhai/wip-nitzan-suites-ignore-still-running-cephadm-osds-suites

suites/rados/cephadm: typo in ignotr list for still running message

2 weeks agoMerge pull request #65431 from afreen23/doc-release-notes
afreen23 [Mon, 8 Sep 2025 19:49:36 +0000 (01:19 +0530)]
Merge pull request #65431 from afreen23/doc-release-notes

Update dashboard Pending release notes

Reviewed-by: Anthony D Atri <anthony.datri@gmail.com>
Reviewed-by: Yuri Weinstein <yweinste@redhat.com>
2 weeks agoMerge pull request #65419 from ljflores/wip-tracker-72897
Ilya Dryomov [Mon, 8 Sep 2025 19:15:10 +0000 (21:15 +0200)]
Merge pull request #65419 from ljflores/wip-tracker-72897

doc/rados/operations: add kernel client procedure to read balancer documentation

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
2 weeks agoMerge pull request #64837 from rzarzynski/wip-bug-72412
Laura Flores [Mon, 8 Sep 2025 18:50:40 +0000 (13:50 -0500)]
Merge pull request #64837 from rzarzynski/wip-bug-72412

osd: stop scrub_purged_snaps() from ignoring osd_beacon_report_interval

2 weeks agomgr/cephadm/nvmeof: Add fields for prometheus frequency to NVMEOF spec file.
Gil Bregman [Mon, 8 Sep 2025 16:29:46 +0000 (19:29 +0300)]
mgr/cephadm/nvmeof: Add fields for prometheus frequency to NVMEOF spec file.

Fixes: https://tracker.ceph.com/issues/72805
Signed-off-by: Gil Bregman <gbregman@il.ibm.com>
2 weeks agoMerge pull request #65384 from rhcs-dashboard/fix-72868-main
afreen23 [Mon, 8 Sep 2025 18:30:22 +0000 (00:00 +0530)]
Merge pull request #65384 from rhcs-dashboard/fix-72868-main

mgr/dashboard: fix RGW Bucket Notification Dashboard units

Reviewed-by: Afreen Misbah <afreen@ibm.com>
Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
2 weeks agorgw: Record the `service_unique_id`, if present, in the SrviceMap wip-perfcounters-unique-string
Adam C. Emerson [Mon, 8 Sep 2025 18:19:20 +0000 (14:19 -0400)]
rgw: Record the `service_unique_id`, if present, in the SrviceMap

For consistency and ease associating the two.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
2 weeks agodoc: wq!Update dashboard Pending release notes
Afreen Misbah [Mon, 8 Sep 2025 11:18:55 +0000 (16:48 +0530)]
doc: wq!Update dashboard Pending release notes

- added for tentacle
- moved the inccorect notes added in tentacke to umbrella

Signed-off-by: Afreen Misbah <afreen@ibm.com>
2 weeks agodoc/rados/operations: add kernel client procedure to read balancer documentation
Laura Flores [Fri, 5 Sep 2025 21:46:20 +0000 (16:46 -0500)]
doc/rados/operations: add kernel client procedure to read balancer documentation

As of now, the kernel client does not support `pg-upmap-primary`. I have
added some troubleshooting steps to help users who are unable to
mount images and filesystems with the kernel client while using `pg-upmap-primary`.

Once the feature is supported by the kernel client, users will be able
to perform mounts along with `pg-upmap-primary`.

Fixes: https://tracker.ceph.com/issues/72897
Signed-off-by: Laura Flores <lflores@ibm.com>
2 weeks agorgw/logging: allow committing empty objects
Yuval Lifshitz [Thu, 4 Sep 2025 10:53:07 +0000 (10:53 +0000)]
rgw/logging: allow committing empty objects

Fixes: https://tracker.ceph.com/issues/72542
Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>
2 weeks agoMerge pull request #62747 from bill-scales/issue70844
Jon Bailey [Mon, 8 Sep 2025 11:01:39 +0000 (12:01 +0100)]
Merge pull request #62747 from bill-scales/issue70844

test: add replica pool support to ceph_test_rados_io_sequence

Reviewed-by: Ronen Friedman <rfriedma@redhat.com>
Reviewed-by: Connor Fawcett <connorfa@uk.ibm.com>
2 weeks agoMerge pull request #65281 from nbalacha/wip-nbalacha-72740
Yuval Lifshitz [Mon, 8 Sep 2025 08:06:10 +0000 (11:06 +0300)]
Merge pull request #65281 from nbalacha/wip-nbalacha-72740

rgw/logging: fixes data loss during rollover

2 weeks agoMerge pull request #65231 from anthonyeleven/improve-osd-dot-cc
Anthony D'Atri [Mon, 8 Sep 2025 06:19:02 +0000 (01:19 -0500)]
Merge pull request #65231 from anthonyeleven/improve-osd-dot-cc

src/osd: Improve message in OSD.cc

2 weeks agorgw/logging: fixes data loss during rollover
N Balachandran [Thu, 28 Aug 2025 06:22:23 +0000 (11:52 +0530)]
rgw/logging: fixes data loss during rollover

Multiple threads attempting to roll over the same log object can result
in the creation of numerous orphan tail objects, each with a single record.
This occurs when a NULL RGWObjVersionTracker is used during the creation of
a new logging object. These records are inaccessible, leading to data loss,
which is particularly critical in Journal mode.
Furthermore, valid log tail objects may be added to the Garbage Collection (GC)
list, exacerbating data loss.

Fixes: https://tracker.ceph.com/issues/72740
Signed-off-by: N Balachandran <nithya.balachandran@ibm.com>
2 weeks agoMerge pull request #65421 from ronen-fr/wip-rf-ec72898
Ronen Friedman [Sun, 7 Sep 2025 14:44:10 +0000 (17:44 +0300)]
Merge pull request #65421 from ronen-fr/wip-rf-ec72898

osd/scrub: clear m_ec_digest_map between objects

Reviewed-by: Jon Bailey <jonathan.bailey1@ibm.com>
2 weeks agoMerge pull request #62106 from rkachach/fix_certmgr_v2
Redouane Kachach [Sun, 7 Sep 2025 10:49:23 +0000 (12:49 +0200)]
Merge pull request #62106 from rkachach/fix_certmgr_v2

Add cephadm-signed certificate support for all services

Reviewed-by: John Mulligan <jmulligan@redhat.com>
2 weeks agotest: add replica pool support to ceph_test_rados_io_sequence
Bill Scales [Wed, 9 Apr 2025 09:58:15 +0000 (10:58 +0100)]
test: add replica pool support to ceph_test_rados_io_sequence

Make 'ceph_test_rados_io_sequenece --pool rbd' work, replica
pools don't have an erausre code profile and do not have the
ec_allow_overwrites or ec_allow_optimizations flags

Fixes: https://tracker.ceph.com/issues/70844
Signed-off-by: Bill Scales <bill_scales@uk.ibm.com>
Signed-off-by: Jon Bailey <jonathan.bailey1@ibm.com>
2 weeks agocommon: Added values to json::OSDPoolGetReply
Jon Bailey [Mon, 14 Jul 2025 12:52:28 +0000 (13:52 +0100)]
common: Added values to json::OSDPoolGetReply

OSDPoolGetReply actually returns a lot more values than what is currently supplied. These have been added in as optionals (as they can not be give as well) so its possible to query them to find out if they exist and use them if they do.

Signed-off-by: Jon Bailey <jonathan.bailey1@ibm.com>
2 weeks agoosd/scrub: clear m_ec_digest_map between objects wip-rf-ec72898
Ronen Friedman [Sun, 7 Sep 2025 07:19:52 +0000 (02:19 -0500)]
osd/scrub: clear m_ec_digest_map between objects

Fixing a bug introduced by commit 4c61079e931
("caluculate EC digest map size only once").

Fixes: https://tracker.ceph.com/issues/72897
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
2 weeks agoMerge pull request #65150 from rkachach/fix_issue_nvmeof_prometheus
Redouane Kachach [Sun, 7 Sep 2025 07:18:06 +0000 (09:18 +0200)]
Merge pull request #65150 from rkachach/fix_issue_nvmeof_prometheus

mgr/cepahdm: fixing nvmeof scraping Prometheus config generation

Reviewed-by: Kushal Deb <Kushal.Deb@ibm.com>
2 weeks agoMerge pull request #65338 from ronen-fr/wip-rf-be-st3
Ronen Friedman [Sun, 7 Sep 2025 05:44:04 +0000 (08:44 +0300)]
Merge pull request #65338 from ronen-fr/wip-rf-be-st3

osd/scrub: modify OMAP stats collection

Reviewed-by: Jon Bailey <jonathan.bailey1@ibm.com>
Reviewed-by: Samuel Just <sjust@redhat.com>
2 weeks agoworkunits/rados: remove cache tier test
Nitzan Mordechai [Tue, 15 Jul 2025 10:58:40 +0000 (10:58 +0000)]
workunits/rados: remove cache tier test

Fixes: https://tracker.ceph.com/issues/71930
Signed-off-by: Nitzan Mordechai <nmordec@ibm.com>
2 weeks agoMerge pull request #65193 from tchaikov/wip-osd-scrub-fix-buffer-overflow
Kefu Chai [Sun, 7 Sep 2025 03:05:39 +0000 (11:05 +0800)]
Merge pull request #65193 from tchaikov/wip-osd-scrub-fix-buffer-overflow

osd/scrub: fix heap-buffer-overflow when checking digest emptiness

Reviewed-by: Jon Bailey <jonathan.bailey1@ibm.com>
Reviewed-by: Ronen Friedman <rfriedma@redhat.com>
2 weeks agomgr/cepahdm: fixing nvmeof scraping Prometheus config generation
Redouane Kachach [Wed, 20 Aug 2025 12:01:28 +0000 (14:01 +0200)]
mgr/cepahdm: fixing nvmeof scraping Prometheus config generation

Signed-off-by: Redouane Kachach <rkachach@ibm.com>
2 weeks agodoc/cephadm: updating certmgr docs to reflects new changes
Redouane Kachach [Tue, 15 Jul 2025 13:50:45 +0000 (15:50 +0200)]
doc/cephadm: updating certmgr docs to reflects new changes

Signed-off-by: Redouane Kachach <rkachach@ibm.com>
2 weeks agomgr/cephadm: Adding RGW migration for the new certmgr certs format
Redouane Kachach [Wed, 20 Aug 2025 13:54:51 +0000 (15:54 +0200)]
mgr/cephadm: Adding RGW migration for the new certmgr certs format

Signed-off-by: Redouane Kachach <rkachach@ibm.com>
2 weeks agomgr/cephadm: removing call to previous method to storing all certs
Redouane Kachach [Tue, 29 Apr 2025 11:02:47 +0000 (13:02 +0200)]
mgr/cephadm: removing call to previous method to storing all certs

Signed-off-by: Redouane Kachach <rkachach@ibm.com>
2 weeks agomgr/cephadm: fixing nvmeof section in cert_mgr UT + new UT
Redouane Kachach [Thu, 7 Aug 2025 13:57:41 +0000 (15:57 +0200)]
mgr/cephadm: fixing nvmeof section in cert_mgr UT + new UT

Signed-off-by: Redouane Kachach <rkachach@ibm.com>
2 weeks agomgr/cepahdm: adapting UT after all the changes
Redouane Kachach [Tue, 12 Aug 2025 15:26:01 +0000 (17:26 +0200)]
mgr/cepahdm: adapting UT after all the changes

Signed-off-by: Redouane Kachach <rkachach@ibm.com>
2 weeks agomgr/cephadm: using 5 years for service-discovery internal certs
Redouane Kachach [Wed, 20 Aug 2025 13:55:24 +0000 (15:55 +0200)]
mgr/cephadm: using 5 years for service-discovery internal certs

Signed-off-by: Redouane Kachach <rkachach@ibm.com>
2 weeks agomgr/cephadm: adding self-signed certifiactes support for nvmeof svc
Redouane Kachach [Tue, 12 Aug 2025 13:53:38 +0000 (15:53 +0200)]
mgr/cephadm: adding self-signed certifiactes support for nvmeof svc

Signed-off-by: Redouane Kachach <rkachach@ibm.com>