]> git-server-git.apps.pok.os.sepia.ceph.com Git - ceph-ci.git/log
ceph-ci.git
2 months agoMerge branch 'partial_write_with_clone_fix' of https://github.com/aainscow/ceph into... wip-bharath4-testing-2025-12-24-1209
skanta [Wed, 24 Dec 2025 06:39:37 +0000 (12:09 +0530)]
Merge branch 'partial_write_with_clone_fix' of https://github.com/aainscow/ceph into wip-bharath4-testing-2025-12-24-1209

2 months agoosd: Do not remove objects with divergent logs if only partial writes.
Alex Ainscow [Fri, 19 Dec 2025 09:04:55 +0000 (09:04 +0000)]
osd: Do not remove objects with divergent logs if only partial writes.

Fixes https://tracker.ceph.com/issues/74221

Note: An AI was used to assist generating unit tests for this commit.
      The production code was written by the author.

In the scenario we are fixing here, there is a divergent log, which needs to
be rolled back. The non-primary does not participate in the transaction to
the object, but the log exists describing the transaction.  The primary has
a different transaction and has correctly detected the divergence.

The primary correctly concludes that no recovery is needed for the object, since
only partial writes exist on the non-primary.

The non-primary observes its divergent log and incorrectly concludes that
recovery IS needed for the divergent write and prepares by removing that
object.

The consequence of this depends on the next operation:
1. A read will fail with -EIO
2. A RMW involving a read from the removed object  will detect the failure
   and reconstruct the necessary data.
3. A RMW not involve the write or an append will recreate the object, but with
   zeros, so will cause data corruption. A

It is unusual for such a log entry to exist on the non-primary because
normally those are omitted from the non-primary log. The scenario that causes
this when a partial write triggers a clone due to copy on write.  We now have
a clone operation which affects ALL shards and so the log entry is sent to
all shards.

This is unusual to see in the field. We must have all of the following:

1. A clone operation (these are infrequent)
2. A partial write.
3. A peering cycle must happen before this write is complete.

The combination of 1 and 3 make this a very unusual operation in teuthology
and will be even rarer in the field.

The fix ensures we skip divergent log entries for partial writes that the shard
did not participate in.

The following is a minimal script to recreate:

set -e -x

MDS=0 MON=1 OSD=4 MGR=1 ../src/vstart.sh --debug --new -x --localhost -o timeout=10000 -o session_timeout=10000 -o debug_osd=20

ceph osd pool set noautoscale
ceph balancer off
ceph osd set nodeep-scrub
ceph osd set noscrub
ceph osd set noout

ceph config set global bluestore_debug_inject_read_err true

dd if=/dev/random of=file_8k bs=8k count=1
dd if=/dev/random of=file_4k bs=4k count=1

ceph osd erasure-code-profile set alex k=2 m=2
ceph osd pool create mypool --pg_num=1 --pool_type=erasure alex
ceph osd pool set mypool allow_ec_overwrites true
ceph osd pool set mypool allow_ec_optimizations true
ceph osd pool set mypool min_size 2

rados put -p mypool test1 file_8k

acting_set=$(ceph osd map mypool test1 --format=json | jq -r '.acting[]')
acting_array=($acting_set)

shard_0_osd=${acting_array[0]}
shard_1_osd=${acting_array[1]}

echo "Shard 0 OSD: $shard_0_osd"
echo "Shard 1 OSD: $shard_1_osd"

ceph daemon osd.$shard_0_osd injectecwriteerr mypool "*" 2 1 0 1

rados -p mypool mksnap test1_snap
rados put -p mypool test1 file_4k --offset 0 &

ceph osd set noup
ceph osd down $shard_1_osd

wait

ceph osd unset noup

rados -p mypool mksnap test1_snap2
rados put -p mypool test1 file_4k --offset 0

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
2 months agoMerge pull request #66681 from bluikko/wip-doc-rados-ops-health-checks-fixes3
bluikko [Fri, 19 Dec 2025 05:16:35 +0000 (12:16 +0700)]
Merge pull request #66681 from bluikko/wip-doc-rados-ops-health-checks-fixes3

doc/rados: Fix minor issues, improvements in health-checks.rst (3 of x)

2 months agoMerge pull request #66675 from bluikko/wip-doc-rados-ops-health-checks-fixes2
bluikko [Fri, 19 Dec 2025 05:15:00 +0000 (12:15 +0700)]
Merge pull request #66675 from bluikko/wip-doc-rados-ops-health-checks-fixes2

doc/rados: Fix minor issues and improvements in health-checks.rst (2 of x)

2 months agoMerge pull request #66662 from bluikko/wip-doc-rados-ops-health-checks-fixes
bluikko [Fri, 19 Dec 2025 05:14:34 +0000 (12:14 +0700)]
Merge pull request #66662 from bluikko/wip-doc-rados-ops-health-checks-fixes

doc/rados: Fix minor issues and improvements in health-checks.rst

2 months agoMerge pull request #66677 from bluikko/wip-doc-cephadm-services-osd-typo
bluikko [Fri, 19 Dec 2025 05:13:53 +0000 (12:13 +0700)]
Merge pull request #66677 from bluikko/wip-doc-cephadm-services-osd-typo

doc/cephadm: Fix formatting errors and improvements in osd.rst

2 months agoMerge pull request #66656 from ceph/apt-mirror-main
David Galloway [Fri, 19 Dec 2025 03:48:57 +0000 (22:48 -0500)]
Merge pull request #66656 from ceph/apt-mirror-main

install-deps: Replace apt-mirror

2 months agodoc/rados: Fix minor issues, improvements in health-checks.rst (3 of x)
Ville Ojamo [Thu, 18 Dec 2025 08:27:36 +0000 (15:27 +0700)]
doc/rados: Fix minor issues, improvements in health-checks.rst (3 of x)

Also changed in monitoring.rst:
- Add label for link.
- Linkify mention of health checks.

Fix "bluestore bluefs" admin-socket commands after #38437.

Use ":confval:" instead of simple inline literal text for config values.

Use a privileged prompt for CLI command examples requiring privileges.

Trim section title syntax underline.

Use admonition instead of emphasis on the whole paragraph.

Attempt to improve language in a few sentences.

Use double backticks consistently for literals.

Linkify, adding a label in the destination if necessary:
- Muting health checks
- BlueStore migration

Capitalize Ceph, BlueStore, RADOS.

Add paragraph linking to more info on the "ceph df" command.

Use DB or DB device instead of literal db.

Add unordered list formatting to what was being rendered as single paragraph.

Use ":program:" for linking to ceph-bluestore-tool.

Change few last double spaces after full stop to a single space.

Use code-block paragraphs instead of inline literals where appropriate.

Add missing word "is".

Signed-off-by: Ville Ojamo <14869000+bluikko@users.noreply.github.com>
2 months agodoc/cephadm: Fix formatting errors and improvements in osd.rst
Ville Ojamo [Thu, 18 Dec 2025 07:05:29 +0000 (14:05 +0700)]
doc/cephadm: Fix formatting errors and improvements in osd.rst

Use console code-block instead of literal text to avoid highlighting.

Promptify few last CLI commands that were missing prompts.

Remove lone single-item unordered list formatting.

Add missing full stops.

Linkify mentions of other sections.

Use title case consistently in section titles.

Use :confval: consistently instead of literal text.

Use double backticks consistently.

Use "YAML" and "JSON" instead of literal .yaml and .json.

Fix two occurrences of double backticks missing one backtick that lead
to broken rendering.

Use *italic* RST syntax instead of rendered underscores.

Try to improve language in simple sentences.

Remove unnecessary * characters that were rendered in CLI example
commands.

Signed-off-by: Ville Ojamo <14869000+bluikko@users.noreply.github.com>
2 months agodoc/rados: Fix minor issues and improvements in health-checks.rst (2 of x)
Ville Ojamo [Thu, 18 Dec 2025 05:52:55 +0000 (12:52 +0700)]
doc/rados: Fix minor issues and improvements in health-checks.rst (2 of x)

Use :confval: instead of simple inline literal text for config values.

Use RST **emphasis** instead of CAPS.

Use a privileged prompt for CLI command examples requiring privileges.

Capitalize Manager when talking about the service/dameon.

Use double backticks for literals instead of single backticks.

Remove spaces around slash in "and/or".

Add a paragraph referring OSD troubleshooting in OSD free space checks.

Modify flag lists to have a consistent style.

Reorder sentences to make more sense.

One small language improvement attempt.

Expand {a,b} glob used in text.

Signed-off-by: Ville Ojamo <14869000+bluikko@users.noreply.github.com>
2 months agodoc/rados: Fix minor issues and improvements in health-checks.rst
Ville Ojamo [Wed, 17 Dec 2025 09:36:23 +0000 (16:36 +0700)]
doc/rados: Fix minor issues and improvements in health-checks.rst

Use title case consistently in section titles.

Wrap long lines over 80 characters.

Use "Ceph Manager" instead of "ceph-mgr".
Use "Ceph Monitor" or "Monitor" consistently instead of "monitor".

Use :confval: instead of simple inline literal text for config values.

Linkify, adding a label in the destination if necessary:
- Muting health checks
- Mon election strategy

Fix a typo.

Use RST *italic* instead of _rendered underscores_.

Use a privileged prompt for CLI command examples requiring privileges.

Use path including fsid to mon data consistently.

Signed-off-by: Ville Ojamo <14869000+bluikko@users.noreply.github.com>
2 months agoMerge pull request #66643 from bluikko/wip-doc-mgr-telemetry-collection-desc
bluikko [Wed, 17 Dec 2025 03:43:39 +0000 (10:43 +0700)]
Merge pull request #66643 from bluikko/wip-doc-mgr-telemetry-collection-desc

doc/mgr: Fix collection desc in telemetry.rst

2 months agoinstall-deps: Replace apt-mirror
David Galloway [Tue, 16 Dec 2025 22:08:00 +0000 (17:08 -0500)]
install-deps: Replace apt-mirror

apt-mirror.front.sepia.ceph.com has happened to always work because we set up CNAMEs to gitbuilder.ceph.com.

That host is making its way to a new home upstate (literally and figuratively) so we'll get rid of the front subdomain since it's publicly accessible anyway and add TLS while we're at it.

Signed-off-by: David Galloway <david.galloway@ibm.com>
2 months agoMerge pull request #66576 from rhcs-dashboard/carbonized-multisite-realm-token-modal
Afreen Misbah [Tue, 16 Dec 2025 10:05:23 +0000 (15:35 +0530)]
Merge pull request #66576 from rhcs-dashboard/carbonized-multisite-realm-token-modal

mgr/dashboard: carbonized-multisite-export-realm-token-modal

Reviewed-by: Afreen Misbah <afreen@ibm.com>
Reviewed-by: Aashish Sharma <aasharma@redhat.com>
Reviewed-by: Naman Munet <nmunet@redhat.com>
Reviewed-by: pujaoshahu <pshahu@redhat.com>
Reviewed-by: Dnyaneshwari Talwekar <dtalweka@redhat.com>
2 months agoMerge pull request #66510 from rhcs-dashboard/manager-module-table-fix
Afreen Misbah [Tue, 16 Dec 2025 09:57:28 +0000 (15:27 +0530)]
Merge pull request #66510 from rhcs-dashboard/manager-module-table-fix

mgr/dashboard: Fix table width expansion on manager module dropdown selection #74089

Reviewed-by: Afreen Misbah <afreen@ibm.com>
Reviewed-by: Naman Munet <nmunet@redhat.com>
2 months agoMerge pull request #65324 from rishabh-d-dave/inode_t-fields
Rishabh Dave [Tue, 16 Dec 2025 09:51:48 +0000 (15:21 +0530)]
Merge pull request #65324 from rishabh-d-dave/inode_t-fields

mds: compare all fields of inode_t

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2 months agoMerge pull request #65496 from rishabh-d-dave/rmtree-enhancements
Rishabh Dave [Tue, 16 Dec 2025 09:50:22 +0000 (15:20 +0530)]
Merge pull request #65496 from rishabh-d-dave/rmtree-enhancements

pybind,cephfs: enhancements for rmtree()

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2 months agoMerge pull request #65805 from rishabh-d-dave/qa-returns-unbound-var
Rishabh Dave [Tue, 16 Dec 2025 09:48:58 +0000 (15:18 +0530)]
Merge pull request #65805 from rishabh-d-dave/qa-returns-unbound-var

qa/cephfs: don't return undefined variable

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Jos Collin <jcollin@redhat.com>
2 months agodoc/mgr: Fix collection desc in telemetry.rst
Ville Ojamo [Tue, 16 Dec 2025 07:29:05 +0000 (14:29 +0700)]
doc/mgr: Fix collection desc in telemetry.rst

Use correct desc taken from source file for basic channel in example CLI
output.

Also turn off syntax highlighting for the CLI outputs so that some
keywords do not get colored.

Signed-off-by: Ville Ojamo <14869000+bluikko@users.noreply.github.com>
2 months agoMerge pull request #66554 from NitzanMordhai/wip-nitzan-improve-corpus-import-script
NitzanMordhai [Tue, 16 Dec 2025 07:34:27 +0000 (09:34 +0200)]
Merge pull request #66554 from NitzanMordhai/wip-nitzan-improve-corpus-import-script

script/gen-corpus: cleanup and improve readability and performance

2 months agoMerge pull request #66232 from NitzanMordhai/wip-nitzan-add-rocky-test-rados-workunit
NitzanMordhai [Tue, 16 Dec 2025 05:49:04 +0000 (07:49 +0200)]
Merge pull request #66232 from NitzanMordhai/wip-nitzan-add-rocky-test-rados-workunit

qa/workunits: add Rocky Linux support to librados tests

2 months agoMerge pull request #66630 from rhcs-dashboard/copy2-clipboard-fix
Nizamudeen A [Tue, 16 Dec 2025 03:58:12 +0000 (09:28 +0530)]
Merge pull request #66630 from rhcs-dashboard/copy2-clipboard-fix

mgr/dashboard: emit success and error on copy2cliboard

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
2 months agoMerge pull request #66631 from rhcs-dashboard/validator-js-bump
Nizamudeen A [Tue, 16 Dec 2025 03:56:47 +0000 (09:26 +0530)]
Merge pull request #66631 from rhcs-dashboard/validator-js-bump

mgr/dashboard: bump validator.js to 13.15.23

Reviewed-by: Naman Munet <nmunet@redhat.com>
Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
2 months agoMerge pull request #66641 from datdenkikniet/document-msgr2-crc
Anthony D'Atri [Mon, 15 Dec 2025 22:53:38 +0000 (17:53 -0500)]
Merge pull request #66641 from datdenkikniet/document-msgr2-crc

doc/dev: document exact CRC parameters for crc32c used in msgr2

2 months agodoc/dev: document exact CRC parameters for crc32c used in msgr2
datdenkikniet [Mon, 15 Dec 2025 21:01:23 +0000 (22:01 +0100)]
doc/dev: document exact CRC parameters for crc32c used in msgr2

The description of the CRC used by the msgr2 protocol as CRC32-C
can be ambiguous. Document the actual parameters that make up
the CRC that is used.

Signed-off-by: Johannes Cornelis Draaijer <jcdra1@gmail.com>
2 months agoMerge pull request #66068 from timqn22/balancer-info-log-reduction
Prashant [Mon, 15 Dec 2025 21:06:47 +0000 (16:06 -0500)]
Merge pull request #66068 from timqn22/balancer-info-log-reduction

mgr/balancer: reduce info-level logs

2 months agoMerge pull request #66129 from Tom-Sollers/fixing_blaum_roth_for_ec_profiles
Radoslaw Zarzynski [Mon, 15 Dec 2025 19:39:12 +0000 (20:39 +0100)]
Merge pull request #66129 from Tom-Sollers/fixing_blaum_roth_for_ec_profiles

Mon: Add new health warning for non prime w+1 in blaum-roth EC profiles

Reviewed-by: Bill Scales <bill_scales@uk.ibm.com>
Reviewed-by: Anthony D'Atri <anthony.datri@ibm.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2 months agoMerge pull request #66277 from NitzanMordhai/wip-nitzan-lazyomap-wait-active-clean...
Radoslaw Zarzynski [Mon, 15 Dec 2025 19:02:10 +0000 (20:02 +0100)]
Merge pull request #66277 from NitzanMordhai/wip-nitzan-lazyomap-wait-active-clean-test

test/lazy-omap-stats: overload get_output() to accept both lvalue and…

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2 months agoMerge pull request #66204 from apataki/73795-cephmon-cleanupmap
Radoslaw Zarzynski [Mon, 15 Dec 2025 18:41:04 +0000 (19:41 +0100)]
Merge pull request #66204 from apataki/73795-cephmon-cleanupmap

ceph-mon: Improve efficiency of upmap cleanup in osdmaps for large clusters

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Laura Flores <lflores@redhat.com>
2 months agoMerge pull request #66542 from aainscow/ec_trunctate_bug
Laura Flores [Mon, 15 Dec 2025 17:12:19 +0000 (11:12 -0600)]
Merge pull request #66542 from aainscow/ec_trunctate_bug

osd: Fix fast EC truncate to whole stripe

Reviewed-by: Bill Scales <bill_scales@uk.ibm.com>
Reviewed-by: Radosław Zarzyński <Radoslaw.Adam.Zarzynski@ibm.com>
2 months agoMerge pull request #66579 from phlogistonjohn/jjm-smb-doc-fixes
bluikko [Mon, 15 Dec 2025 16:25:41 +0000 (23:25 +0700)]
Merge pull request #66579 from phlogistonjohn/jjm-smb-doc-fixes

doc: some small smb mgr module doc improvements

2 months agoMerge pull request #66632 from bluikko/wip-doc-very-minor-fixes
bluikko [Mon, 15 Dec 2025 16:23:38 +0000 (23:23 +0700)]
Merge pull request #66632 from bluikko/wip-doc-very-minor-fixes

doc: Fix minor formatting, typo etc issues

2 months agoMerge pull request #66322 from phlogistonjohn/jjm-qa-workunits-smb
Adam King [Mon, 15 Dec 2025 15:45:35 +0000 (10:45 -0500)]
Merge pull request #66322 from phlogistonjohn/jjm-qa-workunits-smb

smb: qa workunit for smb

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Anoop C S <anoopcs@cryptolab.net>
2 months agoMerge pull request #66497 from yielde/add-destractor-to-sahred_mutex_debug
Kefu Chai [Mon, 15 Dec 2025 15:05:34 +0000 (23:05 +0800)]
Merge pull request #66497 from yielde/add-destractor-to-sahred_mutex_debug

src/common: Add a destructor to shared_mutex_debug

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Kefu Chai <k.chai@proxmox.com>
2 months agoMerge pull request #66617 from imran-imtiaz/dashboard
Imran Imtiaz [Mon, 15 Dec 2025 10:04:30 +0000 (10:04 +0000)]
Merge pull request #66617 from imran-imtiaz/dashboard

mgr/dashboard: add API endpoint to delete consistency group

2 months agoMerge pull request #66441 from hasan4791/fix-install-deps
Kefu Chai [Mon, 15 Dec 2025 09:53:45 +0000 (17:53 +0800)]
Merge pull request #66441 from hasan4791/fix-install-deps

install-deps.sh: build without crimson when it is disabled

Reviewed-by: Kefu Chai <k.chai@proxmox.com>
2 months agoscript/gen-corpus: cleanup and improve readability and performance
Nitzan Mordechai [Mon, 8 Dec 2025 12:25:39 +0000 (12:25 +0000)]
script/gen-corpus: cleanup and improve readability and performance

- gen-corpus cleanup missed removing the temporary directory.
- improve it a bit for readability
- import.sh script was slow, improve performance by using less forks and
  batch processing

Signed-off-by: Nitzan Mordechai <nmordech@ibm.com>
2 months agoMerge pull request #66602 from hasan4791/issue-74183
Kefu Chai [Mon, 15 Dec 2025 09:18:15 +0000 (17:18 +0800)]
Merge pull request #66602 from hasan4791/issue-74183

erasure-code: Enable isa for ppc64le target.

Reviewed-by: Bill Scales <bill_scales@uk.ibm.com>
Reviewed-by: Jamie Pryde <jamiepry@uk.ibm.com>
Reviewed-by: Kefu Chai <k.chai@proxmox.com>
2 months agodoc: Fix minor formatting, typo etc issues
Ville Ojamo [Mon, 15 Dec 2025 08:24:22 +0000 (15:24 +0700)]
doc: Fix minor formatting, typo etc issues

Remove formatting syntax from inside literal text in
cephadm/services/rgw.rst.
Use quotation marks similarly to other placement examples with only
parameter value quoted and not the whole parameter in
cephadm/services/rgw.rst.
Capitalize "YAML" in cephadm/services/rgw.rst.

Remove double space in the middle of a sentence in
rados/operations/erasure-code.rst.

Use double backticks consistently for default values in
radosgw/frontends.rst.
Capitalize "I/O", stylize as "OpenSSL" in radosgw/frontends.rst.

Fix typo "and object" to "an object" in radosgw/s3/bucketops.rst.

Stylize as "CentOS" in start/os-recommendations.rst.

Signed-off-by: Ville Ojamo <14869000+bluikko@users.noreply.github.com>
2 months agoMerge pull request #66507 from rhcs-dashboard/74046-QAT-configuration-through-RGW...
Nizamudeen A [Mon, 15 Dec 2025 08:42:51 +0000 (14:12 +0530)]
Merge pull request #66507 from rhcs-dashboard/74046-QAT-configuration-through-RGW-service-on-Dashbard

mgr/dashboard: Adding QAT Compression dropdown on RGW Service form

Reviewed-by: Afreen Misbah <afreen@ibm.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
2 months agomgr/dashboard: bump validator.js to 13.15.23
Nizamudeen A [Mon, 15 Dec 2025 07:49:02 +0000 (13:19 +0530)]
mgr/dashboard: bump validator.js to 13.15.23

Fixes: https://github.com/advisories/GHSA-vghf-hv5q-vc2g
Fixes: https://tracker.ceph.com/issues/74214
Signed-off-by: Nizamudeen A <nia@redhat.com>
2 months agomgr/dashboard: emit success and error on copy2cliboard
Nizamudeen A [Mon, 15 Dec 2025 07:29:45 +0000 (12:59 +0530)]
mgr/dashboard: emit success and error on copy2cliboard

This is needed since the notification service we have right now is
tightly coupled with the dashboard so toast won't show up in the
applications where this is being consumed. So emitting an output which
the application can use to show relavant toasts.

Fixes: https://tracker.ceph.com/issues/74213
Signed-off-by: Nizamudeen A <nia@redhat.com>
2 months agoMerge pull request #66629 from tokuhirom/wip-doc-orometheus-typo
bluikko [Mon, 15 Dec 2025 06:42:35 +0000 (13:42 +0700)]
Merge pull request #66629 from tokuhirom/wip-doc-orometheus-typo

doc/monitoring: fix typo s/Orometheus/Prometheus/

2 months agoMerge pull request #66620 from bluikko/doc-notifications-formatting-radosgw
bluikko [Mon, 15 Dec 2025 06:35:32 +0000 (13:35 +0700)]
Merge pull request #66620 from bluikko/doc-notifications-formatting-radosgw

doc/radosgw: Update formatting in notifications.rst

2 months agoFixes: https://tracker.ceph.com/issues/74162
Sagar Gopale [Tue, 9 Dec 2025 14:36:39 +0000 (20:06 +0530)]
Fixes: https://tracker.ceph.com/issues/74162
Signed-off-by: Sagar Gopale <sagar.gopale@ibm.com>
2 months agodoc/monitoring: fix typo s/Orometheus/Prometheus/
Tokuhiro Matsuno [Mon, 15 Dec 2025 04:37:03 +0000 (13:37 +0900)]
doc/monitoring: fix typo s/Orometheus/Prometheus/

Signed-off-by: Tokuhiro Matsuno <tokuhirom@gmail.com>
2 months agoMerge pull request #66581 from idryomov/wip-74168
Ilya Dryomov [Sat, 13 Dec 2025 18:45:51 +0000 (19:45 +0100)]
Merge pull request #66581 from idryomov/wip-74168

librbd: fix ExclusiveLock::accept_request() when !is_state_locked()

Reviewed-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>
2 months agoMerge pull request #66605 from ceph/fixes#2414677
anmolbabu [Sat, 13 Dec 2025 08:22:55 +0000 (13:52 +0530)]
Merge pull request #66605 from ceph/fixes#2414677

Fix the prometheus module crash

2 months agodoc/radosgw: Update formatting in notifications.rst
Ville Ojamo [Fri, 12 Dec 2025 16:34:31 +0000 (23:34 +0700)]
doc/radosgw: Update formatting in notifications.rst

Change parameter list descriptions to start with capital case after
colon as discussed in #64768.

Remove a stray comma and end in full stop. Remove article from what
seems to be a plural. Hyphenation in "single-shard topics".

Change several same external links to use a single link definition
instead of repeating the destination URL each time.

Signed-off-by: Ville Ojamo <14869000+bluikko@users.noreply.github.com>
2 months agoMerge pull request #66621 from bluikko/wip-doc-start-documenting-formatting
bluikko [Sat, 13 Dec 2025 06:48:07 +0000 (13:48 +0700)]
Merge pull request #66621 from bluikko/wip-doc-start-documenting-formatting

doc/start: Fix stray RST syntax in documenting-ceph.rst

2 months agoMerge pull request #66341 from cbodley/wip-73943
Casey Bodley [Fri, 12 Dec 2025 17:45:54 +0000 (12:45 -0500)]
Merge pull request #66341 from cbodley/wip-73943

qa/rgw/upgrade: exclude ceph-osd-classic/crimson on squid and tentacle

Reviewed-by: Oguzhan Ozmen <oozmen@bloomberg.net>
2 months agodoc/start: Fix stray RST syntax in documenting-ceph.rst
Ville Ojamo [Fri, 12 Dec 2025 16:55:55 +0000 (23:55 +0700)]
doc/start: Fix stray RST syntax in documenting-ceph.rst

Remove stray single colons before literal text blocks.
Perhaps leftovers moving from :: to prompt blocks.

Remove double backticks used inside literal text.

Fix URL missing // in a link example and use HTTPS in it.

Signed-off-by: Ville Ojamo <14869000+bluikko@users.noreply.github.com>
2 months agoqa/rgw/upgrade: don't randomize distros
Casey Bodley [Fri, 12 Dec 2025 16:49:57 +0000 (11:49 -0500)]
qa/rgw/upgrade: don't randomize distros

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2 months agoqa/rgw/upgrade: exclude ceph-osd-classic/crimson on squid and tentacle
Casey Bodley [Thu, 20 Nov 2025 16:57:35 +0000 (11:57 -0500)]
qa/rgw/upgrade: exclude ceph-osd-classic/crimson on squid and tentacle

split packages for ceph-osd-classic and ceph-osd-crimson were added on
main, but don't exist on squid and tentacle. exclude these packages from
their install tasks

Fixes: https://tracker.ceph.com/issues/73943
Signed-off-by: Casey Bodley <cbodley@redhat.com>
2 months agomgr/dashboard: add API endpoint to delete consistency group
Imran Imtiaz [Fri, 12 Dec 2025 10:02:59 +0000 (10:02 +0000)]
mgr/dashboard: add API endpoint to delete consistency group

Signed-off-by: Imran Imtiaz <imran.imtiaz@uk.ibm.com>
Fixes: https://tracker.ceph.com/issues/74201
Add a dashboard API endpoint to delete a consistency group.

2 months agomgr/dashboard: Adding QAT Compression dropdown on RGW Service form
Devika Babrekar [Thu, 4 Dec 2025 09:58:39 +0000 (15:28 +0530)]
mgr/dashboard: Adding QAT Compression dropdown on RGW Service form
Fixes:https://tracker.ceph.com/issues/74046
Signed-off-by: Devika Babrekar <devika.babrekar@ibm.com>
2 months agoMerge pull request #66233 from benhanokh/dedup_ver
Gabriel Benhanokh [Fri, 12 Dec 2025 06:30:58 +0000 (08:30 +0200)]
Merge pull request #66233 from benhanokh/dedup_ver

rgw/dedup: Add support for RGW versions.

2 months agoMerge pull request #66188 from harriscr/ch_wip_qa_cbt_task
SrinivasaBharathKanta [Fri, 12 Dec 2025 03:31:59 +0000 (09:01 +0530)]
Merge pull request #66188 from harriscr/ch_wip_qa_cbt_task

qa: Use a more pythonic way to install CBT python requirements

2 months agoMerge branch 'wip-update-cluster-log-warnings' of https://github.com/ljflores/ceph... wip-bharath3-testing-2025-12-12-0832
skanta [Fri, 12 Dec 2025 03:02:19 +0000 (08:32 +0530)]
Merge branch 'wip-update-cluster-log-warnings' of https://github.com/ljflores/ceph into wip-bharath3-testing-2025-12-12-0832

2 months agoMerge branch 'no_obj_ver' of https://github.com/aainscow/ceph into wip-bharath3-testi...
skanta [Fri, 12 Dec 2025 03:02:18 +0000 (08:32 +0530)]
Merge branch 'no_obj_ver' of https://github.com/aainscow/ceph into wip-bharath3-testing-2025-12-12-0832

2 months agoMerge pull request #66568 from Matan-B/wip-matanb-crimson-obselte
Kefu Chai [Fri, 12 Dec 2025 01:01:20 +0000 (09:01 +0800)]
Merge pull request #66568 from Matan-B/wip-matanb-crimson-obselte

debian,ceph.spec: fix ceph-osd upgrade conflicts

Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Kefu Chai <k.chai@proxmox.com>
2 months agoqa/suites/upgrade: add "OBJECT_UNFOUND" to ignorelists
Laura Flores [Mon, 24 Nov 2025 17:31:05 +0000 (11:31 -0600)]
qa/suites/upgrade: add "OBJECT_UNFOUND" to ignorelists

The thrashing in the upgrade tests has been configured to be very aggressive;
the tests are permitted to stop up to 4 of the 8 OSDs, so it is expected that
it is causing these kinds of health warnings to be generated.

This commit also cleans up some expected filesystem and pg peering warnings
in the upgrade tests.

Fixes: https://tracker.ceph.com/issues/72424
Signed-off-by: Laura Flores <lflores@ibm.com>
2 months agoqa/suites/upgrade: ignore "osd down" cluster log variations
Laura Flores [Mon, 24 Nov 2025 17:26:57 +0000 (11:26 -0600)]
qa/suites/upgrade: ignore "osd down" cluster log variations

These warnings are expected during upgrade tests. This commit
updates the list with variations of this warning that weren't
covered.

Fixes: https://tracker.ceph.com/issues/69795
Signed-off-by: Laura Flores <lflores@ibm.com>
2 months agoqa/suites/rados/thrash-old-clients: ignore warnings about peering PGs
Laura Flores [Mon, 24 Nov 2025 17:22:48 +0000 (11:22 -0600)]
qa/suites/rados/thrash-old-clients: ignore warnings about peering PGs

These warnings are expected during thrashing tasks.

https://tracker.ceph.com/issues/73360
Signed-off-by: Laura Flores <lflores@ibm.com>
2 months agoqa/suites/upgrade: add expected filesystem warnings to ignorelist
Laura Flores [Mon, 24 Nov 2025 17:18:08 +0000 (11:18 -0600)]
qa/suites/upgrade: add expected filesystem warnings to ignorelist

These warnings appear after we run ‘fs rm’, which seems expected.

Fixes: https://tracker.ceph.com/issues/73557
Signed-off-by: Laura Flores <lflores@ibm.com>
2 months agoMerge pull request #65511 from gardran/wip-gardran-optimize-snapmapper
Igor Fedotov [Thu, 11 Dec 2025 20:55:33 +0000 (23:55 +0300)]
Merge pull request #65511 from gardran/wip-gardran-optimize-snapmapper

osd/SnapMapper: more efficient object list generation in

Reviewed-by: Matan Breizman<mbreizma@redhat.com>
2 months agoMerge pull request #66281 from rhcs-dashboard/nvme-gateway-group-list
Afreen Misbah [Thu, 11 Dec 2025 18:32:13 +0000 (00:02 +0530)]
Merge pull request #66281 from rhcs-dashboard/nvme-gateway-group-list

mgr/dashboard: Carbonize Block Module > NVme-Listing Gateway group

Reviewed-by: Afreen Misbah <afreen@ibm.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
2 months agoFix the prometheus module crash wip-ash-prom-crash-main
Anmol Babu [Thu, 11 Dec 2025 08:39:52 +0000 (14:09 +0530)]
Fix the prometheus module crash

fixes: https://bugzilla.redhat.com/show_bug.cgi?id=2414677
Signed-off-by: Anmol Babu <Anmol.Babu@ibm.com>
2 months agodebian,ceph.spec: fix ceph-osd upgrade conflicts
Matan Breizman [Tue, 9 Dec 2025 09:52:08 +0000 (09:52 +0000)]
debian,ceph.spec: fix ceph-osd upgrade conflicts

With https://github.com/ceph/ceph/pull/65782 merged, upgrading ceph-osd
would need to replace the previous ceph-osd existing on the machine.
Otherwise, we won't be able to symlink the newly installed package:
```
2025-12-05T21:09:20.472 INFO:teuthology.orchestra.run.smithi077.stdout:
Installing       : ceph-osd-classic-2:20.3.0-4434.g8611241d.el9.x86_6
24/87
2025-12-05T21:09:20.478 INFO:teuthology.orchestra.run.smithi077.stdout:
Running scriptlet: ceph-osd-classic-2:20.3.0-4434.g8611241d.el9.x86_6
24/87
2025-12-05T21:09:20.479
INFO:teuthology.orchestra.run.smithi077.stdout:failed to link
/usr/bin/ceph-osd -> /etc/alternatives/ceph-osd: /usr/bin/ceph-osd
exists and it is not a symlink
```

Note: debian/control ceph-osd-classic already had Replace and Breaks:
      - Breaks is replaced with Conflicts to not allow coexistence.
      - Release version is bumped up to be relevant for latest main

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
2 months agosrc/common: Add a destructor to shared_mutex_debug
Hui Tong [Wed, 3 Dec 2025 13:55:36 +0000 (21:55 +0800)]
src/common: Add a destructor to shared_mutex_debug

Signed-off-by: Hui Tong <th937013596@gmail.com>
2 months agoerasure-code: Enable isa for ppc64le target.
T K Chandra Hasan [Thu, 11 Dec 2025 06:54:08 +0000 (12:24 +0530)]
erasure-code: Enable isa for ppc64le target.

Fixes: https://tracker.ceph.com/issues/74183
Signed-off-by: T K Chandra Hasan <t.k.chandra.hasan@ibm.com>
2 months agoMerge pull request #66588 from rhcs-dashboard/task_74176
Afreen Misbah [Thu, 11 Dec 2025 06:09:10 +0000 (11:39 +0530)]
Merge pull request #66588 from rhcs-dashboard/task_74176

mgr/dashboard: where the alert page tag colors are not being applied

Reviewed-by: Afreen Misbah <afreen@ibm.com>
2 months agoMerge pull request #66547 from Hezko/fix-nvme-cert-name
Afreen Misbah [Thu, 11 Dec 2025 03:37:12 +0000 (09:07 +0530)]
Merge pull request #66547 from Hezko/fix-nvme-cert-name

mgr/dashboard: fix nvme certificate name

Reviewed-by: Afreen Misbah <afreen@ibm.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
2 months agoMerge pull request #66394 from elliotcourant/fix/debian-post-install-home
Kefu Chai [Thu, 11 Dec 2025 03:26:36 +0000 (11:26 +0800)]
Merge pull request #66394 from elliotcourant/fix/debian-post-install-home

deb/cephadm: Don't assume a home directory is configured

Reviewed-by: Kefu Chai <tchaikov@gmail.com>
2 months agoMerge pull request #66134 from rzarzynski/wip-pc-humanreadable-prio
Yuri Weinstein [Wed, 10 Dec 2025 23:01:12 +0000 (15:01 -0800)]
Merge pull request #66134 from rzarzynski/wip-pc-humanreadable-prio

common/perf_counters: don't use magics for add_u64_counter{,_histogram}

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
2 months agodeb/cephadm: Don't assume a home directory is configured
Elliot Courant [Mon, 24 Nov 2025 17:50:56 +0000 (11:50 -0600)]
deb/cephadm: Don't assume a home directory is configured

cephadm.postinst can fail if cephadm was originally installed using a
version that didn't configure a home directory for the user at all.
Newer versions do configure a home directory (as either `/home/cephadm`
or `/var/lib/cephadm`) so if that is configured then nothing needs to be
done. But if the user was created with no home directory then one needs
to be added for the configure step to succeed.

Fixes: https://tracker.ceph.com/issues/72083
commit 90bc0369243077c2aaf67f0de2bab5810b217f4e added home directories
for new cephadm users created, but didn't add home directories to
cephadm users that already existed.

Signed-off-by: Elliot Courant <me@elliotcourant.dev>
2 months agoMerge pull request #66574 from anoopcs9/whitespace-for-smb-login-names
Adam King [Wed, 10 Dec 2025 16:34:46 +0000 (11:34 -0500)]
Merge pull request #66574 from anoopcs9/whitespace-for-smb-login-names

mgr/smb: Allow whitespace in share login names

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: John Mulligan <jmulligan@redhat.com>
2 months agodoc/mgr: fix some wording issues
John Mulligan [Wed, 10 Dec 2025 16:06:26 +0000 (11:06 -0500)]
doc/mgr: fix some wording issues

Be more smarticle about the article.
Also banish a stray.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
2 months agodoc/mgr: prefer the word interface over device
John Mulligan [Wed, 10 Dec 2025 16:07:36 +0000 (11:07 -0500)]
doc/mgr: prefer the word interface over device

As per the suggestion in https://github.com/ceph/ceph/pull/66579#discussion_r2606834482

Signed-off-by: John Mulligan <jmulligan@redhat.com>
2 months agorados: Add API to disable version querying with reads in librados
Alex Ainscow [Fri, 7 Nov 2025 10:44:56 +0000 (10:44 +0000)]
rados: Add API to disable version querying with reads in librados

librados will always request a "user version". Until EC direct reads are implemented
this is a cheap operation and so librados always requests the user version, even if
the client does not need it.

With EC direct reads, requesting the user version requires an extra op to the primary
in some scenarios. The non-primary OSDs do not contain an up to date user
version.

NEORADOS already allows for such optimisations, due to a how the API is organised.

librados is not heavily used by ceph-maintained clients, but this API will still be
useful for testing of EC direct reads, since the test clients will use librados, due
to it simpler nature and performance not being critical in the tests.

Signed-off-by: Alex Ainscow <aainscow@uk.ibm.com>
2 months agomgr/dashboard: where the alert page tag colors are not being applied
pujashahu [Wed, 10 Dec 2025 10:57:46 +0000 (16:27 +0530)]
mgr/dashboard: where the alert page tag colors are not being applied

Fixes: https://tracker.ceph.com/issues/74176
Signed-off-by: pujashahu <pshahu@redhat.com>
2 months agoMerge pull request #66516 from rhcs-dashboard/74078-host-labels-removed-fix
Afreen Misbah [Wed, 10 Dec 2025 10:04:10 +0000 (15:34 +0530)]
Merge pull request #66516 from rhcs-dashboard/74078-host-labels-removed-fix

mgr/dashboard :  Fixed labels issue

Reviewed-by: Afreen Misbah <afreen@ibm.com>
2 months agoFixes: https://tracker.ceph.com/issues/74089
Sagar Gopale [Thu, 4 Dec 2025 13:00:00 +0000 (18:30 +0530)]
Fixes: https://tracker.ceph.com/issues/74089
Signed-off-by: Sagar Gopale <sagar.gopale@ibm.com>
2 months agoMerge pull request #66528 from tchaikov/wip-debian-deps-c-v
Guillaume Abrioux [Wed, 10 Dec 2025 09:01:19 +0000 (10:01 +0100)]
Merge pull request #66528 from tchaikov/wip-debian-deps-c-v

debian/control: record python3-packaging dependency for ceph-volume

2 months agomgr/smb: Allow whitespace in share login names
Anoop C S [Tue, 9 Dec 2025 13:37:40 +0000 (19:07 +0530)]
mgr/smb: Allow whitespace in share login names

Whitespaces are perfectly valid in Windows AD user or group names.

Signed-off-by: Anoop C S <anoopcs@cryptolab.net>
2 months agoMerge pull request #66249 from thotz/adminops-restore-op
Jiffin Tony Thottan [Wed, 10 Dec 2025 06:41:09 +0000 (12:11 +0530)]
Merge pull request #66249 from thotz/adminops-restore-op

rgw/adminops: support for adding restore operation

2 months ago'mgr/dashboard: Carbonize Block Module > NVme-Listing Gateway group
pujashahu [Mon, 17 Nov 2025 14:12:03 +0000 (19:42 +0530)]
'mgr/dashboard: Carbonize Block Module > NVme-Listing Gateway group

Fixes: https://tracker.ceph.com/issues/73719
Signed-off-by: pujaoshahu <pshahu@redhat.com>
2 months agoMerge pull request #54435 from dparmar18/libcephfs-nonblocking-io-testcases
Venky Shankar [Wed, 10 Dec 2025 04:38:45 +0000 (10:08 +0530)]
Merge pull request #54435 from dparmar18/libcephfs-nonblocking-io-testcases

src/test: add libcephfs tests for async(nonblocking) calls

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2 months agoMerge pull request #66404 from tchaikov/wip-bwc-with-more-branch-names
Kefu Chai [Wed, 10 Dec 2025 03:21:54 +0000 (11:21 +0800)]
Merge pull request #66404 from tchaikov/wip-bwc-with-more-branch-names

script: sanitize git branch names for OCI tag compliance

Reviewed-by: John Mulligan <jmulligan@redhat.com>
2 months agoqa/tasks: disable ulmits+coverage wrapper scripts for smb workunit
John Mulligan [Tue, 2 Dec 2025 21:32:48 +0000 (16:32 -0500)]
qa/tasks: disable ulmits+coverage wrapper scripts for smb workunit

Signed-off-by: John Mulligan <jmulligan@redhat.com>
2 months agoqa/orch/cephadm/smb: add new smb.workunit task (non-AD)
John Mulligan [Mon, 1 Dec 2025 16:03:40 +0000 (11:03 -0500)]
qa/orch/cephadm/smb: add new smb.workunit task (non-AD)

Add some instances of the new smb workunit wrapper to a couple of
non-AD tests that require the user name and password to be explicitly
provided.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
2 months agoqa/orch/cephadm/smb: add new smb.workunit tasks to some AD tests
John Mulligan [Mon, 29 Sep 2025 18:34:56 +0000 (14:34 -0400)]
qa/orch/cephadm/smb: add new smb.workunit tasks to some AD tests

These tests will automatically use a user account from the known
AD environment.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
2 months agoqa/tasks: add a workunit wrapper function to smb.py
John Mulligan [Wed, 12 Nov 2025 19:49:26 +0000 (14:49 -0500)]
qa/tasks: add a workunit wrapper function to smb.py

Signed-off-by: John Mulligan <jmulligan@redhat.com>
2 months agoqa/workunits/smb: add test_simple_smb.py very simple smb tests
John Mulligan [Mon, 17 Nov 2025 21:04:10 +0000 (16:04 -0500)]
qa/workunits/smb: add test_simple_smb.py very simple smb tests

Signed-off-by: John Mulligan <jmulligan@redhat.com>
2 months agoqa/workunits/smb: add conftest.py for pytest fixtures, etc.
John Mulligan [Mon, 17 Nov 2025 18:04:07 +0000 (13:04 -0500)]
qa/workunits/smb: add conftest.py for pytest fixtures, etc.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
2 months agoqa/workunits/smb: add smbutil.py a testing utility module
John Mulligan [Mon, 17 Nov 2025 21:03:58 +0000 (16:03 -0500)]
qa/workunits/smb: add smbutil.py a testing utility module

Signed-off-by: John Mulligan <jmulligan@redhat.com>
2 months agoqa/workunits/smb: add a stub pytest config
John Mulligan [Wed, 19 Nov 2025 16:19:24 +0000 (11:19 -0500)]
qa/workunits/smb: add a stub pytest config

Signed-off-by: John Mulligan <jmulligan@redhat.com>
2 months agoqa/workunits/smb: add a stub workunit script for smb testing
John Mulligan [Wed, 19 Nov 2025 16:19:24 +0000 (11:19 -0500)]
qa/workunits/smb: add a stub workunit script for smb testing

Signed-off-by: John Mulligan <jmulligan@redhat.com>
2 months agolibrbd: fix ExclusiveLock::accept_request() when !is_state_locked()
Ilya Dryomov [Tue, 9 Dec 2025 14:22:02 +0000 (15:22 +0100)]
librbd: fix ExclusiveLock::accept_request() when !is_state_locked()

To accept an async request, two conditions must be met: a) exclusive
lock must be a firm STATE_LOCKED state and b) async requests shouldn't
be blocked or if they are blocked there should be an exception in place
for a given request_type.  If a) is met but b) isn't, ret_val is set
to m_request_blocked_ret_val, as expected -- the reason for denying
the request is that async requests are blocked.  However, if a) isn't
met, ret_val also gets set to m_request_blocked_ret_val.  This is wrong
because the reason for denying the request in this case isn't that
async requests are blocked (they may or may not be) but a much heavier
circumstance of exclusive lock being in a transient state or not held
at all.

In such scenarios, whether async requests are blocked or not isn't
relevant and ExclusiveLock::accept_request() behaving otherwise can
lead to bogus "duplicate lock owners detected" errors getting raised
during an attempt to handle any maintenance operation notification in
ImageWatcher::handle_operation_request().  This error isn't considered
retryable so the entire operation that needed the exclusive lock would
be spuriously failed with EINVAL.

Fixes: https://tracker.ceph.com/issues/74168
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2 months agolibrbd: add ExclusiveLock::accept_request() overload
Ilya Dryomov [Sat, 6 Dec 2025 08:32:08 +0000 (09:32 +0100)]
librbd: add ExclusiveLock::accept_request() overload

Make ret_val out parameter required for the existing method and
introduce an overload taking just request_type to compensate.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>