git.apps.os.sepia.ceph.com Git

]> git.apps.os.sepia.ceph.com Git - ceph.git/log

Seena Fallah [Tue, 1 Apr 2025 15:28:10 +0000 (17:28 +0200)]

rgw: make rgw_sync_pipe_params::user optional

In rgw_sync_pipe_params, the mode can be either system or user.
When in system mode, no user is involved, but the current
implementation holds an empty rgw_user, which can cause confusion
in pipe_rules::find_basic_info_without_tags().

With this change, rgw_user is now optional, ensuring that when no
user is involved, it is explicitly nullopt rather than an empty object.

Signed-off-by: Seena Fallah <seenafallah@gmail.com>
(cherry picked from commit c8aca216f7d186e4e8391a284d14948afd414957)

commit | commitdiff | tree

Seena Fallah [Fri, 28 Mar 2025 23:00:02 +0000 (00:00 +0100)]

qa/rgw: add perm check test for copy obj between zonegroups

Make sure perms are evaluated properly for the source object.

Signed-off-by: Seena Fallah <seenafallah@gmail.com>
(cherry picked from commit 9523e15fb74e09718f5cc9c0bddf2492fc8d8128)

commit | commitdiff | tree

Seena Fallah [Mon, 24 Feb 2025 15:47:50 +0000 (16:47 +0100)]

doc: add release note for new policy actions on replication

Fixes: https://tracker.ceph.com/issues/70093
Signed-off-by: Seena Fallah <seenafallah@gmail.com>
(cherry picked from commit 8c15d4674f567c7b35d5aac0a9ac4e62306f7b13)

commit | commitdiff | tree

Seena Fallah [Fri, 28 Mar 2025 20:55:20 +0000 (21:55 +0100)]

rgw: remote copy obj pass rgwx-perm-check-uid for perm evaluation

When copying object from remote source (bucket from another zonegroup)
the perms of the source is not evaluated resulting in reading from
unauthorized buckets.
passing `rgwx-perm-check-uid` will let the source zone evaluates the
perm and close this bug.

Signed-off-by: Seena Fallah <seenafallah@gmail.com>
(cherry picked from commit 3c83520d3338e85e2219e34e77d1149033533a71)

commit | commitdiff | tree

Seena Fallah [Fri, 28 Mar 2025 20:52:47 +0000 (21:52 +0100)]

rgw: RGWRadosPutObj evals source bucket perm for backward compatibility

As of a3f40b4 we no longer evaluate perms locally for source bucket,
this could cause broken permission evaluation dusring upgrade as one
zone is not respecting the perm evaluation based on the `rgwx-perm-check-uid`
arg.

This can be dropped in T+2 release.

Signed-off-by: Seena Fallah <seenafallah@gmail.com>
(cherry picked from commit 926ed16c27c0625427ae04d7298a5e47c1aba22b)

commit | commitdiff | tree

Seena Fallah [Thu, 24 Apr 2025 19:02:08 +0000 (21:02 +0200)]

rgw: make verify_bucket_permission functions const

Signed-off-by: Seena Fallah <seenafallah@gmail.com>
(cherry picked from commit b0200c627b1c8cd8ac236119bd6db7b18abc89dc)

commit | commitdiff | tree

Seena Fallah [Fri, 28 Mar 2025 20:48:34 +0000 (21:48 +0100)]

rgw: give hint via header for perm evaluation in GetObj

Return `Rgwx-Perm-Checked` header as a hint for the destination zone
to know whether the perms where considered or not.
This is just a backward compatibility for upgrade and can be dropped
in T+2 release.

Signed-off-by: Seena Fallah <seenafallah@gmail.com>
(cherry picked from commit 84a8d1ba0ed4a9a1abc80c1b839f95aaeef5f27b)

commit | commitdiff | tree

Seena Fallah [Fri, 28 Mar 2025 20:36:38 +0000 (21:36 +0100)]

rgw: rest client callback when all headers are passed

Signed-off-by: Seena Fallah <seenafallah@gmail.com>
(cherry picked from commit 0a93e74a0476e80c51ce5ec23b2a5ca1b28a3996)

commit | commitdiff | tree

Seena Fallah [Wed, 5 Mar 2025 19:52:48 +0000 (20:52 +0100)]

rgw: pass rgwx-perm-check-uid for multisite fetch object

Signed-off-by: Seena Fallah <seenafallah@gmail.com>
(cherry picked from commit 86aa6d36e24b78604fd15ac52452ab2cfcc539a9)

commit | commitdiff | tree

Seena Fallah [Fri, 28 Feb 2025 16:07:32 +0000 (17:07 +0100)]

rgw: GetObject(Version) not allowed to replicate sse-kms objects

To replicate objects encrypted via sse-kms objects,
s3:GetObjectVersionForReplication is required.

Signed-off-by: Seena Fallah <seenafallah@gmail.com>
(cherry picked from commit 3024b70ad56a7733527be7bae53d0a19a368c45c)

commit | commitdiff | tree

Seena Fallah [Thu, 27 Feb 2025 10:53:44 +0000 (11:53 +0100)]

rgw: take account GetObject(Version)Tagging when replicating

In case the uid has no permission to read tagging, the tags should
not be replicated.
Ref. https://docs.aws.amazon.com/AmazonS3/latest/userguide/setting-repl-config-perm-overview.html

Signed-off-by: Seena Fallah <seenafallah@gmail.com>
(cherry picked from commit ae8d7a97714faabe90d1e1660aacabe27e080e42)

commit | commitdiff | tree

Seena Fallah [Mon, 24 Feb 2025 22:56:13 +0000 (23:56 +0100)]

qa/rgw: add test for source object perm check in multisite

Check whether the policies are honored on source object in source
zone when replicating.

Signed-off-by: Seena Fallah <seenafallah@gmail.com>
(cherry picked from commit e4f44851b3c0b46528dea6104cf32d6898c711d4)

commit | commitdiff | tree

Seena Fallah [Fri, 28 Feb 2025 15:51:07 +0000 (16:51 +0100)]

rgw: replication require lock perm if enabled

Signed-off-by: Seena Fallah <seenafallah@gmail.com>
(cherry picked from commit 4fde9dddb8c2732ecf95fa1d508ee7c91fc53e74)

commit | commitdiff | tree

Seena Fallah [Mon, 24 Feb 2025 22:41:13 +0000 (23:41 +0100)]

rgw: check source object replication by replication actions

Check for permissions of `s3:GetObjectVersionForReplication` in
addition to `s3:GetObject` and `s3:GetObjectVersion` when fetching
the object for multisite.

Signed-off-by: Seena Fallah <seenafallah@gmail.com>
(cherry picked from commit 89d92dee29a15c5d1be71859be9a2b485236ef4b)

commit | commitdiff | tree

Seena Fallah [Sat, 1 Mar 2025 00:22:07 +0000 (01:22 +0100)]

rgw: export action_bit_string through header file

Signed-off-by: Seena Fallah <seenafallah@gmail.com>
(cherry picked from commit f2ba4db7b2e57ac0a7166a11251c662c88701805)

commit | commitdiff | tree

Seena Fallah [Mon, 24 Feb 2025 22:33:45 +0000 (23:33 +0100)]

rgw: only allow system override if identity is not impersonating

Since multisite now delegates permission checks for source objects
to the source zone (a3f40b4), we need to avoid allowing system-level
overrides when the request is impersonating another identity.

SysReqApplier should only grant override permission if the request
is truly system-authenticated and not acting on behalf of another
user or role (i.e., no rgwx-perm-check-uid or rgwx-perm-check-role
in the request).

Signed-off-by: Seena Fallah <seenafallah@gmail.com>
(cherry picked from commit 2a0cb65076fa63439a5d4b7c8876fb551d7ab8ec)

commit | commitdiff | tree

Seena Fallah [Thu, 17 Apr 2025 12:55:00 +0000 (14:55 +0200)]

rgw: SysReqApplier overrides is_admin_of based on impersonation

SysReqApplier now returns true for is_admin_of() when the requester
was a system user and was not impersonating any user/role using
rgwx-perm-check-uid or rgwx-perm-check-role.

Signed-off-by: Seena Fallah <seenafallah@gmail.com>
(cherry picked from commit 0e650ea276669c2c6bb236f27db07910754cc220)

commit | commitdiff | tree

Seena Fallah [Fri, 21 Feb 2025 00:34:27 +0000 (01:34 +0100)]

qa/rgw: add test for new replication actions

Signed-off-by: Seena Fallah <seenafallah@gmail.com>
(cherry picked from commit 3f2514f7cf2941350539da86756435808db212f9)

commit | commitdiff | tree

Seena Fallah [Thu, 20 Feb 2025 23:57:25 +0000 (00:57 +0100)]

rgw: support s3ReplicateTags perm on destination bucket for replication

Check for tag replication permission on destination bucket, so if
there was an explicit deny, donot include tags in the replicated
object.

Signed-off-by: Seena Fallah <seenafallah@gmail.com>
(cherry picked from commit 3fb1671520d62ce707ebc15e8f7874540b7e2aaa)

commit | commitdiff | tree

Seena Fallah [Thu, 20 Feb 2025 23:56:28 +0000 (00:56 +0100)]

rgw: check for s3ReplicateObject perm on destination bucket for replication

Instead of s3:PutObject rely on s3:s3ReplicateObject permission to
check whether the user can replicate to the destination bucket.

Signed-off-by: Seena Fallah <seenafallah@gmail.com>
(cherry picked from commit 97ee3287fb3b062eda0d07f07a219eafb04a5a6a)

commit | commitdiff | tree

Seena Fallah [Thu, 20 Feb 2025 21:15:31 +0000 (22:15 +0100)]

rgw: verify perm on delete replication

Check for s3:ReplicateDelete for replicating object deletes and
delete markers when pipe is set to user mode.

Signed-off-by: Seena Fallah <seenafallah@gmail.com>
(cherry picked from commit d7fe7915b452c5639b415d6457e272fe0d235ef5)

commit | commitdiff | tree

Seena Fallah [Sat, 22 Feb 2025 23:50:16 +0000 (00:50 +0100)]

rgw: move RGWUserPermHandler to header

So it can be used by others.

Signed-off-by: Seena Fallah <seenafallah@gmail.com>
(cherry picked from commit 857f7bd8e6be11d1d3453e0dc32dae0e3945f8f5)

commit | commitdiff | tree

Seena Fallah [Thu, 20 Feb 2025 20:38:50 +0000 (21:38 +0100)]

rgw: weaning off RGWUserPermHandler from RGWDataSyncEnv

So it can be called by RGWAsyncRadosRequest classes not holding
sync_env.

Signed-off-by: Seena Fallah <seenafallah@gmail.com>
(cherry picked from commit 77c9304102e8650ba1d3265ef63bfa2d0a6756d1)

commit | commitdiff | tree

Seena Fallah [Sat, 22 Feb 2025 23:47:55 +0000 (00:47 +0100)]

rgw: send bucket sync structs to bucket_sync.h

So it can be imported by headers like rgw_cr_rados.h that already
has dependency to rgw_data_sync.h.

Signed-off-by: Seena Fallah <seenafallah@gmail.com>
(cherry picked from commit e7422956988394d334043123bc87460055a9db13)

commit | commitdiff | tree

Seena Fallah [Wed, 19 Feb 2025 22:51:11 +0000 (23:51 +0100)]

rgw: drop unused params passed to RGWStatRemoteObjCR by RGWObjFetchCR

Signed-off-by: Seena Fallah <seenafallah@gmail.com>
(cherry picked from commit bb337be08467d649f17712558c5414bd64cb3d09)

commit | commitdiff | tree

Pritha Srivastava [Mon, 1 Apr 2024 15:57:06 +0000 (21:27 +0530)]

rgw/qa: added test case to assume a role after role creation
syncs, and then creating a bucket on both primary and secondary.
The test name is test_assume_role_after_sync.

Signed-off-by: Pritha Srivastava <prsrivas@redhat.com>
(cherry picked from commit 855db87f4addec8576708d56b6f6d6554caf8b37)

commit | commitdiff | tree

Pritha Srivastava [Thu, 28 Mar 2024 11:16:20 +0000 (16:46 +0530)]

rgw/sts: by-passing authentication using temp creds
in case the request is forwarded from secondary in
a multi-site setup. authenticating with the system
user creds of which are used to sign the request.
Permissions are still derived from the role.

Signed-off-by: Pritha Srivastava <prsrivas@redhat.com>
(cherry picked from commit 63bc73802ddb0ef74d66d468293e489e4d5fa58f)

commit | commitdiff | tree

Ronen Friedman [Mon, 28 Apr 2025 16:09:17 +0000 (19:09 +0300)]

Merge pull request #62998 from ronen-fr/wip-rf-62996-tentacle

tentacle: osd/scrub: always round up reported scrub duration

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Sat, 26 Apr 2025 08:21:29 +0000 (03:21 -0500)]

osd/scrub: always round up reported scrub duration

as expected by some tests, and clearer for the user.

Fixes: https://tracker.ceph.com/issues/68833
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
(cherry picked from commit b7fca3676eec20371e0735650a91add065f8faa0)

commit | commitdiff | tree

Patrick Donnelly [Fri, 25 Apr 2025 19:02:03 +0000 (15:02 -0400)]

Merge PR #62901 into main

* refs/pull/62901/head:
qa/workunits/fs/misc: remove data pool cleanup

Reviewed-by: Greg Farnum <gfarnum@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Fri, 25 Apr 2025 19:00:39 +0000 (15:00 -0400)]

Merge PR #62833 into main

* refs/pull/62833/head:
qa: test charmap changes with dir and snaps
mds: check for snapshots on parent snaprealms
mds: use strict_strtobool for parsing bools
common: take string_view for strict_tobool

Reviewed-by: Greg Farnum <gfarnum@redhat.com>

commit | commitdiff | tree

Anthony D'Atri [Fri, 25 Apr 2025 16:20:51 +0000 (12:20 -0400)]

Merge pull request #62966 from bluikko/doc-toc-sectionlevels-radosgw

doc/radosgw: Fix section header levels in multisite-sync-policy.rst

commit | commitdiff | tree

Adam King [Fri, 25 Apr 2025 15:11:31 +0000 (11:11 -0400)]

Merge pull request #62023 from Kushal-deb/user-friendly_error_handling_for_invalid_osd_device_paths

cephadm: Provide user friendly error message if osd device path is invalid

Reviewed-by: Adam King <adking@redhat.com>

commit | commitdiff | tree

Yingxin Cheng [Fri, 25 Apr 2025 14:41:02 +0000 (22:41 +0800)]

Merge pull request #62895 from cyx1231st/wip-seastore-omap-link-init

crimson/os/seastore/omap_manager: simplify maybe_init from tolerating duplicated calls

Reviewed-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Yingxin Cheng [Fri, 25 Apr 2025 12:55:31 +0000 (20:55 +0800)]

Merge pull request #62938 from cyx1231st/wip-seastore-cleanup-paddr-types

crimson/os/seastore: improve checks to the paddr types

Reviewed-by: Xuehan Xu <xuxuehan@qianxin.com>
Reviewed-by: Myoungwon Oh <myoungwon.oh@samsung.com>

commit | commitdiff | tree

Yingxin Cheng [Fri, 25 Apr 2025 12:53:55 +0000 (20:53 +0800)]

Merge pull request #62838 from cyx1231st/wip-seastore-simplify-cache-access-metrics

crimson/os/seastore: simplify cache access metrics

Reviewed-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Matt Benjamin [Fri, 25 Apr 2025 11:53:30 +0000 (07:53 -0400)]

Merge pull request #56336 from pritha-srivastava/wip-rgw-d4n-next

Wip rgw d4n next

commit | commitdiff | tree

Adam Kupczyk [Fri, 25 Apr 2025 10:18:34 +0000 (12:18 +0200)]

Merge pull request #56975 from aclamk/wip-aclamk-bs-compression-recompression

os/bluestore: Recompression, part 4. Scanner, Estimator and core recompression.

commit | commitdiff | tree

Ville Ojamo [Fri, 25 Apr 2025 07:16:52 +0000 (14:16 +0700)]

doc/radosgw: Fix section header levels in multisite-sync-policy.rst

The section header levels are reversed so the hierarchy in the TOC is
incorrect. Switch around the section header levels to make the TOC
hierarchy correct, for example individual examples are children of the
"Examples" section.

Signed-off-by: Ville Ojamo <14869000+bluikko@users.noreply.github.com>

commit | commitdiff | tree

Shraddha Agrawal [Fri, 25 Apr 2025 05:56:15 +0000 (11:26 +0530)]

Merge pull request #59673 from shraddhaag/availability-score-feature

monitor: add availability score feature

commit | commitdiff | tree

Gil Bregman [Fri, 25 Apr 2025 05:34:07 +0000 (08:34 +0300)]

Merge pull request #62937 from gbregman/main

mgr/cephadm/nvmeof: Allow setting NVMEoF gateway huge pages count in the spec file

commit | commitdiff | tree

Patrick Donnelly [Fri, 25 Apr 2025 02:41:14 +0000 (22:41 -0400)]

Merge PR #62658 into main

* refs/pull/62658/head:
libcephfs_proxy: Remove arithmetic on `void*`

Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>
Reviewed-by: Xavi Hernandez <xhernandez@gmail.com>

commit | commitdiff | tree

Yingxin Cheng [Wed, 23 Apr 2025 14:05:35 +0000 (22:05 +0800)]

crimson/os/seastore/cache: init root as dirty

To simplify checks that root won't appear in lru.

Also, make sure root has a root paddr.

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Wed, 23 Apr 2025 09:30:24 +0000 (17:30 +0800)]

crimson/os/seastore: introduce strict paddr type checks in cache and transaction

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Wed, 23 Apr 2025 09:27:04 +0000 (17:27 +0800)]

crimson/os/seastore/seastore_types: tolerate fake paddrs as absolute addresses

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Wed, 23 Apr 2025 09:02:54 +0000 (17:02 +0800)]

crimson/os/seastore: fake paddr is only possible with UT, wrap with UNIT_TESTS_BUILT

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Wed, 23 Apr 2025 06:59:25 +0000 (14:59 +0800)]

crimson/os/seastore/seastore_types: prefer paddr_t::is_absolute_*()

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Wed, 23 Apr 2025 06:38:43 +0000 (14:38 +0800)]

crimson/os/seastore: more accurate checks to the paddr types

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Gil Bregman [Wed, 23 Apr 2025 20:55:24 +0000 (23:55 +0300)]

mgr/cephadm/nvmeof: Allow setting NVMEoF gateway huge pages count in the spec file
Fixes https://tracker.ceph.com/issues/71043

Signed-off-by: Gil Bregman <gbregman@il.ibm.com>

commit | commitdiff | tree

Adam King [Thu, 24 Apr 2025 18:40:28 +0000 (14:40 -0400)]

Merge pull request #62561 from rkachach/fix_issue_70359_v2

mgr/cephadm: harmonize mgmt-gateway and oauth2-proxy spec fields

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>

commit | commitdiff | tree

Adam King [Thu, 24 Apr 2025 18:34:21 +0000 (14:34 -0400)]

Merge pull request #62302 from thegreenbear/cephadm-sd-custom-containers

mgr/cephadm: enhanced service to allow discovery of custom containers

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Redouane Kachach <rkachach@redhat.com>

commit | commitdiff | tree

Casey Bodley [Thu, 24 Apr 2025 15:35:48 +0000 (11:35 -0400)]

Merge pull request #62936 from cbodley/wip-doc-rgw-getobjattrs

doc/rgw: release note for GetObjectAttributes

Reviewed-by: Adam C. Emerson <aemerson@redhat.com>
Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Pedro Gonzalez Gomez [Thu, 24 Apr 2025 15:26:18 +0000 (17:26 +0200)]

Merge pull request #62845 from rhcs-dashboard/fix-path

mgr/dashboard: fix smb edit resources

Reviewed-by: Afreen Misbah <afreen@ibm.com>

commit | commitdiff | tree

Casey Bodley [Thu, 24 Apr 2025 14:59:33 +0000 (10:59 -0400)]

Merge pull request #62715 from cbodley/wip-qa-rgw-no-gc

qa/rgw: run verify tests with garbage collection disabled

Reviewed-by: Jane Zhu <jzhu116@bloomberg.net>

commit | commitdiff | tree

Ilya Dryomov [Thu, 24 Apr 2025 14:36:46 +0000 (16:36 +0200)]

Merge pull request #62921 from idryomov/wip-71026

librbd: disallow "rbd trash mv" if image is in a group

Reviewed-by: Ramana Raja <rraja@redhat.com>

commit | commitdiff | tree

Shraddha Agrawal [Mon, 6 Jan 2025 07:12:11 +0000 (07:12 +0000)]

qa/standalone/misc/availability.sh: add tests

This commit adds a standalone test for verifying if
the availability score of a pool comes down when there
are unfound objects present.

Fixes: https://tracker.ceph.com/issues/67777
Signed-off-by: Shraddha Agrawal <shraddha.agrawal000@gmail.com>

commit | commitdiff | tree

Shraddha Agrawal [Mon, 7 Oct 2024 06:16:34 +0000 (11:46 +0530)]

src/mon/PGMap.cc: check unfound obejcts in `get_unavailable_pg_in_pool_map`

If a pool has any PG with unfound objects, we should consider
it unavailable for the availability score. If a PG has unfound
objects, it will be recorded in PGMap.

In `get_unavailable_pg_in_map`, if a PG has unfound obejcts,
we add it to `pool_pg_unavailable_map`.

Fixes: https://tracker.ceph.com/issues/67777
Signed-off-by: Shraddha Agrawal <shraddha.agrawal000@gmail.com>

commit | commitdiff | tree

Kamoltat [Tue, 21 Nov 2023 18:55:29 +0000 (18:55 +0000)]

src/osd/PeeringState.cc: update last_unstale properly

Problem:

When we update the `pg_stat` we don't
check whether the pg state is in `stale`.
Therefore, the attribute `last_unstale`
will always get updated even if the pg
state actually contains `stale`.

Solution:

Place a condition to only update
the attribute `last_unstale` when
we the pg truly doesn't have `stale`
in its state.

Fixes: https://tracker.ceph.com/issues/67777
Signed-off-by: Kamoltat <ksirivad@redhat.com>

commit | commitdiff | tree

Kamoltat [Tue, 10 Oct 2023 15:15:35 +0000 (15:15 +0000)]

src/mgr/OSDMonitor.cc Add command `ceph osd pool availability-status`

```
ceph osd pool availability-status
```
outputs:

`POOL`
`UPTIME`
`DOWNTIME`
`NUMFAILURES`
`MTBF`
`MTTR`
`SCORE`
`AVAILABLE`

Fixes: https://tracker.ceph.com/issues/67777
Signed-off-by: Kamoltat <ksirivad@redhat.com>

commit | commitdiff | tree

Kamoltat [Thu, 26 Oct 2023 19:08:37 +0000 (19:08 +0000)]

src/mon/PGMap.cc: init pool_availability

Added PoolAvailability Struct

Modified PGMap.cc to include a k,v map:
`pool_availability`.

The key being the `poolid` and value
is `PoolAvailability`

Init the function:
`PGMap::get_unavailable_pg_in_pool_map()`
to identify and aggregate all the PGs we
mark as `unavailable` as well as the pool
that associates with the unavailable PG.

Also, included `pool_availability`
to `PGMapDigest::dump()`.

Fixes: https://tracker.ceph.com/issues/67777
Signed-off-by: Kamoltat <ksirivad@redhat.com>

commit | commitdiff | tree

Max Kellermann [Thu, 24 Apr 2025 09:12:12 +0000 (11:12 +0200)]

Merge pull request #62941 from MaxKellermann/mds_Locker__abort

mds/Locker: use ceph_abort_msg() instead of ceph_assert()

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Adam Kupczyk [Tue, 22 Apr 2025 11:23:35 +0000 (11:23 +0000)]

qa/suites/rados/bluestore: Add standalone tests for write_v2

Standalone tests for ceph_test_objectstore require separate instances
for testing write_v2=true.

Signed-off-by: Adam Kupczyk <akupczyk@ibm.com>

commit | commitdiff | tree

Adam Kupczyk [Thu, 13 Jun 2024 17:34:57 +0000 (17:34 +0000)]

os/bluestore: Add "bluestore compression stats"

Add new admin socket command to inspect Estimator stats per collection.

Signed-off-by: Adam Kupczyk <akupczyk@ibm.com>

commit | commitdiff | tree

Adam Kupczyk [Mon, 10 Jun 2024 16:03:24 +0000 (16:03 +0000)]

os/bluestore: Add admin socket commands to inspect onode metadata

Add admin socket commands:
1) bluestore collections
Lists collections.
2) bluestore list <coll> [start object] [max count]
Lists collection coll starting from object (optional). Default 100 entries. 0 = unlimited.
3) bluestore onode metadata <object>
Prints onode metadata as seen by BlueStore.

It might happen (usually in tests) that 2 BlueStore instances are created at the same time.
Since admin commands are unique, it fails to register.
Use first register to detect whether we can register at all.

Signed-off-by: Adam Kupczyk <akupczyk@ibm.com>

commit | commitdiff | tree

Adam Kupczyk [Wed, 29 May 2024 06:34:23 +0000 (06:34 +0000)]

test/objectstore/store_test: Adapt tests to write_v2

Tests that use original write path specific knowledge are failing now.
Falling back to write_v1 in these tests.

Signed-off-by: Adam Kupczyk <akupczyk@ibm.com>

commit | commitdiff | tree

Adam Kupczyk [Tue, 8 Apr 2025 08:36:21 +0000 (08:36 +0000)]

os/bluestore: Add do_write_v2_compressed()

Modify do_write_v2() to branch into do_write_v2_compressed().
Segmented and regular cases are recognized and handled properly.
New do_write_v2_compressed() oversees compression / recompression.

Make one Estimator per Collection.
It makes possible for estimator to learn in collection specific compressibility.
In write_v2_compressed use compressor already selected in choose_write_options.
Make Collection create Estimator on first use.

Signed-off-by: Adam Kupczyk <akupczyk@ibm.com>

commit | commitdiff | tree

Adam Kupczyk [Tue, 8 Apr 2025 11:03:22 +0000 (11:03 +0000)]

os/bluestore/compression: Main part of recompression feature

Add feature of recompression scanner that looks around write region to see how much
would be gained, if we read some more around and wrote more.
Added Compression.h / Compression.cc.
Added debug_bluestore_compression dout.
Created Scanner class.
Provides write_lookaround() for scanning loaded extents.

Signed-off-by: Adam Kupczyk <akupczyk@ibm.com>

commit | commitdiff | tree

Adam Kupczyk [Wed, 9 Apr 2025 16:03:52 +0000 (16:03 +0000)]

os/bluestore/compression: Estimator class

Add CMake rules to compile.
Add bluestore_compression dout subsys.

Created Estimator class.
It is used by Scanner to decide if specific extent is to be recompressed.
Prepare for future machine learning / adaptive algorithm for estimation.

So far logic of Estimator is relatively simple.
It learns expected recompression values and uses them in next iterations to predict.

Signed-off-by: Adam Kupczyk <akupczyk@ibm.com>

commit | commitdiff | tree

Radoslaw Zarzynski [Thu, 24 Apr 2025 06:17:51 +0000 (08:17 +0200)]

Merge pull request #59248 from kamoltat/wip-ksirivad-improve-netsplit-warning

HealthMonitor: Add topology-aware netsplit detection and warning

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
Reviewed-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Max Kellermann [Thu, 24 Apr 2025 05:17:48 +0000 (07:17 +0200)]

mds/Locker: use ceph_abort_msg() instead of ceph_assert()

This ceph_assert() always fails, but depending on the configuration
value `ceph_assert_supresssions`, execution may continue, but the
`dir` variable is left uninitialized.  This leads to a compiler
warning:

/home/jenkins-build/build/workspace/ceph-api/src/mds/Locker.cc:451:22: error: variable 'dir' is used uninitialized whenever 'if' condition is false [-Werror,-Wsometimes-uninitialized]

clang then suggests to nullptr-initialize the variable:

/home/jenkins-build/build/workspace/ceph-api/src/mds/Locker.cc:447:11: note: initialize the variable 'dir' to silence this warning
   447 |         CDir *dir;
       |                  ^
       |                   = nullptr

This, however, is a very bad idea because all this does is suppress
the warning; it still crashes the process.

Since there's no recovery from this problem, let's switch to
ceph_abort_msg() which is [[noreturn]] and the compiler can deduce
that `dir` is always initialized when it's used.

Signed-off-by: Max Kellermann <max.kellermann@ionos.com>

commit | commitdiff | tree

Ronen Friedman [Thu, 24 Apr 2025 05:17:33 +0000 (08:17 +0300)]

Merge pull request #62693 from ronen-fr/wip-rf-iocnt

osd/scrub: performance counters for I/O performed by the scrubber

Reviewed-by: Alex Ainscow <aainscow@uk.ibm.com>
Reviewed-by: Bill Scales <bill_scales@uk.ibm.com>
Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

Yingxin Cheng [Wed, 23 Apr 2025 06:37:43 +0000 (14:37 +0800)]

crimson/os/seastore/seastore_types: introduce and use paddr_t::is_absolute_segmented/random_block()

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Wed, 23 Apr 2025 06:22:58 +0000 (14:22 +0800)]

crimson/os/seastore/lba_mapping: fix LBAMapping::is_zero_reserved()

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Wed, 23 Apr 2025 06:21:30 +0000 (14:21 +0800)]

crimson/os/seastore/seastore_types: adjust paddr_t::is_real_location()

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Ilya Dryomov [Wed, 23 Apr 2025 22:28:52 +0000 (00:28 +0200)]

Merge pull request #62898 from nbalacha/wip-nbalacha-70963

rbd: display mirror state creating

Reviewed-by: Ramana Raja <rraja@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Casey Bodley [Wed, 23 Apr 2025 22:28:16 +0000 (18:28 -0400)]

Merge pull request #60899 from clwluvw/curl-einval

rgw: handle EINVAL translation in forward_request

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Casey Bodley [Wed, 23 Apr 2025 20:42:08 +0000 (16:42 -0400)]

Merge pull request #62888 from clwluvw/neorados-fifotrim

neorados: relax fifo trim error for ENODATA

Reviewed-by: Adam C. Emerson <aemerson@redhat.com>

commit | commitdiff | tree

Bernard Landon [Fri, 14 Mar 2025 14:25:00 +0000 (14:25 +0000)]

src/pybind/mgr/cephadm/service_discovery: enhanced service to allow discovery of custom containers

Fixes: https://tracker.ceph.com/issues/70482
Signed-off-by: Bernard Landon <bernard@lndn.ch>

commit | commitdiff | tree

Casey Bodley [Wed, 23 Apr 2025 19:06:19 +0000 (15:06 -0400)]

doc/rgw: release note for GetObjectAttributes

Signed-off-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Casey Bodley [Wed, 23 Apr 2025 18:46:59 +0000 (14:46 -0400)]

Merge pull request #62902 from cbodley/wip-70700-disable

cmake/common: temporarily remove decode_start_v_checker tests

Reviewed-by: Dan Mick <dmick@redhat.com>
Reviewed-by: Laura Flores <lflores@redhat.com>

commit | commitdiff | tree

Casey Bodley [Wed, 23 Apr 2025 18:02:16 +0000 (14:02 -0400)]

Merge pull request #60227 from clwluvw/zonegroup-delbucket

rgw: skip empty check on non-owned buckets by zonegroup

Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Casey Bodley [Wed, 23 Apr 2025 18:00:56 +0000 (14:00 -0400)]

Merge pull request #62738 from clwluvw/copy-obj-remote-zonegroup

rgw: dont store replication attrs on remote copy obj

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Tue, 15 Apr 2025 08:34:06 +0000 (03:34 -0500)]

osd/scrub: count scrub I/O

Implement I/O counting in the PGBackend::be_scan_list()
and relevant functions it calls.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Matan Breizman [Wed, 23 Apr 2025 15:35:28 +0000 (18:35 +0300)]

Merge pull request #62699 from Matan-B/wip-matanb-crimson-ignore-abort-v2

crimson/common/errorator: rework aborts error handlers

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Radoslaw Zarzynski [Wed, 23 Apr 2025 15:19:31 +0000 (17:19 +0200)]

Merge pull request #62556 from aainscow/ec_pr_and_prereqs

osd: Optimised EC

Reviewed-by: Radoslaw Zarzynski <rzarzynski@redhat.com>
Reviewed-by: Laura Flores <lflores@redhat.com>

commit | commitdiff | tree

N Balachandran [Mon, 21 Apr 2025 11:34:08 +0000 (17:04 +0530)]

rbd: display correct mirror state when creating

The mirror image state is set to MIRROR_IMAGE_STATE_CREATING
when the image is first created on the secondary, but was displayed
as "unknown" by the rbd info command. This has been fixed.

Fixes: https://tracker.ceph.com/issues/70963
Signed-off-by: N Balachandran <nithya.balachandran@ibm.com>

commit | commitdiff | tree

Laura Flores [Wed, 23 Apr 2025 15:06:56 +0000 (10:06 -0500)]

Merge pull request #62710 from bill-scales/ec_backfill

osd: EC Optimizations: Backfill changes for partial writes

commit | commitdiff | tree

Vallari Agrawal [Wed, 23 Apr 2025 13:17:12 +0000 (18:47 +0530)]

Merge pull request #62725 from VallariAg/nvmeof-teuthology-fio

qa/suites/nvmeof: Fix thrasher and fio script

commit | commitdiff | tree

Rishabh Dave [Wed, 23 Apr 2025 12:15:54 +0000 (17:45 +0530)]

Merge pull request #60731 from joscollin/wip-B68954-check-headers-journal-recovery

cephfs-journal-tool: check the headers in dump file after journal recovery

Reviewed-by: Dhairya Parmar <dparmar@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

baum [Wed, 23 Apr 2025 10:36:16 +0000 (13:36 +0300)]

Merge pull request #62914 from baum/ms_dispatch2_clean_up

src/nvmeof/NVMeofGwMonitorClient.cc: ms_dispatch2 clean up

commit | commitdiff | tree

Zac Dover [Wed, 23 Apr 2025 09:27:00 +0000 (19:27 +1000)]

Merge pull request #62696 from anthonyeleven/mgr-prom

doc/mgr: Improve prometheus.rst

Reviewed-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Venky Shankar [Wed, 23 Apr 2025 09:16:03 +0000 (14:46 +0530)]

Merge PR #62577 into main

* refs/pull/62577/head:
libcephfs_proxy: avoid libc buffering for logging

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Anoop C S <anoopcs@cryptolab.net>

commit | commitdiff | tree

Zac Dover [Wed, 23 Apr 2025 09:15:19 +0000 (19:15 +1000)]

Merge branch 'main' into mgr-prom

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Jos Collin [Tue, 11 Feb 2025 10:45:51 +0000 (16:15 +0530)]

qa: test 'journal import' recognizes invalid headers post journal recovery

Fixes: https://tracker.ceph.com/issues/68954
Signed-off-by: Jos Collin <jcollin@redhat.com>

commit | commitdiff | tree

Jos Collin [Thu, 14 Nov 2024 05:12:18 +0000 (10:42 +0530)]

cephfs-journal-tool: check the headers in dump file after journal recovery

Fixes: https://tracker.ceph.com/issues/68954
Signed-off-by: Jos Collin <jcollin@redhat.com>

commit | commitdiff | tree

Anthony D'Atri [Mon, 7 Apr 2025 03:03:53 +0000 (23:03 -0400)]

doc/mgr: Improve prometheus.rst

Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>

commit | commitdiff | tree

Anthony D'Atri [Wed, 23 Apr 2025 03:25:27 +0000 (23:25 -0400)]

Merge pull request #62911 from bluikko/doc-cleanup-radosgw

doc/radosgw: Fix indentation in admin.rst

commit | commitdiff | tree

Zac Dover [Tue, 22 Apr 2025 23:31:21 +0000 (09:31 +1000)]

Merge pull request #62896 from zdover23/wip-doc-2025-04-21-revert-62782-c4f0f8e

doc: Revert "doc/mgr: Promptify CLI commands and other formatting fixes"

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Kamoltat Sirivadhna [Fri, 23 Aug 2024 20:24:36 +0000 (20:24 +0000)]

doc/rados/operations/health-checks: Add MON_NETSPLIT Warning

Fixes: https://tracker.ceph.com/issues/67371
Signed-off-by: Kamoltat Sirivadhna <ksirivad@redhat.com>

commit | commitdiff | tree

Kamoltat Sirivadhna [Thu, 15 Aug 2024 20:25:43 +0000 (20:25 +0000)]

HealthMonitor: Add topology-aware netsplit detection and warning

Problem:
Currently, Ceph cannot detect and report network partitions (netsplits)
between monitors in different topology locations in a consolidated way.
While stretch mode can handle partitions through monitor elections,
users lack visibility into the topology-level view of network
disconnections, making troubleshooting difficult.

Solution:
This implementation adds a hierarchical netsplit detection mechanism that:
- Uses DirectedGraph structure for netsplit detection
- Maps monitor disconnections to relevant CRUSH topology levels
- Aggregates individual disconnections into location-level reports when appropriate
- Detects complete location-level netsplits when ALL monitors between locations
  cannot communicate
- Reports specific topology locations experiencing complete communication failures
- Falls back to individual monitor-level reporting for partial disconnections
- Handles monitors with missing location data gracefully
- Leverages HealthMonitor::check_for_mon_down to receive a set of down monitors,
  efficiently avoiding false netsplit reports for monitors already known to be down
- Implements smart filtering that correctly excludes down monitors from location-based
  analysis, ensuring accurate netsplit reporting at both individual and topology levels

The implementation produces user-friendly health warnings:
1. For complete location netsplits: "Netsplit detected between dc1 and dc2"
2. For individual monitor disconnections: "Netsplit detected between mon.a and mon.d"

Performance considerations:
- Time complexity: O(m²) where m is the number of monitors
- Space complexity: O(m²) for connection tracking
- Practical impact is minimal as monitor count is typically small (3-7)

Fixes: https://tracker.ceph.com/issues/67371
Signed-off-by: Kamoltat Sirivadhna <ksirivad@redhat.com>
Conflicts:
src/mon/Elector.cc - Trivial Fix

Unnamed repository; edit this file 'description' to name the repository.

RSS Atom