git.apps.os.sepia.ceph.com Git

Merge PR #60398 into wip-jcollin-testing-20251001.014437-squid

* refs/pull/60398/head:
doc/cephfs: update about changing max_mds FS setting variable
qa/cephfs: add tests for confirmationn required to change max_mds
mon,cephfs: require confirmation when changing max_mds on unhealthy cluster

Merge PR #64886 into wip-jcollin-testing-20251001.014437-squid

* refs/pull/64886/head:
mds/MDSDaemon: unlock `mds_lock` while shutting down Beacon and others

Merge PR #65267 into wip-jcollin-testing-20251001.014437-squid

* refs/pull/65267/head:
mds: include auth credential in session dump

Merge PR #65359 into wip-jcollin-testing-20251001.014437-squid

* refs/pull/65359/head:
qa: Disable a test for kernel mount
src/test/mds: Fix TestMDSAuthCaps
client: Fix the multifs auth caps check
mds: Fix multifs auth caps check
qa: Fix validation of client_version
qa: Test cross fs access by single client in multifs
qa: Run test_admin with the squid client
client: Fix leading / issue with mds_check_access
qa: Add mds caps test for testing fs read and a path rw

Merge PR #65363 into wip-jcollin-testing-20251001.014437-squid

* refs/pull/65363/head:
test/libcephfs: use more entries to reproduce snapdiff fragmentation
mds: rollback the snapdiff fragment entries with the same name if needed.
test/libcephfs: Polisihing SnapdiffDeletionRecreation case
Test failure: LibCephFS.SnapdiffDeletionRecreation

Merge PR #65602 into wip-jcollin-testing-20251001.014437-squid

* refs/pull/65602/head:
cephfs-journal-tool:: Don't reset the journal trim position
qa: Validate cephfs-journal-tool reset trim

Merge pull request #65709 from aaSharma14/wip-73293-squid

squid: monitoring: fix MTU Mismatch alert rule and expr

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>

Merge pull request #65706 from rhcs-dashboard/wip-73274-squid

squid: mgr/dashboard: Blank entry for Storage Capacity in dashboard under Cluster > Expand Cluster > Review

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@ibm.com>

monitoring: fix MTU Mismatch alert rule and expr

Fixes: https://tracker.ceph.com/issues/73290
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
(cherry picked from commit bee24dec441b9e6b263e4498c2ab333b0a60a52d)

Conflicts:
monitoring/ceph-mixin/prometheus_alerts.yml
monitoring/ceph-mixin/tests_alerts/test_alerts.yml
src/pybind/mgr/dashboard/frontend/src/app/ceph/cluster/prometheus/active-alert-list/active-alert-list.component.html
src/pybind/mgr/dashboard/frontend/src/app/ceph/cluster/prometheus/active-alert-list/active-alert-list.component.ts
src/pybind/mgr/dashboard/frontend/src/app/shared/datatable/table-key-value/table-key-value.component.scss

mgr/dashboard: Blank entry for Storage Capacity in dashboard under Cluster > Expand Cluster > Review

https://tracker.ceph.com/issues/73220

Signed-off-by: Naman Munet <naman.munet@ibm.com>
(cherry picked from commit a01909e7588c7ff757079475e3ea6f1dc3054db7)

Merge pull request #64456 from cbodley/wip-72090-squid

squid: deb/mgr: remove deprecated distutils from ceph-mgr.requires

Reviewed-by: Nizamudeen A <nia@redhat.com>

Merge pull request #65141 from mchangir/wip-70925-squid

squid: mds: fix heap-use-after-free in C_Flush_Journal

Merge pull request #65620 from aaSharma14/wip-73167-squid

squid: mgr/dashboard: fix zone update API forcing STANDARD storage class

Reviewed-by: Afreen Misbah <afreen@ibm.com>

Merge pull request #65671 from aaSharma14/wip-73231-squid

squid: monitoring: fix "In" OSDs in Cluster-Advanced grafana panel. Also change units from decbytes to bytes wherever used in the panel

Reviewed-by: Afreen Misbah <afreen@ibm.com>

monitoring/ceph_mixin: fix Cluster - Advanced OSD grafana panel

1. Fixes the promql expr used to calculate "In" OSDs in
ceph-cluster-advanced.json.
2. Fixes the color coding for the single state panels used in the OSDs
grafana panel like "In", "Out" etc

Fixes: https://tracker.ceph.com/issues/72810
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
(cherry picked from commit 53a6856d603e0fe4ff31f76e19263a80359a9f1d)

Merge pull request #65659 from ceph/wip-squid-noble

squid: cmake: remove _FORTIFY_SOURCE define

Merge pull request #64605 from cbodley/wip-72190-squid

squid: deb/cephadm: add explicit --home for cephadm user

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

cmake: remove _FORTIFY_SOURCE define

according to `dpkg-buildflags`, ubuntu 24 raised this value to
`-D_FORTIFY_SOURCE=3` which causes `error: "_FORTIFY_SOURCE" redefined`
compilation failures because Ceph itself adds `-D_FORTIFY_SOURCE=2`

`_FORTIFY_SOURCE` is a hardening option. both our rpm and debian builds
already specify that via environment variables, so Ceph's cmake should
leave it alone

Fixes: https://tracker.ceph.com/issues/72361
Signed-off-by: Casey Bodley <cbodley@redhat.com>
(cherry picked from commit 66bec97b0dc90b91f5be586351f52082beb6374a)

Merge pull request #61166 from anoopcs9/wip-69306-squid

squid: client: Handle empty pathnames for `ceph_chownat()` and `ceph_statxat()`

Reviewed-by: Rishabh Dave <ridave@redhat.com>

Merge pull request #65636 from adk3798/squid-cephadm-pin-cheroot

squid: pybind/mgr: pin cheroot version in requirements-required.txt

Reviewed-by: John Mulligan <jmulligan@redhat.com>

Merge pull request #65588 from adamemerson/wip-perfcounters-unique-string-squid

squid: common: Allow PerfCounters to return a provided service ID

Reviewed-by: Casey Bodley <cbodley@redhat.com>

Merge pull request #65556 from jzhu116-bloomberg/wip-72972-squid

squid: rgw: discard olh_ attributes when copying object from a versioning-suspended bucket to a versioning-disabled bucket

Reviewed-by: Adam Emerson <aemerson@redhat.com>

mgr/dashboard: bump cheroot to > 10.0

Fixes: https://tracker.ceph.com/issues/55837
Signed-off-by: Nizamudeen A <nia@redhat.com>
(cherry picked from commit 1ec74a8360d1c4abb39754320eba118d080e3499)

client: Gracefully handle empty pathname for statxat()

man statx(2)[1] says the following:
. . .
AT_EMPTY_PATH
    If pathname is an empty string, operate on the file referred to by
    dirfd (which may have been obtained using the open(2) O_PATH flag).
    In this case, dirfd can refer to any type of file, not just a
    directory.

    If dirfd is AT_FDCWD, the call operates on the current working
    directory.
. . .

Look out for an empty pathname and use the relative fd's inode in the
presence of AT_EMPTY_PATH flag before calling internal _getattr().

Fixes: https://tracker.ceph.com/issues/68189
Review with: git show -w

[1] https://www.man7.org/linux/man-pages/man2/statx.2.html

Signed-off-by: Anoop C S <anoopcs@cryptolab.net>
(cherry picked from commit edd7fe76c4919bc243377c6d7aae20b0606b89c3)

Conflicts:
        src/client/Client.cc
- path_walk() refactor from https://github.com/ceph/ceph/pull/62095
  included the required core changes.

libcephfs.h: Fix API documentation for ceph_statxat

flags parameter for ceph_statxat() API is supposed to accept only
AT_STATX_DONT_SYNC and AT_SYMLINK_NOFOLLOW. Modify the corresponding
documentation to reflect the acceptance of above two flags.

Signed-off-by: Anoop C S <anoopcs@cryptolab.net>
(cherry picked from commit 92c5ab99b8dcaae56e4a92cfe72a7e3d343b8a0c)

client: Gracefully handle empty pathname for chownat()

man fchownat(2)[1] says the following:
. . .
AT_EMPTY_PATH (since Linux 2.6.39)
    If pathname is an empty string, operate on the file referred to by
    dirfd (which may have been obtained using the open(2) O_PATH flag).
    In this case, dirfd can refer to any type of file, not just a
    directory. If dirfd is AT_FDCWD, the call operates on the current
    working directory.
. . .

Look out for an empty pathname and use the relative fd's inode in the
presence of AT_EMPTY_PATH flag before calling internal _setattr().

Fixes: https://tracker.ceph.com/issues/68189
Review with: git show -w

[1] https://www.man7.org/linux/man-pages/man2/fchownat.2.html

Signed-off-by: Anoop C S <anoopcs@cryptolab.net>
(cherry picked from commit 829f38899226fcd1f603ba446b018f53c5b0921d)

Conflicts:
        src/client/Client.cc
- path_walk() refactor from https://github.com/ceph/ceph/pull/62095
  included the required core changes.

Merge pull request #65639 from zdover23/wip-doc-2025-09-23-squid-remove-cloud-restore-rst

squid: doc/radosgw: remove cloud-restore from squid

Reviewed-by: Casey Bodley <cbodley@redhat.com>

Merge pull request #61451 from anoopcs9/wip-69556-squid

squid: mds: Fix invalid access of mdr->dn[0].back()

Reviewed-by: Rishabh Dave <ridave@redhat.com>

Merge pull request #62391 from neesingh-rh/wip-70416-squid

squid: cephfs-shell: add option to remove xattr

Reviewed-by: Rishabh Dave <ridave@redhat.com>

Merge pull request #64652 from rishabh-d-dave/wip-72200-squid

squid: mgr/vol: keep and show clone source info

Reviewed-by: Rishabh Dave <ridave@redhat.com>

Merge pull request #65279 from joscollin/wip-67809-squid

squid: mds: add more debug logs and log events

Reviewed-by: Rishabh Dave <ridave@redhat.com>

Merge pull request #65280 from joscollin/wip-69369-squid

squid: qa: use a larger timeout for kernel_untar_build workunit

Reviewed-by: Rishabh Dave <ridave@redhat.com>

Merge pull request #63019 from batrick/wip-71094-squid

squid: mds: check for snapshots on parent snaprealms

Reviewed-by: Jos Collin <jcollin@redhat.com>

Merge pull request #62499 from batrick/wip-70663-squid

squid: client: ll_walk will process absolute paths as relative

Reviewed-by: Jos Collin <jcollin@redhat.com>

Merge pull request #65629 from phlogistonjohn/jjm-s-65514

squid: build-with-container: add argument groups to organize options

doc/radosgw: remove cloud-restore from squid

Remove doc/radosgw/cloud-restore.rst from the Squid branch.

cloud-restore does not appear in index.rst, so its removal from
index.rst is unnecessary.

Signed-off-by: Zac Dover <zac.dover@proton.me>

Merge pull request #64090 from vshankar/wip-cephfs-client-fixes-squid

squid: client: cephfs user-space client fixes

Reviewed-by: Jos Collin <jcollin@redhat.com>

pybind/mgr: pin cheroot version in requirements-required.txt

With python 3.10 (didn't seem to happen with python 3.12) the
pybind/mgr/cephadm/tests/test_node_proxy.py test times out.
This appears to be related to a new release of the cheroot
package and a github issues describing the same problem
we're seeing has been opened by another user
https://github.com/cherrypy/cheroot/issues/769

It is worth noting that the workaround described in that
issue does also work for us. If you add

```
import cheroot
cheroot.server.HTTPServer._serve_unservicable = lambda: None
```

after the existing imports in test_node_proxy.py the
test hanging issue also disappears. Also worth noting the
particular pin of

cheroot~=10.0

was chosen as it matches the existing pin being used
in pybind/mgr/dashboard/constraints.txt

Signed-off-by: Adam King <adking@redhat.com>
(cherry picked from commit 6231955b5d00ae6b3630ee94e85b2449092ef0fe)

Merge pull request #61274 from kotreshhr/wip-68940-squid

squid: ceph-fuse: Improve fuse mount usage message

Reviewed-by: Jos Collin <jcollin@redhat.com>

Merge pull request #62517 from salieri11/wip-70631-squid

squid: mds: add MDS asok command for dumping stray directories

Reviewed-by: Jos Collin <jcollin@redhat.com>

Merge pull request #65133 from chrisphoffman/wip-72645-squid

squid: client: use path supplied in statfs

Reviewed-by: Jos Collin <jcollin@redhat.com>

build-with-container: add argument groups to organize options

Use the argparse add_argument_group feature to organize the mass of
arguments into more sensible categories. Hopefully, someone reading
over the `--help` output can now more easily see options that
are useful rather than being overwhelmed by a wall of text.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
(cherry picked from commit 71a1be4dd0aea004da56c2f518ee70a281a3f7d3)

mgr/dashboard: fix zone update API forcing STANDARD storage class

The zone update REST API (`edit_zone`) always attempted to configure a
placement target for the `STANDARD` storage class, even when the request
was intended for a different storage class name.
This caused failures in deployments where `STANDARD` is not defined.

Changes:
Club add placement target and add storage class methods into one single
add_placement_targets_storage_class_zone method which takes the storage
class as a param as well alongside the rest of the placement params.

Fixes: https://tracker.ceph.com/issues/73105
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
(cherry picked from commit 135f3adb4973be493925839e946e7a5fc75e7d5c)

cephfs-journal-tool:: Don't reset the journal trim position

If the fs had to go through journal recovery and reset,
the cephfs-journal-tool resets the journal trim position
because of which the old unused journal objects just stay
forever in the metadata pool. The patch fixes the issue.
Now, the old stale journal objects are trimmed during the
regular trimming cycle helping to recover space in the
metadata pool.

Fixes: https://tracker.ceph.com/issues/69708
Signed-off-by: Kotresh HR <khiremat@redhat.com>
(cherry picked from commit 4f9a926a467c03a410e5ec5a81031e72f2193f25)

qa: Validate cephfs-journal-tool reset trim

Validates that the cephfs-journal-tool reset
doesn't reset the trim position so that the
journal trim takes care of trimming the older
unused journal objects helping to recover the
space in metadata pool.

Fixes: https://tracker.ceph.com/issues/69708
Signed-off-by: Kotresh HR <khiremat@redhat.com>
(cherry picked from commit 7e85556318ae2707730ed0d5f2ef9a1d817ec6e0)

mgr/vol: move reusable parts from async_cloner

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit 8c536f78907fe1d42e1df612604d5872019b501d)

Conflicts:
src/pybind/mgr/volumes/fs/async_cloner.py
src/pybind/mgr/volumes/fs/operations/versions/subvolume_v1.py
- commit 8c536f78907f was missing which led to conflict.

rgw: Record the `service_unique_id`, if present, in the SrviceMap

For consistency and ease associating the two.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
(cherry picked from commit 3a94a7b2ed02d20b2bc839b283e60cf4778f69e4)
Signed-off-by: Adam C. Emerson <aemerson@redhat.com>

common: Allow PerfCounters to return a provided service ID

Dashboard has asked for a unique identifier that can be associated
with services. This commit provides a component of that
functionality. Enforcing uniqueness is beyond the scope of this PR and
is the responsibility of cluster setup and orchestration. The scope of
uniqueness is a matter of policy and up to the design of cluster setup
and orchestration software.

We provide the `--service_unique_id` argument that can be passed on
the command line when executing a Ceph service that uses
`global_init`. If non-empty, a `service_unique_id` section is added to
the PerfCounters dump for that service. This section has a single
entry whose name is set to the argument of `service_unique_id` and
whose value is arbitrary. If unspecified or empty, no
`service_unique_id` section is added.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
(cherry picked from commit 6dc322421f7a3758251fe29e3f35934231358011)
Conflicts:
src/common/options/global.yaml.in
- Preceding options not in Squid

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>

Merge pull request #65219 from cbodley/wip-72715-squid

squid: rgw/s3: remove 'aws-chunked' from Content-Encoding response

Reviewed-by: Adam Emerson <aemerson@redhat.com>

Merge pull request #62051 from k0ste/wip-65923-squid

squid: CephContext: acquire _fork_watchers_lock in notify_post_fork()

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

Merge pull request #63720 from batrick/wip-71554-squid

squid: test/libcephfs: copy DT_NEEDED entries from input libraries

Merge pull request #64936 from pritha-srivastava/wip-72464-squid

squid: rgw: check all JWKS for STS

rgw: discard olh_ attributes when copying object from a versioning-suspended bucket to a versioning-disabled bucket

Signed-off-by: Jane Zhu <jzhu116@bloomberg.net>
(cherry picked from commit 3fed58f43c3cb3977130926a2d1bca551deefade)

Merge pull request #65550 from stackhpc/doc-balancer-squid

squid: doc: Fixes a typo in balancer operations

doc: Fixes a typo in balancer operations

Signed-off-by: Tyler Brekke <tbrekke@digitalocean.com>
(cherry picked from commit b038b8093d01a5e676ffa419607489a79261ef29)

Merge pull request #65486 from tobias-urdin/squid-rgw-admin-bucket-pagination

squid: rgw/admin: Add max-entries and marker to bucket list

Reviewed-by: Adam C. Emerson <aemerson@redhat.com>

Merge pull request #59922 from mchangir/wip-68077-squid

squid: qa: relocate subvol creation overrides and test

Merge pull request #61857 from batrick/wip-69956-squid

squid: mds: do not schedule quiesce for non-head imported inodes

Merge pull request #64539 from batrick/wip-72163-squid

squid: mds: nudge log for unstable locks after early reply

Merge pull request #65492 from rhcs-dashboard/wip-72987-squid

squid: mgr/dashboard : Fixed mirrored image usage info bar

Reviewed-by: Afreen Misbah <afreen@ibm.com>

Merge pull request #65162 from joscollin/wip-72154-squid

squid: mds: dump export_ephemeral_random_pin as double

mgr/dashboard : Fixed mirrored image usage info bar
fixes : https://tracker.ceph.com/issues/72431

Signed-off-by: Abhishek Desai <abhishek.desai1@ibm.com>
(cherry picked from commit 3a192b7c38e3f1669f3deee31702ba802d7411fd)

radosgw-admin: Pass max_entries for bucket list

The changes in [1] did not take into account that
radosgw-admin code calls `RGWBucketAdminOp::info`
directly and passes a `RGWBucketAdminOpState`
struct where max_entries is not initialized so
we should not assume that it's zero.

This in turn broke the `bucket list --uid foo` and
`bucket stats --uid foo` commands as the output was
changed and thus not keeping backward compatibility.

This change makes sure that we populate max_entries
in `RGWBucketAdminOpState` if `--max-entries` argument
was specified otherwise we set it to zero to keep the
backward compatibility in the output format.

[1] https://github.com/ceph/ceph/pull/62777

Fixes: https://tracker.ceph.com/issues/72049
Signed-off-by: Tobias Urdin <tobias.urdin@binero.com>
(cherry picked from commit 3909c6554cdfcf1b05b5e32297b2e65e9c67af2b)

rgw/qa: Move admin pagination tests

Move the tests into qa directory and add it to the
rgw/verify suite so that we can run it in teuthology.

Signed-off-by: Tobias Urdin <tobias.urdin@binero.com>
(cherry picked from commit 57cbc9b6599be1e84c5bc209936080c4a04bb891)

rgw/doc: Add doc for admin bucket list pagination

This adds the documentation for the admin bucket list
pagination change.

Signed-off-by: Tobias Urdin <tobias.urdin@binero.com>
(cherry picked from commit 7fa025e08a21b03ce91556ffb936b9f26ffdc00f)

rgw/admin: Add max-entries and marker to bucket list

This adds pagination to the /admin/bucket endpoint for the
Admin API.

If a user has a lot of buckets the /admin/bucket endpoint
that is listing buckets can be so long that the HTTP request
gets a timeout.

This adds the ``max-entries`` and ``marker`` query parameters
to the API to support pagination. If ``max-entries`` is given
we introduce a new format for the HTTP response body the same
way that metadata API does, if it's not given we return the
response with the same body as before and thus retaining the
backward compatibility of the API.

This adds a Python3 based test suite that tests all of this
functionality to verify the behaviour and the HTTP response
body itself.

This fixes the pagination mentioned in tracker [1] and thus
fixes (or atleast partially fixes) that.

[1] https://tracker.ceph.com/issues/22168

Fixes: https://tracker.ceph.com/issues/22168
Signed-off-by: Tobias Urdin <tobias.urdin@binero.com>
(cherry picked from commit 1d5523ec0bec916e0a87fdcb8d27b67753e477b6)

Merge pull request #65475 from rhcs-dashboard/wip-72965-squid

squid: monitoring: add user-agent headers to the urllib

monitoring: add user-agent headers to the urllib

The documentation started raising 403 suddenly. Adding User-Agent
headers to the request

Signed-off-by: Nizamudeen A <nia@redhat.com>
(cherry picked from commit b8fe487010483681bbc8ddb8dfe18b40ebfd346b)

Merge pull request #64741 from cbodley/wip-71544-squid

squid: rgw/lc: stop using merge_and_store_attrs in remove_bucket_config

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>

Merge pull request #65302 from guits/wip-72781-squid

squid: ceph-volume: drop udevadm subprocess calls

test/libcephfs: use more entries to reproduce snapdiff fragmentation
issue.

Snapdiff listing fragments have different boundaries in Reef and Squid+
releases hence original reproducer (made for Reef) doesn't work properly
in S+ releases. This patch fixes that at cost of longer execution.
This might be redundant/senseless when backporting to Reef.

Related-to: https://tracker.ceph.com/issues/72518
Signed-off-by: Igor Fedotov <igor.fedotov@croit.io>
(cherry picked from commit 23397d32607fc307359d63cd651df3c83ada3a7f)

mds: rollback the snapdiff fragment entries with the same name if needed.

This is required when more entries with the same name don't fit into the
fragment. With the existing means for fragment offset specification such a splitting to be
prohibited.

Fixes: https://tracker.ceph.com/issues/72518
Signed-off-by: Igor Fedotov <igor.fedotov@croit.io>
(cherry picked from commit 24955e66f4826f8623d2bec1dbfc580f0e4c39ae)

test/libcephfs: Polisihing SnapdiffDeletionRecreation case

Signed-off-by: Igor Fedotov <igor.fedotov@croit.io>
(cherry picked from commit daf3350621cfafa383cd9deea81b60b775a53093)

Test failure: LibCephFS.SnapdiffDeletionRecreation
Reproduces: https://tracker.ceph.com/issues/72518
Signed-off-by: Md Mahamudur Rahaman Sajib <mahamudur.sajib@croit.io>
(cherry picked from commit 4ff71386ac1529dc1f7c2640511f509bd6842862)
(cherry picked from commit 48f5a5d04fb2cef52c5e4a3daf452ccf988666d2)

qa: Disable a test for kernel mount

The kclient fix isn't yet landed in the kernel and hence
the test 'test_multifs_single_client_cross_access_r_caps_end'
would fail for kernel mount. So disable the failing validation
in the test for kclient.

Fixes: https://tracker.ceph.com/issues/72167
Signed-off-by: Kotresh HR <khiremat@redhat.com>
(cherry picked from commit 485f37ac1fe7d233685ce1a1f9ac5360c142b1f5)

src/test/mds: Fix TestMDSAuthCaps

Fix the TestMDSAuthCaps after fixing
multifs authcaps comparison.

Fixes: https://tracker.ceph.com/issues/72167
Signed-off-by: Kotresh HR <khiremat@redhat.com>
(cherry picked from commit 92ab603e110e349342f3611e29f92fc64ae7d3ec)

client: Fix the multifs auth caps check

The fsname needs to be passed to validate the mds
auth caps check. This patch fixes the same.

Fixes: https://tracker.ceph.com/issues/72167
Signed-off-by: Kotresh HR <khiremat@redhat.com>
(cherry picked from commit b1d6bb95d3c805af37883ef481b96a1aa33cedf0)

mds: Fix multifs auth caps check

The fsname is not take into consideration while validating
the access check for the operations. This patch fixes
the same.

Fixes: https://tracker.ceph.com/issues/72167
Signed-off-by: Kotresh HR <khiremat@redhat.com>
(cherry picked from commit 1a5e083eb297dc611c8098abb67faf34fd8e4499)

qa: Fix validation of client_version

The multifs auth caps bug has a fix both in client and mds.
If it's old client and not patched, we expect that the fs
with 'rw' would end up having 'r' caps with the multifs
auth caps used as in the test
'test_multifs_single_client_cross_access_r_caps_end'.
This patch adds the conditional to validate the same.

This commit makes use of the PR #64005

Fixes: https://tracker.ceph.com/issues/72167
Signed-off-by: Kotresh HR <khiremat@redhat.com>
(cherry picked from commit f10e34d0350d216f02d4e73ec695340daae11dd4)

qa: Test cross fs access by single client in multifs

Fixes: https://tracker.ceph.com/issues/72167
Signed-off-by: Kotresh HR <khiremat@redhat.com>
(cherry picked from commit 3516db300d3688cd048542dbed2e0318f9ac5ad3)

Conflicts:
qa/tasks/cephfs/test_admin.py - The commit 1fda8ed2d4a9 isn't
backported

qa: Run test_admin with the squid client

This is required to test the features involving
fixes both in client and mds. This is to make
sure the older clients are not broken with the
fix. The version 19.2.2 is used for client.

The test suite sets up the cluster with squid
19.2.2 and upgrades only the ceph cluster node
leaving the client node.

Signed-off-by: Kotresh HR <khiremat@redhat.com>
(cherry picked from commit a4f97c0aa92c37113b33d63b57f2fae870f403a1)

client: Fix leading / issue with mds_check_access

The "Client::mds_check_access" expects the target_path without
leading '/' as it eventually calls the "MDSCapMatch::match_path"
which expects the target_path passed to be with out leading '/'
as well.

The single leading '/' was being removed. But absolute path
constructed did have leading '//', so removing all the leading
'/' was necessary.

This causes the clients not to be able to access a particular
path even though it has a rw permission on the specific path.

The patche fixes the leading '//' issue.

Fixes: https://tracker.ceph.com/issues/67212
Signed-off-by: Kotresh HR <khiremat@redhat.com>
(cherry picked from commit 2e2adb2719c40a87d9fc556f5fa492ef6a2580dc)

qa: Add mds caps test for testing fs read and a path rw

Fixes: https://tracker.ceph.com/issues/67212
Signed-off-by: Kotresh HR <khiremat@redhat.com>
(cherry picked from commit 983f893fb938b097682c10f3120460bfb2094bfd)

Merge pull request #65417 from ceph/fix-api-tests-squid

squid: pybind/mgr/dashboard: Use teuthology's actual requirements

pybind/mgr/dashboard: Use teuthology's actual requirements

Signed-off-by: David Galloway <david.galloway@ibm.com>
(cherry picked from commit 22a87d959bca74478de1e2d9f86859676385491d)

Merge pull request #65244 from ceph/wip-yuriw-squid-p2p-squid

qa/tests: added 19.2.3 to the mix

Reviewed-by: Laura Flores <lflores@redhat.com>

Merge pull request #65379 from zdover23/wip-doc-2025-09-04-backport-65325-to-squid

squid: doc/cephfs: edit troubleshooting.rst

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

doc/cephfs: edit troubleshooting.rst

Update the "Disconnected+Remounted FS" section in
doc/cephfs/troubleshooting.rst, as suggested by Venky Shankar in https://github.com/ceph/ceph/pull/65129/files#r2312903062

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit f4b40422fefaa993441396a5c31fbfd3d8714595)

Merge pull request #65202 from zdover23/wip-doc-2025-08-25-backport-65185-to-squid

squid: doc/cephfs: edit troubleshooting.rst (Slow MDS)

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

Merge pull request #65206 from zdover23/wip-doc-2025-08-26-backport-64074-to-squid

squid: doc/rados/configuration: Mention show-with-defaults and ceph-conf

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

Merge pull request #65211 from zdover23/wip-doc-2025-08-26-backport-65180-to-squid

squid: doc/dev:update blkin.rst doc for lttng trace

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

Merge pull request #65238 from zdover23/wip-doc-2025-08-26-backport-65230-to-squid

squid: doc/rados/operations: Improve health-checks.rst

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

Merge pull request #65311 from zdover23/wip-doc-2025-08-30-backport-8ff129c89-to-squid

squid: doc/dev/crimson: Update docs

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

doc/dev/crimson: Update docs

* CPU allocation missed some information and was confusing.
* Drop alienized term when possible
* introdice release/debug builds

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
(cherry picked from commit 8ff129c89ffcd4831dcd9d8b8f0d49687cc57183)

Merge pull request #63130 from kshtsk/wip-71208-squid

squid: qa/tasks/cephfs/mount: use 'ip route' instead 'route'

Reviewed-by: Igor Fedotov <igor.fedotov@croit.io>
Reviewed-by: Venky Shankar <vshankar@redhat.com>

ceph-volume: drop udevadm subprocess calls

Calling udevadm via subprocess can cause processes to pile up
under heavy load on production clusters.
This commit switches to reading udev data directly from /run/udev/data,
which is mounted as tmpfs.

Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
(cherry picked from commit 727e69db73ef39d51bdd45515536e59d3acff19e)

qa: use a larger timeout for kernel_untar_build workunit

Fixes: http://tracker.ceph.com/issues/68855
Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit b65d7492af0c0e31f5bf891554b7608fa4c9f395)

mds: add the event when creating the batch head

This could help us the debug the slow request or deadlock bugs and
we could know which stage the request has been in.

Fixes: https://tracker.ceph.com/issues/67280
Signed-off-by: Xiubo Li <xiubli@redhat.com>
(cherry picked from commit 7e08760a0e560759fa73e7b889494bdb60a03ff3)

mds: print the path depth to help debug mds issue

From the path depth we can get more useful info, such as whether
the path_traverse() acquired any locks, etc.

Fixes: https://tracker.ceph.com/issues/67280
Signed-off-by: Xiubo Li <xiubli@redhat.com>
(cherry picked from commit 5d8aa5245cb7e5d819651898533130e463b373fe)

doc/cephfs: edit troubleshooting.rst (Slow MDS)

Move the "Slow requests (MDS)" section immediately after the first
section in this document ("Slow/Stuck Operations"), because the first
procedure on the page directs the reader to undertake the operation in
"Slow requests (MDS)" before trying anything else.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 55af6643c9a119afc4e22e2591774e1d68ef5580)