git.apps.os.sepia.ceph.com Git

Merge PR #62050 into wip-jcollin-testing-20250926.150239-reef

* refs/pull/62050/head:
qa/cephfs: add tests for "fs subvolume create" command when...
mgr/vol: print proper message when subvolume metadata filename is too...

Merge PR #63069 into wip-jcollin-testing-20250926.150239-reef

* refs/pull/63069/head:
qa/cephfs: test that user created pool is not deleted by...
mgr/vol: don't delete user-created pool in "volume create" command
PendingReleaseNote: add note that "volume create" accepts pool names...
doc/cephfs: mention new options for "fs volume create" cmd
qa/cephfs: test passing pool names to "fs volume create" cmd
qa/cephfs: separate the tests for "ceph fs volume create" cmd
mgr/vol: allow passing pool names to "fs volume create" cmd

Merge PR #63224 into wip-jcollin-testing-20250926.150239-reef

* refs/pull/63224/head:
qa/cephfs: use "snapshot getpath" cmd instead of constructing...
qa/cephfs: add tests for "snapshot getpath" cmd against v1 and...
qa/cephfs: add a helper method to construct the snapshot path
qa/cephfs: move tests for "snapshot getpath" cmd to a separate class

Merge PR #65448 into wip-jcollin-testing-20250926.150239-reef

* refs/pull/65448/head:
qa/cephfs: ignore warning that pg is stuck peering for upgrade jobs

Merge PR #65603 into wip-jcollin-testing-20250926.150239-reef

* refs/pull/65603/head:
cephfs-journal-tool:: Don't reset the journal trim position
qa: Validate cephfs-journal-tool reset trim

Merge pull request #65621 from aaSharma14/wip-73165-reef

reef: mgr/dashboard: fix zone update API forcing STANDARD storage class

Reviewed-by: Afreen Misbah <afreen@ibm.com>

qa/cephfs: add tests for "fs subvolume create" command when...

subvolume metadata file name length is too long.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit 01f8f710c1a615bc9de03babc6d1fe7e0bc8a5bd)

Conflicts:
qa/tasks/cephfs/test_volumes.py
- Region where patch is to be applied has changed in Reef compared to
main.

mgr/vol: print proper message when subvolume metadata filename is too...

long.

When combination of subvolume group name and subvolume name is longer
than 248 characters, it leads to a failure because the metadata file is
a combination of both of these along with ":", "_" and ".meta".

Currently, when this comibnation longer than 248 characters, a bunch
stacktraces are printed along with multiple errors. This confuses the
user as well as looks bad.

Fixes: https://tracker.ceph.com/issues/69865
Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit 82fc1e7cac3d9157d9aaf95c45ded04c1b278183)

qa/cephfs: test that user created pool is not deleted by...

"volume create" command.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit ad1716ff15743dc45d88ddb529b4705b6bd780a6)

mgr/vol: don't delete user-created pool in "volume create" command

If one of the pool names passed to "ceph fs volume create" command
(through --data-pool and --meta-pool name) is absent, don't delete the
pool that is present and passed to this command during the cleanup code
of this command.

IOW, "volume create" command should continue deleting pool created by it
but not delete pool created by the user.

Fixes: https://tracker.ceph.com/issues/70945
Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit 4299f660ba9c83ef9305b2834c195da9008810a9)

PendingReleaseNote: add note that "volume create" accepts pool names...

in args.

Add a release note that "ceph fs volume create" command allows users to
pass pool names to "ceph fs volume create" command.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit a74dbbe8ee19d7b572c1a63984f6db8db203cd3b)

doc/cephfs: mention new options for "fs volume create" cmd

Command "ceph fs volume create" accepts 2 new options to allow users to
pass data and metadata pool name. Update docs to include mention of both
the options.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit 3044bf7e4b667fb5de5062c41e22e866fc6bb9a6)

qa/cephfs: test passing pool names to "fs volume create" cmd

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit 3ae193035c8cca41a699d796e012ea5b23517767)

Conflicts:
qa/tasks/cephfs/test_volumes.py
- Imports on reef branch are slightly different from main branch leading
to a conflict.

qa/cephfs: separate the tests for "ceph fs volume create" cmd

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit 058769c131754539efcb7f46164f7f164ecb1b13)

mgr/vol: allow passing pool names to "fs volume create" cmd

Fixes: https://tracker.ceph.com/issues/69878
Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit e794a19fd0da92c4331702ed4c35e9d659e53f81)

Merge pull request #62436 from rishabh-d-dave/mgr-vol-no-del-reef

reef: mgr/volumes: allow disabling async job threads

Reviewed-by: Jos Collin <jcollin@redhat.com>

PendingReleaseNote: add note for pause_purging and pause_cloninig

Added release notes for mgr/vol config option "pause_purging and
"pause_cloning".

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit 03b90d589ca5759701164ea54b0dbf9b92c4efef)

doc/cephfs: add note for config option pause_purging and pause_cloning

Update documentation for add information about mgr/vol config options
"pause_purging" and "pause_cloning".

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit 1f0bfe1c599769ac67d3b1b41f37bb3482e27839)

qa/cephfs: add tests for mgr/vol config pause_cloning

mgr/vol config option pause_cloning allows pausing of cloner threads.
Add tests for this.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit e7eb36e4393c8401c7cf1aa1f714b52c1ced9ca0)

qa/cephfs: extend wait for trash empty

Trash directory for a volume is not created by default. If
_wait_for_trash_empty() in test_volumes.py encounters absence of trash
directory, return true.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit 6d6be8b41c990acf2d9c08f35eb382996d59d5a7)

qa/cephfs: add tests for config option pause_purging

Setting MGR config option mgr/volumes/pause_purging to true halts
all ongoing purges and allows no new purging to begin until this option
is changed to false. Add tests for this.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit f38fcbc6109494e23e4948d794f59c927a9303ff)

Conflicts:
qa/tasks/cephfs/test_volumes.py
- First conflict occurred due to missing import of safe_while which
  in Reef branch compared to main branch. Along with resolving this
  conflict this has been imported as it used by the tests.
- Second conflict occured due to absence of some test methods right
  before where TestPausePurging was to be added.
- Third conflict occured because entire contextutil was imported instead
  of just safe_while and only CommandFailedError was imported from
  teuthology.exceptions while this commit imports MaxWhileTries too.

qa/cephfs: don't strip any whitespace for get_shell_stdout

Whitespace is not removed from the end of the stdout returned by the
method get_ceph_cmd_stdout(). Follow the same policy here since it is
better to not do so (this whitespace can be useful, when copying Ceph
auth keyrings from stdout to a file) and also for sake of uniformity of
interfaces.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit 9582b9b13a42fae4c7f38c22f9202eb893b6f1bc)

Conflicts:
qa/tasks/cephfs/mount.py
- Conflict occured for 2 reasons -
  - One, method get_shell_stdout() is absent on Reef branch but not in
    main so this patch which makes modification to it will obviously run
    in to conflict
   - Two, run_shell_payload() lies right next to get_shell_stdout() in
     main branch and its definition is quite different, leading to
     conflict again.

mgr/vol: add pause/resume mechanism for async jobs

Add mechansim that allows pausing/resuming of the entire async job
machinery that queues, launches and picks next async job; both async
jobs, clones as well as purges.

And then add mgr/vol config option pause_purging and pause_cloning so
that both of these async jobs can be paused and resumed individually.

Fixes: https://tracker.ceph.com/issues/61903
Fixes: https://tracker.ceph.com/issues/68630
Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit 01d37d5e1ba0e250e9d3a5f28ec7f3fa3597c63f)

Conflicts:
src/pybind/mgr/volumes/module.py
- Code where patch was to be applied was slighty different

Merge pull request #60630 from kamoltat/wip-68841-reef

reef: mon [stretch mode]: support disable_stretch_mode & qa/workunits/mon: ensure election strategy is "connectivity" for stretch mode

Merge pull request #65637 from adk3798/reef-cephadm-pin-cheroot

reef: pybind/mgr: pin cheroot version in requirements-required.txt

Reviewed-by: John Mulligan <jmulligan@redhat.com>

mgr/dashboard: bump cheroot to > 10.0

Fixes: https://tracker.ceph.com/issues/55837
Signed-off-by: Nizamudeen A <nia@redhat.com>
(cherry picked from commit 1ec74a8360d1c4abb39754320eba118d080e3499)

Merge pull request #65638 from zdover23/wip-doc-2025-09-23-reef-remove-cloud-restore-rst

reef: doc/radosgw: remove cloud-restore from reef

Reviewed-by: Casey Bodley <cbodley@redhat.com>

Merge pull request #61279 from vshankar/wip-68765-reef

reef: qa: increase the http.maxRequestBuffer to 100MB and enable the git debug logs

Reviewed-by: Jos Collin <jcollin@redhat.com>

Merge pull request #65630 from phlogistonjohn/jjm-r-65514

reef: build-with-container: add argument groups to organize options

pybind/mgr: pin cheroot version in requirements-required.txt

With python 3.10 (didn't seem to happen with python 3.12) the
pybind/mgr/cephadm/tests/test_node_proxy.py test times out.
This appears to be related to a new release of the cheroot
package and a github issues describing the same problem
we're seeing has been opened by another user
https://github.com/cherrypy/cheroot/issues/769

It is worth noting that the workaround described in that
issue does also work for us. If you add

```
import cheroot
cheroot.server.HTTPServer._serve_unservicable = lambda: None
```

after the existing imports in test_node_proxy.py the
test hanging issue also disappears. Also worth noting the
particular pin of

cheroot~=10.0

was chosen as it matches the existing pin being used
in pybind/mgr/dashboard/constraints.txt

Signed-off-by: Adam King <adking@redhat.com>
(cherry picked from commit 6231955b5d00ae6b3630ee94e85b2449092ef0fe)

doc/radosgw: remove cloud-restore from reef

Remove doc/radosgw/cloud-restore.rst from the reef branch.

cloud-restore does not appear in index.rst, so its removal from
index.rst is unnecessary.

Signed-off-by: Zac Dover <zac.dover@proton.me>

build-with-container: add argument groups to organize options

Use the argparse add_argument_group feature to organize the mass of
arguments into more sensible categories. Hopefully, someone reading
over the `--help` output can now more easily see options that
are useful rather than being overwhelmed by a wall of text.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
(cherry picked from commit 71a1be4dd0aea004da56c2f518ee70a281a3f7d3)

mgr/dashboard: fix zone update API forcing STANDARD storage class

The zone update REST API (`edit_zone`) always attempted to configure a
placement target for the `STANDARD` storage class, even when the request
was intended for a different storage class name.
This caused failures in deployments where `STANDARD` is not defined.

Changes:
Club add placement target and add storage class methods into one single
add_placement_targets_storage_class_zone method which takes the storage
class as a param as well alongside the rest of the placement params.

Fixes: https://tracker.ceph.com/issues/73105
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
(cherry picked from commit 135f3adb4973be493925839e946e7a5fc75e7d5c)

qa/cephfs: ignore warning that pg is stuck peering for upgrade jobs

Health warning "pg .* is stuck peering" is seen while Ceph cluster is
under the upgrade process during fs/upgrade QA job. Being an expected
warning, it should be added to the ignorelist.

And besides this one, we already ignore more severe warnings ("pg is
stuck inactive" and "pg is degrarded") for fs/upgrade jobs.

Fixes: https://tracker.ceph.com/issues/70023
Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit 9748de76e02254c6dc284dcc20ec5d5761760dcb)

Merge pull request #65297 from joscollin/wip-71832-reef

reef: mgr/volumes: Keep mon caps if auth key has remaining mds/osd caps

Reviewed-by: Kotresh HR <khiremat@redhat.com>

Merge pull request #61297 from batrick/wip-68451-reef

reef: qa: ignore pg availability/degraded warnings

Reviewed-by: Jos Collin <jcollin@redhat.com>

Merge pull request #62092 from batrick/wip-70155-reef

reef: qa: ignore variant of down fs

Reviewed-by: Jos Collin <jcollin@redhat.com>

Merge pull request #63017 from batrick/wip-71092-reef

reef: qa/workunits/fs/misc: remove data pool cleanup

Reviewed-by: Jos Collin <jcollin@redhat.com>

cephfs-journal-tool:: Don't reset the journal trim position

If the fs had to go through journal recovery and reset,
the cephfs-journal-tool resets the journal trim position
because of which the old unused journal objects just stay
forever in the metadata pool. The patch fixes the issue.
Now, the old stale journal objects are trimmed during the
regular trimming cycle helping to recover space in the
metadata pool.

Fixes: https://tracker.ceph.com/issues/69708
Signed-off-by: Kotresh HR <khiremat@redhat.com>
(cherry picked from commit 4f9a926a467c03a410e5ec5a81031e72f2193f25)

qa: Validate cephfs-journal-tool reset trim

Validates that the cephfs-journal-tool reset
doesn't reset the trim position so that the
journal trim takes care of trimming the older
unused journal objects helping to recover the
space in metadata pool.

Fixes: https://tracker.ceph.com/issues/69708
Signed-off-by: Kotresh HR <khiremat@redhat.com>
(cherry picked from commit 7e85556318ae2707730ed0d5f2ef9a1d817ec6e0)

Merge pull request #65595 from aaSharma14/wip-73134-reef

reef: Handle failures in metric parsing

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>

Handle failures in metric parsing

fixes: https://bugzilla.redhat.com/show_bug.cgi?id=2345460
Signed-off-by: Anmol Babu <anmolbabu@Anmols-MacBook-Pro.local>
(cherry picked from commit f29e3f307c46401328e920204cbe893fbd837c65)

Conflicts:
src/exporter/DaemonMetricCollector.cc

Merge pull request #61978 from batrick/wip-70066-reef

reef: mds: dump next_snap when checking dentry corruption

Merge pull request #62278 from dparmar18/wip-70034-reef

reef: mgr/nfs: validate path when modifying cephfs export

Merge pull request #62409 from neesingh-rh/wip-70418-reef

reef: cephfs-shell: add option to remove xattr

Merge pull request #65251 from joscollin/wip-70031-reef

reef: qa: enable debug mds/client for fs/nfs suite

Merge pull request #65253 from joscollin/wip-71379-reef

reef: cephfs: session tracker accounts for killing sessions

qa/workunits/mon: ensure election strategy is "connectivity" for stretch mode

The election strategy is randomly chosen for this type of test. Sometimes,
the test passes if the "connectivity" election strategy happens to be picked.
But if a different strategy, i.e. "classic", is picked, then the test will fail.

We can ensure that the election strategy is "connectivity" by setting it in the
workunit with the ceph CLI command. Although connectivity was specified in
stretch-mode-5-mons-8-osds.yaml, that config ultimately gets overridden by
the "qa/mon_config" yaml.

Fixes: https://tracker.ceph.com/issues/69107
Signed-off-by: Laura Flores <lflores@ibm.com>
(cherry picked from commit de2d9186bddbd452d2e7939723418c200e3fec46)

src/mon/MonMap: modify dump function

Problem:
Current dump for "removed_ranks" and "disallowed_leaders"
doesn't have the correct format so the python test
script can parse through these values.

Solution:
Modified the values such that it is in the correct format

Conflict: src/osd/osd_types.cc: Added f->dump_bool("is_stretch_pool", is_stretch_pool());

Signed-off-by: Kamoltat Sirivadhna <ksirivad@redhat.com>
(cherry picked from commit a7f3b7b749acabd235d615a3f5b80e3398a6d80d)

qa: Added tests for disabling stretch mode

Test disabling stretch mode with the following scenario:

1. Healthy Stretch Mode
2. Degraded Stretch Mode

Fixes: https://tracker.ceph.com/issues/67467
Signed-off-by: Kamoltat Sirivadhna <ksirivad@redhat.com>
(cherry picked from commit 4d2f8879bed2abd10c00e5a1c5008bd56c11bf61)

doc/rados/operations/stretch-mode.rst: Added Exitting Stretch Mode

Added documentation about exiting stretch mode.

Fixes: https://tracker.ceph.com/issues/67467
Signed-off-by: Kamoltat Sirivadhna <ksirivad@redhat.com>
(cherry picked from commit 0680f17d7bab386429a013c254dd90c70fbabeb7)

mon [stretch mode]: support disable_stretch_mode

Problem:

Currently, Ceph lacks the ability
to exit stretch mode and move back
to normal cluster (non-stretched).

Solution:

Provide a command to allow
the user to exit stretch mode gracefully:

`ceph mon disable_stretch_mode <crush_rule> --yes-i-really-mean-it`

User can either specify a crush rule that
they want all pools to move to or not specify
a rule and Ceph will use a default replicated crush rule.

Fixes: https://tracker.ceph.com/issues/67467
Signed-off-by: Kamoltat Sirivadhna <ksirivad@redhat.com>
(cherry picked from commit 78ce68de41b1d5278e14cf56dff7f15394969255)

Conflicts:
src/mon/MonmapMonitor.cc - replace `goto reply` with
`goto reply_no_propose`
src/mon/OSDMonitorcc - replace `rule_valid_for_pool_type`
with `get_rule_type` since
`rule_valid_for_pool_type` is not
backported.

Merge pull request #65473 from rhcs-dashboard/wip-72963-reef

reef: monitoring: add user-agent headers to the urllib

monitoring: add user-agent headers to the urllib

The documentation started raising 403 suddenly. Adding User-Agent
headers to the request

Signed-off-by: Nizamudeen A <nia@redhat.com>
(cherry picked from commit b8fe487010483681bbc8ddb8dfe18b40ebfd346b)

Merge pull request #65002 from aaSharma14/wip-68481-reef

reef: mgr/dashboard: show non default realm sync status in rgw overview page

Reviewed-by: Afreen Misbah <afreen@ibm.com>

mgr/volumes: Fix json.loads for test on mon caps

Signed-off-by: Enrico Bocchi <enrico.bocchi@cern.ch>
(cherry picked from commit b008ef9eb690618608f902c67f8df1fb8a587e33)

mgr/volumes: Add test for mon caps if auth key has remaining mds/osd caps

Signed-off-by: Enrico Bocchi <enrico.bocchi@cern.ch>
(cherry picked from commit 403d5411364e2fddd70d98a6f120b26e416c1d99)

mgr/volumes: Keep mon caps if auth key has remaining mds/osd caps

Signed-off-by: Enrico Bocchi <enrico.bocchi@cern.ch>
(cherry picked from commit 0882bbe8a4470f82993d87b7c02b19aa7fe7fbcc)

Merge pull request #65418 from ceph/fix-api-tests-reef

reef: pybind/mgr/dashboard: Use teuthology's actual requirements

pybind/mgr/dashboard: Use teuthology's actual requirements

Signed-off-by: David Galloway <david.galloway@ibm.com>
(cherry picked from commit 22a87d959bca74478de1e2d9f86859676385491d)

Merge pull request #65380 from zdover23/wip-doc-2025-09-04-backport-65325-to-reef

reef: doc/cephfs: edit troubleshooting.rst

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

doc/cephfs: edit troubleshooting.rst

Update the "Disconnected+Remounted FS" section in
doc/cephfs/troubleshooting.rst, as suggested by Venky Shankar in https://github.com/ceph/ceph/pull/65129/files#r2312903062

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit f4b40422fefaa993441396a5c31fbfd3d8714595)

Merge pull request #65250 from ceph/reef-pipeline-backports

reef: Recent pipeline backports

Merge pull request #65094 from zdover23/wip-doc-2025-08-18-backport-64931-to-reef

reef: doc/cephfs: edit troubleshooting.rst

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

Merge pull request #65207 from zdover23/wip-doc-2025-08-26-backport-64074-to-reef

reef: doc/rados/configuration: Mention show-with-defaults and ceph-conf

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

Merge pull request #65212 from zdover23/wip-doc-2025-08-26-backport-65180-to-reef

reef: doc/dev:update blkin.rst doc for lttng trace

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

Merge pull request #65239 from zdover23/wip-doc-2025-08-26-backport-65230-to-reef

reef: doc/rados/operations: Improve health-checks.rst

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

Merge pull request #64843 from NitzanMordhai/wip-72419-reef

reef: monitor: Enhance historic ops command output and error handling

Merge pull request #63134 from kshtsk/wip-71215-reef

reef: tasks/cephfs/mount: use 192.168.144.0.0/20 for brxnet

mgr/dashboard: show non default realm sync status in rgw overview page

Currently, we just show the sync status of the default realm in rgw
overview page. This PR is to show the sync status of non-default realms
as well. Multisite sync status can be viewed for any of the active daemon
which runs in default/non-default realm.

Fixes: https://tracker.ceph.com/issues/68329
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
(cherry picked from commit ea53aceb8d72187f7f8629aa6d3b66c7cca88a86)

Conflicts:
src/pybind/mgr/dashboard/frontend/src/app/ceph/rgw/rgw-overview-dashboard/rgw-overview-dashboard.component.ts
src/pybind/mgr/dashboard/openapi.yaml

Merge pull request #65201 from zdover23/wip-doc-2025-08-25-backport-65185-to-reef

reef: doc/cephfs: edit troubleshooting.rst (Slow MDS)

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

Merge pull request #65184 from zdover23/wip-doc-2025-08-22-backport-64726-to-reef

reef: doc/man/8: Improve mount.ceph.rst

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

Merge pull request #65138 from zdover23/wip-doc-2025-08-20-backport-65128-to-reef

reef: doc/rados: repair short underline

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

Merge pull request #65091 from zdover23/wip-doc-2025-08-18-backport-64928-to-reef

reef: doc/cephfs: edit troubleshooting.rst

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

cephfs: session tracker accounts for killing sessions

Fixes: https://tracker.ceph.com/issues/70770
Signed-off-by: Abhishek Lekshmanan <abhishek.lekshmanan@cern.ch>
(cherry picked from commit bb52cb72abe9540affce713fa3078e9b0f59d65a)

qa: enable debug mds/client for fs/nfs suite

Fixes: http://tracker.ceph.com/issues/63482
Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit d5834c9af6013a23324a808570b22c819988ce03)

Remove git clean -fdx

either
1) a source tarball is supplied, in which case the local dir is
irrelevant, or
2) make-debs calls make-dist, which doesn't care about a dirty cwd

so it just punishes the unaware by removing things that they may
have wanted to keep.

Signed-off-by: Dan Mick <dan.mick@redhat.com>
(cherry picked from commit ad529cce49c466daa02bb3b90804ff6a6ec548e8)

make-debs.sh: invoke tar with --no-same-owner

When running as a normal user, tar does not attempt to preserve
owners set on the tar content files. When running as root, it does.
Containerized builds are running as root. Stop make-debs.sh from
trying to set other owners for files, and leaving files in the
host system with mapped UIDs other than the user running the container
(which causes jenkins to be unable to clear the workspace).

Signed-off-by: Dan Mick <dan.mick@redhat.com>
(cherry picked from commit 46c540444dd0dc4b4572e71ef452436a3b580d51)

make-debs.sh: make "skip debug packages" conditional

Now that we're using make-debs.sh as a builder inside containers,
the default should be to build all the packages, including debug.
(Also, fix a typo.)

Signed-off-by: Dan Mick <dan.mick@redhat.com>
(cherry picked from commit 86d6e931bd10bc15252d76aa58e4835a72742fcd)

doc/rados/operations: Improve health-checks.rst

Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
(cherry picked from commit ba5cb7b8d63040730934a06d13baf2968952e813)

doc/dev:update blkin.rst doc for lttng trace
fixes:https://tracker.ceph.com/issues/72059

Signed-off-by: lizhipeng <qiuxinyidian@gmail.com>
(cherry picked from commit 3029cc9afdee352fb22db0895c5d3ec4a35277d3)

doc/rados/configuration: Mention show-with-defaults and ceph-conf

A small improvement based on
"Why is it still so difficult to just dump all config and where it comes from?"
https://lists.ceph.io/hyperkitty/list/ceph-users@ceph.io/thread/EZSLRYBYEWDA6YIARQVMUKQUWHAE3PGR/

`show-with-defaults` is very useful, and `ceph-conf` is mentioned
so that it's clear that it's legacy, and the user doesn't have to
wonder if it's actually useful but was forgotten in the list.

Signed-off-by: Niklas Hambüchen <mail@nh2.me>
(cherry picked from commit 978ab834c464b993ec77c914cb36da47211a1cd4)

doc/cephfs: edit troubleshooting.rst (Slow MDS)

Move the "Slow requests (MDS)" section immediately after the first
section in this document ("Slow/Stuck Operations"), because the first
procedure on the page directs the reader to undertake the operation in
"Slow requests (MDS)" before trying anything else.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 55af6643c9a119afc4e22e2591774e1d68ef5580)

Merge pull request #65188 from phlogistonjohn/jjm-reef-bwc

reef: backport build-with-container patches from main

build-with-container: improve source rpm detection

Improve source rpm detection by adding a new detection method that
executes and rpm command in a container to get exactly the version of
the source rpm that the ceph.spec file would have generated. For
backwards compatibility and that I don't entirely trust myself to have
tested this the old methods are still available.

The old `--rpm-no-match-sha` is now an alias for `--srpm-match=any` to
cause it to build any (unique) ceph srpm it finds.
`--srpm-match=versionglob` retains the previous default behavior of
using a glob matching on the git id or ceph version value. The new
default of `--srpm-match=auto` implements the rpm command based behavior
described above.

All of this is wrapped in a new step `find-rpm` but that's mostly an
implementation detail and for testing.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
(cherry picked from commit 916088a4e7380cd7ac1403fb4416ef91ab07aa52)

make-srpm.sh: don't shell out redundantly to pwd

Just something that annoyed me while reading the script.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
(cherry picked from commit 544d8ab5eb81fc5b8b950c2c1c116fad6b1a40c4)

pybind/mgr/dashboard/frontend: add NPM_CACHEDIR envvar, use in bwc

Add an optional NPM_CACHEDIR environment variable to serve as the
cache parameter for npm in the dashboard frontend build. The idea
is to allow it to persist across builds so that we decrease the load
on registry.npmjs.org, which has been throttling our requests when
using build-with-container.py, and also hopefully improve the time
of the frontend npm operations.

build-with-container.py also grows a --npm-cache-path option to allow
setting it for container builds and passing the envvar to the build.

Fixes: https://tracker.ceph.com/issues/72298
Signed-off-by: Dan Mick <dan.mick@redhat.com>
(cherry picked from commit ad7e6117a9e99061a3ad7e03709dd31e34832966)

dashboard: fix the workaround for unpacking node sources

My previous workaround in the dashboard for the unpacking of non-root
own tarball as the fake root of a container did not work because of the
strange quoting/escaping behavior of cmake (it tried to run `id -u` as a
single command, not a command and an argument).
Use single quoted string and old school backticks to work around this issue.

Fixes: 24dbfb5da4813c6588f9cd199b9f527bb67f1e88
Signed-off-by: John Mulligan <jmulligan@redhat.com>
(cherry picked from commit 3a36180a373d91adcf9726660204f0cc1dcecba3)

dashboard: ensure nodeenv downloaded content is owned by current user

When testing ceph builds in a container we discovered that certain files
could not be deleted by jenkins after a build. This was due to the way
the container maps IDs - files owned by the root user in the container
become owned by the "real" user/jenkins user on the "host".
However, the node tarball that is fetched and unpacked by nodeenv has
a different owner name/uid that is preserved in the tree and this id
gets mapped to something that can be managed by the "fake root" of the
container but not by the "regular" user outside the container.

The simplest workaround I can think of is to chown the tree back
to the current user and avoid leaving files on disk with uncleanly
mapped uids.

Conflicts:
src/pybind/mgr/dashboard/frontend/CMakeLists.txt

Signed-off-by: John Mulligan <jmulligan@redhat.com>
(cherry picked from commit 24dbfb5da4813c6588f9cd199b9f527bb67f1e88)

Dockerfile.build: make WITH_CRIMSON a build arg

We've chosen to enable crimson by default to match the CI, but that
is not always something a developer may want, so make WITH_CRIMSON
a build argument that can be toggled off if necessary.

Conflicts:
Dockerfile.build

Signed-off-by: John Mulligan <jmulligan@redhat.com>
(cherry picked from commit 9cbadc2e6009cbbf3bc757f801d3508da6fe498d)

mgr/dashboard: add a cobertura xml file workaround variable

Add an environment variable REWRITE_COVERAGE_ROOTDIR that
changes the "hardcoded" path in the cobertura-coverage.xml file.
This can be used to map the paths used in a container build to
the paths known to a jenkins job (or whatever else you want to
do with the file).

Signed-off-by: John Mulligan <jmulligan@redhat.com>
(cherry picked from commit f80dad2e0adfb5bca33fab292728f5c085ca3a78)

Dockerfile.build: Install sccache

Note that it is not used automatically.

Signed-off-by: Zack Cerza <zack@cerza.org>
(cherry picked from commit a5a983c7363ab185cdc48ecd392f8cff6ce087bd)

make-srpm.sh: Skip make-dist if tarball is present

Signed-off-by: Zack Cerza <zack@cerza.org>
(cherry picked from commit a1f607bd1be122b9491555b06f2ce3272d9c331b)

Merge pull request #65126 from zdover23/wip-doc-2025-08-19-backport-64929-to-reef

reef: doc/cephfs: edit troubleshooting.rst

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

Merge pull request #65123 from zdover23/wip-doc-2025-08-19-backport-65021-to-reef

reef: doc/cephfs: edit troubleshooting.rst

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

Merge pull request #65097 from zdover23/wip-doc-2025-08-18-backport-64940-to-reef

reef: doc/cephfs: edit troubleshooting.rst

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

doc/man/8: Improve mount.ceph.rst

Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
(cherry picked from commit 230d91c2e28f3df27dd5007b937b477922cb7655)

Merge pull request #65066 from ceph/reef-nogitclean

reef: build backports

doc/rados: repair short underline

Repair a too-short underline that was introduced in
https://github.com/ceph/ceph/commit/babd3c00341c59cb82e66f93f41c687ab4fc6928.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit ff93fcd28046e65a54368308f60308e84f93aa38)

Merge pull request #61230 from vshankar/wip-67478-reef

reef: mgr/volumes: periodically check for async work

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>