git.apps.os.sepia.ceph.com Git

]> git.apps.os.sepia.ceph.com Git - ceph-ci.git/log

Jos Collin [Thu, 16 Oct 2025 05:53:02 +0000 (11:23 +0530)]

Merge PR #65821 into wip-jcollin-testing-20251016.055245-squid

* refs/pull/65821/head:
client: Fix a deadlock when osd is full

commit | commitdiff | tree

Jos Collin [Thu, 16 Oct 2025 05:52:58 +0000 (11:22 +0530)]

Merge PR #65822 into wip-jcollin-testing-20251016.055245-squid

* refs/pull/65822/head:
mds/FSMap: fix join_fscid being incorrectly reset for active MDS during filesystem removal

commit | commitdiff | tree

Jos Collin [Thu, 16 Oct 2025 05:52:55 +0000 (11:22 +0530)]

Merge PR #65823 into wip-jcollin-testing-20251016.055245-squid

* refs/pull/65823/head:
mds: fix rank 0 marked damaged if stopping fails after Elid flush and log trimmed

commit | commitdiff | tree

Jos Collin [Thu, 16 Oct 2025 05:52:51 +0000 (11:22 +0530)]

Merge PR #65824 into wip-jcollin-testing-20251016.055245-squid

* refs/pull/65824/head:
mds: fix test that directory has no snaps
qa: test for child dir with first beyond parent snaps
qa: remove extraneous directory from test
qa: correct test description

commit | commitdiff | tree

Zack Cerza [Wed, 15 Oct 2025 19:40:10 +0000 (13:40 -0600)]

Merge pull request #65945 from phlogistonjohn/jjm-bwc-variants-s

squid: build-with-container: build image variants

commit | commitdiff | tree

afreen23 [Wed, 15 Oct 2025 09:24:35 +0000 (14:54 +0530)]

Merge pull request #65928 from rhcs-dashboard/wip-73509-squid

squid: mgr/dashboard : Fixed usage bar for secondary site in rbd mirroing

Reviewed-by: Afreen Misbah <afreen@ibm.com>

commit | commitdiff | tree

John Mulligan [Mon, 13 Oct 2025 20:24:11 +0000 (16:24 -0400)]

script/build-with-container: add build image variants

Allow the user to control the content of the build image with a
high-level `--image-variant=` switch. Currently the supported values are
`default` (the same maximal image we have been generating) and
`packages` a slimmer image that avoids installing certain test-only
dependencies.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Mon, 13 Oct 2025 20:23:58 +0000 (16:23 -0400)]

Dockerfile.build: make FOR_MAKE_CHECK a build argument

Set it only during install time.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Mon, 13 Oct 2025 20:23:10 +0000 (16:23 -0400)]

install-deps.sh: let FOR_MAKE_CHECK variable take precedence

Previously, the FOR_MAKE_CHECK variable could only enable installing
extra (test) dependencies when install-deps.sh was used and it was
ignored if `tty -s` exited true. This change allows FOR_MAKE_CHECK to
take precedence over the tty check and to specify one of true, 1, yes to
enable extra "for make check" deps or false, 0, no to explicitly disable
the extra deps.

Based-on-work-by: Dan Mick <dan.mick@redhat.com>
Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

Rishabh Dave [Tue, 14 Oct 2025 13:37:50 +0000 (19:07 +0530)]

Merge pull request #60839 from vshankar/wip-68922-squid

squid: qa/cephfs: randomize configs in `fs:thrash:workloads`

Reviewed-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Rishabh Dave [Tue, 14 Oct 2025 13:37:36 +0000 (19:07 +0530)]

Merge pull request #61301 from batrick/wip-68722-squid

squid: qa/cephfs: override testing kernel with -k option

Reviewed-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Rishabh Dave [Tue, 14 Oct 2025 13:37:26 +0000 (19:07 +0530)]

Merge pull request #61303 from batrick/wip-68450-squid

squid: qa: ignore pg availability/degraded warnings

Reviewed-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Rishabh Dave [Tue, 14 Oct 2025 13:37:11 +0000 (19:07 +0530)]

Merge pull request #61304 from batrick/wip-68244-squid

squid: qa: correct daemon for warning conf

Reviewed-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Rishabh Dave [Tue, 14 Oct 2025 13:37:06 +0000 (19:07 +0530)]

Merge pull request #62091 from batrick/wip-70156-squid

squid: qa: ignore variant of down fs

Reviewed-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Abhishek Desai [Thu, 9 Oct 2025 07:49:34 +0000 (13:19 +0530)]

mgr/dashboard : Fixed usage bar for secondary site in rbd mirroing
fixes : https://tracker.ceph.com/issues/73447
Signed-off-by: Abhishek Desai <abhishek.desai1@ibm.com>
(cherry picked from commit 60140b1ccc8006325632320e39fc209724524aef)

Conflicts:
src/pybind/mgr/dashboard/frontend/src/app/ceph/block/rbd-list/rbd-list.component.html

commit | commitdiff | tree

Christopher Hoffman [Fri, 10 Oct 2025 18:17:25 +0000 (14:17 -0400)]

Merge pull request #62918 from rishabh-d-dave/wip-71018-squid

squid: mgr/vol: add command to get snapshot path

commit | commitdiff | tree

Christopher Hoffman [Fri, 10 Oct 2025 18:13:25 +0000 (14:13 -0400)]

Merge pull request #63222 from rishabh-d-dave/wip-71276-squid

squid: mgr/vol: make "snapshot getpath" cmd work with v1 and legacy

commit | commitdiff | tree

Christopher Hoffman [Fri, 10 Oct 2025 18:09:53 +0000 (14:09 -0400)]

Merge pull request #64205 from rishabh-d-dave/wip-71854-squid

squid: mgr/vol: include group name in subvolume's pool namespace name

commit | commitdiff | tree

Dan Mick [Thu, 9 Oct 2025 16:39:36 +0000 (09:39 -0700)]

Merge pull request #65838 from phlogistonjohn/jjm-rmc-backport-squid

squid: run-make-check.sh: handle sudo and command that may not run in container

commit | commitdiff | tree

Patrick Donnelly [Thu, 20 Feb 2025 18:59:03 +0000 (13:59 -0500)]

qa: ignore variant of down fs

Fixes: https://tracker.ceph.com/issues/70107
Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
(cherry picked from commit 1c0359dcf00309049d1b2081c64ea8bade5dafa1)

Conflicts:
qa/cephfs/overrides/ignorelist_health.yaml: trivial

commit | commitdiff | tree

NitzanMordhai [Thu, 9 Oct 2025 05:51:42 +0000 (08:51 +0300)]

Merge pull request #65444 from NitzanMordhai/wip-72919-squid

squid: suites/rados/cephadm: typo in ignore list for still running message

commit | commitdiff | tree

David Galloway [Thu, 9 Oct 2025 00:22:56 +0000 (20:22 -0400)]

Merge pull request #65844 from phlogistonjohn/jjm-bwc-backports-s

squid: sync build-with-container patches from main

commit | commitdiff | tree

John Mulligan [Wed, 8 Oct 2025 20:41:36 +0000 (16:41 -0400)]

script/build-with-container: improve error handling for invalid distros

Instead of throwing a long obnoxious traceback at the user if the value
supplied to -d/--distro is invalid do something nicer. For example:
```
$ ./src/script/build-with-container.py -d trixy -e build
usage: build-with-container.py [-h] [--help-build-steps]
build-with-container.py: error: argument --distro/-d: unknown distro: 'trixy' not in centos10, centos10stream, centos8, centos9, centos9stream, rocky9, rockylinux9, rocky10, rockylinux10, fedora41, fc41, fedora42, fc42, fedora43, fc43, ubuntu20.04, ubuntu-focal, focal, ubuntu22.04, ubuntu-jammy, jammy, ubuntu24.04, ubuntu-noble, noble, debian12, debian-bookworm, bookworm, debian13, debian-trixie, trixie

```

Signed-off-by: John Mulligan <jmulligan@redhat.com>
(cherry picked from commit 72f3ad9549e84bdba7bdfd97d2ede3c55e02f103)

commit | commitdiff | tree

John Mulligan [Wed, 8 Oct 2025 21:04:27 +0000 (17:04 -0400)]

script/build-with-container: add debian 13 (trixie)

Signed-off-by: John Mulligan <jmulligan@redhat.com>
(cherry picked from commit a13fa091dd6bad35c44076cb7c46cb7bcc17a7ac)

commit | commitdiff | tree

John Mulligan [Wed, 8 Oct 2025 17:30:18 +0000 (13:30 -0400)]

script/build-with-container: add ubuntu 20.04 (focal)

Add ubuntu 20.04 (focal) to the available list of distro kinds.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
(cherry picked from commit 7c40f7bd07ac935d0657b9284118da8590a5cf0d)

commit | commitdiff | tree

John Mulligan [Wed, 8 Oct 2025 14:28:30 +0000 (10:28 -0400)]

script/build-with-container: add a pair of fedora distro versions

Add fedora 42 and the soon-to-be-released fedora 43.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
(cherry picked from commit 76fe5ad298ee5626eeb63591a702e8f8cc9be7d0)

commit | commitdiff | tree

John Mulligan [Wed, 8 Oct 2025 14:26:58 +0000 (10:26 -0400)]

script/build-with-container: lightly organize the distro kind aliases

Do a tiny reorg of the distro kind aliases and container images to keep
the EL distros together and comment out each "section".

Signed-off-by: John Mulligan <jmulligan@redhat.com>
(cherry picked from commit 4430a5ad6be6f26309d5f5bea0e448a4bbd432e1)

commit | commitdiff | tree

John Mulligan [Wed, 8 Oct 2025 14:23:25 +0000 (10:23 -0400)]

script/build-with-container: be consistent with naming in distro kinds

Update the DistroKind enum and related items so that the naming is
applied consistently. That is: the canonical (no pun indented) form
of the name is "<name><version>" and codenames, such as "jammy" or
"bookworm" are aliases. This matches the previously existing code.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
(cherry picked from commit ac11a80a63ab1909fbdf682d830acde96856f502)

commit | commitdiff | tree

Dan Mick [Tue, 7 Oct 2025 01:48:42 +0000 (18:48 -0700)]

src/script: add bookworm to build-with-container.py

..and its friend buildcontainer-setup.sh

Signed-off-by: Dan Mick <dan.mick@redhat.com>
(cherry picked from commit 34b497c2f3652e7d30c7b7476b711fd9f1f4ecac)

commit | commitdiff | tree

John Mulligan [Thu, 28 Aug 2025 23:39:06 +0000 (19:39 -0400)]

build-with-container: ensure npm dir is set up before configure

When the npm cache path option is passed the npm cache dir is passed
to all container `run` commands, ensure the dir has been created
before the first container command (configure) is used.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
(cherry picked from commit 79166af192ea0b4b982b56ce521516d5a29e7a0d)

commit | commitdiff | tree

John Mulligan [Tue, 18 Mar 2025 23:26:59 +0000 (19:26 -0400)]

run-make-check.sh: handle sudo and command that may not run in container

Work around a known failure that sudo is not expected to be present in
container images. Prepare to handle a failure to set a sysctl param.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
(cherry picked from commit 9f44155dff195015186315968a0a1e8ce925ed5d)

commit | commitdiff | tree

John Mulligan [Sat, 15 Mar 2025 16:44:00 +0000 (12:44 -0400)]

install-deps: extract SUDO variable logic into a reusable function

While the function is pretty simple and could be copy-pasted I
prefer to extract things into functions to indicate that the
logic is used/repeated elsewhere to ward off making changes to
one copy vs the other.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
(cherry picked from commit bbd7933598e11d84758a6f09fd176f47c744aaa2)

commit | commitdiff | tree

Patrick Donnelly [Tue, 27 May 2025 14:20:22 +0000 (10:20 -0400)]

mds: fix test that directory has no snaps

To look if the directory's first is beyond the last snap. This matches the behavior of lssnaps.

Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
Fixes: https://tracker.ceph.com/issues/71462
(cherry picked from commit c22db4e683cf2e6b0decc937e9ab92ba15d46487)

commit | commitdiff | tree

Patrick Donnelly [Tue, 27 May 2025 14:03:12 +0000 (10:03 -0400)]

qa: test for child dir with first beyond parent snaps

If the parent directory has snapshots but the child was created after, then we
should be able to modify its charmap.

Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
Fixes: https://tracker.ceph.com/issues/71462
(cherry picked from commit 659e4262d042dc50a381846c25640c76a06bdec2)

commit | commitdiff | tree

Patrick Donnelly [Tue, 27 May 2025 14:39:25 +0000 (10:39 -0400)]

qa: remove extraneous directory from test

Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
Fixes: https://tracker.ceph.com/issues/71462
(cherry picked from commit 7678dbfd8830141ece420fde66bbb1687c616206)

commit | commitdiff | tree

Patrick Donnelly [Tue, 27 May 2025 14:38:37 +0000 (10:38 -0400)]

qa: correct test description

This test is checking for failure conditions.

Signed-off-by: Patrick Donnelly <pdonnell@ibm.com>
Fixes: https://tracker.ceph.com/issues/71462
(cherry picked from commit c428149b9cb12c9e9b90d305131b669211a56b4b)

commit | commitdiff | tree

ethanwu [Thu, 11 Sep 2025 07:40:09 +0000 (15:40 +0800)]

mds: fix rank 0 marked damaged if stopping fails after Elid flush and log trimmed

steps to reproduce
../src/vstart.sh --debug --new -x --localhost --bluestore
./bin/ceph tell mds.<rank 0> config set mds_kill_shutdown_at 10
./bin/ceph fs set <fs name> down true

wait for a few seconds and will see the following log from take-over mds
and rank 0 is marked damaged
2025-09-11T16:47:24.591+0800 785dabeaa6c0 -1 log_channel(cluster) log [ERR] : No subtrees found for root MDS rank!
2025-09-11T16:47:24.591+0800 785dabeaa6c0 5 mds.beacon.b set_want_state: up:rejoin -> down:damaged

During shutdown_pass after submitting Elid and trimming mdlog, mds log
will now have only ELid event which does nothing at replay.
After replay, no subtree is found.

Fix this by checking whther MDLog contains only one event.
If so, skip the subtree check for rank 0, and allow it to request
STATE_STOPPED just like the other ranks.

Fixes: https://tracker.ceph.com/issues/72983
Signed-off-by: ethanwu <ethanwu@synology.com>
(cherry picked from commit adb448b4f4e421f75275874f5a67c3a2ceb0214c)

commit | commitdiff | tree

ethanwu [Tue, 23 Sep 2025 01:45:36 +0000 (09:45 +0800)]

mds/FSMap: fix join_fscid being incorrectly reset for active MDS during filesystem removal

Fix bug where active MDS daemons in remaining filesystems incorrectly
have their join_fscid cleared to FS_CLUSTER_ID_NONE when any other
filesystem is removed.

The issue was caused by variable name shadowing in erase_filesystem()
where the loop variable 'fscid' shadowed the function parameter 'fscid':
Inside loop: if (info.join_fscid == fscid) compared against the
loop variable (remaining FS ID) instead of parameter (removed FS ID)

Renamed the loop variable to 'remaining_fscid' to eliminate the shadowing
and ensure the comparison uses the correct filesystem ID.

Reproducer:
../src/vstart.sh --new -x --localhost --bluestore
FS=b
./bin/ceph osd pool create cephfs.${FS}.meta 64 64 replicated
./bin/ceph osd pool create cephfs.${FS}.data 64 64 replicated
./bin/ceph fs new ${FS} cephfs.${FS}.meta cephfs.${FS}.data
./bin/ceph config set mds.a mds_join_fs a
./bin/ceph config set mds.b mds_join_fs a
./bin/ceph fs fail ${FS}
./bin/ceph fs rm ${FS} --yes-i-really-mean-it

Then from ./bin/ceph fs dump
We can see join_fscid in all active mds filesystem 'a' are reset.
Since there are standby mds with join_fscid=1
MDSMonitor think they have better affinity and trigger switch over.

Fixes: https://tracker.ceph.com/issues/73183
Signed-off-by: ethanwu <ethanwu@synology.com>
(cherry picked from commit cfecf7c867d20d7d05ab3f341844c7c2b9b733d0)

commit | commitdiff | tree

Kotresh HR [Wed, 23 Oct 2024 19:00:41 +0000 (00:30 +0530)]

client: Fix a deadlock when osd is full

Problem:
When osd is full, the client receives the notification
and cancels the ongoing writes. If the ongoing writes
are async, it could cause a dead lock as the async
callback registered also takes the 'client_lock' which
the handle_osd_map takes at the beginning.

The op_cancel_writes calls the callback registered for
the async write synchronously holding the 'client_lock'
causing the deadlock.

Earlier approach:
  It was tried to solve this issue by calling 'op_cancel_writes'
without holding 'client_lock'. But this failed lock dependency
between objecter's 'rwlock' and async write's callback taking
'client_lock'. The 'client_lock' should always be taken before
taking 'rwlock'. So this approach is dropped against the current
approach.

Solution:
Use C_OnFinisher for objecter async write callback i.e., wrap
the async write's callback using the Finisher. This queues the
callback to the Finisher's context queue which the finisher
thread picks up and executes thus avoiding the deadlock.

Testing:
The fix is tested in the vstart cluster with the following reproducer.
1. Mount the cephfs volume using nfs-ganesha at /mnt
2. Run fio on /mnt on one terminal
3. On the other terminal, blocklist the nfs client session
4. The fio would hang

It is reproducing in the vstart cluster most of the times. I think
that's because it's slow. The same test written for teuthology is
not reproducing the issue. The test expects one or more writes
to be on going in rados when the client is blocklisted for the deadlock
to be hit.

Stripped down version of Traceback:
----------
0  0x00007f4d77274960 in __lll_lock_wait ()
1  0x00007f4d7727aff2 in pthread_mutex_lock@@GLIBC_2.2.5 ()
2  0x00007f4d7491b0a1 in __gthread_mutex_lock (__mutex=0x7f4d200f99b0)
3  std::mutex::lock (this=<optimized out>)
4  std::scoped_lock<std::mutex>::scoped_lock (__m=..., this=<optimized out>, this=<optimized out>, __m=...)
5  Client::C_Lock_Client_Finisher::finish (this=0x7f4ca0103550, r=-28)
6  0x00007f4d74888dfd in Context::complete (this=0x7f4ca0103550, r=<optimized out>)
7  0x00007f4d7498850c in std::__do_visit<...>(...) (__visitor=...)
8  std::visit<Objecter::Op::complete(...) (__visitor=...)
9  Objecter::Op::complete(...) (e=..., e=..., r=-28, ec=..., f=...)
10 Objecter::Op::complete (e=..., r=-28, ec=..., this=0x7f4ca022c7f0)
11 Objecter::op_cancel (this=0x7f4d200fab20, s=<optimized out>, tid=<optimized out>, r=-28)
12 0x00007f4d7498ea12 in Objecter::op_cancel_writes (this=0x7f4d200fab20, r=-28, pool=103)
13 0x00007f4d748e1c8e in Client::_handle_full_flag (this=0x7f4d200f9830, pool=103)
14 0x00007f4d748ed20c in Client::handle_osd_map (m=..., this=0x7f4d200f9830)
15 Client::ms_dispatch2 (this=0x7f4d200f9830, m=...)
16 0x00007f4d75b8add2 in Messenger::ms_deliver_dispatch (m=..., this=0x7f4d200ed3e0)
17 DispatchQueue::entry (this=0x7f4d200ed6f0)
18 0x00007f4d75c27fa1 in DispatchQueue::DispatchThread::entry (this=<optimized out>)
19 0x00007f4d77277c02 in start_thread ()
20 0x00007f4d772fcc40 in clone3 ()
--------

Fixes: https://tracker.ceph.com/issues/68641
Signed-off-by: Kotresh HR <khiremat@redhat.com>
(cherry picked from commit 60c58013c53d0f280b8f96b7caf9c255b54640fb)

commit | commitdiff | tree

Yuri Weinstein [Mon, 6 Oct 2025 14:55:29 +0000 (07:55 -0700)]

Merge pull request #65462 from pdvian/wip-72853-squid

squid: mgr/DaemonState: Minimise time we hold the DaemonStateIndex lock

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Mon, 6 Oct 2025 14:52:59 +0000 (07:52 -0700)]

Merge pull request #65214 from ifed01/wip-ifed-discard-threads-better-lifecycle-squi

squid: blk/kernel: improve DiscardThread life cycle.

Reviewed-by: YiteGu <yitegu0@gmail.com>

commit | commitdiff | tree

Yuri Weinstein [Mon, 6 Oct 2025 14:51:59 +0000 (07:51 -0700)]

Merge pull request #65006 from mchangir/wip-72564-squid

squid: mgr: avoid explicit dropping of ref

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Sun, 5 Oct 2025 13:17:48 +0000 (15:17 +0200)]

Merge pull request #65335 from abitdrag/wip-72817-squid

squid: auth: msgr2 can return incorrect allowed_modes through AuthBadMethodFrame

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Yuri Weinstein [Thu, 2 Oct 2025 17:27:31 +0000 (10:27 -0700)]

Merge pull request #64739 from VinayBhaskar-V/wip-72319-squid

squid: rbd-mirror: prevent image deletion if remote image is not primary

Reviewed-by: Ilya Dryomov <idryomov@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Wed, 1 Oct 2025 18:31:45 +0000 (11:31 -0700)]

Merge pull request #65665 from kchheda3/wip-73055-squid

squid: rgw/account: bucket acls are not completely migrated once the user is migrated to an account

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Aashish Sharma [Tue, 30 Sep 2025 05:15:24 +0000 (10:45 +0530)]

Merge pull request #65709 from aaSharma14/wip-73293-squid

squid: monitoring: fix MTU Mismatch alert rule and expr

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>

commit | commitdiff | tree

Pedro Gonzalez Gomez [Mon, 29 Sep 2025 11:52:15 +0000 (13:52 +0200)]

Merge pull request #65706 from rhcs-dashboard/wip-73274-squid

squid: mgr/dashboard: Blank entry for Storage Capacity in dashboard under Cluster > Expand Cluster > Review

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@ibm.com>

commit | commitdiff | tree

Aashish Sharma [Wed, 2 Jul 2025 11:05:14 +0000 (16:35 +0530)]

monitoring: fix MTU Mismatch alert rule and expr

Fixes: https://tracker.ceph.com/issues/73290
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
(cherry picked from commit bee24dec441b9e6b263e4498c2ab333b0a60a52d)

Conflicts:
monitoring/ceph-mixin/prometheus_alerts.yml
monitoring/ceph-mixin/tests_alerts/test_alerts.yml
src/pybind/mgr/dashboard/frontend/src/app/ceph/cluster/prometheus/active-alert-list/active-alert-list.component.html
src/pybind/mgr/dashboard/frontend/src/app/ceph/cluster/prometheus/active-alert-list/active-alert-list.component.ts
src/pybind/mgr/dashboard/frontend/src/app/shared/datatable/table-key-value/table-key-value.component.scss

commit | commitdiff | tree

Rishabh Dave [Tue, 3 Jun 2025 18:56:33 +0000 (00:26 +0530)]

release note: add note for change in format of name of pool...

namespace of CephFS volumes.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit f350d9800024661eecdfd7da6d57fa0e0324d981)

commit | commitdiff | tree

Naman Munet [Wed, 24 Sep 2025 07:23:40 +0000 (12:53 +0530)]

mgr/dashboard: Blank entry for Storage Capacity in dashboard under Cluster > Expand Cluster > Review

https://tracker.ceph.com/issues/73220

Signed-off-by: Naman Munet <naman.munet@ibm.com>
(cherry picked from commit a01909e7588c7ff757079475e3ea6f1dc3054db7)

commit | commitdiff | tree

Yuri Weinstein [Fri, 26 Sep 2025 16:08:49 +0000 (09:08 -0700)]

Merge pull request #64456 from cbodley/wip-72090-squid

squid: deb/mgr: remove deprecated distutils from ceph-mgr.requires

Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Christopher Hoffman [Fri, 26 Sep 2025 16:07:55 +0000 (12:07 -0400)]

Merge pull request #65141 from mchangir/wip-70925-squid

squid: mds: fix heap-use-after-free in C_Flush_Journal

commit | commitdiff | tree

Aashish Sharma [Fri, 26 Sep 2025 14:07:55 +0000 (19:37 +0530)]

Merge pull request #65620 from aaSharma14/wip-73167-squid

squid: mgr/dashboard: fix zone update API forcing STANDARD storage class

Reviewed-by: Afreen Misbah <afreen@ibm.com>

commit | commitdiff | tree

Aashish Sharma [Fri, 26 Sep 2025 05:11:59 +0000 (10:41 +0530)]

Merge pull request #65671 from aaSharma14/wip-73231-squid

squid: monitoring: fix "In" OSDs in Cluster-Advanced grafana panel. Also change units from decbytes to bytes wherever used in the panel

Reviewed-by: Afreen Misbah <afreen@ibm.com>

commit | commitdiff | tree

Rishabh Dave [Mon, 7 Apr 2025 14:39:44 +0000 (20:09 +0530)]

release note: add a note for "snapshot getpath" command

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit a59b1fa431e2b546877c160beb5f67f2970776f0)

commit | commitdiff | tree

Rishabh Dave [Mon, 7 Apr 2025 14:36:33 +0000 (20:06 +0530)]

doc/cephfs: add doc for "snapshot getpath" cmd

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit 9e40a5c8d7a5cd6e4c1929559c4c7e3411653de5)

commit | commitdiff | tree

Rishabh Dave [Mon, 7 Apr 2025 14:14:57 +0000 (19:44 +0530)]

qa/cephfs: add tests for "subvolume snapshot getpath" cmd

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit 870cbf62d288ae09ea06a5da112ea62156336924)

commit | commitdiff | tree

Rishabh Dave [Wed, 2 Apr 2025 15:25:32 +0000 (20:55 +0530)]

mgr/vol: add command to get snapshot path

Fixes: https://tracker.ceph.com/issues/70815
Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit 50d28992d99fcd67390815aa42f9da8ffaa82575)

Conflicts:
src/pybind/mgr/volumes/fs/volume.py
- Line where the original patch makes the change is slightly different
in main compared to Squid branch, leading to conflict.

commit | commitdiff | tree

Aashish Sharma [Wed, 17 Sep 2025 06:58:16 +0000 (12:28 +0530)]

monitoring/ceph_mixin: fix Cluster - Advanced OSD grafana panel

1. Fixes the promql expr used to calculate "In" OSDs in
ceph-cluster-advanced.json.
2. Fixes the color coding for the single state panels used in the OSDs
grafana panel like "In", "Out" etc

Fixes: https://tracker.ceph.com/issues/72810
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
(cherry picked from commit 53a6856d603e0fe4ff31f76e19263a80359a9f1d)

commit | commitdiff | tree

David Galloway [Wed, 24 Sep 2025 19:35:22 +0000 (15:35 -0400)]

Merge pull request #65659 from ceph/wip-squid-noble

squid: cmake: remove _FORTIFY_SOURCE define

commit | commitdiff | tree

Yuri Weinstein [Wed, 24 Sep 2025 18:42:50 +0000 (11:42 -0700)]

Merge pull request #64605 from cbodley/wip-72190-squid

squid: deb/cephadm: add explicit --home for cephadm user

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

kchheda3 [Wed, 3 Sep 2025 14:48:42 +0000 (10:48 -0400)]

rgw/account: bucket acls are not completely migrated once the user is migrated to an account

Signed-off-by: kchheda3 <kchheda3@bloomberg.net>
(cherry picked from commit 23dc3697cfd309b4d8736ec99490cd57db621cf7)

commit | commitdiff | tree

Casey Bodley [Wed, 3 Sep 2025 17:22:30 +0000 (13:22 -0400)]

cmake: remove _FORTIFY_SOURCE define

according to `dpkg-buildflags`, ubuntu 24 raised this value to
`-D_FORTIFY_SOURCE=3` which causes `error: "_FORTIFY_SOURCE" redefined`
compilation failures because Ceph itself adds `-D_FORTIFY_SOURCE=2`

`_FORTIFY_SOURCE` is a hardening option. both our rpm and debian builds
already specify that via environment variables, so Ceph's cmake should
leave it alone

Fixes: https://tracker.ceph.com/issues/72361
Signed-off-by: Casey Bodley <cbodley@redhat.com>
(cherry picked from commit 66bec97b0dc90b91f5be586351f52082beb6374a)

commit | commitdiff | tree

Rishabh Dave [Wed, 24 Sep 2025 13:42:42 +0000 (19:12 +0530)]

Merge pull request #61166 from anoopcs9/wip-69306-squid

squid: client: Handle empty pathnames for `ceph_chownat()` and `ceph_statxat()`

Reviewed-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Adam King [Tue, 23 Sep 2025 21:10:40 +0000 (17:10 -0400)]

Merge pull request #65636 from adk3798/squid-cephadm-pin-cheroot

squid: pybind/mgr: pin cheroot version in requirements-required.txt

Reviewed-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Tue, 23 Sep 2025 19:12:58 +0000 (12:12 -0700)]

Merge pull request #65588 from adamemerson/wip-perfcounters-unique-string-squid

squid: common: Allow PerfCounters to return a provided service ID

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Tue, 23 Sep 2025 19:12:09 +0000 (12:12 -0700)]

Merge pull request #65556 from jzhu116-bloomberg/wip-72972-squid

squid: rgw: discard olh_ attributes when copying object from a versioning-suspended bucket to a versioning-disabled bucket

Reviewed-by: Adam Emerson <aemerson@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Fri, 19 Apr 2024 08:07:11 +0000 (13:37 +0530)]

mgr/dashboard: bump cheroot to > 10.0

Fixes: https://tracker.ceph.com/issues/55837
Signed-off-by: Nizamudeen A <nia@redhat.com>
(cherry picked from commit 1ec74a8360d1c4abb39754320eba118d080e3499)

commit | commitdiff | tree

Anoop C S [Mon, 23 Sep 2024 07:06:55 +0000 (12:36 +0530)]

client: Gracefully handle empty pathname for statxat()

man statx(2)[1] says the following:
. . .
AT_EMPTY_PATH
    If pathname is an empty string, operate on the file referred to by
    dirfd (which may have been obtained using the open(2) O_PATH flag).
    In this case, dirfd can refer to any type of file, not just a
    directory.

    If dirfd is AT_FDCWD, the call operates on the current working
    directory.
. . .

Look out for an empty pathname and use the relative fd's inode in the
presence of AT_EMPTY_PATH flag before calling internal _getattr().

Fixes: https://tracker.ceph.com/issues/68189
Review with: git show -w

[1] https://www.man7.org/linux/man-pages/man2/statx.2.html

Signed-off-by: Anoop C S <anoopcs@cryptolab.net>
(cherry picked from commit edd7fe76c4919bc243377c6d7aae20b0606b89c3)

Conflicts:
        src/client/Client.cc
- path_walk() refactor from https://github.com/ceph/ceph/pull/62095
  included the required core changes.

commit | commitdiff | tree

Anoop C S [Thu, 17 Oct 2024 16:15:17 +0000 (21:45 +0530)]

libcephfs.h: Fix API documentation for ceph_statxat

flags parameter for ceph_statxat() API is supposed to accept only
AT_STATX_DONT_SYNC and AT_SYMLINK_NOFOLLOW. Modify the corresponding
documentation to reflect the acceptance of above two flags.

Signed-off-by: Anoop C S <anoopcs@cryptolab.net>
(cherry picked from commit 92c5ab99b8dcaae56e4a92cfe72a7e3d343b8a0c)

commit | commitdiff | tree

Anoop C S [Fri, 20 Sep 2024 08:49:01 +0000 (14:19 +0530)]

client: Gracefully handle empty pathname for chownat()

man fchownat(2)[1] says the following:
. . .
AT_EMPTY_PATH (since Linux 2.6.39)
    If pathname is an empty string, operate on the file referred to by
    dirfd (which may have been obtained using the open(2) O_PATH flag).
    In this case, dirfd can refer to any type of file, not just a
    directory. If dirfd is AT_FDCWD, the call operates on the current
    working directory.
. . .

Look out for an empty pathname and use the relative fd's inode in the
presence of AT_EMPTY_PATH flag before calling internal _setattr().

Fixes: https://tracker.ceph.com/issues/68189
Review with: git show -w

[1] https://www.man7.org/linux/man-pages/man2/fchownat.2.html

Signed-off-by: Anoop C S <anoopcs@cryptolab.net>
(cherry picked from commit 829f38899226fcd1f603ba446b018f53c5b0921d)

Conflicts:
        src/client/Client.cc
- path_walk() refactor from https://github.com/ceph/ceph/pull/62095
  included the required core changes.

commit | commitdiff | tree

Casey Bodley [Tue, 23 Sep 2025 13:24:28 +0000 (09:24 -0400)]

Merge pull request #65639 from zdover23/wip-doc-2025-09-23-squid-remove-cloud-restore-rst

squid: doc/radosgw: remove cloud-restore from squid

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Rishabh Dave [Tue, 23 Sep 2025 12:10:57 +0000 (17:40 +0530)]

Merge pull request #61451 from anoopcs9/wip-69556-squid

squid: mds: Fix invalid access of mdr->dn[0].back()

Reviewed-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Rishabh Dave [Tue, 23 Sep 2025 12:10:47 +0000 (17:40 +0530)]

Merge pull request #62391 from neesingh-rh/wip-70416-squid

squid: cephfs-shell: add option to remove xattr

Reviewed-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Rishabh Dave [Tue, 23 Sep 2025 12:10:35 +0000 (17:40 +0530)]

Merge pull request #64652 from rishabh-d-dave/wip-72200-squid

squid: mgr/vol: keep and show clone source info

Reviewed-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Rishabh Dave [Tue, 23 Sep 2025 12:10:25 +0000 (17:40 +0530)]

Merge pull request #65279 from joscollin/wip-67809-squid

squid: mds: add more debug logs and log events

Reviewed-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Rishabh Dave [Tue, 23 Sep 2025 12:09:50 +0000 (17:39 +0530)]

Merge pull request #65280 from joscollin/wip-69369-squid

squid: qa: use a larger timeout for kernel_untar_build workunit

Reviewed-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

VinayBhaskar-V [Wed, 30 Jul 2025 10:10:27 +0000 (10:10 +0000)]

test/rbd-mirror: eliminate a race in ResyncRequestedRemoteNotPrimary

Adjust the wait_for_notification call in TestMockImageReplayerSnapshotReplayer.ResyncRequestedRemoteNotPrimary
to expect 2 notifications instead of 1. This allows the test to correctly wait for both expected events
i.e for finish_sync() and handle_replay_complete(locker, -EREMOTEIO, "remote image demoted"), ensuring the
replayer transitions to STATE_COMPLETE and is_replaying() returns false as intended.

Fixes: https://tracker.ceph.com/issues/72325
Signed-off-by: VinayBhaskar-V <vvarada@redhat.com>
(cherry picked from commit b5a013f6170bb4445da8f5469243e4869b760a81)

commit | commitdiff | tree

VinayBhaskar-V [Tue, 13 May 2025 20:25:44 +0000 (01:55 +0530)]

rbd-mirror: prevent image deletion if remote image is not primary

A resync on a mirrored image may incorrectly results in the local
image being deleted even when the remote image is no longer primary.
This issue can occur under the following conditions:
* if  resync is requested on the secondary before the remote image has
  been fully demoted
* if the demotion of the primary image is not mirrored
  due to the rbd-mirror daemon being offline.

This can be fixed by ensuring that image deletion during a resync is
only allowed when the remote image is confirmed to be primary.

This commit fixes the issue only for snapshot based mirroring mode

Fixes: https://tracker.ceph.com/issues/70948
Signed-off-by: VinayBhaskar-V <vvarada@redhat.com>
(cherry picked from commit e14afbc95a5fb8f5a33e7ea23a035992b966d671)

commit | commitdiff | tree

Jos Collin [Tue, 23 Sep 2025 03:09:30 +0000 (08:39 +0530)]

Merge pull request #63019 from batrick/wip-71094-squid

squid: mds: check for snapshots on parent snaprealms

Reviewed-by: Jos Collin <jcollin@redhat.com>

commit | commitdiff | tree

Jos Collin [Tue, 23 Sep 2025 02:43:42 +0000 (08:13 +0530)]

Merge pull request #62499 from batrick/wip-70663-squid

squid: client: ll_walk will process absolute paths as relative

Reviewed-by: Jos Collin <jcollin@redhat.com>

commit | commitdiff | tree

David Galloway [Tue, 23 Sep 2025 02:27:19 +0000 (19:27 -0700)]

Merge pull request #65629 from phlogistonjohn/jjm-s-65514

squid: build-with-container: add argument groups to organize options

commit | commitdiff | tree

Zac Dover [Tue, 23 Sep 2025 02:09:52 +0000 (12:09 +1000)]

doc/radosgw: remove cloud-restore from squid

Remove doc/radosgw/cloud-restore.rst from the Squid branch.

cloud-restore does not appear in index.rst, so its removal from
index.rst is unnecessary.

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Jos Collin [Tue, 23 Sep 2025 02:06:52 +0000 (07:36 +0530)]

Merge pull request #64090 from vshankar/wip-cephfs-client-fixes-squid

squid: client: cephfs user-space client fixes

Reviewed-by: Jos Collin <jcollin@redhat.com>

commit | commitdiff | tree

Adam King [Mon, 22 Sep 2025 21:05:07 +0000 (17:05 -0400)]

pybind/mgr: pin cheroot version in requirements-required.txt

With python 3.10 (didn't seem to happen with python 3.12) the
pybind/mgr/cephadm/tests/test_node_proxy.py test times out.
This appears to be related to a new release of the cheroot
package and a github issues describing the same problem
we're seeing has been opened by another user
https://github.com/cherrypy/cheroot/issues/769

It is worth noting that the workaround described in that
issue does also work for us. If you add

```
import cheroot
cheroot.server.HTTPServer._serve_unservicable = lambda: None
```

after the existing imports in test_node_proxy.py the
test hanging issue also disappears. Also worth noting the
particular pin of

cheroot~=10.0

was chosen as it matches the existing pin being used
in pybind/mgr/dashboard/constraints.txt

Signed-off-by: Adam King <adking@redhat.com>
(cherry picked from commit 6231955b5d00ae6b3630ee94e85b2449092ef0fe)

commit | commitdiff | tree

Jos Collin [Tue, 23 Sep 2025 01:41:10 +0000 (07:11 +0530)]

Merge pull request #61274 from kotreshhr/wip-68940-squid

squid: ceph-fuse: Improve fuse mount usage message

Reviewed-by: Jos Collin <jcollin@redhat.com>

commit | commitdiff | tree

Jos Collin [Tue, 23 Sep 2025 01:36:49 +0000 (07:06 +0530)]

Merge pull request #62517 from salieri11/wip-70631-squid

squid: mds: add MDS asok command for dumping stray directories

Reviewed-by: Jos Collin <jcollin@redhat.com>

commit | commitdiff | tree

Jos Collin [Tue, 23 Sep 2025 01:28:13 +0000 (06:58 +0530)]

Merge pull request #65133 from chrisphoffman/wip-72645-squid

squid: client: use path supplied in statfs

Reviewed-by: Jos Collin <jcollin@redhat.com>

commit | commitdiff | tree

John Mulligan [Fri, 12 Sep 2025 17:52:25 +0000 (13:52 -0400)]

build-with-container: add argument groups to organize options

Use the argparse add_argument_group feature to organize the mass of
arguments into more sensible categories. Hopefully, someone reading
over the `--help` output can now more easily see options that
are useful rather than being overwhelmed by a wall of text.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
(cherry picked from commit 71a1be4dd0aea004da56c2f518ee70a281a3f7d3)

commit | commitdiff | tree

Aashish Sharma [Thu, 18 Sep 2025 10:59:52 +0000 (16:29 +0530)]

mgr/dashboard: fix zone update API forcing STANDARD storage class

The zone update REST API (`edit_zone`) always attempted to configure a
placement target for the `STANDARD` storage class, even when the request
was intended for a different storage class name.
This caused failures in deployments where `STANDARD` is not defined.

Changes:
Club add placement target and add storage class methods into one single
add_placement_targets_storage_class_zone method which takes the storage
class as a param as well alongside the rest of the placement params.

Fixes: https://tracker.ceph.com/issues/73105
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
(cherry picked from commit 135f3adb4973be493925839e946e7a5fc75e7d5c)

commit | commitdiff | tree

Rishabh Dave [Wed, 14 Feb 2024 09:52:47 +0000 (15:22 +0530)]

mgr/vol: move reusable parts from async_cloner

Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit 8c536f78907fe1d42e1df612604d5872019b501d)

Conflicts:
src/pybind/mgr/volumes/fs/async_cloner.py
src/pybind/mgr/volumes/fs/operations/versions/subvolume_v1.py
- commit 8c536f78907f was missing which led to conflict.

commit | commitdiff | tree

Adam C. Emerson [Mon, 8 Sep 2025 18:19:20 +0000 (14:19 -0400)]

rgw: Record the `service_unique_id`, if present, in the SrviceMap

For consistency and ease associating the two.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
(cherry picked from commit 3a94a7b2ed02d20b2bc839b283e60cf4778f69e4)
Signed-off-by: Adam C. Emerson <aemerson@redhat.com>

commit | commitdiff | tree

Adam C. Emerson [Fri, 5 Sep 2025 15:31:40 +0000 (11:31 -0400)]

common: Allow PerfCounters to return a provided service ID

Dashboard has asked for a unique identifier that can be associated
with services. This commit provides a component of that
functionality. Enforcing uniqueness is beyond the scope of this PR and
is the responsibility of cluster setup and orchestration. The scope of
uniqueness is a matter of policy and up to the design of cluster setup
and orchestration software.

We provide the `--service_unique_id` argument that can be passed on
the command line when executing a Ceph service that uses
`global_init`. If non-empty, a `service_unique_id` section is added to
the PerfCounters dump for that service. This section has a single
entry whose name is set to the argument of `service_unique_id` and
whose value is arbitrary. If unspecified or empty, no
`service_unique_id` section is added.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
(cherry picked from commit 6dc322421f7a3758251fe29e3f35934231358011)
Conflicts:
src/common/options/global.yaml.in
- Preceding options not in Squid

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Thu, 18 Sep 2025 15:20:45 +0000 (08:20 -0700)]

Merge pull request #65219 from cbodley/wip-72715-squid

squid: rgw/s3: remove 'aws-chunked' from Content-Encoding response

Reviewed-by: Adam Emerson <aemerson@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Thu, 18 Sep 2025 15:19:59 +0000 (08:19 -0700)]

Merge pull request #62051 from k0ste/wip-65923-squid

squid: CephContext: acquire _fork_watchers_lock in notify_post_fork()

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

Christopher Hoffman [Wed, 17 Sep 2025 15:17:39 +0000 (11:17 -0400)]

Merge pull request #63720 from batrick/wip-71554-squid

squid: test/libcephfs: copy DT_NEEDED entries from input libraries

commit | commitdiff | tree

anrao19 [Wed, 17 Sep 2025 05:22:12 +0000 (10:52 +0530)]

Merge pull request #64936 from pritha-srivastava/wip-72464-squid

squid: rgw: check all JWKS for STS

commit | commitdiff | tree

Jane Zhu [Wed, 20 Aug 2025 18:38:23 +0000 (18:38 +0000)]

rgw: discard olh_ attributes when copying object from a versioning-suspended bucket to a versioning-disabled bucket

Signed-off-by: Jane Zhu <jzhu116@bloomberg.net>
(cherry picked from commit 3fed58f43c3cb3977130926a2d1bca551deefade)

commit | commitdiff | tree

Anthony D'Atri [Tue, 16 Sep 2025 20:48:05 +0000 (15:48 -0500)]

Merge pull request #65550 from stackhpc/doc-balancer-squid

squid: doc: Fixes a typo in balancer operations

commit | commitdiff | tree

Tyler Brekke [Tue, 24 Jun 2025 19:12:33 +0000 (12:12 -0700)]

doc: Fixes a typo in balancer operations

Signed-off-by: Tyler Brekke <tbrekke@digitalocean.com>
(cherry picked from commit b038b8093d01a5e676ffa419607489a79261ef29)

Unnamed repository; edit this file 'description' to name the repository.

RSS Atom