git.apps.os.sepia.ceph.com Git - ceph.git/log

]> git.apps.os.sepia.ceph.com Git - ceph.git/log

projects / ceph.git / log

summary | shortlog | log | commit | commitdiff | tree
first ⋅ prev ⋅ next

commit | commitdiff | tree

Vallari Agrawal [Wed, 25 Dec 2024 04:50:26 +0000 (10:20 +0530)]

src/common/options/mon.yaml.in: add mon_nvmeofgw_delete_grace

This config allows to configure the delay in triggering
NVMEOF_GATEWAY_DELETING healthcheck warning, which is
triggered when NVMeoF gateways are in DELETEING state
for too long (indicating a problem in namespace
load-balacing).
The default value for this config is 15 mins.

Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>

commit | commitdiff | tree

Vallari Agrawal [Wed, 18 Dec 2024 07:59:47 +0000 (13:29 +0530)]

mon/NVMeofGwMap: add healthcheck warning NVMEOF_GATEWAY_DELETING

Add a warning when NVMeoF gateways are in DELETING state.
This happens when there are namespaces under the deleted gateway's
ANA group ID.

The gateways are removed completely after users manually move these
namespaces to another load balancing group. Or if a new gateway is
deployed on that host.

Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>

commit | commitdiff | tree

Gil Bregman [Fri, 3 Jan 2025 10:12:08 +0000 (12:12 +0200)]

Merge pull request #61217 from gbregman/main

mgr/cephadm/nvmeof: Add key verification field to NVMeOF configuration

commit | commitdiff | tree

Gil Bregman [Thu, 2 Jan 2025 21:08:00 +0000 (23:08 +0200)]

mgr/cephadm/nvmeof: Add key verification field to NVMeOF configuration
Fixes https://tracker.ceph.com/issues/69413

Signed-off-by: Gil Bregman <gbregman@il.ibm.com>

commit | commitdiff | tree

J. Eric Ivancich [Thu, 2 Jan 2025 19:31:41 +0000 (14:31 -0500)]

Merge pull request #59980 from kchheda3/wip-fix-v1-v2-notification

rgw/notification: Forward Topic & Notification creation request to master when notification_v2 enabled

Reviewed-by: Yuval Lifshitz <ylifshit@ibm.com>

commit | commitdiff | tree

J. Eric Ivancich [Thu, 2 Jan 2025 19:30:51 +0000 (14:30 -0500)]

Merge pull request #60430 from ivancich/wip-fix-multipart-empty-storage-class

rgw: fix empty storage class on display of multipart uploads

Reviewed-by: Adam Emerson <aemerson@redhat.com>

commit | commitdiff | tree

J. Eric Ivancich [Thu, 2 Jan 2025 19:29:13 +0000 (14:29 -0500)]

Merge pull request #59631 from thotz/create-user-without-creds-cli

radosgw-admin: create user without creds cli

Reviewed-by: Matt Benjamin <mbenjamin@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Venky Shankar [Thu, 2 Jan 2025 06:45:55 +0000 (12:15 +0530)]

Merge PR #60411 into main

* refs/pull/60411/head:
client: Fix a deadlock when osd is full

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>
Reviewed-by: Dhairya Parmar <dparmar@redhat.com>

commit | commitdiff | tree

Yuval Lifshitz [Wed, 1 Jan 2025 09:03:41 +0000 (11:03 +0200)]

Merge pull request #61103 from yuvalif/wip-yuval-fix-test-names

test/rgw/noitifications: fix test names

Reviewed-By: Ali Masarwe <ali.masarwa@ibm.com>

commit | commitdiff | tree

Anthony D'Atri [Mon, 30 Dec 2024 15:05:10 +0000 (10:05 -0500)]

Merge pull request #61197 from bebehei/fix-haproxy-dashboard-docs

doc/mgr/dashboard: Fix HAProxy TLS example

commit | commitdiff | tree

Benedikt Heine [Mon, 30 Dec 2024 14:26:16 +0000 (15:26 +0100)]

doc/mgr/dashboard: Fix HAProxy TLS example

With `ssl` set on the `server` option, HAProxy strips the TLS protocol
for all clients. You would need to connect to it with `http://<ip>:443`.

To have an active health check, which uses SSL, but does not strip it
for clients, you'd need to add:

- `check` to enable active health checks.
- `check-ssl` to instruct the health check to use TLS
- `verify none` to skip verification on the health check requests from
HAProxy
- _REMOVE_ `ssl` to stop stripping TLS

The active health checks are required to not route any requests to the
inactive managers. These would redirect to any unusable IP from the
active mgr.

---

Alternatively you could add another certificate in the frontend and then
re-encrypt the traffic. But this would require tracking the certs also
in HAProxy.

Signed-off-by: Benedikt Heine <bebe@bebehei.de>

commit | commitdiff | tree

Yuval Lifshitz [Mon, 16 Dec 2024 17:16:36 +0000 (17:16 +0000)]

test/rgw/noitifications: fix test names

for persistent topic stats tests

Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>

commit | commitdiff | tree

Zac Dover [Mon, 30 Dec 2024 07:26:45 +0000 (17:26 +1000)]

Merge pull request #60794 from dparmar18/wip-68571

doc/cephfs: document purge queue and its perf counters

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Zac Dover [Sun, 29 Dec 2024 19:21:39 +0000 (05:21 +1000)]

Merge pull request #61191 from zdover23/wip-doc-2024-12-29-README-cleanup

doc: README.md - format admonition

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Zac Dover [Sun, 29 Dec 2024 13:24:46 +0000 (23:24 +1000)]

doc: README.md - format admonition

Format an admonition correctly. This commit is a prelude to a cleanup of
a recent addition to README.md.

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Zac Dover [Sat, 28 Dec 2024 15:32:26 +0000 (01:32 +1000)]

Merge pull request #61142 from Dedsec0098/wip-doc-shrish

doc: Update vstart section in readme.md

Reviewed-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Zac Dover [Sat, 28 Dec 2024 09:38:26 +0000 (19:38 +1000)]

Merge pull request #61156 from zdover23/wip-doc-2024-12-20-radosgw-uadk-accel

doc/radosgw: line edit uadk-accel.rst (1st half)

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Ronen Friedman [Fri, 27 Dec 2024 13:29:57 +0000 (15:29 +0200)]

Merge pull request #61184 from ronen-fr/wip-rf-watch-scrubmax

osd/scrub: register for 'osd_max_scrubs' config changes

Reviewed-by: Nitzan Mordechai <nmordech@redhat.com>

commit | commitdiff | tree

Venky Shankar [Fri, 27 Dec 2024 11:06:10 +0000 (16:36 +0530)]

Merge PR #55616 into main

* refs/pull/55616/head:
PendingReleaseNotes: add note for replay completion warning
qa: test to verify `MDS_ESTIMATED_REPLAY_TIME` warning
doc: add a note for `MDS_ESTIMATED_REPLAY_TIME` MDS warning
mds: emit warning for estinated replay time

Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>
Reviewed-by: Milind Changire <mchangir@redhat.com>

commit | commitdiff | tree

Venky Shankar [Fri, 27 Dec 2024 06:40:08 +0000 (12:10 +0530)]

Merge PR #60640 into main

* refs/pull/60640/head:
qa: use a larger timeout for kernel_untar_build workunit

Reviewed-by: Milind Changire <mchangir@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Thu, 26 Dec 2024 13:06:10 +0000 (07:06 -0600)]

osd/scrub: register for 'osd_max_scrubs' config changes

Since https://github.com/ceph/ceph/pull/55340, the
osd_max_scrubs (also) affects the parameters of the
async scrub reserver used by the replicas. Thus,
the code must notice and acknowledge changes to this config.

Fixes: https://tracker.ceph.com/issues/69362
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Yuval Lifshitz [Thu, 26 Dec 2024 10:48:54 +0000 (12:48 +0200)]

Merge pull request #60988 from thuvh/feat/kafka_multiple_brokers

rgw: allow bucket notification send message to kafka with multiple brokers

commit | commitdiff | tree

Yingxin Cheng [Thu, 26 Dec 2024 07:27:28 +0000 (15:27 +0800)]

Merge pull request #61144 from cyx1231st/wip-seastore-improve-backref

crimson/os/seastore/cache: improve committing allocations by backref entries

Reviewed-by: Xuehan Xu <xuxuehan@qianxin.com>
Reviewed-by: Zhang Song <zhangsong02@qianxin.com>

commit | commitdiff | tree

Dhairya Parmar [Thu, 21 Nov 2024 22:10:33 +0000 (03:40 +0530)]

doc/cephfs: document purge queue and its perf counters

Fixes: https://tracker.ceph.com/issues/68571
Signed-off-by: Dhairya Parmar <dparmar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Thu, 26 Dec 2024 04:15:08 +0000 (09:45 +0530)]

Merge PR #60653 into main

* refs/pull/60653/head:
mds: do not process client metrics message with fast dispatch

Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>
Reviewed-by: Jos Collin <jcollin@redhat.com>
Reviewed-by: Dhairya Parmar <dparmar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Thu, 26 Dec 2024 04:11:58 +0000 (09:41 +0530)]

Merge pull request #61143 from kotreshhr/qa-nfsganesha-fio-test

qa: Add async io test to nfs ganesha using fio

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Thu, 26 Dec 2024 04:04:15 +0000 (09:34 +0530)]

Merge pull request #55145 from mchangir/mgr-snap_schedule-handle-volume-delete

mgr/snap_schedule: handle volume delete

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Hoai-Thu Vuong [Tue, 9 Jul 2024 16:56:51 +0000 (23:56 +0700)]

rgw: allow bucket notification send message to kafka with multiple brokers

- add new parameter to define list of broker
- update document
- change nose to nose-py3
- add test case for multiple brokers (happy case)

Signed-off-by: Hoai-Thu Vuong <thuvh87@gmail.com>

commit | commitdiff | tree

Anthony D'Atri [Wed, 25 Dec 2024 23:57:46 +0000 (18:57 -0500)]

Merge pull request #61177 from Skalador/wip-doc-typo-hardware-controllers

doc/start: Fixes a typo in controllers section of hardware recommendations

commit | commitdiff | tree

Skalador [Wed, 25 Dec 2024 22:57:00 +0000 (23:57 +0100)]

doc: Fixes a typo in controllers section of hardware recommendations

Signed-off-by: Kevin Niederwanger <k.niederwanger@gmail.com>

commit | commitdiff | tree

Yuri Weinstein [Tue, 24 Dec 2024 19:08:30 +0000 (11:08 -0800)]

Merge pull request #59116 from rzarzynski/wip-constbl-compressor

compressor: switch out of get_current_ptr() for the sake of const-stricter bufferlist

Reviewed-by: Bill Scales <bill_scales@uk.ibm.com>
Reviewed-by: Igor Fedotov <ifedotov@suse.com>

commit | commitdiff | tree

Ronen Friedman [Tue, 24 Dec 2024 18:32:39 +0000 (20:32 +0200)]

Merge pull request #60808 from ronen-fr/wip-rf-reservStat2

osd/scrub: show reservation status in 'pg dump' output

Reviewed-by: Nitzan Mordechai <nmordech@redhat.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Fri, 22 Nov 2024 18:00:50 +0000 (12:00 -0600)]

osd/scrub: show reservation status in 'pg dump' output

Whenever a PG is selected for scrubbing, and is waiting for
remote reservations, the 'pg dump' output will include the
following text (under the 'SCRUB_SCHEDULING' column):
Reserving. Waiting Ns for OSD.k (n/m)

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Yingxin Cheng [Tue, 24 Dec 2024 02:29:42 +0000 (10:29 +0800)]

crimson/os/seastore/cache: move committing existing blocks to prepare_record()

So the 2 steps of remapping are atomically committed.

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
Signed-off-by: Zhang Song <zhangsong02@qianxin.com>

commit | commitdiff | tree

Yingxin Cheng [Fri, 20 Dec 2024 08:23:26 +0000 (16:23 +0800)]

crimson/os/seastore/journal: hide RBM specific finish_commit()

By introducing a callback upon completing submission.

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Thu, 19 Dec 2024 07:13:38 +0000 (15:13 +0800)]

crimson/os/seastore/cache: minor cleanup

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Shrish0098 [Thu, 19 Dec 2024 03:57:56 +0000 (09:27 +0530)]

doc: Update vstart section in readme.md

Add guidance about vstart clusters to README.md.

Signed-off-by: Shrish Mishra <shrish409@gmail.com>

commit | commitdiff | tree

Matan Breizman [Mon, 23 Dec 2024 09:02:04 +0000 (11:02 +0200)]

Merge pull request #60821 from Matan-B/wip-matanb-crimson-seastore-deafult

crimson: switch to SeaStore as default object store

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Ilya Dryomov [Sun, 22 Dec 2024 22:56:28 +0000 (23:56 +0100)]

Merge pull request #61155 from idryomov/wip-69324

rbd: handle --{group,image}-namespace in "rbd group image {add,rm}"

Reviewed-by: Ramana Raja <rraja@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Sun, 22 Dec 2024 22:56:09 +0000 (23:56 +0100)]

Merge pull request #61150 from idryomov/wip-69319

rbd: open images in read-only mode for "rbd mirror pool status --verbose"

Reviewed-by: Ramana Raja <rraja@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Sun, 22 Dec 2024 15:23:12 +0000 (16:23 +0100)]

Merge pull request #61129 from idryomov/wip-68998

librbd: avoid data corruption on flatten when object map is inconsistent

Reviewed-by: Ramana Raja <rraja@redhat.com>

commit | commitdiff | tree

Laura Flores [Sat, 21 Dec 2024 02:58:38 +0000 (20:58 -0600)]

Merge pull request #61125 from ljflores/wip-release-note

commit | commitdiff | tree

Patrick Donnelly [Fri, 20 Dec 2024 16:05:57 +0000 (11:05 -0500)]

Merge PR #61149 into main

* refs/pull/61149/head:
script/ceph-backport: allow hyphen in project during ceph fork deduction

Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>

commit | commitdiff | tree

afreen23 [Fri, 20 Dec 2024 13:16:48 +0000 (18:46 +0530)]

Merge pull request #60777 from VallariAg/wip-nvmeof-prometheus-rbd-image-reused

monitoring: Add prometheus alert NVMeoFMultipleNamespacesOfRBDImage

Reviewed-by: Afreen Misbah <afreen@ibm.com>

commit | commitdiff | tree

Ilya Dryomov [Fri, 20 Dec 2024 10:16:58 +0000 (11:16 +0100)]

rbd: drop --pool option from "rbd group image {add,rm}"

It stopped working with removal of get_special_pool_group_names() in
commit 3e8624f157a1 ("rbd: add support for namespaces") over six years
ago. Given how much time has passed, stop accepting this option.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Zac Dover [Fri, 20 Dec 2024 12:16:46 +0000 (22:16 +1000)]

doc/radosgw: line edit uadk-accel.rst (1st half)

Line edit the first half of doc/radosgw/uadk-accel.rst.

This is one of a series of pull requests intended to improve the content
of uadk-accel.rst. This series includes the following PRs:

https://github.com/ceph/ceph/pull/59514
https://github.com/ceph/ceph/pull/60953
https://github.com/ceph/ceph/pull/61128

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Zac Dover [Fri, 20 Dec 2024 12:13:35 +0000 (22:13 +1000)]

Merge pull request #61146 from zdover23/wip-doc-2024-12-19-radosgw-uadk-accel

doc/radosgw: edit uadk-accel.rst

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Ilya Dryomov [Tue, 17 Dec 2024 15:06:17 +0000 (16:06 +0100)]

rbd: handle --{group,image}-namespace in "rbd group image {add,rm}"

Currently only passing the namespace as part of the group or image spec
works. If --group-namespace or --image-namespace options are used, the
namespace isn't picked up.

Fixes: https://tracker.ceph.com/issues/69324
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Laura Flores [Tue, 17 Dec 2024 23:18:11 +0000 (17:18 -0600)]

PendingReleaseNotes: add note about tracker #69012

We merged a fix for v19.2.1 that helps alleviate
the worst of this problem (https://tracker.ceph.com/issues/68657),
but it still comes up on occasion. This release note addresses the
remaining issues tracked in https://tracker.ceph.com/issues/69012.

Signed-off-by: Laura Flores <lflores@ibm.com>

commit | commitdiff | tree

Ilya Dryomov [Tue, 17 Dec 2024 12:31:09 +0000 (13:31 +0100)]

rbd: open images in read-only mode for "rbd mirror pool status --verbose"

This is cleaner and makes the command run a bit faster because watches
won't be established.

Fixes: https://tracker.ceph.com/issues/69319
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

J. Eric Ivancich [Thu, 19 Dec 2024 17:11:06 +0000 (12:11 -0500)]

script/ceph-backport: allow hyphen in project during ceph fork deduction

This will allow the fork deduction code to recognize project names
that include a hyphen, such as:

git@github.com:USERNAME/ceph-fork.git

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>

commit | commitdiff | tree

leonidc [Thu, 19 Dec 2024 15:57:25 +0000 (17:57 +0200)]

Merge pull request #61113 from leonidc/wip-leonidc_no_listener_fix

nvmeofgw*: fix no-listeners

commit | commitdiff | tree

Casey Bodley [Thu, 19 Dec 2024 15:19:51 +0000 (10:19 -0500)]

Merge pull request #61107 from cbodley/wip-69241

rgw/posix: std::ignore return value of write()

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>

commit | commitdiff | tree

afreen23 [Thu, 19 Dec 2024 15:16:59 +0000 (20:46 +0530)]

Merge pull request #59703 from rhcs-dashboard/usermanagement-form

mgr/dashboard: carbonize user management form

Reviewed-by: Afreen Misbah <afreen@ibm.com>

commit | commitdiff | tree

baum [Thu, 19 Dec 2024 14:57:14 +0000 (16:57 +0200)]

Merge pull request #61130 from baum/wip-baum-20241218-00

src/nvmeof/NVMeofGwMonitorClient: remove MDS client, not needed

commit | commitdiff | tree

Zac Dover [Thu, 19 Dec 2024 13:19:22 +0000 (23:19 +1000)]

doc/radosgw: edit uadk-accel.rst

Edit the sections of doc/radosgw/uadk-accel.rst that concern the
automatic and manual building of UADK.

This is one in a series of uadk-accel.rst-related changes that includes
the following PRs:
https://github.com/ceph/ceph/pull/60953
https://github.com/ceph/ceph/pull/61128

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Zac Dover [Thu, 19 Dec 2024 13:13:14 +0000 (23:13 +1000)]

Merge pull request #61128 from zdover23/wip-doc-2024-12-18-radosgw-uadk-accel

doc/radosgw: edit uadk-accel.rst

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Matan Breizman [Mon, 16 Dec 2024 16:56:41 +0000 (16:56 +0000)]

vstart: use crimson_osd_objectstore

Signed-off-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Matan Breizman [Mon, 25 Nov 2024 12:05:19 +0000 (12:05 +0000)]

doc/dev/crimson: update SeaStore as default backend

Signed-off-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Matan Breizman [Mon, 25 Nov 2024 12:04:18 +0000 (12:04 +0000)]

vstart.sh: set alien cores with bluestore only

Signed-off-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Matan Breizman [Mon, 25 Nov 2024 12:03:46 +0000 (12:03 +0000)]

qa/suites/crimson-rados: make use of crimson_osd_objectstore

Signed-off-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Yingxin Cheng [Wed, 18 Dec 2024 09:09:54 +0000 (17:09 +0800)]

crimson/os/seastore: make the updates to backref_entry_mset be consistent with extents

Generally, make alloc/free be consistent with add/commit/retire extents
in Cache.

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
Signed-off-by: Zhang Song <zhangsong02@qianxin.com>

commit | commitdiff | tree

Yuval Lifshitz [Thu, 19 Dec 2024 06:18:42 +0000 (08:18 +0200)]

Merge pull request #60788 from oshrey16/tests-rgw-ps-list-topics-op

tests/rgw/notifications: Add tests for RGWPSListTopicsOp::execute()

commit | commitdiff | tree

Kotresh HR [Wed, 23 Oct 2024 19:00:41 +0000 (00:30 +0530)]

client: Fix a deadlock when osd is full

Problem:
When osd is full, the client receives the notification
and cancels the ongoing writes. If the ongoing writes
are async, it could cause a dead lock as the async
callback registered also takes the 'client_lock' which
the handle_osd_map takes at the beginning.

The op_cancel_writes calls the callback registered for
the async write synchronously holding the 'client_lock'
causing the deadlock.

Earlier approach:
  It was tried to solve this issue by calling 'op_cancel_writes'
without holding 'client_lock'. But this failed lock dependency
between objecter's 'rwlock' and async write's callback taking
'client_lock'. The 'client_lock' should always be taken before
taking 'rwlock'. So this approach is dropped against the current
approach.

Solution:
Use C_OnFinisher for objecter async write callback i.e., wrap
the async write's callback using the Finisher. This queues the
callback to the Finisher's context queue which the finisher
thread picks up and executes thus avoiding the deadlock.

Testing:
The fix is tested in the vstart cluster with the following reproducer.
1. Mount the cephfs volume using nfs-ganesha at /mnt
2. Run fio on /mnt on one terminal
3. On the other terminal, blocklist the nfs client session
4. The fio would hang

It is reproducing in the vstart cluster most of the times. I think
that's because it's slow. The same test written for teuthology is
not reproducing the issue. The test expects one or more writes
to be on going in rados when the client is blocklisted for the deadlock
to be hit.

Stripped down version of Traceback:
----------
0  0x00007f4d77274960 in __lll_lock_wait ()
1  0x00007f4d7727aff2 in pthread_mutex_lock@@GLIBC_2.2.5 ()
2  0x00007f4d7491b0a1 in __gthread_mutex_lock (__mutex=0x7f4d200f99b0)
3  std::mutex::lock (this=<optimized out>)
4  std::scoped_lock<std::mutex>::scoped_lock (__m=..., this=<optimized out>, this=<optimized out>, __m=...)
5  Client::C_Lock_Client_Finisher::finish (this=0x7f4ca0103550, r=-28)
6  0x00007f4d74888dfd in Context::complete (this=0x7f4ca0103550, r=<optimized out>)
7  0x00007f4d7498850c in std::__do_visit<...>(...) (__visitor=...)
8  std::visit<Objecter::Op::complete(...) (__visitor=...)
9  Objecter::Op::complete(...) (e=..., e=..., r=-28, ec=..., f=...)
10 Objecter::Op::complete (e=..., r=-28, ec=..., this=0x7f4ca022c7f0)
11 Objecter::op_cancel (this=0x7f4d200fab20, s=<optimized out>, tid=<optimized out>, r=-28)
12 0x00007f4d7498ea12 in Objecter::op_cancel_writes (this=0x7f4d200fab20, r=-28, pool=103)
13 0x00007f4d748e1c8e in Client::_handle_full_flag (this=0x7f4d200f9830, pool=103)
14 0x00007f4d748ed20c in Client::handle_osd_map (m=..., this=0x7f4d200f9830)
15 Client::ms_dispatch2 (this=0x7f4d200f9830, m=...)
16 0x00007f4d75b8add2 in Messenger::ms_deliver_dispatch (m=..., this=0x7f4d200ed3e0)
17 DispatchQueue::entry (this=0x7f4d200ed6f0)
18 0x00007f4d75c27fa1 in DispatchQueue::DispatchThread::entry (this=<optimized out>)
19 0x00007f4d77277c02 in start_thread ()
20 0x00007f4d772fcc40 in clone3 ()
--------

Fixes: https://tracker.ceph.com/issues/68641
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Kotresh HR [Fri, 25 Oct 2024 10:10:42 +0000 (15:40 +0530)]

qa: Add async io test to nfs ganesha using fio

Fixes: https://tracker.ceph.com/issues/69311
Signed-off-by: Kotresh HR <khiremat@redhat.com>

commit | commitdiff | tree

Vallari Agrawal [Thu, 19 Dec 2024 03:49:31 +0000 (09:19 +0530)]

Merge pull request #60431 from VallariAg/fix-nvmeof-ns-test

qa: fix nvmeof scalability and namespace test

commit | commitdiff | tree

Patrick Donnelly [Wed, 18 Dec 2024 14:33:50 +0000 (09:33 -0500)]

Merge PR #61018 into main

* refs/pull/61018/head:
msg/async/Event: make the pipe byte static

Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>
Reviewed-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Oshrey Avraham [Wed, 18 Dec 2024 14:23:40 +0000 (16:23 +0200)]

rgw/notifications: Add tests for RGWPSListTopicsOp::execute()

Tests:
Add comprehensive test cases to verify the behavior of `RGWPSListTopicsOp::execute()` under various scenarios:

    Migration case: Validate correct handling when `support_all_zones` is enabled, with v1 in a new state after migration and v2 topics present.
    v2 notification case: Ensure proper retrieval when v2 notifications are supported.
    v1 notification case: Verify fallback behavior when v2 notifications are unavailable.

Enhancements:
    Update `delete_all_topics` to handle v1 responses with the `result` key.

fixes: https://tracker.ceph.com/issues/68756
Signed-off-by: Oshrey Avraham <oshrey16@gmail.com>

commit | commitdiff | tree

Casey Bodley [Wed, 18 Dec 2024 14:05:50 +0000 (09:05 -0500)]

Merge pull request #60771 from YaZasnyal/fix/complete-abort-mp

rgw/abortmp: Race condition on AbortMultipartUpload

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

J. Eric Ivancich [Wed, 18 Dec 2024 13:56:33 +0000 (08:56 -0500)]

Merge pull request #60676 from awojno-bloomberg/rgw-admin-json

rgw-admin: Add --format option for bucket sync status

Reviewed-by: Adam Emerson <aemerson@redhat.com>

commit | commitdiff | tree

J. Eric Ivancich [Wed, 18 Dec 2024 13:56:08 +0000 (08:56 -0500)]

Merge pull request #60970 from clwluvw/redirect-qs

rgw: append query string to redirect URL if present

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

J. Eric Ivancich [Wed, 18 Dec 2024 13:55:45 +0000 (08:55 -0500)]

Merge pull request #60579 from clwluvw/empty-multidelete

rgw: return MalformedXML for empty objects list in DeleteObjects

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Wed, 18 Dec 2024 13:33:56 +0000 (08:33 -0500)]

Merge PR #60217 into main

* refs/pull/60217/head:
CMakeLists.txt: do not compile {Objecter,Striper}.cc twice

Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>

commit | commitdiff | tree

leonidc [Wed, 18 Dec 2024 12:08:11 +0000 (14:08 +0200)]

Merge pull request #61091 from leonidc/wip-leonidc_publish_rebalance_index

nvmeofgw* : fix publishing rebalance index

commit | commitdiff | tree

Ilya Dryomov [Tue, 17 Dec 2024 08:57:44 +0000 (09:57 +0100)]

test/librbd: add TestInternal.FlattenInconsistentObjectMap

Inject an object map with all possible inconsistencies before
flattening to ensure that something similar to commit 40af4f87b64f
("librbd: flatten operation should use object map") doesn't reappear
in a different form.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Ilya Dryomov [Wed, 27 Nov 2024 10:11:16 +0000 (11:11 +0100)]

librbd: avoid data corruption on flatten when object map is inconsistent

By making flatten skip copyup in case the object is marked
OBJECT_EXISTS or OBJECT_EXISTS_CLEAN, commit 40af4f87b64f ("librbd:
flatten operation should use object map") introduced a critical
regression.  If the object map becomes inconsistent (e.g. because
flatten gets interrupted by killing "rbd flatten" process or a client
running on the clone crashes after updating the object map but before
writing to the image), the following attempt to flatten would corrupt
the clone if the copyup is actually still needed.

By design, it's impossible to tell whether the object is "known to
exist" based on the object map -- only telling whether the object is
"known to NOT exist" is possible (i.e. only OBJECT_NONEXISTENT state
is reliable).  Negating OBJECT_NONEXISTENT tells that the object "may
exist", not that the object is "known to exist".  This is reflected in
the name of object_may_exist() helper that was introduced together with
the object map implementation.  Something like object_may_not_exist()
simply can't be constructed given the rest of librbd.

This effectively reverts commits 4c86bccf07b8 ("librbd: add
object_may_not_exist helper") and 40af4f87b64f ("librbd: flatten
operation should use object map").

Fixes: https://tracker.ceph.com/issues/68998
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Vallari Agrawal [Tue, 19 Nov 2024 11:55:49 +0000 (17:25 +0530)]

monitoring: Add prometheus alert NVMeoFMultipleNamespacesOfRBDImage

NVMeoFMultipleNamespacesOfRBDImage alerts the user if a RBD image
is used for multiple namespaces. This is important alerts for cases
where namespaces are created on same image for different gateway group.

Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>

commit | commitdiff | tree

Alexander Indenbaum [Wed, 18 Dec 2024 10:05:36 +0000 (10:05 +0000)]

src/nvmeof/NVMeofGwMonitorClient: remove MDS client, not needed

Signed-off-by: Alexander Indenbaum <aindenba@redhat.com>

commit | commitdiff | tree

Zac Dover [Wed, 18 Dec 2024 09:25:00 +0000 (19:25 +1000)]

doc/radosgw: edit uadk-accel.rst

Incorporate Anthony D'Atri's suggested changes from
https://github.com/ceph/ceph/pull/60953 into doc/radosgw/uadk-accel.rst.

Two questions from that PR remain unclear to me: one is about whether
IOMMU should be disabled for performance on AMD EPYC systems, and the
other is about UADK. The note about UADK will be rewritten in improved
English in a near-future PR and any remaining technical questions that
involve it can be discussed in that PR.

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Zac Dover [Wed, 18 Dec 2024 09:19:17 +0000 (19:19 +1000)]

Merge pull request #60953 from zdover23/wip-doc-2024-12-04-radosgw-uadk-accel

doc/radosgw: edit uadk-accel.rst

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Yingxin Cheng [Wed, 18 Dec 2024 09:07:53 +0000 (17:07 +0800)]

crimson/os/seastore: move backref_entry_t to a separate header

For reuse.

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Wed, 18 Dec 2024 09:03:37 +0000 (17:03 +0800)]

crimson/os/seastore/cache: rework backref_batch_update()

Leverage the fact that seq won't go backwards.

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Wed, 18 Dec 2024 08:56:00 +0000 (16:56 +0800)]

crimson/os/seastore: introduce named factory methods for alloc/retire

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Wed, 18 Dec 2024 08:47:54 +0000 (16:47 +0800)]

crimson/os/seastore/seastore_types: use is_backref_mapped_type()

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Wed, 18 Dec 2024 08:42:40 +0000 (16:42 +0800)]

crimson/os/seastore/cache: don't mark allocations as L_ADDR_NULL

Which is used for deallocations.

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Wed, 18 Dec 2024 08:40:01 +0000 (16:40 +0800)]

crimson/os/seastore: misc related cleanups

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Wed, 18 Dec 2024 08:23:16 +0000 (16:23 +0800)]

crimson/os/seastore/cache: drop unused backref_entry_t::seq

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Pedro Gonzalez Gomez [Wed, 18 Dec 2024 08:20:40 +0000 (09:20 +0100)]

Merge pull request #60976 from rhcs-dashboard/fix-69144-main

mgr/dashboard: Fix Latency chart data units in rgw overview page

Reviewed-by: Afreen Misbah <afreen@ibm.com>
Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Wed, 18 Dec 2024 06:42:57 +0000 (08:42 +0200)]

Merge pull request #61115 from ronen-fr/wip-rf-tst-p-replicated

qa/standalone/scrub: fix TEST_periodic_scrub_replicated

Reviewed-by: Laura Flores <lflores@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Wed, 18 Dec 2024 06:39:10 +0000 (08:39 +0200)]

Merge pull request #60863 from ronen-fr/wip-rf-deeperrors

osd/scrub: remove the 'has_deep_errors' scheduling flag

Reviewed-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

J. Eric Ivancich [Wed, 18 Dec 2024 00:30:44 +0000 (19:30 -0500)]

Merge pull request #61122 from ceph/revert-61024-wip_ops-replayer_jfw-signedness-warning

Revert "ops_replayer: fix signed/unsigned warning and add parameter range checking."

Reviewed-by: Laura Flores <lflores@redhat.com>

commit | commitdiff | tree

Casey Bodley [Tue, 17 Dec 2024 21:33:46 +0000 (16:33 -0500)]

Merge pull request #56583 from pritha-srivastava/wip-rgw-del-bucket-tag

rgw: fix to correctly store updated attrs in backend store after erasing an attr/attrs for delete ops on a bucket.

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Pedro Gonzalez Gomez [Tue, 17 Dec 2024 21:22:00 +0000 (22:22 +0100)]

Merge pull request #60940 from rhcs-dashboard/rgw-user-accounts-api

mgr/dashboard: RGW user accounts CRUD api's

Reviewed-by: Afreen Misbah <afreen@ibm.com>
Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>

commit | commitdiff | tree

Casey Bodley [Tue, 17 Dec 2024 18:58:32 +0000 (13:58 -0500)]

Revert "ops_replayer: fix signed/unsigned warning and add parameter range checking."

Signed-off-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Jesse Williamson [Tue, 17 Dec 2024 18:17:09 +0000 (10:17 -0800)]

Merge pull request #61024 from chardan/wip_ops-replayer_jfw-signedness-warning

ops_replayer: fix signed/unsigned warning and add parameter range checking.

commit | commitdiff | tree

Ali Masarwa [Tue, 17 Dec 2024 18:06:51 +0000 (20:06 +0200)]

Merge pull request #60942 from AliMasarweh/wip-alimasa-bucket-logging-filter

RGW/logging: add filtering for bucket logging

Reviewed-by: yuvalif<ylifshit@redhat.com>

commit | commitdiff | tree

Ali Masarwa [Tue, 17 Dec 2024 15:08:22 +0000 (17:08 +0200)]

Merge pull request #60697 from AliMasarweh/wip-alimasa-bucket-logging-flush

RGW/logging: add rest API to flush logging bucket

Reviewed-by: yuvalif<ylifshit@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Tue, 17 Dec 2024 13:51:48 +0000 (15:51 +0200)]

qa/standalone/scrub: fix TEST_periodic_scrub_replicated

A bogus change introduced as part of PR#54363 (commit
fbb7d73) changed multiple 'scrub' commands to 'scheduled-scrub'.
In this one instance - that was wrong.

Fixes: https://tracker.ceph.com/issues/69276
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Milind Changire [Tue, 17 Dec 2024 11:18:24 +0000 (16:48 +0530)]

Merge pull request #59936 from anoopcs9/client-chown-statx-fixes

client: Handle empty pathnames for `ceph_chownat()` and `ceph_statxat()`

commit | commitdiff | tree

Ali Masarwa [Tue, 10 Dec 2024 13:09:50 +0000 (15:09 +0200)]

RGW\logging: convert s3_filter to key_filter

Signed-off-by: Ali Masarwa <amasarwa@redhat.com>

Unnamed repository; edit this file 'description' to name the repository.