]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
14 months agomds/quiesce: drop remote authpins before waiting for the quiesce lock 57332/head
Leonid Usov [Thu, 16 May 2024 14:11:19 +0000 (17:11 +0300)]
mds/quiesce: drop remote authpins before waiting for the quiesce lock

Signed-off-by: Leonid Usov <leonid.usov@ibm.com>
Fixes: https://tracker.ceph.com/issues/65802
14 months agoqa/cephfs/test_quiesce: test proper handling of remote authpins
Leonid Usov [Thu, 16 May 2024 13:52:14 +0000 (16:52 +0300)]
qa/cephfs/test_quiesce: test proper handling of remote authpins

When a request is blocked on the quiesce lock, it should release
all remote authpins, especially those that make an inode AUTHPIN_FROZEN

Signed-off-by: Leonid Usov <leonid.usov@ibm.com>
14 months agomds: don't clear `AUTHPIN_FROZEN` until `FROZEN` in rename_prep
Leonid Usov [Sun, 12 May 2024 00:36:32 +0000 (03:36 +0300)]
mds: don't clear `AUTHPIN_FROZEN` until `FROZEN` in rename_prep

Signed-off-by: Leonid Usov <leonid.usov@ibm.com>
14 months agomds: enhance the `lock path` asok command
Leonid Usov [Sat, 11 May 2024 14:00:21 +0000 (17:00 +0300)]
mds: enhance the `lock path` asok command

* when the quiesce lock is taken by this op, don't consider the inode `quiesced`
* drop all locks taken during traversal
* drop all local authpins after the locks are taken
* add --await functionality that will block the command until locks are taken or an error is encountered
* return the RC that represents the operation result. 0 if the operation was scheduled and hasn't failed so far
* add authpin control flags
** --ap-freeze - to auth_pin_freeze the target inode
** --ap-dont-block - to pass auth_pin_nonblocking when acquiring the target inode locks

Signed-off-by: Leonid Usov <leonid.usov@ibm.com>
14 months agomds/quiesce: overdrive fragmenting that's still freezing
Leonid Usov [Thu, 9 May 2024 01:39:12 +0000 (04:39 +0300)]
mds/quiesce: overdrive fragmenting that's still freezing

Quiesce requires revocation of capabilities,
which is not working for a freezing/frozen nodes.
Since it is best effort, abort an ongoing fragmenting
for the sake of a faster quiesce.

Signed-off-by: Leonid Usov <leonid.usov@ibm.com>
Fixes: https://tracker.ceph.com/issues/65716
14 months agorevert: mds: provide a mechanism to authpin while freezing
Leonid Usov [Sun, 12 May 2024 16:19:34 +0000 (19:19 +0300)]
revert: mds: provide a mechanism to authpin while freezing

This is a functional revert of a9964a7ccc4394f923fb0f1c76eb8fa03fe8733d
git revert was giving too many conflicts, as the code has changed
too much since the original commit.

The bypass freezing mechanism lead us into several deadlocks,
and when we found out that a freezing inode defers reclaiming
client caps, we realized that we needed to try a different approach.
This commit removes the bypass freezing related changes to clear way
for a different approach to resolving the conflict between quiesce
and freezing.

Signed-off-by: Leonid Usov <leonid.usov@ibm.com>
14 months agoqa/cephfs/test_quiesce: enhance the fragmentation test
Leonid Usov [Thu, 2 May 2024 20:10:12 +0000 (23:10 +0300)]
qa/cephfs/test_quiesce: enhance the fragmentation test

Repeatedly quiesce under a heavy balancer load

Fixes: https://tracker.ceph.com/issues/65716
Signed-off-by: Leonid Usov <leonid.usov@ibm.com>
14 months agoMerge pull request #57011 from tchaikov/wip-cmake-legacy-option-header
Kefu Chai [Sun, 19 May 2024 13:56:02 +0000 (21:56 +0800)]
Merge pull request #57011 from tchaikov/wip-cmake-legacy-option-header

cmake: link against legacy-option-headers instead of depending on it

Reviewed-by: Casey Bodley <cbodley@redhat.com>
14 months agoMerge pull request #57542 from zdover23/wip-doc-2024-05-18-ceph-fs-volumes-subvolume...
Zac Dover [Sun, 19 May 2024 11:09:06 +0000 (21:09 +1000)]
Merge pull request #57542 from zdover23/wip-doc-2024-05-18-ceph-fs-volumes-subvolume-quiesce-squid-note

doc/cephfs: Squid and later - subvolume quiesce

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
14 months agoMerge pull request #57383 from athanatos/sjust/wip-65867-user-version
Matan Breizman [Sun, 19 May 2024 10:30:28 +0000 (13:30 +0300)]
Merge pull request #57383 from athanatos/sjust/wip-65867-user-version

crimson: fix user_version handling

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
Reviewed-by: Xuehan Xu <xuxuehan@qianxin.com>
14 months agoMerge pull request #56114 from xxhdx1985126/wip-seastore-teuthology-max-test-attr-len
Matan Breizman [Sun, 19 May 2024 10:28:24 +0000 (13:28 +0300)]
Merge pull request #56114 from xxhdx1985126/wip-seastore-teuthology-max-test-attr-len

src/test/TestRados: add max-attr-len to control the max length of attributes sent to OSDs

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>
14 months agoMerge pull request #55008 from NitzanMordhai/wip-nitzan-clear-data-digest-for-crc...
NitzanMordhai [Sun, 19 May 2024 07:29:44 +0000 (10:29 +0300)]
Merge pull request #55008 from NitzanMordhai/wip-nitzan-clear-data-digest-for-crc-check

osd: full-object read CRC mismatch due to 'truncate' modifying oi.size w/o clearing 'data_digest'

14 months agoceph_test_rados_io_pp: Add cxx test for append zero test 55008/head
NitzanMordhai [Tue, 26 Dec 2023 10:22:19 +0000 (10:22 +0000)]
ceph_test_rados_io_pp: Add cxx test for append zero test

1. adding allow_ec_overwrite option for cxx test
2. adding new test for crc failuer check with append zero length

Fixes: https://tracker.ceph.com/issues/53240
Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>
14 months agotest/pybind: check crc fail after append zero
NitzanMordhai [Tue, 26 Dec 2023 09:02:22 +0000 (09:02 +0000)]
test/pybind: check crc fail after append zero

Add test for zero crc check failed.

Fixes: https://tracker.ceph.com/issues/53240
Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>
14 months agoosd: Adjust code style
jiawd [Wed, 17 Nov 2021 09:28:59 +0000 (09:28 +0000)]
osd: Adjust code style

Fixes: https://tracker.ceph.com/issues/53240
Signed-off-by: jiawd <jiawendong@xtaotech.com>
14 months agoosd: Remove the duplicate oi.clear_data_digest
jiawd [Fri, 12 Nov 2021 04:03:15 +0000 (04:03 +0000)]
osd: Remove the duplicate oi.clear_data_digest

Fixes: https://tracker.ceph.com/issues/53240
Signed-off-by: jiawd <jiawendong@xtaotech.com>
14 months agoosd: full-object read crc is mismatch, because truncate modify oi.size and forget...
jiawd [Fri, 12 Nov 2021 03:48:56 +0000 (03:48 +0000)]
osd: full-object read crc is mismatch, because truncate modify oi.size and forget to clear data_digest

when write before truncate, need trim length, if truncate is to 0,
write is [0~128k], write change to [0~0], do nothing, oi.size is 0, x1 = set_data_digest(crc32(-1)).
write is [128k~128k], write change to [128k~0], truncate oi.size to offset 128k, x2 = set_data_digest(crc32(x1)).
write is [256k~128k], write change to [256k~0], truncate oi.size to offset 256k, x3 = set_data_digest(crc32(x2)).
...
write is [4063232~128k], write change to [4063232~0], truncate oi.size to offset 4063232, xn = set_data_digest(crs32(xn-1))
Now, we can see oi.size is 4063232, and data_digest is 0xffffffff, because thelength of in_data of crc is 0 every time.
when read verify crc will reply EIO. (EC pool).

so, when truncate in write, need clear data_digest and DIGEST flag,
when write before truncate, need to trim length, when offset over than oi.size, don't truncate oi.size to offset.

Fixes: https://tracker.ceph.com/issues/53240
Signed-off-by: jiawd <jiawendong@xtaotech.com>
14 months agodoc/cephfs: Squid and later - subvolume quiesce 57542/head
Zac Dover [Sun, 19 May 2024 00:00:29 +0000 (10:00 +1000)]
doc/cephfs: Squid and later - subvolume quiesce

Add a note to the "Subvolume quiesce" section that says that the
information in the section applies only to the Squid and later releases
of Ceph. This is included here so that I don't overwrite the Reef and
Quincy documentation with irrelevant information, and so that I don't
overwrite the Squid information with blank space where the "Subvolume
quiesce" section should be.

Signed-off-by: Zac Dover <zac.dover@proton.me>
14 months agoMerge pull request #57433 from idryomov/wip-65813
Ilya Dryomov [Sat, 18 May 2024 20:30:29 +0000 (22:30 +0200)]
Merge pull request #57433 from idryomov/wip-65813

librbd: don't crash on a zero-length read if buffer is NULL

Reviewed-by: Ramana Raja <rraja@redhat.com>
14 months agoMerge pull request #56310 from ajarr/wip-64800
Ilya Dryomov [Sat, 18 May 2024 20:28:56 +0000 (22:28 +0200)]
Merge pull request #56310 from ajarr/wip-64800

librbd: create rbd_trash object during pool initialization and namespace creation

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
14 months agoMerge pull request #57534 from zdover23/wip-doc-2024-05-17-ceph-fs-volumes-2-of-x
Zac Dover [Sat, 18 May 2024 11:27:45 +0000 (21:27 +1000)]
Merge pull request #57534 from zdover23/wip-doc-2024-05-17-ceph-fs-volumes-2-of-x

doc/cephfs: edit fs-volumes.rst (2 of x)

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
14 months agoMerge pull request #57528 from rishabh-d-dave/fix-test_rootsquash_nofeature
Rishabh Dave [Fri, 17 May 2024 12:26:22 +0000 (17:56 +0530)]
Merge pull request #57528 from rishabh-d-dave/fix-test_rootsquash_nofeature

qa/cephfs: add MDS_CLIENTS_BROKEN_ROOTSQUASH to ignorelist

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
14 months agoMerge pull request #57496 from rishabh-d-dave/block-test_idem_unaffected_root_squash
Rishabh Dave [Fri, 17 May 2024 12:26:01 +0000 (17:56 +0530)]
Merge pull request #57496 from rishabh-d-dave/block-test_idem_unaffected_root_squash

qa/cephfs: block buggy tests in test_admin.py

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
14 months agoMerge pull request #57333 from rishabh-d-dave/fs-fail-cache-kclient
Rishabh Dave [Fri, 17 May 2024 12:25:44 +0000 (17:55 +0530)]
Merge pull request #57333 from rishabh-d-dave/fs-fail-cache-kclient

qa/cephfs: set joinable on FS before exiting tests in TestFSFail

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
14 months agodoc/cephfs: edit fs-volumes.rst (2 of x) 57534/head
Zac Dover [Fri, 17 May 2024 10:46:28 +0000 (20:46 +1000)]
doc/cephfs: edit fs-volumes.rst (2 of x)

Edit doc/cephfs/fs-volumes to the section "Cloning Snapshots" (but not
including the section "Cloning Snapshots".

Follows https://github.com/ceph/ceph/pull/57415

Signed-off-by: Zac Dover <zac.dover@proton.me>
14 months agoMerge pull request #57497 from rhcs-dashboard/fix-66058-main
Aashish Sharma [Fri, 17 May 2024 09:07:10 +0000 (14:37 +0530)]
Merge pull request #57497 from rhcs-dashboard/fix-66058-main

mgr/dashboard: Fix host count per cluster and total hosts count on multi-cluster overview page

Reviewed-by: Nizamudeen A <nia@redhat.com>
14 months agoMerge pull request #57498 from idryomov/wip-link-to-mon-osd-blocklist-default-expire
Ilya Dryomov [Fri, 17 May 2024 07:53:09 +0000 (09:53 +0200)]
Merge pull request #57498 from idryomov/wip-link-to-mon-osd-blocklist-default-expire

common/options: link to mon_osd_blocklist_default_expire from RBD

Reviewed-by: Prasanna Kumar Kalever <prasanna.kalever@redhat.com>
Reviewed-by: N Balachandran <nibalach@redhat.com>
14 months agoMerge pull request #52560 from petrutlucian94/rbd_service_restart_test
Ilya Dryomov [Fri, 17 May 2024 07:09:38 +0000 (09:09 +0200)]
Merge pull request #52560 from petrutlucian94/rbd_service_restart_test

qa: add ceph-rbd windows service restart test

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
14 months agoMerge pull request #57253 from rhcs-dashboard/core_events_migration_for_c9
Nizamudeen A [Fri, 17 May 2024 06:31:02 +0000 (12:01 +0530)]
Merge pull request #57253 from rhcs-dashboard/core_events_migration_for_c9

mgr/k8sevents: update V1Events to CoreV1Events

Reviewed-by: Dan Mick <dmick@redhat.com>
Reviewed-by: Laura Flores <lflores@redhat.com>
Reviewed-by: Redouane Kachach <rkachach@redhat.com>
14 months agomgr/dashboard: Fix host count per cluster and total hosts count on multi-cluster... 57497/head
Aashish Sharma [Thu, 16 May 2024 08:26:34 +0000 (13:56 +0530)]
mgr/dashboard: Fix host count per cluster and total hosts count on multi-cluster overview page

Fixes: https://tracker.ceph.com/issues/66058
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
14 months agoqa/cephfs: block buggy tests in test_admin.py 57496/head
Rishabh Dave [Thu, 16 May 2024 07:00:49 +0000 (12:30 +0530)]
qa/cephfs: block buggy tests in test_admin.py

Block test_idem_unaffected_root_squash temporarily and
test_multifs_single_path_rootsquash.

This test fails due to a known bug. Block it temporarily so that
test_admin.py can run fully and PRs under QA can be tested fully.
Otherwise, this test fails and that halts test_admin.py, which leaves
the PR partially untested.

This failure is then seen as an unrelated failure which lets the buggy
code get merged. This has happened recently.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
14 months agoMerge pull request #57474 from xxhdx1985126/wip-seastore-rewrite-bug-fix
Yingxin [Fri, 17 May 2024 02:36:22 +0000 (10:36 +0800)]
Merge pull request #57474 from xxhdx1985126/wip-seastore-rewrite-bug-fix

crimson/os/seastore/transaction_manager: correct the offset of the data copied from the original extents

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
14 months agoMerge PR #57302 into main
Patrick Donnelly [Fri, 17 May 2024 01:12:51 +0000 (21:12 -0400)]
Merge PR #57302 into main

* refs/pull/57302/head:
qa/tasks/quiescer: dump ops in parallel

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>
14 months agoMerge pull request #57260 from tchaikov/test_back_trace-with-asan
Kefu Chai [Fri, 17 May 2024 00:46:33 +0000 (08:46 +0800)]
Merge pull request #57260 from tchaikov/test_back_trace-with-asan

test/common/test_back_trace: skip one more frame when testing w/ ASan

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
14 months agoMerge pull request #57429 from tchaikov/dencoder-unregister
Kefu Chai [Fri, 17 May 2024 00:41:59 +0000 (08:41 +0800)]
Merge pull request #57429 from tchaikov/dencoder-unregister

tools/ceph-dencoder: unregister dencoders in dtor

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
14 months agoqa/cephfs: add MDS_CLIENTS_BROKEN_ROOTSQUASH to ignorelist 57528/head
Rishabh Dave [Thu, 16 May 2024 16:30:01 +0000 (22:00 +0530)]
qa/cephfs: add MDS_CLIENTS_BROKEN_ROOTSQUASH to ignorelist

MDS_CLIENTS_BROKEN_ROOTSQUASH is generated and expected by
test_rootsquash_nofeature but it hasn't be added to ignorelist as a
result of which QA code marks the job as failed even though all tests
finished running successfully.

Introduced-by: bccc8ceb471c441ec04d7eb2c353630f8c5ce843
Fixes: https://tracker.ceph.com/issues/66075
Signed-off-by: Rishabh Dave <ridave@redhat.com>
14 months agoqa/cephfs: set joinable on FS before exiting tests in TestFSFail 57333/head
Rishabh Dave [Tue, 7 May 2024 14:50:55 +0000 (20:20 +0530)]
qa/cephfs: set joinable on FS before exiting tests in TestFSFail

After running TestFSFail, CephFSTestCase.tearDown() fails attempting
to unmount CephFS. Set joinable on FS and wait for the MDS to be up
before exiting the test. This will ensure that unmounting is
successful in teardown.

Fixes: https://tracker.ceph.com/issues/65841
Signed-off-by: Rishabh Dave <ridave@redhat.com>
14 months agoqa/tasks/quiescer: dump ops in parallel 57302/head
Patrick Donnelly [Mon, 6 May 2024 18:02:02 +0000 (14:02 -0400)]
qa/tasks/quiescer: dump ops in parallel

Since this --flags=locks takes the mds_lock and dumps thousands of ops, this
may take a long time to complete for each individual MDS. The entire quiesce
set may timeout (and all q ops killed) before we finish dumping ops.

Fixes: https://tracker.ceph.com/issues/65823
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
14 months agoMerge pull request #56640 from NitzanMordhai/wip-nitzan-public-addr-check-subnet...
Yuri Weinstein [Thu, 16 May 2024 15:12:51 +0000 (08:12 -0700)]
Merge pull request #56640 from NitzanMordhai/wip-nitzan-public-addr-check-subnet-unreachable

common/pick_address: check if address in subnet all public address

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Prashant D <pdhange@redhat.com>
14 months agoMerge pull request #57479 from ktdreyer/wip-66016-qatlib-arches
Casey Bodley [Thu, 16 May 2024 14:15:23 +0000 (15:15 +0100)]
Merge pull request #57479 from ktdreyer/wip-66016-qatlib-arches

cmake: disable WITH_QATLIB/ZIP on non-x86

Reviewed-by: Casey Bodley <cbodley@redhat.com>
14 months agoMerge pull request #56743 from NitzanMordhai/wip-nitzan-backword-forword-dencoder...
NitzanMordhai [Thu, 16 May 2024 12:40:11 +0000 (15:40 +0300)]
Merge pull request #56743 from NitzanMordhai/wip-nitzan-backword-forword-dencoder-tests

suites: adding dencoder test multi versions

14 months agocommon/options: link to mon_osd_blocklist_default_expire from RBD 57498/head
Ilya Dryomov [Thu, 16 May 2024 10:40:58 +0000 (12:40 +0200)]
common/options: link to mon_osd_blocklist_default_expire from RBD

"number of seconds to blocklist - set to 0 for OSD default" in the
description of rbd_blocklist_expire_seconds refers to the value that is
controlled by mon_osd_blocklist_default_expire.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
14 months agoMerge PR #56944 into main
Venky Shankar [Thu, 16 May 2024 09:22:13 +0000 (14:52 +0530)]
Merge PR #56944 into main

* refs/pull/56944/head:
qa: add a YAML to ignore MGR_DOWN warning

Reviewed-by: Venky Shankar <vshankar@redhat.com>
14 months agoMerge pull request #55735 from xxhdx1985126/wip-crimson-errorator-void-handler
Yingxin [Thu, 16 May 2024 07:57:48 +0000 (15:57 +0800)]
Merge pull request #55735 from xxhdx1985126/wip-crimson-errorator-void-handler

crimson/common/errorator: disallow void-returning error handlers

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
Reviewed-by: Chunmei Liu <chunmei.liu@intel.com>
14 months agoMerge pull request #57432 from cyx1231st/wip-seastore-cleanup-objdata-read
Yingxin [Thu, 16 May 2024 06:18:46 +0000 (14:18 +0800)]
Merge pull request #57432 from cyx1231st/wip-seastore-cleanup-objdata-read

crimson/os/seastore/object_data_handler: cleanup read()

Reviewed-by: Myoungwon Oh <myoungwon.oh@samsung.com>
Reviewed-by: Xuehan Xu <xuxuehan@qianxin.com>
14 months agoMerge PR #56941 into main
Venky Shankar [Thu, 16 May 2024 05:59:17 +0000 (11:29 +0530)]
Merge PR #56941 into main

* refs/pull/56941/head:
mds: find a new head for the batch ops when the head is dead

Reviewed-by: Kotresh Hiremath Ravishankar <khiremat@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Dhairya Parmar <dparmar@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
14 months agoMerge PR #57275 into main
Venky Shankar [Thu, 16 May 2024 05:56:45 +0000 (11:26 +0530)]
Merge PR #57275 into main

* refs/pull/57275/head:
qa/fsx: use a specified sha1 to build the xfstest-dev

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Leonid Usov <leonid.usov@ibm.com>
14 months agosuites: adding dencoder test multi versions 56743/head
nmordech@redhat.com [Wed, 3 Apr 2024 07:02:15 +0000 (07:02 +0000)]
suites: adding dencoder test multi versions

We are currently conducting regular ceph-dencoder tests for backward compatibility.
However, we are omitting tests for forward compatibility.
This suite will introduce tests against the ceph-objects-corpus to address forward
compatibility issues that may arise.
the script will install N-2 version and run against the latest version corpus objects
that we have, then install N-1 to N version and check them as well.

Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>
14 months agoMerge PR #57454 into main
Patrick Donnelly [Thu, 16 May 2024 03:01:16 +0000 (23:01 -0400)]
Merge PR #57454 into main

* refs/pull/57454/head:
mds/quiesce-db: optimize peer updates
mds/quiesce-db: track db epoch separately from the membership epoch
mds/quiesce-db: test that a peer on a newer membership epoch can ack a root

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
14 months agoMerge PR #57274 into main
Patrick Donnelly [Thu, 16 May 2024 02:56:38 +0000 (22:56 -0400)]
Merge PR #57274 into main

* refs/pull/57274/head:
mds: don't stall the asok thread for flush commands
qa/quiescer: relax some timing requirements in the quiescer

Reviewed-by: Xiubo Li <xiubli@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
14 months agoMerge PR #57329 into main
Patrick Donnelly [Thu, 16 May 2024 02:56:11 +0000 (22:56 -0400)]
Merge PR #57329 into main

* refs/pull/57329/head:
qa: unmount clients before damaging the fs

Reviewed-by: Kotresh Hiremath Ravishankar <khiremat@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
14 months agoMerge PR #57334 into main
Patrick Donnelly [Thu, 16 May 2024 02:55:19 +0000 (22:55 -0400)]
Merge PR #57334 into main

* refs/pull/57334/head:
mds: remove erroneous debug message

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>
14 months agoMerge pull request #57476 from cyx1231st/wip-seastore-drop-unused
Yingxin [Thu, 16 May 2024 01:34:52 +0000 (09:34 +0800)]
Merge pull request #57476 from cyx1231st/wip-seastore-drop-unused

crimson/os/seastore/transaction_manager: drop unused code

Reviewed-by: Xuehan Xu <xuxuehan@qianxin.com>
14 months agoMerge pull request #55985 from Svelar/make-dist
Laura Flores [Wed, 15 May 2024 22:08:28 +0000 (17:08 -0500)]
Merge pull request #55985 from Svelar/make-dist

make-dist: remove old cruft recursively

14 months agocommon/options: make 'rbd_validate_pool' a dev option 56310/head
Ramana Raja [Tue, 9 Apr 2024 16:16:57 +0000 (12:16 -0400)]
common/options: make 'rbd_validate_pool' a dev option

'rbd_validate_pool' option is for testing purposes only. So change
its level appropriately to 'dev'.

Signed-off-by: Ramana Raja <rraja@redhat.com>
14 months agolibrbd/image: create rbd_trash object during RBD pool initialization
Ramana Raja [Tue, 30 Apr 2024 17:56:12 +0000 (13:56 -0400)]
librbd/image: create rbd_trash object during RBD pool initialization

... and RBD namespace creation.

It was not possible to remove a RBD image when OSDs were full and the
'rbd_trash' object was not already created in the image's pool or pool
namespace. The 'rbd_trash' object was created in a pool or namespace
during the first instance of image removal from the pool or namespace.
If no images were ever removed from a RBD pool or namespace and the
OSDs became full, removal of images using the CLI failed. The failure
occured when trying to move the images to trash since the 'rbd_trash'
object was missing in the pool or namespace.

Fix this issue by creating the rbd_trash object in a pool when
initalizing the pool as a RBD pool and when creating a RBD namespace.

Fixes: https://tracker.ceph.com/issues/64800
Signed-off-by: Ramana Raja <rraja@redhat.com>
14 months agoMerge pull request #56349 from rhcs-dashboard/add-landing-page-upgrade-notification
Pedro Gonzalez Gomez [Wed, 15 May 2024 18:23:46 +0000 (20:23 +0200)]
Merge pull request #56349 from rhcs-dashboard/add-landing-page-upgrade-notification

mgr/dashboard: add upgrade notification

Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
14 months agoqa: update rbd-wnbd test, using MBR instead of GPT 52560/head
Lucian Petrut [Wed, 2 Aug 2023 13:08:01 +0000 (13:08 +0000)]
qa: update rbd-wnbd test, using MBR instead of GPT

We're getting the following error while initializing 64MB disks
on WS 2019: "The disk is not large enough to support a GPT
partition style.".

For this reason, we'll use MBR instead.

Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
14 months agoqa: add ceph-rbd windows service restart test
Lucian Petrut [Thu, 12 Jan 2023 10:55:06 +0000 (12:55 +0200)]
qa: add ceph-rbd windows service restart test

We're adding a test that:

* maps a configurable number of images
* runs a specified test - we're reusing the ones from stress_test,
  making just a few minor changes to allow running the same test
  multiple times
* restarts the ceph-rbd Windows service
* waits for the images to be reconnected and refreshes the mount
  information
* reruns the test
* repeats the above workflow for a specified number of times,
  reusing the same images

This test ensures that:

* mounted images are still available after a service restart
* drive letters are retained
* the image content is retained
* there are no race conditions when connecting or disconnecting
  a large number of images in parallel
* the driver is capable of mapping a specified number of images
  simultaneously

Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
14 months agoMerge PR #57478 into main
Patrick Donnelly [Wed, 15 May 2024 16:14:15 +0000 (12:14 -0400)]
Merge PR #57478 into main

* refs/pull/57478/head:
.github: remove description for action

Reviewed-by: John Mulligan <jmulligan@redhat.com>
14 months agoMerge pull request #57137 from sajibreadd/wip-65228
Yuri Weinstein [Wed, 15 May 2024 15:14:57 +0000 (08:14 -0700)]
Merge pull request #57137 from sajibreadd/wip-65228

osd: CEPH_OSD_OP_FLAG_BYPASS_CLEAN_CACHE flag is passed from ECBackend

Reviewed-by: Igor Fedotov <ifedotov@suse.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
14 months agoMerge pull request #56804 from sajibreadd/seg-fault-osd-shutdown-64373
Yuri Weinstein [Wed, 15 May 2024 15:14:06 +0000 (08:14 -0700)]
Merge pull request #56804 from sajibreadd/seg-fault-osd-shutdown-64373

osd: fix for segmentation fault on OSD fast shutdown

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Igor Fedotov <ifedotov@suse.com>
14 months agoMerge pull request #55298 from ifed01/wip-ifed-fsck-stats
Yuri Weinstein [Wed, 15 May 2024 15:12:04 +0000 (08:12 -0700)]
Merge pull request #55298 from ifed01/wip-ifed-fsck-stats

os/bluestore: show some additional osd/pool stats on fsck completion.

Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>
14 months ago.github: remove description for action 57478/head
Patrick Donnelly [Wed, 15 May 2024 13:29:28 +0000 (09:29 -0400)]
.github: remove description for action

It is rejected by github's parser.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
14 months agoMerge pull request #57099 from ceph/wip-lusov-quiesce-await-eperm
Leonid S. Usov [Wed, 15 May 2024 11:35:27 +0000 (14:35 +0300)]
Merge pull request #57099 from ceph/wip-lusov-quiesce-await-eperm

mds/quiesce: db: quiesce-await should EPERM if a set is past QS_QUIESCED
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
14 months agoMerge pull request #56882 from guits/cv-65477
Guillaume Abrioux [Wed, 15 May 2024 08:36:58 +0000 (10:36 +0200)]
Merge pull request #56882 from guits/cv-65477

ceph-volume: create LVs when using partitions

14 months agoMerge pull request #57138 from guits/node-proxy-improvements
Guillaume Abrioux [Wed, 15 May 2024 08:36:16 +0000 (10:36 +0200)]
Merge pull request #57138 from guits/node-proxy-improvements

node-proxy: make the daemon discover endpoints

14 months agocrimson/os/seastore/transaction_manager: drop unused code 57476/head
Yingxin Cheng [Wed, 15 May 2024 07:59:04 +0000 (15:59 +0800)]
crimson/os/seastore/transaction_manager: drop unused code

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
14 months agocrimson/os/seastore: set last_committed_crc for rewritten extents 57474/head
Xuehan Xu [Wed, 15 May 2024 07:58:24 +0000 (15:58 +0800)]
crimson/os/seastore: set last_committed_crc for rewritten extents

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
14 months agocrimson/os/seastore/transaction_manager: correct the offset of the data
Xuehan Xu [Wed, 15 May 2024 07:31:14 +0000 (15:31 +0800)]
crimson/os/seastore/transaction_manager: correct the offset of the data
copied from the original extents

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
14 months agoMerge pull request #57164 from guits/bump-promtail
Guillaume Abrioux [Wed, 15 May 2024 07:13:38 +0000 (09:13 +0200)]
Merge pull request #57164 from guits/bump-promtail

cephadm: change loki/promtail default image tags

14 months agoMerge pull request #57000 from chenyuanrun/fix-mkdir-p
Guillaume Abrioux [Wed, 15 May 2024 07:12:33 +0000 (09:12 +0200)]
Merge pull request #57000 from chenyuanrun/fix-mkdir-p

ceph-volume: use os.makedirs for mkdir_p

14 months agoqa: reorganize Windows python test
Lucian Petrut [Tue, 10 Jan 2023 14:50:04 +0000 (16:50 +0200)]
qa: reorganize Windows python test

We're splitting the rbd-wnbd python test into separate files so
that the common code may easily be reused by other tests. This
also makes the code easier to read and maintain.

Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
14 months agomgr/k8sevents: update V1Events to CoreV1Events 57253/head
Nizamudeen A [Fri, 3 May 2024 08:56:19 +0000 (14:26 +0530)]
mgr/k8sevents: update V1Events to CoreV1Events

centos9 only provides kubernetes 26.1.0 as base dep and hence the
k8sevents code needs to be updated accordingly. the api changes happened
in kuberenetes while 19.0.0 was released

Fixes: https://tracker.ceph.com/issues/65627
Fixes: https://tracker.ceph.com/issues/64981
Signed-off-by: Nizamudeen A <nia@redhat.com>
14 months agoMerge PR #57453 into main
Patrick Donnelly [Wed, 15 May 2024 00:19:30 +0000 (20:19 -0400)]
Merge PR #57453 into main

* refs/pull/57453/head:
doc: add status badge for backport creation
.github: use shorter name for backport tracker action
.github: document where runs/output can be examined

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
14 months agoMerge pull request #57391 from soumyakoduri/wip-skoduri-sync-policy
Casey Bodley [Tue, 14 May 2024 21:09:07 +0000 (22:09 +0100)]
Merge pull request #57391 from soumyakoduri/wip-skoduri-sync-policy

Add new sync-policy related params to boto3 extension

Reviewed-by: Casey Bodley <cbodley@redhat.com>
14 months agocmake: disable WITH_QATLIB/ZIP on non-x86 57479/head
Ken Dreyer [Tue, 14 May 2024 18:53:51 +0000 (14:53 -0400)]
cmake: disable WITH_QATLIB/ZIP on non-x86

This feature is only relevant to x86 hosts.

Signed-off-by: Ken Dreyer <kdreyer@ibm.com>
Fixes: https://tracker.ceph.com/issues/66016
Signed-off-by: Ken Dreyer <kdreyer@ibm.com>
14 months agomgr/dashboard: add upgrade notification 56349/head
Pedro Gonzalez Gomez [Wed, 20 Mar 2024 17:47:10 +0000 (18:47 +0100)]
mgr/dashboard: add upgrade notification

- Adds upgradable component to manage upgrades from outside upgrade component.
- Cleans a couple subscriptions that were not being destroyed

Fixes: https://tracker.ceph.com/issues/65012
Signed-off-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
14 months agoMerge pull request #53859 from rhcs-dashboard/add-pseudo-validation
Avan [Tue, 14 May 2024 13:42:21 +0000 (19:12 +0530)]
Merge pull request #53859 from rhcs-dashboard/add-pseudo-validation

mgr/dashboard: add absolute path validation for pseudo path of nfs export

Reviewed-by: Aashish Sharma <aasharma@redhat.com>
Reviewed-by: afreen23 <NOT@FOUND>
Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
14 months agodoc: add status badge for backport creation 57453/head
Patrick Donnelly [Mon, 13 May 2024 18:14:27 +0000 (14:14 -0400)]
doc: add status badge for backport creation

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
14 months ago.github: use shorter name for backport tracker action
Patrick Donnelly [Tue, 14 May 2024 12:19:06 +0000 (08:19 -0400)]
.github: use shorter name for backport tracker action

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
14 months ago.github: document where runs/output can be examined
Patrick Donnelly [Mon, 13 May 2024 17:02:35 +0000 (13:02 -0400)]
.github: document where runs/output can be examined

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
14 months agoMerge pull request #57368 from myoungwon/wip-seastore-is-data-stable
Yingxin [Tue, 14 May 2024 08:38:25 +0000 (16:38 +0800)]
Merge pull request #57368 from myoungwon/wip-seastore-is-data-stable

crimson/os/seastore: add is_data_stable() to allow delta-overwrite on EXIST_CLEAN

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
14 months agocrimson/os/seastore/object_data_handler: adjust code indention of read() 57432/head
Yingxin Cheng [Mon, 13 May 2024 07:05:49 +0000 (15:05 +0800)]
crimson/os/seastore/object_data_handler: adjust code indention of read()

To make it easier to read, make sure there are 2 spaces per nesting.

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
14 months agocrimson/os/seastore/object_data_handler: cleanup read() and logs
Yingxin Cheng [Mon, 13 May 2024 06:52:43 +0000 (14:52 +0800)]
crimson/os/seastore/object_data_handler: cleanup read() and logs

Specifically, refactor based on read range, pin range and extent range.

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
14 months agocrimson/os/seastore/object_data_handler: adjust l_current asserts for read()
Yingxin Cheng [Mon, 13 May 2024 05:37:34 +0000 (13:37 +0800)]
crimson/os/seastore/object_data_handler: adjust l_current asserts for read()

l_current is only allowed to be larger than pin_key for the first pin.

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
14 months agocrimson/os/seastore/object_data_handler: cleanup and renames
Yingxin Cheng [Mon, 13 May 2024 02:36:42 +0000 (10:36 +0800)]
crimson/os/seastore/object_data_handler: cleanup and renames

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
14 months agoMerge pull request #57444 from idryomov/wip-51845
Ilya Dryomov [Tue, 14 May 2024 08:06:01 +0000 (10:06 +0200)]
Merge pull request #57444 from idryomov/wip-51845

qa/suites/krbd: drop pre-single-major and move "layering only" coverage

Reviewed-by: Ramana Raja <rraja@redhat.com>
14 months agoMerge pull request #57434 from cyx1231st/wip-fix-seastore-capture
Yingxin [Tue, 14 May 2024 01:33:09 +0000 (09:33 +0800)]
Merge pull request #57434 from cyx1231st/wip-fix-seastore-capture

crimson/os/seastore/record_scanner: replace [=] capturing

Reviewed-by: Myoungwon Oh <myoungwon.oh@samsung.com>
14 months agocrimson/os/seastore: add comments to add a note that seastore_obj_data_write_amplific... 57368/head
myoungwon oh [Mon, 13 May 2024 04:21:26 +0000 (04:21 +0000)]
crimson/os/seastore: add comments to add a note that seastore_obj_data_write_amplification needs to be reconsidered

Signed-off-by: Myoungwon Oh <myoungwon.oh@samsung.com>
14 months agocrimson/os/seastore: replace is_left_stable/is_right_stable with is_left_fresh/is_rig...
myoungwon oh [Mon, 13 May 2024 04:16:46 +0000 (04:16 +0000)]
crimson/os/seastore: replace is_left_stable/is_right_stable with is_left_fresh/is_right_fresh

Signed-off-by: Myoungwon Oh <myoungwon.oh@samsung.com>
14 months agoMerge pull request #55488 from cyx1231st/wip-crimson-load-obc
Samuel Just [Tue, 14 May 2024 00:45:26 +0000 (17:45 -0700)]
Merge pull request #55488 from cyx1231st/wip-crimson-load-obc

crimson/osd/osd_operations/client_request: make loading-obc concurrent

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
Reviewed-by: Samuel Just <sjust@redhat.com>
14 months agomds/quiesce-db: optimize peer updates 57454/head
Leonid Usov [Mon, 13 May 2024 22:40:21 +0000 (01:40 +0300)]
mds/quiesce-db: optimize peer updates

Prevent sending of the same version to the same peer more than once a second

Signed-off-by: Leonid Usov <leonid.usov@ibm.com>
14 months agomds/quiesce-db: track db epoch separately from the membership epoch
Leonid Usov [Mon, 13 May 2024 21:10:04 +0000 (00:10 +0300)]
mds/quiesce-db: track db epoch separately from the membership epoch

Tracking the db epoch separately will make sure that replicas
only follow leader's epoch choice, even if they are already on
the new membership epoch. This eliminates races due to the
random order of mdsmap updates.

Fixes: https://tracker.ceph.com/issues/65977
Signed-off-by: Leonid Usov <leonid.usov@ibm.com>
14 months agomds/quiesce-db: test that a peer on a newer membership epoch can ack a root
Leonid Usov [Mon, 13 May 2024 15:14:32 +0000 (18:14 +0300)]
mds/quiesce-db: test that a peer on a newer membership epoch can ack a root

Signed-off-by: Leonid Usov <leonid.usov@ibm.com>
14 months agoMerge pull request #57356 from soumyakoduri/wip-skoduri-lc-notify
Casey Bodley [Mon, 13 May 2024 17:45:33 +0000 (18:45 +0100)]
Merge pull request #57356 from soumyakoduri/wip-skoduri-lc-notify

rgw/cloud-transition: fix the crash with publish_commit

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
14 months agoMerge pull request #56574 from badone/wip-tracker-59196-lazy-omap-regex-segfault
Laura Flores [Mon, 13 May 2024 17:05:53 +0000 (12:05 -0500)]
Merge pull request #56574 from badone/wip-tracker-59196-lazy-omap-regex-segfault

test/lazy-omap-stats: Convert to boost::regex

14 months agoAdd new sync-policy related params to boto3 extension 57391/head
Soumya Koduri [Fri, 10 May 2024 11:34:37 +0000 (17:04 +0530)]
Add new sync-policy related params to boto3 extension

Signed-off-by: Soumya Koduri <skoduri@redhat.com>
14 months agoMerge pull request #56979 from yuvalif/wip-yuval-65337
Yuval Lifshitz [Mon, 13 May 2024 14:25:27 +0000 (17:25 +0300)]
Merge pull request #56979 from yuvalif/wip-yuval-65337

rgw/notification: start/stop endpoint managers in notification manager

Reviewed-By: cbodley@ibm.com , kchheda3@bloomberg.net
14 months agoqa/suites/krbd: rename no-object-map to no-exclusive-lock 57444/head
Ilya Dryomov [Mon, 13 May 2024 11:45:26 +0000 (13:45 +0200)]
qa/suites/krbd: rename no-object-map to no-exclusive-lock

Exclusive lock has always been disabled by this facet, so it might as
well be reflected in its name.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>