git.apps.os.sepia.ceph.com Git - ceph.git/log

]> git.apps.os.sepia.ceph.com Git - ceph.git/log

projects / ceph.git / log

summary | shortlog | log | commit | commitdiff | tree
first ⋅ prev ⋅ next

commit | commitdiff | tree

Patrick Donnelly [Fri, 3 May 2024 00:49:22 +0000 (20:49 -0400)]

mon/MDSMonitor: add note about missing metadata inclusion

There is a "client_count" metadata on the health warning that apparently was
intended to be used for aggregating warnings but never was. Add a TODO item for
that.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Wed, 1 May 2024 01:41:14 +0000 (21:41 -0400)]

mds: check relevant caps for fs include root_squash

When denying client reconnects because the MDS caps include root_squash and the
client features do not include CEPHFS_FEATURE_MDS_AUTH_CAPS_CHECK, ensure those
caps are only for the file system the MDS is joined to.

Fixes: https://tracker.ceph.com/issues/65733
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Thu, 2 May 2024 12:55:36 +0000 (08:55 -0400)]

mds: refactor out fs_name match in MDSAuthCaps

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Thu, 2 May 2024 01:08:57 +0000 (21:08 -0400)]

qa: test for root_squash with multiple caps

Where the client has root_squash for one cap but not for another. The fs
without root_squash should not necessarily reject the client.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Thu, 2 May 2024 02:06:54 +0000 (22:06 -0400)]

qa: pass kwargs to mount from remount

So we can pass mntargs.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Thu, 2 May 2024 02:04:57 +0000 (22:04 -0400)]

qa: simplify update_attrs and only update relevant keys

So we can just pass the caller's kwargs to update_attrs.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Thu, 2 May 2024 00:51:59 +0000 (20:51 -0400)]

client: allow overriding client features

For testing purposes.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Casey Bodley [Mon, 6 May 2024 17:18:56 +0000 (18:18 +0100)]

Merge pull request #57257 from cbodley/wip-65746

rgw: fix CompleteMultipart error handling regression

Reviewed-by: Ali Masarwa <amasarwa@redhat.com>
Reviewed-by: Matt Benjamin <mbenjamin@redhat.com>

commit | commitdiff | tree

Yuval Lifshitz [Mon, 6 May 2024 16:34:53 +0000 (19:34 +0300)]

Merge pull request #57256 from yuvalif/wip-yuval-failed-test-hang

test/rgw/notification: prevent failed http tests from hanging

commit | commitdiff | tree

Ilya Dryomov [Mon, 6 May 2024 15:43:57 +0000 (17:43 +0200)]

Merge pull request #57212 from idryomov/wip-65624

pybind/rbd: expose CLONE_FORMAT and FLATTEN image options

Reviewed-by: Mykola Golub <mgolub@suse.com>

commit | commitdiff | tree

Ilya Dryomov [Mon, 6 May 2024 15:43:11 +0000 (17:43 +0200)]

Merge pull request #57163 from idryomov/wip-copyup-end-in-test

test/librbd: correct expected_overlap in SnapshotCopyup

Reviewed-by: Mykola Golub <mgolub@suse.com>

commit | commitdiff | tree

Ilya Dryomov [Mon, 6 May 2024 15:41:36 +0000 (17:41 +0200)]

Merge pull request #57082 from idryomov/wip-65487

rbd-mirror: clean up stale pool replayers and callouts better

Reviewed-by: N Balachandran <nibalach@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Mon, 6 May 2024 06:16:01 +0000 (08:16 +0200)]

qa/workunits/rbd: wait for replaying status in bootstrap tests

wait_for_replay_complete() doesn't wait for image status to get
updated. This didn't matter previously because these tests are run on
two different pools and nothing else was following.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Ilya Dryomov [Wed, 1 May 2024 13:49:54 +0000 (15:49 +0200)]

test/pybind: drop unnecessary data_pool=None arguments

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Ilya Dryomov [Wed, 1 May 2024 13:49:51 +0000 (15:49 +0200)]

pybind/rbd: adjust error message in deep_copy()

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Ilya Dryomov [Wed, 1 May 2024 13:49:47 +0000 (15:49 +0200)]

pybind/rbd: expose RBD_IMAGE_OPTION_FLATTEN option

It takes effect with deep_copy() and migration_prepare().

Fixes: https://tracker.ceph.com/issues/65624
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Ilya Dryomov [Sun, 28 Apr 2024 17:19:22 +0000 (19:19 +0200)]

pybind/rbd: expose RBD_IMAGE_OPTION_CLONE_FORMAT option

It takes effect with clone(), deep_copy() and migration_prepare().

Fixes: https://tracker.ceph.com/issues/65624
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Matan Breizman [Mon, 6 May 2024 07:33:35 +0000 (10:33 +0300)]

Merge pull request #57245 from Matan-B/wip-crimson-only-testing-report

qa/config/crimson_qa_overrides: adjust mgr_stats_period

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Anthony D'Atri [Sun, 5 May 2024 19:49:20 +0000 (15:49 -0400)]

Merge pull request #57273 from zdover23/wip-doc-2024-05-06-mgr-dashboard

doc/mgr: edit "Resolve IP address to hostname before redirect"

commit | commitdiff | tree

Zac Dover [Sun, 5 May 2024 19:12:55 +0000 (05:12 +1000)]

doc/mgr: edit "Resolve IP address to hostname before redirect"

Edit the section "Resolve IP address to hostname before redirect" in
doc/mgr/dashboard.rst.

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Ilya Dryomov [Wed, 24 Apr 2024 10:45:27 +0000 (12:45 +0200)]

rbd-mirror: remove callout when destroying pool replayer

If a pool replayer is removed in an error state (e.g. after failing to
connect to the remote cluster), its callout should be removed as well.
Otherwise, the error would persist causing "daemon health: ERROR"
status to be reported even after a new pool replayer is created and
started successfully.

Fixes: https://tracker.ceph.com/issues/65487
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Kefu Chai [Sun, 5 May 2024 10:07:24 +0000 (18:07 +0800)]

Merge pull request #57261 from tchaikov/test_bluefs-fix-asan

test/objectstore/test_bluefs: fix heap-use-after-free

Reviewed-by: Igor Fedotov <igor.fedotov@croit.io>

commit | commitdiff | tree

Matan Breizman [Sun, 5 May 2024 08:11:29 +0000 (11:11 +0300)]

Merge pull request #57056 from rzarzynski/wip-crimson-coherent-log-and-at_version

crimson/osd: make osd_op_params::at_version coherent with last log entry

Reviewed-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Zac Dover [Sun, 5 May 2024 05:23:21 +0000 (15:23 +1000)]

Merge pull request #55380 from Thingee/foundation-mem-202401

doc/foundation: Updating members for Jan 2024

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Kefu Chai [Sat, 4 May 2024 03:50:54 +0000 (11:50 +0800)]

test/objectstore/test_bluefs: fix heap-use-after-free

this change was created in the same spirit of b8c30a79.

in BlueFS.test_shared_alloc and BlueFS.test_shared_alloc_sparse, we
keep the return value of `fs.get_perf_counters()`, and dereference it
after umounting the fs, but the `PerfCounters*` pointer returned from
`fs.get_perf_counters()` is destroyed in `BlueFS::_shutdown_logger()`
which is in turn called by `BlueFS::umount()`. so ASan points this out:

```
==548153==ERROR: AddressSanitizer: heap-use-after-free on address 0x6110000336c0 at pc 0x7fc810326654 bp 0x7ffd869be8f0 sp 0x7ffd869be8e8
READ of size 8 at 0x6110000336c0 thread T0
    #0 0x7fc810326653 in ceph::common::PerfCounters::get(int) const /home/jenkins-build/build/workspace/ceph-pull-requests/src/common/perf_counters.cc:246:8
    #1 0x564e7a5397a5 in BlueFS_test_shared_alloc_sparse_Test::TestBody() /home/jenkins-build/build/workspace/ceph-pull-requests/src/test/objectstore/test_bluefs.cc:1265:3
    #2 0x564e7a644006 in void testing::internal::HandleSehExceptionsInMethodIfSupported<testing::Test, void>(testing::Test*, void (testing::Test::*)(), char const*) /home/jenkins-build/build/workspace/ceph-pull-requests/src/googletest/googletest/src/gtest.cc:2605:10
    #3 0x564e7a5fdbc2 in void testing::internal::HandleExceptionsInMethodIfSupported<testing::Test, void>(testing::Test*, void (testing::Test::*)(), char const*) /home/jenkins-build/build/workspace/ceph-pull-requests/src/googletest/googletest/src/gtest.cc:2641:14
    #4 0x564e7a5ae7ec in testing::Test::Run() /home/jenkins-build/build/workspace/ceph-pull-requests/src/googletest/googletest/src/gtest.cc:2680:5
    #5 0x564e7a5b0822 in testing::TestInfo::Run() /home/jenkins-build/build/workspace/ceph-pull-requests/src/googletest/googletest/src/gtest.cc:2858:11
    #6 0x564e7a5b1e5b in testing::TestSuite::Run() /home/jenkins-build/build/workspace/ceph-pull-requests/src/googletest/googletest/src/gtest.cc:3012:28
    #7 0x564e7a5cf2e8 in testing::internal::UnitTestImpl::RunAllTests() /home/jenkins-build/build/workspace/ceph-pull-requests/src/googletest/googletest/src/gtest.cc:5723:44
    #8 0x564e7a64c8b6 in bool testing::internal::HandleSehExceptionsInMethodIfSupported<testing::internal::UnitTestImpl, bool>(testing::internal::UnitTestImpl*, bool (testing::internal::UnitTestImpl::*)(), char const*) /home/jenkins-build/build/workspace/ceph-pull-requests/src/googletest/googletest/src/gtest.cc:2605:10
    #9 0x564e7a604662 in bool testing::internal::HandleExceptionsInMethodIfSupported<testing::internal::UnitTestImpl, bool>(testing::internal::UnitTestImpl*, bool (testing::internal::UnitTestImpl::*)(), char const*) /home/jenkins-build/build/workspace/ceph-pull-requests/src/googletest/googletest/src/gtest.cc:2641:14
    #10 0x564e7a5ce672 in testing::UnitTest::Run() /home/jenkins-build/build/workspace/ceph-pull-requests/src/googletest/googletest/src/gtest.cc:5306:10
    #11 0x564e7a55a410 in RUN_ALL_TESTS() /home/jenkins-build/build/workspace/ceph-pull-requests/src/googletest/googletest/include/gtest/gtest.h:2486:46
    #12 0x564e7a551295 in main /home/jenkins-build/build/workspace/ceph-pull-requests/src/test/objectstore/test_bluefs.cc:1609:10
    #13 0x7fc80d775d8f in __libc_start_call_main csu/../sysdeps/nptl/libc_start_call_main.h:58:16
    #14 0x7fc80d775e3f in __libc_start_main csu/../csu/libc-start.c:392:3
    #15 0x564e7a4296a4 in _start (/home/jenkins-build/build/workspace/ceph-pull-requests/build/bin/unittest_bluefs+0x2856a4) (BuildId: fd4e4e0b1c2f9a3b0c1a7051d8ed68b3576e3277)

0x6110000336c0 is located 0 bytes inside of 208-byte region [0x6110000336c0,0x611000033790)
freed by thread T0 here:
    #0 0x564e7a4e7b1d in operator delete(void*) (/home/jenkins-build/build/workspace/ceph-pull-requests/build/bin/unittest_bluefs+0x343b1d) (BuildId: fd4e4e0b1c2f9a3b0c1a7051d8ed68b3576e3277)
    #1 0x564e7a686ce3 in BlueFS::_shutdown_logger() /home/jenkins-build/build/workspace/ceph-pull-requests/src/os/bluestore/BlueFS.cc:462:3
    #2 0x564e7a6a9b55 in BlueFS::umount(bool) /home/jenkins-build/build/workspace/ceph-pull-requests/src/os/bluestore/BlueFS.cc:1076:3
    #3 0x564e7a539767 in BlueFS_test_shared_alloc_sparse_Test::TestBody() /home/jenkins-build/build/workspace/ceph-pull-requests/src/test/objectstore/test_bluefs.cc:1262:6
    #4 0x564e7a644006 in void testing::internal::HandleSehExceptionsInMethodIfSupported<testing::Test, void>(testing::Test*, void (testing::Test::*)(), char const*) /home/jenkins-build/build/workspace/ceph-pull-requests/src/googletest/googletest/src/gtest.cc:2605:10
    #5 0x564e7a5fdbc2 in void testing::internal::HandleExceptionsInMethodIfSupported<testing::Test, void>(testing::Test*, void (testing::Test::*)(), char const*) /home/jenkins-build/build/workspace/ceph-pull-requests/src/googletest/googletest/src/gtest.cc:2641:14
    #6 0x564e7a5ae7ec in testing::Test::Run() /home/jenkins-build/build/workspace/ceph-pull-requests/src/googletest/googletest/src/gtest.cc:2680:5
    #7 0x564e7a5b0822 in testing::TestInfo::Run() /home/jenkins-build/build/workspace/ceph-pull-requests/src/googletest/googletest/src/gtest.cc:2858:11
    #8 0x564e7a5b1e5b in testing::TestSuite::Run() /home/jenkins-build/build/workspace/ceph-pull-requests/src/googletest/googletest/src/gtest.cc:3012:28
    #9 0x564e7a5cf2e8 in testing::internal::UnitTestImpl::RunAllTests() /home/jenkins-build/build/workspace/ceph-pull-requests/src/googletest/googletest/src/gtest.cc:5723:44
    #10 0x564e7a64c8b6 in bool testing::internal::HandleSehExceptionsInMethodIfSupported<testing::internal::UnitTestImpl, bool>(testing::internal::UnitTestImpl*, bool (testing::internal::UnitTestImpl::*)(), char const*) /home/jenkins-build/build/workspace/ceph-pull-requests/src/googletest/googletest/src/gtest.cc:2605:10
    #11 0x564e7a604662 in bool testing::internal::HandleExceptionsInMethodIfSupported<testing::internal::UnitTestImpl, bool>(testing::internal::UnitTestImpl*, bool (testing::internal::UnitTestImpl::*)(), char const*) /home/jenkins-build/build/workspace/ceph-pull-requests/src/googletest/googletest/src/gtest.cc:2641:14
    #12 0x564e7a5ce672 in testing::UnitTest::Run() /home/jenkins-build/build/workspace/ceph-pull-requests/src/googletest/googletest/src/gtest.cc:5306:10
    #13 0x564e7a55a410 in RUN_ALL_TESTS() /home/jenkins-build/build/workspace/ceph-pull-requests/src/googletest/googletest/include/gtest/gtest.h:2486:46
    #14 0x564e7a551295 in main /home/jenkins-build/build/workspace/ceph-pull-requests/src/test/objectstore/test_bluefs.cc:1609:10
    #15 0x7fc80d775d8f in __libc_start_call_main csu/../sysdeps/nptl/libc_start_call_main.h:58:16
```

in this change, instead of keeping `logger` across the `umount()` and
`mount()` calls, we get another instance of `logger`, query it for
the perf counter that we are interested, and compare the value
to see if it is unchanged.

this should address the ASan warning above.

Signed-off-by: Kefu Chai <tchaikov@gmail.com>

commit | commitdiff | tree

Kefu Chai [Sat, 4 May 2024 03:34:50 +0000 (11:34 +0800)]

Merge pull request #56978 from Svelar/asan_rgw_lua

rgw/rgw_lua_utils: free std::string

Reviewed-by: Yuval Lifshitz <ylifshit@ibm.com>

commit | commitdiff | tree

Casey Bodley [Fri, 3 May 2024 19:43:39 +0000 (15:43 -0400)]

rgw: move publish_complete() back to RGWCompleteMultipart::execute()

move publish_complete() and meta_obj->delete_object() back to execute()
so they only run on success. this allows several member variables to
move back to execute()'s stack as well

Fixes: https://tracker.ceph.com/issues/65746
Signed-off-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Casey Bodley [Fri, 3 May 2024 19:29:00 +0000 (15:29 -0400)]

rgw: CompleteMultipart uses s->object for Notification

get_notification() should be associated with the target object
s->object. the meta_obj has the wrong object name, so required passing
s->object->get_name() as an extra argument

importantly, Notification no longer depends on the lifetime of meta_obj
to avoid a dangling pointer, while the lifetime of s->object is guaranteed

Signed-off-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Casey Bodley [Fri, 3 May 2024 19:17:48 +0000 (15:17 -0400)]

rgw: CompleteMultipart uses s->object instead of target_obj

most requests operate directly on s->object. there's no reason to
allocate a separate target_obj for the same purpose

Signed-off-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Mike Perez [Tue, 30 Jan 2024 20:10:22 +0000 (12:10 -0800)]

doc/foundation: Updating members for Jan 2024

Signed-off-by: Mike Perez <thingee@gmail.com>

commit | commitdiff | tree

Rishabh Dave [Fri, 3 May 2024 13:42:58 +0000 (19:12 +0530)]

Merge pull request #56846 from rishabh-d-dave/test-fs-auth

qa/cephfs: fix and improve test_multifs_single_path_rootsquash

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Xiubo Li <xiubli@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Fri, 3 May 2024 13:42:18 +0000 (06:42 -0700)]

Merge pull request #56924 from markhpc/wip-ectransaction-truncate-fix

osd/ECTransaction: Remove incorrect asserts in generate_transactions

Reviewed-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Anthony D'Atri [Fri, 3 May 2024 13:41:59 +0000 (09:41 -0400)]

Merge pull request #57252 from zdover23/wip-doc-2024-05-03-rados-tshooting-tshooting-pg

doc/rados: PR#57022 unfinished business

commit | commitdiff | tree

Yuri Weinstein [Fri, 3 May 2024 13:40:09 +0000 (06:40 -0700)]

Merge pull request #56586 from tchaikov/wip-erasure-code-new

erasure-code: use new/delete to alloc/free coefficients array

Reviewed-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Fri, 3 May 2024 13:39:20 +0000 (06:39 -0700)]

Merge pull request #49619 from amathuria/wip-amat-fix-encode-payload

msg/async: Encode message once features are set

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Fri, 3 May 2024 13:38:22 +0000 (06:38 -0700)]

Merge pull request #49438 from cfsnyder/wip-58274-bluestore-collection-list-bounds

os/bluestore: set rocksdb iterator bounds for Bluestore::_collection_list()

Reviewed-by: Igor Fedotov <ifedotov@suse.com>
Reviewed-by: Adam Kupczyk <akupczyk@redhat.com>

commit | commitdiff | tree

Yuval Lifshitz [Fri, 3 May 2024 12:04:45 +0000 (12:04 +0000)]

test/rgw/notification: prevent failed http tests from hanging

this is a regressions from: 673adcbdbd01e64c8b76c3176e062571fb8710ac

Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>

commit | commitdiff | tree

Aashish Sharma [Fri, 3 May 2024 11:04:52 +0000 (16:34 +0530)]

Merge pull request #57232 from rhcs-dashboard/fix-65760-main

mgr/dashboard: fix cluster filter typo in multi-cluster-overview grafana dashboard

Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Connor Fawcett [Fri, 3 May 2024 09:33:38 +0000 (10:33 +0100)]

Merge pull request #57235 from connorfawcett/ec-bench-update

qa/workunits/erasure-code: add bench data tables and graph support for additional jerasure techniques

commit | commitdiff | tree

Rishabh Dave [Fri, 3 May 2024 08:35:50 +0000 (14:05 +0530)]

Merge pull request #56732 from mchangir/mgr-snap_schedule-restore-yearly-spec-from-Y-to-y

mgr/snap_schedule: restore yearly spec to lowercase y

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Jos Collin <jcollin@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Rishabh Dave [Thu, 11 Apr 2024 18:12:18 +0000 (23:42 +0530)]

qa/cephfs: fix test_multifs_single_path_rootsquash

test_multifs_single_path_rootsquash was never run with vstart_runner.py
or with teuthology and is therefore full of bugs. Fix it to make sure it
runs fine.

Introduced-by: 1fda8ed2d4a9
Fixes: https://tracker.ceph.com/issues/65246
Signed-off-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Rishabh Dave [Thu, 11 Apr 2024 16:39:45 +0000 (22:09 +0530)]

qa/cephfs: run test_multifs_single_path_rootsquash for kclient too

Root squash is valid for kclient too, Kotresh ran test recently fo it
against main branch. Therefore it is safe to remove.
https://github.com/ceph/ceph/pull/56846#discussion_r1587507868

Signed-off-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Rishabh Dave [Fri, 3 May 2024 07:56:49 +0000 (13:26 +0530)]

Merge pull request #56065 from rishabh-d-dave/mds-counters

mds: don't add counters in warning for standby-replay MDS

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Dhairya Parmar <dparmar@redhat.com>

commit | commitdiff | tree

Rishabh Dave [Fri, 3 May 2024 07:43:34 +0000 (13:13 +0530)]

Merge pull request #56066 from rishabh-d-dave/mds-fail-confirm

mon,cephfs: require confirmation flag to bring down unhealthy MDS

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Rishabh Dave [Fri, 3 May 2024 07:28:44 +0000 (12:58 +0530)]

Merge pull request #56981 from lxbsz/wip-65563

mds: remove the incorrect std::move for fsname and path

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Milind Changire <mchangir@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Dhairya Parmar <dparmar@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Zac Dover [Fri, 3 May 2024 05:32:28 +0000 (15:32 +1000)]

doc/rados: PR#57022 unfinished business

Make the changes suggested by Anthony D'Atri in
https://github.com/ceph/ceph/pull/57022.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Nizamudeen A [Fri, 3 May 2024 05:30:18 +0000 (11:00 +0530)]

Merge pull request #57145 from ktdreyer/rm-dashboard-werkzeug

ceph.spec.in: ceph-mgr-dashboard does not require werkzeug

Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Aashish Sharma [Fri, 3 May 2024 05:23:49 +0000 (10:53 +0530)]

Merge pull request #56579 from rhcs-dashboard/fix-alerts-verison-compatability-multi-cluster

mgr/dashboard: Filter alerts based on cluster fsid and do not allow to connect clusters with version less than hub cluster in multi-cluster

Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Matan Breizman [Thu, 2 May 2024 13:54:18 +0000 (13:54 +0000)]

qa/config/crimson_qa_overrides: adjust mgr_stats_period

Some test jobs fail with `local_shared_foreign_ptr: Assertion `ptr && *ptr' failed`
It seems that we attempt to use a connection which is not yet ready to use
after setting up the daemons on boot.
Adjust the mgr_stats_period to allow more time for the daemons to set up.
See: https://tracker.ceph.com/issues/62162#note-10

Note: This is not a fix but more of a temporary solution to avoid noise
in the testing suite (Tracker stays open).

Signed-off-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Casey Bodley [Thu, 2 May 2024 14:58:30 +0000 (15:58 +0100)]

Merge pull request #57004 from kchheda3/wip-fix-retry

rgw/multisite-notification: retry storing bucket notification attrs for ECANCELED(ConcurrentModification) errors.

Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Yuval Lifshitz <ylifshit@ibm.com>

commit | commitdiff | tree

Patrick Donnelly [Thu, 2 May 2024 14:39:57 +0000 (10:39 -0400)]

Merge PR #57207 into main

* refs/pull/57207/head:
doc/rados: s/cepgsqlite/cephsqlite/

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Anthony D Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Radoslaw Zarzynski [Tue, 23 Apr 2024 14:10:22 +0000 (14:10 +0000)]

crimson/osd: make osd_op_params::at_version coherent with last log entry

Before this commit we were doing something like:

1. initialize `at_version` with PG::projected_last_update`
   **incremented by one**.
2. produce a log entry at such version.
3. increment `at_version` for the sake of a further production
   that may never come.

The problem is `osd_op_params::at_version` is higher by one
than the last log entry which hurts at later stages of
`osd_op_params` processing (I was hit in the shared EC code
by the assertion in `PG::op_applied`).

This patch changes the algorithm to:

A. initialize `at_version` with PG::projected_last_update`
   **incremented by one**.
B. increment `at_version` for the sake of the very next production.
C. produce a log entry at this version.

Co-authored-by: Matan Breizman <mbreizma@redhat.com>
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

Connor Fawcett [Wed, 1 May 2024 23:07:47 +0000 (23:07 +0000)]

qa/workunits/erasure-code: add bench data tables and graph support for additional jerasure techniques

Signed-off-by: Connor Fawcett <connorfa@uk.ibm.com>

commit | commitdiff | tree

Adam King [Thu, 2 May 2024 12:04:30 +0000 (08:04 -0400)]

Merge pull request #57143 from adk3798/nvmeof-osd-profile-rbd

mgr/cephadm: set OSD cap for NVMEoF daemon to "profile rbd"

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Aashish Sharma [Thu, 2 May 2024 11:42:26 +0000 (17:12 +0530)]

mgr/dashboard: fix cluster filter typo in multi-cluster-overview
grafana dashboard

Fixes: https://tracker.ceph.com/issues/65760
Signed-off-by: Aashish Sharma <aasharma@redhat.com>

commit | commitdiff | tree

Aashish Sharma [Thu, 2 May 2024 10:04:50 +0000 (15:34 +0530)]

Merge pull request #56575 from cloudbehl/ceph-cluster-json-update

monitoring/ceph-mixin: Add cluster variable to ceph-cluster.json

Reviewed-by: Aashish Sharma <aasharma@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Thu, 2 May 2024 08:25:37 +0000 (13:55 +0530)]

Merge pull request #55495 from frittentheke/issue_64321

monitoring/ceph-mixin: Cleanup of variables, queries and tests (to fix showMultiCluster=True)

Reviewed-by: Aashish Sharma <aasharma@redhat.com>
Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Thu, 2 May 2024 06:05:52 +0000 (11:35 +0530)]

Merge pull request #57151 from rhcs-dashboard/fix-65698-main

mgr/dashboard: exclude cloned-deleted RBD snaps

Reviewed-by: Ilya Dryomov <idryomov@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Zac Dover [Wed, 1 May 2024 16:42:46 +0000 (02:42 +1000)]

doc/rados: s/cepgsqlite/cephsqlite/

As stated in the commit-message line, this corrects the typo "cepg" to
the correct string "ceph".

This typo was discovered by https://github.com/test-erik and this was
brought to our attention way back in
https://github.com/ceph/ceph/pull/50420.

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Patrick Donnelly [Wed, 1 May 2024 15:21:09 +0000 (11:21 -0400)]

Merge PR #57183 into main

* refs/pull/57183/head:
script/ptl-tool: add switch to update existing qa ticket

Reviewed-by: Yuri Weinstein <yweins@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Wed, 1 May 2024 12:57:37 +0000 (08:57 -0400)]

Merge PR #57059 into main

* refs/pull/57059/head:
mds: do not try fragmenting or exporting a quiesced directory
mds: set/test ALL_LOCKED on fragment_dir request
mds: pass bypassfreezing to parent auth pin req
qa: add quiesce tests during fragmentation
qa: translate empty output from rank_tell to empty dict
qa: move reqid_tostr helper

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>

commit | commitdiff | tree

Radoslaw Zarzynski [Tue, 23 Apr 2024 14:04:02 +0000 (14:04 +0000)]

crimson/osd: s/fill_op_params_bump_pg_version/fill_op_params/ in OpsExecuter

Since https://github.com/ceph/ceph/pull/53054 it doesn't
bump `PG::projected_last_update` anymore.

Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

Radoslaw Zarzynski [Tue, 23 Apr 2024 13:54:23 +0000 (13:54 +0000)]

crimson/osd: on the cloning path bump obj version closer to its usage

My understanding at the time of writing is this change is a pure
refactoring.

Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

Matan Breizman [Wed, 1 May 2024 07:10:58 +0000 (10:10 +0300)]

Merge pull request #57102 from xxhdx1985126/wip-65672

crimson/osd/osdop_params:Unify OpsExecuter::user_modify and osd_op_params_t::user_modify

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Tue, 30 Apr 2024 23:32:33 +0000 (16:32 -0700)]

Merge pull request #56995 from athanatos/sjust/wip-65185-scrub-attr-error

osd: only call stat/getattrs once per object during deep-scrub

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Tue, 30 Apr 2024 20:46:06 +0000 (16:46 -0400)]

Merge PR #56997 into main

* refs/pull/56997/head:
pybind/mgr: disable sqlite3/python autocommit
qa/tasks/mgr: add tests for sqlite autocommit
qa/tasks/vstart_runner: run daemons in foreground
qa/tasks/vstart_runner: add missing poll method
qa/suites/rados/mgr: add cli/devicehealth tasks
qa: reorganize mgr unit tests
qa: use position-independent link
qa: add missing terminating newline
pybind/mgr: add killpoint for sqlite3 database setup
mgr: allow specifying module option level
mon/MgrMonitor: promote standby when unsetting down flag
mon/MgrMonitor: only drop active if exists

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>

commit | commitdiff | tree

Adam King [Tue, 30 Apr 2024 19:48:41 +0000 (15:48 -0400)]

Merge pull request #57144 from adk3798/stray-laundry

qa/cephadm: ignore stray daemon warning during rados_api_tests

Reviewed-by: Laura Flores <lflores@ibm.com>

commit | commitdiff | tree

Adam King [Tue, 30 Apr 2024 19:41:23 +0000 (15:41 -0400)]

Merge pull request #56961 from adk3798/agent-check-error-before-json

cephadm: have agent check for errors before json loading mgr response

Reviewed-by: <mfritch@suse.com>

commit | commitdiff | tree

Adam King [Tue, 30 Apr 2024 19:38:20 +0000 (15:38 -0400)]

Merge pull request #56933 from adk3798/service-size-cleanup

mgr/cephadm: cleanup service size logic block

Reviewed-by: Michael Fritch <mfritch@suse.com>

commit | commitdiff | tree

Adam King [Tue, 30 Apr 2024 19:36:27 +0000 (15:36 -0400)]

Merge pull request #56928 from adk3798/grafana-anon-access-dropped

python-common: handle "anonymous_access: false" in to_json of Grafana spec

Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Adam King [Tue, 30 Apr 2024 19:32:39 +0000 (15:32 -0400)]

Merge pull request #56350 from phlogistonjohn/jjm-mgr-smb-module

mgr: add a new smb cluster & share module

Reviewed-by: Adam King <adking@redhat.com>

commit | commitdiff | tree

Adam King [Tue, 30 Apr 2024 19:30:30 +0000 (15:30 -0400)]

Merge pull request #56817 from guits/cephadm-check-apply-spec-file

cephadm: check if file exists when passing `--apply_spec`

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

Adam King [Tue, 30 Apr 2024 19:23:59 +0000 (15:23 -0400)]

Merge pull request #56487 from adk3798/bootstrap-custom-alerts

cephadm: allow passing custom prometheus alerts to bootstrap

Reviewed-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Sat, 20 Apr 2024 19:57:22 +0000 (15:57 -0400)]

script/ptl-tool: add switch to update existing qa ticket

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Ernesto Puerta [Tue, 30 Apr 2024 08:51:23 +0000 (10:51 +0200)]

mgr/dashboard: exclude cloned-deleted RBD snaps

Fixes: https://tracker.ceph.com/issues/65698
Signed-off-by: Ernesto Puerta <epuertat@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Tue, 30 Apr 2024 16:22:41 +0000 (12:22 -0400)]

Merge PR #57044 into main

* refs/pull/57044/head:
qa: increase debugging for snap_schedule

Reviewed-by: Milind Changire <mchangir@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Tue, 30 Apr 2024 16:22:15 +0000 (12:22 -0400)]

Merge PR #57043 into main

* refs/pull/57043/head:
client: clear resend_mds only after sending request

Reviewed-by: Milind Changire <mchangir@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Tue, 30 Apr 2024 16:21:20 +0000 (12:21 -0400)]

Merge PR #57020 into main

* refs/pull/57020/head:
mds/cache: don't assume non-auth xlocks to be remote locks

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Milind Changire <mchangir@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Tue, 30 Apr 2024 16:20:17 +0000 (12:20 -0400)]

Merge PR #56935 into main

* refs/pull/56935/head:
mds: encode flags for all inode types
qa: test file inode with F_QUIESCE_BLOCK is replicated

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>

commit | commitdiff | tree

Patrick Donnelly [Tue, 30 Apr 2024 16:19:31 +0000 (12:19 -0400)]

Merge PR #56934 into main

* refs/pull/56934/head:
mds: move drop_locks to directly after rdonly check
qa: test quiesce.block is replicated
qa: test that ceph.dir.subvolume is replicated properly
mds: add debug "lock path" command
qa: move reqid_tostr helper
qa: return run_shell process for waiters

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>

commit | commitdiff | tree

Patrick Donnelly [Tue, 30 Apr 2024 16:18:45 +0000 (12:18 -0400)]

Merge PR #56923 into main

* refs/pull/56923/head:
qa: extend rank 1 lockup for test_quiesce_authpin_wait

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>

commit | commitdiff | tree

Anthony D'Atri [Tue, 30 Apr 2024 14:10:17 +0000 (10:10 -0400)]

Merge pull request #57148 from zdover23/wip-doc-2024-04-30-cephadm-services-monitoring

doc/cephadm: add default monitor images

commit | commitdiff | tree

Ilya Dryomov [Tue, 30 Apr 2024 13:26:38 +0000 (15:26 +0200)]

test/librbd: correct expected_overlap in SnapshotCopyup

Changing the end of second interval from 2096640 to copyup_end - 512
with copyup_end potentially set to 1 << order in commit 750e61ac91d7
("librbd: clone copy-on-write operations should preserve sparseness")
was incorrect because the test image size is just 2M. There are no
end-to-end tests for enable_sparse_copyup = false case, so this went
unnoticed.

A year later, commit 38622b5ca12d ("librbd: copyup state machine
should always issue a sparse-read") dropped the respective branch in
CopyupRequest, thus eliminating the reason for branching on
enable_sparse_copyup altogether.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Ken Dreyer [Mon, 29 Apr 2024 20:29:11 +0000 (16:29 -0400)]

ceph.spec.in: ceph-mgr-dashboard does not require werkzeug

Nothing in the dashboard codebase imports werkzeug. It appears this was
leftover from the time when the dashboard was packaged with the rest of
the mgr modules.

Fixes: https://tracker.ceph.com/issues/65693
Signed-off-by: Ken Dreyer <kdreyer@ibm.com>

commit | commitdiff | tree

Zac Dover [Tue, 30 Apr 2024 12:24:10 +0000 (22:24 +1000)]

Merge pull request #57153 from rkhudov/doc-man-8-radosgw-admin_add-get-lifecycle-config-command

doc/man/8/radosgw-admin: add get lifecycle command

Reviewed-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Zac Dover [Tue, 30 Apr 2024 12:17:49 +0000 (22:17 +1000)]

Merge pull request #57154 from cityofships/main

doc/rados/operations: rephrase OSDs peering

Reviewed-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Ilya Dryomov [Tue, 30 Apr 2024 12:03:00 +0000 (14:03 +0200)]

Merge pull request #55978 from xxhdx1985126/wip-librbd-unittest-sparsify

test/librbd/test_internal: change to check for 4KB-aligned sparse read

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Piotr Parczewski [Tue, 30 Apr 2024 10:56:44 +0000 (12:56 +0200)]

doc/rados/operations: rephrase OSDs peering

Signed-off-by: Piotr Parczewski <piotr@stackhpc.com>

commit | commitdiff | tree

Leonid S. Usov [Tue, 30 Apr 2024 10:53:46 +0000 (13:53 +0300)]

Merge pull request #55823 from ceph/wip-lusov-quiescer

qa: cephfs quiesce thrasher

commit | commitdiff | tree

rkhudov [Tue, 30 Apr 2024 10:00:36 +0000 (11:00 +0100)]

doc/man/8/radosgw-admin: add get lifecycle command

Signed-off-by: rkhudov <rhudov@gmail.com>

commit | commitdiff | tree

Matan Breizman [Tue, 30 Apr 2024 09:29:59 +0000 (12:29 +0300)]

Merge pull request #56806 from xxhdx1985126/wip-65399

crimson/osd/pg_recovery: backoff if the recovery/backfill is deferred

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Matan Breizman [Tue, 30 Apr 2024 09:29:16 +0000 (12:29 +0300)]

Merge pull request #56998 from xxhdx1985126/wip-65568

crimson/osd/pg: only kick off snap trimming when PGs are clean

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Yingxin [Tue, 30 Apr 2024 08:40:50 +0000 (16:40 +0800)]

Merge pull request #57128 from cyx1231st/wip-fix-ut-seastore-timeout

crimson/os/seastore: misc cleanups and fixes

Reviewed-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Zac Dover [Tue, 30 Apr 2024 07:09:05 +0000 (17:09 +1000)]

doc/cephadm: add default monitor images

Add a list of default monitor images to the documentation. This commit
is made in response to a request from Eugen Block, and is made using the
information developed by Mr Block here:
https://lists.ceph.io/hyperkitty/list/ceph-users@ceph.io/thread/QGC66QIFBKRTPZAQMQEYFXOGZJ7RLWBN/.

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Matan Breizman [Tue, 30 Apr 2024 06:50:45 +0000 (09:50 +0300)]

Merge pull request #56892 from xxhdx1985126/wip-crimson-urgent-recovery-version

crimson/osd/osd_operations/client_request: retrieve the correct version for objects to be recovered urgently

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Matan Breizman [Tue, 30 Apr 2024 06:45:57 +0000 (09:45 +0300)]

Merge pull request #56610 from Matan-B/wip-matanb-crimson-replicated-head-obc

crimson/osd/replicated_recovery_backend: Fix recovery obc usage

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Yingxin [Tue, 30 Apr 2024 01:52:17 +0000 (09:52 +0800)]

Merge pull request #57135 from xxhdx1985126/wip-seastore-checksum-assertion-fix

crimson/os/seastore/transaction_manager: remove incorrect assertions

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin [Tue, 30 Apr 2024 01:49:43 +0000 (09:49 +0800)]

Merge pull request #57129 from xxhdx1985126/wip-65585

crimson/os/seastore/transaction_manager: fix write pipeline phase leak

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
Reviewed-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Tue, 23 Apr 2024 18:33:14 +0000 (14:33 -0400)]

mds: do not try fragmenting or exporting a quiesced directory

And handle inode becoming quiesced after op is created.

Fixes: https://tracker.ceph.com/issues/65603
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Mon, 29 Apr 2024 14:51:39 +0000 (10:51 -0400)]

mds: set/test ALL_LOCKED on fragment_dir request

This is an optimization to obviate repeated calls to acquire_locks.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

Unnamed repository; edit this file 'description' to name the repository.