git-server-git.apps.pok.os.sepia.ceph.com Git - ceph.git/log

]> git-server-git.apps.pok.os.sepia.ceph.com Git - ceph.git/log

projects / ceph.git / log

summary | shortlog | log | commit | commitdiff | tree
first ⋅ prev ⋅ next

commit | commitdiff | tree

Zac Dover [Fri, 17 Jan 2025 12:33:49 +0000 (22:33 +1000)]

doc/cephfs: edit disaster-recovery-experts (2 of x)

Edit the third and fourth sections of
doc/cephfs/disaster-recovery-experts.rst in preparation for adding
deeper explanations of the contexts in which one should use the various
commands listed on that page.

Follows https://github.com/ceph/ceph/pull/61426

https://tracker.ceph.com/issues/69557

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 4f3a69eb919fc0d99cdf943f095ca3a951c82897)

commit | commitdiff | tree

Zac Dover [Sat, 18 Jan 2025 03:44:14 +0000 (13:44 +1000)]

Merge pull request #61437 from zdover23/wip-doc-2025-01-18-backport-61272-to-squid

squid: doc/radosgw/config-ref: fix lc worker thread tuning

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Zac Dover [Sat, 18 Jan 2025 03:42:48 +0000 (13:42 +1000)]

Merge pull request #61423 from zdover23/wip-doc-2025-01-17-backport-61411-to-squid

squid: doc/cephfs: edit disaster-recovery-experts

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Zac Dover [Sat, 18 Jan 2025 03:42:39 +0000 (13:42 +1000)]

Merge pull request #61419 from zdover23/wip-doc-2025-01-17-revert-part-of-61331

squid: src/exporter: revert incorrect lines

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Zac Dover [Sat, 18 Jan 2025 03:42:21 +0000 (13:42 +1000)]

Merge pull request #61401 from zdover23/wip-doc-2025-01-16-backport-61373-to-squid

squid: AsyncMessenger.cc : improve error messages

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Zac Dover [Fri, 17 Jan 2025 22:18:53 +0000 (08:18 +1000)]

Merge pull request #61377 from zdover23/wip-doc-2025-01-15-backport-61072-to-squid

squid: test/pybind: Clean whitespace. (Doc and test fixes)

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Laimis Juzeliūnas [Wed, 8 Jan 2025 22:05:34 +0000 (00:05 +0200)]

doc/radosgw/config-ref: fix lc worker thread tuning

This commit updates RGW Config Reference - Lifecycle Settings section. In particular it addresses an incorrect suggestion to decrease parallel threads in the workers pool for a more aggressive/accelerated per-bucket lifecycle processing. A more aggressive lifecycle processing for a bucket containing higher number of objects is achieved by increasing, not decreasing parallel threads.
Current suggestion is miss-leading.

Fixes: https://tracker.ceph.com/issues/63659
Signed-off-by: Laimis Juzeliunas <laimis.juzeliunas@oxylabs.io>
(cherry picked from commit b7ae18a292c7d1d5139dfb74c575f1af0de29a3e)

commit | commitdiff | tree

Adam Kupczyk [Fri, 17 Jan 2025 12:31:25 +0000 (13:31 +0100)]

Merge pull request #60240 from aclamk/wip-aclamk-bluefs-truncate-allocations-squid

squid: os/bluestore: Make truncate() drop unused allocations

commit | commitdiff | tree

Zac Dover [Thu, 16 Jan 2025 11:51:46 +0000 (21:51 +1000)]

doc/cephfs: edit disaster-recovery-experts

Edit the first two sections of doc/cephfs/disaster-recovery-experts.rst
in preparation for adding deeper explanations of the contexts in which
one should use the various commands listed on that page.

https://tracker.ceph.com/issues/69557

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit cc8cfeddbd290ef9b5e4e0c68ae94cefb34e1be9)

commit | commitdiff | tree

SrinivasaBharathKanta [Fri, 17 Jan 2025 00:12:24 +0000 (05:42 +0530)]

Merge pull request #59389 from jmundack/wip-67657-squid

squid: common: Leverage a better CRC32C implementation

commit | commitdiff | tree

Zac Dover [Thu, 16 Jan 2025 15:49:04 +0000 (01:49 +1000)]

squid: src/exporter: revert incorrect lines

Revert part of 99f4eccb6bef444f86e6d6cbf579dc3ba975af98. The lines
removed here were reported to me by Anthony D'Atri, who says that Juan
Miguel Olmo told him that they should not be in the Squid branch.

See https://github.com/ceph/ceph/pull/61331#issuecomment-2596058701

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Adam Kupczyk [Thu, 16 Jan 2025 13:02:00 +0000 (14:02 +0100)]

Merge pull request #60543 from aclamk/wip-68680-squid

squid: os/bluestore/ceph-bluestore-tool: Modify show-label for many devs

commit | commitdiff | tree

Anthony D'Atri [Wed, 15 Jan 2025 00:17:14 +0000 (19:17 -0500)]

AsyncMessenger.cc : improve error messages

Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
(cherry picked from commit 3d36a3b9bbeb8b21b99046aab0d0bdf8f1c30aa2)

commit | commitdiff | tree

Paulo E. Castro [Thu, 12 Dec 2024 22:33:10 +0000 (22:33 +0000)]

test/pybind: Clean whitespace. (Doc and test fixes)

Signed-off-by: Paulo E. Castro <pecastro@wormholenet.com>
(cherry picked from commit aec8ca8b2688be5b460c822cab7ae6ba47bf4c80)

test/pybind: Test method has been renamed in unittest 3.2

Signed-off-by: Paulo E. Castro <pecastro@wormholenet.com>
(cherry picked from commit 560d66e34edacef0bed3c44ff8a02be6f79b5fbe)

doc: Fix typo.

Signed-off-by: Paulo E. Castro <pecastro@wormholenet.com>
(cherry picked from commit 5a886ee6ce9e55e9f66da6cff45945fba354bd69)

tools/cephfs: fix flake8 f-string formatting for py3.12

Signed-off-by: Paulo E. Castro <pecastro@wormholenet.com>
(cherry picked from commit 2b2ce7871579252074eff7b6072890196b1e4f2c)

commit | commitdiff | tree

Adam Kupczyk [Fri, 10 Jan 2025 08:26:54 +0000 (08:26 +0000)]

os/bluestore: Fix BlueFS::truncate()

In `struct bluefs_fnode_t` there is a vector `extents` and
the vector `extents_index` that is a log2 seek cache.

Until modifications to truncate() we never removed extents from files.
Modified truncate() did not update extents_index.

For example 10 extents long files when truncated to 0 will have:
0 extents, 10 extents_index.
After writing some data to file:
1 extents, 11 extents_index.

Now, `bluefs_fnode_t::seek` will binary search extents_index,
lets say it located seek at item #3.
It will then jump up from #0 extent (that exists) to #3 extent which
does not exist at.
The worst part is that code is now broken, as #3 != extent.end().

There are 3 parts of the fix:
1) assert in `bluefs_fnode_t::seek` to protect against
   jumping outside extents
2) code in BlueFS::truncate to sync up `extents_index` with `extents`
3) dampening down assert in _replay to give a way out of cases
   where incorrect "offset 12345" (12345 is file size) instead of
   "offset 20000" (allocations occupied) was written to log.

Fixes: https://tracker.ceph.com/issues/69481
Signed-off-by: Adam Kupczyk <akupczyk@ibm.com>
(cherry picked from commit 7f3601089d41bfc23f530c7bf3fb7efad2d055ec)

commit | commitdiff | tree

Adam Kupczyk [Fri, 10 Jan 2025 10:07:18 +0000 (10:07 +0000)]

os/bluestore: bluefs unittest for truncate bug

Unittest showing 2 different flavours of problems:
1) bluefs log corruption
2) bluefs sigsegv

Signed-off-by: Adam Kupczyk <akupczyk@ibm.com>
(cherry picked from commit f2b5e2fa0a9274c1667fccafa597fff9be7a74b1)
+fixup for bad usage of std::string's fill constructor

commit | commitdiff | tree

SrinivasaBharathKanta [Tue, 14 Jan 2025 00:53:10 +0000 (06:23 +0530)]

Merge pull request #61069 from shraddhaag/wip-68690-squid

squid: mon/LogMonitor: Use generic cluster log level config

commit | commitdiff | tree

Anthony D'Atri [Mon, 13 Jan 2025 15:09:24 +0000 (10:09 -0500)]

Merge pull request #61342 from zdover23/wip-doc-2025-01-13-backport-61313-to-squid

squid: doc: improve tests-integration-testing-teuthology-workflow.rst

commit | commitdiff | tree

Vallari Agrawal [Thu, 9 Jan 2025 12:27:42 +0000 (17:57 +0530)]

doc: improve tests-integration-testing-teuthology-workflow.rst

This commit adds:
1. workflow summary in the first section along with an image.
2. sub-section "Pushing to ceph-ci repository" to second section.
3. file doc/dev/developer_guide/testing_integration_tests/workflow.png

Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
(cherry picked from commit dc539b3ea8031d2b02da9d5a5b1f856d96d70362)

commit | commitdiff | tree

SrinivasaBharathKanta [Mon, 13 Jan 2025 06:59:48 +0000 (12:29 +0530)]

Merge pull request #60629 from kamoltat/wip-68840-squid

squid: mon [stretch mode]: support disable_stretch_mode

commit | commitdiff | tree

SrinivasaBharathKanta [Mon, 13 Jan 2025 06:57:43 +0000 (12:27 +0530)]

Merge pull request #59625 from NitzanMordhai/wip-67914-squid

squid: test: test_rados_tools compare output without trimming newline

commit | commitdiff | tree

Ronen Friedman [Sun, 12 Jan 2025 13:42:17 +0000 (15:42 +0200)]

Merge pull request #61088 from ronen-fr/wip-rf-59437-squid

squid: test/scrub: only instruct clean PGs to scrub
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Sat, 11 Jan 2025 17:24:55 +0000 (18:24 +0100)]

Merge pull request #60618 from badone/wip-68825-squid

squid: qa: Add ignorelist entries for reef-x tests

Reviewed-by: Laura Flores <lflores@ibm.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Ilya Dryomov [Sat, 11 Jan 2025 16:32:23 +0000 (17:32 +0100)]

Merge pull request #59245 from ljflores/wip-67577-squid

squid: qa/suites/upgrade: ignore PG_AVAILABILITY and MON_DOWN for quincy-x and reef-x upgrade suites

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Laura Flores [Tue, 13 Aug 2024 22:58:17 +0000 (17:58 -0500)]

qa/suites/upgrade: ignore MON_DOWN and PG_AVAILABILITY warnings in upgrade tests

Signed-off-by: Laura Flores <lflores@ibm.com>
(cherry picked from commit e225b127eaf5375493bc1a667058de092f10e4dd)

commit | commitdiff | tree

Anthony D'Atri [Sat, 11 Jan 2025 01:32:17 +0000 (20:32 -0500)]

Merge pull request #61331 from zdover23/wip-doc-2025-01-11-backport-60081-to-squid

squid: src/exporter: improve usage message

commit | commitdiff | tree

Anthony D'Atri [Tue, 1 Oct 2024 16:21:56 +0000 (12:21 -0400)]

src/exporter: improve usage message

Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
(cherry picked from commit 725b4e184798dc38ec60ab81766577b39fd6e488)

commit | commitdiff | tree

Adam Emerson [Fri, 10 Jan 2025 06:32:36 +0000 (01:32 -0500)]

Merge pull request #61306 from adamemerson/wip-69303-squid

squid: rgw: Don't crash on exceptions from pool listing

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Adam Emerson [Thu, 9 Jan 2025 16:46:32 +0000 (11:46 -0500)]

rgw: Don't crash on exceptions from pool listing

Fixes: https://tracker.ceph.com/issues/69303
Signed-off-by: Adam Emerson <aemerson@redhat.com>
(cherry picked from commit 4318b188740634ead65a986680f9186541c9d1b0)

Fixes: https://tracker.ceph.com/issues/69476
Signed-off-by: Adam Emerson <aemerson@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Wed, 8 Jan 2025 15:16:00 +0000 (16:16 +0100)]

Merge pull request #61260 from guits/wip-69437-squid

ceph-volume: fix Zap.ensure_associated_raw()

commit | commitdiff | tree

Guillaume Abrioux [Mon, 6 Jan 2025 16:12:22 +0000 (16:12 +0000)]

ceph-volume: fix Zap.ensure_associated_raw()

When an OSD creation fails, ceph-volume can zaps unrelated
existing raw based OSD as part of the 'rollback step'.

Fixes: https://tracker.ceph.com/issues/69430
Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
(cherry picked from commit 95bdcbfd813d41aa07069cc95a5da7c321936afd)

commit | commitdiff | tree

Guillaume Abrioux [Wed, 8 Jan 2025 07:59:59 +0000 (08:59 +0100)]

Merge pull request #60487 from guits/wip-68720-squid

squid: ceph-volume: support zapping by osd-id for RAW OSDs

commit | commitdiff | tree

Adam Emerson [Tue, 7 Jan 2025 16:55:49 +0000 (11:55 -0500)]

Merge pull request #61023 from cbodley/wip-69181-squid

squid: qa/rgw: bump keystone/barbican from 2023.1 to 2024.1

Reviewed-by: Adam Emerson <aemerson@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Tue, 7 Jan 2025 12:53:22 +0000 (14:53 +0200)]

Merge pull request #61185 from ronen-fr/wip-rf-61184-squid

squid: osd/scrub: register for 'osd_max_scrubs' config changes
Reviewed-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Tue, 7 Jan 2025 11:18:52 +0000 (13:18 +0200)]

Merge pull request #61118 from ronen-fr/wip-rf-61115-squid

squid: qa/standalone/scrub: fix TEST_periodic_scrub_replicated
Reviewed-by: Laura Flores <lflores@redhat.com>

commit | commitdiff | tree

Milind Changire [Fri, 3 Jan 2025 09:50:51 +0000 (15:20 +0530)]

Merge pull request #60752 from chrisphoffman/wip-68942-squid

squid: mds: Implement remove for ceph vxattrs

commit | commitdiff | tree

Milind Changire [Fri, 3 Jan 2025 04:28:53 +0000 (09:58 +0530)]

Merge pull request #59919 from mchangir/wip-67826-squid

squid: qa: failfast mount for better performance and unblock `fs volume ls`

commit | commitdiff | tree

Milind Changire [Thu, 2 Jan 2025 10:18:03 +0000 (15:48 +0530)]

Merge pull request #59395 from lxbsz/wip-67587

squid: client: flush the caps release in filesystem sync

commit | commitdiff | tree

Milind Changire [Thu, 2 Jan 2025 10:17:08 +0000 (15:47 +0530)]

Merge pull request #59518 from batrick/wip-67751-squid

squid: mds: cephx path restriction incorrectly rejects snapshots of deleted directory

commit | commitdiff | tree

Milind Changire [Thu, 2 Jan 2025 10:16:00 +0000 (15:46 +0530)]

Merge pull request #59704 from vshankar/wip-67374-squid

squid: mon: fix `fs set down` to adjust max_mds only when cluster is not down

commit | commitdiff | tree

Milind Changire [Thu, 2 Jan 2025 10:12:01 +0000 (15:42 +0530)]

Merge pull request #60059 from vshankar/wip-68321-squid

squid: mds: invalid id for client eviction is to be treated as success

commit | commitdiff | tree

Milind Changire [Thu, 2 Jan 2025 10:11:21 +0000 (15:41 +0530)]

Merge pull request #60687 from vshankar/wip-67532-squid

squid: tools/rados: Fix extra NL in getxattr

commit | commitdiff | tree

Milind Changire [Thu, 2 Jan 2025 09:56:39 +0000 (15:26 +0530)]

Merge pull request #60688 from vshankar/wip-68112-squid

squid: mds: batch backtrace updates by pool-id when expiring a log segment

commit | commitdiff | tree

Milind Changire [Thu, 2 Jan 2025 09:55:56 +0000 (15:25 +0530)]

Merge pull request #60691 from vshankar/wip-67693-squid

squid: mds: getattr just waits the xlock to be released by the previous client

commit | commitdiff | tree

Milind Changire [Thu, 2 Jan 2025 09:55:10 +0000 (15:25 +0530)]

Merge pull request #60694 from vshankar/wip-68694-squid

squid: client: fixed a bug that read operation hung

commit | commitdiff | tree

Milind Changire [Thu, 2 Jan 2025 09:49:57 +0000 (15:19 +0530)]

Merge pull request #59685 from vshankar/wip-67744-squid

squid: qa: wait for file creation before changing mode

commit | commitdiff | tree

Milind Changire [Thu, 2 Jan 2025 09:48:51 +0000 (15:18 +0530)]

Merge pull request #59866 from dparmar18/wip-66620-squid

squid: mds: drop client metrics during recovery

commit | commitdiff | tree

Milind Changire [Thu, 2 Jan 2025 09:44:03 +0000 (15:14 +0530)]

Merge pull request #59899 from joscollin/wip-68168-squid

squid: cephfs-top: fix exceptions on small/large sized windows

commit | commitdiff | tree

Milind Changire [Thu, 2 Jan 2025 09:43:27 +0000 (15:13 +0530)]

Merge pull request #60181 from joscollin/wip-68408-squid

squid: qa: fix test failure test_cephfs_mirror_cancel_mirroring_and_readd

commit | commitdiff | tree

Milind Changire [Thu, 2 Jan 2025 09:42:30 +0000 (15:12 +0530)]

Merge pull request #60218 from dparmar18/wip-67674-squid

squid: client: contiguous read fails for non-contiguous write (in async I/O api)

commit | commitdiff | tree

Pedro Gonzalez Gomez [Thu, 2 Jan 2025 09:27:50 +0000 (10:27 +0100)]

Merge pull request #61181 from rhcs-dashboard/squid-configuration-not-updatable

Squid: mgr/dashboard: Administration > Configuration > Some of the config options are not updatable at runtime

Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>

commit | commitdiff | tree

Anthony D'Atri [Mon, 30 Dec 2024 15:11:26 +0000 (10:11 -0500)]

Merge pull request #61193 from zdover23/wip-doc-2024-12-30-backport-60794-to-squid

squid: doc/cephfs: document purge queue and its perf counters

commit | commitdiff | tree

Dhairya Parmar [Thu, 21 Nov 2024 22:10:33 +0000 (03:40 +0530)]

doc/cephfs: document purge queue and its perf counters

Fixes: https://tracker.ceph.com/issues/68571
Signed-off-by: Dhairya Parmar <dparmar@redhat.com>
(cherry picked from commit ae9277398840bf8576ea5a8c4a2ba4e23f8b9613)

commit | commitdiff | tree

Pedro Gonzalez Gomez [Fri, 27 Dec 2024 12:30:44 +0000 (13:30 +0100)]

Merge pull request #61189 from rhcs-dashboard/squid-multisite-e2e-fixes

Squid: mgr/dashboard: fix multisite e2e failures

Reviewed-by: Pedro Gonzaez Gomez <pegonzal@redhat.com>

commit | commitdiff | tree

Naman Munet [Fri, 22 Nov 2024 09:57:44 +0000 (15:27 +0530)]

mgr/dashboard: Administration > Configuration > Some of the config options are not updatable at runtime

Fixes: https://tracker.ceph.com/issues/68976
Fixes Includes:
1) by-passing 'can_update_at_runtime' flag for 'rgw' related configurations as the same can be updated at runtime via CLI.
Also implemented a warning popup for user to make force edit to rgw related configurations.
2) when navigated to Administration >> Configuration, modified configuration will be seen as we see in cli "ceph config dump",
instead of configuration with filter level:basic

Signed-off-by: Naman Munet <naman.munet@ibm.com>
(cherry picked from commit 3181acc223dafd04e3fc56d418389ad50c5868e4)

commit | commitdiff | tree

Naman Munet [Wed, 27 Nov 2024 17:27:39 +0000 (22:57 +0530)]

mgr/dashboard: fix multisite e2e failures

Fixes: https://tracker.ceph.com/issues/69072
Signed-off-by: Naman Munet <namanmunet@li-ff83bccc-26af-11b2-a85c-a4b04bfb1003.ibm.com>
(cherry picked from commit 3ae0e6d05375e41e002c262fec526e40120500ec)

commit | commitdiff | tree

Ronen Friedman [Thu, 26 Dec 2024 13:06:10 +0000 (07:06 -0600)]

osd/scrub: register for 'osd_max_scrubs' config changes

Since https://github.com/ceph/ceph/pull/55340, the
osd_max_scrubs (also) affects the parameters of the
async scrub reserver used by the replicas. Thus,
the code must notice and acknowledge changes to this config.

Fixes: https://tracker.ceph.com/issues/69362
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
(cherry picked from commit 31e6bacfbf60e3e9222cae354d8527fa92282dbc)

commit | commitdiff | tree

Anthony D'Atri [Thu, 26 Dec 2024 00:27:26 +0000 (19:27 -0500)]

Merge pull request #61178 from zdover23/wip-doc-2024-12-26-backport-61177-to-squid

doc: Fixes a typo in controllers section of hardware recommendations

commit | commitdiff | tree

Skalador [Wed, 25 Dec 2024 22:57:00 +0000 (23:57 +0100)]

doc: Fixes a typo in controllers section of hardware recommendations

Signed-off-by: Kevin Niederwanger <k.niederwanger@gmail.com>
(cherry picked from commit 089636224910e1cd6231cadd2c422a78c3d08fea)

commit | commitdiff | tree

Ilya Dryomov [Mon, 23 Dec 2024 22:05:46 +0000 (23:05 +0100)]

Merge pull request #61172 from idryomov/wip-69324-squid

squid: rbd: handle --{group,image}-namespace in "rbd group image {add,rm}"

Reviewed-by: Ramana Raja <rraja@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Mon, 23 Dec 2024 22:04:58 +0000 (23:04 +0100)]

Merge pull request #61168 from idryomov/wip-68998-squid

squid: librbd: avoid data corruption on flatten when object map is inconsistent

Reviewed-by: Ramana Raja <rraja@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Fri, 20 Dec 2024 10:16:58 +0000 (11:16 +0100)]

rbd: drop --pool option from "rbd group image {add,rm}"

It stopped working with removal of get_special_pool_group_names() in
commit 3e8624f157a1 ("rbd: add support for namespaces") over six years
ago. Given how much time has passed, stop accepting this option.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 1f71671dc65fa9e35d451e55d8963d60f3198a93)

commit | commitdiff | tree

Ilya Dryomov [Tue, 17 Dec 2024 15:06:17 +0000 (16:06 +0100)]

rbd: handle --{group,image}-namespace in "rbd group image {add,rm}"

Currently only passing the namespace as part of the group or image spec
works. If --group-namespace or --image-namespace options are used, the
namespace isn't picked up.

Fixes: https://tracker.ceph.com/issues/69324
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit f35e3a6e9d93c2d2122c31d5eeb9fabaef89f2e1)

Conflicts:
src/tools/rbd/action/Group.cc [ "rbd group info" and "rbd group
snap info" commands not in squid ]

commit | commitdiff | tree

Ilya Dryomov [Tue, 17 Dec 2024 08:57:44 +0000 (09:57 +0100)]

test/librbd: add TestInternal.FlattenInconsistentObjectMap

Inject an object map with all possible inconsistencies before
flattening to ensure that something similar to commit 40af4f87b64f
("librbd: flatten operation should use object map") doesn't reappear
in a different form.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit ffcd90313b9dd6e5aab8df0f9a5335a69785133c)

commit | commitdiff | tree

Ilya Dryomov [Wed, 27 Nov 2024 10:11:16 +0000 (11:11 +0100)]

librbd: avoid data corruption on flatten when object map is inconsistent

By making flatten skip copyup in case the object is marked
OBJECT_EXISTS or OBJECT_EXISTS_CLEAN, commit 40af4f87b64f ("librbd:
flatten operation should use object map") introduced a critical
regression.  If the object map becomes inconsistent (e.g. because
flatten gets interrupted by killing "rbd flatten" process or a client
running on the clone crashes after updating the object map but before
writing to the image), the following attempt to flatten would corrupt
the clone if the copyup is actually still needed.

By design, it's impossible to tell whether the object is "known to
exist" based on the object map -- only telling whether the object is
"known to NOT exist" is possible (i.e. only OBJECT_NONEXISTENT state
is reliable).  Negating OBJECT_NONEXISTENT tells that the object "may
exist", not that the object is "known to exist".  This is reflected in
the name of object_may_exist() helper that was introduced together with
the object map implementation.  Something like object_may_not_exist()
simply can't be constructed given the rest of librbd.

This effectively reverts commits 4c86bccf07b8 ("librbd: add
object_may_not_exist helper") and 40af4f87b64f ("librbd: flatten
operation should use object map").

Fixes: https://tracker.ceph.com/issues/68998
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 97ed3fced13dd48520ec9c165537ff0bbc7cbb64)

commit | commitdiff | tree

Casey Bodley [Thu, 19 Dec 2024 18:04:21 +0000 (13:04 -0500)]

Merge pull request #61070 from ivancich/wip-69209-squid

squid: rgw: optimize bucket listing to skip past regions of namespaced entries

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Thu, 19 Dec 2024 16:44:03 +0000 (08:44 -0800)]

Merge pull request #61138 from cbodley/wip-69308-squid

squid: rgw/lc: Fix lifecycle not working while bucket versioning is suspended

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Thu, 19 Dec 2024 16:43:26 +0000 (08:43 -0800)]

Merge pull request #61120 from ivancich/wip-69269-squid

squid: qa/rgw: force Hadoop to run under Java 1.8

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Thu, 19 Dec 2024 16:42:58 +0000 (08:42 -0800)]

Merge pull request #61119 from yuvalif/wip-yuval-fix-notif-test-regression

squid: test/rgw/notifications: fix test regression

Reviewed-by: J. Eric Ivancich <ivancich@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Thu, 19 Dec 2024 16:41:54 +0000 (08:41 -0800)]

Merge pull request #61101 from ivancich/wip-69256-squid

squid: qa/rgw: pull Apache artifacts from mirror instead of archive.apache.org

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Thu, 19 Dec 2024 16:40:41 +0000 (08:40 -0800)]

Merge pull request #61053 from smanjara/wip-69210-squid

squid: qa/rgw: fix s3 java tests by forcing gradle to run on Java 8

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Zac Dover [Wed, 18 Dec 2024 21:44:56 +0000 (07:44 +1000)]

Merge pull request #60467 from zdover23/wip-doc-2024-10-24-backport-60465-to-squid

squid: doc/cephfs: edit "Disabling Volumes Plugin"

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Trang Tran [Thu, 24 Oct 2024 10:49:20 +0000 (17:49 +0700)]

rgw/lc: Fix lifecycle not working while bucket versioning is suspended

Signed-off-by: Trang Tran <tttrangptit@gmail.com>
(cherry picked from commit e1f3344216011a61b65013aec26220f180f4cfaf)

commit | commitdiff | tree

Ilya Dryomov [Wed, 18 Dec 2024 08:00:10 +0000 (09:00 +0100)]

Merge pull request #61095 from idryomov/wip-69178-squid

squid: librbd/migration/HttpClient: avoid reusing ssl_stream after shut down

Reviewed-by: Ramana Raja <rraja@redhat.com>

commit | commitdiff | tree

Laura Flores [Tue, 17 Dec 2024 18:35:24 +0000 (12:35 -0600)]

Merge pull request #58144 from ljflores/wip-66463-squid

squid: test/neorados: remove depreciated RemoteReads cls test

commit | commitdiff | tree

J. Eric Ivancich [Mon, 16 Dec 2024 18:21:35 +0000 (13:21 -0500)]

qa/rgw: force Hadoop to run under Java 1.8

The Hadoop test installs Java 1.8 but then just runs the default
version. This makes sure it will run the version it installed.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
(cherry picked from commit c5503187af96dc0179265dc84b2716df851e4cdf)

commit | commitdiff | tree

Yuval Lifshitz [Tue, 17 Dec 2024 17:52:23 +0000 (17:52 +0000)]

squid: test/rgw/notifications: fix test regression

from commit: 05e2ade13292e7d924e6385ea234f58c13eb2d51

Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>

commit | commitdiff | tree

Ronen Friedman [Tue, 17 Dec 2024 13:51:48 +0000 (15:51 +0200)]

qa/standalone/scrub: fix TEST_periodic_scrub_replicated

A bogus change introduced as part of PR#54363 (commit
fbb7d73) changed multiple 'scrub' commands to 'scheduled-scrub'.
In this one instance - that was wrong.

Fixes: https://tracker.ceph.com/issues/69276
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
(cherry picked from commit ca189fb511f18fc82c62702bf904201b6347395b)

commit | commitdiff | tree

afreen23 [Tue, 17 Dec 2024 15:20:59 +0000 (20:50 +0530)]

Merge pull request #60604 from rhcs-dashboard/wip-68811-squid

squid: mgr/dashboard: rm nvmeof conf based on its daemon name

Reviewed-by: Afreen Misbah <afreen@ibm.com>

commit | commitdiff | tree

Matan Breizman [Tue, 17 Dec 2024 13:41:53 +0000 (15:41 +0200)]

Merge pull request #61109 from NitzanMordhai/wip-68972-squid

squid: qa/suites/crimson-rados/perf: add ssh keys

Reviewed-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

afreen23 [Tue, 17 Dec 2024 08:05:51 +0000 (13:35 +0530)]

Merge pull request #61096 from afreen23/wip-69201-squid

squid: mgr/dashboard: handle infinite values for pools

Reviewed-by: Afreen Misbah <afreen@ibm.com>

commit | commitdiff | tree

Nitzan Mordechai [Mon, 7 Oct 2024 10:21:58 +0000 (10:21 +0000)]

qa/suites/crimson-rados/perf: add ssh keys

cbt use ssh connection by defualt, without ssh_keys the task
won't generate public key and the cbt task will fail

Fixes: https://tracker.ceph.com/issues/68421
Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>
(cherry picked from commit 471ebae9240192a4e143c00894e1736dd1921308)

commit | commitdiff | tree

J. Eric Ivancich [Fri, 13 Dec 2024 20:47:30 +0000 (15:47 -0500)]

qa/rgw: pull Apache artifacts from mirror instead of archive.apache.org

Currently maven and kafka are pulled from archive.apache.org. This
uses Apache's "closer" calculator to find a mirror to use instead.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
(cherry picked from commit 3aae66611dd7f05612056a757cb7a87dfcf95de0)

commit | commitdiff | tree

Afreen [Wed, 6 Mar 2024 20:22:16 +0000 (01:52 +0530)]

mgr/dashboard: handle infinite values for pools

Fixes https://tracker.ceph.com/issues/64724

Issue:
======
Json parsing is failing because of Infinity values present in pools
meteadata. "read_balance": {"score_acting": Infinity, "score_stable":
Infinity,}
Due to this entire pool list is not rendered.

Fix:
====
Added a handler for checking "inf" values and replacing them with a
string "Infinity" so that json parsing does not fail on frontend.

Signed-off-by: Afreen <afreen23.git@gmail.com>
(cherry picked from commit 82d100ad264c35d79262c1983a8005d8d4791855)

commit | commitdiff | tree

Ilya Dryomov [Thu, 12 Dec 2024 20:32:39 +0000 (21:32 +0100)]

librbd/migration/HttpClient: socket isn't shut down on some state transitions

If shut_down() gets delayed until a) the state transition from
STATE_RESET_CONNECTING completes and the reconnect is unsuccessful or
b) the state transition from STATE_RESET_DISCONNECTING completes (i.e.
next_state is STATE_UNINITIALIZED or STATE_RESET_CONNECTING), the
socket needs to be shut down before m_on_shutdown is invoked. The line
of thought here is the same as for the corresponding state transitions
that don't involve STATE_SHUTTING_DOWN.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 88557dff2fe14c7df96992fbb0a5208979c20bf1)

commit | commitdiff | tree

Ilya Dryomov [Wed, 11 Dec 2024 15:25:13 +0000 (16:25 +0100)]

librbd/migration/HttpClient: avoid hitting an assert in advance_state()

If the shutdown gets delayed until the state transition from
STATE_RESET_CONNECTING completes and the reconnect is successful
(i.e. next_state is STATE_READY), we eventually hit "unexpected
state transition" assert in advance_state(). The reason is that
advance_state() would update m_state and call disconnect() under
STATE_READY instead of STATE_SHUTTING_DOWN. After the disconnect
maybe_finalize_shutdown() would enter advance_state() again with
STATE_SHUTDOWN as next_state, but the transition to that from
STATE_READY is invalid.

Plug this by not transitioning to next_state if current_state is
STATE_SHUTTING_DOWN.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 1046d610e3d6852258e6c4bf0355d0d13fb197b4)

commit | commitdiff | tree

Ilya Dryomov [Mon, 9 Dec 2024 10:19:57 +0000 (11:19 +0100)]

librbd/migration/HttpClient: ignore stream_truncated when shutting down SSL

Propagate ec to handle_disconnect() and use it to suppress
stream_truncated errors.  Here is a quote from Beast documentation [1]:

  // Gracefully shutdown the SSL/TLS connection
  error_code ec;
  stream.shutdown(ec);
  // Non-compliant servers don't participate in the SSL/TLS shutdown process and
  // close the underlying transport layer. This causes the shutdown operation to
  // complete with a `stream_truncated` error. One might decide not to log such
  // errors as there are many non-compliant servers in the wild.
  if(ec != net::ssl::error::stream_truncated)
      log(ec);

... and a commit that made ignoring stream_truncated safe [2]:

  // ssl::error::stream_truncated, also known as an SSL "short read",
  // indicates the peer closed the connection without performing the
  // required closing handshake
  // [...]
  // When a short read would cut off the end of an HTTP message,
  // Beast returns the error beast::http::error::partial_message.
  // Therefore, if we see a short read here, it has occurred
  // after the message has been completed, so it is safe to ignore it.

[1] https://www.boost.org/doc/libs/develop/libs/beast/doc/html/beast/using_io/ssl_tls_shutdown.html
[2] https://github.com/boostorg/beast/commit/094f5ec5cb3be1c3ce2d985564f1f39e9bed74ff

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 9fa0bcc67d79d90996cd4ec2b5af56d051ef6be7)

commit | commitdiff | tree

Ilya Dryomov [Sat, 7 Dec 2024 11:37:29 +0000 (12:37 +0100)]

librbd/migration/HttpClient: propagate ec to handle_handshake()

Get rid of get_callback_adapter() which only obfuscates the error:

handle_handshake: failed to complete SSL handshake: (337047686) Unknown error 337047686

vs

handle_handshake: failed to complete SSL handshake: certificate verify failed (SSL routines, tls_process_server_certificate)

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit e305a5908bd7bd3f2fa906af8521aea989f0c0ca)

commit | commitdiff | tree

Ilya Dryomov [Sat, 7 Dec 2024 12:52:41 +0000 (13:52 +0100)]

librbd/migration/HttpClient: drop SslHttpSession::m_ssl_enabled

The remaining callers of disconnect() call it only when m_ssl_enabled
is set to true (i.e. after the handshake is completed):

- shut_down(), in STATE_READY
- maybe_finalize_reset(), very shortly after transitioning out of
STATE_READY as part of performing a reset
- advance_state(), on a transition to STATE_READY that is intercepted
by a previously delayed shut down

m_ssl_enabled isn't used outside of disconnect() and on top of that
is never cleared.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 8566224e9406abca42925f8045077141c2724bed)

commit | commitdiff | tree

Ilya Dryomov [Sat, 7 Dec 2024 11:22:52 +0000 (12:22 +0100)]

librbd/migration/HttpClient: don't call disconnect() in handle_handshake()

With m_ssl_enabled set to false, disconnect() is a no-op. Since
m_ssl_enabled is flipped to true only when the handshake succeeds,
calling disconnect() on "failed to complete handshake" error is bogus
(as would be attempting to shut down SSL there).

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 331b5ea322537d380996ac6b250898ba474500be)

commit | commitdiff | tree

Ilya Dryomov [Fri, 6 Dec 2024 15:51:51 +0000 (16:51 +0100)]

librbd/migration/HttpClient: avoid reusing ssl_stream after shut down

ssl_stream objects can't be reused after shut down: despite
a successful reconnect and handshake, any attempt to read or write
fails with "end of stream" (beast.http:1) or "protocol is shutdown"
(asio.ssl:337690831) error respectively. This doesn't appear to be
documented, but Beast and ASIO authors both mention that the stream
must be destroyed and recreated [1][2].

This was missed because the only integration test with a big enough
image used http instead of https.

[1] https://github.com/boostorg/beast/issues/821#issuecomment-338354949
[2] https://github.com/chriskohlhoff/asio/issues/804#issuecomment-872746894

Fixes: https://tracker.ceph.com/issues/69178
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 20885b11794ba80d5cddd178994865a83da7240f)

commit | commitdiff | tree

Ilya Dryomov [Fri, 6 Dec 2024 13:42:55 +0000 (14:42 +0100)]

librbd/migration/HttpClient: don't shut down socket in resolve_host()

resolve_host() is called from init() and issue() when transitioning out
of STATE_UNINITIALIZED and from advance_state() right after the call to
shutdown_socket(). In all three cases the socket should get closed, so
drop the redundant call and place asserts in connect() implementations
instead.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 5699e4b871c66035c2c425bc9de7022fa2474c78)

commit | commitdiff | tree

Ilya Dryomov [Tue, 3 Dec 2024 15:20:16 +0000 (16:20 +0100)]

librbd/migration/HttpClient: fix some error messages in handle_receive()

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit d1a83f264ae97ec56004e6bf67a00f5753c6d611)

commit | commitdiff | tree

Ilya Dryomov [Tue, 3 Dec 2024 15:16:39 +0000 (16:16 +0100)]

librbd/migration/HttpClient: log ec details in handle_{issue,receive}()

Logging just the (negated) value makes tracking down the error category
and message unnecessarily hard.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit eb773496d53e924e6ff99ab1c3a9bcd36a8f81ea)

commit | commitdiff | tree

Ronen Friedman [Sun, 25 Aug 2024 17:05:11 +0000 (12:05 -0500)]

test/scrub: only instruct clean PGs to scrub

Recent changes to the scrub scheduling mechanism, especially
regarding the 'must_scrub' flag, cause operator scrub commands
issued on a not-clean PG to be rejected - and forgotten.

This commit changes the tests to issue a scrub command only
after the target PG is clean.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
(cherry picked from commit b01c0e9b50a2d10d2eabf2f0dcd7f53beeb66b35)

commit | commitdiff | tree

Anthony D'Atri [Fri, 13 Dec 2024 20:08:29 +0000 (15:08 -0500)]

Merge pull request #61080 from zdover23/wip-doc-2024-12-14-backport-61077-to-squid

squid: doc/cephfs: edit 3rd 3rd of mount-using-kernel-driver

commit | commitdiff | tree

Zac Dover [Fri, 13 Dec 2024 06:12:49 +0000 (16:12 +1000)]

doc/cephfs: edit 3rd 3rd of mount-using-kernel-driver

Edit the third third of doc/cephfs/mount-using-kernel-driver.rst in
preparation for correcting mount commands that may not work in Reef as
described in this documentation.

This commit edits only English-language strings in
doc/cephfs/mount-using-kernel-driver.rst. No technical content (that is,
no commands and no settings) have been altered in this commit.

Technical alterations to this file will be made only after the English
is unambiguous.

This PR follows the following two PRs:
https://github.com/ceph/ceph/pull/61048 - 1st 3rd
https://github.com/ceph/ceph/pull/61049 - 2nd 3rd

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 9c7580a2935511d009c9e66885e76635aa504ee8)

commit | commitdiff | tree

J. Eric Ivancich [Fri, 22 Nov 2024 17:40:24 +0000 (12:40 -0500)]

rgw: optimize bucket listing to skip past regions of namespaced entries

When listing a bucket and the parameters are such that we're not
listing namespaced entries, this commit adds an optimization to
advance the marker such that we skip past a whole region of namespaced
entries rather than evaluating each entry one-by-one.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
(cherry picked from commit 91488321277d15b877d8b5c5a5204e18d7ab7fad)

commit | commitdiff | tree

Prashant D [Mon, 8 Aug 2022 14:55:23 +0000 (10:55 -0400)]

mon/LogMonitor: Use generic cluster log level config

We do not control the verbosity of the LogEntry
which is getting logged to stderr, graylog and
journald. This causes excessive flooding of logs
to /var/log, making a filesystem to fill up quickly.
Also we have different config variables namely
mon_cluster_log_file_level and mon_cluster_log_to_syslog_level
to control verbosity at cluster log file and
syslog level respectively. Add a generic cluster log
level config variable which controls cluster log
verbosity for all external entities.

Additionally, this patch addresses the regression of
`mon_cluster_log_file_level` option which doesn't take effect
because of code refactoring of LogMonitor::update_from_paxos
(commit : 7c84e06).

Fixes: https://tracker.ceph.com/issues/57061
Fixes: https://tracker.ceph.com/issues/57049
Signed-off-by: Prashant D <pdhange@redhat.com>
(cherry picked from commit cb0d4a57bd25d32919f7ed44815b4301f41e9508)

commit | commitdiff | tree

Yuri Weinstein [Thu, 12 Dec 2024 16:59:31 +0000 (08:59 -0800)]

Merge pull request #61026 from adk3798/wip-69184-squid

squid: qa/tasks/nvme_loop: update task to work with new nvme list format

Reviewed-by: Laura Flores <lflores@redhat.com>

Unnamed repository; edit this file 'description' to name the repository.