git.apps.os.sepia.ceph.com Git

]> git.apps.os.sepia.ceph.com Git - ceph.git/log

projects / ceph.git / log

commit | commitdiff | tree

Anthony D'Atri [Wed, 4 Sep 2024 11:41:34 +0000 (07:41 -0400)]

Merge pull request #59586 from zdover23/wip-doc-2024-09-04-backport-59577-to-reef

reef: doc/mds: improve wording

commit | commitdiff | tree

Piotr Parczewski [Tue, 3 Sep 2024 11:25:26 +0000 (13:25 +0200)]

doc/mds: improve wording

Signed-off-by: Piotr Parczewski <piotr@stackhpc.com>
(cherry picked from commit 332804bad58c892d01d2d2da557e42104365ef8a)

commit | commitdiff | tree

Anthony D'Atri [Mon, 2 Sep 2024 13:04:43 +0000 (09:04 -0400)]

Merge pull request #59560 from zdover23/wip-doc-2024-09-02-backport-59556-to-reef

reef: doc: Correct link to Prometheus docs

commit | commitdiff | tree

Matthew Vernon [Mon, 2 Sep 2024 09:16:36 +0000 (10:16 +0100)]

doc: Correct link to Prometheus docs

The link is to the `#http_sd_config` anchor in the prometheus config docs; that link only works without the trailing `/`.

This correction would ideally get backported to at least reef & squid.

Signed-off-by: Matthew Vernon <mvernon@wikimedia.org>
(cherry picked from commit 84a30ba6b94b34806faac8217ccaa299c9ee68d6)

commit | commitdiff | tree

Anthony D'Atri [Sun, 1 Sep 2024 15:16:03 +0000 (11:16 -0400)]

Merge pull request #59549 from zdover23/wip-doc-2024-09-01-backport-59544-to-reef

reef: doc: update tests-integration-testing-teuthology-workflow.rst

commit | commitdiff | tree

Vallari Agrawal [Sat, 31 Aug 2024 14:27:25 +0000 (19:57 +0530)]

doc: update tests-integration-testing-teuthology-workflow.rst

* add "Infrastructure" section.

* move "Naming the ceph-ci branch" section under
   "Getting binaries - Build Ceph". Also mention
   about centos9-only trick.

* in "Teuthology Archives", mention about developer
   playground machines and ceph log files.

Signed-off-by: Vallari Agrawal <val.agl002@gmail.com>
(cherry picked from commit 9bfcb8e17db8c61e523e10856d12b237433d831a)

commit | commitdiff | tree

Anthony D'Atri [Sat, 31 Aug 2024 14:29:17 +0000 (10:29 -0400)]

Merge pull request #59541 from zdover23/wip-doc-2024-08-31-backport-59528-to-reef

reef: doc/ceph-volume: add spillover fix procedure

commit | commitdiff | tree

Zac Dover [Fri, 30 Aug 2024 11:16:57 +0000 (21:16 +1000)]

doc/ceph-volume: add spillover fix procedure

Add a procedure that explains how, after an upgrade, to move bytes that
have spilled over to a relatively slow device back to the faster device.

This procedure was developed by Chris Dunlop on the [ceph-users] mailing
list, here: https://lists.ceph.io/hyperkitty/list/ceph-users@ceph.io/message/POPUFSZGXR3P2RPYPJ4WJ4HGHZ3QESF6/

Eugen Block requested the addition of this procedure to the
documentation on 30 Aug 2024.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 98618aaa1c8b786c7d240a210b62cc737fdb048d)

commit | commitdiff | tree

Adam King [Thu, 29 Aug 2024 11:40:02 +0000 (07:40 -0400)]

Merge pull request #59462 from adk3798/wip-66428-reef

reef: mgr/cephadm: make SMB and NVMEoF upgrade last in staggered upgrade

Reviewed-by: Guillaume Abrioux <gabrioux@ibm.com>

commit | commitdiff | tree

Adam King [Thu, 29 Aug 2024 11:39:22 +0000 (07:39 -0400)]

Merge pull request #59461 from adk3798/wip-66426-reef

reef: cephadm: CephExporter doesn't bind to IPv6 in dual stack

Reviewed-by: Guillaume Abrioux <gabrioux@ibm.com>

commit | commitdiff | tree

Adam King [Thu, 29 Aug 2024 11:39:07 +0000 (07:39 -0400)]

Merge pull request #59460 from adk3798/wip-65969-reef

reef: mgr/cephadm: make setting --cgroups=split configurable for adopted daemons

Reviewed-by: Guillaume Abrioux <gabrioux@ibm.com>

commit | commitdiff | tree

Adam King [Thu, 29 Aug 2024 11:37:39 +0000 (07:37 -0400)]

Merge pull request #59455 from adk3798/wip-65723-reef

reef: cephadm: have agent check for errors before json loading mgr response

Reviewed-by: Guillaume Abrioux <gabrioux@ibm.com>

commit | commitdiff | tree

Adam King [Thu, 29 Aug 2024 11:36:33 +0000 (07:36 -0400)]

Merge pull request #57519 from asm0deuz/backport_PR54158

reef: cephadm: added check for `--skip-firewalld` to section on adding explicit Ports to firewalld

Reviewed-by: Adam King <adking@redhat.com

commit | commitdiff | tree

Adam King [Thu, 29 Aug 2024 11:35:37 +0000 (07:35 -0400)]

Merge pull request #57234 from adk3798/wip-65763-reef

reef: mgr/cephadm: set OSD cap for NVMEoF daemon to "profile rbd"

Reviewed-by: Guillaume Abrioux <gabrioux@ibm.com>

commit | commitdiff | tree

Adam King [Thu, 29 Aug 2024 11:34:46 +0000 (07:34 -0400)]

Merge pull request #56909 from adk3798/wip-65383-reef

reef: mgr/cephadm: Allows enabling NFS Ganesha NLM

Reviewed-by: Guillaume Abrioux <gabrioux@ibm.com>

commit | commitdiff | tree

Adam King [Thu, 29 Aug 2024 11:33:41 +0000 (07:33 -0400)]

Merge pull request #56490 from adk3798/wip-64991-reef

reef: cephadm: fix `cephadm shell --name <daemon-name>` for stopped/failed daemon

Reviewed-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

Kamoltat (Junior) Sirivadhna [Thu, 29 Aug 2024 02:47:22 +0000 (22:47 -0400)]

Merge pull request #59268 from k0ste/wip-64671-reef

reef: qa/tasks/ceph_manager.py: Rewrite test_pool_min_size
Reviewed-by: Kamoltat Sirivadhna <ksirivad@redhat.com>

commit | commitdiff | tree

Adam King [Thu, 2 May 2024 17:35:41 +0000 (13:35 -0400)]

mgr/cephadm: make SMB and NVMEoF upgrade last in staggered upgrade

This needs to happen as some work on the NVMEoF side (still unmerged
as of writing this) will make the NVMEoF daemon dependent on the mon.
Prior to this patch, in a staggered upgrade, all daemons not using the
ceph image were upgraded after the mgr since we typically only care
about the default image changing or potential changes to how we handle
our systemd units which only needs the mgr to be upgraded to be applied.
This NVMEoF dependency on the mon changes this and we can no longer
upgrade it directly after the mgr. This patch changes it so the NVMEoF
daemon is instead upgraded after all ceph image daemons have been
upgraded in a staggered upgrade scenario. Non-staggered upgrades
are unaffected as the NVMEoF daemon was already upgraded near the
end in that scenario. The SMB dameon has no reason it needs to be
upgraded later, but it's in the (small) pool of daemons that don't
use the ceph image and aren't for monitoring, so it's been affected
by this as well.

NOTE: This is a bit of an ugly patch imo and shows that a refactoring
of the upgrade code is likely required. Hopefully this patch is more
of a stopgap until that larger effort can be made

Fixes: https://tracker.ceph.com/issues/65809
Signed-off-by: Adam King <adking@redhat.com>
(cherry picked from commit 5e7a3c2147d87c1fc5be71acbadedefb70e024bf)

commit | commitdiff | tree

Mouratidis Theofilos [Fri, 10 May 2024 10:17:12 +0000 (12:17 +0200)]

Fix CephExporter protocol bind logic

In a dual stack configuration ceph-exporter binds to ipv4 only and the metrics fail in ipv6

Signed-off-by: Mouratidis Theofilos <mtheofilos@gmail.com>
(cherry picked from commit 110bc665078fe19c31e3680c4197587e69e4e751)

Conflicts:
src/cephadm/cephadmlib/daemons/ceph.py

commit | commitdiff | tree

Gilad Sid [Wed, 1 May 2024 14:55:41 +0000 (17:55 +0300)]

cephadm: Adding support to pass --no-cgroups-split flag when adopting legacy daemons

Signed-off-by: Gilad Sid <sid.gilad@gmail.com>
(cherry picked from commit 20ffd4d6e330095c8cf2816a36f61bd950e213a5)

commit | commitdiff | tree

Adam King [Wed, 17 Apr 2024 15:36:12 +0000 (11:36 -0400)]

cephadm: have agent check for errors before json loading mgr response

Currently, since it tries to json.loads the response
payload before checking the return code, if there was
an error it fails with

Failed to send metadata to mgr: the JSON object must be str, bytes or bytearray, not ConnectionRefusedError

which is masking the actual failure.

Also adds more context to the RuntimeError raised

Fixes: https://tracker.ceph.com/issues/65553
Signed-off-by: Adam King <adking@redhat.com>
(cherry picked from commit 287bd34eec09815602700747c62e0a709e6e5ff0)

commit | commitdiff | tree

Anthony D'Atri [Mon, 26 Aug 2024 13:23:23 +0000 (09:23 -0400)]

Merge pull request #59431 from zdover23/wip-doc-2024-08-26-backport-59428-to-reef

reef: doc/cephadm: how to get exact size_spec from device

commit | commitdiff | tree

Zac Dover [Sun, 25 Aug 2024 20:03:34 +0000 (06:03 +1000)]

doc/cephadm: how to get exact size_spec from device

Add instructions for retrieving the exact size of block devices.

Fixes: https://tracker.ceph.com/issues/66754
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit d00d1b52d50b5575d918c3be7b7a8249ef31f0a8)

commit | commitdiff | tree

Anthony D'Atri [Sun, 25 Aug 2024 03:07:02 +0000 (23:07 -0400)]

Merge pull request #59425 from zdover23/wip-doc-2024-08-25-backport-59418-to-reef

reef: doc/glossary: add "object storage"

commit | commitdiff | tree

Zac Dover [Fri, 23 Aug 2024 12:36:16 +0000 (22:36 +1000)]

doc/glossary: add "object storage"

Add a (very basic) definition of object storage.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 43057b88538e868b817acb04d5e6c4e95b4c716e)

commit | commitdiff | tree

Yuri Weinstein [Fri, 23 Aug 2024 22:04:04 +0000 (15:04 -0700)]

Merge pull request #57625 from sajibreadd/wip-65938-reef

reef: os/bluestore: set rocksdb iterator bounds for Bluestore::_collection_list()

Reviewed-by: Igor Fedotov <ifedotov@suse.com>

commit | commitdiff | tree

Yuri Weinstein [Fri, 23 Aug 2024 22:03:21 +0000 (15:03 -0700)]

Merge pull request #57621 from sajibreadd/wip-66145-reef

reef: osd: CEPH_OSD_OP_FLAG_BYPASS_CLEAN_CACHE flag is passed from ECBackend

Reviewed-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Fri, 23 Aug 2024 22:01:20 +0000 (15:01 -0700)]

Merge pull request #55110 from k0ste/wip-63977-reef

reef: mgr/BaseMgrModule: Optimize CPython Call in Finish Function

Reviewed-by: Nitzan Mordechai <nmordech@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Fri, 23 Aug 2024 22:00:12 +0000 (15:00 -0700)]

Merge pull request #53269 from YiteGu/backport-always-generate-random-nonce

reef: msg: always generate random nonce; don't try to reuse PID

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

Adam King [Fri, 23 Aug 2024 18:00:14 +0000 (14:00 -0400)]

Merge pull request #59411 from adk3798/wip-67682-reef

reef: mgr/cephadm: add "original_weight" parameter to OSD class

Reviewed-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

Zac Dover [Fri, 23 Aug 2024 11:36:20 +0000 (21:36 +1000)]

Merge pull request #59381 from zdover23/wip-doc-2024-08-21-backport-59348-to-reef

reef: doc/rados: document unfound object cache-tiering scenario

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Adam King [Mon, 19 Aug 2024 16:30:24 +0000 (12:30 -0400)]

mgr/cephadm: add "original_weight" parameter to OSD class

Fixes: https://tracker.ceph.com/issues/67329
Signed-off-by: Adam King <adking@redhat.com>
(cherry picked from commit 04330f5df92994882efcd4879d5c37279138e97b)

commit | commitdiff | tree

Yuri Weinstein [Thu, 22 Aug 2024 15:48:13 +0000 (08:48 -0700)]

Merge pull request #59075 from tobias-urdin/reef-keystone-admin-token

reef: rgw: invalidate and retry keystone admin token

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Thu, 22 Aug 2024 15:47:36 +0000 (08:47 -0700)]

Merge pull request #59018 from Svelar/wip-67072-reef

reef: rgw/amqp: lock erase and create connection before emplace

Reviewed-by: Yuval Lifshitz <ylifshit@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Thu, 22 Aug 2024 15:46:49 +0000 (08:46 -0700)]

Merge pull request #59056 from yuvalif/wip-67363-reef

reef: common/dout: fix FTBFS on GCC 14

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Thu, 22 Aug 2024 15:46:15 +0000 (08:46 -0700)]

Merge pull request #57197 from k0ste/wip-63315-reef

reef: os/bluestore: fix crash caused by dividing by 0

Reviewed-by: Igor Fedotov <ifedotov@suse.com>

commit | commitdiff | tree

Yuri Weinstein [Thu, 22 Aug 2024 15:45:44 +0000 (08:45 -0700)]

Merge pull request #57194 from k0ste/wip-64590-reef

reef: os/bluestore: fix the problem of l_bluefs_log_compactions double recording

Reviewed-by: Igor Fedotov <ifedotov@suse.com>

commit | commitdiff | tree

Yuri Weinstein [Thu, 22 Aug 2024 15:45:08 +0000 (08:45 -0700)]

Merge pull request #56813 from Matan-B/wip-65305-reef

reef: osd/SnapMapper: fix _lookup_purged_snap

Reviewed-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Thu, 22 Aug 2024 15:44:36 +0000 (08:44 -0700)]

Merge pull request #56431 from Matan-B/wip-65096-reef

reef: mon/OSDMonitor: fix rmsnap command

Reviewed-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Thu, 22 Aug 2024 15:43:59 +0000 (08:43 -0700)]

Merge pull request #55778 from ifed01/wip-ifed-fix-63795-reef

reef: test/store_test: fix deferred writing test cases

Reviewed-by: Adam Kupczyk <akupczyk@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Thu, 22 Aug 2024 15:43:16 +0000 (08:43 -0700)]

Merge pull request #55220 from ifed01/wip-ifed-cache-ratios

reef: osd: make _set_cache_sizes ratio aware of cache_kv_onode_ratio

Reviewed-by: Mark Nelson <mnelson@redhat.com>
Reviewed-by: Adam Kupczyk <akupczyk@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Thu, 22 Aug 2024 15:42:03 +0000 (08:42 -0700)]

Merge pull request #58312 from cbodley/wip-66710-reef

reef: rgw/notifications/test: fix rabbitmq and kafka issues in centos9

Reviewed-by: Yuval Lifshitz <ylifshit@redhat.com>

commit | commitdiff | tree

Samuel Just [Wed, 21 Aug 2024 21:16:44 +0000 (14:16 -0700)]

Merge pull request #58846 from idryomov/wip-58120-reef

reef: osd: avoid watcher remains after "rados watch" is interrupted

Reviewed-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Zac Dover [Tue, 20 Aug 2024 12:45:29 +0000 (22:45 +1000)]

doc/rados: document unfound object cache-tiering scenario

Explain how to deal with "unfound objects" when restarting OSDs in a
cache-tiered environment.

Fixes: https://tracker.ceph.com/issues/44286
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit f01d7a8d5b85170c034acb962b9833913853a1c5)

commit | commitdiff | tree

Yuri Weinstein [Mon, 19 Aug 2024 20:42:57 +0000 (13:42 -0700)]

Merge pull request #58513 from k0ste/wip-66890-reef

reef: mgr/Mgr.cc: clear daemon health metrics instead of removing down/out osd from daemon state

Reviewed-by: Aishwarya Mathuria <amathuri@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Mon, 19 Aug 2024 20:42:06 +0000 (13:42 -0700)]

Merge pull request #57487 from ljflores/wip-65014-reef

reef: qa/suites/rados/singleton: add POOL_APP_NOT_ENABLED to ignorelist

Reviewed-by: Sridhar Seshasayee <sseshasa@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Mon, 19 Aug 2024 20:41:12 +0000 (13:41 -0700)]

Merge pull request #57408 from k0ste/wip-62927-reef

reef: mon: stuck peering since warning is misleading

Reviewed-by: Laura Flores <lflores@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Mon, 19 Aug 2024 20:40:23 +0000 (13:40 -0700)]

Merge pull request #57402 from k0ste/wip-65916-reef

reef: kv/RocksDBStore: Configure compact-on-deletion for all CFs

Reviewed-by: Igor Fedotov <ifedotov@suse.com>

commit | commitdiff | tree

Anthony D'Atri [Mon, 19 Aug 2024 12:54:11 +0000 (05:54 -0700)]

Merge pull request #59295 from zdover23/wip-doc-2024-08-19-backport-59256-to-reef

doc/cephfs: s/mountpoint/mount point/

commit | commitdiff | tree

Zac Dover [Sat, 17 Aug 2024 03:37:58 +0000 (13:37 +1000)]

commit | commitdiff | tree

Anthony D'Atri [Sat, 17 Aug 2024 21:20:43 +0000 (14:20 -0700)]

Merge pull request #59287 from zdover23/wip-doc-2024-08-18-backport-59257-to-reef

reef: doc/cephfs: s/mountpoint/mount point/

commit | commitdiff | tree

Zac Dover [Sat, 17 Aug 2024 03:44:30 +0000 (13:44 +1000)]

doc/cephfs: s/mountpoint/mount point/

Change the string "mountpoint" to "mount point" in English-language
strings (as opposed to in commands, where the string "mountpoint"
sometimes appears and is correct).

cf. https://github.com/ceph/ceph/pull/58908#discussion_r1697715486 in
which page 345 of The IBM Style Guide is referenced to back up this
change.

This commit alters only English-language text and example commands in
which the string "{mount point}" is meant to be replaced. No commands
meant for cutting-and-pasting have been altered in this commit.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit a0f81cfb5094164630f55a717efbbcdce45bce58)

commit | commitdiff | tree

Kamoltat [Thu, 19 Oct 2023 15:57:39 +0000 (15:57 +0000)]

qa/tasks/ceph_manager.py: Added more loggings for all_active_or_peered()

Signed-off-by: Kamoltat <ksirivad@redhat.com>
(cherry picked from commit 9762656160c9ae12d06b29a3e8a8d0dd13847328)

commit | commitdiff | tree

Kamoltat [Wed, 18 Oct 2023 22:52:20 +0000 (22:52 +0000)]

qa/tasks/ceph_manager.py: Rewrite test_pool_min_size

Problem:

Failed the test in EC Pool configuration because PGs are
not going into active+clean (our fault for over thrashing and checking the wrong thing).
Also, PG would not go into active because we thrash below min_size
in an EC pool config, not enough shards in the acting set.
Therefore, failed the wait_for_recovery check.
Moreover, When we revive osds, we didn't add the osd back in the cluster,
this messes up true count for live_osds in the test.

Solution:

Instead of randomly choosing OSDs to thrash,
we randomly select a PG from each pool and
thrash the OSDs in the PG's acting set until
we reach min_size, then we check to see if the
PG is still active. After that we revive all
the OSDs to see if the PG recovered cleanly.

We removed some of the unnecessary part such
as `min_dead`, `min_live`, `min_out` and etc.

Also, we refractored the part of where we are
assigning k,m for the EC pools so that we get
better code readablility.

Fixes: Fixes: https://tracker.ceph.com/issues/59172
Signed-off-by: Kamoltat <ksirivad@redhat.com>
(cherry picked from commit 8c4768ecb3ec38c8fce209eae9fe931e974d0495)

commit | commitdiff | tree

Kamoltat [Thu, 28 Sep 2023 18:03:45 +0000 (18:03 +0000)]

qa/tasks/rados.py: Allow rados task to override config

Problem:

Currently, no option override the config in rados task.

Solution:

Enable override of the config file in rados task.

Signed-off-by: Kamoltat <ksirivad@redhat.com>
(cherry picked from commit 92bf1a8aa8d0d208577c4076d4a86644c01548d5)

commit | commitdiff | tree

Kamoltat [Mon, 25 Sep 2023 21:29:35 +0000 (21:29 +0000)]

qa/tasks/ceph_manager.py: init test_min_size_duration

Added comment about test_min_size_duration
in qa/tasks/thrashosds.

But also use the variable in ceph_manager.py

Signed-off-by: Kamoltat <ksirivad@redhat.com>
(cherry picked from commit 9f19dffc93463513e03908f3506c62e65364c0cd)

commit | commitdiff | tree

Kamoltat [Thu, 17 Aug 2023 20:01:38 +0000 (20:01 +0000)]

qa/suites/rados: Added wait_for_all_active_clean_pgs flag

Added flag to not allow rados suite to delete
the pool unless all pgs are active+clean
and all OSDs are up in the thrashosds side
of the test.

Fixes: https://tracker.ceph.com/issues/59172
Signed-off-by: Kamoltat <ksirivad@redhat.com>
(cherry picked from commit 3ccd10f266cfd7ec6dd1ad930598bfe4ca422a90)

commit | commitdiff | tree

Anthony D'Atri [Fri, 16 Aug 2024 22:53:43 +0000 (15:53 -0700)]

Merge pull request #59251 from zdover23/wip-doc-2024-08-16-backport-59167-to-reef

reef: doc/cephfs: improve "layout fields" text

commit | commitdiff | tree

Zac Dover [Mon, 12 Aug 2024 12:38:14 +0000 (22:38 +1000)]

doc/cephfs: improve "layout fields" text

Improve "layout fields" text in doc/cephfs/file-layouts.rst, as suggesed
by Anthony D'Atri in these comments:

https://github.com/ceph/ceph/pull/59021#discussion_r1704108581
https://github.com/ceph/ceph/pull/59021#discussion_r1704112320

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 0949b410433837f0509fb73169fa7f22c8f6c256)

commit | commitdiff | tree

Zac Dover [Fri, 16 Aug 2024 09:26:41 +0000 (19:26 +1000)]

Merge pull request #59022 from zdover23/wip-doc-2024-08-05-backport-58891-to-reef

reef: doc/cephfs: edit "Layout Fields" text

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Anthony D'Atri [Thu, 15 Aug 2024 23:27:15 +0000 (16:27 -0700)]

Merge pull request #59234 from zdover23/wip-doc-2024-08-15-backport-59219-to-reef

reef: doc/rgw/notification: persistent notification queue full behavior

commit | commitdiff | tree

Ilya Dryomov [Thu, 15 Aug 2024 22:27:34 +0000 (00:27 +0200)]

Merge pull request #59231 from idryomov/wip-67353-reef

reef: qa: adjust expected io_opt in krbd_discard_granularity.t

Reviewed-by: Ramana Raja <rraja@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Thu, 15 Aug 2024 14:29:56 +0000 (07:29 -0700)]

Merge pull request #59151 from idryomov/wip-53674-reef

reef: librbd/crypto: fix issue when live-migrating from encrypted export

Reviewed-by: Ramana Raja <rraja@redhat.com>

commit | commitdiff | tree

Yuval Lifshitz [Wed, 14 Aug 2024 11:02:09 +0000 (11:02 +0000)]

doc/rgw/notification: persistent notification queue full behavior

Fixes: https://tracker.ceph.com/issues/50610
Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>
(cherry picked from commit d12ba11741dc749bcce315bf467078595fa95b24)

commit | commitdiff | tree

Ilya Dryomov [Thu, 8 Aug 2024 20:01:47 +0000 (22:01 +0200)]

qa: cover a custom object size in krbd_discard_granularity.t

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit e8126bf2f6411069da5074ac3a5a2ea16c0bba0c)

commit | commitdiff | tree

Ilya Dryomov [Thu, 8 Aug 2024 19:50:40 +0000 (21:50 +0200)]

qa: adjust expected io_opt in krbd_discard_granularity.t

With linux.git commit a00d4bfce7c6 ("rbd: increase io_opt again"),
io_opt is set to object set size.

Fixes: https://tracker.ceph.com/issues/67353
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 144270eb737850159614abd63c261baaa83a8afc)

commit | commitdiff | tree

Anthony D'Atri [Wed, 14 Aug 2024 13:18:34 +0000 (06:18 -0700)]

Merge pull request #59215 from zdover23/wip-doc-2024-08-14-backport-59168-to-reef

reef: doc/cephfs: improve cache-configuration.rst

commit | commitdiff | tree

Zac Dover [Mon, 12 Aug 2024 12:47:08 +0000 (22:47 +1000)]

doc/cephfs: improve cache-configuration.rst

Improve the text in the section about dealing with cache-pressure alerts
that was added in https://github.com/ceph/ceph/pull/59077. The changes
in this commit were suggested by Anthony D'Atri.

Co-authored-by: Patrick Donnelly <pdonnelly@redhat.com>
Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit aa3bdae2314fef2fca8fc12dca006af657235e17)

commit | commitdiff | tree

Ilya Dryomov [Fri, 2 Aug 2024 07:27:42 +0000 (09:27 +0200)]

librbd/migration: make ImageDispatch handle encryption for non-native formats

With NativeFormat now being handled via dispatch, handling encryption
for non-native formats (i.e. mapping to raw image extents and performing
decryption/mapping back on completion) in the migration layer is really
straightforward.

Note that alignment doesn't need to be performed in the migration layer
because it happens on the destination image -- the "align and resubmit"
logic in C_UnalignedObjectReadRequest should kick in before the call to
read_parent().

Fixes: https://tracker.ceph.com/issues/53674
Co-authored-by: Or Ozeri <oro@il.ibm.com>
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 0000c3447407772039121bb4499f243df1c889da)

Conflicts:
src/librbd/migration/ImageDispatch.cc [ commit 20aee5bbbcb5
("neorados: Make IOContext getters/setters less weird") not
in reef ]

commit | commitdiff | tree

Ilya Dryomov [Mon, 29 Jul 2024 09:01:17 +0000 (11:01 +0200)]

librbd: don't make an extra copy of image_extents in C_ImageReadRequest ctor

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit b20a897061feabc4e22c339c4e7a8aa5155151e8)

commit | commitdiff | tree

Ilya Dryomov [Tue, 6 Aug 2024 11:24:02 +0000 (13:24 +0200)]

qa/workunits/rbd: perform cleanup in test_clone_encryption()

... so that RAW_DEV can be unmapped and future tests can reuse testimg
and other image names without bumping into watchers and older snapshots.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 82d26909cb12b62d88f69f90eb8af692e497bddb)

commit | commitdiff | tree

Ilya Dryomov [Sat, 3 Aug 2024 17:31:03 +0000 (19:31 +0200)]

qa/workunits/rbd: no need to chmod in luks-encryption.sh

Most workunits expect the user to be a member of "disk" group, so we
can pretty much rely on that being the case at this point.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 830cbee7a5f875af04f335266b02ad96e4cd71c4)

commit | commitdiff | tree

Ilya Dryomov [Fri, 26 Jul 2024 14:54:31 +0000 (16:54 +0200)]

librbd/migration: make FormatInterface::read() void again

Now that NativeFormat is handled via dispatch, FormatInterface::read()
can be void again for consistency with FormatInterface::list_snaps().

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit b6c7f69b8044f9206aa982c2aba6466c49fd2bea)

Conflicts:
src/librbd/migration/ImageDispatch.cc [ commit 20aee5bbbcb5
("neorados: Make IOContext getters/setters less weird") not
in reef ]

commit | commitdiff | tree

Ilya Dryomov [Fri, 26 Jul 2024 10:13:08 +0000 (12:13 +0200)]

librbd/migration: close source image in OpenSourceImageRequest

Currently, on errors in FormatInterface::open(), RawFormat disposes
of src_image_ctx, but QCOWFormat doesn't, which is a leak. Rather than
having each format do it internally, do it in OpenSourceImageRequest.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 63159d6b431470f5edc4b110cebf46865c550689)

commit | commitdiff | tree

Ilya Dryomov [Thu, 18 Jul 2024 16:11:18 +0000 (18:11 +0200)]

librbd/migration: don't instantiate NativeFormat, handle it via dispatch

Trying to shoehorn NativeFormat under FormatInterface doesn't really
work.  It fundamentally doesn't fit in:

- Unlike for RawFormat and QCOWFormat, src_image_ctx for NativeFormat
  is not dummy -- it's an ImageCtx for a real RBD image.  Pre-creating
  it in OpenSourceImageRequest with the expectation that placeholder
  values would be overridden later forces NativeFormat to reach into
  ImageCtx guts, duplicating the logic in the constructor.  This also
  necessitates calling snap_set() in a separate step, since snap_id
  isn't known at the time ImageCtx is created.

- Unlike for RawFormat and QCOWFormat, get_image_size() and
  get_snapshots() implementations for NativeFormat are dummy.

- read() and list_snaps() implementations for NativeFormat are
  inconsistent: read() passes through io::ImageDispatch layer, but
  list_snaps() doesn't.  Both can be passing through, meaning that in
  essence these are also dummy.

All of this is with today's code.  Additional complications arise with
planned support for migrating from external clusters where src_image_ctx
would require more invasive patching to "move" to an IoCtx belonging to
an external cluster's CephContext and also with other work.

With the above in mind, NativeFormat actually consists of:

1. Code that parses the "type: native" source spec
2. Code that patches ImageCtx, working around the fact that it's
   pre-created in OpenSourceImageRequest
3. A bunch of dummy implementations for FormatInterface

With this change, (1) is wrapped into a static method that also creates
ImageCtx after all required parameters are known and (2) and (3) go away
entirely.  NativeFormat no longer implements FormatInterface and doesn't
get instantiated at all.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit cacf7ca941876f64f9a04867ffc6cdcb484d89b9)

commit | commitdiff | tree

Ilya Dryomov [Wed, 17 Jul 2024 19:11:51 +0000 (21:11 +0200)]

librbd/migration/NativeFormat: refactor source spec parsing

In preparation for not instantiating NativeFormat and losing a copy of
the source spec JSON object in m_json_object, refactor the parsing code
to use only const methods (which std::map's operator[] isn't) and local
variables where possible.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 3bbf1f5ddbaa4a8c252d70a384e23852f0c537c1)

commit | commitdiff | tree

Ilya Dryomov [Wed, 17 Jul 2024 18:05:08 +0000 (20:05 +0200)]

librbd/migration/NativeFormat: do pool lookup instead of creating io_ctx

A Rados instance is sufficient to map the pool name to the pool ID,
no need to involve an IoCtx instance as well. While at it, report
distinctive errors for a non-existing pool and an invalid JSON value
for pool_name key cases.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 1ba9a32598f50073b574b4649736d76b678a1c58)

commit | commitdiff | tree

Ilya Dryomov [Wed, 17 Jul 2024 13:06:33 +0000 (15:06 +0200)]

librbd/migration: make SourceSpecBuilder::parse_source_spec() static

In preparation for divorcing NativeFormat from FormatInterface and
changing when/how src_image_ctx is created, make parse_source_spec()
independent of src_image_ctx. The "invalid source-spec JSON" error is
duplicated by the "failed to parse migration source-spec" error, so
just get rid of the former to spare having to pass CephContext to
parse_source_spec().

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit f172fb97be9a6129be7cdbaa87346dc6c8e8ccb1)

commit | commitdiff | tree

Ilya Dryomov [Tue, 30 Jul 2024 20:56:17 +0000 (22:56 +0200)]

librbd/migration/OpenSourceImageRequest: rename io_ctx -> dst_io_ctx

For now, this is just slightly clearer. The distinction would become
important with planned support for migrating from external clusters.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit c14356b1f9eea0988e071f40dc0df005f70edd4d)

commit | commitdiff | tree

Ilya Dryomov [Sun, 14 Jul 2024 17:48:33 +0000 (19:48 +0200)]

librbd/migration: massage some error messages

Add missing spaces, don't use the word stream when reporting errors
on POSIX file operations (open() and lseek64()) and fix a cut-and-paste
typo in RawSnapshot.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 789df2ce38f35ffb3e86974e02868e5fff71e72c)

commit | commitdiff | tree

Ilya Dryomov [Sun, 14 Jul 2024 17:21:47 +0000 (19:21 +0200)]

librbd/api: clean up leftovers in Migration::prepare_import()

Dead code after return and an unused variable.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit b92ad49a35536fff87d840ffbf171aee61b05424)

commit | commitdiff | tree

Anthony D'Atri [Sun, 11 Aug 2024 15:37:44 +0000 (08:37 -0700)]

Merge pull request #59149 from zdover23/wip-doc-2024-08-11-backport-59077-to-reef

reef: doc/cephfs: add cache pressure information

commit | commitdiff | tree

Yuri Weinstein [Sun, 11 Aug 2024 15:03:00 +0000 (08:03 -0700)]

Merge pull request #58853 from idryomov/wip-67051-reef

reef: qa/workunits/rbd: avoid caching effects in luks-encryption.sh

Reviewed-by: Ramana Raja <rraja@redhat.com>
Reviewed-by: Mykola Golub <mgolub@suse.com>

commit | commitdiff | tree

Yuri Weinstein [Sun, 11 Aug 2024 15:01:57 +0000 (08:01 -0700)]

Merge pull request #58540 from idryomov/wip-66886-reef

reef: qa: account for rbd_trash object in krbd_data_pool.sh + related ceph{,adm} task fixes

Reviewed-by: Ramana Raja <rraja@redhat.com>
Reviewed-by: Adam King adking@redhat.com

commit | commitdiff | tree

Zac Dover [Wed, 7 Aug 2024 13:11:11 +0000 (23:11 +1000)]

doc/cephfs: add cache pressure information

Add information to doc/cephfs/cache-configuration.rst about how to deal
with a message that reads "clients failing to respond to cache
pressure". This procedure explains how to slow the growth of the
recall_caps value so that it does not exceed the
mds_recall_warning_threshold.

The information in this commit was developed by Eugen Block. See
https://lists.ceph.io/hyperkitty/list/ceph-users@ceph.io/thread/5ROH5CWKKOEIQMVXOVRT5OO7CWK2HPM3/#J65DFUPP4BY57MICPANXKI7KAXSZ5Z5P
and https://www.spinics.net/lists/ceph-users/msg73188.html.

Fixes: https://tracker.ceph.com/issues/57115
Co-authored-by: Eugen Block <eblock@nde.ag>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit bf26274ae4737417193f8c2b56bea20eb2a358aa)

commit | commitdiff | tree

Shilpa Jagannath [Fri, 9 Aug 2024 17:14:52 +0000 (10:14 -0700)]

Merge pull request #57901 from adamemerson/wip-62292-reef

reef: rgw: modify string match_wildcards with fnmatch

commit | commitdiff | tree

Yuri Weinstein [Fri, 9 Aug 2024 13:56:23 +0000 (06:56 -0700)]

Merge pull request #57301 from cbodley/wip-65822-reef

reef: rgw: fix CompleteMultipart error handling regression

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Fri, 9 Aug 2024 13:55:23 +0000 (06:55 -0700)]

Merge pull request #52611 from cbodley/wip-62142-reef

reef: valgrind: update suppression for SyscallParam under call_init

Reviewed-by: Shilpa Jagannath <smanjara@redhat.com>

commit | commitdiff | tree

Anthony D'Atri [Fri, 9 Aug 2024 01:02:52 +0000 (18:02 -0700)]

Merge pull request #59100 from zdover23/wip-doc-2024-08-09-backport-59807-to-reef

reef: docs/rados/operations/stretch-mode: warn device class is not supported

commit | commitdiff | tree

Kamoltat Sirivadhna [Wed, 7 Aug 2024 19:20:41 +0000 (19:20 +0000)]

docs/rados/operations/stretch-mode: warn device class is not supported

Signed-off-by: Kamoltat Sirivadhna <ksirivad@redhat.com>
(cherry picked from commit aa1d8cf4fa321e24e850bd5f687a6ddad3ce05e3)

commit | commitdiff | tree

Casey Bodley [Fri, 3 May 2024 19:43:39 +0000 (15:43 -0400)]

rgw: move publish_complete() back to RGWCompleteMultipart::execute()

move publish_complete() and meta_obj->delete_object() back to execute()
so they only run on success. this allows several member variables to
move back to execute()'s stack as well

Fixes: https://tracker.ceph.com/issues/65746
Signed-off-by: Casey Bodley <cbodley@redhat.com>
(cherry picked from commit ebb37c7bb3aee4663220054c6516164bf046fa8c)

commit | commitdiff | tree

Casey Bodley [Fri, 3 May 2024 19:29:00 +0000 (15:29 -0400)]

rgw: CompleteMultipart uses s->object for Notification

get_notification() should be associated with the target object
s->object. the meta_obj has the wrong object name, so required passing
s->object->get_name() as an extra argument

importantly, Notification no longer depends on the lifetime of meta_obj
to avoid a dangling pointer, while the lifetime of s->object is guaranteed

Signed-off-by: Casey Bodley <cbodley@redhat.com>
(cherry picked from commit 91dc7f3be945dccd8f59e070e9bc43a2a5df12db)

commit | commitdiff | tree

Casey Bodley [Fri, 3 May 2024 19:17:48 +0000 (15:17 -0400)]

rgw: CompleteMultipart uses s->object instead of target_obj

most requests operate directly on s->object. there's no reason to
allocate a separate target_obj for the same purpose

Signed-off-by: Casey Bodley <cbodley@redhat.com>
(cherry picked from commit d09b8ab2e077ceb6a0c6dfb99ce1b45d63a28be4)

commit | commitdiff | tree

Shilpa Jagannath [Thu, 8 Aug 2024 20:34:18 +0000 (13:34 -0700)]

Merge pull request #58793 from ivancich/wip-67155-reef

reef: test/rgw: address potential race condition in reshard testing

commit | commitdiff | tree

Shilpa Jagannath [Thu, 8 Aug 2024 20:33:21 +0000 (13:33 -0700)]

Merge pull request #58435 from cbodley/wip-64465-reef

reef: rgw: cumulatively fix 6 AWS SigV4 request failure cases

commit | commitdiff | tree

Shilpa Jagannath [Thu, 8 Aug 2024 20:31:52 +0000 (13:31 -0700)]

Merge pull request #58168 from cbodley/wip-66580-reef

reef: rgw: optimize gc chain size calculation

commit | commitdiff | tree

Shilpa Jagannath [Thu, 8 Aug 2024 20:30:59 +0000 (13:30 -0700)]

Merge pull request #57425 from mkogan1/wip-65886-reef

reef: rgw/beast: fix crash observed in SSL stream.async_shutdown()

commit | commitdiff | tree

Shilpa Jagannath [Thu, 8 Aug 2024 20:30:41 +0000 (13:30 -0700)]

Merge pull request #57127 from jzhu116-bloomberg/wip-64325-reef

reef: rgw/multisite: avoid writing multipart parts to the bucket index log

commit | commitdiff | tree

Shilpa Jagannath [Thu, 8 Aug 2024 20:26:50 +0000 (13:26 -0700)]

Merge pull request #56615 from prazumovsky/wip-63621

reef: rgw/swift: preserve dashes/underscores in swift user metadata names

commit | commitdiff | tree

Shilpa Jagannath [Thu, 8 Aug 2024 20:25:12 +0000 (13:25 -0700)]

Merge pull request #56554 from petrutlucian94/wip-64326-reef

reef: RGW: fix cloud-sync not being able to sync folders

Unnamed repository; edit this file 'description' to name the repository.

RSS Atom