git-server-git.apps.pok.os.sepia.ceph.com Git - ceph.git/log

]> git-server-git.apps.pok.os.sepia.ceph.com Git - ceph.git/log

projects / ceph.git / log

summary | shortlog | log | commit | commitdiff | tree
first ⋅ prev ⋅ next

commit | commitdiff | tree

Ilya Dryomov [Thu, 12 Dec 2024 20:32:39 +0000 (21:32 +0100)]

librbd/migration/HttpClient: socket isn't shut down on some state transitions

If shut_down() gets delayed until a) the state transition from
STATE_RESET_CONNECTING completes and the reconnect is unsuccessful or
b) the state transition from STATE_RESET_DISCONNECTING completes (i.e.
next_state is STATE_UNINITIALIZED or STATE_RESET_CONNECTING), the
socket needs to be shut down before m_on_shutdown is invoked. The line
of thought here is the same as for the corresponding state transitions
that don't involve STATE_SHUTTING_DOWN.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 88557dff2fe14c7df96992fbb0a5208979c20bf1)

commit | commitdiff | tree

Ilya Dryomov [Wed, 11 Dec 2024 15:25:13 +0000 (16:25 +0100)]

librbd/migration/HttpClient: avoid hitting an assert in advance_state()

If the shutdown gets delayed until the state transition from
STATE_RESET_CONNECTING completes and the reconnect is successful
(i.e. next_state is STATE_READY), we eventually hit "unexpected
state transition" assert in advance_state(). The reason is that
advance_state() would update m_state and call disconnect() under
STATE_READY instead of STATE_SHUTTING_DOWN. After the disconnect
maybe_finalize_shutdown() would enter advance_state() again with
STATE_SHUTDOWN as next_state, but the transition to that from
STATE_READY is invalid.

Plug this by not transitioning to next_state if current_state is
STATE_SHUTTING_DOWN.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 1046d610e3d6852258e6c4bf0355d0d13fb197b4)

commit | commitdiff | tree

Ilya Dryomov [Mon, 9 Dec 2024 10:19:57 +0000 (11:19 +0100)]

librbd/migration/HttpClient: ignore stream_truncated when shutting down SSL

Propagate ec to handle_disconnect() and use it to suppress
stream_truncated errors.  Here is a quote from Beast documentation [1]:

  // Gracefully shutdown the SSL/TLS connection
  error_code ec;
  stream.shutdown(ec);
  // Non-compliant servers don't participate in the SSL/TLS shutdown process and
  // close the underlying transport layer. This causes the shutdown operation to
  // complete with a `stream_truncated` error. One might decide not to log such
  // errors as there are many non-compliant servers in the wild.
  if(ec != net::ssl::error::stream_truncated)
      log(ec);

... and a commit that made ignoring stream_truncated safe [2]:

  // ssl::error::stream_truncated, also known as an SSL "short read",
  // indicates the peer closed the connection without performing the
  // required closing handshake
  // [...]
  // When a short read would cut off the end of an HTTP message,
  // Beast returns the error beast::http::error::partial_message.
  // Therefore, if we see a short read here, it has occurred
  // after the message has been completed, so it is safe to ignore it.

[1] https://www.boost.org/doc/libs/develop/libs/beast/doc/html/beast/using_io/ssl_tls_shutdown.html
[2] https://github.com/boostorg/beast/commit/094f5ec5cb3be1c3ce2d985564f1f39e9bed74ff

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 9fa0bcc67d79d90996cd4ec2b5af56d051ef6be7)

commit | commitdiff | tree

Ilya Dryomov [Sat, 7 Dec 2024 11:37:29 +0000 (12:37 +0100)]

librbd/migration/HttpClient: propagate ec to handle_handshake()

Get rid of get_callback_adapter() which only obfuscates the error:

handle_handshake: failed to complete SSL handshake: (337047686) Unknown error 337047686

vs

handle_handshake: failed to complete SSL handshake: certificate verify failed (SSL routines, tls_process_server_certificate)

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit e305a5908bd7bd3f2fa906af8521aea989f0c0ca)

commit | commitdiff | tree

Ilya Dryomov [Sat, 7 Dec 2024 12:52:41 +0000 (13:52 +0100)]

librbd/migration/HttpClient: drop SslHttpSession::m_ssl_enabled

The remaining callers of disconnect() call it only when m_ssl_enabled
is set to true (i.e. after the handshake is completed):

- shut_down(), in STATE_READY
- maybe_finalize_reset(), very shortly after transitioning out of
STATE_READY as part of performing a reset
- advance_state(), on a transition to STATE_READY that is intercepted
by a previously delayed shut down

m_ssl_enabled isn't used outside of disconnect() and on top of that
is never cleared.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 8566224e9406abca42925f8045077141c2724bed)

commit | commitdiff | tree

Ilya Dryomov [Sat, 7 Dec 2024 11:22:52 +0000 (12:22 +0100)]

librbd/migration/HttpClient: don't call disconnect() in handle_handshake()

With m_ssl_enabled set to false, disconnect() is a no-op. Since
m_ssl_enabled is flipped to true only when the handshake succeeds,
calling disconnect() on "failed to complete handshake" error is bogus
(as would be attempting to shut down SSL there).

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 331b5ea322537d380996ac6b250898ba474500be)

commit | commitdiff | tree

Ilya Dryomov [Fri, 6 Dec 2024 15:51:51 +0000 (16:51 +0100)]

librbd/migration/HttpClient: avoid reusing ssl_stream after shut down

ssl_stream objects can't be reused after shut down: despite
a successful reconnect and handshake, any attempt to read or write
fails with "end of stream" (beast.http:1) or "protocol is shutdown"
(asio.ssl:337690831) error respectively. This doesn't appear to be
documented, but Beast and ASIO authors both mention that the stream
must be destroyed and recreated [1][2].

This was missed because the only integration test with a big enough
image used http instead of https.

[1] https://github.com/boostorg/beast/issues/821#issuecomment-338354949
[2] https://github.com/chriskohlhoff/asio/issues/804#issuecomment-872746894

Fixes: https://tracker.ceph.com/issues/69178
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 20885b11794ba80d5cddd178994865a83da7240f)

commit | commitdiff | tree

Ilya Dryomov [Fri, 6 Dec 2024 13:42:55 +0000 (14:42 +0100)]

librbd/migration/HttpClient: don't shut down socket in resolve_host()

resolve_host() is called from init() and issue() when transitioning out
of STATE_UNINITIALIZED and from advance_state() right after the call to
shutdown_socket(). In all three cases the socket should get closed, so
drop the redundant call and place asserts in connect() implementations
instead.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 5699e4b871c66035c2c425bc9de7022fa2474c78)

commit | commitdiff | tree

Ilya Dryomov [Tue, 3 Dec 2024 15:20:16 +0000 (16:20 +0100)]

librbd/migration/HttpClient: fix some error messages in handle_receive()

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit d1a83f264ae97ec56004e6bf67a00f5753c6d611)

commit | commitdiff | tree

Ilya Dryomov [Tue, 3 Dec 2024 15:16:39 +0000 (16:16 +0100)]

librbd/migration/HttpClient: log ec details in handle_{issue,receive}()

Logging just the (negated) value makes tracking down the error category
and message unnecessarily hard.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit eb773496d53e924e6ff99ab1c3a9bcd36a8f81ea)

commit | commitdiff | tree

Anthony D'Atri [Fri, 13 Dec 2024 20:08:14 +0000 (15:08 -0500)]

Merge pull request #61081 from zdover23/wip-doc-2024-12-14-backport-61077-to-reef

reef: doc/cephfs: edit 3rd 3rd of mount-using-kernel-driver

commit | commitdiff | tree

Zac Dover [Fri, 13 Dec 2024 06:12:49 +0000 (16:12 +1000)]

doc/cephfs: edit 3rd 3rd of mount-using-kernel-driver

Edit the third third of doc/cephfs/mount-using-kernel-driver.rst in
preparation for correcting mount commands that may not work in Reef as
described in this documentation.

This commit edits only English-language strings in
doc/cephfs/mount-using-kernel-driver.rst. No technical content (that is,
no commands and no settings) have been altered in this commit.

Technical alterations to this file will be made only after the English
is unambiguous.

This PR follows the following two PRs:
https://github.com/ceph/ceph/pull/61048 - 1st 3rd
https://github.com/ceph/ceph/pull/61049 - 2nd 3rd

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 9c7580a2935511d009c9e66885e76635aa504ee8)

commit | commitdiff | tree

Anthony D'Atri [Thu, 12 Dec 2024 13:44:52 +0000 (08:44 -0500)]

Merge pull request #61064 from zdover23/wip-doc-2024-12-12-backport-60955-to-reef

reef: doc/dev: instruct devs to backport

commit | commitdiff | tree

Zac Dover [Wed, 4 Dec 2024 20:43:12 +0000 (21:43 +0100)]

doc/dev: instruct devs to backport

Add a note to doc/dec/development-workflow.rst that instructs developers
to do their own backports. This change was requested by Laura Flores on
04 Dec 2024.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 5d584b4badb606d372c266424f59076408f62f40)

commit | commitdiff | tree

Zac Dover [Wed, 11 Dec 2024 21:02:11 +0000 (22:02 +0100)]

Merge pull request #61042 from zdover23/wip-doc-2024-12-11-mount-using-kernel-driver

doc/cephfs: edit first 3rd of mount-using-kernel-driver

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Zac Dover [Wed, 11 Dec 2024 14:15:14 +0000 (15:15 +0100)]

doc/cephfs: edit first 3rd of mount-using-kernel-driver

Edit the first third of doc/cephfs/mount-using-kernel-driver.rst in
preparation for correcting mount commands that may not work in Reef as
described in this documentation.

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Zac Dover [Wed, 11 Dec 2024 13:33:17 +0000 (14:33 +0100)]

Merge pull request #61032 from neha-ojha/wip-update-foundation-reef

reef: doc/foundation.rst: update Intel point of contact

Reviewed-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Neha Ojha [Tue, 10 Dec 2024 22:36:07 +0000 (22:36 +0000)]

doc/foundation.rst: update Intel point of contact

This is for reef since the new foundation structure has changed.

Signed-off-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Tue, 10 Dec 2024 08:43:44 +0000 (09:43 +0100)]

Merge pull request #60750 from idryomov/wip-68743-reef

reef: Make mon addrs consistent with mon info

Reviewed-by: Aishwarya Mathuria <amathuri@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Mon, 9 Dec 2024 19:29:40 +0000 (11:29 -0800)]

Merge pull request #60061 from sajibreadd/wip-68324-reef

reef: common/Formatter: dump inf/nan as null

Reviewed-by: Igor Fedotov <ifedotov@suse.com>

commit | commitdiff | tree

Anthony D'Atri [Fri, 6 Dec 2024 05:54:39 +0000 (06:54 +0100)]

Merge pull request #60972 from zdover23/wip-doc-2024-12-06-backport-60929-to-reef

reef: doc/dev/release-process.rst: New container build/release process

commit | commitdiff | tree

Dan Mick [Wed, 4 Dec 2024 01:15:41 +0000 (17:15 -0800)]

doc/dev/release-process.rst: New container build/release process

Signed-off-by: Dan Mick <dan.mick@redhat.com>
(cherry picked from commit 178036924e7d1ec8eb62663f00f82d0b092c7ca1)

commit | commitdiff | tree

Dan Mick [Thu, 5 Dec 2024 23:25:42 +0000 (15:25 -0800)]

Merge pull request #60961 from dmick/wip-69118-reef

reef: Modify container/ software to support release containers and the promotion of prerelease containers

commit | commitdiff | tree

SrinivasaBharathKanta [Thu, 5 Dec 2024 10:32:35 +0000 (16:02 +0530)]

Merge pull request #56199 from ifed01/wip-ifed-fix-64443-reef

reef: test/store_test: fix DeferredWrite test when prefer_deferred_size=0

commit | commitdiff | tree

Dan Mick [Thu, 21 Nov 2024 03:11:24 +0000 (19:11 -0800)]

container/{Containerfile,build.sh}: use a secret to pass secrets

Use podman's build --secret mechanism to pass credentials for
download.ceph.com

Signed-off-by: Dan Mick <dan.mick@redhat.com>
(cherry picked from commit 4b3c0cb8f3f130563d9e1309beb7a8e0b59ea83e)

commit | commitdiff | tree

Dan Mick [Thu, 21 Nov 2024 02:23:50 +0000 (18:23 -0800)]

container/Containerfile: allow failure from curl to stop build

curl doesn't actually fail unless -f is used; also, add
some sh -e flags.

Signed-off-by: Dan Mick <dan.mick@redhat.com>
(cherry picked from commit 18d14b22f31c65ad3db79c7930a841157b68deee)

commit | commitdiff | tree

Dan Mick [Thu, 21 Nov 2024 02:18:59 +0000 (18:18 -0800)]

container/Containerfile: purge .repo files with secrets before commit

ceph.repo had creds in it for download.ceph.com/prerelease.
Remove the .repo files we construct, since they're not necessary
once the container is built (no one should be dnf'ing anything
in the container).

Signed-off-by: Dan Mick <dan.mick@redhat.com>
(cherry picked from commit 08432ef54212aa2aba724102854082e76f0cd847)

commit | commitdiff | tree

Dan Mick [Wed, 20 Nov 2024 00:44:17 +0000 (16:44 -0800)]

container/Containerfile fix boolean IS_RELEASE usage

Existing syntax was just wrong.

Signed-off-by: Dan Mick <dan.mick@redhat.com>
(cherry picked from commit ce74f14de5042774b8386b0eff04497b517eaacc)

commit | commitdiff | tree

Dan Mick [Wed, 20 Nov 2024 00:40:05 +0000 (16:40 -0800)]

container/make-manifest-list.py: use LOCALMANIFEST throughout

Signed-off-by: Dan Mick <dan.mick@redhat.com>
(cherry picked from commit a3a9f188cabf714de2c39e310bccbe848ed66671)

commit | commitdiff | tree

Dan Mick [Sat, 16 Nov 2024 00:21:26 +0000 (16:21 -0800)]

containers/make-manifest.py: Add --promote

The promote function finds the latest image on the prerelease repo,
finds all its tags, and copies them all to the release repo.

Signed-off-by: Dan Mick <dan.mick@redhat.com>
(cherry picked from commit 4f050d7811fa6ec347c3414d5749e0f86480d7f4)

commit | commitdiff | tree

Dan Mick [Fri, 15 Nov 2024 01:49:15 +0000 (17:49 -0800)]

container/Containerfile, build.sh: Add PRERELEASE_USERNAME/PASSWORD

Used for accessing download.ceph.com

Signed-off-by: Dan Mick <dan.mick@redhat.com>
(cherry picked from commit 7374b9b1114da08f1fdb9dfcfe2ef37c2178c8f5)

commit | commitdiff | tree

Dan Mick [Thu, 14 Nov 2024 22:28:32 +0000 (14:28 -0800)]

container/build.sh:

- change repo path to prerelease
- remove image id arg for release push
- include 'v' in version name for release push

Signed-off-by: Dan Mick <dan.mick@redhat.com>
(cherry picked from commit 6c8e9984253a61c261cfe6e0981dda58895500ae)

commit | commitdiff | tree

Dan Mick [Fri, 1 Nov 2024 02:55:36 +0000 (19:55 -0700)]

container/make-manifest-list.py

- don't print command failure in worker; let the caller print them
  if desired (allow silent failure)
- allow for empty tags list
- look for CEPH_SHA1.  GIT_COMMIT was the sha1 of the ceph-container.git
  commit
- change default paths to prerelease
- add --dry-run to avoid final push
- rename 'HOST' to 'CONTAINER_HOST'
- Use ARCH_SPECIFIC_HOST instead of CONTAINER_HOST (which is used by podman)

Signed-off-by: Dan Mick <dan.mick@redhat.com>
(cherry picked from commit 44b2fc4d5e1fd909370018ca9ec210686f98cefa)

commit | commitdiff | tree

Anthony D'Atri [Wed, 4 Dec 2024 21:38:07 +0000 (22:38 +0100)]

Merge pull request #60950 from zdover23/wip-doc-2024-12-04-backport-60930-to-reef

reef: doc/rados: fix sentences in health-checks (3 of x)

commit | commitdiff | tree

Zac Dover [Wed, 4 Dec 2024 02:13:05 +0000 (03:13 +0100)]

doc/rados: fix sentences in health-checks (3 of x)

Make sentences agree at the head of each section in
doc/rados/operations/health-checks.rst. The sentences were sometimes in
the imperative mood and sometimes in the declarative mood.

This commit edits the second third of
doc/rados/operations/health-checks.rst.

Note to (I hope soon) future Zac: There are a a couple of places near
the end of this file where the sentences are ungrammatical. Update these
in a separate PR (in isolation, so that the grammar and technical
accuracy of these sentences can be the primary focus of the reviewers).

Zac: cf. 000228

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 97df4479d90b7f1b659b71ac172ff1094d58a50e)

commit | commitdiff | tree

Anthony D'Atri [Wed, 4 Dec 2024 07:58:36 +0000 (08:58 +0100)]

Merge pull request #60932 from zdover23/wip-doc-2024-12-04-backport-60619-to-reef

reef: doc/rados: fix sentences in health-checks (2 of x)

commit | commitdiff | tree

Zac Dover [Tue, 3 Dec 2024 11:02:43 +0000 (12:02 +0100)]

doc/rados: fix sentences in health-checks (2 of x)

Make sentences agree at the head of each section in
doc/rados/operations/health-checks.rst. The sentences were sometimes in
the imperative mood and sometimes in the declarative mood.

This commit edits the second third of
doc/rados/operations/health-checks.rst.

Zac: cf. 000228

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit ee0ef765349ed8e170f8cd31c4fa0c7570105a96)

commit | commitdiff | tree

Anthony D'Atri [Tue, 3 Dec 2024 15:01:52 +0000 (16:01 +0100)]

Merge pull request #60921 from zdover23/wip-doc-2024-12-03-backport-60917-to-reef

reef: doc/rados: make sentences agree in health-checks.rst

commit | commitdiff | tree

Zac Dover [Tue, 3 Dec 2024 08:28:09 +0000 (09:28 +0100)]

doc/rados: make sentences agree in health-checks.rst

Make sentences agree at the head of each section in
doc/rados/operations/health-checks.rst. The sentences were sometimes in
the imperative mood and sometimes in the declarative mood.

This commit edits the first third of
doc/rados/operations/health-checks.rst.

Zac: cf. 000228

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit aec87b9188a92d2722c7310e753462a054cb12c7)

commit | commitdiff | tree

SrinivasaBharathKanta [Tue, 3 Dec 2024 04:09:40 +0000 (09:39 +0530)]

Merge pull request #59034 from sseshasa/wip-67357-reef

reef: osd/scheduler: add mclock queue length perfcounter

commit | commitdiff | tree

SrinivasaBharathKanta [Tue, 3 Dec 2024 04:09:13 +0000 (09:39 +0530)]

Merge pull request #58786 from sseshasa/wip-67109-reef

reef: qa/suites/rados/verify/validater: increase heartbeat grace timeout

commit | commitdiff | tree

SrinivasaBharathKanta [Tue, 3 Dec 2024 04:07:19 +0000 (09:37 +0530)]

Merge pull request #58692 from ljflores/wip-67054-reef

reef: qa: suppress Leak_StillReachable mon leak in centos 9 jobs

commit | commitdiff | tree

SrinivasaBharathKanta [Tue, 3 Dec 2024 04:06:54 +0000 (09:36 +0530)]

Merge pull request #58675 from ifed01/wip-ifed-many-many-extents-read-reef

reef: blk/aio: fix long batch (64+K entries) submission

commit | commitdiff | tree

Anthony D'Atri [Sun, 1 Dec 2024 15:40:04 +0000 (10:40 -0500)]

Merge pull request #60894 from zdover23/wip-doc-2024-12-01-backport-60882-to-reef

reef: os/bluestore: Improve documentation introduced by #57722

commit | commitdiff | tree

Anthony D'Atri [Sun, 1 Dec 2024 13:47:10 +0000 (08:47 -0500)]

Merge pull request #60897 from zdover23/wip-doc-2024-12-01-backport-60892-to-reef

reef: doc/glossary.rst: add "Dashboard Plugin"

commit | commitdiff | tree

Zac Dover [Sat, 30 Nov 2024 16:50:53 +0000 (17:50 +0100)]

doc/glossary.rst: add "Dashboard Plugin"

Add an entry below the (Mimic-era and therefore outdated but
nonetheless historically important) Dashboard Plugin key word in the
glosssary, which before now had never been added to the glossary.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit a1cec3de5704cedd7f7c8ce185144a96d72520d3)

commit | commitdiff | tree

Anthony D'Atri [Thu, 28 Nov 2024 18:09:39 +0000 (13:09 -0500)]

os/bluestore: Improve documentation introduced by #57722

Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
(cherry picked from commit b6eb98cc10895e0cdc4bc147f6f83145c10cba41)

commit | commitdiff | tree

Anthony D'Atri [Fri, 29 Nov 2024 22:52:37 +0000 (17:52 -0500)]

Merge pull request #60886 from zdover23/wip-doc-2024-11-29-backport-60884-to-reef

reef: doc/radosgw: update rgw_dns_name doc

commit | commitdiff | tree

Zac Dover [Fri, 29 Nov 2024 03:12:02 +0000 (13:12 +1000)]

doc/radosgw: update rgw_dns_name doc

Update doc/radosgw/s3/commons.rst with the changes made by Jiffin Tony
Thottan in https://github.com/ceph/ceph/pull/54524 and the suggestions
made in that same PR by Anthony D'Atri.

Explain how to set rgw_dns_name to a domain name in order to configure
access to virtual hosted buckets.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 45ff7e3365eaaa01ccb4ead71a14dd1dd9741c2f)

commit | commitdiff | tree

Anthony D'Atri [Fri, 29 Nov 2024 01:55:44 +0000 (20:55 -0500)]

Merge pull request #60868 from zdover23/wip-doc-2024-11-28-backport-60860-to-reef

reef: doc/radosgw: Improve archive-sync-module.rst more

commit | commitdiff | tree

Anthony D'Atri [Thu, 28 Nov 2024 18:22:36 +0000 (13:22 -0500)]

Merge pull request #60876 from zdover23/wip-doc-2024-11-28-backport-60857-to-reef

reef: doc/glossary: add "DC" (Data Center) to glossary

commit | commitdiff | tree

Hemanth [Thu, 28 Nov 2024 17:02:33 +0000 (22:32 +0530)]

Merge pull request #60182 from joscollin/wip-68409-reef

reef: cephfs_mirror, qa: fix test failure test_cephfs_mirror_cancel_mirroring_and_readd

commit | commitdiff | tree

SrinivasaBharathKanta [Thu, 28 Nov 2024 14:19:27 +0000 (19:49 +0530)]

Merge pull request #59466 from sajibreadd/wip-67038-reef

reef: os/bluestore: Warning added for slow operations and stalled read

commit | commitdiff | tree

SrinivasaBharathKanta [Thu, 28 Nov 2024 14:16:27 +0000 (19:46 +0530)]

Merge pull request #60284 from Matan-B/wip-68501-reef

reef: osd/PeeringState: introduce osd_skip_check_past_interval_bounds

commit | commitdiff | tree

SrinivasaBharathKanta [Thu, 28 Nov 2024 14:15:44 +0000 (19:45 +0530)]

Merge pull request #59972 from sseshasa/wip-68224-reef

reef: common/options: Change HDD OSD shard configuration defaults for mClock

commit | commitdiff | tree

SrinivasaBharathKanta [Thu, 28 Nov 2024 12:16:45 +0000 (17:46 +0530)]

Merge pull request #60108 from cbodley/wip-68379-reef

reef: cmake: use ExternalProjects to build isa-l and isa-l_crypto libraries

commit | commitdiff | tree

sajibreadd [Mon, 27 May 2024 07:30:06 +0000 (13:30 +0600)]

Warning added for slow operations and stalled read in BlueStore. User can control how much time the warning should persist after last occurence and maximum number of operations as a threshold will be considered for the warning.

Fixes: https://tracker.ceph.com/issues/62500
Signed-off-by: Md Mahamudur Rahaman Sajib <mahamudur.sajib@croit.io>
(cherry picked from commit 73b80a9a2c38259346fb646f85fa2ba4dcbb1329)

Conflicts:
src/common/options/global.yaml.in
src/os/bluestore/BlueStore.cc
src/os/bluestore/BlueStore.h
doc/rados/operations/health-checks.rst

commit | commitdiff | tree

SrinivasaBharathKanta [Thu, 28 Nov 2024 11:26:41 +0000 (16:56 +0530)]

Merge pull request #59157 from badone/wip-tracker-59380-admin-socket-injectfull-reef

reef: qa/suites/rados: Cancel injectfull to allow cleanup

commit | commitdiff | tree

Sridhar Seshasayee [Tue, 3 Sep 2024 05:39:08 +0000 (11:09 +0530)]

common/options: Change HDD OSD shard configuration defaults for mClock

Based on tests performed at scale on a HDD based cluster, it was found
that scheduling with mClock was not optimal with multiple OSD shards. For
e.g., in the scaled cluster with multiple OSD node failures, the client
throughput was found to be inconsistent across test runs coupled with
multiple reported slow requests.

However, the same test with a single OSD shard and with multiple worker
threads yielded significantly better results in terms of consistency of
client and recovery throughput across multiple test runs.

For more details see https://tracker.ceph.com/issues/66289.

Therefore, as an interim measure until the issue with multiple OSD shards
(or multiple mClock queues per OSD) is investigated and fixed, the
following change to the default HDD OSD shard configuration is made:

- osd_op_num_shards_hdd = 1 (was 5)
- osd_op_num_threads_per_shard_hdd = 5 (was 1)

The other changes in this commit include:
- Doc change to the OSD and mClock config reference describing
   this change.
- OSD troubleshooting entry on the procedure to change the shard
   configuration for clusters affected by this issue running on older
   releases.
- Add release note for this change.

Fixes: https://tracker.ceph.com/issues/66289
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
(cherry picked from commit 0d81e721378e6d7a647c5a4f6aab3cede1a828d3)

Conflicts:
        doc/rados/troubleshooting/troubleshooting-osd.rst
- Included the troubleshooting entry before the "Flapping OSDs" section.
        PendingReleaseNotes
- Moved the release note under 18.2.4 section and removed unrelated entries

commit | commitdiff | tree

Zac Dover [Wed, 27 Nov 2024 06:17:41 +0000 (16:17 +1000)]

doc/glossary: add "DC" (Data Center) to glossary

Add the initialism "DC" (Data Center) to the glossary.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit e595f353b53d50291f7cf58e6d782adf30449704)

commit | commitdiff | tree

Zac Dover [Thu, 28 Nov 2024 05:04:37 +0000 (15:04 +1000)]

Merge pull request #60865 from zdover23/wip-doc-2024-11-28-backport-59225-to-reef

reef: doc/glossary: add "flapping OSD"

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Anthony D'Atri [Wed, 27 Nov 2024 16:39:38 +0000 (11:39 -0500)]

doc/radosgw: Improve archive-sync-module.rst more

Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
(cherry picked from commit c24dc78484cbca8660c87742a6612c15935953c4)

commit | commitdiff | tree

Zac Dover [Wed, 14 Aug 2024 18:08:14 +0000 (04:08 +1000)]

doc/glossary: add "flapping OSD"

Add an entry for "Flapping OSD" to the glossary.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 1ca89e6ca3d9e0042702b7ac3af9a3e596330c1d)

commit | commitdiff | tree

Anthony D'Atri [Tue, 26 Nov 2024 21:59:28 +0000 (16:59 -0500)]

Merge pull request #60853 from zdover23/wip-doc-2024-11-27-backport-60846-to-reef

reef: doc/radosgw: Improve archive-sync-module.rst

commit | commitdiff | tree

Anthony D'Atri [Tue, 26 Nov 2024 16:13:06 +0000 (11:13 -0500)]

doc/radosgw: Improve archive-sync-module.rst

Signed-off-by: Anthony D'Atri <anthony.datri@gmail.com>
(cherry picked from commit 303ce155a34fdf2cde63b948d204aca339dd5682)

commit | commitdiff | tree

Venky Shankar [Tue, 26 Nov 2024 16:56:36 +0000 (22:26 +0530)]

Merge pull request #58789 from batrick/wip-67121-reef

reef: qa/cephfs: ignore variant of MDS_UP_LESS_THAN_MAX

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Tue, 26 Nov 2024 16:56:28 +0000 (22:26 +0530)]

Merge pull request #58791 from batrick/wip-67154-reef

reef: qa: suppress __trans_list_add valgrind warning

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Tue, 26 Nov 2024 16:56:18 +0000 (22:26 +0530)]

Merge pull request #58805 from lxbsz/wip-67148

reef: cephfs: Fixed a bug in the readdir_cache_cb function that may have us…

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Tue, 26 Nov 2024 16:56:06 +0000 (22:26 +0530)]

Merge pull request #58809 from lxbsz/wip-67118

reef: qa: ignore container checkpoint/restore related selinux denials for c…

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Tue, 26 Nov 2024 16:55:57 +0000 (22:25 +0530)]

Merge pull request #58812 from lxbsz/wip-67114

reef: cephfs-shell: excute cmd 'rmdir_helper' reported error

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Tue, 26 Nov 2024 16:55:42 +0000 (22:25 +0530)]

Merge pull request #58842 from lxbsz/wip-67162

reef: mds: try to choose a new batch head in request_clientup()

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Tue, 26 Nov 2024 16:48:26 +0000 (22:18 +0530)]

Merge pull request #59071 from joscollin/wip-67386-reef

reef: cephfs_mirror: provide metrics for last successful snapshot sync

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Tue, 26 Nov 2024 16:48:13 +0000 (22:18 +0530)]

Merge pull request #59069 from joscollin/wip-67383-reef

reef: qa/suites: drop --show-reachable=yes from fs:valgrind tests

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Tue, 26 Nov 2024 15:48:42 +0000 (07:48 -0800)]

Merge pull request #60659 from sseshasa/wip-68857-reef

reef: common,osd: Use last valid OSD IOPS value if measured IOPS is unrealistic

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
Reviewed-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Tue, 26 Nov 2024 15:47:12 +0000 (07:47 -0800)]

Merge pull request #59264 from k0ste/wip-62781-reef

reef: os/bluestore: fix btree allocator

Reviewed-by: Igor Fedotov <ifedotov@suse.com>

commit | commitdiff | tree

Venky Shankar [Tue, 26 Nov 2024 10:36:14 +0000 (16:06 +0530)]

Merge pull request #59399 from lxbsz/wip-67434

reef: qa/cephfs: fix TestRenameCommand and unmount the clinet before failin…

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Tue, 26 Nov 2024 10:35:35 +0000 (16:05 +0530)]

Merge pull request #59519 from batrick/wip-67750-reef

reef: mds: cephx path restriction incorrectly rejects snapshots of deleted directory

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Tue, 26 Nov 2024 10:35:23 +0000 (16:05 +0530)]

Merge pull request #59522 from batrick/wip-67807-reef

reef: qa: load all dirfrags before testing altname recovery

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Tue, 26 Nov 2024 10:34:12 +0000 (16:04 +0530)]

Merge pull request #59648 from mchangir/wip-67941-reef

reef: mgr/snap_schedule: correctly fetch mds_max_snaps_per_dir from mds

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Tue, 26 Nov 2024 05:36:51 +0000 (11:06 +0530)]

Merge pull request #59197 from neesingh-rh/wip-67301-reef

reef: pybind/ceph_argparse: Fix error message for ceph tell command

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Tue, 26 Nov 2024 05:30:13 +0000 (11:00 +0530)]

Merge pull request #59397 from lxbsz/wip-67586

reef: client: flush the caps release in filesystem sync

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Milind Changire [Tue, 26 Nov 2024 04:38:45 +0000 (10:08 +0530)]

Merge pull request #60728 from mchangir/wip-68651-reef

reef: log: save/fetch thread name infra

commit | commitdiff | tree

Anthony D'Atri [Mon, 25 Nov 2024 23:26:08 +0000 (18:26 -0500)]

Merge pull request #60825 from zdover23/wip-doc-2024-11-25-backport-60813-to-reef

reef: doc/rados/troubleshooting: Improve log-and-debug.rst

commit | commitdiff | tree

Anthony D'Atri [Mon, 25 Nov 2024 01:24:18 +0000 (20:24 -0500)]

doc/rados/troubleshooting: Improve log-and-debug.rst

Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
(cherry picked from commit 44c9519cc3eb013104c52014c302ba71ba806f8f)

commit | commitdiff | tree

neeraj pratap singh [Thu, 25 Apr 2024 08:04:09 +0000 (13:34 +0530)]

qa: add test for ceph tell with unknown cephtype

Fixes: https://tracker.ceph.com/issues/59624
Signed-off-by: Neeraj Pratap Singh <neesingh@redhat.com>
(cherry picked from commit decf32e8234a00bd3af7291eac7845008d402717)

commit | commitdiff | tree

neeraj pratap singh [Thu, 2 Nov 2023 03:57:40 +0000 (09:27 +0530)]

pybind/ceph_argparse: fixing error message for ceph tell command

Fixes: https://tracker.ceph.com/issues/59624
Signed-off-by: Neeraj Pratap Singh <neesingh@redhat.com>
(cherry picked from commit 2811bb0dfe3b00ba831e34eb7db50e528d8fbb13)

commit | commitdiff | tree

Venky Shankar [Mon, 25 Nov 2024 12:26:02 +0000 (17:56 +0530)]

Merge pull request #59307 from batrick/wip-67477-reef

reef: mon/MonClient: handle ms_handle_fast_authentication return

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Hemanth [Mon, 25 Nov 2024 10:10:06 +0000 (15:40 +0530)]

Merge pull request #59828 from joscollin/wip-68107-reef

reef: qa: do the set/get attribute on the remote filesystem

commit | commitdiff | tree

Hemanth [Mon, 25 Nov 2024 10:07:59 +0000 (15:37 +0530)]

Merge pull request #60410 from neesingh-rh/wip-68593-reef

reef: cephfs-shell: fixing cephfs-shell test failures

commit | commitdiff | tree

Hemanth [Mon, 25 Nov 2024 08:41:38 +0000 (14:11 +0530)]

Merge pull request #60178 from joscollin/wip-68415-reef

reef: cephfs_mirror: 'ceph fs snapshot mirror ls' command

commit | commitdiff | tree

Zac Dover [Mon, 25 Nov 2024 03:11:19 +0000 (13:11 +1000)]

Merge pull request #60738 from zdover23/wip-doc-2024-11-15-backport-60703-to-reef

reef: doc/rados: correct "full ratio" note

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Anthony D'Atri [Sun, 24 Nov 2024 15:34:08 +0000 (10:34 -0500)]

Merge pull request #60810 from zdover23/wip-doc-2024-11-24-backport-60804-to-reef-test-2

reef: doc/cephadm: Clarify "Deploying a new Cluster"

commit | commitdiff | tree

Zac Dover [Sat, 23 Nov 2024 12:32:13 +0000 (22:32 +1000)]

doc/cephadm: Clarify "Deploying a new Cluster"

Change the title of the section "Deploying a new Ceph cluster" to "Using
cephadm to Deploy a New Ceph Cluster". This is part of the initiative to
separate package-related documentation from container-based
documenation.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 8cf77d03a1f3ecb73931ddf96d13ef098aa61c1a)

commit | commitdiff | tree

Anthony D'Atri [Thu, 21 Nov 2024 13:01:14 +0000 (08:01 -0500)]

Merge pull request #60789 from zdover23/wip-doc-2024-11-21-os-recommendations-reef

reef: doc/start: separate package and container support tables

commit | commitdiff | tree

Zac Dover [Thu, 21 Nov 2024 11:39:40 +0000 (21:39 +1000)]

doc/start: separate package and container support tables

Separate out container-related information from package-related information and put container-related support information in its own table.

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Nizamudeen A [Thu, 21 Nov 2024 05:24:15 +0000 (10:54 +0530)]

Merge pull request #60729 from YiteGu/wip-68935-reef

reef: mgr/dashboard: add cephfs rename REST API

Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Wed, 20 Nov 2024 06:32:45 +0000 (12:02 +0530)]

Merge pull request #60633 from rhcs-dashboard/wip-68845-reef

reef: mgr/dashboard: remove cherrypy_backports.py

commit | commitdiff | tree

shenjiatong [Sat, 2 Nov 2024 08:10:38 +0000 (16:10 +0800)]

Make mon addrs consistent with mon info

For legacy librbd clients, sanitize_mons will panic if addrs are not
consistent with mon_info_t.

Signed-off-by: shenjiatong <yshxxsjt715@163.com>
fixes: https://tracker.ceph.com/issues/68743
(cherry picked from commit 14b09519739f3b2dc715647fd0f04dd22c17aeb3)

commit | commitdiff | tree

Zac Dover [Mon, 11 Nov 2024 23:31:28 +0000 (09:31 +1000)]

doc/rados: correct "full ratio" note

Correct a note that directed users not to add an OSD after the cluster
has reached its "full ratio". The note now says "Do not let your cluster
reach its full ratio before adding an OSD."

Hat tip: Oskar Berggren

Fixes: https://tracker.ceph.com/issues/68900
Co-authored-by: Oskar Berggren <oskar.berggren@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit f1a2637c79a15c26a769661dd72ca68d766b2f0d)

commit | commitdiff | tree

Zac Dover [Thu, 14 Nov 2024 14:59:14 +0000 (00:59 +1000)]

Merge pull request #60699 from zdover23/wip-doc-2024-11-12-backport-60529-to-reef

reef: doc/start: separate package chart from container chart

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

Unnamed repository; edit this file 'description' to name the repository.