git.apps.os.sepia.ceph.com Git - ceph.git/log

]> git.apps.os.sepia.ceph.com Git - ceph.git/log

projects / ceph.git / log

summary | shortlog | log | commit | commitdiff | tree
first ⋅ prev ⋅ next

commit | commitdiff | tree

Mark Kogan [Thu, 23 May 2024 12:17:22 +0000 (12:17 +0000)]

rgw: delay the RGW process exit until all actice requests have completed

Wait for up to `rgw_exit_timeout_secs` for all outstanding requests
to complete before exiting unconditionally.
(new HTTP requests will not be accepted during this time.)

Fixes: https://tracker.ceph.com/issues/66205
Signed-off-by: Mark Kogan <mkogan@ibm.com>

commit | commitdiff | tree

Patrick Donnelly [Sun, 23 Jun 2024 18:32:41 +0000 (14:32 -0400)]

Merge PR #58125 into main

* refs/pull/58125/head:
common/TrackedOp: do not count the ops marked as nowarn

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Sun, 23 Jun 2024 18:32:00 +0000 (14:32 -0400)]

Merge PR #58121 into main

* refs/pull/58121/head:
doc: add documentation for `ceph auth rotate`
PendingReleaseNotes: add note for new `auth rotate`
qa: test `auth rotate`
mon/AuthMonitor: add `ceph auth rotate` command

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Sun, 23 Jun 2024 18:31:28 +0000 (14:31 -0400)]

Merge PR #57993 into main

* refs/pull/57993/head:
mds: initialize epoch for quiescedb

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>

commit | commitdiff | tree

Patrick Donnelly [Sun, 23 Jun 2024 18:29:53 +0000 (14:29 -0400)]

Merge PR #57754 into main

* refs/pull/57754/head:
mds: set the proper extra bl for the create request
mds: encode the correct extra info depending on the feature bits
mds: add set_reply_extra_bl() helper support
mds: cleanup the code to make it to be more readable

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Greg Farnum <gfarnum@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Sun, 23 Jun 2024 18:29:13 +0000 (14:29 -0400)]

Merge PR #57084 into main

* refs/pull/57084/head:
mds: drop client metrics during recovery

Reviewed-by: Rishabh Dave <ridave@redhat.com>
Reviewed-by: Xiubo Li <xiubli@redhat.com>
Reviewed-by: Leonid Usov <leonid.usov@ibm.com>

commit | commitdiff | tree

Patrick Donnelly [Sun, 23 Jun 2024 18:28:38 +0000 (14:28 -0400)]

Merge PR #56828 into main

* refs/pull/56828/head:
mds: do remove the cap when seqs equal or larger than last issue

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Zac Dover [Sun, 23 Jun 2024 08:15:26 +0000 (18:15 +1000)]

Merge pull request #58200 from zdover23/wip-doc-2024-06-22-start-begin-links

doc/start: add links to Beginner's Guide

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Zac Dover [Sat, 22 Jun 2024 06:28:12 +0000 (16:28 +1000)]

doc/start: add links to Beginner's Guide

Add two links to Sage Weil's overview talks about Ceph.

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Samuel Just [Sun, 23 Jun 2024 00:42:37 +0000 (17:42 -0700)]

Merge pull request #58099 from athanatos/sjust/wip-66461-obc-with-lock

crimson: fix ObjectContext::_with_lock to only unlock if lock is taken

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Sat, 22 Jun 2024 11:18:58 +0000 (16:48 +0530)]

Merge pull request #57866 from rhcs-dashboard/fix-66348-main

mgr/dashboard: Block -> Images -> Create form improvements

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
Reviewed-by: afreen23 <NOT@FOUND>
Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Sunil Angadi <Sunil.Angadi@ibm.com>

commit | commitdiff | tree

Zac Dover [Sat, 22 Jun 2024 06:21:20 +0000 (16:21 +1000)]

Merge pull request #58156 from zdover23/wip-doc-2024-06-20-rados-troubleshooting-mon

doc/rados: followup to PR#58057

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Samuel Just [Thu, 13 Jun 2024 00:41:55 +0000 (00:41 +0000)]

crimson: fix ObjectContext::_with_lock to only unlock if lock is taken

Fixes: https://tracker.ceph.com/issues/66461
Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Laura Flores [Fri, 21 Jun 2024 21:48:27 +0000 (16:48 -0500)]

Merge pull request #57974 from ljflores/wip-tracker-65189

Telemetry upgrade test fixes

commit | commitdiff | tree

Laura Flores [Wed, 19 Jun 2024 23:01:31 +0000 (18:01 -0500)]

qa/suites/upgrade/telemetry-upgrade/quincy-x: update image reference

`quay.io/ceph/daemon-base:latest-quincy` points to the latest quincy tag (v17.2.7),
not the tip of quincy, which is actually what we want to test.

Pulling the daemon-base:latest-quincy image shows it is running v17.2.7:
```
lflores:ceph$ podman run --rm -it daemon-base:latest-quincy
[root@e4eb265e7fa0 /]# ceph -v
ceph version 17.2.7 (b12291d110049b2f35e32e0de30d70e9a4c060d2) quincy (stable)
```

It is important that we test the tip of quincy in case there is a fix there that
we want to pick up.

I referenced this commit to make the change: https://github.com/ceph/ceph/commit/d0381ed7e68a6de7c9ff1e4220698ed8eab28ac6

Signed-off-by: Laura Flores <lflores@ibm.com>

commit | commitdiff | tree

Laura Flores [Wed, 19 Jun 2024 21:57:45 +0000 (16:57 -0500)]

qa/suites/upgrade/telemetry-upgrade/reef-x: update how cephadm is pulled and change image reference

Update how cephadm is pulled:

`cephadm_git_url` and `cephadm_branch` are used in releases older than reef
to install cephadm. Both of these keys are needed to install it from the github
repo.

However, in reef and on, the compiled zipapp cephadm needs to be pulled differently
than the old single python script `cephadm` from earlier releases.

This commit is what I referenced for this change: https://github.com/ceph/ceph/commit/967716f2e6c8cd3f7d810a5f8e5e49448717e513

-------

Change image reference:

`quay.io/ceph/daemon-base:latest-reef` points to the latest reef tag (v18.2.2),
not the tip of reef, which is actually what we want to test.

Pulling the daemon-base:latest-reef image shows it is running v18.2.2:
```
lflores:ceph$ podman run --rm -it daemon-base:latest-reef
[root@d4d9a0eb04be /]# ceph -v
ceph version 18.2.2 (531c0d11a1c5d39fbfe6aa8a521f023abf3bf3e2) reef (stable)
```

It is important that we test the tip of reef in case there is a fix there that we
want to pick up.

I referenced this commit to make the change: https://github.com/ceph/ceph/commit/d0381ed7e68a6de7c9ff1e4220698ed8eab28ac6

Signed-off-by: Laura Flores <lflores@ibm.com>

commit | commitdiff | tree

Laura Flores [Wed, 19 Jun 2024 21:07:31 +0000 (16:07 -0500)]

qa/suites/upgrade/telemetry-upgrade: add more ignorelist items and require_osd_release=squid

The warnings added to the ignorelist show up in the cluster log, but they are
expected during upgrades and should thus be ignored.

We also need to set require_osd_release=squid to avoid this warning:
```
cluster [WRN] Health check failed: all OSDs are running squid or later but require_osd_release < squid (OSD_UPGRADE_FINISHED)
```

Signed-off-by: Laura Flores <lflores@ibm.com>

commit | commitdiff | tree

Radoslaw Zarzynski [Fri, 21 Jun 2024 14:41:12 +0000 (16:41 +0200)]

Merge pull request #58178 from ljflores/wip-tracker-66556

Revert "test: reproduce flap peering bug (wip)"

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Dan van der Ster <daniel.vanderster@cern.ch>

commit | commitdiff | tree

Ilya Dryomov [Fri, 21 Jun 2024 13:21:40 +0000 (15:21 +0200)]

Merge pull request #58175 from batrick/label-scripts

.github/labeler: add script label to script changes

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Zac Dover [Fri, 21 Jun 2024 09:17:54 +0000 (19:17 +1000)]

Merge pull request #55675 from douglasawh/patch-2

doc/src: invadvisable is not a word

Reviewed-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Aashish Sharma [Tue, 4 Jun 2024 12:29:34 +0000 (17:59 +0530)]

mgr/dashboard: Block -> Images -> Create form improvements

Fixes: https://tracker.ceph.com/issues/66348
Signed-off-by: Aashish Sharma <aasharma@redhat.com>

commit | commitdiff | tree

Zac Dover [Fri, 21 Jun 2024 05:49:01 +0000 (15:49 +1000)]

Merge pull request #58109 from zdover23/wip-doc-2024-06-18-rados-ops-stretch-mode

doc/rados: add stretch_rule workaround

Reviewed-by: Prashant D <pdhange@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Fri, 21 Jun 2024 01:50:25 +0000 (21:50 -0400)]

Merge PR #58174 into main

* refs/pull/58174/head:
script/backport-create-issue: update tag custom field

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Tue, 18 Jun 2024 18:07:31 +0000 (14:07 -0400)]

doc: add documentation for `ceph auth rotate`

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Tue, 18 Jun 2024 17:47:29 +0000 (13:47 -0400)]

PendingReleaseNotes: add note for new `auth rotate`

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Tue, 18 Jun 2024 18:17:00 +0000 (14:17 -0400)]

qa: test `auth rotate`

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Tue, 18 Jun 2024 17:31:14 +0000 (13:31 -0400)]

mon/AuthMonitor: add `ceph auth rotate` command

Add command to rotate the permanent key of an entity. This avoids the need to
delete / recreate the key when it is compromised, lost, or just scheduled for
rotation.

Fixes: https://tracker.ceph.com/issues/66509
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Doug Whitfield [Tue, 20 Feb 2024 21:13:14 +0000 (15:13 -0600)]

doc/src: invadvisable is not a word

changed to inadvisable on line 4997

Signed-off-by: Doug Whitfield <douglasawh@gmail.com>

commit | commitdiff | tree

Laura Flores [Thu, 20 Jun 2024 18:52:07 +0000 (13:52 -0500)]

Revert "test: reproduce flap peering bug (wip)"

This reverts commit c89f9c4e80a4ac45ae5ea86ea7e9c41999299f27.

Note from Dan van der Ster:
This is a test that should succeed,
it definitely used to succeed back in the L/O days of Ceph. At some
point peering code changed and this behaviour regressed. In short,
an OSD goes down then comes up, and no objects were modified in the mean time.
There should be no degraded PGs in this case.

As this commit is currently breaking make check on all PRs, I think it should
be re-evaluated and merged so whatever fix is needed along with this test to
make it work are merged together.

Fixes: https://tracker.ceph.com/issues/66556
Signed-off-by: Laura Flores <lflores@ibm.com>

commit | commitdiff | tree

Patrick Donnelly [Thu, 20 Jun 2024 16:00:05 +0000 (12:00 -0400)]

.github/labeler: add script label to script changes

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Thu, 20 Jun 2024 15:43:17 +0000 (11:43 -0400)]

script/backport-create-issue: update tag custom field

The redmineup_tags was added and, apparently, the old "Tags" custom field with
id == 3 was deleted and then recreated with id == 31. This broke the script.
Update and refactor the id for that "Tags" custom field.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Adam King [Thu, 20 Jun 2024 16:06:18 +0000 (12:06 -0400)]

Merge pull request #57412 from adk3798/stray-laundry2

qa/cephadm: fix ignorelist of CEPHADM_STRAY_DAEMON for rados_api_tests

Reviewed-by: Laura Flores <lflores@ibm.com>

commit | commitdiff | tree

Ilya Dryomov [Thu, 20 Jun 2024 15:05:11 +0000 (17:05 +0200)]

Merge pull request #58055 from zmc/qemu-task-version

qa/tasks/qemu: Fix OS version comparison

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Casey Bodley [Thu, 20 Jun 2024 14:19:18 +0000 (15:19 +0100)]

Merge pull request #57079 from kchheda3/wip-lc-notification

rgw/lifecycle-notification: Do not block lc processing for notification errors.

Reviewed-by: Yuval Lifshitz <ylifshit@ibm.com>

commit | commitdiff | tree

Casey Bodley [Thu, 20 Jun 2024 14:04:47 +0000 (15:04 +0100)]

Merge pull request #57933 from samarahu/d4n_copy_test_fix

d4n/directory: Comment out `copy` unit tests and remove unnecessary `copy_object` implementation in filter

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Zac Dover [Thu, 20 Jun 2024 13:53:42 +0000 (23:53 +1000)]

Merge pull request #58147 from zdover23/wip-doc-2024-06-20-start-beginners-guide-wiki

doc/start: add wiki to Beginner's Guide

Reviewed-by: Cole Mitchell <cole.mitchell.ceph@gmail.com>

commit | commitdiff | tree

Nizamudeen A [Thu, 20 Jun 2024 12:48:24 +0000 (18:18 +0530)]

Merge pull request #58069 from rhcs-dashboard/multiple-alert-panel-dashboard

mgr/dashboard: fix alert broken for multiple alerts

Reviewed-by: afreen23 <NOT@FOUND>
Reviewed-by: ivoalmeida <NOT@FOUND>

commit | commitdiff | tree

Avan [Thu, 20 Jun 2024 12:29:05 +0000 (17:59 +0530)]

Merge pull request #51566 from rhcs-dashboard/fix-pool-application

mgr/dashboard: make application field for pool creation mandatory

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
Reviewed-by: afreen23 <NOT@FOUND>
Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>
Reviewed-by: Sunil Angadi <Sunil.Angadi@ibm.com>

commit | commitdiff | tree

Zac Dover [Thu, 20 Jun 2024 11:43:53 +0000 (21:43 +1000)]

doc/rados: followup to PR#58057

Incorporate Anthony D'Atri's suggestions in
https://github.com/ceph/ceph/pull/58057

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Nizamudeen A [Thu, 20 Jun 2024 06:25:44 +0000 (11:55 +0530)]

Merge pull request #58068 from rhcs-dashboard/disable-telemetry-notifications-in-e2e

mgr/dashboard: disable telemetry notification in e2e

Reviewed-by: afreen23 <NOT@FOUND>
Reviewed-by: nmunet <NOT@FOUND>

commit | commitdiff | tree

Guillaume Abrioux [Thu, 20 Jun 2024 05:45:03 +0000 (07:45 +0200)]

Merge pull request #58138 from guits/fix-dmcrypt-check

ceph-volume: fix regex usage in `set_dmcrypt_no_workqueue`

commit | commitdiff | tree

Leonid S. Usov [Thu, 20 Jun 2024 04:56:09 +0000 (07:56 +0300)]

Merge pull request #57946 from ceph/wip-lusov-qdb-bootstrap-event

mds: QuiesceDbManager: mark next retry event during bootstrap

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Leonid S. Usov [Thu, 20 Jun 2024 04:55:36 +0000 (07:55 +0300)]

Merge pull request #57980 from ceph/wip-lusov-qdb-root-ttl

mds/quiesce-db: calculate_quiesce_map: aggregate quiesce roots' TTL as `max`

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Zac Dover [Thu, 20 Jun 2024 03:54:24 +0000 (13:54 +1000)]

doc/start: add wiki to Beginner's Guide

Add the Redmine Ceph Wiki to the Ceph Beginner's Guide.

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Xiubo Li [Wed, 19 Jun 2024 03:27:33 +0000 (11:27 +0800)]

common/TrackedOp: do not count the ops marked as nowarn

If an op is marked as nowarn then it won't be counted as the slow
requests, but currently it will count the initiated time when
iterating the inflight ops.

For example:

[WRN] : 1 slow requests, 1 included below; oldest blocked for > 38.764892 secs
[WRN] : slow request 33.875059 seconds old, received at 2024-06-17T14:14:34.228261+0000: client_request(client.78109915:11369251 mkdir #0x1008ecedea2/chk-89588 2024-06-17T14:14:34.097825+0000 caller_uid=1002960000, caller_gid=0{0,1002960000,}) currently failed to wrlock, waiting

The oldest blocked request is 38.764892 old, but the oldest slow
request reported is 33.875059 old.

Fixes: commit e4160d7e783 ("mds: don't report slow request for blocked filelock request")
Fixes: https://tracker.ceph.com/issues/66557
Signed-off-by: Xiubo Li <xiubli@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Wed, 19 Jun 2024 19:19:56 +0000 (15:19 -0400)]

Merge PR #55792 into main

* refs/pull/55792/head:
tools/cephfs: recover alternate_name of dentries from journal
qa: add test to verify recovery of alternate_name from journal
tools/cephfs/JournalTool: add some more debugging
tools/cephfs/JournalTool: remove extraneous 0x in debug output
mds: dump alternate_name to formatter
mds: add warning about encoding new fields

Reviewed-by: Christopher Hoffman <choffman@redhat.com>
Reviewed-by: Xiubo Li <xiubli@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Wed, 19 Jun 2024 18:17:15 +0000 (20:17 +0200)]

Merge pull request #57973 from idryomov/wip-66418

librbd: diff-iterate shouldn't crash on an empty byte range

Reviewed-by: Ramana Raja <rraja@redhat.com>
Reviewed-by: Mykola Golub <mgolub@suse.com>

commit | commitdiff | tree

Ilya Dryomov [Wed, 19 Jun 2024 18:15:35 +0000 (20:15 +0200)]

Merge pull request #58074 from idryomov/wip-66300

librbd: disallow group snap rollback if memberships don't match

Reviewed-by: Ramana Raja <rraja@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Wed, 19 Jun 2024 13:06:52 +0000 (15:06 +0200)]

ceph-volume: fix regex usage in `set_dmcrypt_no_workqueue`

- Updated the regex pattern to `r'(\d+\.?)+'` to more accurately
  capture version numbers.

- Replaced `re.match` with `re.search` to properly match the cryptsetup
  version in the output.

- `re.match` only checks for a match at the beginning of the string,
   while `re.search` looks for a match anywhere in the string.

This fix ensures that the function correctly retrieves the
cryptsetup version from the output.

Fixes: https://tracker.ceph.com/issues/66393
Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>

commit | commitdiff | tree

Nizamudeen A [Sun, 16 Jun 2024 12:46:52 +0000 (18:16 +0530)]

mgr/dashboard: fix alert broken for multiple alerts

After carbon the alert panel was broken when there are multiple alerts
present (telemetry, motd, password expiration).

Applying carbon banner to the existing alert banner

Fixes: https://tracker.ceph.com/issues/66512
Signed-off-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Adam King [Wed, 19 Jun 2024 13:17:51 +0000 (09:17 -0400)]

Merge pull request #57294 from phlogistonjohn/jjm-smb-free-customize

mgr/smb: add custom config options to share and cluster resources

Reviewed-by: Adam King <adking@redhat.com>

commit | commitdiff | tree

Casey Bodley [Wed, 19 Jun 2024 12:14:30 +0000 (13:14 +0100)]

Merge pull request #57430 from tchaikov/wip-rgw-driver-posi

rgw/driver/posix: s/is_value()/success()/

Reviewed-by: Matt Benjamin <mbenjamin@redhat.com>

commit | commitdiff | tree

Matan Breizman [Wed, 19 Jun 2024 10:21:25 +0000 (13:21 +0300)]

Merge pull request #57691 from xxhdx1985126/wip-66216

crimson/osd/osd_operations/client_request_common: `PeeringState::needs_recovery()` may fail if the object is under backfill

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Afreen Misbah [Fri, 14 Jun 2024 05:21:47 +0000 (10:51 +0530)]

mgr/dashboard: fix service page e2e tests

- service page now uses defaults value for the placement count due to which mds test failing
- in test we pass "1" while "2" which is the default count for mds is already populated, making it 21 and causing unable to create mds service

Fixes: https://tracker.ceph.com/issues/66540
Signed-off-by: Afreen Misbah <afreen23.git@gmail.com>
(cherry picked from commit 5a718229dc43953f140a16342f19523a1c075863)

commit | commitdiff | tree

Nizamudeen A [Wed, 19 Jun 2024 06:49:56 +0000 (12:19 +0530)]

Merge pull request #53832 from rhcs-dashboard/fix-cephadm-daemon-failed-alert

mgr/dashboard: Show which daemons failed in CEPHADM_FAILED_DAEMON healthcheck

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Yingxin [Wed, 19 Jun 2024 03:10:44 +0000 (11:10 +0800)]

Merge pull request #58083 from myoungwon/wip-seastore-rbm-stat

crimson/os/seastore: add writer level stats to RBM

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Zack Cerza [Fri, 14 Jun 2024 19:37:16 +0000 (13:37 -0600)]

qa/tasks/qemu: Fix OS version comparison

See: https://sentry.ceph.com/share/issue/21ed88d705854238bdafbf6711e795ee/
They're strings, not floats.
This surfaced as a result of https://github.com/ceph/teuthology/pull/1953

Signed-off-by: Zack Cerza <zack@redhat.com>

commit | commitdiff | tree

Anthony D'Atri [Wed, 19 Jun 2024 00:50:21 +0000 (20:50 -0400)]

Merge pull request #58123 from gsoneill/clarify-ec-architecture

doc: clarify availability vs integrity

commit | commitdiff | tree

Gregory O'Neill [Tue, 18 Jun 2024 21:17:00 +0000 (17:17 -0400)]

doc: clarify availability vs integrity

commit | commitdiff | tree

Patrick Donnelly [Wed, 12 Jun 2024 16:09:11 +0000 (12:09 -0400)]

mds: initialize epoch for quiescedb

Fixes: https://tracker.ceph.com/issues/66449
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Tue, 18 Jun 2024 14:18:35 +0000 (16:18 +0200)]

Merge pull request #58112 from nbalacha/wip-doc-typo-180624

doc/rbd: fix typos in NVMe-oF docs

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Matan Breizman [Tue, 18 Jun 2024 13:43:50 +0000 (16:43 +0300)]

Merge pull request #57978 from athanatos/sjust/wip-63647-snaptrim-pipeline

crimson: simplify snaptrim operation pipline usage

Reviewed-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Sun, 16 Jun 2024 11:15:43 +0000 (16:45 +0530)]

mgr/dashboard: disable telemetry notification in e2e

After the new UI shell, the telemetry notifications are displayed on the
bottom side, which kind of exists on top of some buttons and that causes
some cypress failures while clicking Submit button in the form.

I am disabling the notification itself in e2e because its not checked in
the e2e at all so we can afford to disable it. Incase we decide to add
an e2e for the notification, we can just toggle it on later on and we
can check it

Fixes: https://tracker.ceph.com/issues/66506
Signed-off-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Tue, 18 Jun 2024 13:23:03 +0000 (15:23 +0200)]

Merge pull request #58100 from petrutlucian94/conn_timeout_test

ceph_test_rados_api_misc: adjust LibRadosMiscConnectFailure.ConnectTimeout timeout

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Aashish Sharma [Thu, 5 Oct 2023 05:38:30 +0000 (11:08 +0530)]

mgr/dashboard: Show which daemons failed in CEPHADM_FAILED_DAEMON
healthcheck

Fixes: https://tracker.ceph.com/issues/63792
Signed-off-by: Aashish Sharma <aasharma@redhat.com>

commit | commitdiff | tree

Xuehan Xu [Fri, 24 May 2024 09:30:41 +0000 (17:30 +0800)]

crimson/osd/osd_operations/client_request_common: `PeeringState::needs_recovery()`
may fail if the object is under backfill

Meanwhile, set the correct version for backfill:
From Classic:
```
    if (is_degraded_or_backfilling_object(head)) {
      if (can_backoff && g_conf()->osd_backoff_on_degraded) {
        add_backoff(session, head, head);
        maybe_kick_recovery(head);
      }
```

```
void PrimaryLogPG::maybe_kick_recovery(
  const hobject_t &soid)
{
  eversion_t v;
  bool work_started = false;
  if (!recovery_state.get_missing_loc().needs_recovery(soid, &v))
    return;
```

Fixes: https://tracker.ceph.com/issues/66216
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
Signed-off-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Matan Breizman [Tue, 18 Jun 2024 10:56:00 +0000 (13:56 +0300)]

Merge pull request #57352 from Matan-B/wip-crimson-conf-docs

doc/dev/crimson: CPU cores allocation

Reviewed-by: Zac Dover <zac.dover@proton.me>
Reviewed-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

N Balachandran [Tue, 18 Jun 2024 10:39:51 +0000 (16:09 +0530)]

doc/rbd: fix typos in NVMe-oF docs.

Signed-off-by: N Balachandran <nibalach@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Tue, 18 Jun 2024 09:34:47 +0000 (11:34 +0200)]

Merge pull request #58086 from guits/fix_cv-migrate-tests

ceph-volume: fix TestMigrate tests

commit | commitdiff | tree

myoungwon oh [Tue, 18 Jun 2024 07:10:38 +0000 (07:10 +0000)]

crimson/os/seastore: replace record_group_data_bytes with data_bytes

Signed-off-by: Myoungwon Oh <myoungwon.oh@samsung.com>

commit | commitdiff | tree

myoungwon oh [Mon, 17 Jun 2024 10:12:50 +0000 (10:12 +0000)]

crimson/os/seastore: add writer level stats to RBM

Signed-off-by: Myoungwon Oh <myoungwon.oh@samsung.com>

commit | commitdiff | tree

avanthakkar [Thu, 18 May 2023 12:14:50 +0000 (17:44 +0530)]

mgr/dashboard: make application field for pool creation mandatory

Fixes: https://tracker.ceph.com/issues/61238
Signed-off-by: avanthakkar <avanjohn@gmail.com>

commit | commitdiff | tree

Lucian Petrut [Mon, 17 Jun 2024 18:37:11 +0000 (18:37 +0000)]

ceph_test_rados_api_misc: adjust LibRadosMiscConnectFailure.ConnectTimeout timeout

The test sets a 2 second connection timeout and asserts that the
attempt takes less than 4 seconds.

Under load, the asserted timeout can be exceeded. In particular,
this affects the Windows CI job, which uses a VM with a reduced
amount of resources.

    [ RUN      ] LibRadosMiscConnectFailure.ConnectTimeout
    /home/ubuntu/ceph/src/test/librados/misc.cc:87: Failure
    Expected: (dur) < (utime_t(4, 0)), actual: 5.987712 vs 4.000000

We'll adjust the timeouts, using a 5s configured timeout, asserting
that the connection attempt doesn't take more than 15s.

Fixes: https://tracker.ceph.com/issues/66534
Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>

commit | commitdiff | tree

Matan Breizman [Wed, 8 May 2024 08:23:30 +0000 (11:23 +0300)]

doc/dev/crimson: CPU cores allocation

CPU cores allocation and
add crimson-smp vstart flag

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
Signed-off-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Zac Dover [Tue, 18 Jun 2024 05:22:41 +0000 (15:22 +1000)]

doc/rados: add stretch_rule workaround

Add a method for defining a CRUSH rule that returns the actual value of
the total available size.

Fixes: https://tracker.ceph.com/issues/56650
Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Ronen Friedman [Tue, 18 Jun 2024 04:22:35 +0000 (07:22 +0300)]

Merge pull request #58003 from ronen-fr/wip-rf-targets-j9

osd/scrub: move more of the scrub initiation login into the scrubber

Reviewed-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Zac Dover [Tue, 18 Jun 2024 01:12:02 +0000 (11:12 +1000)]

Merge pull request #58091 from zdover23/wip-doc-2024-06-17-dev-developer-guide-essentials

doc/dev: link to ceph.io leads list

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Dan Mick <dmick@redhat.com>

commit | commitdiff | tree

Laura Flores [Mon, 17 Jun 2024 18:23:09 +0000 (13:23 -0500)]

Merge pull request #57988 from mohit84/issue_62934

unittest_osdmap aborted during OSDMapTest.BUG_42485

commit | commitdiff | tree

Samuel Just [Mon, 17 Jun 2024 18:08:53 +0000 (11:08 -0700)]

Merge pull request #57147 from xxhdx1985126/wip-65696

crimson/osd/pg_recovery: skip unfound objects when recovering the primary

Reviewed-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Radoslaw Zarzynski [Mon, 17 Jun 2024 17:50:45 +0000 (19:50 +0200)]

Merge pull request #40849 from dvanders/drainpeeringbug

test: test osd drain flap peering bug

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

Adam King [Mon, 17 Jun 2024 17:15:15 +0000 (13:15 -0400)]

Merge pull request #57955 from phlogistonjohn/jjm-fix-issue66389-apparmor

cephadm: fix apparmor profiles with spaces in the names

Reviewed-by: Adam King <adking@redhat.com>

commit | commitdiff | tree

John Mulligan [Thu, 9 May 2024 18:43:24 +0000 (14:43 -0400)]

mgr/smb: add unit tests for custom_options funcs

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Thu, 2 May 2024 21:05:44 +0000 (17:05 -0400)]

doc: add subsections on custom share and custom global options

Document the options, the risk of using it, and the "magic key-value"
pair needed to enable it.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Thu, 2 May 2024 20:41:47 +0000 (16:41 -0400)]

mgr/smb: pass custom options to samba(cc) configuration

Pass the custom share and/or global option key-value pairs to the
generated configurations.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Thu, 2 May 2024 20:41:37 +0000 (16:41 -0400)]

mgr/smb: add custom config options to share and cluster resources

Allow devs/testers/experimenters to add custom smb config params to our
managed shares or clusters. Use at your own risk.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Thu, 2 May 2024 20:41:15 +0000 (16:41 -0400)]

mgr/smb: add validation funcs for custom parameter dictionaries

Custom parameter dictionaries will be used to pass options to samba
config without much filtering and control by the smb mgr module. Because
the risks that it entails the user must "agree" that using these options
can break their setup with a "magic" key-value pair.
This pair will be filtered out of the eventual data passed to samba.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Thu, 2 May 2024 20:40:53 +0000 (16:40 -0400)]

mgr/smb: convert failures to create a valid resource into error results

Convert failures to create a valid resource into error results that can
be reported back to the caller like the other error result types
generated by actually attempting to apply the resources.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Thu, 2 May 2024 20:40:41 +0000 (16:40 -0400)]

mgr/smb: use correct module when referring to result classes

The result classes are not based in handler.py (any more) but the
module.py code was not updated to reflect that. Update it now.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Thu, 2 May 2024 20:39:34 +0000 (16:39 -0400)]

mgr/smb: allow ResultGroup to take an initial list of results

Can save a line of code later.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Thu, 2 May 2024 20:33:01 +0000 (16:33 -0400)]

mgr/smb: add result type for reporting resource validation errors

This error type can not take a real resource object because the resource
object could not be constructed from the data. Use the raw data for
reporting the error result.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Thu, 2 May 2024 20:32:35 +0000 (16:32 -0400)]

mgr/smb: add resource construction error handling exception

Use error hook function to wrap plain ValueError instances to an
InvalidResourcError. This error retains the simplified data being
(re)constructed into an object and thus will be used later to generate
proper error results.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Thu, 2 May 2024 20:28:46 +0000 (16:28 -0400)]

mgr/smb: add error handling & conversion hook to resourcelib

Add a method to supply the Resource instances with a callback that
can handle errors that occur during object construction from simplified
data. If the callback is not set, the exceptions are handled as usual.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

Adam King [Mon, 17 Jun 2024 15:16:53 +0000 (11:16 -0400)]

Merge pull request #58054 from phlogistonjohn/jjm-cpatch-py-vers

script/cpatch.py: add support for multiple valid python versions

Reviewed-by: Adam King <adking@redhat.com>

commit | commitdiff | tree

Adam King [Mon, 17 Jun 2024 15:16:02 +0000 (11:16 -0400)]

Merge pull request #58053 from rkhudov/src-cephadm-box_remove_unused_imports

src/cephadm/box: remove unused imports

Reviewed-by: Adam King <adking@redhat.com>

commit | commitdiff | tree

Adam King [Mon, 17 Jun 2024 15:14:38 +0000 (11:14 -0400)]

Merge pull request #57293 from phlogistonjohn/jjm-smb-create-only

smb: have create cluster and create share commands only create

Reviewed-by: Adam King <adking@redhat.om>
Reviewed-by: Avan Thakkar <athakkar@redhat.com>

commit | commitdiff | tree

Zac Dover [Mon, 17 Jun 2024 14:39:10 +0000 (00:39 +1000)]

doc/dev: link to ceph.io leads list

Remove the docs.ceph.com list of team leads, and link to the list of
leads on ceph.io.

Fixes: https://tracker.ceph.com/issues/66392
Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Guillaume Abrioux [Mon, 17 Jun 2024 13:14:56 +0000 (15:14 +0200)]

ceph-volume: tests shouldn't require lvm2 to pass

some tests are currently failing when `lvm2` isn't installed:

```
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestList::test_empty_device_json_zero_exit_status - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestList::test_empty_device_zero_exit_status - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestFullReport::test_no_ceph_lvs - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestFullReport::test_ceph_data_lv_reported - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestFullReport::test_ceph_journal_lv_reported - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestFullReport::test_ceph_wal_lv_reported - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestFullReport::test_physical_2nd_device_gets_reported[journal] - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestFullReport::test_physical_2nd_device_gets_reported[db] - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestFullReport::test_physical_2nd_device_gets_reported[wal] - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestSingleReport::test_not_a_ceph_lv - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestSingleReport::test_report_a_ceph_lv - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestSingleReport::test_report_a_ceph_journal_device - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestSingleReport::test_report_by_osd_id_for_just_block_dev - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestSingleReport::test_report_by_osd_id_for_just_data_dev - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestSingleReport::test_report_by_osd_id_for_just_block_wal_and_db_dev - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestSingleReport::test_report_by_osd_id_for_data_and_journal_dev - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestSingleReport::test_report_by_nonexistent_osd_id - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestSingleReport::test_report_a_ceph_lv_with_no_matching_devices - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_migrate.py::TestNew::test_newdb_not_target_lvm - FileNotFoundError: [Errno 2] No such file or directory: 'lvs'
FAILED ceph_volume/tests/devices/lvm/test_zap.py::TestEnsureAssociatedLVs::test_nothing_is_found - FileNotFoundError: [Errno 2] No such file or directory: 'lvs'
FAILED ceph_volume/tests/devices/lvm/test_zap.py::TestEnsureAssociatedLVs::test_multiple_journals_are_found - FileNotFoundError: [Errno 2] No such file or directory: 'lvs'
FAILED ceph_volume/tests/devices/lvm/test_zap.py::TestEnsureAssociatedLVs::test_multiple_dbs_are_found - FileNotFoundError: [Errno 2] No such file or directory: 'lvs'
FAILED ceph_volume/tests/devices/lvm/test_zap.py::TestEnsureAssociatedLVs::test_multiple_wals_are_found - FileNotFoundError: [Errno 2] No such file or directory: 'lvs'
FAILED ceph_volume/tests/devices/lvm/test_zap.py::TestEnsureAssociatedLVs::test_multiple_backing_devs_are_found - FileNotFoundError: [Errno 2] No such file or directory: 'lvs'
FAILED ceph_volume/tests/objectstore/test_lvmbluestore.py::TestLvmBlueStore::test_activate_all_osd_is_active - FileNotFoundError: [Errno 2] No such file or directory: 'lvs'
```

Everything should be actually mocked. This commit addresses that.

Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>

commit | commitdiff | tree

Guillaume Abrioux [Mon, 17 Jun 2024 12:02:11 +0000 (14:02 +0200)]

ceph-volume: fix TestMigrate tests

ae5ef432845 broke some unit tests.
This commit fixes that.

Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>

commit | commitdiff | tree

Igor Fedotov [Mon, 17 Jun 2024 11:02:13 +0000 (14:02 +0300)]

Merge pull request #58079 from jrchyang/main

os/bluestore: corrected the description of the bluefs stats output

Reviewed-by: Igor Fedotov <igor.fedotov@croit.io>

commit | commitdiff | tree

Venky Shankar [Mon, 17 Jun 2024 09:27:52 +0000 (14:57 +0530)]

Merge PR #57991 into main

* refs/pull/57991/head:
qa: upgrade sub-suite upgraded_client from from n-1|n-2 releases
qa: upgrade sub-suite nofs from n-1 and n-2 releases
qa: use supported releases for featureful_client

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>

Unnamed repository; edit this file 'description' to name the repository.