git-server-git.apps.pok.os.sepia.ceph.com Git

]> git-server-git.apps.pok.os.sepia.ceph.com Git - ceph.git/log

Samuel Just [Fri, 21 Jun 2024 01:54:58 +0000 (18:54 -0700)]

crimson/.../object_context_loader: add missing dpp argument to debug statement

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Fri, 21 Jun 2024 00:48:04 +0000 (17:48 -0700)]

crimson/.../tri_mutex: use hobject_t for tri_mutex::name debug value

Useful for distinguishing between tri_mutex instances on different clones.

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Fri, 21 Jun 2024 00:46:11 +0000 (17:46 -0700)]

crimson/.../tri_mutex: no need to store waiter_name on waiter_t

It's going to be the same as tri_mutex::name for any waiter on
a tri_mutex.

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Wed, 19 Jun 2024 04:10:34 +0000 (21:10 -0700)]

crimson/.../object_context: drop recovery_read_marker

This doesn't seem to serve a purpose with current crimson. classic
uses ObjectState::recovery_read_marker to indicate that backfill
should be requeued upon wakeup, but that hasn't been necessary so
far in crimson. We can reintroduce this if it becomes useful.

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Wed, 19 Jun 2024 04:09:04 +0000 (21:09 -0700)]

crimson/.../object_context: drop unused maybe_get_excl

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Mon, 10 Jun 2024 21:15:16 +0000 (21:15 +0000)]

crimson/.../object_context: remove loading_mutex

Now that we acquire and demote an exclusive lock during loading without
blocking, a seperate loading_mutex shouldn't be necessary.

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Thu, 13 Jun 2024 00:47:08 +0000 (00:47 +0000)]

crimson/.../tri_mutex: lock() methods return normal future

f63d76a2 modified the lock() variants on tri_mutex so that the obc
loading pathway wouldn't invoke .then() on returned future known
statically to be ready. Now that the loading pathway uses demotion
mechanisms that cannot block and do not return futures, we no longer
have any users like that and can drop the extra std::nullopt
possibility.

In a larger sense, if lock() *can* return a non-ready future in a
particular pathway, there's no semantic difference between returning
std::optional<future<>> and future<> as the caller would still have to
deal with a possible non-ready future return even if std::nullopt is
also possible. If the pathway can be demonstrated statically to be
non-blocking, as with the obc loading mechanism, we really want to use a
mechanism that obviously cannot block rather relying on a mechanism with
a return signature of std::optional<future<>> to return std::nullopt.

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Mon, 10 Jun 2024 20:47:07 +0000 (20:47 +0000)]

crimson: eliminate lock promotion from object_context and tri_mutex

Since we now load obc's via ObjectContext::load_then_with_lock, we no
longer need to promote locks. Eliminate support for now.

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Mon, 10 Jun 2024 20:40:43 +0000 (20:40 +0000)]

crimson/.../object_context_loader: simplify obc loading

Because we just constructed the obc, we know that we can get an
exclusive lock without blocking. Introduce
ObjectContext::load_then_with_lock to take an exclusive lock
unconditionally, load, downgrade (which we also know must be safe), and
then run the passed function.

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Sat, 8 Jun 2024 01:30:03 +0000 (01:30 +0000)]

crimson/.../object_context_loader: consolidate obc loading

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Sat, 8 Jun 2024 00:31:53 +0000 (00:31 +0000)]

crimson/.../object_context_loader: remove impossible branch in get_or_load_obc

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Thu, 13 Jun 2024 00:41:55 +0000 (00:41 +0000)]

crimson: fix ObjectContext::_with_lock to only unlock if lock is taken

Fixes: https://tracker.ceph.com/issues/66461
Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Laura Flores [Fri, 21 Jun 2024 21:48:27 +0000 (16:48 -0500)]

Merge pull request #57974 from ljflores/wip-tracker-65189

Telemetry upgrade test fixes

commit | commitdiff | tree

Laura Flores [Wed, 19 Jun 2024 23:01:31 +0000 (18:01 -0500)]

qa/suites/upgrade/telemetry-upgrade/quincy-x: update image reference

`quay.io/ceph/daemon-base:latest-quincy` points to the latest quincy tag (v17.2.7),
not the tip of quincy, which is actually what we want to test.

Pulling the daemon-base:latest-quincy image shows it is running v17.2.7:
```
lflores:ceph$ podman run --rm -it daemon-base:latest-quincy
[root@e4eb265e7fa0 /]# ceph -v
ceph version 17.2.7 (b12291d110049b2f35e32e0de30d70e9a4c060d2) quincy (stable)
```

It is important that we test the tip of quincy in case there is a fix there that
we want to pick up.

I referenced this commit to make the change: https://github.com/ceph/ceph/commit/d0381ed7e68a6de7c9ff1e4220698ed8eab28ac6

Signed-off-by: Laura Flores <lflores@ibm.com>

commit | commitdiff | tree

Laura Flores [Wed, 19 Jun 2024 21:57:45 +0000 (16:57 -0500)]

qa/suites/upgrade/telemetry-upgrade/reef-x: update how cephadm is pulled and change image reference

Update how cephadm is pulled:

`cephadm_git_url` and `cephadm_branch` are used in releases older than reef
to install cephadm. Both of these keys are needed to install it from the github
repo.

However, in reef and on, the compiled zipapp cephadm needs to be pulled differently
than the old single python script `cephadm` from earlier releases.

This commit is what I referenced for this change: https://github.com/ceph/ceph/commit/967716f2e6c8cd3f7d810a5f8e5e49448717e513

-------

Change image reference:

`quay.io/ceph/daemon-base:latest-reef` points to the latest reef tag (v18.2.2),
not the tip of reef, which is actually what we want to test.

Pulling the daemon-base:latest-reef image shows it is running v18.2.2:
```
lflores:ceph$ podman run --rm -it daemon-base:latest-reef
[root@d4d9a0eb04be /]# ceph -v
ceph version 18.2.2 (531c0d11a1c5d39fbfe6aa8a521f023abf3bf3e2) reef (stable)
```

It is important that we test the tip of reef in case there is a fix there that we
want to pick up.

I referenced this commit to make the change: https://github.com/ceph/ceph/commit/d0381ed7e68a6de7c9ff1e4220698ed8eab28ac6

Signed-off-by: Laura Flores <lflores@ibm.com>

commit | commitdiff | tree

Laura Flores [Wed, 19 Jun 2024 21:07:31 +0000 (16:07 -0500)]

qa/suites/upgrade/telemetry-upgrade: add more ignorelist items and require_osd_release=squid

The warnings added to the ignorelist show up in the cluster log, but they are
expected during upgrades and should thus be ignored.

We also need to set require_osd_release=squid to avoid this warning:
```
cluster [WRN] Health check failed: all OSDs are running squid or later but require_osd_release < squid (OSD_UPGRADE_FINISHED)
```

Signed-off-by: Laura Flores <lflores@ibm.com>

commit | commitdiff | tree

Radoslaw Zarzynski [Fri, 21 Jun 2024 14:41:12 +0000 (16:41 +0200)]

Merge pull request #58178 from ljflores/wip-tracker-66556

Revert "test: reproduce flap peering bug (wip)"

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Dan van der Ster <daniel.vanderster@cern.ch>

commit | commitdiff | tree

Ilya Dryomov [Fri, 21 Jun 2024 13:21:40 +0000 (15:21 +0200)]

Merge pull request #58175 from batrick/label-scripts

.github/labeler: add script label to script changes

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Zac Dover [Fri, 21 Jun 2024 09:17:54 +0000 (19:17 +1000)]

Merge pull request #55675 from douglasawh/patch-2

doc/src: invadvisable is not a word

Reviewed-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Zac Dover [Fri, 21 Jun 2024 05:49:01 +0000 (15:49 +1000)]

Merge pull request #58109 from zdover23/wip-doc-2024-06-18-rados-ops-stretch-mode

doc/rados: add stretch_rule workaround

Reviewed-by: Prashant D <pdhange@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Fri, 21 Jun 2024 01:50:25 +0000 (21:50 -0400)]

Merge PR #58174 into main

* refs/pull/58174/head:
script/backport-create-issue: update tag custom field

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>

commit | commitdiff | tree

Doug Whitfield [Tue, 20 Feb 2024 21:13:14 +0000 (15:13 -0600)]

doc/src: invadvisable is not a word

changed to inadvisable on line 4997

Signed-off-by: Doug Whitfield <douglasawh@gmail.com>

commit | commitdiff | tree

Laura Flores [Thu, 20 Jun 2024 18:52:07 +0000 (13:52 -0500)]

Revert "test: reproduce flap peering bug (wip)"

This reverts commit c89f9c4e80a4ac45ae5ea86ea7e9c41999299f27.

Note from Dan van der Ster:
This is a test that should succeed,
it definitely used to succeed back in the L/O days of Ceph. At some
point peering code changed and this behaviour regressed. In short,
an OSD goes down then comes up, and no objects were modified in the mean time.
There should be no degraded PGs in this case.

As this commit is currently breaking make check on all PRs, I think it should
be re-evaluated and merged so whatever fix is needed along with this test to
make it work are merged together.

Fixes: https://tracker.ceph.com/issues/66556
Signed-off-by: Laura Flores <lflores@ibm.com>

commit | commitdiff | tree

Patrick Donnelly [Thu, 20 Jun 2024 16:00:05 +0000 (12:00 -0400)]

.github/labeler: add script label to script changes

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Thu, 20 Jun 2024 15:43:17 +0000 (11:43 -0400)]

script/backport-create-issue: update tag custom field

The redmineup_tags was added and, apparently, the old "Tags" custom field with
id == 3 was deleted and then recreated with id == 31. This broke the script.
Update and refactor the id for that "Tags" custom field.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Adam King [Thu, 20 Jun 2024 16:06:18 +0000 (12:06 -0400)]

Merge pull request #57412 from adk3798/stray-laundry2

qa/cephadm: fix ignorelist of CEPHADM_STRAY_DAEMON for rados_api_tests

Reviewed-by: Laura Flores <lflores@ibm.com>

commit | commitdiff | tree

Ilya Dryomov [Thu, 20 Jun 2024 15:05:11 +0000 (17:05 +0200)]

Merge pull request #58055 from zmc/qemu-task-version

qa/tasks/qemu: Fix OS version comparison

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Casey Bodley [Thu, 20 Jun 2024 14:19:18 +0000 (15:19 +0100)]

Merge pull request #57079 from kchheda3/wip-lc-notification

rgw/lifecycle-notification: Do not block lc processing for notification errors.

Reviewed-by: Yuval Lifshitz <ylifshit@ibm.com>

commit | commitdiff | tree

Casey Bodley [Thu, 20 Jun 2024 14:04:47 +0000 (15:04 +0100)]

Merge pull request #57933 from samarahu/d4n_copy_test_fix

d4n/directory: Comment out `copy` unit tests and remove unnecessary `copy_object` implementation in filter

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Zac Dover [Thu, 20 Jun 2024 13:53:42 +0000 (23:53 +1000)]

Merge pull request #58147 from zdover23/wip-doc-2024-06-20-start-beginners-guide-wiki

doc/start: add wiki to Beginner's Guide

Reviewed-by: Cole Mitchell <cole.mitchell.ceph@gmail.com>

commit | commitdiff | tree

Nizamudeen A [Thu, 20 Jun 2024 12:48:24 +0000 (18:18 +0530)]

Merge pull request #58069 from rhcs-dashboard/multiple-alert-panel-dashboard

mgr/dashboard: fix alert broken for multiple alerts

Reviewed-by: afreen23 <NOT@FOUND>
Reviewed-by: ivoalmeida <NOT@FOUND>

commit | commitdiff | tree

Avan [Thu, 20 Jun 2024 12:29:05 +0000 (17:59 +0530)]

Merge pull request #51566 from rhcs-dashboard/fix-pool-application

mgr/dashboard: make application field for pool creation mandatory

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
Reviewed-by: afreen23 <NOT@FOUND>
Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>
Reviewed-by: Sunil Angadi <Sunil.Angadi@ibm.com>

commit | commitdiff | tree

Nizamudeen A [Thu, 20 Jun 2024 06:25:44 +0000 (11:55 +0530)]

Merge pull request #58068 from rhcs-dashboard/disable-telemetry-notifications-in-e2e

mgr/dashboard: disable telemetry notification in e2e

Reviewed-by: afreen23 <NOT@FOUND>
Reviewed-by: nmunet <NOT@FOUND>

commit | commitdiff | tree

Guillaume Abrioux [Thu, 20 Jun 2024 05:45:03 +0000 (07:45 +0200)]

Merge pull request #58138 from guits/fix-dmcrypt-check

ceph-volume: fix regex usage in `set_dmcrypt_no_workqueue`

commit | commitdiff | tree

Leonid S. Usov [Thu, 20 Jun 2024 04:56:09 +0000 (07:56 +0300)]

Merge pull request #57946 from ceph/wip-lusov-qdb-bootstrap-event

mds: QuiesceDbManager: mark next retry event during bootstrap

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Leonid S. Usov [Thu, 20 Jun 2024 04:55:36 +0000 (07:55 +0300)]

Merge pull request #57980 from ceph/wip-lusov-qdb-root-ttl

mds/quiesce-db: calculate_quiesce_map: aggregate quiesce roots' TTL as `max`

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Zac Dover [Thu, 20 Jun 2024 03:54:24 +0000 (13:54 +1000)]

doc/start: add wiki to Beginner's Guide

Add the Redmine Ceph Wiki to the Ceph Beginner's Guide.

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Patrick Donnelly [Wed, 19 Jun 2024 19:19:56 +0000 (15:19 -0400)]

Merge PR #55792 into main

* refs/pull/55792/head:
tools/cephfs: recover alternate_name of dentries from journal
qa: add test to verify recovery of alternate_name from journal
tools/cephfs/JournalTool: add some more debugging
tools/cephfs/JournalTool: remove extraneous 0x in debug output
mds: dump alternate_name to formatter
mds: add warning about encoding new fields

Reviewed-by: Christopher Hoffman <choffman@redhat.com>
Reviewed-by: Xiubo Li <xiubli@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Wed, 19 Jun 2024 18:17:15 +0000 (20:17 +0200)]

Merge pull request #57973 from idryomov/wip-66418

librbd: diff-iterate shouldn't crash on an empty byte range

Reviewed-by: Ramana Raja <rraja@redhat.com>
Reviewed-by: Mykola Golub <mgolub@suse.com>

commit | commitdiff | tree

Ilya Dryomov [Wed, 19 Jun 2024 18:15:35 +0000 (20:15 +0200)]

Merge pull request #58074 from idryomov/wip-66300

librbd: disallow group snap rollback if memberships don't match

Reviewed-by: Ramana Raja <rraja@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Wed, 19 Jun 2024 13:06:52 +0000 (15:06 +0200)]

ceph-volume: fix regex usage in `set_dmcrypt_no_workqueue`

- Updated the regex pattern to `r'(\d+\.?)+'` to more accurately
  capture version numbers.

- Replaced `re.match` with `re.search` to properly match the cryptsetup
  version in the output.

- `re.match` only checks for a match at the beginning of the string,
   while `re.search` looks for a match anywhere in the string.

This fix ensures that the function correctly retrieves the
cryptsetup version from the output.

Fixes: https://tracker.ceph.com/issues/66393
Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>

commit | commitdiff | tree

Nizamudeen A [Sun, 16 Jun 2024 12:46:52 +0000 (18:16 +0530)]

mgr/dashboard: fix alert broken for multiple alerts

After carbon the alert panel was broken when there are multiple alerts
present (telemetry, motd, password expiration).

Applying carbon banner to the existing alert banner

Fixes: https://tracker.ceph.com/issues/66512
Signed-off-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Adam King [Wed, 19 Jun 2024 13:17:51 +0000 (09:17 -0400)]

Merge pull request #57294 from phlogistonjohn/jjm-smb-free-customize

mgr/smb: add custom config options to share and cluster resources

Reviewed-by: Adam King <adking@redhat.com>

commit | commitdiff | tree

Casey Bodley [Wed, 19 Jun 2024 12:14:30 +0000 (13:14 +0100)]

Merge pull request #57430 from tchaikov/wip-rgw-driver-posi

rgw/driver/posix: s/is_value()/success()/

Reviewed-by: Matt Benjamin <mbenjamin@redhat.com>

commit | commitdiff | tree

Matan Breizman [Wed, 19 Jun 2024 10:21:25 +0000 (13:21 +0300)]

Merge pull request #57691 from xxhdx1985126/wip-66216

crimson/osd/osd_operations/client_request_common: `PeeringState::needs_recovery()` may fail if the object is under backfill

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Afreen Misbah [Fri, 14 Jun 2024 05:21:47 +0000 (10:51 +0530)]

mgr/dashboard: fix service page e2e tests

- service page now uses defaults value for the placement count due to which mds test failing
- in test we pass "1" while "2" which is the default count for mds is already populated, making it 21 and causing unable to create mds service

Fixes: https://tracker.ceph.com/issues/66540
Signed-off-by: Afreen Misbah <afreen23.git@gmail.com>
(cherry picked from commit 5a718229dc43953f140a16342f19523a1c075863)

commit | commitdiff | tree

Nizamudeen A [Wed, 19 Jun 2024 06:49:56 +0000 (12:19 +0530)]

Merge pull request #53832 from rhcs-dashboard/fix-cephadm-daemon-failed-alert

mgr/dashboard: Show which daemons failed in CEPHADM_FAILED_DAEMON healthcheck

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Yingxin [Wed, 19 Jun 2024 03:10:44 +0000 (11:10 +0800)]

Merge pull request #58083 from myoungwon/wip-seastore-rbm-stat

crimson/os/seastore: add writer level stats to RBM

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Zack Cerza [Fri, 14 Jun 2024 19:37:16 +0000 (13:37 -0600)]

qa/tasks/qemu: Fix OS version comparison

See: https://sentry.ceph.com/share/issue/21ed88d705854238bdafbf6711e795ee/
They're strings, not floats.
This surfaced as a result of https://github.com/ceph/teuthology/pull/1953

Signed-off-by: Zack Cerza <zack@redhat.com>

commit | commitdiff | tree

Anthony D'Atri [Wed, 19 Jun 2024 00:50:21 +0000 (20:50 -0400)]

Merge pull request #58123 from gsoneill/clarify-ec-architecture

doc: clarify availability vs integrity

commit | commitdiff | tree

Gregory O'Neill [Tue, 18 Jun 2024 21:17:00 +0000 (17:17 -0400)]

doc: clarify availability vs integrity

commit | commitdiff | tree

Ilya Dryomov [Tue, 18 Jun 2024 14:18:35 +0000 (16:18 +0200)]

Merge pull request #58112 from nbalacha/wip-doc-typo-180624

doc/rbd: fix typos in NVMe-oF docs

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Matan Breizman [Tue, 18 Jun 2024 13:43:50 +0000 (16:43 +0300)]

Merge pull request #57978 from athanatos/sjust/wip-63647-snaptrim-pipeline

crimson: simplify snaptrim operation pipline usage

Reviewed-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Sun, 16 Jun 2024 11:15:43 +0000 (16:45 +0530)]

mgr/dashboard: disable telemetry notification in e2e

After the new UI shell, the telemetry notifications are displayed on the
bottom side, which kind of exists on top of some buttons and that causes
some cypress failures while clicking Submit button in the form.

I am disabling the notification itself in e2e because its not checked in
the e2e at all so we can afford to disable it. Incase we decide to add
an e2e for the notification, we can just toggle it on later on and we
can check it

Fixes: https://tracker.ceph.com/issues/66506
Signed-off-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Tue, 18 Jun 2024 13:23:03 +0000 (15:23 +0200)]

Merge pull request #58100 from petrutlucian94/conn_timeout_test

ceph_test_rados_api_misc: adjust LibRadosMiscConnectFailure.ConnectTimeout timeout

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Aashish Sharma [Thu, 5 Oct 2023 05:38:30 +0000 (11:08 +0530)]

mgr/dashboard: Show which daemons failed in CEPHADM_FAILED_DAEMON
healthcheck

Fixes: https://tracker.ceph.com/issues/63792
Signed-off-by: Aashish Sharma <aasharma@redhat.com>

commit | commitdiff | tree

Xuehan Xu [Fri, 24 May 2024 09:30:41 +0000 (17:30 +0800)]

crimson/osd/osd_operations/client_request_common: `PeeringState::needs_recovery()`
may fail if the object is under backfill

Meanwhile, set the correct version for backfill:
From Classic:
```
    if (is_degraded_or_backfilling_object(head)) {
      if (can_backoff && g_conf()->osd_backoff_on_degraded) {
        add_backoff(session, head, head);
        maybe_kick_recovery(head);
      }
```

```
void PrimaryLogPG::maybe_kick_recovery(
  const hobject_t &soid)
{
  eversion_t v;
  bool work_started = false;
  if (!recovery_state.get_missing_loc().needs_recovery(soid, &v))
    return;
```

Fixes: https://tracker.ceph.com/issues/66216
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
Signed-off-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Matan Breizman [Tue, 18 Jun 2024 10:56:00 +0000 (13:56 +0300)]

Merge pull request #57352 from Matan-B/wip-crimson-conf-docs

doc/dev/crimson: CPU cores allocation

Reviewed-by: Zac Dover <zac.dover@proton.me>
Reviewed-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

N Balachandran [Tue, 18 Jun 2024 10:39:51 +0000 (16:09 +0530)]

doc/rbd: fix typos in NVMe-oF docs.

Signed-off-by: N Balachandran <nibalach@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Tue, 18 Jun 2024 09:34:47 +0000 (11:34 +0200)]

Merge pull request #58086 from guits/fix_cv-migrate-tests

ceph-volume: fix TestMigrate tests

commit | commitdiff | tree

myoungwon oh [Tue, 18 Jun 2024 07:10:38 +0000 (07:10 +0000)]

crimson/os/seastore: replace record_group_data_bytes with data_bytes

Signed-off-by: Myoungwon Oh <myoungwon.oh@samsung.com>

commit | commitdiff | tree

myoungwon oh [Mon, 17 Jun 2024 10:12:50 +0000 (10:12 +0000)]

crimson/os/seastore: add writer level stats to RBM

Signed-off-by: Myoungwon Oh <myoungwon.oh@samsung.com>

commit | commitdiff | tree

avanthakkar [Thu, 18 May 2023 12:14:50 +0000 (17:44 +0530)]

mgr/dashboard: make application field for pool creation mandatory

Fixes: https://tracker.ceph.com/issues/61238
Signed-off-by: avanthakkar <avanjohn@gmail.com>

commit | commitdiff | tree

Lucian Petrut [Mon, 17 Jun 2024 18:37:11 +0000 (18:37 +0000)]

ceph_test_rados_api_misc: adjust LibRadosMiscConnectFailure.ConnectTimeout timeout

The test sets a 2 second connection timeout and asserts that the
attempt takes less than 4 seconds.

Under load, the asserted timeout can be exceeded. In particular,
this affects the Windows CI job, which uses a VM with a reduced
amount of resources.

    [ RUN      ] LibRadosMiscConnectFailure.ConnectTimeout
    /home/ubuntu/ceph/src/test/librados/misc.cc:87: Failure
    Expected: (dur) < (utime_t(4, 0)), actual: 5.987712 vs 4.000000

We'll adjust the timeouts, using a 5s configured timeout, asserting
that the connection attempt doesn't take more than 15s.

Fixes: https://tracker.ceph.com/issues/66534
Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>

commit | commitdiff | tree

Matan Breizman [Wed, 8 May 2024 08:23:30 +0000 (11:23 +0300)]

doc/dev/crimson: CPU cores allocation

CPU cores allocation and
add crimson-smp vstart flag

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
Signed-off-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Zac Dover [Tue, 18 Jun 2024 05:22:41 +0000 (15:22 +1000)]

doc/rados: add stretch_rule workaround

Add a method for defining a CRUSH rule that returns the actual value of
the total available size.

Fixes: https://tracker.ceph.com/issues/56650
Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Ronen Friedman [Tue, 18 Jun 2024 04:22:35 +0000 (07:22 +0300)]

Merge pull request #58003 from ronen-fr/wip-rf-targets-j9

osd/scrub: move more of the scrub initiation login into the scrubber

Reviewed-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Zac Dover [Tue, 18 Jun 2024 01:12:02 +0000 (11:12 +1000)]

Merge pull request #58091 from zdover23/wip-doc-2024-06-17-dev-developer-guide-essentials

doc/dev: link to ceph.io leads list

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Dan Mick <dmick@redhat.com>

commit | commitdiff | tree

Laura Flores [Mon, 17 Jun 2024 18:23:09 +0000 (13:23 -0500)]

Merge pull request #57988 from mohit84/issue_62934

unittest_osdmap aborted during OSDMapTest.BUG_42485

commit | commitdiff | tree

Samuel Just [Mon, 17 Jun 2024 18:08:53 +0000 (11:08 -0700)]

Merge pull request #57147 from xxhdx1985126/wip-65696

crimson/osd/pg_recovery: skip unfound objects when recovering the primary

Reviewed-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Radoslaw Zarzynski [Mon, 17 Jun 2024 17:50:45 +0000 (19:50 +0200)]

Merge pull request #40849 from dvanders/drainpeeringbug

test: test osd drain flap peering bug

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

Adam King [Mon, 17 Jun 2024 17:15:15 +0000 (13:15 -0400)]

Merge pull request #57955 from phlogistonjohn/jjm-fix-issue66389-apparmor

cephadm: fix apparmor profiles with spaces in the names

Reviewed-by: Adam King <adking@redhat.com>

commit | commitdiff | tree

John Mulligan [Thu, 9 May 2024 18:43:24 +0000 (14:43 -0400)]

mgr/smb: add unit tests for custom_options funcs

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Thu, 2 May 2024 21:05:44 +0000 (17:05 -0400)]

doc: add subsections on custom share and custom global options

Document the options, the risk of using it, and the "magic key-value"
pair needed to enable it.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Thu, 2 May 2024 20:41:47 +0000 (16:41 -0400)]

mgr/smb: pass custom options to samba(cc) configuration

Pass the custom share and/or global option key-value pairs to the
generated configurations.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Thu, 2 May 2024 20:41:37 +0000 (16:41 -0400)]

mgr/smb: add custom config options to share and cluster resources

Allow devs/testers/experimenters to add custom smb config params to our
managed shares or clusters. Use at your own risk.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Thu, 2 May 2024 20:41:15 +0000 (16:41 -0400)]

mgr/smb: add validation funcs for custom parameter dictionaries

Custom parameter dictionaries will be used to pass options to samba
config without much filtering and control by the smb mgr module. Because
the risks that it entails the user must "agree" that using these options
can break their setup with a "magic" key-value pair.
This pair will be filtered out of the eventual data passed to samba.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Thu, 2 May 2024 20:40:53 +0000 (16:40 -0400)]

mgr/smb: convert failures to create a valid resource into error results

Convert failures to create a valid resource into error results that can
be reported back to the caller like the other error result types
generated by actually attempting to apply the resources.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Thu, 2 May 2024 20:40:41 +0000 (16:40 -0400)]

mgr/smb: use correct module when referring to result classes

The result classes are not based in handler.py (any more) but the
module.py code was not updated to reflect that. Update it now.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Thu, 2 May 2024 20:39:34 +0000 (16:39 -0400)]

mgr/smb: allow ResultGroup to take an initial list of results

Can save a line of code later.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Thu, 2 May 2024 20:33:01 +0000 (16:33 -0400)]

mgr/smb: add result type for reporting resource validation errors

This error type can not take a real resource object because the resource
object could not be constructed from the data. Use the raw data for
reporting the error result.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Thu, 2 May 2024 20:32:35 +0000 (16:32 -0400)]

mgr/smb: add resource construction error handling exception

Use error hook function to wrap plain ValueError instances to an
InvalidResourcError. This error retains the simplified data being
(re)constructed into an object and thus will be used later to generate
proper error results.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Thu, 2 May 2024 20:28:46 +0000 (16:28 -0400)]

mgr/smb: add error handling & conversion hook to resourcelib

Add a method to supply the Resource instances with a callback that
can handle errors that occur during object construction from simplified
data. If the callback is not set, the exceptions are handled as usual.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

Adam King [Mon, 17 Jun 2024 15:16:53 +0000 (11:16 -0400)]

Merge pull request #58054 from phlogistonjohn/jjm-cpatch-py-vers

script/cpatch.py: add support for multiple valid python versions

Reviewed-by: Adam King <adking@redhat.com>

commit | commitdiff | tree

Adam King [Mon, 17 Jun 2024 15:16:02 +0000 (11:16 -0400)]

Merge pull request #58053 from rkhudov/src-cephadm-box_remove_unused_imports

src/cephadm/box: remove unused imports

Reviewed-by: Adam King <adking@redhat.com>

commit | commitdiff | tree

Adam King [Mon, 17 Jun 2024 15:14:38 +0000 (11:14 -0400)]

Merge pull request #57293 from phlogistonjohn/jjm-smb-create-only

smb: have create cluster and create share commands only create

Reviewed-by: Adam King <adking@redhat.om>
Reviewed-by: Avan Thakkar <athakkar@redhat.com>

commit | commitdiff | tree

Zac Dover [Mon, 17 Jun 2024 14:39:10 +0000 (00:39 +1000)]

doc/dev: link to ceph.io leads list

Remove the docs.ceph.com list of team leads, and link to the list of
leads on ceph.io.

Fixes: https://tracker.ceph.com/issues/66392
Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Guillaume Abrioux [Mon, 17 Jun 2024 13:14:56 +0000 (15:14 +0200)]

ceph-volume: tests shouldn't require lvm2 to pass

some tests are currently failing when `lvm2` isn't installed:

```
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestList::test_empty_device_json_zero_exit_status - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestList::test_empty_device_zero_exit_status - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestFullReport::test_no_ceph_lvs - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestFullReport::test_ceph_data_lv_reported - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestFullReport::test_ceph_journal_lv_reported - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestFullReport::test_ceph_wal_lv_reported - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestFullReport::test_physical_2nd_device_gets_reported[journal] - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestFullReport::test_physical_2nd_device_gets_reported[db] - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestFullReport::test_physical_2nd_device_gets_reported[wal] - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestSingleReport::test_not_a_ceph_lv - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestSingleReport::test_report_a_ceph_lv - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestSingleReport::test_report_a_ceph_journal_device - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestSingleReport::test_report_by_osd_id_for_just_block_dev - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestSingleReport::test_report_by_osd_id_for_just_data_dev - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestSingleReport::test_report_by_osd_id_for_just_block_wal_and_db_dev - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestSingleReport::test_report_by_osd_id_for_data_and_journal_dev - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestSingleReport::test_report_by_nonexistent_osd_id - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestSingleReport::test_report_a_ceph_lv_with_no_matching_devices - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_migrate.py::TestNew::test_newdb_not_target_lvm - FileNotFoundError: [Errno 2] No such file or directory: 'lvs'
FAILED ceph_volume/tests/devices/lvm/test_zap.py::TestEnsureAssociatedLVs::test_nothing_is_found - FileNotFoundError: [Errno 2] No such file or directory: 'lvs'
FAILED ceph_volume/tests/devices/lvm/test_zap.py::TestEnsureAssociatedLVs::test_multiple_journals_are_found - FileNotFoundError: [Errno 2] No such file or directory: 'lvs'
FAILED ceph_volume/tests/devices/lvm/test_zap.py::TestEnsureAssociatedLVs::test_multiple_dbs_are_found - FileNotFoundError: [Errno 2] No such file or directory: 'lvs'
FAILED ceph_volume/tests/devices/lvm/test_zap.py::TestEnsureAssociatedLVs::test_multiple_wals_are_found - FileNotFoundError: [Errno 2] No such file or directory: 'lvs'
FAILED ceph_volume/tests/devices/lvm/test_zap.py::TestEnsureAssociatedLVs::test_multiple_backing_devs_are_found - FileNotFoundError: [Errno 2] No such file or directory: 'lvs'
FAILED ceph_volume/tests/objectstore/test_lvmbluestore.py::TestLvmBlueStore::test_activate_all_osd_is_active - FileNotFoundError: [Errno 2] No such file or directory: 'lvs'
```

Everything should be actually mocked. This commit addresses that.

Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>

commit | commitdiff | tree

Guillaume Abrioux [Mon, 17 Jun 2024 12:02:11 +0000 (14:02 +0200)]

ceph-volume: fix TestMigrate tests

ae5ef432845 broke some unit tests.
This commit fixes that.

Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>

commit | commitdiff | tree

Igor Fedotov [Mon, 17 Jun 2024 11:02:13 +0000 (14:02 +0300)]

Merge pull request #58079 from jrchyang/main

os/bluestore: corrected the description of the bluefs stats output

Reviewed-by: Igor Fedotov <igor.fedotov@croit.io>

commit | commitdiff | tree

Venky Shankar [Mon, 17 Jun 2024 09:27:52 +0000 (14:57 +0530)]

Merge PR #57991 into main

* refs/pull/57991/head:
qa: upgrade sub-suite upgraded_client from from n-1|n-2 releases
qa: upgrade sub-suite nofs from n-1 and n-2 releases
qa: use supported releases for featureful_client

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>

commit | commitdiff | tree

Ronen Friedman [Mon, 17 Jun 2024 07:23:09 +0000 (10:23 +0300)]

Merge pull request #57865 from ronen-fr/wip-rf-at-once

osd/scrub: allow new scrubs while reserving

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Yingxin [Mon, 17 Jun 2024 05:56:23 +0000 (13:56 +0800)]

Merge pull request #57818 from xxhdx1985126/wip-seastore-lba-remap-perf-opt

crimson/os/seastore/lba_manager: do batch mapping allocs when remapping multiple mappings

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin [Mon, 17 Jun 2024 03:59:00 +0000 (11:59 +0800)]

Merge pull request #57359 from perezjosibm/wip-perezjos-vstart-alien

vstart.sh: add options to set number of alien threads, and number of cpu cores for alien threads

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

jrchyang [Mon, 17 Jun 2024 02:55:18 +0000 (02:55 +0000)]

os/bluestore: corrected the description of the horizontal and vertical axes in bluefs stats output

Signed-off-by: Jrchyang Yu <yuzhiqiang_yewu@cmss.chinamobile.com>

commit | commitdiff | tree

Ilya Dryomov [Fri, 14 Jun 2024 12:04:53 +0000 (14:04 +0200)]

librbd: drop unused parameter on group_snap_rollback_by_record()

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Ilya Dryomov [Fri, 14 Jun 2024 12:04:39 +0000 (14:04 +0200)]

librbd: disallow group snap rollback if memberships don't match

Before proceeding with group rollback, ensure that the set of images
that took part in the group snapshot matches the set of images that are
currently part of the group. Otherwise, because we preserve affected
snapshots when an image is removed from the group, data loss can ensue
where an image gets rolled back while part of another group or not part
of any group but long repurposed for something else.

Similarly, ensure that the group snapshot is complete.

Fixes: https://tracker.ceph.com/issues/66300
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Ilya Dryomov [Sun, 16 Jun 2024 18:40:31 +0000 (20:40 +0200)]

test/librados_test_stub: unbreak selfmanaged_snap_rollback()

After the rollback assert in TestGroup.add_snapshot{,PP} was made
meaningful in the previous commit, it fails in mock tests which means
that rollback has never been exercised properly...

While I confess to not following file->snap_id == CEPH_NOSNAP branch
especially given how file variable is shadowed, it's pretty clear that
get_snap_read() doesn't belong here -- the snapshot selected for reads
has nothing to do with rollback. Replacing it with the rollback snap
ID makes sense of the other branches and makes the tests in question
pass.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Ilya Dryomov [Thu, 13 Jun 2024 14:24:43 +0000 (16:24 +0200)]

test/librbd: make rollback in TestGroup.add_snapshot{,PP} meaningful

The rollback assert doesn't really test anything -- because orig_data
and test_data are written to non-overlapping areas, the test would pass
even if rbd_group_snap_rollback() does nothing (i.e. rollback isn't
performed) as long as the call returns 0.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Ilya Dryomov [Sun, 16 Jun 2024 19:19:40 +0000 (21:19 +0200)]

Merge pull request #58062 from rhcs-dashboard/cephadm-flake8-fix

mgr/cephadm: fix flake8 test failures

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

Unnamed repository; edit this file 'description' to name the repository.

RSS Atom