git.apps.os.sepia.ceph.com Git

]> git.apps.os.sepia.ceph.com Git - ceph.git/log

Matan Breizman [Mon, 24 Jun 2024 08:07:42 +0000 (08:07 +0000)]

Revert "crimson/osd/osd_operation: fix dump_historic_slow_ops command works"

This reverts commit 834ab99efc6453f91183a47849f56617cf73c112.

Signed-off-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Matan Breizman [Wed, 19 Jun 2024 10:21:25 +0000 (13:21 +0300)]

Merge pull request #57691 from xxhdx1985126/wip-66216

crimson/osd/osd_operations/client_request_common: `PeeringState::needs_recovery()` may fail if the object is under backfill

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Wed, 19 Jun 2024 06:49:56 +0000 (12:19 +0530)]

Merge pull request #53832 from rhcs-dashboard/fix-cephadm-daemon-failed-alert

mgr/dashboard: Show which daemons failed in CEPHADM_FAILED_DAEMON healthcheck

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Yingxin [Wed, 19 Jun 2024 03:10:44 +0000 (11:10 +0800)]

Merge pull request #58083 from myoungwon/wip-seastore-rbm-stat

crimson/os/seastore: add writer level stats to RBM

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Anthony D'Atri [Wed, 19 Jun 2024 00:50:21 +0000 (20:50 -0400)]

Merge pull request #58123 from gsoneill/clarify-ec-architecture

doc: clarify availability vs integrity

commit | commitdiff | tree

Gregory O'Neill [Tue, 18 Jun 2024 21:17:00 +0000 (17:17 -0400)]

doc: clarify availability vs integrity

commit | commitdiff | tree

Ilya Dryomov [Tue, 18 Jun 2024 14:18:35 +0000 (16:18 +0200)]

Merge pull request #58112 from nbalacha/wip-doc-typo-180624

doc/rbd: fix typos in NVMe-oF docs

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Matan Breizman [Tue, 18 Jun 2024 13:43:50 +0000 (16:43 +0300)]

Merge pull request #57978 from athanatos/sjust/wip-63647-snaptrim-pipeline

crimson: simplify snaptrim operation pipline usage

Reviewed-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Tue, 18 Jun 2024 13:23:03 +0000 (15:23 +0200)]

Merge pull request #58100 from petrutlucian94/conn_timeout_test

ceph_test_rados_api_misc: adjust LibRadosMiscConnectFailure.ConnectTimeout timeout

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Aashish Sharma [Thu, 5 Oct 2023 05:38:30 +0000 (11:08 +0530)]

mgr/dashboard: Show which daemons failed in CEPHADM_FAILED_DAEMON
healthcheck

Fixes: https://tracker.ceph.com/issues/63792
Signed-off-by: Aashish Sharma <aasharma@redhat.com>

commit | commitdiff | tree

Xuehan Xu [Fri, 24 May 2024 09:30:41 +0000 (17:30 +0800)]

crimson/osd/osd_operations/client_request_common: `PeeringState::needs_recovery()`
may fail if the object is under backfill

Meanwhile, set the correct version for backfill:
From Classic:
```
    if (is_degraded_or_backfilling_object(head)) {
      if (can_backoff && g_conf()->osd_backoff_on_degraded) {
        add_backoff(session, head, head);
        maybe_kick_recovery(head);
      }
```

```
void PrimaryLogPG::maybe_kick_recovery(
  const hobject_t &soid)
{
  eversion_t v;
  bool work_started = false;
  if (!recovery_state.get_missing_loc().needs_recovery(soid, &v))
    return;
```

Fixes: https://tracker.ceph.com/issues/66216
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
Signed-off-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Matan Breizman [Tue, 18 Jun 2024 10:56:00 +0000 (13:56 +0300)]

Merge pull request #57352 from Matan-B/wip-crimson-conf-docs

doc/dev/crimson: CPU cores allocation

Reviewed-by: Zac Dover <zac.dover@proton.me>
Reviewed-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

N Balachandran [Tue, 18 Jun 2024 10:39:51 +0000 (16:09 +0530)]

doc/rbd: fix typos in NVMe-oF docs.

Signed-off-by: N Balachandran <nibalach@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Tue, 18 Jun 2024 09:34:47 +0000 (11:34 +0200)]

Merge pull request #58086 from guits/fix_cv-migrate-tests

ceph-volume: fix TestMigrate tests

commit | commitdiff | tree

myoungwon oh [Tue, 18 Jun 2024 07:10:38 +0000 (07:10 +0000)]

crimson/os/seastore: replace record_group_data_bytes with data_bytes

Signed-off-by: Myoungwon Oh <myoungwon.oh@samsung.com>

commit | commitdiff | tree

myoungwon oh [Mon, 17 Jun 2024 10:12:50 +0000 (10:12 +0000)]

crimson/os/seastore: add writer level stats to RBM

Signed-off-by: Myoungwon Oh <myoungwon.oh@samsung.com>

commit | commitdiff | tree

Lucian Petrut [Mon, 17 Jun 2024 18:37:11 +0000 (18:37 +0000)]

ceph_test_rados_api_misc: adjust LibRadosMiscConnectFailure.ConnectTimeout timeout

The test sets a 2 second connection timeout and asserts that the
attempt takes less than 4 seconds.

Under load, the asserted timeout can be exceeded. In particular,
this affects the Windows CI job, which uses a VM with a reduced
amount of resources.

    [ RUN      ] LibRadosMiscConnectFailure.ConnectTimeout
    /home/ubuntu/ceph/src/test/librados/misc.cc:87: Failure
    Expected: (dur) < (utime_t(4, 0)), actual: 5.987712 vs 4.000000

We'll adjust the timeouts, using a 5s configured timeout, asserting
that the connection attempt doesn't take more than 15s.

Fixes: https://tracker.ceph.com/issues/66534
Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>

commit | commitdiff | tree

Matan Breizman [Wed, 8 May 2024 08:23:30 +0000 (11:23 +0300)]

doc/dev/crimson: CPU cores allocation

CPU cores allocation and
add crimson-smp vstart flag

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
Signed-off-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Tue, 18 Jun 2024 04:22:35 +0000 (07:22 +0300)]

Merge pull request #58003 from ronen-fr/wip-rf-targets-j9

osd/scrub: move more of the scrub initiation login into the scrubber

Reviewed-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Zac Dover [Tue, 18 Jun 2024 01:12:02 +0000 (11:12 +1000)]

Merge pull request #58091 from zdover23/wip-doc-2024-06-17-dev-developer-guide-essentials

doc/dev: link to ceph.io leads list

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Dan Mick <dmick@redhat.com>

commit | commitdiff | tree

Laura Flores [Mon, 17 Jun 2024 18:23:09 +0000 (13:23 -0500)]

Merge pull request #57988 from mohit84/issue_62934

unittest_osdmap aborted during OSDMapTest.BUG_42485

commit | commitdiff | tree

Samuel Just [Mon, 17 Jun 2024 18:08:53 +0000 (11:08 -0700)]

Merge pull request #57147 from xxhdx1985126/wip-65696

crimson/osd/pg_recovery: skip unfound objects when recovering the primary

Reviewed-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Radoslaw Zarzynski [Mon, 17 Jun 2024 17:50:45 +0000 (19:50 +0200)]

Merge pull request #40849 from dvanders/drainpeeringbug

test: test osd drain flap peering bug

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

Adam King [Mon, 17 Jun 2024 17:15:15 +0000 (13:15 -0400)]

Merge pull request #57955 from phlogistonjohn/jjm-fix-issue66389-apparmor

cephadm: fix apparmor profiles with spaces in the names

Reviewed-by: Adam King <adking@redhat.com>

commit | commitdiff | tree

Adam King [Mon, 17 Jun 2024 15:16:53 +0000 (11:16 -0400)]

Merge pull request #58054 from phlogistonjohn/jjm-cpatch-py-vers

script/cpatch.py: add support for multiple valid python versions

Reviewed-by: Adam King <adking@redhat.com>

commit | commitdiff | tree

Adam King [Mon, 17 Jun 2024 15:16:02 +0000 (11:16 -0400)]

Merge pull request #58053 from rkhudov/src-cephadm-box_remove_unused_imports

src/cephadm/box: remove unused imports

Reviewed-by: Adam King <adking@redhat.com>

commit | commitdiff | tree

Adam King [Mon, 17 Jun 2024 15:14:38 +0000 (11:14 -0400)]

Merge pull request #57293 from phlogistonjohn/jjm-smb-create-only

smb: have create cluster and create share commands only create

Reviewed-by: Adam King <adking@redhat.om>
Reviewed-by: Avan Thakkar <athakkar@redhat.com>

commit | commitdiff | tree

Zac Dover [Mon, 17 Jun 2024 14:39:10 +0000 (00:39 +1000)]

doc/dev: link to ceph.io leads list

Remove the docs.ceph.com list of team leads, and link to the list of
leads on ceph.io.

Fixes: https://tracker.ceph.com/issues/66392
Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Guillaume Abrioux [Mon, 17 Jun 2024 13:14:56 +0000 (15:14 +0200)]

ceph-volume: tests shouldn't require lvm2 to pass

some tests are currently failing when `lvm2` isn't installed:

```
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestList::test_empty_device_json_zero_exit_status - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestList::test_empty_device_zero_exit_status - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestFullReport::test_no_ceph_lvs - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestFullReport::test_ceph_data_lv_reported - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestFullReport::test_ceph_journal_lv_reported - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestFullReport::test_ceph_wal_lv_reported - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestFullReport::test_physical_2nd_device_gets_reported[journal] - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestFullReport::test_physical_2nd_device_gets_reported[db] - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestFullReport::test_physical_2nd_device_gets_reported[wal] - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestSingleReport::test_not_a_ceph_lv - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestSingleReport::test_report_a_ceph_lv - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestSingleReport::test_report_a_ceph_journal_device - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestSingleReport::test_report_by_osd_id_for_just_block_dev - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestSingleReport::test_report_by_osd_id_for_just_data_dev - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestSingleReport::test_report_by_osd_id_for_just_block_wal_and_db_dev - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestSingleReport::test_report_by_osd_id_for_data_and_journal_dev - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestSingleReport::test_report_by_nonexistent_osd_id - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_listing.py::TestSingleReport::test_report_a_ceph_lv_with_no_matching_devices - FileNotFoundError: [Errno 2] No such file or directory: 'pvs'
FAILED ceph_volume/tests/devices/lvm/test_migrate.py::TestNew::test_newdb_not_target_lvm - FileNotFoundError: [Errno 2] No such file or directory: 'lvs'
FAILED ceph_volume/tests/devices/lvm/test_zap.py::TestEnsureAssociatedLVs::test_nothing_is_found - FileNotFoundError: [Errno 2] No such file or directory: 'lvs'
FAILED ceph_volume/tests/devices/lvm/test_zap.py::TestEnsureAssociatedLVs::test_multiple_journals_are_found - FileNotFoundError: [Errno 2] No such file or directory: 'lvs'
FAILED ceph_volume/tests/devices/lvm/test_zap.py::TestEnsureAssociatedLVs::test_multiple_dbs_are_found - FileNotFoundError: [Errno 2] No such file or directory: 'lvs'
FAILED ceph_volume/tests/devices/lvm/test_zap.py::TestEnsureAssociatedLVs::test_multiple_wals_are_found - FileNotFoundError: [Errno 2] No such file or directory: 'lvs'
FAILED ceph_volume/tests/devices/lvm/test_zap.py::TestEnsureAssociatedLVs::test_multiple_backing_devs_are_found - FileNotFoundError: [Errno 2] No such file or directory: 'lvs'
FAILED ceph_volume/tests/objectstore/test_lvmbluestore.py::TestLvmBlueStore::test_activate_all_osd_is_active - FileNotFoundError: [Errno 2] No such file or directory: 'lvs'
```

Everything should be actually mocked. This commit addresses that.

Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>

commit | commitdiff | tree

Guillaume Abrioux [Mon, 17 Jun 2024 12:02:11 +0000 (14:02 +0200)]

ceph-volume: fix TestMigrate tests

ae5ef432845 broke some unit tests.
This commit fixes that.

Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>

commit | commitdiff | tree

Igor Fedotov [Mon, 17 Jun 2024 11:02:13 +0000 (14:02 +0300)]

Merge pull request #58079 from jrchyang/main

os/bluestore: corrected the description of the bluefs stats output

Reviewed-by: Igor Fedotov <igor.fedotov@croit.io>

commit | commitdiff | tree

Venky Shankar [Mon, 17 Jun 2024 09:27:52 +0000 (14:57 +0530)]

Merge PR #57991 into main

* refs/pull/57991/head:
qa: upgrade sub-suite upgraded_client from from n-1|n-2 releases
qa: upgrade sub-suite nofs from n-1 and n-2 releases
qa: use supported releases for featureful_client

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>

commit | commitdiff | tree

Ronen Friedman [Mon, 17 Jun 2024 07:23:09 +0000 (10:23 +0300)]

Merge pull request #57865 from ronen-fr/wip-rf-at-once

osd/scrub: allow new scrubs while reserving

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Yingxin [Mon, 17 Jun 2024 05:56:23 +0000 (13:56 +0800)]

Merge pull request #57818 from xxhdx1985126/wip-seastore-lba-remap-perf-opt

crimson/os/seastore/lba_manager: do batch mapping allocs when remapping multiple mappings

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin [Mon, 17 Jun 2024 03:59:00 +0000 (11:59 +0800)]

Merge pull request #57359 from perezjosibm/wip-perezjos-vstart-alien

vstart.sh: add options to set number of alien threads, and number of cpu cores for alien threads

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

jrchyang [Mon, 17 Jun 2024 02:55:18 +0000 (02:55 +0000)]

os/bluestore: corrected the description of the horizontal and vertical axes in bluefs stats output

Signed-off-by: Jrchyang Yu <yuzhiqiang_yewu@cmss.chinamobile.com>

commit | commitdiff | tree

Ilya Dryomov [Sun, 16 Jun 2024 19:19:40 +0000 (21:19 +0200)]

Merge pull request #58062 from rhcs-dashboard/cephadm-flake8-fix

mgr/cephadm: fix flake8 test failures

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Ronen Friedman [Tue, 4 Jun 2024 09:02:55 +0000 (04:02 -0500)]

osd/scrub: do not track reserving state at OSD level

As we no longer block the initiation of new scrub sessions for an OSD
for which any of its PGs is in the process of reserving scrub resources,
there is no need to track the reserving state at the OSD level.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Tue, 4 Jun 2024 08:53:04 +0000 (03:53 -0500)]

osd/scrub: allow new scrubs while reserving

allow new scrub session to be initiated by an OSD even while a PG is
in the process of reserving scrub resources.
The existing restriction made sense when the replica reservation process
was expected to succeed or fail within a few milliseconds. It makes less
sense now that the reservation process is queue-based (Reserver based)
and can take unlimited time (hours, days, ...) to complete.

Fixes: https://tracker.ceph.com/issues/66351
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Sun, 16 Jun 2024 09:11:04 +0000 (14:41 +0530)]

mgr/cephadm: fix flake8 test failures

after a recent update on flake8, some tests in cephadm started failing
```
flake8: commands[0]> flake8 --config=tox.ini alerts balancer cephadm cli_api crash devicehealth diskprediction_local hello iostat localpool nfs orchestrator prometheus rbd_support rgw selftest smb
cephadm/configchecks.py:94:6: E204 whitespace after decorator '@'
cephadm/module.py:3017:10: E204 whitespace after decorator '@'
cephadm/module.py:3344:144: E226 missing whitespace around arithmetic operator
cephadm/tests/test_cephadm.py:982:6: E204 whitespace after decorator '@'
3 E204 whitespace after decorator '@'
1 E226 missing whitespace around arithmetic operator
```

Signed-off-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Matan Breizman [Sun, 16 Jun 2024 08:34:01 +0000 (11:34 +0300)]

Merge pull request #57367 from xxhdx1985126/wip-65872

crimson/osd/osd_operations/client_request: check "can_serve_replica_reads" before getting obc

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Matan Breizman [Sun, 16 Jun 2024 08:32:36 +0000 (11:32 +0300)]

Merge pull request #56994 from guojidan/historic-ops

crimson/osd/osd_operation: fix dump_historic_slow_ops command works

Reviewed-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Zac Dover [Sun, 16 Jun 2024 05:13:26 +0000 (15:13 +1000)]

Merge pull request #58057 from zdover23/wip-doc-2024-06-15-rados-troubleshooting-mon

doc/rados: explain replaceable parts of command

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Zac Dover [Sat, 15 Jun 2024 11:55:18 +0000 (21:55 +1000)]

doc/rados: explain replaceable parts of command

Add an explanation that directs the reader to replace the "X" part of
the command "ceph tell mon.X mon_status" with the value specific to the
reader's Ceph cluster (which is (probably) not "X").

In the future, such replaceable strings in commands may be bounded by
angle brackets ("<" and ">").

This improvement to the documentation was suggested on the [ceph-users]
email list by Joel Davidow. This email, an absolute model of user
engagement with an upstream project, can be reviewed here:
https://lists.ceph.io/hyperkitty/list/ceph-users@ceph.io/message/KF67F5TXFSSTPXV7EKL6JKLA5KZQDLDQ/

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

John Mulligan [Fri, 14 Jun 2024 14:07:07 +0000 (10:07 -0400)]

script/cpatch.py: add support for multiple valid python versions

Fix running cpatch.py with the latest centos9s based container images.
Future proof a little by adding multiple valid, existing, python version
numbers to probe.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

rkhudov [Fri, 14 Jun 2024 13:40:59 +0000 (14:40 +0100)]

src/cephadm/box: remove unused imports

Signed-off-by: rkhudov <rhudov@gmail.com>

commit | commitdiff | tree

Yuval Lifshitz [Fri, 14 Jun 2024 11:39:09 +0000 (14:39 +0300)]

Merge pull request #57969 from yuvalif/wip-yuval-64305

rgw: do not assert on thread name setting failures

Reviewed-By: Casey Bodley <cbodley@ibm.com>

commit | commitdiff | tree

Yuval Lifshitz [Fri, 14 Jun 2024 10:35:17 +0000 (13:35 +0300)]

Merge pull request #57898 from yuvalif/wip-yuval-notif-decode

rgw/admin/notification: add command to dump notifications

Reviewed-By: Ali Masarwe <ali.masarwa@ibm.com>

commit | commitdiff | tree

Dhairya Parmar [Tue, 30 Apr 2024 11:09:19 +0000 (16:39 +0530)]

qa: upgrade sub-suite upgraded_client from from n-1|n-2 releases

... and move to centos9.

Fixes: https://tracker.ceph.com/issues/65580
Signed-off-by: Dhairya Parmar <dparmar@redhat.com>
Signed-off-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Dhairya Parmar [Thu, 4 Apr 2024 12:25:52 +0000 (17:55 +0530)]

qa: upgrade sub-suite nofs from n-1 and n-2 releases

also adjust the respective upgrade YAMLs and move to
centos9.

Fixes: https://tracker.ceph.com/issues/65580
Signed-off-by: Dhairya Parmar <dparmar@redhat.com>
Signed-off-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Fri, 7 Jun 2024 05:13:27 +0000 (05:13 +0000)]

qa: use supported releases for featureful_client

... and switch for centos9.

Fixes: https://tracker.ceph.com/issues/65580
Signed-off-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Ivo Almeida [Fri, 14 Jun 2024 08:45:20 +0000 (09:45 +0100)]

Merge pull request #57690 from ceph/carbon-setup-ui-shell

mgr/dashboard: carbon setup and ui shel

Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Fri, 14 Jun 2024 07:45:50 +0000 (13:15 +0530)]

Merge pull request #57698 from rhcs-dashboard/due-timer-validate

mgr/dashboard: add dueTime to rgw bucket validator

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Fri, 14 Jun 2024 06:34:59 +0000 (12:04 +0530)]

Merge pull request #57713 from rhcs-dashboard/bucket-replication-form

mgr/dashboard: apply replication policy for a bucket

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
Reviewed-by: Ankush Behl <cloudbehl@gmail.com>

commit | commitdiff | tree

Nizamudeen A [Fri, 14 Jun 2024 05:08:49 +0000 (10:38 +0530)]

Merge pull request #57801 from afreen23/wip-nvmeof-service

mgr/dashboard: Configure NVMe/TCP

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: rivkap <NOT@FOUND>

commit | commitdiff | tree

Samuel Just [Tue, 28 May 2024 20:45:26 +0000 (20:45 +0000)]

crimson/.../snaptrim_event: SnapTrimObjSubEvent should enter WaitRepop

Otherwise, it parks on Process until the repop completes blocking any
other repops, including client IO. Since we don't actually care about
ordering, simply calling handle.complete() would also be viable, but
this is a valid usage of the stage and does provide information to an
operator.

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Tue, 28 May 2024 20:37:56 +0000 (13:37 -0700)]

crimson/.../snaptrim_event: remove remaining SnapTrimEvent pipeline stages

SnapTrimEvent doesn't actually do or block on GetOBC or Process --
remove those stages entirely. Entering Process, in particular, causes
problems unless we immediately leave it as SnapTrimObjSubEvent needs to
enter and leave it to complete. Entering one of the stages removed in
a prior commit had a side effect of exiting Process -- without that
exit SnapTrimEvent and SnapTrimObjSubEvent mutually block preventing
snap trim or client io from making progress.

This leaves no actual pipeline stages on SnapTrimEvent, which makes
sense as only SnapTrimObjSubEvent actually does IO.

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Tue, 28 May 2024 20:36:45 +0000 (13:36 -0700)]

crimson/.../snaptrim_event: no need to wait for activation

PG must already be active+clean.

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Tue, 28 May 2024 16:48:42 +0000 (09:48 -0700)]

crimson/.../snaptrim_event: remove pipeline stages located on event

WaitSubop, WaitTrimTimer, and WaitRepop are pipeline stages local to
the operation.  As such they don't actually provide any ordering
guarrantees as only one operation will ever enter them.  Rather, the
intent is to hook into the event system to expose information to an
administrator.

This poses a problem for OrderedConcurrentPhase as it is currently
implemented.  PipelineHandle::exit() is invoked prior to the op being
destructed.  PipelineHandle::exit() does:

  void exit() {
    barrier.reset();
  }

For OrderedConcurrentPhase, ~ExitBarrier() invokes ExitBarrier::exit():

    void exit() final {
      if (barrier) {
        assert(phase);
        assert(phase->core == seastar::this_shard_id());
        std::ignore = std::move(*barrier
        ).then([phase=this->phase] {
          phase->mutex.unlock();
        });
        barrier = std::nullopt;
        phase = nullptr;
      } else if (phase) {
        assert(phase->core == seastar::this_shard_id());
        phase->mutex.unlock();
        phase = nullptr;
      }
    }

The problem comes in not waiting for the phase->mutex.unlock() to occur.
For SnapTrimEvent, phase is actually in the operation itself.  It's
possible for that continuation

        ).then([phase=this->phase] {
          phase->mutex.unlock();

to occur after the last finally() in ShardServices::start_operation
completes and releases the final reference to SnapTrimEvent.  This is
harmless normally provided that the PG or connection outlives it,
but it's a problem for these stages.

For now, let's just remove these stages.  We can reintroduce another
mechanism later to set these event flags without an actual pipeline
stage.

This is likely a bug even with pipelines not embedded in an operation,
but we can fix it later -- https://tracker.ceph.com/issues/64545.

Fixes: https://tracker.ceph.com/issues/63647
Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Tue, 28 May 2024 20:47:52 +0000 (13:47 -0700)]

crimson/.../snaptrim_event: no need to recover missing, pg must be clean

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Thu, 13 Jun 2024 18:03:41 +0000 (14:03 -0400)]

Merge PR #54159 into main

* refs/pull/54159/head:
qa: add export thrashing to fs:thrash

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Thu, 13 Jun 2024 17:10:56 +0000 (19:10 +0200)]

Merge pull request #57954 from idryomov/wip-64662

librbd: allow cloning from non-user snapshots

Reviewed-by: Ramana Raja <rraja@redhat.com>
Reviewed-by: Afreen Misbah <afreen23.git@gmail.com>

commit | commitdiff | tree

Laura Flores [Thu, 13 Jun 2024 14:56:17 +0000 (09:56 -0500)]

Merge pull request #57855 from rzarzynski/wip-bug-66321

qa/rados: don't pass --max_attr_len to ceph_test_rados unconditionally

commit | commitdiff | tree

John Mulligan [Wed, 1 May 2024 14:57:02 +0000 (10:57 -0400)]

mgr/smb: share and cluster create commands only create resources

Prior to this change the create commands could be used counter to the
term 'create' as a create-or-update command. IMO this violates the
principle of least surprise so make them create-only.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Wed, 1 May 2024 14:55:34 +0000 (10:55 -0400)]

mgr/smb: add a unit test to cover apply func create_only argument

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Wed, 1 May 2024 14:55:27 +0000 (10:55 -0400)]

mgr/smb: add create_only arg for handler apply function

Add a create_only argument to the handler class apply function. This
flag is used to prevent modification of existing resources. This flag
will be use by 'cluster create' and 'share create' commands to make
them true to their names and not sneaky modify-or-create commands.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Wed, 1 May 2024 14:55:14 +0000 (10:55 -0400)]

mgr/smb: cache new and stored keys separately in the staging area

We would like a quick way of determining what is new. Add methods to the
staging area class to make it possible to do so using (cached) keys.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

Afreen [Fri, 31 May 2024 07:54:27 +0000 (13:24 +0530)]

mgr/dashboard: Configure NVMe/TCP

Fixes https://tracker.ceph.com/issues/63686

- creation of Nvme-oF/TCP service
- deletion of Nvme-oF/TCP service
- edit/update Nvme-oF/TCP service
- added unit tests for Nvme-oF/TCP service
- changed Id -> Service Name
- added prefix of service type in service name (similar to <client.> in
fs access)
- service name and pool are required fields for nvmeof
- placement count now takes default value as mentioned in cephadm
- slight refactors
- prepopulate serviceId for each service type setServiceId()
- in case serviceId is same as servcie type then do not add create service name with<servicetype>.<setrviceid> format

Signed-off-by: Afreen <afreen23.git@gmail.com>

commit | commitdiff | tree

Ilya Dryomov [Mon, 10 Jun 2024 11:19:25 +0000 (13:19 +0200)]

rbd: include original namespace type in "rbd snap ls --all" output

Before (snap 22 comes from "rbd group snap create", snap 23 created
manually with "rbd snap create"):

SNAPID  NAME                                  SIZE   PROTECTED  TIMESTAMP                 NAMESPACE
    21  f7cfdcfe-5f71-40e4-be82-3fb0e7caf2aa  1 GiB             Mon Jun 10 09:23:40 2024  trash (mysnap)
    22  bd67397f-32cb-48fe-b1ac-ef6f02319239  1 GiB             Mon Jun 10 09:26:06 2024  trash (.group.2_1491b049b556_1497bf66f586)
    23  27a5f053-8431-428e-ab33-be9d8b6cf51e  1 GiB             Mon Jun 10 09:28:30 2024  trash (.group.2_1491b049b556_1497bf66f586)

After:

SNAPID  NAME                                  SIZE   PROTECTED  TIMESTAMP                 NAMESPACE
    21  f7cfdcfe-5f71-40e4-be82-3fb0e7caf2aa  1 GiB             Mon Jun 10 09:23:40 2024  trash (user mysnap)
    22  bd67397f-32cb-48fe-b1ac-ef6f02319239  1 GiB             Mon Jun 10 09:26:06 2024  trash (group .group.2_1491b049b556_1497bf66f586)
    23  27a5f053-8431-428e-ab33-be9d8b6cf51e  1 GiB             Mon Jun 10 09:28:30 2024  trash (user .group.2_1491b049b556_1497bf66f586)

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Ilya Dryomov [Fri, 7 Jun 2024 10:12:29 +0000 (12:12 +0200)]

librbd: add rbd_snap_get_trash_namespace2() API to return full namespace

The existing rbd_snap_get_trash_namespace() API returns only the
original name of the deleted snapshot, omitting its namespace type.
While non-user snapshots have distinctive names, there is nothing
preventing the user from creating user snapshots with identical names
(i.e. starting with ".group" or ".mirror" prefix). After cloning from
non-user snapshots is allowed, it's possible for such user snapshots to
get mixed up with non-user snapshots in the trash, so let's provide
means for disambiguation.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Ilya Dryomov [Thu, 30 May 2024 14:54:53 +0000 (16:54 +0200)]

qa/workunits/rbd: fix bogus grep -v asserts in test_clone()

The intent of "rbd ls | grep -v clone" was probably to check that an
image with the name "clone" shows up in rbd2 pool and not in rbd pool.
However, it's very far from that -- "grep -v clone" would succeed
regardless because of an image with the name "test1" in rbd pool.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Ilya Dryomov [Thu, 30 May 2024 09:38:53 +0000 (11:38 +0200)]

rbd: add --snap-id option to "rbd clone"

Enable cloning from non-user snapshots via the CLI.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Ilya Dryomov [Thu, 30 May 2024 09:34:40 +0000 (11:34 +0200)]

rbd: pass ArgumentModifier to add_snap_id_option()

Make ARGUMENT_MODIFIER_DEST synonymous to ARGUMENT_MODIFIER_NONE as
there is no use case for a destination snap ID for now.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Ilya Dryomov [Fri, 24 May 2024 10:19:14 +0000 (12:19 +0200)]

tracing: rename parent/child_id fields on librbd clone tracepoints

Make it clear that these are pool IDs, referring to the same pool(s) as
parent/child_pool_name fields.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Ilya Dryomov [Fri, 24 May 2024 10:06:09 +0000 (12:06 +0200)]

librbd: add rbd_clone4() API to take parent snapshot by ID

Allow cloning from non-user snapshots -- namely snapshots in group
and mirror namespaces. The motivation is to provide a building block
for cloning new groups from group snapshots ("rbd group snap create").
Otherwise, group snapshots as they are today can be used only for
rolling back the group as a whole, which is very limiting.

While at it, there doesn't seem to be anything wrong with making it
possible to clone from mirror snapshots as well.

Snapshots in a trash namespace can't be cloned from since they are
considered to be deleted.

Cloning from non-user snapshots is limited to clone v2 just because
protecting/unprotecting is limited to snapshots in a user namespace.
This happens to simplify some invariants.

Fixes: https://tracker.ceph.com/issues/64662
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Ilya Dryomov [Sun, 9 Jun 2024 08:08:00 +0000 (10:08 +0200)]

pybind/rbd: expand what get_parent_image_spec() returns

Propagate everything there is in rbd_linked_image_spec_t and
rbd_snap_spec_t structures returned by the C API.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Ilya Dryomov [Thu, 11 Apr 2024 16:56:15 +0000 (18:56 +0200)]

librbd: replace assert with error check in clone()

With an error check for p_snap_name, it doesn't make much sense to
crash if "either p_id or p_name" contract is violated. Replace the
assert with a similar error check.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Ilya Dryomov [Thu, 11 Apr 2024 15:12:25 +0000 (17:12 +0200)]

librbd: fix error message in handle_attach_child()

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Anthony D'Atri [Thu, 13 Jun 2024 12:02:57 +0000 (08:02 -0400)]

Merge pull request #58007 from zdover23/wip-doc-2024-06-13-rados-operations

doc/rados: add pg-states and pg-concepts to tree

commit | commitdiff | tree

Pedro Gonzalez Gomez [Thu, 13 Jun 2024 11:56:59 +0000 (13:56 +0200)]

Merge pull request #57992 from afreen23/wip-fix-test

mgr/dashboard: Fix login and notification e2e tests

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>

commit | commitdiff | tree

Zac Dover [Thu, 13 Jun 2024 11:29:47 +0000 (21:29 +1000)]

doc/rados: add pg-states and pg-concepts to tree

Add "pg-states" and "pg-concepts" to the left tree pane on
docs.ceph.com.

This commit has been made in response to a request from the upstream
made in https://pad.ceph.com/p/Report_Documentation_Bugs.

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Ivo Almeida [Wed, 15 May 2024 08:42:47 +0000 (09:42 +0100)]

mgr/dashboard: carbon initial setup

* replace header and side navigation by carbon components
* added carbon specifc style overrides
* added carbon icons
* created custom theme based on current color scheme

Fixes: https://tracker.ceph.com/issues/66217
Signed-off-by: Ivo Almeida <ialmeida@redhat.com>

commit | commitdiff | tree

Yuval Lifshitz [Tue, 11 Jun 2024 15:12:01 +0000 (15:12 +0000)]

rgw: do not assert on thread name setting failures

Fixes: https://tracker.ceph.com/issues/64305
Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>

commit | commitdiff | tree

Guillaume Abrioux [Thu, 13 Jun 2024 07:25:56 +0000 (09:25 +0200)]

Merge pull request #57925 from guits/cv-fix-version-checking

ceph-volume: fix set_dmcrypt_no_workqueue()

commit | commitdiff | tree

Laura Flores [Thu, 13 Jun 2024 05:53:42 +0000 (00:53 -0500)]

Merge pull request #57486 from ljflores/wip-tracker-66019

commit | commitdiff | tree

Samuel Just [Thu, 13 Jun 2024 00:44:51 +0000 (17:44 -0700)]

Merge pull request #57856 from athanatos/sjust/wip-66226-ec-crc

test/librados: fix LibRadosIoECPP.CrcZeroWrite

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

Laura Flores [Wed, 12 Jun 2024 19:11:50 +0000 (14:11 -0500)]

Merge pull request #57934 from ljflores/wip-tracker-66398

qa/suites/rados/thrash-old-clients: update supported releases and distro

commit | commitdiff | tree

Patrick Donnelly [Wed, 12 Jun 2024 16:47:50 +0000 (12:47 -0400)]

Merge PR #57963 into main

* refs/pull/57963/head:
srcipt/ceph-backports: add redmine ticket's JSON to debug output

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Daniel Gryniewicz [Wed, 12 Jun 2024 16:06:34 +0000 (12:06 -0400)]

Merge pull request #56906 from dang/wip-dang-65471

RGW - Remove potential string overflow in POSIXDriver

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Afreen Misbah [Wed, 12 Jun 2024 15:50:04 +0000 (21:20 +0530)]

mgr/dashboard: Fix login and notification e2e tests

Fixes https://tracker.ceph.com/issues/66453

- `#rbdMirroring` checkbox is not found due to which both of these tests are failing on most of the Prs
- this is due to the pool helper function which checks for an existing app passed in parameter
- if app is not found, then mirroring checkbox remains hidden

Signed-off-by: Afreen Misbah <afreen23.git@gmail.com>

commit | commitdiff | tree

Adam King [Wed, 12 Jun 2024 15:25:45 +0000 (11:25 -0400)]

Merge pull request #57601 from guits/cephadm-hostspec-fixes

cephadm: _extract_host_info_from_*() refactor

Reviewed-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Wed, 12 Jun 2024 15:21:47 +0000 (17:21 +0200)]

Merge pull request #57807 from ifed01/wip-ifed-cephvol-zap-after-migrate

ceph-volume: do source devices zapping if they're detached.

commit | commitdiff | tree

Ilya Dryomov [Wed, 12 Jun 2024 13:52:17 +0000 (15:52 +0200)]

Merge pull request #57889 from idryomov/wip-66359

pybind/rbd: parse access and modify timestamps in UTC

Reviewed-by: Ramana Raja <rraja@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Wed, 12 Jun 2024 13:50:11 +0000 (15:50 +0200)]

Merge pull request #57759 from guojidan/group-info

rbd: add group info command to output group id

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
Reviewed-by: Ramana Raja <rraja@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Wed, 12 Jun 2024 13:46:27 +0000 (09:46 -0400)]

Merge PR #57979 into main

* refs/pull/57979/head:
test/ smb: modify source_type to fix run-tox-mgr failed

Reviewed-by: Ilya Dryomov <idryomov@redhat.com>
Reviewed-by: Adam King <adking@redhat.com>

commit | commitdiff | tree

Casey Bodley [Wed, 12 Jun 2024 13:33:08 +0000 (14:33 +0100)]

Merge pull request #57536 from kchheda3/wip-fix-persistent-queue-regression

rgw/notification: Store the value of `persistent_queue` for existing topics and continue commiting events for all topics subscribed to given bucket

Reviewed-by: Yuval Lifshitz <ylifshit@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Tue, 11 Jun 2024 13:47:36 +0000 (08:47 -0500)]

osd/scrub: reuse populate_config_params() in is_time_for_deep(),

avoiding code duplication.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Tue, 11 Jun 2024 12:32:02 +0000 (07:32 -0500)]

osd/scrub: move the scrub mode validation methods to the scrubber

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Mohit Agrawal [Wed, 12 Jun 2024 11:49:18 +0000 (17:19 +0530)]

unittest_osdmap aborted during OSDMapTest.BUG_42485

The testcase is aborted during the call of clean_upmap_tp
thread. The function(clean_pg_upmaps) spawns a number
of worker threads to process a PGMapper job. The worker
thread fetch a job from the queue and then process the
job and call process_finish the job. The process function
of PGMapper class destroying the object and as worker thread
call _process_finish function it crashes because job pointer
has become a dangling pointer.

Solution: To avoid a crash destroy the object in _process_finish
instead of doing in _process.

Fixes: https://tracker.ceph.com/issues/62934
Signed-off-by: Mohit Agrawal <moagrawa@redhat.com>

commit | commitdiff | tree

Rishabh Dave [Tue, 11 Jun 2024 06:13:30 +0000 (11:43 +0530)]

srcipt/ceph-backports: add redmine ticket's JSON to debug output

Add ticket received as JSON to debug output of ceph-backports.sh since
it is pretty helpful while debugging errors related to ticket's field.

This issue was discovered while running -
./src/script/ceph-backport.sh <tracker-num> --existing-pr <pr-num> --debug

Signed-off-by: Rishabh Dave <ridave@redhat.com>

Unnamed repository; edit this file 'description' to name the repository.

RSS Atom