]> git.apps.os.sepia.ceph.com Git - ceph-ci.git/log
ceph-ci.git
16 months agoqa/suites/upgrade/telemetry-upgrade: add more ignorelist items and require_osd_releas...
Laura Flores [Wed, 19 Jun 2024 21:07:31 +0000 (16:07 -0500)]
qa/suites/upgrade/telemetry-upgrade: add more ignorelist items and require_osd_release=squid

The warnings added to the ignorelist show up in the cluster log, but they are
expected during upgrades and should thus be ignored.

We also need to set require_osd_release=squid to avoid this warning:
```
cluster [WRN] Health check failed: all OSDs are running squid or later but require_osd_release < squid (OSD_UPGRADE_FINISHED)
```

Signed-off-by: Laura Flores <lflores@ibm.com>
16 months agoqa/workunits: Opt in to new telemetry collections right away after upgrading
Laura Flores [Tue, 11 Jun 2024 20:19:06 +0000 (15:19 -0500)]
qa/workunits: Opt in to new telemetry collections right away after upgrading

If this does not happen, the test may fail due to the "TELEMETRY_CHANGED" warning
appearing for too long in the cluster log.

Fixes: https://tracker.ceph.com/issues/64458
Signed-off-by: Laura Flores <lflores@ibm.com>
16 months agoqa/suites/upgrade/telemetry-upgrade: upgrade from reef instead of pacific
Laura Flores [Tue, 11 Jun 2024 20:10:01 +0000 (15:10 -0500)]
qa/suites/upgrade/telemetry-upgrade: upgrade from reef instead of pacific

With cephadm upgrades, we are only allowed to upgrade from as far back as N-2
releases. On the main branch, that means we can only upgrade from quincy and reef, and
we can no longer upgrade from pacific.

This test was trying to upgrade from pacific, which isn't allowed, which led to an
`UPGRADE_BAD_TARGET_VERSION` cluster error.

Fixes: https://tracker.ceph.com/issues/65189
Signed-off-by: Laura Flores <lflores@ibm.com>
16 months agoMerge pull request #57106 from phlogistonjohn/jjm-mgr-quiet-notify-types-err
Adam King [Tue, 11 Jun 2024 16:42:28 +0000 (12:42 -0400)]
Merge pull request #57106 from phlogistonjohn/jjm-mgr-quiet-notify-types-err

mgr: do not require NOTIFY_TYPES in python modules

Reviewed-by: Adam King <adking@redhat.com>
16 months agoMerge pull request #57847 from thegreenbear/keepalived-security-context
Adam King [Tue, 11 Jun 2024 16:38:41 +0000 (12:38 -0400)]
Merge pull request #57847 from thegreenbear/keepalived-security-context

cephadm/services/ingress: configure security user in keepalived template

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: John Mulligan <jmulligan@redhat.com>
16 months agoMerge pull request #57829 from gukaifeng/adjust_bootstrap_config_priorities
Adam King [Tue, 11 Jun 2024 16:33:07 +0000 (12:33 -0400)]
Merge pull request #57829 from gukaifeng/adjust_bootstrap_config_priorities

cephadm: the user config is preferred during the bootstrap

Reviewed-by: Adam King <adking@redhat.com>
16 months agoMerge pull request #57685 from tchaikov/replace-pkg_resources-with-importlib
Adam King [Tue, 11 Jun 2024 16:28:09 +0000 (12:28 -0400)]
Merge pull request #57685 from tchaikov/replace-pkg_resources-with-importlib

cephadm: use importlib.metadata for querying ceph_iscsi's version

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: John Mulligan <jmulligan@redhat.com>
16 months agoMerge pull request #57292 from adk3798/staggered-upgrade-non-ceph-daemons
Adam King [Tue, 11 Jun 2024 16:23:12 +0000 (12:23 -0400)]
Merge pull request #57292 from adk3798/staggered-upgrade-non-ceph-daemons

mgr/cephadm: make SMB and NVMEoF upgrade last in staggered upgrade

Reviewed-by: John Mulligan <jmulligan@redhat.com>
16 months agoMerge pull request #57389 from thmour/patch-1
Adam King [Tue, 11 Jun 2024 16:22:15 +0000 (12:22 -0400)]
Merge pull request #57389 from thmour/patch-1

cephadm: CephExporter doesn't bind to IPv6 in dual stack

Reviewed-by: Adam King <adking@redhat.com>
16 months agoMerge pull request #57180 from phlogistonjohn/jjm-smb-linked-res
Adam King [Tue, 11 Jun 2024 16:20:30 +0000 (12:20 -0400)]
Merge pull request #57180 from phlogistonjohn/jjm-smb-linked-res

mgr/smb: cluster linked  join auth and users/groups resource types

Reviewed-by: Adam King <adking@redhat.com>
16 months agoMerge PR #57881 into main
Patrick Donnelly [Tue, 11 Jun 2024 15:59:45 +0000 (11:59 -0400)]
Merge PR #57881 into main

* refs/pull/57881/head:
qa/tasks/vstart_runner: avoid deprecation warning

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>
16 months agoMerge pull request #57319 from TRYTOBE8TME/wip-bug-59488
Yuval Lifshitz [Tue, 11 Jun 2024 14:21:05 +0000 (17:21 +0300)]
Merge pull request #57319 from TRYTOBE8TME/wip-bug-59488

src/rgw: Fixing event name

16 months agoMerge pull request #57931 from rhcs-dashboard/bucket-policy-other-daemon
Nizamudeen A [Tue, 11 Jun 2024 14:10:51 +0000 (19:40 +0530)]
Merge pull request #57931 from rhcs-dashboard/bucket-policy-other-daemon

mgr/dashboard: fix edit bucket failing in other selected gateways

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
16 months agoMerge pull request #57957 from zdover23/wip-doc-2024-06-11-rados-troubleshooting-mon
Anthony D'Atri [Mon, 10 Jun 2024 22:46:59 +0000 (18:46 -0400)]
Merge pull request #57957 from zdover23/wip-doc-2024-06-11-rados-troubleshooting-mon

doc/rados: improve leader/peon monitor explanation

16 months agodoc/rados: improve leader/peon monitor explanation
Zac Dover [Mon, 10 Jun 2024 22:00:00 +0000 (08:00 +1000)]
doc/rados: improve leader/peon monitor explanation

Add an explanation of leader-peon conditions that obtain when the
cluster is in the "HEALTH_OK" state. Previously, the text discussed
these two monitor states only in the context of a health detail entry.

This improvement to the documentation was suggested on the [ceph-users]
email list by Joel Davidow. This email, an absolute model of user
engagement with an upstream project, can be reviewed here: https://lists.ceph.io/hyperkitty/list/ceph-users@ceph.io/message/KF67F5TXFSSTPXV7EKL6JKLA5KZQDLDQ/

I will list Joel Davidow here as the co-author for the sake of more
expediently getting this change into the documentation, but though he is
listed as the co-author, he is the true author.

Co-authored-by: Joel Davidow <jdavidow@nso.edu>
Signed-off-by: Zac Dover <zac.dover@proton.me>
16 months agoMerge pull request #56744 from benhanokh/ncb_async_discard_fix
Yuri Weinstein [Mon, 10 Jun 2024 19:42:47 +0000 (12:42 -0700)]
Merge pull request #56744 from benhanokh/ncb_async_discard_fix

os/BlueStore: NCB fix for leaked space when bdev_async_discard is ena…

Reviewed-by: Igor Fedotov <ifedotov@suse.com>
Reviewed-by: Ronen Friedman <rfriedma@redhat.com>
Reviewed-by: Joshua Baergen <jbaergen@digitalocean.com>
16 months agoMerge pull request #53477 from jianwei1216/fix_msgr_worker_cpu_high
Yuri Weinstein [Mon, 10 Jun 2024 19:39:45 +0000 (12:39 -0700)]
Merge pull request #53477 from jianwei1216/fix_msgr_worker_cpu_high

src/msg: fix high CPU consumption of msgr worker thread

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>
Reviewed-by: Mark Nelson <mnelson@redhat.com>
16 months agoMerge pull request #51171 from amathuria/wip-amat-scrub-cost-related-changes
Yuri Weinstein [Mon, 10 Jun 2024 19:38:28 +0000 (12:38 -0700)]
Merge pull request #51171 from amathuria/wip-amat-scrub-cost-related-changes

osd/scrub: Change scrub cost to average object size

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>
Reviewed-by: Sridhar Seshasayee <sseshasa@redhat.com>
16 months agoMerge pull request #57593 from Matan-B/wip-matanb-crimson-default-cpu-cores
Matan Breizman [Mon, 10 Jun 2024 11:36:20 +0000 (14:36 +0300)]
Merge pull request #57593 from Matan-B/wip-matanb-crimson-default-cpu-cores

Crimson: Support basic deployments

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
16 months agoMerge pull request #57313 from xxhdx1985126/wip-65832
Matan Breizman [Mon, 10 Jun 2024 08:14:10 +0000 (11:14 +0300)]
Merge pull request #57313 from xxhdx1985126/wip-65832

crimson/osd/ops_executor: calculation of clone_overlap shouldn't consider snap contexts

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
16 months agoMerge pull request #57279 from xxhdx1985126/wip-65806
Matan Breizman [Mon, 10 Jun 2024 08:12:53 +0000 (11:12 +0300)]
Merge pull request #57279 from xxhdx1985126/wip-65806

crimson/osd/pg: trigger wait_for_active_blocker on replica osds when the activate event is committed

Reviewed-by: Samuel Just <sjust@redhat.com>
16 months agoMerge pull request #57947 from zdover23/wip-doc-2024-06-10-start-rm-intro
Zac Dover [Sun, 9 Jun 2024 21:31:36 +0000 (07:31 +1000)]
Merge pull request #57947 from zdover23/wip-doc-2024-06-10-start-rm-intro

doc/start: remove "intro.rst"

<Reviewed-by: Cole Mitchell <cole.mitchell.ceph@gmail.com>

16 months agodoc/start: remove "intro.rst"
Zac Dover [Sun, 9 Jun 2024 18:55:13 +0000 (04:55 +1000)]
doc/start: remove "intro.rst"

Remove "start/intro.rst", which has been renamed "start/index.rst" in
order to follow the conventions followed elsewhere in the documentation.

Follows https://github.com/ceph/ceph/pull/57900.

Signed-off-by: Zac Dover <zac.dover@proton.me>
16 months agoMerge pull request #57908 from xxhdx1985126/wip-66374
Matan Breizman [Sun, 9 Jun 2024 14:51:07 +0000 (17:51 +0300)]
Merge pull request #57908 from xxhdx1985126/wip-66374

crimson/osd/osd_operations: correct connection pipelines for osd operations

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
16 months agoMerge pull request #57692 from xxhdx1985126/wip-crimson-unnecessary-move
Matan Breizman [Sun, 9 Jun 2024 13:47:52 +0000 (16:47 +0300)]
Merge pull request #57692 from xxhdx1985126/wip-crimson-unnecessary-move

crimson/osd/pg_shard_manager: remove the unnecessary "std::move" call

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
16 months agoMerge pull request #57561 from guojidan/rollback
Matan Breizman [Sun, 9 Jun 2024 13:46:04 +0000 (16:46 +0300)]
Merge pull request #57561 from guojidan/rollback

crimson/osd/ops_executer: LIST_SNAPS only on CEPH_SNAPDIR

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
16 months agoMerge pull request #57416 from xxhdx1985126/wip-65972
Matan Breizman [Sun, 9 Jun 2024 13:34:12 +0000 (16:34 +0300)]
Merge pull request #57416 from xxhdx1985126/wip-65972

crimson/osd/pg: hold PGs' references to the last minute of snap trim events executions

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
16 months agoMerge pull request #57455 from xxhdx1985126/wip-66002
Matan Breizman [Sun, 9 Jun 2024 13:32:18 +0000 (16:32 +0300)]
Merge pull request #57455 from xxhdx1985126/wip-66002

crimson/osd/replicated_backend: no need to set_rollback_to for repops

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>
16 months agoMerge pull request #57386 from xxhdx1985126/wip-65894
Matan Breizman [Sun, 9 Jun 2024 13:30:02 +0000 (16:30 +0300)]
Merge pull request #57386 from xxhdx1985126/wip-65894

crimson/osd/recovery_backend: change recovery waiters' promises into optional ones

Reviewed-by: Samuel Just <sjust@redhat.com>
16 months agoMerge pull request #57276 from xxhdx1985126/wip-65804
Matan Breizman [Sun, 9 Jun 2024 12:07:41 +0000 (15:07 +0300)]
Merge pull request #57276 from xxhdx1985126/wip-65804

crimson/osd/pg_backend: DONOT modify OSDOp::indata when handling CEPH_OSD_OP_CHECKSUM

Reviewed-by: Samuel Just <sjust@redhat.com>
16 months agoMerge pull request #57251 from xxhdx1985126/wip-65773
Matan Breizman [Sun, 9 Jun 2024 12:07:02 +0000 (15:07 +0300)]
Merge pull request #57251 from xxhdx1985126/wip-65773

crimson/osd/pg_backend: only create new onode when it doesn't exist

Reviewed-by: Samuel Just <sjust@redhat.com>
16 months agoMerge pull request #57204 from rzarzynski/wip-crimson-osd-read-clamp
Matan Breizman [Sun, 9 Jun 2024 12:06:39 +0000 (15:06 +0300)]
Merge pull request #57204 from rzarzynski/wip-crimson-osd-read-clamp

crimson/osd: clamp reads to object size and bring full read trimming

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>
16 months agoMerge pull request #57069 from xxhdx1985126/wip-65632
Matan Breizman [Sun, 9 Jun 2024 12:02:56 +0000 (15:02 +0300)]
Merge pull request #57069 from xxhdx1985126/wip-65632

crimson/common/operation: detach blockers from blocking events when they are destroyed

Reviewed-by: Samuel Just <sjust@redhat.com>
16 months agoMerge pull request #57788 from cyx1231st/wip-seastore-report-disk
Matan Breizman [Sun, 9 Jun 2024 07:56:15 +0000 (10:56 +0300)]
Merge pull request #57788 from cyx1231st/wip-seastore-report-disk

crimson/os/seastore: implement disk and writer level stats reporting

Reviewed-by: Xuehan Xu <xuxuehan@qianxin.com>
16 months agoMerge pull request #57939 from zdover23/wip-doc-2024-06-09-glossary-swift
Zac Dover [Sun, 9 Jun 2024 02:27:30 +0000 (12:27 +1000)]
Merge pull request #57939 from zdover23/wip-doc-2024-06-09-glossary-swift

doc/glossary.rst: add "OpenStack Swift" and "Swift"

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
16 months agodoc/glossary.rst: add "OpenStack Swift" and "Swift"
Zac Dover [Sat, 8 Jun 2024 20:24:43 +0000 (06:24 +1000)]
doc/glossary.rst: add "OpenStack Swift" and "Swift"

Add "OpenStack Swift" and "Swift" entries to the glossary.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
16 months agoMerge pull request #57940 from zdover23/wip-doc-2024-06-090-start-documenting-ceph...
Anthony D'Atri [Sat, 8 Jun 2024 23:26:02 +0000 (19:26 -0400)]
Merge pull request #57940 from zdover23/wip-doc-2024-06-090-start-documenting-ceph-squash

doc/start: format squash procedure

16 months agodoc/start: format squash procedure
Zac Dover [Sat, 8 Jun 2024 20:32:19 +0000 (06:32 +1000)]
doc/start: format squash procedure

Add letters to substeps in a procedure so that readers won't get too
confused by having numbered steps nested inside numbered steps.

Signed-off-by: Zac Dover <zac.dover@proton.me>
16 months agoMerge PR #48130 into main
Patrick Donnelly [Sat, 8 Jun 2024 19:56:12 +0000 (15:56 -0400)]
Merge PR #48130 into main

* refs/pull/48130/head:
qa: add killpoint testing for dirfrags
qa: stringify arguments to setfattr
qa: move some configs to cluster-conf
qa: restore default for config to split exports
qa/tasks/ceph_test_case: rollback configs using `config reset`
qa/cephfs: set confs using cluster-conf
qa/tasks/ceph: provide configuration for setting configs via mon
mds: optimize MDBalancer code path config access
mds: add killpoints for directory fragmentation

Reviewed-by: Venky Shankar <vshankar@redhat.com>
16 months agoMerge pull request #57912 from ceph/wip-lusov-qdb-exclude-or-cancel
Leonid S. Usov [Sat, 8 Jun 2024 07:43:30 +0000 (10:43 +0300)]
Merge pull request #57912 from ceph/wip-lusov-qdb-exclude-or-cancel

mds: QuiesceDbRequest: update the internal encoding of ops

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
16 months agoMerge pull request #57923 from zdover23/wip-doc-2024-06-07-global-yaml-in-inline...
Zac Dover [Fri, 7 Jun 2024 23:05:41 +0000 (09:05 +1000)]
Merge pull request #57923 from zdover23/wip-doc-2024-06-07-global-yaml-in-inline-literals

doc/src: add inline literals (``) to variables

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
16 months agoMerge pull request #57755 from rhcs-dashboard/fix-66267-main
Nizamudeen A [Fri, 7 Jun 2024 13:59:34 +0000 (19:29 +0530)]
Merge pull request #57755 from rhcs-dashboard/fix-66267-main

mgr/dashboard: provide option to enable pool based mirroring mode while creating a pool

Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
16 months agomgr/dashboard: fix edit bucket failing in other selected gateways
Nizamudeen A [Fri, 7 Jun 2024 13:49:42 +0000 (19:19 +0530)]
mgr/dashboard: fix edit bucket failing in other selected gateways

even if I select gateway 8002, the bucket policy req seems to go through 8000 and doesn't find the bucket
```
2024-06-07T13:40:33.161+0000 7f563be00700  0 [dashboard DEBUG rest_client] RGW REST API GET req: /hello?policy data: None
2024-06-07T13:40:33.164+0000 7f563be00700  0 [dashboard DEBUG urllib3.connectionpool] http://172.20.0.5:8000 "GET /hello?policy HTTP/1.1" 404 174
2024-06-07T13:40:33.164+0000 7f563be00700  0 [dashboard ERROR rest_client] RGW REST API failed GET req status: 404
2024-06-07T13:40:33.164+0000 7f563be00700  0 [dashboard ERROR exception] Internal Server Error
Traceback (most recent call last):
  File "/ceph/src/pybind/mgr/dashboard/services/exception.py", line 47, in dashboard_exception_handler
    return handler(*args, **kwargs)
  File "/usr/lib/python3.6/site-packages/cherrypy/_cpdispatch.py", line 54, in __call__
    return self.callable(*self.args, **self.kwargs)
  File "/ceph/src/pybind/mgr/dashboard/controllers/_base_controller.py", line 263, in inner
    ret = func(*args, **kwargs)
  File "/ceph/src/pybind/mgr/dashboard/controllers/_rest_controller.py", line 193, in wrapper
    return func(*vpath, **params)
  File "/ceph/src/pybind/mgr/dashboard/controllers/rgw.py", line 463, in get
    result['bucket_policy'] = self._get_policy(bucket_name)
  File "/ceph/src/pybind/mgr/dashboard/controllers/rgw.py", line 381, in _get_policy
    return rgw_client.get_bucket_policy(bucket)
  File "/ceph/src/pybind/mgr/dashboard/rest_client.py", line 543, in func_wrapper
    **kwargs)
  File "/ceph/src/pybind/mgr/dashboard/services/rgw_client.py", line 957, in get_bucket_policy
    raise e
  File "/ceph/src/pybind/mgr/dashboard/services/rgw_client.py", line 949, in get_bucket_policy
    request = request()
  File "/ceph/src/pybind/mgr/dashboard/rest_client.py", line 325, in __call__
    data, raw_content, headers)
  File "/ceph/src/pybind/mgr/dashboard/rest_client.py", line 428, in do_request
    resp.content)
dashboard.rest_client.RequestException: RGW REST API failed request with status code 404
(b'{"Code":"NoSuchBucket","Message":"","BucketName":"hello","RequestId":"tx0000'
 b'0d73bbbad485175ea-0066630dd1-18785-zone1-zg1-realm1","HostId":"18785-zone1-z'
 b'g1-realm1-zg1-realm1"}')
```

But for the same bucket the encryption and other req goes through the correct gateway

```
2024-06-07T13:40:32.704+0000 7f563be00700  0 [dashboard DEBUG urllib3.connectionpool] http://172.20.0.5:8002 "GET /hello?versioning HTTP/1.1" 200 2
2024-06-07T13:40:32.745+0000 7f563be00700  0 [dashboard DEBUG rest_client] RGW REST API GET res status: 200 content: {}
2024-06-07T13:40:32.745+0000 7f563be00700  0 [dashboard INFO rgw_client] Found RGW daemon with configuration: host=172.20.0.5, port=8000, ssl=False
2024-06-07T13:40:32.746+0000 7f563be00700  0 [dashboard INFO rgw_client] Found RGW daemon with configuration: host=172.20.0.5, port=8002, ssl=False
2024-06-07T13:40:32.746+0000 7f563be00700  0 [dashboard DEBUG rest_client] RGW REST API GET req: /hello?encryption data: None
2024-06-07T13:40:32.747+0000 7f563be00700  0 [dashboard DEBUG urllib3.connectionpool] http://172.20.0.5:8002 "GET /hello?encr
```

Fixes: https://tracker.ceph.com/issues/66395
Signed-off-by: Nizamudeen A <nia@redhat.com>
16 months agoMerge PR #57882 into main
Patrick Donnelly [Fri, 7 Jun 2024 12:24:09 +0000 (08:24 -0400)]
Merge PR #57882 into main

* refs/pull/57882/head:
mds: add debug message when conf changes are processed

Reviewed-by: Rishabh Dave <ridave@redhat.com>
16 months agoMerge PR #57919 into main
Patrick Donnelly [Fri, 7 Jun 2024 12:22:54 +0000 (08:22 -0400)]
Merge PR #57919 into main

* refs/pull/57919/head:
script/ceph-backport: lookup "Release" custom_field by id

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>
16 months agoMerge pull request #57894 from AliMasarweh/wip-alimasa-bn-http-teuthology
Ali Masarwa [Fri, 7 Jun 2024 11:20:46 +0000 (14:20 +0300)]
Merge pull request #57894 from AliMasarweh/wip-alimasa-bn-http-teuthology

RGW|BN: make http tests run on a seperate task

Reviewed-by: yuvalif<ylifshit@redhat.com>
16 months agoMerge pull request #57746 from ceph/nodejs-lts-update
Ivo Almeida [Fri, 7 Jun 2024 09:23:29 +0000 (10:23 +0100)]
Merge pull request #57746 from ceph/nodejs-lts-update

mgr/dashboard: update node js version

Reviewed-by: Nizamudeen A <nia@redhat.com>
16 months agomgr/dashboard: provide option to enable pool based mirroring mode while
Aashish Sharma [Wed, 29 May 2024 04:34:13 +0000 (10:04 +0530)]
mgr/dashboard: provide option to enable pool based mirroring mode while
creating a pool

Fixes: https://tracker.ceph.com/issues/66267
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
16 months agodoc/src: add inline literals (``) to variables
Zac Dover [Fri, 7 Jun 2024 01:24:00 +0000 (11:24 +1000)]
doc/src: add inline literals (``) to variables

Add inline-literal RST markup (``) (double backticks) around variable
names in src/common/options/global.yaml.in.

This follows https://github.com/ceph/ceph/pull/57887.

Signed-off-by: Zac Dover <zac.dover@proton.me>
16 months agoMerge pull request #56874 from smanjara/wip-shilpa-multisite-tests
Shilpa Jagannath [Thu, 6 Jun 2024 22:28:38 +0000 (15:28 -0700)]
Merge pull request #56874 from smanjara/wip-shilpa-multisite-tests

qa/multisite: stabilize multisite testing

16 months agomultisite/qa: resolve rebase conflicts
Shilpa Jagannath [Thu, 6 Jun 2024 19:06:49 +0000 (12:06 -0700)]
multisite/qa: resolve rebase conflicts

Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>
16 months agomultisite/qa: disable sync error injection
Shilpa Jagannath [Tue, 21 May 2024 17:31:31 +0000 (10:31 -0700)]
multisite/qa: disable sync error injection

Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>
16 months agoqa/multisite: disable two-zonegroup and three-zone configs.
Shilpa Jagannath [Mon, 15 Apr 2024 14:54:57 +0000 (10:54 -0400)]
qa/multisite: disable two-zonegroup and three-zone configs.

Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>
16 months agoqa/rgw/multisite: enable notification_v2 feature
Casey Bodley [Mon, 15 Apr 2024 14:06:11 +0000 (10:06 -0400)]
qa/rgw/multisite: enable notification_v2 feature

Signed-off-by: Casey Bodley <cbodley@redhat.com>
(cherry picked from commit 1e7fc66683e4dea7294f0ec95f47b22f48b33b7e)

16 months agoqa/multisite: disable sync policy, endpoint and data_sync_init tests until fixed.
Shilpa Jagannath [Fri, 12 Apr 2024 20:46:11 +0000 (16:46 -0400)]
qa/multisite: disable sync policy, endpoint and data_sync_init tests until fixed.

Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>
16 months agoqa/rgw/multisite: add two-zone configuration
Casey Bodley [Thu, 9 Nov 2023 21:22:36 +0000 (16:22 -0500)]
qa/rgw/multisite: add two-zone configuration

Signed-off-by: Casey Bodley <cbodley@redhat.com>
(cherry picked from commit 14204442ac2eaaaa3f40a76403da49f0f4ed92c1)

16 months agoqa/rgw/multisite: double cluster nodes and osd count
Casey Bodley [Tue, 31 Oct 2023 14:53:25 +0000 (10:53 -0400)]
qa/rgw/multisite: double cluster nodes and osd count

> WARNING: The OSD cluster is overloaded and struggling to complete ops. You need more capacity to serve this level of demand.

give each cluster 2 nodes instead of 1, and 6 osds instead of 3

Signed-off-by: Casey Bodley <cbodley@redhat.com>
(cherry picked from commit 9a80cf4bf473c95c8d0b18fcdce88be20f74f973)

16 months agoqa/rgw/multisite: reduce polling intervals from 20s to 5s
Casey Bodley [Tue, 31 Oct 2023 14:51:28 +0000 (10:51 -0400)]
qa/rgw/multisite: reduce polling intervals from 20s to 5s

Signed-off-by: Casey Bodley <cbodley@redhat.com>
(cherry picked from commit bc084c0ac64477abdbf54d96bc4981edf41e1343)

16 months agorgw/multisite: reconfigure checkpoint delay to 90s
Shilpa Jagannath [Mon, 30 Oct 2023 15:29:06 +0000 (11:29 -0400)]
rgw/multisite: reconfigure checkpoint delay to 90s

Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>
16 months agoMerge pull request #57725 from Matan-B/wip-matanb-crimson-load-exist
Samuel Just [Thu, 6 Jun 2024 18:39:14 +0000 (11:39 -0700)]
Merge pull request #57725 from Matan-B/wip-matanb-crimson-load-exist

crimson/osd/object_context_loader: Fix obc cache existence usage

Reviewed-by: Aishwarya Mathuria <amathuri@redhat.com>
Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Xuehan Xu <xuxuehan@qianxin.com>
Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
16 months agoscript/ceph-backport: lookup "Release" custom_field by id
Patrick Donnelly [Thu, 6 Jun 2024 18:17:46 +0000 (14:17 -0400)]
script/ceph-backport: lookup "Release" custom_field by id

Right now the script assumes the Release custom_field will be ordered first in
the array from the REST API.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
16 months agoqa: add killpoint testing for dirfrags
Patrick Donnelly [Thu, 15 Sep 2022 13:59:01 +0000 (09:59 -0400)]
qa: add killpoint testing for dirfrags

Fixes: https://tracker.ceph.com/issues/7320
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Signed-off-by: Sidharth Anupkrishnan <sanupkri@redhat.com>
16 months agoqa: stringify arguments to setfattr
Patrick Donnelly [Thu, 15 Sep 2022 13:58:01 +0000 (09:58 -0400)]
qa: stringify arguments to setfattr

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
16 months agoqa: move some configs to cluster-conf
Patrick Donnelly [Sat, 16 Sep 2023 00:50:15 +0000 (20:50 -0400)]
qa: move some configs to cluster-conf

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
16 months agoqa: restore default for config to split exports
Patrick Donnelly [Fri, 15 Sep 2023 15:44:29 +0000 (11:44 -0400)]
qa: restore default for config to split exports

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
16 months agoqa/tasks/ceph_test_case: rollback configs using `config reset`
Patrick Donnelly [Tue, 8 Aug 2023 01:14:55 +0000 (21:14 -0400)]
qa/tasks/ceph_test_case: rollback configs using `config reset`

This is faster and catches config changes made outside this interface.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
16 months agoqa/cephfs: set confs using cluster-conf
Patrick Donnelly [Tue, 8 Aug 2023 01:14:03 +0000 (21:14 -0400)]
qa/cephfs: set confs using cluster-conf

So that these configs can be easily overriden by tests.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
16 months agoqa/tasks/ceph: provide configuration for setting configs via mon
Patrick Donnelly [Tue, 8 Aug 2023 01:09:21 +0000 (21:09 -0400)]
qa/tasks/ceph: provide configuration for setting configs via mon

These configs may be set using:

ceph:
  cluster-config:
    entity:
      foo: bar

same as the current:

ceph:
  config:
    entity:
      foo: bar

The configs will be set in parallel using the `ceph config set` command.

The main benefit here is to avoid using the ceph.conf to set configs which
cannot be overriden using subsequent `ceph config` command. The only way to
override is to change the ceph.conf in the test (yuck) or the admin socket
(which gets reset when the daemon restarts).

Finally, we can now exploit the `ceph config reset` command will let us
trivially rollback config changes after a test completes. That is exposed
as the `ctx.config_epoch` variable.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
16 months agomds: optimize MDBalancer code path config access
Sidharth Anupkrishnan [Thu, 1 Oct 2020 15:34:56 +0000 (21:04 +0530)]
mds: optimize MDBalancer code path config access

This change is necessary because the new way of setting config is to use the
ceph config command or the asok interface rather than the old way which
involved editing the ceph.conf and restarting the daemons to reflect the
changes. Have updated the code to support runtime config changes.

Signed-off-by: Sidharth Anupkrishnan <sanupkri@redhat.com>
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
16 months agomds: add killpoints for directory fragmentation
Sidharth Anupkrishnan [Thu, 21 May 2020 15:25:54 +0000 (20:55 +0530)]
mds: add killpoints for directory fragmentation

Defined a set of killpoints for testing directory fragmentation during
cluster recovery after failure.

Signed-off-by: Sidharth Anupkrishnan <sanupkri@redhat.com>
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
16 months agoMerge PR #57879 into main
Patrick Donnelly [Thu, 6 Jun 2024 17:56:46 +0000 (13:56 -0400)]
Merge PR #57879 into main

* refs/pull/57879/head:
qa: add common conf links
qa: disable mon_warn_on_pool_no_app in fs suite

Reviewed-by: Venky Shankar <vshankar@redhat.com>
16 months agoMerge PR #57877 into main
Patrick Donnelly [Thu, 6 Jun 2024 17:56:00 +0000 (13:56 -0400)]
Merge PR #57877 into main

* refs/pull/57877/head:
qa: correct json lookup for new `lock path` output

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>
16 months agoMerge PR #57813 into main
Patrick Donnelly [Thu, 6 Jun 2024 17:55:41 +0000 (13:55 -0400)]
Merge PR #57813 into main

* refs/pull/57813/head:
mds: avoid use of swap to initialize

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>
16 months agoMerge pull request #57887 from zdover23/wip-doc-2024-06-05-rados-config-network-confi...
Zac Dover [Thu, 6 Jun 2024 17:21:43 +0000 (03:21 +1000)]
Merge pull request #57887 from zdover23/wip-doc-2024-06-05-rados-config-network-config-ref

doc/rados: add options to network config ref

Reviewed-by: Cole Mitchell <cole.mitchell.ceph@gmail.com>
16 months agoMerge pull request #57641 from cbodley/wip-rgw-delete-objects-throttle
Casey Bodley [Thu, 6 Jun 2024 16:21:04 +0000 (17:21 +0100)]
Merge pull request #57641 from cbodley/wip-rgw-delete-objects-throttle

rgw: RGWDeleteMultiObj uses spawn_throttle for concurrency

Reviewed-by: Adam Emerson <aemerson@redhat.com>
16 months agoMerge pull request #57702 from Suyashd999/fix-uam2
Casey Bodley [Thu, 6 Jun 2024 14:43:39 +0000 (15:43 +0100)]
Merge pull request #57702 from Suyashd999/fix-uam2

rados/cls_fifo_legacy: 'oid' used after it was moved

Reviewed-by: Yuval Lifshitz <ylifshit@ibm.com>
Reviewed-by: Adam Emerson <aemerson@redhat.com>
16 months agocrimson/osd/object_context_loader: get_or_load to support atomicity
Matan Breizman [Thu, 6 Jun 2024 09:48:09 +0000 (09:48 +0000)]
crimson/osd/object_context_loader: get_or_load to support atomicity

make use of try_lock in order to support atomicity when called in
ObjectContext::_with_lock()

Co-authored-by: Yingxin Cheng <yingxin.cheng@intel.com>
Signed-off-by: Matan Breizman <mbreizma@redhat.com>
16 months agocrimson/common/tri_mutex: make lock() atomic if doesn't need wait
Yingxin Cheng [Mon, 3 Jun 2024 06:33:26 +0000 (14:33 +0800)]
crimson/common/tri_mutex: make lock() atomic if doesn't need wait

Otherwise, promotion cannot be atomic with the 1st locker.

Identified by: Matan Breizman <mbreizma@redhat.com>

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
16 months agocrimson/common/tri_mutex: minor cleanup to be consistent
Yingxin Cheng [Mon, 3 Jun 2024 06:31:35 +0000 (14:31 +0800)]
crimson/common/tri_mutex: minor cleanup to be consistent

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
Signed-off-by: Matan Breizman <mbreizma@redhat.com>
16 months agocrimson/common/tri_mutex: add waiter_t::waiter_name
Matan Breizman [Sun, 2 Jun 2024 15:17:36 +0000 (15:17 +0000)]
crimson/common/tri_mutex: add waiter_t::waiter_name

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
16 months agocrimson/common/tri_mutex: add debug logs
Matan Breizman [Sun, 2 Jun 2024 15:00:37 +0000 (15:00 +0000)]
crimson/common/tri_mutex: add debug logs

to be used only for testing

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
16 months agocrimson/osd/object_context_loader: with_head_obc to log `existed`
Matan Breizman [Thu, 30 May 2024 11:48:46 +0000 (11:48 +0000)]
crimson/osd/object_context_loader: with_head_obc to log `existed`

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
16 months agocrimson/osd/object_context_loader: cleanup `loaded`
Matan Breizman [Tue, 28 May 2024 13:52:08 +0000 (13:52 +0000)]
crimson/osd/object_context_loader: cleanup `loaded`

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
16 months agoMerge pull request #57696 from kotreshhr/fix_headcheck_first_damage
Venky Shankar [Thu, 6 Jun 2024 13:45:30 +0000 (19:15 +0530)]
Merge pull request #57696 from kotreshhr/fix_headcheck_first_damage

tools/first-damage: Make CEPH_NOSNAP int64

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
16 months agomds: QuiesceDbRequest: update the internal encoding of ops
Leonid Usov [Thu, 6 Jun 2024 11:48:56 +0000 (14:48 +0300)]
mds: QuiesceDbRequest: update the internal encoding of ops

Excluding the last root from a set will automatically mark it as QS_CANCELED.
Hence, it makes more sense if `exclude` and `cancel` share the same op code,
rather than `exclude` and `release`.

Signed-off-by: Leonid Usov <leonid.usov@ibm.com>
Fixes: https://tracker.ceph.com/issues/66383
16 months agoRGW|BN: make http tests run on a seperate task
Ali Masarwa [Wed, 5 Jun 2024 09:31:51 +0000 (12:31 +0300)]
RGW|BN: make http tests run on a seperate task

Signed-off-by: Ali Masarwa <amasarwa@redhat.com>
16 months agotools/first-damage: Make CEPH_NOSNAP int64
Kotresh HR [Fri, 24 May 2024 11:33:10 +0000 (17:03 +0530)]
tools/first-damage: Make CEPH_NOSNAP int64

Make CEPH_NOSNAP int64 and pack 8 bytes when updating the omap
value.

Signed-off-by: Kotresh HR <khiremat@redhat.com>
Signed-off-by: Venky Shankar <vshankar@redhat.com>
16 months agocrimson/osd/osd_operations: correct connection pipelines for osd
Xuehan Xu [Thu, 6 Jun 2024 03:55:38 +0000 (11:55 +0800)]
crimson/osd/osd_operations: correct connection pipelines for osd
operations

Fixes: https://tracker.ceph.com/issues/66374
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
16 months agoMerge pull request #56275 from smanjara/wip-set-attr-mtime
Shilpa Jagannath [Wed, 5 Jun 2024 21:12:39 +0000 (14:12 -0700)]
Merge pull request #56275 from smanjara/wip-set-attr-mtime

rgw/multisite: allow PutACL replication.

16 months agocephadm/services/ingress: configure security user in keepalived template
Bernard Landon [Tue, 4 Jun 2024 21:29:54 +0000 (23:29 +0200)]
cephadm/services/ingress: configure security user in keepalived template

It is cleaner to enable script security and define a script user for keepalived.

Signed-off-by: Bernard Landon <bernard@lndn.ch>
16 months agoMerge pull request #57900 from zdover23/wip-doc-2024-06-06-start-intro-to-index
Zac Dover [Wed, 5 Jun 2024 20:04:42 +0000 (06:04 +1000)]
Merge pull request #57900 from zdover23/wip-doc-2024-06-06-start-intro-to-index

doc/start: s/intro.rst/index.rst/

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
16 months agoqa/multisite: add object set acl test
Shilpa Jagannath [Wed, 27 Mar 2024 06:14:44 +0000 (02:14 -0400)]
qa/multisite: add object set acl test

Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>
16 months agodoc/start: s/intro.rst/index.rst/
Zac Dover [Wed, 5 Jun 2024 16:43:15 +0000 (02:43 +1000)]
doc/start: s/intro.rst/index.rst/

Change the filename "doc/start/intro.rst" to "doc/start/index.rst" so
that Sphinx finds the root filename for the "/start" directory in the
default location.

Signed-off-by: Zac Dover <zac.dover@proton.me>
16 months agorgw/multisite: allow modified object acls to be replicated
Shilpa Jagannath [Tue, 19 Mar 2024 14:59:35 +0000 (10:59 -0400)]
rgw/multisite: allow modified object acls to be replicated

Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>
16 months agoMerge pull request #57015 from liangmingyuanneo/wip-bluefs-max-alloc-size
Yuri Weinstein [Wed, 5 Jun 2024 14:50:04 +0000 (07:50 -0700)]
Merge pull request #57015 from liangmingyuanneo/wip-bluefs-max-alloc-size

bluefs: bluefs alloc unit should only be shrink

Reviewed-by: Igor Fedotov <ifedotov@suse.com>
16 months agoMerge pull request #56428 from tchaikov/wip-crush-no-vla
Yuri Weinstein [Wed, 5 Jun 2024 14:47:28 +0000 (07:47 -0700)]
Merge pull request #56428 from tchaikov/wip-crush-no-vla

crush: use std::vector instead of variable length arrays

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
16 months agoMerge pull request #56389 from ronen-fr/wip-rf-qtimeout
Yuri Weinstein [Wed, 5 Jun 2024 14:46:08 +0000 (07:46 -0700)]
Merge pull request #56389 from ronen-fr/wip-rf-qtimeout

osd/scrub: remove detection & handling of reservation timeouts from the code

Reviewed-by: Samuel Just <sjust@redhat.com>
16 months agocephadm: the user config is preferred during the bootstrap
gukaifeng [Wed, 5 Jun 2024 13:11:13 +0000 (21:11 +0800)]
cephadm: the user config is preferred during the bootstrap

Signed-off-by: gukaifeng <gukaifeng@xiaomi.com>
16 months agoqa/tasks/vstart_runner: avoid deprecation warning
Patrick Donnelly [Sat, 25 May 2024 18:31:44 +0000 (14:31 -0400)]
qa/tasks/vstart_runner: avoid deprecation warning

    /home/pdonnell/ceph/qa/tasks/vstart_runner.py:1280: DeprecationWarning: datetime.datetime.utcnow() is deprecated and scheduled for removal in a future version. Use timezone-aware objects to represent datetimes in UTC: datetime.datetime.now(datetime.UTC).
    time_elapsed = (datetime.datetime.utcnow() - started_at).total_seconds()

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
16 months agoMerge pull request #57886 from zdover23/wip-doc-2024-06-05-dev-perf-counters
Ilya Dryomov [Wed, 5 Jun 2024 06:26:50 +0000 (08:26 +0200)]
Merge pull request #57886 from zdover23/wip-doc-2024-06-05-dev-perf-counters

doc/dev: origin of Labeled Perf Counters

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>