]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
21 months agoMerge pull request #53800 from batrick/cmake-debug-wrn
Ilya Dryomov [Tue, 17 Oct 2023 11:14:25 +0000 (13:14 +0200)]
Merge pull request #53800 from batrick/cmake-debug-wrn

CMakeLists: warn if CMAKE_BUILD_TYPE is set/unset

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
21 months agoMerge pull request #53802 from phlogistonjohn/jjm-cephadm-dtypes-2
Adam King [Mon, 16 Oct 2023 19:33:53 +0000 (15:33 -0400)]
Merge pull request #53802 from phlogistonjohn/jjm-cephadm-dtypes-2

cephadm: convert more types to be based on ContainerDaemonForm

Reviewed-by: Adam King <adking@redhat.com>
21 months agoMerge pull request #53351 from ceph/QE_defect_fixes
Casey Bodley [Mon, 16 Oct 2023 16:02:52 +0000 (17:02 +0100)]
Merge pull request #53351 from ceph/QE_defect_fixes

rgw/s3select: fix per QE defect.

21 months agoCMakeLists: warn if CMAKE_BUILD_TYPE is set/unset 53800/head
Patrick Donnelly [Tue, 3 Oct 2023 16:01:40 +0000 (16:01 +0000)]
CMakeLists: warn if CMAKE_BUILD_TYPE is set/unset

At some point the debug builds for wip branches no longer had the .git
directory available so the Debug build type was unset. This meant we are
no longer doing numerous checks (like mutex ownership checks) that we
would normally be doing in the qa suite.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
21 months agoMerge pull request #54022 from zdover23/wip-doc-2023-10-15-rados-troubleshooting...
zdover23 [Sun, 15 Oct 2023 23:39:20 +0000 (09:39 +1000)]
Merge pull request #54022 from zdover23/wip-doc-2023-10-15-rados-troubleshooting-troubleshooting-osd-3-of-x

doc/rados: Edit troubleshooting-osd (3 of x)

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
21 months agodoc/rados: Edit troubleshooting-osd (3 of x) 54022/head
Zac Dover [Sun, 15 Oct 2023 12:43:39 +0000 (22:43 +1000)]
doc/rados: Edit troubleshooting-osd (3 of x)

Edit doc/rados/troubleshooting/troubleshooting-osd.rst (3 of x).

Follows https://github.com/ceph/ceph/pull/53986.

Signed-off-by: Zac Dover <zac.dover@proton.me>
21 months agoMerge pull request #53547 from yuvalif/wip-yuval-req-context
Yuval Lifshitz [Sun, 15 Oct 2023 06:51:25 +0000 (09:51 +0300)]
Merge pull request #53547 from yuvalif/wip-yuval-req-context

rgw: adding request context structure

reviewed-by: dang, cbodley

21 months agoMerge pull request #54006 from zdover23/wip-doc-2023-10-14-filestore-deprecation
Anthony D'Atri [Fri, 13 Oct 2023 18:50:21 +0000 (14:50 -0400)]
Merge pull request #54006 from zdover23/wip-doc-2023-10-14-filestore-deprecation

doc/rados: remove FileStore material (in Reef)

21 months agodoc/rados: remove FileStore material (in Reef) 54006/head
Zac Dover [Fri, 13 Oct 2023 18:40:26 +0000 (04:40 +1000)]
doc/rados: remove FileStore material (in Reef)

Remove FileStore-related material because FileStore is not supported in
Reef.

Signed-off-by: Zac Dover <zac.dover@proton.me>
21 months agoMerge pull request #54005 from zdover23/wip-doc-2023-10-14-dev-cache-pool-removal
Anthony D'Atri [Fri, 13 Oct 2023 18:25:59 +0000 (14:25 -0400)]
Merge pull request #54005 from zdover23/wip-doc-2023-10-14-dev-cache-pool-removal

doc/dev: remove cache-pool

21 months agodoc/dev: remove cache-pool 54005/head
Zac Dover [Fri, 13 Oct 2023 18:17:44 +0000 (04:17 +1000)]
doc/dev: remove cache-pool

Remove cache-pool, which is deprecated in Reef.

Signed-off-by: Zac Dover <zac.dover@proton.me>
21 months agoMerge pull request #53979 from kamoltat/wip-ksirivad-fix-63183
Kamoltat (Junior) Sirivadhna [Fri, 13 Oct 2023 15:44:36 +0000 (11:44 -0400)]
Merge pull request #53979 from kamoltat/wip-ksirivad-fix-63183

src/mon/Monitor: Fix set_elector_disallowed_leaders
Reviewed-by: Greg Farnum <gfarnum@redhat.com>
21 months agoMerge pull request #53732 from pereman2/bluefs-expand-bug
Adam Kupczyk [Fri, 13 Oct 2023 14:07:57 +0000 (16:07 +0200)]
Merge pull request #53732 from pereman2/bluefs-expand-bug

os/bluestore: fix bluefs _extend_log seq update

21 months agoMerge pull request #53986 from zdover23/wip-doc-2023-10-12-rados-troubleshooting...
zdover23 [Fri, 13 Oct 2023 13:40:38 +0000 (23:40 +1000)]
Merge pull request #53986 from zdover23/wip-doc-2023-10-12-rados-troubleshooting-troubleshooting-osd-2-of-x

doc/rados: Edit troubleshooting-osd (2 of x)

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
21 months agodoc/rados: Edit troubleshooting-osd (2 of x) 53986/head
Zac Dover [Thu, 12 Oct 2023 08:33:58 +0000 (18:33 +1000)]
doc/rados: Edit troubleshooting-osd (2 of x)

Edit doc/rados/troubleshooting/troubleshooting.rst (2 of x).

Follows https://github.com/ceph/ceph/pull/53936.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
21 months agoMerge pull request #53234 from xxhdx1985126/wip-seastore-onode-manager-replay
Yingxin [Fri, 13 Oct 2023 08:28:34 +0000 (16:28 +0800)]
Merge pull request #53234 from xxhdx1985126/wip-seastore-onode-manager-replay

crimson/os/seastore/onode_manager: populate delta recorders for each onode modification

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
21 months agofix per QE defect. 53351/head
galsalomon66 [Sat, 9 Sep 2023 13:38:23 +0000 (16:38 +0300)]
fix per QE defect.
s3select submodule

s3select engine changes

s3select submodule
aggregation functions upon no data

Signed-off-by: galsalomon66 <gal.salomon@gmail.com>
21 months agosrc/mon/Monitor: Fix set_elector_disallowed_leaders 53979/head
Kamoltat [Wed, 11 Oct 2023 21:12:03 +0000 (21:12 +0000)]
src/mon/Monitor: Fix set_elector_disallowed_leaders

Problem:

In the monitors we hold 2 copies of disallowed_leader ...
1. MonMap class 2. Elector class.
When computing the ConnectivityScore for the monitors during
the election, we use the `disallowed_leader` from Elector
class to determine which monitors we shouldn't allow to lead.

Now, we rely on the function `set_elector_disallowed_leaders`
to set the `disallowed_leader` of the Elector class, MonMap
class copy of the `disallowed_leader` contains the
`tiebreaker_monitor` so we inherit that plus we also add the
monitors that are dead due to a zone failure.

Hence, the `adding dead monitors` phase is only allowed if we can
enter stretch_mode. However, there is a problem when failing over a stretch cluster
zone and reviving the entire zone back up, the revived monitors
couldn't enter stretch_mode when they are at the state of "probing"
since PaxosServices like osdmon becomes unreadable (this is expected)

Solution:

We unconditionally add monitors that are in
`monmap->stretch_marked_down_mons` to the
`disallowed_leaders` list in
`Monitor::set_elector_disallowed_leaders` since
if the monitors are in `monmap->stretch_marked_down_mons`
we know that they probably belong in a marked down
zone and is not fit for lead.

This will fix the problem of newly revived monitors
having different disallowed_leaders set
and getting stuck in election.

Fixes: https://tracker.ceph.com/issues/63183
Signed-off-by: Kamoltat <ksirivad@redhat.com>
21 months agoMerge pull request #53914 from idryomov/wip-63149
Ilya Dryomov [Thu, 12 Oct 2023 19:29:05 +0000 (21:29 +0200)]
Merge pull request #53914 from idryomov/wip-63149

qa/suites/rbd: drop cache tiering workload tests

Reviewed-by: Ramana Raja <rraja@redhat.com>
Reviewed-by: Mykola Golub <mgolub@suse.com>
21 months agoMerge pull request #51416 from adk3798/jaeger-agent-dep
Adam King [Thu, 12 Oct 2023 19:24:26 +0000 (15:24 -0400)]
Merge pull request #51416 from adk3798/jaeger-agent-dep

mgr/cephadm: make jaeger-collector a dep for jaeger-agent

Reviewed-by: John Mulligan <jmulligan@redhat.com>
21 months agoMerge pull request #53980 from ceph/wip-yuri-crontab-main
Yuri Weinstein [Thu, 12 Oct 2023 14:56:01 +0000 (07:56 -0700)]
Merge pull request #53980 from ceph/wip-yuri-crontab-main

qa/tests: fixed typo and added `--force-priority` in missing places

Reviewed-by: Laura Flores <lflores@redhat.com>
21 months agoMerge pull request #53062 from xxhdx1985126/wip-seastore-list-objects
Yingxin [Thu, 12 Oct 2023 06:44:11 +0000 (14:44 +0800)]
Merge pull request #53062 from xxhdx1985126/wip-seastore-list-objects

crimson/os/seastore: return ghobject_t::max as the end when list_objects reaches the end of the listing

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
Reviewed-by: Chunmei Liu <chunmei.liu@intel.com>
21 months agoMerge pull request #53936 from zdover23/wip-doc-2023-10-11-rados-troubleshooting...
zdover23 [Thu, 12 Oct 2023 05:15:58 +0000 (15:15 +1000)]
Merge pull request #53936 from zdover23/wip-doc-2023-10-11-rados-troubleshooting-troubleshooting-osd-1-of-x

doc/rados: edit troubleshooting-osd (1 of x)

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
21 months agodoc/rados: edit troubleshooting-osd (1 of x) 53936/head
Zac Dover [Wed, 11 Oct 2023 10:38:55 +0000 (20:38 +1000)]
doc/rados: edit troubleshooting-osd (1 of x)

Edit doc/rados/troubleshooting/troubleshooting-osd.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
21 months agotracing/osd/rgw: using jspan as alias to opentelemetry::Span 53547/head
Yuval Lifshitz [Tue, 10 Oct 2023 17:30:54 +0000 (17:30 +0000)]
tracing/osd/rgw: using jspan as alias to opentelemetry::Span

and jspan_ptr as jspan shared pointer

Signed-off-by: Yuval Lifshitz <ylifshit@redhat.com>
21 months agorgw: adding request context structure
Yuval Lifshitz [Wed, 20 Sep 2023 09:46:54 +0000 (09:46 +0000)]
rgw: adding request context structure

this structure should be created at the frontend and trickle all the way
to the RADOS layer. holding: dout prefix, optional yield and trace.
in this commit, so far it was only added to the "complete()" sal interface,
and to the "write_meta()" rados interface.

in the future, it should be added to more sal interfaces, replacing the
current way where dpp and optional yield are passed as sepearte
arguments to all functions.
in addition, if more information would be needed, it should be possible
to add that information to the request context struct without changing
many function prototypes

basic test instructions:
https://gist.github.com/yuvalif/1c7f1e80126bed5fa79345efb27fe1b1

Signed-off-by: Yuval Lifshitz <ylifshit@redhat.com>
21 months agoMerge pull request #53971 from zdover23/wip-doc-2023-10-12-start-get-involved-slack
zdover23 [Thu, 12 Oct 2023 03:44:24 +0000 (13:44 +1000)]
Merge pull request #53971 from zdover23/wip-doc-2023-10-12-start-get-involved-slack

doc/start: add Slack to get-involved

Reviewed-by: Laura Flores <lflores@redhat.com>
Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
21 months agoqa/tests: fixed typo and added `--force-priority` in missing places 53980/head
Yuri Weinstein [Wed, 11 Oct 2023 22:07:02 +0000 (15:07 -0700)]
qa/tests: fixed typo and added `--force-priority` in missing places

Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
21 months agoMerge pull request #53926 from zmc/workunit-assert-client-role
Zack Cerza [Wed, 11 Oct 2023 21:10:14 +0000 (15:10 -0600)]
Merge pull request #53926 from zmc/workunit-assert-client-role

 qa/tasks/workunit: improve assert on client names

21 months agosrc/mon: Added more loggings for disallowed_leaders
Kamoltat [Wed, 11 Oct 2023 21:08:19 +0000 (21:08 +0000)]
src/mon: Added more loggings for disallowed_leaders

Added more loggings regarding the disallowed_leaders
set so that it is easier to debug problems like:

https://tracker.ceph.com/issues/63183

Fixes: https://tracker.ceph.com/issues/63183
Signed-off-by: Kamoltat <ksirivad@redhat.com>
21 months agodoc/start: add Slack to get-involved 53971/head
Zac Dover [Wed, 11 Oct 2023 16:29:12 +0000 (02:29 +1000)]
doc/start: add Slack to get-involved

Add a link to the Ceph Slack to doc/start/get-involved.rst.

Signed-off-by: Zac Dover <zac.dover@proton.me>
21 months agoMerge pull request #53846 from cbodley/wip-62989-again
Casey Bodley [Wed, 11 Oct 2023 15:59:20 +0000 (16:59 +0100)]
Merge pull request #53846 from cbodley/wip-62989-again

rgw: fix http error checks in keystone/barbican/vault clients

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
21 months agoMerge pull request #53579 from Matan-B/wip-matanb-reset_purged_snaps_last
Radoslaw Zarzynski [Wed, 11 Oct 2023 15:01:44 +0000 (17:01 +0200)]
Merge pull request #53579 from Matan-B/wip-matanb-reset_purged_snaps_last

osd/OSD: introduce reset_purged_snaps_last

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
21 months agoMerge pull request #53908 from zdover23/wip-doc-2023-10-10-troubleshooting-troublesho...
zdover23 [Wed, 11 Oct 2023 00:39:53 +0000 (10:39 +1000)]
Merge pull request #53908 from zdover23/wip-doc-2023-10-10-troubleshooting-troubleshooting-memory-profiling

doc/rados: edit memory-profiling.rst

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
21 months agoMerge pull request #53930 from ceph/wip-yuri-crontab-main
Yuri Weinstein [Tue, 10 Oct 2023 23:01:19 +0000 (16:01 -0700)]
Merge pull request #53930 from ceph/wip-yuri-crontab-main

qa/tests: Added upgrade/reef-x to the mix

Reviewed-by: Laura Flores <lflores@redhat.com>
21 months agoqa/tests: Added upgrade/reef-x to the mix 53930/head
Yuri Weinstein [Tue, 10 Oct 2023 19:00:12 +0000 (12:00 -0700)]
qa/tests: Added upgrade/reef-x to the mix

Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
21 months agoMerge pull request #53085 from ceph/wip-yuriw-reef-x-main
Yuri Weinstein [Tue, 10 Oct 2023 18:55:24 +0000 (11:55 -0700)]
Merge pull request #53085 from ceph/wip-yuriw-reef-x-main

qa/tests: initial reef-x check in

Reviewed-by: Laura Flores <lflores@redhat.com>
21 months agoworkunit: Improve assert on client names 53926/head
Zack Cerza [Tue, 10 Oct 2023 17:23:20 +0000 (11:23 -0600)]
workunit: Improve assert on client names

Instead of a bare AssertionError, we can mention the problematic value.

Signed-off-by: Zack Cerza <zack@redhat.com>
21 months agoMerge pull request #53862 from adk3798/nvmeof-upgrade
Adam King [Tue, 10 Oct 2023 16:08:43 +0000 (12:08 -0400)]
Merge pull request #53862 from adk3798/nvmeof-upgrade

mgr/cephadm: fix upgrades with nvmeof

Reviewed-by: Redouane Kachach <rkachach@redhat.com>
21 months agoMerge pull request #53900 from ljflores/wip-fix-stress-split-tests
Laura Flores [Tue, 10 Oct 2023 15:52:01 +0000 (11:52 -0400)]
Merge pull request #53900 from ljflores/wip-fix-stress-split-tests

qa/suites/upgrade: fix env indentation in stress-split upgrade tests

21 months agocephadm: convert ceph exporter type to a ContainerDaemonForm 53802/head
John Mulligan [Tue, 3 Oct 2023 20:52:09 +0000 (16:52 -0400)]
cephadm: convert ceph exporter type to a ContainerDaemonForm

CephExporter was being (partially) over-shadowed by the Ceph class as
the Ceph class listed 'ceph-exporter' as one of the daemon types it
handled. This change updates CephExporter to a ContainerDaemonForm while
simultaneously breaking the link between Ceph and 'ceph-exporter',
allowing CephExporter to handle all the duty of managing ceph-exporter,
continuing the process of having clearer logical responsibilities and
class hierarchy in cephadm.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
21 months agocephadm: mock os.path.listdir in daemon forms test
John Mulligan [Tue, 3 Oct 2023 20:51:49 +0000 (16:51 -0400)]
cephadm: mock os.path.listdir in daemon forms test

Prevent classes that want to check the filesystem from breaking the
simple daemon forms instantiation test case. A better future fix would
be avoiding checking the file system during __init__ of the class but
that is left for future improvements.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
21 months agocephadm: stop directly using Ceph.daemons property
John Mulligan [Tue, 3 Oct 2023 20:43:59 +0000 (16:43 -0400)]
cephadm: stop directly using Ceph.daemons property

The Ceph.daemons property has two unfortunate behaviors: most important,
it includes ceph-exporter which causes the other CephExporter class to
be over-shadowed the DaemonForms mechanism. Second, it couples all
functions that want to know the names of ceph daemon types to the Ceph
class preventing future refactoring of that class.

Break the existing coupling by adding a new `ceph_daemons` function
similar to `get_supported_daemons` but returning the same value that
Ceph.daemons used to provide. This will permit future fixes and
improvements.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
21 months agocephadm: eliminate _dispatch_deploy function
John Mulligan [Thu, 28 Sep 2023 18:15:55 +0000 (14:15 -0400)]
cephadm: eliminate _dispatch_deploy function

Eliminate the _dispatch_deploy function, folding it into the
_common_deploy function, because the mass of if-elif lines have
been replaced and keeping it as a separate function no longer
serves much of a useful purpose.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
21 months agocephadm: convert ceph class to a ContainerDaemonForm
John Mulligan [Thu, 28 Sep 2023 15:04:49 +0000 (11:04 -0400)]
cephadm: convert ceph class to a ContainerDaemonForm

Signed-off-by: John Mulligan <jmulligan@redhat.com>
21 months agodoc/rados: edit memory-profiling.rst 53908/head
Zac Dover [Tue, 10 Oct 2023 08:26:53 +0000 (18:26 +1000)]
doc/rados: edit memory-profiling.rst

Edit doc/rados/troubleshooting/memory-profiling.rst.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
21 months agoqa/tasks/ceph_manager: thrash - add reset_purged_snaps_last 53579/head
Matan Breizman [Thu, 31 Aug 2023 09:55:33 +0000 (09:55 +0000)]
qa/tasks/ceph_manager: thrash - add reset_purged_snaps_last

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
21 months agoMerge pull request #50132 from aruniiird/add-rbd-mirror-mon-alerts
Juan Miguel Olmo [Tue, 10 Oct 2023 11:37:01 +0000 (13:37 +0200)]
Merge pull request #50132 from aruniiird/add-rbd-mirror-mon-alerts

ceph-mixin: Add RBD Mirror monitoring alerts

21 months agoqa/suites/rbd: drop redundant ignorelist entries 53914/head
Ilya Dryomov [Tue, 10 Oct 2023 10:31:28 +0000 (12:31 +0200)]
qa/suites/rbd: drop redundant ignorelist entries

CACHE_POOL_NO_HIT_SET is retained in *api_tests*.yaml and
rbd_mirror.yaml snippets for TestLibRBD.ListChildrenTiered and
TestClusterWatcher.CachePools tests.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
21 months agocrimson/os/seastore: return ghobject_t::max as the end when list_objects 53062/head
Xuehan Xu [Mon, 21 Aug 2023 08:33:16 +0000 (16:33 +0800)]
crimson/os/seastore: return ghobject_t::max as the end when list_objects
reaches the end of the listing

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
21 months agoqa/suites/rbd: deduplicate (data) pool facets
Ilya Dryomov [Sat, 30 Sep 2023 09:39:32 +0000 (11:39 +0200)]
qa/suites/rbd: deduplicate (data) pool facets

With cache tiering facets gone, "pool" facets are strictly about
--data-pool option now.  Rename to "data-pool" and create symlinks
to a common directory.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
21 months agoqa/suites/rbd: drop cache tiering workload tests
Ilya Dryomov [Sat, 30 Sep 2023 11:34:44 +0000 (13:34 +0200)]
qa/suites/rbd: drop cache tiering workload tests

Cache tiering facets have been a constant source of job timeouts
accompanied by "slow request" warnings on the OSDs for at least two
years.  Same workloads pass without pool/small-cache-pool.yaml or
thrashers/cache.yaml.

See cache tiering deprecation note added in commit 535b8db33ea0 ("doc:
deprecate the cache tiering").

Fixes: https://tracker.ceph.com/issues/63149
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
21 months agoMerge pull request #53719 from rishabh-d-dave/mon-fscmds
Rishabh Dave [Tue, 10 Oct 2023 05:56:35 +0000 (11:26 +0530)]
Merge pull request #53719 from rishabh-d-dave/mon-fscmds

 mon/FSCommands: clean up FSCommands

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>
Reviewed-by: Xiubo Li <xiubli@redhat.com>
Reviewed-by: Kotresh HR <khiremat@redhat.com>
Reviewed-by: Laura Flores <lflores@redhat.com>
21 months agocrimson/os/seastore/onode_manager: drop unnecessary status 53234/head
Xuehan Xu [Tue, 10 Oct 2023 03:47:38 +0000 (11:47 +0800)]
crimson/os/seastore/onode_manager: drop unnecessary status

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
21 months agocrimson/os/seastore/onode_manager: avoid unnecessary delta related
Xuehan Xu [Sat, 7 Oct 2023 06:01:32 +0000 (14:01 +0800)]
crimson/os/seastore/onode_manager: avoid unnecessary delta related
overhead

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
21 months agoMerge PR #53885 into main
Venky Shankar [Tue, 10 Oct 2023 04:05:37 +0000 (09:35 +0530)]
Merge PR #53885 into main

* refs/pull/53885/head:
Revert "mds: disable delegating inode ranges to clients"

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
21 months agoMerge pull request #53517 from cbodley/wip-qa-distros-s
Yuri Weinstein [Mon, 9 Oct 2023 21:04:54 +0000 (14:04 -0700)]
Merge pull request #53517 from cbodley/wip-qa-distros-s

qa/distros: remove centos/rhel8 and ubuntu20.04 from supported distros

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@redhat.com>
Reviewed-by: Ali Maredia <amaredia@redhat.com>
21 months agoqa/suites/upgrade: fix env indentation in stress-split upgrade tests 53900/head
Laura Flores [Mon, 9 Oct 2023 20:24:14 +0000 (20:24 +0000)]
qa/suites/upgrade: fix env indentation in stress-split upgrade tests

This is an issue with the stress-split yaml files, as introduced in https://github.com/ceph/ceph/pull/51889.

The stress-split tests have an incorrectly-intented "env" section, which teuthology detects as an entry for "clients".

Fixes: https://tracker.ceph.com/issues/63158
Signed-off-by: Laura Flores <lflores@ibm.com>
21 months agoMerge pull request #53829 from ajarr/wip-63009
Ilya Dryomov [Mon, 9 Oct 2023 19:38:21 +0000 (21:38 +0200)]
Merge pull request #53829 from ajarr/wip-63009

librbd: kick ExclusiveLock state machine stalled waiting for lock from reacquire_lock()

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
21 months agoMerge pull request #53722 from rishabh-d-dave/mon-authmon
Rishabh Dave [Mon, 9 Oct 2023 17:23:42 +0000 (22:53 +0530)]
Merge pull request #53722 from rishabh-d-dave/mon-authmon

mon/AuthMonitor: clean up AuthMonitor

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>
Reviewed-by: Xiubo Li <xiubli@redhat.com>
Reviewed-by: Kotresh HR <khiremat@redhat.com>
Reviewed-by: Laura Flores <lflores@redhat.com>
21 months agoMerge pull request #53721 from rishabh-d-dave/mon-mdsmon
Rishabh Dave [Mon, 9 Oct 2023 17:22:44 +0000 (22:52 +0530)]
Merge pull request #53721 from rishabh-d-dave/mon-mdsmon

mon/MDSMonitor: clean up MDSMonitor

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>
Reviewed-by: Xiubo Li <xiubli@redhat.com>
Reviewed-by: Kotresh HR <khiremat@redhat.com>
Reviewed-by: Laura Flores <lflores@redhat.com>
21 months agoMerge pull request #53405 from rishabh-d-dave/ceph-auth-caps-val-caps
Rishabh Dave [Mon, 9 Oct 2023 17:19:28 +0000 (22:49 +0530)]
Merge pull request #53405 from rishabh-d-dave/ceph-auth-caps-val-caps

mon/AuthMonitor: make "ceph auth caps" print error messsages

Reviewed-by: Xiubo Li <xiubli@redhat.com>
Reviewed-by: Laura Flores <lflores@redhat.com>
21 months agoMerge PR #53099 into main
Patrick Donnelly [Mon, 9 Oct 2023 17:03:37 +0000 (13:03 -0400)]
Merge PR #53099 into main

* refs/pull/53099/head:
script: update ceph-debug-docker for centos 9.stream

Reviewed-by: Laura Flores <lflores@redhat.com>
21 months agoRevert "mds: disable delegating inode ranges to clients" 53885/head
Venky Shankar [Mon, 9 Oct 2023 05:06:49 +0000 (10:36 +0530)]
Revert "mds: disable delegating inode ranges to clients"

This isn't necessary -- the MDS handles delegating inode ranges
to clients from its preallocated inode set properly - the suspected
bug involving not persisting the sessionmap and causing asserts
during replay isn't an issue. The preallocated set is persisted
with the log event and the MDS correctly rebuild the set from
this information during replay.

Signed-off-by: Venky Shankar <vshankar@redhat.com>
21 months agoMerge PR #53206 into main
Patrick Donnelly [Mon, 9 Oct 2023 15:53:31 +0000 (11:53 -0400)]
Merge PR #53206 into main

* refs/pull/53206/head:
mds: use LogSegment dump for debugging

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
21 months agoMerge pull request #52254 from vedanshbhartia/coverity_uninit
Yuval Lifshitz [Mon, 9 Oct 2023 14:58:42 +0000 (17:58 +0300)]
Merge pull request #52254 from vedanshbhartia/coverity_uninit

rgw: Add coverity uninitialized variable and initialize RGWBucketEntryMetadataObject

reviewed-by: yuvalif

21 months agoMerge pull request #52328 from vedanshbhartia/coverity_1512267
Yuval Lifshitz [Mon, 9 Oct 2023 14:57:47 +0000 (17:57 +0300)]
Merge pull request #52328 from vedanshbhartia/coverity_1512267

rgw: fix potential null dereference in rgw_iam_policy.c: ParseState::do_string

reviewed-by: yuvalif

21 months agoMerge pull request #52472 from vedanshbhartia/coverity_1510724
Yuval Lifshitz [Mon, 9 Oct 2023 14:56:40 +0000 (17:56 +0300)]
Merge pull request #52472 from vedanshbhartia/coverity_1510724

rgw: Remove unnecessary null check from valid_s3_bucket_name

reviwed-by: soumyakoduri , yuvalif

21 months agoMerge pull request #52734 from vedanshbhartia/coverity_ostream
Yuval Lifshitz [Mon, 9 Oct 2023 14:54:34 +0000 (17:54 +0300)]
Merge pull request #52734 from vedanshbhartia/coverity_ostream

rgw: Restore ostream format state after changing it

reviwed-by: yuvalif

21 months agoMerge pull request #52326 from yuvalif/wip-yuval-lua-reload
Yuval Lifshitz [Mon, 9 Oct 2023 14:53:15 +0000 (17:53 +0300)]
Merge pull request #52326 from yuvalif/wip-yuval-lua-reload

rgw/lua: support reloading lua packages on all RGWs

reviwed-by: dang, cbodle, anthonyeleven

21 months agoMerge pull request #53890 from zdover23/wip-doc-2023-10-09-troubleshooting-troublesho...
zdover23 [Mon, 9 Oct 2023 13:53:05 +0000 (23:53 +1000)]
Merge pull request #53890 from zdover23/wip-doc-2023-10-09-troubleshooting-troubleshooting-mon-4-of-x

doc/rados: edit troubleshooting-mon.rst (4 of x)

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
21 months agoMerge pull request #53892 from rishabh-d-dave/fix-test_cephfs.py
Rishabh Dave [Mon, 9 Oct 2023 13:47:07 +0000 (19:17 +0530)]
Merge pull request #53892 from rishabh-d-dave/fix-test_cephfs.py

src/test/pybind: don't use decorator "with_setup"

Reviewed-by: Venky Shankar <vshankar@redhat.com>
21 months agodoc/rados: edit troubleshooting-mon.rst (4 of x) 53890/head
Zac Dover [Mon, 9 Oct 2023 08:55:07 +0000 (18:55 +1000)]
doc/rados: edit troubleshooting-mon.rst (4 of x)

Edit doc/rados/troubleshooting/troubleshooting-mon.rst.

Follows https://github.com/ceph/ceph/pull/53875

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
21 months agoMerge pull request #53884 from leonid-s-usov/rgw-posix-test
Daniel Gryniewicz [Mon, 9 Oct 2023 12:57:13 +0000 (08:57 -0400)]
Merge pull request #53884 from leonid-s-usov/rgw-posix-test

test/rgw: don't compile POSIX test unless enabled

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
21 months agoMerge PR #53873 into main
Venky Shankar [Mon, 9 Oct 2023 12:19:30 +0000 (17:49 +0530)]
Merge PR #53873 into main

* refs/pull/53873/head:
qa: typo fix when checking for perf counter - s/md_thresh_evicted/mdthresh_evicted
qa: lower mds_session_metadata_threshold for tests

Reviewed-by: Milind Changire <mchangir@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>
Reviewed-by: Leonid Usov <leonid.usov@ibm.com>
21 months agosrc/test/pybind: don't use decorator "with_setup" 53892/head
Rishabh Dave [Mon, 9 Oct 2023 11:28:16 +0000 (16:58 +0530)]
src/test/pybind: don't use decorator "with_setup"

Signed-off-by: Rishabh Dave <ridave@redhat.com>
21 months agoqa: typo fix when checking for perf counter - s/md_thresh_evicted/mdthresh_evicted 53873/head
Venky Shankar [Mon, 9 Oct 2023 04:29:45 +0000 (09:59 +0530)]
qa: typo fix when checking for perf counter - s/md_thresh_evicted/mdthresh_evicted

Signed-off-by: Venky Shankar <vshankar@redhat.com>
21 months agoqa: lower mds_session_metadata_threshold for tests
Venky Shankar [Sat, 7 Oct 2023 15:12:29 +0000 (20:42 +0530)]
qa: lower mds_session_metadata_threshold for tests

... and increase the number of files that are created so as to
hit the threshold with a high probability.

Fixes: http://tracker.ceph.com/issues/62873
Signed-off-by: Venky Shankar <vshankar@redhat.com>
21 months agoMerge pull request #53817 from cloudbehl/active-alert-filter
Nizamudeen A [Mon, 9 Oct 2023 05:11:21 +0000 (10:41 +0530)]
Merge pull request #53817 from cloudbehl/active-alert-filter

mgr/dashboard: Filter active alerts

Reviewed-by: Nizamudeen A <nia@redhat.com>
21 months agotest/rgw: don't compile POSIX test unless enabled 53884/head
Leonid Usov [Sun, 8 Oct 2023 14:14:23 +0000 (17:14 +0300)]
test/rgw: don't compile POSIX test unless enabled

Signed-off-by: Leonid Usov <leonid.usov@ibm.com>
21 months agolibrbd/ManagedLock: kickstart ExclusiveLock state machine 53829/head
Ramana Raja [Mon, 2 Oct 2023 16:39:26 +0000 (12:39 -0400)]
librbd/ManagedLock: kickstart ExclusiveLock state machine

... that is stalled waiting for lock. Do this when trying to reacquire
lock in the ImageWatcher's rewatch mechanism. This would enable the
ExclusiveLock state machine to propagate the blocklist error to the
caller trying to perform an image operation requiring an exclusive
lock.

Previous attempt, e66db763, to fix the hang due to exclusive lock
acquisiton (stuck waiting for lock) racing with client blocklisting
did not always work. e66db763 kickstarted the ExclusiveLock state
machine when the ImageWatcher tried to schedule a exclusive lock
request and the blocklisting was detected. However, there is a short
window between a watch getting deregistered and client blocklisting
getting detected as part of rewatching. If hit when trying to schedule
a lock request, the ExclusiveLock state machine wasn't kickstarted,
blocklist error wasn't propagated, and the hang resurfaced.

A more robust approach is taken to resume the ExclusiveLock state
machine stuck waiting for lock during client blocklisting. Whenever
a client's ImageWatcher loses connection to the cluster, as it happens
during blocklising, the ImageWatcher initiates a mechanism to rewatch
the image and tries to reacquire the lock. Piggyback on this rewatch
mechanism that gets triggered during client blocklisting. And when
trying to reacquire the lock, kickstart the ExclusiveLock state
machine stalled waiting for lock (STATE_WAITING_FOR_LOCK).

Fixes: https://tracker.ceph.com/issues/63009
Signed-off-by: Ramana Raja <rraja@redhat.com>
21 months agocrimson/os/seastore/onode_manager: drop write_dirty
Xuehan Xu [Wed, 27 Sep 2023 09:15:29 +0000 (17:15 +0800)]
crimson/os/seastore/onode_manager: drop write_dirty

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
21 months agocrimson/os/seastore/onode_manager: populate delta recorders for each
Xuehan Xu [Wed, 27 Sep 2023 09:13:44 +0000 (17:13 +0800)]
crimson/os/seastore/onode_manager: populate delta recorders for each
onode modification

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
21 months agoMerge pull request #53874 from zdover23/wip-doc-2023-10-07-rados-troubleshooting...
zdover23 [Sun, 8 Oct 2023 05:10:20 +0000 (16:10 +1100)]
Merge pull request #53874 from zdover23/wip-doc-2023-10-07-rados-troubleshooting-community

doc/rados: edit troubleshooting/community.rst

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
21 months agoMerge pull request #53875 from zdover23/wip-doc-2023-10-07-troubleshooting-troublesho...
zdover23 [Sun, 8 Oct 2023 04:50:45 +0000 (15:50 +1100)]
Merge pull request #53875 from zdover23/wip-doc-2023-10-07-troubleshooting-troubleshooting-mon-3-of-x

doc/rados: edit troubleshooting-mon.rst (3 of x)

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
21 months agodoc/rados: edit troubleshooting/community.rst 53874/head
Zac Dover [Sat, 7 Oct 2023 20:39:23 +0000 (06:39 +1000)]
doc/rados: edit troubleshooting/community.rst

Edit doc/rados/troubleshooting/community.rst.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
21 months agodoc/rados: edit troubleshooting-mon.rst (3 of x) 53875/head
Zac Dover [Sat, 7 Oct 2023 21:33:17 +0000 (07:33 +1000)]
doc/rados: edit troubleshooting-mon.rst (3 of x)

Edit doc/rados/troubleshooting/troubleshooting-mon.rst.

Follows https://github.com/ceph/ceph/pull/52827

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
21 months agoMerge pull request #53876 from zdover23/wip-doc-2023-10-08-architecture-rbd-sentence...
Anthony D'Atri [Sun, 8 Oct 2023 00:45:38 +0000 (20:45 -0400)]
Merge pull request #53876 from zdover23/wip-doc-2023-10-08-architecture-rbd-sentence-repair

doc/architecture: repair RBD sentence

21 months agodoc/architecture: repair RBD sentence 53876/head
Zac Dover [Sat, 7 Oct 2023 21:43:43 +0000 (07:43 +1000)]
doc/architecture: repair RBD sentence

Improve an ambiguous sentence in doc/architecture.rst.

The problem presented by the original sentence is that the phrasal verb
"to provide with" is implicated in one of its possible readings.
Interpreted in that way, the sentence seems to express the incorrect
idea that RBD furnishes block devices with snapshotting and cloning, as
though snapshotting and cloning are being delivered to the block
devices. In fact, snapshotting and cloning are just features of RBD, and
are features that are described on this page:
https://docs.ceph.com/en/quincy/rbd/rbd-snapshot/.

Signed-off-by: Zac Dover <zac.dover@proton.me>
21 months agoMerge pull request #53790 from zdover23/wip-doc-2023-10-03-architecture-17-of-x
zdover23 [Sat, 7 Oct 2023 04:51:27 +0000 (15:51 +1100)]
Merge pull request #53790 from zdover23/wip-doc-2023-10-03-architecture-17-of-x

doc/architecture: edit "Peering and Sets"

21 months agodoc/architecture: edit "Peering and Sets" 53790/head
Zac Dover [Tue, 3 Oct 2023 12:11:50 +0000 (22:11 +1000)]
doc/architecture: edit "Peering and Sets"

Edit the English in the section "Peering and Sets" in the file
doc/architecture.rst.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
21 months agoMerge PR #53855 into main
Patrick Donnelly [Sat, 7 Oct 2023 00:15:23 +0000 (20:15 -0400)]
Merge PR #53855 into main

* refs/pull/53855/head:
script: add option for debug build

Reviewed-by: Ilya Dryomov <idryomov@redhat.com>
21 months agoscript: add option for debug build 53855/head
Patrick Donnelly [Fri, 6 Oct 2023 00:11:29 +0000 (20:11 -0400)]
script: add option for debug build

See: https://github.com/ceph/ceph-build/pull/2167

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
21 months agomgr/cephadm: fix upgrades with nvmeof 53862/head
Adam King [Fri, 6 Oct 2023 15:20:57 +0000 (11:20 -0400)]
mgr/cephadm: fix upgrades with nvmeof

Currently, nvmeof was being treated as if it used
a ceph image during upgrades. This would cause logging
of messages like (I've removed the nvmeof daemon id)

log [WRN] :     Upgrade daemon: nvmeof.<id>: Cannot redeploy
nvmeof.<id> with a new image: Supported types are: mgr, mon,
crash, osd, mds, rgw, rbd-mirror, cephfs-mirror, ceph-exporter,
iscsi, nfs

and if you had set a custom image for the
mgr/cephadm/container_image_nvmeof setting, this would
be undone as part of the upgrade process.

Fixes: https://tracker.ceph.com/issues/63127
Signed-off-by: Adam King <adking@redhat.com>
21 months agoMerge pull request #53417 from jrchyang/fix_mclock_scheduling_slow_main
Yuri Weinstein [Fri, 6 Oct 2023 13:58:11 +0000 (06:58 -0700)]
Merge pull request #53417 from jrchyang/fix_mclock_scheduling_slow_main

osd: fix: slow scheduling when item_cost is large

Reviewed-by: Sridhar Seshasayee <sseshasa@redhat.com>
21 months agoMerge pull request #52430 from vedanshbhartia/coverity_datarace
Yuval Lifshitz [Fri, 6 Oct 2023 10:07:31 +0000 (13:07 +0300)]
Merge pull request #52430 from vedanshbhartia/coverity_datarace

rgw: Add coverity annotations for missing mutex locks

reviewed-by: yuvalif, mkogan1

21 months agorgw/lua/doc: support reloading lua packages on all RGWs 52326/head
Yuval Lifshitz [Wed, 5 Jul 2023 18:33:55 +0000 (18:33 +0000)]
rgw/lua/doc: support reloading lua packages on all RGWs

without requiring a restart of the RGWs
test instructions:
https://gist.github.com/yuvalif/95b8ed9ea73ab4591c59644a050e01e2
also use capitalized "Lua" in logs/doc

Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>
21 months agomon/AuthMonitor: check if entity is absent before creating it 53405/head
Rishabh Dave [Thu, 28 Sep 2023 17:34:51 +0000 (23:04 +0530)]
mon/AuthMonitor: check if entity is absent before creating it

Although this code path is not used for creating entities yet, it is
better to fix the bug sooner than later. Method
AuthMonitor::_update_or_create_entity() must exit (with appropriate
error code) when entity to be created on the Ceph cluster is already
present.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
21 months agomds/MDSAuthCaps: re-word an error message for better clarity
Rishabh Dave [Thu, 28 Sep 2023 14:29:24 +0000 (19:59 +0530)]
mds/MDSAuthCaps: re-word an error message for better clarity

Signed-off-by: Rishabh Dave <ridave@redhat.com>
21 months agoMerge pull request #53834 from dparmar18/remove-egg-fragment-from-doc
zdover23 [Thu, 5 Oct 2023 23:21:14 +0000 (10:21 +1100)]
Merge pull request #53834 from dparmar18/remove-egg-fragment-from-doc

doc: remove egg fragment from dev/developer_guide/running-tests-locally

Reviewed-by: John Mulligan <jmulligan@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>
Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
Reviewed-by: Zac Dover <zac.dover@proton.me>