git.apps.os.sepia.ceph.com Git

]> git.apps.os.sepia.ceph.com Git - ceph.git/log

Ilya Dryomov [Tue, 17 Oct 2023 11:20:59 +0000 (13:20 +0200)]

Merge pull request #54002 from idryomov/wip-63028

pybind/rbd: don't produce info on errors in aio_mirror_image_get_info()

Reviewed-by: Mykola Golub <mgolub@suse.com>
Reviewed-by: Ramana Raja <rraja@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Tue, 17 Oct 2023 11:14:25 +0000 (13:14 +0200)]

Merge pull request #53800 from batrick/cmake-debug-wrn

CMakeLists: warn if CMAKE_BUILD_TYPE is set/unset

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Adam King [Mon, 16 Oct 2023 19:33:53 +0000 (15:33 -0400)]

Merge pull request #53802 from phlogistonjohn/jjm-cephadm-dtypes-2

cephadm: convert more types to be based on ContainerDaemonForm

Reviewed-by: Adam King <adking@redhat.com>

commit | commitdiff | tree

Casey Bodley [Mon, 16 Oct 2023 16:02:52 +0000 (17:02 +0100)]

Merge pull request #53351 from ceph/QE_defect_fixes

rgw/s3select: fix per QE defect.

commit | commitdiff | tree

Patrick Donnelly [Tue, 3 Oct 2023 16:01:40 +0000 (16:01 +0000)]

CMakeLists: warn if CMAKE_BUILD_TYPE is set/unset

At some point the debug builds for wip branches no longer had the .git
directory available so the Debug build type was unset. This meant we are
no longer doing numerous checks (like mutex ownership checks) that we
would normally be doing in the qa suite.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

zdover23 [Sun, 15 Oct 2023 23:39:20 +0000 (09:39 +1000)]

Merge pull request #54022 from zdover23/wip-doc-2023-10-15-rados-troubleshooting-troubleshooting-osd-3-of-x

doc/rados: Edit troubleshooting-osd (3 of x)

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Zac Dover [Sun, 15 Oct 2023 12:43:39 +0000 (22:43 +1000)]

doc/rados: Edit troubleshooting-osd (3 of x)

Edit doc/rados/troubleshooting/troubleshooting-osd.rst (3 of x).

Follows https://github.com/ceph/ceph/pull/53986.

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Yuval Lifshitz [Sun, 15 Oct 2023 06:51:25 +0000 (09:51 +0300)]

Merge pull request #53547 from yuvalif/wip-yuval-req-context

rgw: adding request context structure

reviewed-by: dang, cbodley

commit | commitdiff | tree

Anthony D'Atri [Fri, 13 Oct 2023 18:50:21 +0000 (14:50 -0400)]

Merge pull request #54006 from zdover23/wip-doc-2023-10-14-filestore-deprecation

doc/rados: remove FileStore material (in Reef)

commit | commitdiff | tree

Zac Dover [Fri, 13 Oct 2023 18:40:26 +0000 (04:40 +1000)]

doc/rados: remove FileStore material (in Reef)

Remove FileStore-related material because FileStore is not supported in
Reef.

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Anthony D'Atri [Fri, 13 Oct 2023 18:25:59 +0000 (14:25 -0400)]

Merge pull request #54005 from zdover23/wip-doc-2023-10-14-dev-cache-pool-removal

doc/dev: remove cache-pool

commit | commitdiff | tree

Zac Dover [Fri, 13 Oct 2023 18:17:44 +0000 (04:17 +1000)]

doc/dev: remove cache-pool

Remove cache-pool, which is deprecated in Reef.

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Kamoltat (Junior) Sirivadhna [Fri, 13 Oct 2023 15:44:36 +0000 (11:44 -0400)]

Merge pull request #53979 from kamoltat/wip-ksirivad-fix-63183

src/mon/Monitor: Fix set_elector_disallowed_leaders
Reviewed-by: Greg Farnum <gfarnum@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Thu, 12 Oct 2023 19:32:53 +0000 (21:32 +0200)]

mgr/rbd_support: make type hits on aio_mirror_image_*() callbacks better

Make it clear that mirror mode, mirror info and snap ID can be None if
the respective operation fails.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Ilya Dryomov [Thu, 12 Oct 2023 17:03:10 +0000 (19:03 +0200)]

pybind/rbd: don't produce info on errors in aio_mirror_image_get_info()

Check completion return value before attemting to decode c_info.
Otherwise we are guaranteed to access invalid memory in decode_cstr()
while trying to compute global_id string length when the client is
blocklisted for example.

Fixes: https://tracker.ceph.com/issues/63028
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Adam Kupczyk [Fri, 13 Oct 2023 14:07:57 +0000 (16:07 +0200)]

Merge pull request #53732 from pereman2/bluefs-expand-bug

os/bluestore: fix bluefs _extend_log seq update

commit | commitdiff | tree

zdover23 [Fri, 13 Oct 2023 13:40:38 +0000 (23:40 +1000)]

Merge pull request #53986 from zdover23/wip-doc-2023-10-12-rados-troubleshooting-troubleshooting-osd-2-of-x

doc/rados: Edit troubleshooting-osd (2 of x)

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Zac Dover [Thu, 12 Oct 2023 08:33:58 +0000 (18:33 +1000)]

doc/rados: Edit troubleshooting-osd (2 of x)

Edit doc/rados/troubleshooting/troubleshooting.rst (2 of x).

Follows https://github.com/ceph/ceph/pull/53936.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Yingxin [Fri, 13 Oct 2023 08:28:34 +0000 (16:28 +0800)]

Merge pull request #53234 from xxhdx1985126/wip-seastore-onode-manager-replay

crimson/os/seastore/onode_manager: populate delta recorders for each onode modification

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

galsalomon66 [Sat, 9 Sep 2023 13:38:23 +0000 (16:38 +0300)]

fix per QE defect.
s3select submodule

s3select engine changes

s3select submodule
aggregation functions upon no data

Signed-off-by: galsalomon66 <gal.salomon@gmail.com>

commit | commitdiff | tree

Kamoltat [Wed, 11 Oct 2023 21:12:03 +0000 (21:12 +0000)]

src/mon/Monitor: Fix set_elector_disallowed_leaders

Problem:

In the monitors we hold 2 copies of disallowed_leader ...
1. MonMap class 2. Elector class.
When computing the ConnectivityScore for the monitors during
the election, we use the `disallowed_leader` from Elector
class to determine which monitors we shouldn't allow to lead.

Now, we rely on the function `set_elector_disallowed_leaders`
to set the `disallowed_leader` of the Elector class, MonMap
class copy of the `disallowed_leader` contains the
`tiebreaker_monitor` so we inherit that plus we also add the
monitors that are dead due to a zone failure.

Hence, the `adding dead monitors` phase is only allowed if we can
enter stretch_mode. However, there is a problem when failing over a stretch cluster
zone and reviving the entire zone back up, the revived monitors
couldn't enter stretch_mode when they are at the state of "probing"
since PaxosServices like osdmon becomes unreadable (this is expected)

Solution:

We unconditionally add monitors that are in
`monmap->stretch_marked_down_mons` to the
`disallowed_leaders` list in
`Monitor::set_elector_disallowed_leaders` since
if the monitors are in `monmap->stretch_marked_down_mons`
we know that they probably belong in a marked down
zone and is not fit for lead.

This will fix the problem of newly revived monitors
having different disallowed_leaders set
and getting stuck in election.

Fixes: https://tracker.ceph.com/issues/63183
Signed-off-by: Kamoltat <ksirivad@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Thu, 12 Oct 2023 19:29:05 +0000 (21:29 +0200)]

Merge pull request #53914 from idryomov/wip-63149

qa/suites/rbd: drop cache tiering workload tests

Reviewed-by: Ramana Raja <rraja@redhat.com>
Reviewed-by: Mykola Golub <mgolub@suse.com>

commit | commitdiff | tree

Adam King [Thu, 12 Oct 2023 19:24:26 +0000 (15:24 -0400)]

Merge pull request #51416 from adk3798/jaeger-agent-dep

mgr/cephadm: make jaeger-collector a dep for jaeger-agent

Reviewed-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Thu, 12 Oct 2023 14:56:01 +0000 (07:56 -0700)]

Merge pull request #53980 from ceph/wip-yuri-crontab-main

qa/tests: fixed typo and added `--force-priority` in missing places

Reviewed-by: Laura Flores <lflores@redhat.com>

commit | commitdiff | tree

Yingxin [Thu, 12 Oct 2023 06:44:11 +0000 (14:44 +0800)]

Merge pull request #53062 from xxhdx1985126/wip-seastore-list-objects

crimson/os/seastore: return ghobject_t::max as the end when list_objects reaches the end of the listing

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
Reviewed-by: Chunmei Liu <chunmei.liu@intel.com>

commit | commitdiff | tree

zdover23 [Thu, 12 Oct 2023 05:15:58 +0000 (15:15 +1000)]

Merge pull request #53936 from zdover23/wip-doc-2023-10-11-rados-troubleshooting-troubleshooting-osd-1-of-x

doc/rados: edit troubleshooting-osd (1 of x)

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Zac Dover [Wed, 11 Oct 2023 10:38:55 +0000 (20:38 +1000)]

doc/rados: edit troubleshooting-osd (1 of x)

Edit doc/rados/troubleshooting/troubleshooting-osd.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Yuval Lifshitz [Tue, 10 Oct 2023 17:30:54 +0000 (17:30 +0000)]

tracing/osd/rgw: using jspan as alias to opentelemetry::Span

and jspan_ptr as jspan shared pointer

Signed-off-by: Yuval Lifshitz <ylifshit@redhat.com>

commit | commitdiff | tree

Yuval Lifshitz [Wed, 20 Sep 2023 09:46:54 +0000 (09:46 +0000)]

rgw: adding request context structure

this structure should be created at the frontend and trickle all the way
to the RADOS layer. holding: dout prefix, optional yield and trace.
in this commit, so far it was only added to the "complete()" sal interface,
and to the "write_meta()" rados interface.

in the future, it should be added to more sal interfaces, replacing the
current way where dpp and optional yield are passed as sepearte
arguments to all functions.
in addition, if more information would be needed, it should be possible
to add that information to the request context struct without changing
many function prototypes

basic test instructions:
https://gist.github.com/yuvalif/1c7f1e80126bed5fa79345efb27fe1b1

Signed-off-by: Yuval Lifshitz <ylifshit@redhat.com>

commit | commitdiff | tree

zdover23 [Thu, 12 Oct 2023 03:44:24 +0000 (13:44 +1000)]

Merge pull request #53971 from zdover23/wip-doc-2023-10-12-start-get-involved-slack

doc/start: add Slack to get-involved

Reviewed-by: Laura Flores <lflores@redhat.com>
Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Yuri Weinstein [Wed, 11 Oct 2023 22:07:02 +0000 (15:07 -0700)]

qa/tests: fixed typo and added `--force-priority` in missing places

Signed-off-by: Yuri Weinstein <yweinste@redhat.com>

commit | commitdiff | tree

Zack Cerza [Wed, 11 Oct 2023 21:10:14 +0000 (15:10 -0600)]

Merge pull request #53926 from zmc/workunit-assert-client-role

qa/tasks/workunit: improve assert on client names

commit | commitdiff | tree

Kamoltat [Wed, 11 Oct 2023 21:08:19 +0000 (21:08 +0000)]

src/mon: Added more loggings for disallowed_leaders

Added more loggings regarding the disallowed_leaders
set so that it is easier to debug problems like:

https://tracker.ceph.com/issues/63183

Fixes: https://tracker.ceph.com/issues/63183
Signed-off-by: Kamoltat <ksirivad@redhat.com>

commit | commitdiff | tree

Zac Dover [Wed, 11 Oct 2023 16:29:12 +0000 (02:29 +1000)]

doc/start: add Slack to get-involved

Add a link to the Ceph Slack to doc/start/get-involved.rst.

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Casey Bodley [Wed, 11 Oct 2023 15:59:20 +0000 (16:59 +0100)]

Merge pull request #53846 from cbodley/wip-62989-again

rgw: fix http error checks in keystone/barbican/vault clients

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>

commit | commitdiff | tree

Radoslaw Zarzynski [Wed, 11 Oct 2023 15:01:44 +0000 (17:01 +0200)]

Merge pull request #53579 from Matan-B/wip-matanb-reset_purged_snaps_last

osd/OSD: introduce reset_purged_snaps_last

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

zdover23 [Wed, 11 Oct 2023 00:39:53 +0000 (10:39 +1000)]

Merge pull request #53908 from zdover23/wip-doc-2023-10-10-troubleshooting-troubleshooting-memory-profiling

doc/rados: edit memory-profiling.rst

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Yuri Weinstein [Tue, 10 Oct 2023 23:01:19 +0000 (16:01 -0700)]

Merge pull request #53930 from ceph/wip-yuri-crontab-main

qa/tests: Added upgrade/reef-x to the mix

Reviewed-by: Laura Flores <lflores@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Tue, 10 Oct 2023 19:00:12 +0000 (12:00 -0700)]

qa/tests: Added upgrade/reef-x to the mix

Signed-off-by: Yuri Weinstein <yweinste@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Tue, 10 Oct 2023 18:55:24 +0000 (11:55 -0700)]

Merge pull request #53085 from ceph/wip-yuriw-reef-x-main

qa/tests: initial reef-x check in

Reviewed-by: Laura Flores <lflores@redhat.com>

commit | commitdiff | tree

Zack Cerza [Tue, 10 Oct 2023 17:23:20 +0000 (11:23 -0600)]

workunit: Improve assert on client names

Instead of a bare AssertionError, we can mention the problematic value.

Signed-off-by: Zack Cerza <zack@redhat.com>

commit | commitdiff | tree

Adam King [Tue, 10 Oct 2023 16:08:43 +0000 (12:08 -0400)]

Merge pull request #53862 from adk3798/nvmeof-upgrade

mgr/cephadm: fix upgrades with nvmeof

Reviewed-by: Redouane Kachach <rkachach@redhat.com>

commit | commitdiff | tree

Laura Flores [Tue, 10 Oct 2023 15:52:01 +0000 (11:52 -0400)]

Merge pull request #53900 from ljflores/wip-fix-stress-split-tests

qa/suites/upgrade: fix env indentation in stress-split upgrade tests

commit | commitdiff | tree

John Mulligan [Tue, 3 Oct 2023 20:52:09 +0000 (16:52 -0400)]

cephadm: convert ceph exporter type to a ContainerDaemonForm

CephExporter was being (partially) over-shadowed by the Ceph class as
the Ceph class listed 'ceph-exporter' as one of the daemon types it
handled. This change updates CephExporter to a ContainerDaemonForm while
simultaneously breaking the link between Ceph and 'ceph-exporter',
allowing CephExporter to handle all the duty of managing ceph-exporter,
continuing the process of having clearer logical responsibilities and
class hierarchy in cephadm.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Tue, 3 Oct 2023 20:51:49 +0000 (16:51 -0400)]

cephadm: mock os.path.listdir in daemon forms test

Prevent classes that want to check the filesystem from breaking the
simple daemon forms instantiation test case. A better future fix would
be avoiding checking the file system during __init__ of the class but
that is left for future improvements.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Tue, 3 Oct 2023 20:43:59 +0000 (16:43 -0400)]

cephadm: stop directly using Ceph.daemons property

The Ceph.daemons property has two unfortunate behaviors: most important,
it includes ceph-exporter which causes the other CephExporter class to
be over-shadowed the DaemonForms mechanism. Second, it couples all
functions that want to know the names of ceph daemon types to the Ceph
class preventing future refactoring of that class.

Break the existing coupling by adding a new `ceph_daemons` function
similar to `get_supported_daemons` but returning the same value that
Ceph.daemons used to provide. This will permit future fixes and
improvements.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Thu, 28 Sep 2023 18:15:55 +0000 (14:15 -0400)]

cephadm: eliminate _dispatch_deploy function

Eliminate the _dispatch_deploy function, folding it into the
_common_deploy function, because the mass of if-elif lines have
been replaced and keeping it as a separate function no longer
serves much of a useful purpose.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Thu, 28 Sep 2023 15:04:49 +0000 (11:04 -0400)]

cephadm: convert ceph class to a ContainerDaemonForm

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

Zac Dover [Tue, 10 Oct 2023 08:26:53 +0000 (18:26 +1000)]

doc/rados: edit memory-profiling.rst

Edit doc/rados/troubleshooting/memory-profiling.rst.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Matan Breizman [Thu, 31 Aug 2023 09:55:33 +0000 (09:55 +0000)]

qa/tasks/ceph_manager: thrash - add reset_purged_snaps_last

Signed-off-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Juan Miguel Olmo [Tue, 10 Oct 2023 11:37:01 +0000 (13:37 +0200)]

Merge pull request #50132 from aruniiird/add-rbd-mirror-mon-alerts

ceph-mixin: Add RBD Mirror monitoring alerts

commit | commitdiff | tree

Ilya Dryomov [Tue, 10 Oct 2023 10:31:28 +0000 (12:31 +0200)]

qa/suites/rbd: drop redundant ignorelist entries

CACHE_POOL_NO_HIT_SET is retained in *api_tests*.yaml and
rbd_mirror.yaml snippets for TestLibRBD.ListChildrenTiered and
TestClusterWatcher.CachePools tests.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Xuehan Xu [Mon, 21 Aug 2023 08:33:16 +0000 (16:33 +0800)]

crimson/os/seastore: return ghobject_t::max as the end when list_objects
reaches the end of the listing

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Ilya Dryomov [Sat, 30 Sep 2023 09:39:32 +0000 (11:39 +0200)]

qa/suites/rbd: deduplicate (data) pool facets

With cache tiering facets gone, "pool" facets are strictly about
--data-pool option now. Rename to "data-pool" and create symlinks
to a common directory.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Ilya Dryomov [Sat, 30 Sep 2023 11:34:44 +0000 (13:34 +0200)]

qa/suites/rbd: drop cache tiering workload tests

Cache tiering facets have been a constant source of job timeouts
accompanied by "slow request" warnings on the OSDs for at least two
years. Same workloads pass without pool/small-cache-pool.yaml or
thrashers/cache.yaml.

See cache tiering deprecation note added in commit 535b8db33ea0 ("doc:
deprecate the cache tiering").

Fixes: https://tracker.ceph.com/issues/63149
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Rishabh Dave [Tue, 10 Oct 2023 05:56:35 +0000 (11:26 +0530)]

Merge pull request #53719 from rishabh-d-dave/mon-fscmds

mon/FSCommands: clean up FSCommands

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>
Reviewed-by: Xiubo Li <xiubli@redhat.com>
Reviewed-by: Kotresh HR <khiremat@redhat.com>
Reviewed-by: Laura Flores <lflores@redhat.com>

commit | commitdiff | tree

Xuehan Xu [Tue, 10 Oct 2023 03:47:38 +0000 (11:47 +0800)]

crimson/os/seastore/onode_manager: drop unnecessary status

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Xuehan Xu [Sat, 7 Oct 2023 06:01:32 +0000 (14:01 +0800)]

crimson/os/seastore/onode_manager: avoid unnecessary delta related
overhead

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Venky Shankar [Tue, 10 Oct 2023 04:05:37 +0000 (09:35 +0530)]

Merge PR #53885 into main

* refs/pull/53885/head:
Revert "mds: disable delegating inode ranges to clients"

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Mon, 9 Oct 2023 21:04:54 +0000 (14:04 -0700)]

Merge pull request #53517 from cbodley/wip-qa-distros-s

qa/distros: remove centos/rhel8 and ubuntu20.04 from supported distros

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@redhat.com>
Reviewed-by: Ali Maredia <amaredia@redhat.com>

commit | commitdiff | tree

Laura Flores [Mon, 9 Oct 2023 20:24:14 +0000 (20:24 +0000)]

qa/suites/upgrade: fix env indentation in stress-split upgrade tests

This is an issue with the stress-split yaml files, as introduced in https://github.com/ceph/ceph/pull/51889.

The stress-split tests have an incorrectly-intented "env" section, which teuthology detects as an entry for "clients".

Fixes: https://tracker.ceph.com/issues/63158
Signed-off-by: Laura Flores <lflores@ibm.com>

commit | commitdiff | tree

Ilya Dryomov [Mon, 9 Oct 2023 19:38:21 +0000 (21:38 +0200)]

Merge pull request #53829 from ajarr/wip-63009

librbd: kick ExclusiveLock state machine stalled waiting for lock from reacquire_lock()

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Rishabh Dave [Mon, 9 Oct 2023 17:23:42 +0000 (22:53 +0530)]

Merge pull request #53722 from rishabh-d-dave/mon-authmon

mon/AuthMonitor: clean up AuthMonitor

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>
Reviewed-by: Xiubo Li <xiubli@redhat.com>
Reviewed-by: Kotresh HR <khiremat@redhat.com>
Reviewed-by: Laura Flores <lflores@redhat.com>

commit | commitdiff | tree

Rishabh Dave [Mon, 9 Oct 2023 17:22:44 +0000 (22:52 +0530)]

Merge pull request #53721 from rishabh-d-dave/mon-mdsmon

mon/MDSMonitor: clean up MDSMonitor

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>
Reviewed-by: Xiubo Li <xiubli@redhat.com>
Reviewed-by: Kotresh HR <khiremat@redhat.com>
Reviewed-by: Laura Flores <lflores@redhat.com>

commit | commitdiff | tree

Rishabh Dave [Mon, 9 Oct 2023 17:19:28 +0000 (22:49 +0530)]

Merge pull request #53405 from rishabh-d-dave/ceph-auth-caps-val-caps

mon/AuthMonitor: make "ceph auth caps" print error messsages

Reviewed-by: Xiubo Li <xiubli@redhat.com>
Reviewed-by: Laura Flores <lflores@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Mon, 9 Oct 2023 17:03:37 +0000 (13:03 -0400)]

Merge PR #53099 into main

* refs/pull/53099/head:
script: update ceph-debug-docker for centos 9.stream

Reviewed-by: Laura Flores <lflores@redhat.com>

commit | commitdiff | tree

Venky Shankar [Mon, 9 Oct 2023 05:06:49 +0000 (10:36 +0530)]

Revert "mds: disable delegating inode ranges to clients"

This isn't necessary -- the MDS handles delegating inode ranges
to clients from its preallocated inode set properly - the suspected
bug involving not persisting the sessionmap and causing asserts
during replay isn't an issue. The preallocated set is persisted
with the log event and the MDS correctly rebuild the set from
this information during replay.

Signed-off-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Mon, 9 Oct 2023 15:53:31 +0000 (11:53 -0400)]

Merge PR #53206 into main

* refs/pull/53206/head:
mds: use LogSegment dump for debugging

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Yuval Lifshitz [Mon, 9 Oct 2023 14:58:42 +0000 (17:58 +0300)]

Merge pull request #52254 from vedanshbhartia/coverity_uninit

rgw: Add coverity uninitialized variable and initialize RGWBucketEntryMetadataObject

reviewed-by: yuvalif

commit | commitdiff | tree

Yuval Lifshitz [Mon, 9 Oct 2023 14:57:47 +0000 (17:57 +0300)]

Merge pull request #52328 from vedanshbhartia/coverity_1512267

rgw: fix potential null dereference in rgw_iam_policy.c: ParseState::do_string

reviewed-by: yuvalif

commit | commitdiff | tree

Yuval Lifshitz [Mon, 9 Oct 2023 14:56:40 +0000 (17:56 +0300)]

Merge pull request #52472 from vedanshbhartia/coverity_1510724

rgw: Remove unnecessary null check from valid_s3_bucket_name

reviwed-by: soumyakoduri , yuvalif

commit | commitdiff | tree

Yuval Lifshitz [Mon, 9 Oct 2023 14:54:34 +0000 (17:54 +0300)]

Merge pull request #52734 from vedanshbhartia/coverity_ostream

rgw: Restore ostream format state after changing it

reviwed-by: yuvalif

commit | commitdiff | tree

Yuval Lifshitz [Mon, 9 Oct 2023 14:53:15 +0000 (17:53 +0300)]

Merge pull request #52326 from yuvalif/wip-yuval-lua-reload

rgw/lua: support reloading lua packages on all RGWs

reviwed-by: dang, cbodle, anthonyeleven

commit | commitdiff | tree

zdover23 [Mon, 9 Oct 2023 13:53:05 +0000 (23:53 +1000)]

Merge pull request #53890 from zdover23/wip-doc-2023-10-09-troubleshooting-troubleshooting-mon-4-of-x

doc/rados: edit troubleshooting-mon.rst (4 of x)

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Rishabh Dave [Mon, 9 Oct 2023 13:47:07 +0000 (19:17 +0530)]

Merge pull request #53892 from rishabh-d-dave/fix-test_cephfs.py

src/test/pybind: don't use decorator "with_setup"

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Zac Dover [Mon, 9 Oct 2023 08:55:07 +0000 (18:55 +1000)]

doc/rados: edit troubleshooting-mon.rst (4 of x)

Edit doc/rados/troubleshooting/troubleshooting-mon.rst.

Follows https://github.com/ceph/ceph/pull/53875

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Daniel Gryniewicz [Mon, 9 Oct 2023 12:57:13 +0000 (08:57 -0400)]

Merge pull request #53884 from leonid-s-usov/rgw-posix-test

test/rgw: don't compile POSIX test unless enabled

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>

commit | commitdiff | tree

Venky Shankar [Mon, 9 Oct 2023 12:19:30 +0000 (17:49 +0530)]

Merge PR #53873 into main

* refs/pull/53873/head:
qa: typo fix when checking for perf counter - s/md_thresh_evicted/mdthresh_evicted
qa: lower mds_session_metadata_threshold for tests

Reviewed-by: Milind Changire <mchangir@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>
Reviewed-by: Leonid Usov <leonid.usov@ibm.com>

commit | commitdiff | tree

Rishabh Dave [Mon, 9 Oct 2023 11:28:16 +0000 (16:58 +0530)]

src/test/pybind: don't use decorator "with_setup"

Signed-off-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Venky Shankar [Mon, 9 Oct 2023 04:29:45 +0000 (09:59 +0530)]

qa: typo fix when checking for perf counter - s/md_thresh_evicted/mdthresh_evicted

Signed-off-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Sat, 7 Oct 2023 15:12:29 +0000 (20:42 +0530)]

qa: lower mds_session_metadata_threshold for tests

... and increase the number of files that are created so as to
hit the threshold with a high probability.

Fixes: http://tracker.ceph.com/issues/62873
Signed-off-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Mon, 9 Oct 2023 05:11:21 +0000 (10:41 +0530)]

Merge pull request #53817 from cloudbehl/active-alert-filter

mgr/dashboard: Filter active alerts

Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Leonid Usov [Sun, 8 Oct 2023 14:14:23 +0000 (17:14 +0300)]

test/rgw: don't compile POSIX test unless enabled

Signed-off-by: Leonid Usov <leonid.usov@ibm.com>

commit | commitdiff | tree

Ramana Raja [Mon, 2 Oct 2023 16:39:26 +0000 (12:39 -0400)]

librbd/ManagedLock: kickstart ExclusiveLock state machine

... that is stalled waiting for lock. Do this when trying to reacquire
lock in the ImageWatcher's rewatch mechanism. This would enable the
ExclusiveLock state machine to propagate the blocklist error to the
caller trying to perform an image operation requiring an exclusive
lock.

Previous attempt, e66db763, to fix the hang due to exclusive lock
acquisiton (stuck waiting for lock) racing with client blocklisting
did not always work. e66db763 kickstarted the ExclusiveLock state
machine when the ImageWatcher tried to schedule a exclusive lock
request and the blocklisting was detected. However, there is a short
window between a watch getting deregistered and client blocklisting
getting detected as part of rewatching. If hit when trying to schedule
a lock request, the ExclusiveLock state machine wasn't kickstarted,
blocklist error wasn't propagated, and the hang resurfaced.

A more robust approach is taken to resume the ExclusiveLock state
machine stuck waiting for lock during client blocklisting. Whenever
a client's ImageWatcher loses connection to the cluster, as it happens
during blocklising, the ImageWatcher initiates a mechanism to rewatch
the image and tries to reacquire the lock. Piggyback on this rewatch
mechanism that gets triggered during client blocklisting. And when
trying to reacquire the lock, kickstart the ExclusiveLock state
machine stalled waiting for lock (STATE_WAITING_FOR_LOCK).

Fixes: https://tracker.ceph.com/issues/63009
Signed-off-by: Ramana Raja <rraja@redhat.com>

commit | commitdiff | tree

Xuehan Xu [Wed, 27 Sep 2023 09:15:29 +0000 (17:15 +0800)]

crimson/os/seastore/onode_manager: drop write_dirty

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Xuehan Xu [Wed, 27 Sep 2023 09:13:44 +0000 (17:13 +0800)]

crimson/os/seastore/onode_manager: populate delta recorders for each
onode modification

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

zdover23 [Sun, 8 Oct 2023 05:10:20 +0000 (16:10 +1100)]

Merge pull request #53874 from zdover23/wip-doc-2023-10-07-rados-troubleshooting-community

doc/rados: edit troubleshooting/community.rst

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

zdover23 [Sun, 8 Oct 2023 04:50:45 +0000 (15:50 +1100)]

Merge pull request #53875 from zdover23/wip-doc-2023-10-07-troubleshooting-troubleshooting-mon-3-of-x

doc/rados: edit troubleshooting-mon.rst (3 of x)

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Zac Dover [Sat, 7 Oct 2023 20:39:23 +0000 (06:39 +1000)]

doc/rados: edit troubleshooting/community.rst

Edit doc/rados/troubleshooting/community.rst.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Zac Dover [Sat, 7 Oct 2023 21:33:17 +0000 (07:33 +1000)]

doc/rados: edit troubleshooting-mon.rst (3 of x)

Edit doc/rados/troubleshooting/troubleshooting-mon.rst.

Follows https://github.com/ceph/ceph/pull/52827

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Anthony D'Atri [Sun, 8 Oct 2023 00:45:38 +0000 (20:45 -0400)]

Merge pull request #53876 from zdover23/wip-doc-2023-10-08-architecture-rbd-sentence-repair

doc/architecture: repair RBD sentence

commit | commitdiff | tree

Zac Dover [Sat, 7 Oct 2023 21:43:43 +0000 (07:43 +1000)]

doc/architecture: repair RBD sentence

Improve an ambiguous sentence in doc/architecture.rst.

The problem presented by the original sentence is that the phrasal verb
"to provide with" is implicated in one of its possible readings.
Interpreted in that way, the sentence seems to express the incorrect
idea that RBD furnishes block devices with snapshotting and cloning, as
though snapshotting and cloning are being delivered to the block
devices. In fact, snapshotting and cloning are just features of RBD, and
are features that are described on this page:
https://docs.ceph.com/en/quincy/rbd/rbd-snapshot/.

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

zdover23 [Sat, 7 Oct 2023 04:51:27 +0000 (15:51 +1100)]

Merge pull request #53790 from zdover23/wip-doc-2023-10-03-architecture-17-of-x

doc/architecture: edit "Peering and Sets"

commit | commitdiff | tree

Zac Dover [Tue, 3 Oct 2023 12:11:50 +0000 (22:11 +1000)]

doc/architecture: edit "Peering and Sets"

Edit the English in the section "Peering and Sets" in the file
doc/architecture.rst.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Patrick Donnelly [Sat, 7 Oct 2023 00:15:23 +0000 (20:15 -0400)]

Merge PR #53855 into main

* refs/pull/53855/head:
script: add option for debug build

Reviewed-by: Ilya Dryomov <idryomov@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Fri, 6 Oct 2023 00:11:29 +0000 (20:11 -0400)]

script: add option for debug build

See: https://github.com/ceph/ceph-build/pull/2167

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Adam King [Fri, 6 Oct 2023 15:20:57 +0000 (11:20 -0400)]

mgr/cephadm: fix upgrades with nvmeof

Currently, nvmeof was being treated as if it used
a ceph image during upgrades. This would cause logging
of messages like (I've removed the nvmeof daemon id)

log [WRN] : Upgrade daemon: nvmeof.<id>: Cannot redeploy
nvmeof.<id> with a new image: Supported types are: mgr, mon,
crash, osd, mds, rgw, rbd-mirror, cephfs-mirror, ceph-exporter,
iscsi, nfs

and if you had set a custom image for the
mgr/cephadm/container_image_nvmeof setting, this would
be undone as part of the upgrade process.

Fixes: https://tracker.ceph.com/issues/63127
Signed-off-by: Adam King <adking@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Fri, 6 Oct 2023 13:58:11 +0000 (06:58 -0700)]

Merge pull request #53417 from jrchyang/fix_mclock_scheduling_slow_main

osd: fix: slow scheduling when item_cost is large

Reviewed-by: Sridhar Seshasayee <sseshasa@redhat.com>

commit | commitdiff | tree

Yuval Lifshitz [Fri, 6 Oct 2023 10:07:31 +0000 (13:07 +0300)]

Merge pull request #52430 from vedanshbhartia/coverity_datarace

rgw: Add coverity annotations for missing mutex locks

reviewed-by: yuvalif, mkogan1

commit | commitdiff | tree

Yuval Lifshitz [Wed, 5 Jul 2023 18:33:55 +0000 (18:33 +0000)]

rgw/lua/doc: support reloading lua packages on all RGWs

without requiring a restart of the RGWs
test instructions:
https://gist.github.com/yuvalif/95b8ed9ea73ab4591c59644a050e01e2
also use capitalized "Lua" in logs/doc

Signed-off-by: Yuval Lifshitz <ylifshit@ibm.com>

Unnamed repository; edit this file 'description' to name the repository.

RSS Atom