]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
4 years agocephadm: normalize unqualified repo digests to docker.io 40577/head
Sage Weil [Sat, 3 Apr 2021 13:14:00 +0000 (09:14 -0400)]
cephadm: normalize unqualified repo digests to docker.io

A RepoDigests returned by docker|podman image inspect can either include
the docker.io/ prefix or not.  For reasons that aren't entirely clear,
this may vary between hosts in a cluster.  However, ceph/ceph@sha256:abc...
is the same thing as docker.io/ceph/ceph@sha256:abc..., and should be
treated as such.  Otherwise, upgrade can get into a loop where it pulls
the image on a new host, finds the other variant of the repodigests,
sees no overlap, updates target_digests, and restarts.  (It will then
find the first variant again on the first host and loop.)

Avoid this by normalizing any docker.io digests by always including the
docker.io/ prefix.

Note that it is technically possible that this assumption is wrong: it
may be that the image that already exists on the local host is from a
different registry in registries.conf's unqualified-search-registries.
However, we don't know which, since this is a search list.  In practice,
it should be exceeding rare that an image that *we* are installing using
a fully-qualified image name will end up having an unqualified repodigest
in the local registry.  Hopefully!

Fixes: https://tracker.ceph.com/issues/50114
Signed-off-by: Sage Weil <sage@newdream.net>
4 years agomgr/cephadm/upgrade: normalize unqualified target image
Sage Weil [Tue, 6 Apr 2021 13:36:31 +0000 (09:36 -0400)]
mgr/cephadm/upgrade: normalize unqualified target image

If we get an unqualified target image, assume it's docker.io.  This
ensures that we're passing a fully-qualified target to docker|podman on
the various hosts and don't end up with something different based on the
per-host search path for unqualified image names.

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agoMerge pull request #40445 from liewegas/cephadm-upgrade-progress-version
Kefu Chai [Sun, 28 Mar 2021 12:08:51 +0000 (20:08 +0800)]
Merge pull request #40445 from liewegas/cephadm-upgrade-progress-version

mgr/cephadm: make upgrade progress bar mention target version, not repo digest

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Michael Fritch <mfritch@suse.com>
4 years agoMerge pull request #40447 from Daniel-Pivonka/cephadmdocapplyrgwflags
Kefu Chai [Sun, 28 Mar 2021 12:06:01 +0000 (20:06 +0800)]
Merge pull request #40447 from Daniel-Pivonka/cephadmdocapplyrgwflags

mgr/orchestrator: fix rgw realm and zone flags

Reviewed-by: Sage Weil <sage@redhat.com>
4 years agoMerge pull request #40379 from adk3798/no-overwrite
Kefu Chai [Sun, 28 Mar 2021 12:03:22 +0000 (20:03 +0800)]
Merge pull request #40379 from adk3798/no-overwrite

mgr/cephadm: --no-overwite flag for apply command

Reviewed-by: Juan Miguel Olmo <jolmomar@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>
4 years agoMerge pull request #39969 from batrick/i49684
Kefu Chai [Sun, 28 Mar 2021 12:01:32 +0000 (20:01 +0800)]
Merge pull request #39969 from batrick/i49684

qa: wait for daemons to come up via cephadm

Reviewed-by: Sage Weil <sage@redhat.com>
4 years agoMerge pull request #40396 from maydaycc/wip-49968
Kefu Chai [Sun, 28 Mar 2021 11:59:53 +0000 (19:59 +0800)]
Merge pull request #40396 from maydaycc/wip-49968

mgr: attr_list reassigned before use in diskprediction_local module

Reviewed-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge pull request #40421 from badone/wip-fix-typo-get_device_id
Kefu Chai [Sun, 28 Mar 2021 11:59:06 +0000 (19:59 +0800)]
Merge pull request #40421 from badone/wip-fix-typo-get_device_id

common/blkdev: Remove trailing single quote

Reviewed-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge pull request #40450 from smithfarm/wip-50007-alt
Nathan Cutler [Sat, 27 Mar 2021 18:34:53 +0000 (19:34 +0100)]
Merge pull request #40450 from smithfarm/wip-50007-alt

rpm: drop extraneous explicit sqlite-libs runtime dependency

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
4 years agorpm: drop extraneous explicit sqlite-libs runtime dependency 40450/head
Nathan Cutler [Fri, 26 Mar 2021 10:03:34 +0000 (11:03 +0100)]
rpm: drop extraneous explicit sqlite-libs runtime dependency

Commit 75980798f19b8c11efd75ba4aae3e491d4c99f98 introduced a new package,
libcephsqlite, with a hard RPM dependency on a package "sqlite-libs" which
does not exist in openSUSE.

Since the runtime library dependencies of libcephsqlite are handled by RPM
transparently, this line is not needed.

Fixes: https://tracker.ceph.com/issues/50007
Signed-off-by: Nathan Cutler <ncutler@suse.com>
4 years agoMerge pull request #40442 from Thingee/update-foundation-page
Kefu Chai [Sat, 27 Mar 2021 08:57:43 +0000 (16:57 +0800)]
Merge pull request #40442 from Thingee/update-foundation-page

doc: Update Foundation page

Reviewed-by: Sage Weil <sage@redhat.com>
4 years agodoc: Updating Foundation membership page 40442/head
Mike Perez [Fri, 26 Mar 2021 15:52:49 +0000 (08:52 -0700)]
doc: Updating Foundation membership page

Signed-off-by: Mike Perez <thingee@gmail.com>
4 years agoMerge pull request #40408 from varshar16/wip-doc-update-ninja
Kefu Chai [Sat, 27 Mar 2021 07:45:38 +0000 (15:45 +0800)]
Merge pull request #40408 from varshar16/wip-doc-update-ninja

doc/install: update about ninja

Reviewed-by: Rishabh Dave <ridave@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge pull request #40345 from rzarzynski/wip-crimson-doc-waitstates-follow-up
Kefu Chai [Sat, 27 Mar 2021 03:28:33 +0000 (11:28 +0800)]
Merge pull request #40345 from rzarzynski/wip-crimson-doc-waitstates-follow-up

doc/crimson: improve wording about the pipeline / wait states.

Reviewed-by: Ronen Friedman <rfriedma@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge PR #38732 into master
Patrick Donnelly [Fri, 26 Mar 2021 22:13:07 +0000 (15:13 -0700)]
Merge PR #38732 into master

* refs/pull/38732/head:
client: fire the finish_cap_snap() after buffer being flushed
client: simplify the iterating code
client: remove unused _flushed_cap_snap
client: clean up the code

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
4 years agoMerge PR #40326 into master
Patrick Donnelly [Fri, 26 Mar 2021 22:11:28 +0000 (15:11 -0700)]
Merge PR #40326 into master

* refs/pull/40326/head:
client: rebuild bl to avoid too many vector(> IOV_MAX)

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Xiubo Li <xiubli@redhat.com>
4 years agoMerge PR #40394 into master
Sage Weil [Fri, 26 Mar 2021 20:41:31 +0000 (16:41 -0400)]
Merge PR #40394 into master

* refs/pull/40394/head:
osd: handle MOSDPGInfo
osd/PeeringState: init last_require_osd_release in ctor

Reviewed-by: Brad Hubbard <bhubbard@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
4 years agodoc/cephadm: fix rgw realm and zone flags 40447/head
Daniel Pivonka [Fri, 26 Mar 2021 19:13:18 +0000 (15:13 -0400)]
doc/cephadm: fix rgw realm and zone flags

Signed-off-by: Daniel Pivonka <dpivonka@redhat.com>
4 years agoMerge pull request #40415 from sseshasa/wip-qa-wait-for-clean-check
Neha Ojha [Fri, 26 Mar 2021 18:25:16 +0000 (11:25 -0700)]
Merge pull request #40415 from sseshasa/wip-qa-wait-for-clean-check

qa/tasks: Add wait_for_clean() check prior to initiating scrubbing.

Reviewed-by: Neha Ojha <nojha@redhat.com>
4 years agomgr/cephadm: make upgrade progress bar mention target version, not repo digest 40445/head
Sage Weil [Thu, 25 Mar 2021 20:05:02 +0000 (15:05 -0500)]
mgr/cephadm: make upgrade progress bar mention target version, not repo digest

The repo digest is super long and meaningless for a human user.  Instead,
use the target version (as soon as we know what it is--until then, use
the target image name).

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agoMerge pull request #40438 from idryomov/wip-crimson-reopen-session
Kefu Chai [Fri, 26 Mar 2021 16:11:54 +0000 (00:11 +0800)]
Merge pull request #40438 from idryomov/wip-crimson-reopen-session

crimson/monc: discard active/pending connections when reopening

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
4 years agocrimson/monc: discard active/pending connections when reopening 40438/head
Ilya Dryomov [Tue, 23 Mar 2021 09:40:18 +0000 (10:40 +0100)]
crimson/monc: discard active/pending connections when reopening

Otherwise pending_conns vector just keeps growing with redundant
connections all trying to reach the same set of monitors.  When one
of the attempts finally succeeds, _finish_auth() will pick the first
connection with a matching entity_addr_t, designate it as active and
close all others.  The match is very likely to be wrong and hence the
actual authenticated connection gets closed, leaving the OSD with
a bogus active_con and no monitor session.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
4 years agoMerge PR #40404 into master
Sage Weil [Fri, 26 Mar 2021 12:24:52 +0000 (08:24 -0400)]
Merge PR #40404 into master

* refs/pull/40404/head:
auth: require CEPHX_V2 by default

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
4 years agoMerge PR #40321 into master
Sage Weil [Fri, 26 Mar 2021 12:19:06 +0000 (08:19 -0400)]
Merge PR #40321 into master

* refs/pull/40321/head:
cephadm: fix parsing of keepalived version (drop leading 'v')
cephadm: keepalived needs --cap-add=NET_RAW
cephadm: fix --cap-add=NET_ADMIN
cephadm: fix quoting for keepalived env var
mgr/cephadm: ha-rgw: use correct port

Reviewed-by: Michael Fritch <mfritch@suse.com>
4 years agoMerge PR #40015 into master
Sage Weil [Fri, 26 Mar 2021 12:18:35 +0000 (08:18 -0400)]
Merge PR #40015 into master

* refs/pull/40015/head:
cephadm: validate fsid during cephadm shell command

Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Juan Miguel Olmo <jolmomar@redhat.com>
4 years agoMerge PR #40341 into master
Sage Weil [Fri, 26 Mar 2021 12:16:19 +0000 (08:16 -0400)]
Merge PR #40341 into master

* refs/pull/40341/head:
os/bluestore: separate omap per-pool vs per-pg alerts

Reviewed-by: Igor Fedotov <ifedotov@suse.com>
4 years agoMerge pull request #40429 from varshar16/wip-fix-fs-top-error
Venky Shankar [Fri, 26 Mar 2021 11:55:03 +0000 (17:25 +0530)]
Merge pull request #40429 from varshar16/wip-fix-fs-top-error

src/tools/cephfs-top: fix flake8 line too long error

Reviewed-by: Venky Shankar <vshankar@redhat.com>
4 years agoMerge pull request #40401 from lnussel/master
Nathan Cutler [Fri, 26 Mar 2021 10:23:55 +0000 (11:23 +0100)]
Merge pull request #40401 from lnussel/master

ceph.spec: prepare openSUSE usrmerge (boo#1029961)

4 years agosrc/tools/cephfs-top: fix flake8 line too long error 40429/head
Varsha Rao [Fri, 26 Mar 2021 09:38:33 +0000 (15:08 +0530)]
src/tools/cephfs-top: fix flake8 line too long error

Fixes: https://tracker.ceph.com/issues/50005
Signed-off-by: Varsha Rao <varao@redhat.com>
4 years agoMerge pull request #40170 from Rethan/fix-sync-pause-max-lat
Kefu Chai [Fri, 26 Mar 2021 07:08:25 +0000 (15:08 +0800)]
Merge pull request #40170 from Rethan/fix-sync-pause-max-lat

os/FileStore: fix sync_pause_max_lat which should not be inced

Reviewed-by: Kefu Chai <kchai@redhat.com>
4 years agoREADME.md: remove extra 'the' and format para related to ninja 40408/head
Varsha Rao [Thu, 25 Mar 2021 12:06:57 +0000 (17:36 +0530)]
README.md: remove extra 'the' and format para related to ninja

Signed-off-by: Varsha Rao <varao@redhat.com>
4 years agodoc/install: replace do_cmake note with link to ceph README
Varsha Rao [Fri, 26 Mar 2021 06:46:05 +0000 (12:16 +0530)]
doc/install: replace do_cmake note with link to ceph README

Ceph README.md doc already has the do_cmake note and other build related info.

Signed-off-by: Varsha Rao <varao@redhat.com>
4 years agodoc/install: Update about ninja
Varsha Rao [Thu, 25 Mar 2021 11:59:21 +0000 (17:29 +0530)]
doc/install: Update about ninja

'ninja' replaced 'make' in this PR[1]. This patch updates the doc about it.

[1] https://github.com/ceph/ceph/pull/39826

Signed-off-by: Varsha Rao <varao@redhat.com>
4 years agoMerge pull request #40334 from dvanders/pick
Kefu Chai [Fri, 26 Mar 2021 06:31:10 +0000 (14:31 +0800)]
Merge pull request #40334 from dvanders/pick

common/ipaddr: skip loopback interfaces named 'lo' and test it

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
4 years agoMerge pull request #40292 from tchaikov/wip-39191
Kefu Chai [Fri, 26 Mar 2021 06:28:34 +0000 (14:28 +0800)]
Merge pull request #40292 from tchaikov/wip-39191

debian,cmake,cephsqlite: hide non-public symbols

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
4 years agoMerge pull request #40171 from swegener/bluefs-assert-always-true
Kefu Chai [Fri, 26 Mar 2021 06:25:38 +0000 (14:25 +0800)]
Merge pull request #40171 from swegener/bluefs-assert-always-true

bluefs: fix assert always being true

Reviewed-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge pull request #40304 from tchaikov/wip-telegraph-ann
Kefu Chai [Fri, 26 Mar 2021 06:24:56 +0000 (14:24 +0800)]
Merge pull request #40304 from tchaikov/wip-telegraph-ann

pybind/mgr/telegraf: add type annotations

Reviewed-by: Wido den Hollander <wido@widodh.nl>
4 years agoMerge pull request #40403 from vshankar/wip-cephfs-top-add-metrics
Venky Shankar [Fri, 26 Mar 2021 06:22:31 +0000 (11:52 +0530)]
Merge pull request #40403 from vshankar/wip-cephfs-top-add-metrics

cephfs-top: include additional metrics reported by `fs perf stats`

Reviewed-by: Venky Shankar <vshankar@redhat.com>
4 years agoMerge pull request #40167 from singuliere/wip-49781
Kefu Chai [Fri, 26 Mar 2021 06:13:18 +0000 (14:13 +0800)]
Merge pull request #40167 from singuliere/wip-49781

common/mempool: only fail tests if sharding is very bad

Reviewed-by: Adam Kupczyk <akupczyk@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
4 years agocommon/blkdev: Remove trailing single quote 40421/head
Brad Hubbard [Fri, 26 Mar 2021 04:58:04 +0000 (14:58 +1000)]
common/blkdev: Remove trailing single quote

Example output:

e1 collect_metadata :  no unique device id for : fallback method has no
model nor serial'

Signed-off-by: Brad Hubbard <bhubbard@redhat.com>
4 years agoMerge pull request #40331 from athanatos/sjust/wip-seatore-gc-refactor
Samuel Just [Fri, 26 Mar 2021 04:21:28 +0000 (21:21 -0700)]
Merge pull request #40331 from athanatos/sjust/wip-seatore-gc-refactor

crimson/seastore: refactor gc into concurrent task

Reviewed-by: Chunmei Liu <chunmei.liu@intel.com>
4 years agoMerge pull request #40336 from tchaikov/wip-cmake-fio
Kefu Chai [Fri, 26 Mar 2021 01:28:31 +0000 (09:28 +0800)]
Merge pull request #40336 from tchaikov/wip-cmake-fio

cmake: pass compile options by fio interface library

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
4 years agoMerge pull request #40323 from aclamk/wip-onode-resiliant-to-split-cache
Neha Ojha [Fri, 26 Mar 2021 01:05:28 +0000 (18:05 -0700)]
Merge pull request #40323 from aclamk/wip-onode-resiliant-to-split-cache

os/bluestore: Make Onode::put/get resiliant to split_cache

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Igor Fedotov <ifedotov@suse.com>
4 years agomgr/volumes: do not overwrite existant mds specs 40379/head
Adam King [Thu, 25 Mar 2021 20:29:03 +0000 (16:29 -0400)]
mgr/volumes: do not overwrite existant mds specs

Signed-off-by: Adam King <adking@redhat.com>
4 years agocrimson/os/seastore/segment_cleaner: bump reclaim_ratio limits 40331/head
Samuel Just [Thu, 25 Mar 2021 20:15:24 +0000 (13:15 -0700)]
crimson/os/seastore/segment_cleaner: bump reclaim_ratio limits

GC behavior still needs a ton of work, but this adjustment should
reduce gc activity for now until we have time to refine the params
and introduce the ability to write cold extents to a segment other
than the current journal segment.

Signed-off-by: Samuel Just <sjust@redhat.com>
4 years agocrimson/os/seastore/.../btree_range_pin: use ceph_assert
Samuel Just [Thu, 25 Mar 2021 20:12:11 +0000 (13:12 -0700)]
crimson/os/seastore/.../btree_range_pin: use ceph_assert

There's a performance overhead, but seastore is quite immature and
detecting these logic errors saves a ton of debugging time.

Signed-off-by: Samuel Just <sjust@redhat.com>
4 years agocrimson/os/seastore/.../block.cc: add error message on alloc error to segment_write
Samuel Just [Thu, 25 Mar 2021 18:56:40 +0000 (11:56 -0700)]
crimson/os/seastore/.../block.cc: add error message on alloc error to segment_write

Signed-off-by: Samuel Just <sjust@redhat.com>
4 years agomgr/cephadm: no-overwite flag for apply command
Adam King [Wed, 24 Mar 2021 18:29:28 +0000 (14:29 -0400)]
mgr/cephadm: no-overwite flag for apply command

no-overwrite flag makes it so cephadm will only apply the spec
if the spec does not already exist.

Signed-off-by: Adam King <adking@redhat.com>
4 years agoqa/tasks: Add wait_for_clean() check prior to initiating scrubbing. 40415/head
Sridhar Seshasayee [Thu, 25 Mar 2021 12:37:49 +0000 (18:07 +0530)]
qa/tasks: Add wait_for_clean() check prior to initiating scrubbing.

Fixes: https://tracker.ceph.com/issues/49983
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
4 years agoMerge pull request #39776 from alimaredia/wip-rgw-log-req-latency
Mark Kogan [Thu, 25 Mar 2021 16:16:39 +0000 (18:16 +0200)]
Merge pull request #39776 from alimaredia/wip-rgw-log-req-latency

rgw: add latency to the request summary of an op

4 years agocephadm: validate fsid during cephadm shell command 40015/head
Daniel Pivonka [Wed, 10 Mar 2021 23:01:35 +0000 (18:01 -0500)]
cephadm: validate fsid during cephadm shell command

Signed-off-by: Daniel Pivonka <dpivonka@redhat.com>
4 years agoMerge PR #40366 into master
Sage Weil [Thu, 25 Mar 2021 12:39:34 +0000 (08:39 -0400)]
Merge PR #40366 into master

* refs/pull/40366/head:
mgr/cephadm: Fix dashboard gateway configuration when using IPV6

Reviewed-by: Sage Weil <sage@redhat.com>
4 years agoMerge pull request #40395 from rhcs-dashboard/fix-49952-master
Kefu Chai [Thu, 25 Mar 2021 11:28:59 +0000 (19:28 +0800)]
Merge pull request #40395 from rhcs-dashboard/fix-49952-master

mgr/dashboard:Simplify some complex calculations in test_alerts.yml

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>
4 years agoauth: require CEPHX_V2 by default 40404/head
Ilya Dryomov [Wed, 24 Mar 2021 15:23:44 +0000 (16:23 +0100)]
auth: require CEPHX_V2 by default

It's been almost three years and support is present in all relevant
clients.

From the security perspective, roughly the same could be achieved
with "ceph osd set-require-min-compat-client nautilus", but this is
more user friendly as the client gets ENOTSUP instead of spinning on
"feature set mismatch" faults.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
4 years agocephfs-top: include additional metrics reported by `fs perf stats`. 40403/head
Venky Shankar [Thu, 25 Mar 2021 09:40:28 +0000 (05:40 -0400)]
cephfs-top: include additional metrics reported by `fs perf stats`.

Without this, `cephfs-top` hits an exception since the additional
metrics keys were not configured.

Also, include a validation suring selftest that checks if `ceph
fs perf stats` metrics match what cephfs-top is configured to
report.

Fixes: http://tracker.ceph.com/issues/49974
Signed-off-by: Venky Shankar <vshankar@redhat.com>
4 years agoMerge pull request #40400 from tchaikov/wip-ctest-jenkins
Kefu Chai [Thu, 25 Mar 2021 10:24:22 +0000 (18:24 +0800)]
Merge pull request #40400 from tchaikov/wip-ctest-jenkins

run-make-check.sh: let ctest generate XML output

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
4 years agoMerge pull request #40327 from racpatel/wip-cephfs-top-refresh-interval
Venky Shankar [Thu, 25 Mar 2021 09:46:39 +0000 (15:16 +0530)]
Merge pull request #40327 from racpatel/wip-cephfs-top-refresh-interval

cephfs-top: allow configurable stats refresh interval

4 years agoMerge pull request #40210 from joscollin/wip-B49736-missing-client-metadata
Venky Shankar [Thu, 25 Mar 2021 09:45:01 +0000 (15:15 +0530)]
Merge pull request #40210 from joscollin/wip-B49736-missing-client-metadata

cephfs-top: be resilient to missing client metadata keys

4 years agoceph.spec: prepare openSUSE usrmerge (boo#1029961) 40401/head
Ludwig Nussel [Thu, 25 Mar 2021 09:32:21 +0000 (10:32 +0100)]
ceph.spec: prepare openSUSE usrmerge (boo#1029961)

The compat symlink in /sbin is no longer required and actually in
the way in the usrmerge case.

Signed-off-by: Ludwig Nussel <ludwig.nussel@suse.de>
4 years agorun-make-check.sh: let ctest generate XML output 40400/head
Kefu Chai [Thu, 25 Mar 2021 09:08:48 +0000 (17:08 +0800)]
run-make-check.sh: let ctest generate XML output

to enable XUnit plugin of jenkins to consume the ctest output and
publish it in the dashboard, we need to

* let ctest generate XML output instead of plain text output
* do not fail the test if any test case fails. this allows the publisher
  to do its job by checking the XML output.
* prevent ctest from compressing the output. see
  https://issues.jenkins.io/browse/JENKINS-21737

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agomgr/dashboard:Simplify some complex calculations in test_alerts.yml 40395/head
Aashish Sharma [Thu, 25 Mar 2021 05:55:37 +0000 (11:25 +0530)]
mgr/dashboard:Simplify some complex calculations in test_alerts.yml

run-promtool-unittests is failing with difference in floating point values in some complex calculations. This PR intends to simplify those calculations and fix this issue.

Fixes: https://tracker.ceph.com/issues/49952
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
4 years agoosd: handle MOSDPGInfo 40394/head
Kefu Chai [Thu, 25 Mar 2021 04:33:50 +0000 (12:33 +0800)]
osd: handle MOSDPGInfo

this change partially reverts b37e959238e75690d38873744515fa54413f4393,
which introduced a regression where we fail to handle MOSDPGInfo
messages sent by osd before aba13c7661bf8a62f2879513512053dc9857c003
and/or f7130db7a9fa85e475f14e4feed0097bf2d0b964.

Fixes: https://tracker.ceph.com/issues/49963
Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agoosd/PeeringState: init last_require_osd_release in ctor
Kefu Chai [Thu, 25 Mar 2021 04:04:37 +0000 (12:04 +0800)]
osd/PeeringState: init last_require_osd_release in ctor

so we can use the latest osdmap to tell what is the required osd release in
*current* cluster before consuming a new osdmap which is fed to
PeeringState only if we advance to a new map.

a osd shares its pg_info using MOSDPGInfo2 or MOSDPGInfo according to
PeeringState::last_require_osd_release, this happens before a new osdmap
is fully consumed.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agomgr: attr_list reassigned before use in diskprediction_local module 40396/head
wanwencong [Thu, 25 Mar 2021 03:34:24 +0000 (11:34 +0800)]
mgr: attr_list reassigned before use in diskprediction_local module

According to the document description, attr_list should be intersection
set of all disk days.

Fixes: https://tracker.ceph.com/issues/49968
Signed-off-by: wencong wan <wanwc@chinatelecom.cn>
4 years agoMerge pull request #40377 from tchaikov/wip-doc-fixes
Kefu Chai [Thu, 25 Mar 2021 02:39:14 +0000 (10:39 +0800)]
Merge pull request #40377 from tchaikov/wip-doc-fixes

doc/man: fix formatting

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
4 years agoMerge pull request #38044 from dvanders/dvanders_buffered
Neha Ojha [Thu, 25 Mar 2021 00:45:17 +0000 (17:45 -0700)]
Merge pull request #38044 from dvanders/dvanders_buffered

common/options: bluefs_buffered_io=true by default

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
4 years agoMerge pull request #40380 from adamemerson/wip-num-rados-handles-die-die-die
Ali Maredia [Wed, 24 Mar 2021 22:39:35 +0000 (18:39 -0400)]
Merge pull request #40380 from adamemerson/wip-num-rados-handles-die-die-die

Die, `num_rados_handles`, die.

Reviewed-by: Ali Maredia <amaredia@redhat.com>
4 years agocephadm: fix parsing of keepalived version (drop leading 'v') 40321/head
Sage Weil [Wed, 24 Mar 2021 16:25:08 +0000 (12:25 -0400)]
cephadm: fix parsing of keepalived version (drop leading 'v')

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agocephadm: keepalived needs --cap-add=NET_RAW
Sage Weil [Wed, 24 Mar 2021 16:06:48 +0000 (12:06 -0400)]
cephadm: keepalived needs --cap-add=NET_RAW

This makes

Mar 24 12:00:32 dael conmon[3969650]: Wed Mar 24 16:00:32 2021: cant open raw socket. errno=1

go away and allows it to enter the MASTER state.

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agocephadm: fix --cap-add=NET_ADMIN
Sage Weil [Wed, 24 Mar 2021 15:58:34 +0000 (11:58 -0400)]
cephadm: fix --cap-add=NET_ADMIN

Podman wants the = sign.  This aligns us with the other --cap-add user
(SYS_PTRACE), which uses =.

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agocephadm: fix quoting for keepalived env var
Sage Weil [Wed, 24 Mar 2021 15:57:45 +0000 (11:57 -0400)]
cephadm: fix quoting for keepalived env var

This was broken by 3ea514c5525cd21722ef3fe9b90363c21e483596

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agorgw: Remove spurious `num_rados_handles` option 40380/head
Adam C. Emerson [Wed, 24 Mar 2021 19:47:42 +0000 (15:47 -0400)]
rgw: Remove spurious `num_rados_handles` option

While any code that used the option is gone, it appears the option
itself was accidentally resurrected in various options.cc refactors
and transitions.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
4 years agocrimson/os/seastore/segment_cleaner: introduce byte limit on rewrite_dirty
Samuel Just [Wed, 24 Mar 2021 18:53:13 +0000 (18:53 +0000)]
crimson/os/seastore/segment_cleaner: introduce byte limit on rewrite_dirty

Signed-off-by: Samuel Just <sjust@redhat.com>
4 years agodoc/install/windows-troubleshooting: fix typo 40377/head
Kefu Chai [Wed, 24 Mar 2021 15:10:44 +0000 (23:10 +0800)]
doc/install/windows-troubleshooting: fix typo

and tweak the format a little bit

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agodoc/man/8/ceph-volume: fix formatting
Kefu Chai [Wed, 24 Mar 2021 17:01:26 +0000 (01:01 +0800)]
doc/man/8/ceph-volume: fix formatting

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agoadmin/build-doc: build man page in proper directory
Kefu Chai [Wed, 24 Mar 2021 17:00:23 +0000 (01:00 +0800)]
admin/build-doc: build man page in proper directory

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge PR #40350 into master
Sage Weil [Wed, 24 Mar 2021 15:39:53 +0000 (11:39 -0400)]
Merge PR #40350 into master

* refs/pull/40350/head:
doc/cephadm: explicitly show host requirments in adding host section

Reviewed-by: Sage Weil <sage@redhat.com>
4 years agoMerge pull request #40235 from tchaikov/wip-cmake-test-fixture
Kefu Chai [Wed, 24 Mar 2021 15:20:26 +0000 (23:20 +0800)]
Merge pull request #40235 from tchaikov/wip-cmake-test-fixture

cmake: use fixture for preparing venv

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
4 years agomgr/cephadm: Fix dashboard gateway configuration when using IPV6 40366/head
Juan Miguel Olmo Martínez [Wed, 24 Mar 2021 14:30:38 +0000 (15:30 +0100)]
mgr/cephadm: Fix dashboard gateway configuration when using IPV6

Fixes: https://tracker.ceph.com/issues/49957
Signed-off-by: Juan Miguel Olmo Martínez <jolmomar@redhat.com>
4 years agocmake: use fixture for preparing venv 40235/head
Kefu Chai [Fri, 19 Mar 2021 09:26:37 +0000 (17:26 +0800)]
cmake: use fixture for preparing venv

this change should allow us to decouple "ninja tests" from "ctest".
in other words, we can just run

ctest -R run-tox-python-common -V

without running "ninja tests" first. before this change

${name}-venv is added as a dependency of "tests" target.

after this change,

setup-venv-for-${name} is added as a test, which is in turn a test of
run-tox-${name}, so we can just

ctest -R run-tox-${name}

now for preparing the venv and then testing the tox test of ${name}.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge pull request #40344 from zdover23/wip-doc-cephadm-index-rewrite-2021-Mar-23
zdover23 [Wed, 24 Mar 2021 14:09:01 +0000 (00:09 +1000)]
Merge pull request #40344 from zdover23/wip-doc-cephadm-index-rewrite-2021-Mar-23

doc/cephadm: rewrite "index.rst"

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
4 years agoMerge PR #40353 into master
Patrick Donnelly [Wed, 24 Mar 2021 14:07:31 +0000 (07:07 -0700)]
Merge PR #40353 into master

* refs/pull/40353/head:
cephsqlite: add julian day offset in milliseconds

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
4 years agoMerge pull request #40291 from tchaikov/wip-debian-cleanups
Kefu Chai [Wed, 24 Mar 2021 13:31:34 +0000 (21:31 +0800)]
Merge pull request #40291 from tchaikov/wip-debian-cleanups

debian: cleanups and exclude ceph_crypto from dh_shlibdep

Reviewed-by: Yuval Lifshitz <ylifshit@redhat.com>
4 years agoos/bluestore: separate omap per-pool vs per-pg alerts 40341/head
Sage Weil [Tue, 23 Mar 2021 16:56:59 +0000 (11:56 -0500)]
os/bluestore: separate omap per-pool vs per-pg alerts

Currently the health alert raised does not match the docs, and the docs
do not describe what the health alert indicates.

Octopus added per-pool omap storage.  This improves space accounting
and reporting.

Pacific added per-pg omap storage (object hash in key).  This speeds up
PG removal.

Separate everthing out into two distinct alerts raised from bluestore
and surfaced as health alerts, with corresponding config options to
disable, and update the docs accordingly.

Also update the fsck options for warn vs error, and raise separate
errors for the per-pg and per-pool cases.

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agotools/cephfs: fix missing keys in cephfs-top client_metadata 40210/head
Jos Collin [Wed, 17 Mar 2021 12:18:09 +0000 (17:48 +0530)]
tools/cephfs: fix missing keys in cephfs-top client_metadata

Fixes: https://tracker.ceph.com/issues/49736
Signed-off-by: Jos Collin <jcollin@redhat.com>
4 years agoMerge pull request #39683 from maydaycc/wip-49487
Kefu Chai [Wed, 24 Mar 2021 12:21:11 +0000 (20:21 +0800)]
Merge pull request #39683 from maydaycc/wip-49487

osd: fix scrub reschedule bug

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: David Zafman <dzafman@redhat.com>
4 years agodebian/rules: add with_system_libs option 40291/head
Kefu Chai [Sun, 21 Mar 2021 15:04:49 +0000 (23:04 +0800)]
debian/rules: add with_system_libs option

so the maintainer can choose to use the packages shipped by distro.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agodebian/control: remove cython from Build-Depends
Kefu Chai [Sun, 21 Mar 2021 14:55:37 +0000 (22:55 +0800)]
debian/control: remove cython from Build-Depends

as cython3 is enough. and we've dropped the python2 support.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agodebian/rules: exclude ceph_crypto from dh_shlibdep
Kefu Chai [Sun, 21 Mar 2021 14:59:03 +0000 (22:59 +0800)]
debian/rules: exclude ceph_crypto from dh_shlibdep

as libceph_crypto* are plugins, and they are not self-contained. they
reference symbols offered by the executable loading them. dh_shlibdep
should not complain when checking them, so add them to the exclude list.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge pull request #40346 from zdover23/wip-doc-cephadm-osd-service-2021-Mar-23-first
zdover23 [Wed, 24 Mar 2021 10:19:44 +0000 (20:19 +1000)]
Merge pull request #40346 from zdover23/wip-doc-cephadm-osd-service-2021-Mar-23-first

doc/cephadm: rewrite osd.rst - list devices

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
4 years agodoc/cephadm: rewrite "index.rst" 40344/head
Zac Dover [Tue, 23 Mar 2021 15:19:11 +0000 (01:19 +1000)]
doc/cephadm: rewrite "index.rst"

This PR rewrites the three paragraphs at the
front of the cephadm guide, increasing their
elegance and removing ambiguities.

Signed-off-by: Zac Dover <zac.dover@gmail.com>
4 years agoMerge pull request #40357 from ybwang0211/quota_doc
Kefu Chai [Wed, 24 Mar 2021 09:33:28 +0000 (17:33 +0800)]
Merge pull request #40357 from ybwang0211/quota_doc

doc: remove duplicated 'quota' parameter in URL

Reviewed-by: Kefu Chai <kchai@redhat.com>
4 years agocrimson/os/seastore/journal: add error output for submit_record erange
Samuel Just [Wed, 24 Mar 2021 09:00:19 +0000 (09:00 +0000)]
crimson/os/seastore/journal: add error output for submit_record erange

Signed-off-by: Samuel Just <sjust@redhat.com>
4 years agodebian/control: remove libsqlite3-mod-ceph from dep of ceph-mgr 40292/head
Kefu Chai [Sun, 21 Mar 2021 14:57:16 +0000 (22:57 +0800)]
debian/control: remove libsqlite3-mod-ceph from dep of ceph-mgr

as the subvar of ${shlibs:Depends} is able to take care of it

Fixes: https://tracker.ceph.com/issues/49899
Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agodoc: remove duplicated 'quota' parameter in URL 40357/head
wangyingbin [Wed, 24 Mar 2021 06:05:18 +0000 (14:05 +0800)]
doc: remove duplicated 'quota' parameter in URL

Only one "quota" parameter is needed in the URL to indicate that it is a quota request.

Signed-off-by: wangyingbin <ybwang0211@163.com>
4 years agocephfs-top: signal main thread to exit on SIGINT/SIGTERM 40327/head
Rachana Patel [Tue, 23 Mar 2021 08:47:34 +0000 (08:47 +0000)]
cephfs-top: signal main thread to exit on SIGINT/SIGTERM

Fixes: http://tracker.ceph.com/issues/49953
Signed-off-by: Rachana Patel <racpatel@redhat.com>
4 years agocephfs-top: allow configurable stats refresh interval
Rachana Patel [Tue, 23 Mar 2021 04:40:56 +0000 (04:40 +0000)]
cephfs-top: allow configurable stats refresh interval

Signed-off-by: Rachana Patel <racpatel@redhat.com>
4 years agocrimson/os/seastore/segment_cleaner: add more method comments
Samuel Just [Wed, 24 Mar 2021 05:40:39 +0000 (22:40 -0700)]
crimson/os/seastore/segment_cleaner: add more method comments

Signed-off-by: Samuel Just <sjust@redhat.com>
4 years agocrimson/os/seastore/segment_cleaner: don't consider journal segments reclaimable
Samuel Just [Tue, 23 Mar 2021 21:27:33 +0000 (14:27 -0700)]
crimson/os/seastore/segment_cleaner: don't consider journal segments reclaimable

Otherwise, we might end up looping trying to gc based on the reclaimable
metric but be unable to actually find a reclaimable segment.  Mainly a
problem in unit tests that artificially force replay prior rolling the
first journal segment.

Signed-off-by: Samuel Just <sjust@redhat.com>
4 years agocrimson/.../transaction_manager: wait_io in get_extent_if_cached
Samuel Just [Sun, 21 Mar 2021 02:18:18 +0000 (02:18 +0000)]
crimson/.../transaction_manager: wait_io in get_extent_if_cached

Exposing wait_io to TransactionManager directly would complicate
any future users.

Signed-off-by: Samuel Just <sjust@redhat.com>