]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
4 years agolibrbd/crypto: fix bad return checks from libcryptsetup 40705/head
Or Ozeri [Thu, 8 Apr 2021 17:13:49 +0000 (20:13 +0300)]
librbd/crypto: fix bad return checks from libcryptsetup

This commit fixes the return checks for libcryptsetup functions
that may return non-zero success codes.

Fixes: https://tracker.ceph.com/issues/50461
Signed-off-by: Or Ozeri <oro@il.ibm.com>
4 years agoMerge pull request #40272 from tchaikov/wip-install-dep-remove-existing-boost
Kefu Chai [Sun, 21 Mar 2021 05:43:24 +0000 (13:43 +0800)]
Merge pull request #40272 from tchaikov/wip-install-dep-remove-existing-boost

install-deps.sh: remove existing ceph-libboost of different version

Reviewed-by: David Galloway <dgallowa@redhat.com>
4 years agoMerge PR #40147 into master
Sage Weil [Sat, 20 Mar 2021 23:57:23 +0000 (19:57 -0400)]
Merge PR #40147 into master

* refs/pull/40147/head:
python-common: Validate characters in service_id for container names

Reviewed-by: Sebastian Wagner <swagner@suse.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge PR #40244 into master
Sage Weil [Sat, 20 Mar 2021 23:56:58 +0000 (19:56 -0400)]
Merge PR #40244 into master

* refs/pull/40244/head:
qa/suites/rados/cephadm/smoke-roleless: deploy additional daemon types

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
4 years agoMerge pull request #40266 from jdurgin/wip-release-notes-retry
David Galloway [Sat, 20 Mar 2021 18:58:42 +0000 (14:58 -0400)]
Merge pull request #40266 from jdurgin/wip-release-notes-retry

script/ceph-release-notes: add retries to pull request fetching

4 years agoMerge pull request #40271 from liu-chunmei/seastore_fix_segment_cleaner
Kefu Chai [Sat, 20 Mar 2021 14:34:53 +0000 (22:34 +0800)]
Merge pull request #40271 from liu-chunmei/seastore_fix_segment_cleaner

crimson/seastore: fix segment_cleaner bugs

Reviewed-by: Samuel Just <sjust@redhat.com>
4 years agoMerge PR #40219 into master
Sage Weil [Sat, 20 Mar 2021 13:17:26 +0000 (09:17 -0400)]
Merge PR #40219 into master

* refs/pull/40219/head:
mon/MgrStatMonitor: ignore MMgrReport from non-active mgr
mgr: tell monc when we get new servicemap, fsmap

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Mykola Golub <mgolub@suse.com>
Reviewed-by: Jason Dillaman <dillaman@redhat.com>
4 years agoMerge PR #40117 into master
Sage Weil [Sat, 20 Mar 2021 13:16:41 +0000 (09:16 -0400)]
Merge PR #40117 into master

* refs/pull/40117/head:
mgr/orchestrator: DG loads properly the unmanaged attribute

Reviewed-by: Sebastian Wagner <swagner@suse.com>
4 years agoMerge PR #40103 into master
Sage Weil [Sat, 20 Mar 2021 13:16:14 +0000 (09:16 -0400)]
Merge PR #40103 into master

* refs/pull/40103/head:
cephadm: fix a minor typo in logging message

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Juan Miguel Olmo <jolmomar@redhat.com>
4 years agoMerge PR #40220 into master
Sage Weil [Sat, 20 Mar 2021 13:15:58 +0000 (09:15 -0400)]
Merge PR #40220 into master

* refs/pull/40220/head:
mgr/cephadm: identify rgw, cepfs-mirror in servicemap
mgr/ServiceMap: adjust 'ceph -s' summary
rgw: register daemons in servicemap by gid; include id
cephadm: fix rbd-mirror auth name

Reviewed-by: Casey Bodley <cbodley@redhat.com>
4 years agoMerge PR #40222 into master
Sage Weil [Sat, 20 Mar 2021 13:14:31 +0000 (09:14 -0400)]
Merge PR #40222 into master

* refs/pull/40222/head:
mgr/orchestrator: remove image name field from 'orch ps' and 'orch ls'

Reviewed-by: Michael Fritch <mfritch@suse.com>
Reviewed-by: Sage Weil <sage@redhat.com>
4 years agoMerge PR #40224 into master
Sage Weil [Sat, 20 Mar 2021 13:14:21 +0000 (09:14 -0400)]
Merge PR #40224 into master

* refs/pull/40224/head:
qa/suites/rados/cephadm/dashboard: test on centos

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
4 years agoMerge PR #40241 into master
Sage Weil [Sat, 20 Mar 2021 13:14:10 +0000 (09:14 -0400)]
Merge PR #40241 into master

* refs/pull/40241/head:
cephadm: use debug verbosity during container exec

Reviewed-by: Adam King <adking@redhat.com>
4 years agoinstall-deps.sh: remove existing ceph-libboost of different version 40272/head
Kefu Chai [Sat, 20 Mar 2021 05:00:01 +0000 (13:00 +0800)]
install-deps.sh: remove existing ceph-libboost of different version

we install different versions of precompiled ceph-libboost packages
for different branches when building and testing them on ubuntu test
nodes. for instance,

- nautilus: v1.72
- octopus, pacific: v1.73

they share the same set of test nodes. and these ceph-libboost packages
conflict with each other, because they install files to the same places.

in order to avoid the confliction, we should uninstall existing packages
before installing a different version of ceph-libboost packages.

ceph-libboost${version}-dev is a package providing the shared headers of
boost library, so, in this change we check if it is installed before
returning or removing the existing packages.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agocrimson/seastore: fix segment_cleaner bugs 40271/head
chunmei-liu [Sat, 20 Mar 2021 04:16:44 +0000 (21:16 -0700)]
crimson/seastore: fix segment_cleaner bugs

Signed-off-by: chunmei-liu <chunmei.liu@intel.com>
4 years agoMerge PR #40214 into master
Patrick Donnelly [Sat, 20 Mar 2021 03:11:47 +0000 (20:11 -0700)]
Merge PR #40214 into master

* refs/pull/40214/head:
mgr/volumes: Retain suid/guid bits in subvolume clone
pybind/cephfs: Add lchmod python binding
client/libcephfs: Add lchmod

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Ramana Raja <rraja@redhat.com>
4 years agoscript/ceph-release-notes: add retries to pull request fetching 40266/head
Josh Durgin [Sat, 20 Mar 2021 01:11:29 +0000 (21:11 -0400)]
script/ceph-release-notes: add retries to pull request fetching

API rate limits are easily hit without this for major releases.

Signed-off-by: Josh Durgin <jdurgin@redhat.com>
4 years agoMerge PR #40242 into master
Sage Weil [Fri, 19 Mar 2021 20:42:14 +0000 (16:42 -0400)]
Merge PR #40242 into master

* refs/pull/40242/head:
mgr/cephadm/upgrade: do not repeat crash message
mgr/cephadm/upgrade: a little less verbose
mgr/cephadm: don't log not-ok-to-stop at ERR level
mgr/cephadm: is presumed -> appears
mgr/cephadm: don't double-log ok-to-stop results
mgr/cephadm/upgrade: include upgrade progress in ceph -s

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
4 years agomgr/cephadm: identify rgw, cepfs-mirror in servicemap 40220/head
Sage Weil [Thu, 18 Mar 2021 21:27:08 +0000 (17:27 -0400)]
mgr/cephadm: identify rgw, cepfs-mirror in servicemap

Like rbd-mirror, cephfs-mirror and rgw daemons register under their gid.

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agomgr/ServiceMap: adjust 'ceph -s' summary
Sage Weil [Fri, 19 Mar 2021 12:21:18 +0000 (08:21 -0400)]
mgr/ServiceMap: adjust 'ceph -s' summary

- Do not list individual daemon ids as this won't scale for larger
  clusters
- Do not contemplate multile daemons of the same type that register with
  different "daemon_type" -- not until we actually have any that do that.
- Present counts by various groupings: distinct hosts and rgw zones to
  start.

  services:
    mon:           1 daemons, quorum a (age 4m)
    mgr:           x(active, since 3m)
    osd:           1 osds: 1 up (since 3m), 1 in (since 3m)
    cephfs-mirror: 1 daemon active (1 hosts)
    rbd-mirror:    2 daemons active (1 hosts)
    rgw:           2 daemons active (1 hosts, 1 zones)

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agoMerge pull request #40230 from tchaikov/wip-rgw-test-boost-asio
Kefu Chai [Fri, 19 Mar 2021 16:13:16 +0000 (00:13 +0800)]
Merge pull request #40230 from tchaikov/wip-rgw-test-boost-asio

cmake: define BOOST_ASIO_USE_TS_EXECUTOR_AS_DEFAULT for rgw tests

Reviewed-by: Casey Bodley <cbodley@redhat.com>
4 years agoqa/suites/rados/cephadm/smoke-roleless: deploy additional daemon types 40244/head
Sage Weil [Fri, 19 Mar 2021 15:57:58 +0000 (10:57 -0500)]
qa/suites/rados/cephadm/smoke-roleless: deploy additional daemon types

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agomgr/cephadm/upgrade: do not repeat crash message 40242/head
Sage Weil [Fri, 19 Mar 2021 14:46:09 +0000 (10:46 -0400)]
mgr/cephadm/upgrade: do not repeat crash message

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agomgr/cephadm/upgrade: a little less verbose
Sage Weil [Fri, 19 Mar 2021 14:44:19 +0000 (10:44 -0400)]
mgr/cephadm/upgrade: a little less verbose

The _do_upgrade() method runs a zillion times; try to report fewer
repetitive messages on every iteration.

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agocmake: define BOOST_ASIO_USE_TS_EXECUTOR_AS_DEFAULT for rgw tests 40230/head
Kefu Chai [Fri, 19 Mar 2021 04:46:17 +0000 (12:46 +0800)]
cmake: define BOOST_ASIO_USE_TS_EXECUTOR_AS_DEFAULT for rgw tests

otherwise unittest_rgw_iam_policy does not compile with boost v1.75

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agomgr/cephadm: don't log not-ok-to-stop at ERR level
Sage Weil [Fri, 19 Mar 2021 14:38:06 +0000 (10:38 -0400)]
mgr/cephadm: don't log not-ok-to-stop at ERR level

This is normal during the upgrade; INF is fine.

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agomgr/cephadm: is presumed -> appears
Sage Weil [Fri, 19 Mar 2021 14:37:37 +0000 (10:37 -0400)]
mgr/cephadm: is presumed -> appears

The old wording was weird.

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agomgr/cephadm: don't double-log ok-to-stop results
Sage Weil [Fri, 19 Mar 2021 14:37:16 +0000 (10:37 -0400)]
mgr/cephadm: don't double-log ok-to-stop results

The calling upgrade code also reports this.

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agomgr/cephadm/upgrade: include upgrade progress in ceph -s
Sage Weil [Fri, 19 Mar 2021 14:31:24 +0000 (10:31 -0400)]
mgr/cephadm/upgrade: include upgrade progress in ceph -s

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agoMerge PR #40218 into master
Sage Weil [Fri, 19 Mar 2021 14:21:20 +0000 (10:21 -0400)]
Merge PR #40218 into master

* refs/pull/40218/head:
cephadm: make default image the daily master build

Reviewed-by: Michael Fritch <mfritch@suse.com>
4 years agocephadm: use debug verbosity during container exec 40241/head
Michael Fritch [Thu, 18 Mar 2021 21:41:06 +0000 (15:41 -0600)]
cephadm: use debug verbosity during container exec

avoid failures from appearing on the consle when exec'ing within the
container during the `ls` command

Signed-off-by: Michael Fritch <mfritch@suse.com>
4 years agoMerge pull request #40232 from tchaikov/wip-rgw-drop-unused-var
Kefu Chai [Fri, 19 Mar 2021 14:05:55 +0000 (22:05 +0800)]
Merge pull request #40232 from tchaikov/wip-rgw-drop-unused-var

rgw/rgw_zone: drop unused variable

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
4 years agoMerge pull request #40205 from tchaikov/wip-promtool-podman-docker
Kefu Chai [Fri, 19 Mar 2021 14:03:50 +0000 (22:03 +0800)]
Merge pull request #40205 from tchaikov/wip-promtool-podman-docker

test: run promtool test without docker on focal

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Brad Hubbard <bhubbard@redhat.com>
Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Aashish Sharma <aasharma@redhat.com>
4 years agoqa/suites/rados/cephadm/dashboard: test on centos 40224/head
Sage Weil [Fri, 19 Mar 2021 13:16:55 +0000 (08:16 -0500)]
qa/suites/rados/cephadm/dashboard: test on centos

Fixes: https://tracker.ceph.com/issues/49638
Signed-off-by: Sage Weil <sage@newdream.net>
4 years agorgw: register daemons in servicemap by gid; include id
Sage Weil [Fri, 19 Mar 2021 12:25:23 +0000 (08:25 -0400)]
rgw: register daemons in servicemap by gid; include id

Registering by gid allows multiple radosgw instances to share an auth
key/identity.  Including the id in the metadata allows them to still be
identified by name (even if not uniquely).

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agocmake: define BOOST_ASIO_USE_TS_EXECUTOR_AS_DEFAULT for rgw tests 40205/head
Kefu Chai [Fri, 19 Mar 2021 04:46:17 +0000 (12:46 +0800)]
cmake: define BOOST_ASIO_USE_TS_EXECUTOR_AS_DEFAULT for rgw tests

otherwise unittest_rbd_mirror does not compile with boost v1.75

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agotest: run promtool test without docker on ubuntu/focal
Kefu Chai [Fri, 19 Mar 2021 02:32:16 +0000 (10:32 +0800)]
test: run promtool test without docker on ubuntu/focal

before this change, we use docker for running promtools offered by
a docker image, but this is not efficient, and quite a few developers
do not want to use docker for running "make check". this change was
introduced by #39246, the reason was that, in Ceph's CI process, we
are using Ubuntu/Bionic for running "make check" jobs, but prometheus
packaged by Bionic does not offer the "test rules" command. so, to
address problem, we are using "dnanexus/promtool:2.9.2" docker image
for verifying monitoring/prometheus/alerts/test_alerts.yml.

after this change, we use prometheus packaged by debian derivatives
instead of pulling a docker image.

* debian/control: add prometheus as a "make check" dependency
* install-deps.sh: partially revert
  53a5816deda0874a3a37e131e9bc22d88bb2a588, as we don't need to
  pull docker or start docker service for using promtool anymore.
* cmake: check if promtool is capable of running "test rules"
  command, bail out if it is not.

see also: https://tracker.ceph.com/issues/49653

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agoinstall-deps.sh: install boost 1.75 on focal
Kefu Chai [Thu, 18 Mar 2021 11:50:58 +0000 (19:50 +0800)]
install-deps.sh: install boost 1.75 on focal

we bump boost on regular basis. let's take the opportunity of moving to
focal to use boost v1.75.

v1.73 was used before this change. since both boost 1.75 and boost 1.73
install some files at the same places, we need to remove boost 1.73
before installing boost 1.75.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agocmake: adapt FindBoost.cmake to our needs
Kefu Chai [Thu, 18 Mar 2021 13:16:34 +0000 (21:16 +0800)]
cmake: adapt FindBoost.cmake to our needs

the vanilla FindBoost.cmake pulled from cmake has couple assumptions
which do not hold in our environment. so address them case by case.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agocmake: add 1.75 to known versions
Kefu Chai [Thu, 18 Mar 2021 13:08:08 +0000 (21:08 +0800)]
cmake: add 1.75 to known versions

sync with
https://github.com/Kitware/CMake/blob/507710438d6033011cf980dc8a3c739b8bc1bb1c/Modules/FindBoost.cmake

for v1.75 support

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agoinstall-deps.sh: install libzbd on focal
Kefu Chai [Thu, 18 Mar 2021 11:43:06 +0000 (19:43 +0800)]
install-deps.sh: install libzbd on focal

WITH_ZBD is enabled for testing the build of zbd bluestore backend, and
we plan to migrate to Ubuntu/Focal for testing "make check", so need to
install libzbd when the distro version is focal.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge PR #40200 into master
Sage Weil [Fri, 19 Mar 2021 12:31:56 +0000 (08:31 -0400)]
Merge PR #40200 into master

* refs/pull/40200/head:
mgr/cephadm: clean up misc messages
mgr/cephadm/configcheck: do not spam info every minute

Reviewed-by: Adam King <adking@redhat.com>
4 years agoMerge PR #40223 into master
Sage Weil [Fri, 19 Mar 2021 12:31:24 +0000 (08:31 -0400)]
Merge PR #40223 into master

* refs/pull/40223/head:
cephadm: prevent podman from breaking socket.getfqdn()

Reviewed-by: Daniel Pivonka <dpivonka@redhat.com>
4 years agoMerge pull request #40236 from tchaikov/wip-cbt-perf
Kefu Chai [Fri, 19 Mar 2021 12:10:33 +0000 (20:10 +0800)]
Merge pull request #40236 from tchaikov/wip-cbt-perf

script/run-cbt.sh: set kernel.perf_event_paranoid for running perf

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
4 years agoscript/run-make.sh: quote targets with double quote 40236/head
Kefu Chai [Fri, 19 Mar 2021 11:23:09 +0000 (19:23 +0800)]
script/run-make.sh: quote targets with double quote

in
ceph-build/ceph-perf-pull-requests/config/definitions/ceph-perf-pull-requests.yml,
we pass "vstart-base crimson-osd" as the targets argument, but the
build() function in ceph/src/script/run-make.sh fails to quote them, so
they are expanded into two argument of `test -n`. hence it breaks like

src/script/run-make.sh: line 124: test: vstart-base: binary operator expected
make will run with option(s) -j40
Unknown argument vstart-base
Unknown argument crimson-osd

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agoscript/run-cbt.sh: set kernel.perf_event_paranoid for running perf
Kefu Chai [Fri, 19 Mar 2021 11:04:03 +0000 (19:04 +0800)]
script/run-cbt.sh: set kernel.perf_event_paranoid for running perf

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge pull request #40233 from tchaikov/wip-make-check-aio-max
Kefu Chai [Fri, 19 Mar 2021 09:50:16 +0000 (17:50 +0800)]
Merge pull request #40233 from tchaikov/wip-make-check-aio-max

run-make-check.sh: increase fs.aio-max-nr

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
4 years agocmake: use --smp 1 --memory 256M to crimson tests 40233/head
Jenkins Build Slave User [Fri, 19 Mar 2021 08:32:59 +0000 (08:32 +0000)]
cmake: use --smp 1 --memory 256M to crimson tests

to reduce the resource usage when running tests

there is an exception though, as we want to test test_config.cc with
multiple reactors.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge pull request #40229 from tchaikov/wip-dashboard-flake8
Kefu Chai [Fri, 19 Mar 2021 08:25:14 +0000 (16:25 +0800)]
Merge pull request #40229 from tchaikov/wip-dashboard-flake8

pybind/mgr/dashboard: bump flake8 to 3.9.0

Reviewed-by: Nizamudeen A <nia@redhat.com>
4 years agorun-make-check.sh: increase fs.aio-max-nr
Kefu Chai [Fri, 19 Mar 2021 08:18:23 +0000 (16:18 +0800)]
run-make-check.sh: increase fs.aio-max-nr

without this change the seastar based tests fail on host with 48 cores,
because the /proc/sys/fs/aio-nr used by the tests is greater than
1048576. if run-make-check.sh is used to launch the test, the default
job number is `$(nproc) / 2`, and the peak number of /proc/sys/fs/aio-nr
when running ctest was 3190848 when testing on the 48-core host.

so we need to increase fs.aio-max-nr accordingly to the available cores
on the host.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agorgw/rgw_zone: drop unused variable 40232/head
Kefu Chai [Fri, 19 Mar 2021 06:52:06 +0000 (14:52 +0800)]
rgw/rgw_zone: drop unused variable

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agopybind/mgr/dashboard: remove "python_version >= 3' 40229/head
Kefu Chai [Fri, 19 Mar 2021 04:24:28 +0000 (12:24 +0800)]
pybind/mgr/dashboard: remove "python_version >= 3'

remove "python_version >= '3'" from requirements-lint.txt, as we've
dropped the Python2 support.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agopybind/mgr/dashboard: bump flake8 to 3.9.0
Kefu Chai [Fri, 19 Mar 2021 04:05:45 +0000 (12:05 +0800)]
pybind/mgr/dashboard: bump flake8 to 3.9.0

to address the failure of

ERROR: Cannot install -r requirements-lint.txt (line 2) and -r requirements-lint.txt (line 8) because these package versions have conflicting dependencies.

The conflict is caused by:
    flake8 3.8.4 depends on pycodestyle<2.7.0 and >=2.6.0a1
    autopep8 1.5.6 depends on pycodestyle>=2.7.0

To fix this you could try to:
1. loosen the range of package versions you've specified
2. remove package versions to allow pip attempt to solve the dependency conflict

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agomgr/volumes: Retain suid/guid bits in subvolume clone 40214/head
Kotresh HR [Thu, 18 Mar 2021 12:54:44 +0000 (18:24 +0530)]
mgr/volumes: Retain suid/guid bits in subvolume clone

Fixes: https://tracker.ceph.com/issues/49882
Signed-off-by: Kotresh HR <khiremat@redhat.com>
4 years agopybind/cephfs: Add lchmod python binding
Kotresh HR [Thu, 18 Mar 2021 12:51:05 +0000 (18:21 +0530)]
pybind/cephfs: Add lchmod python binding

Fixes: https://tracker.ceph.com/issues/49882
Signed-off-by: Kotresh HR <khiremat@redhat.com>
4 years agoclient/libcephfs: Add lchmod
Kotresh HR [Thu, 18 Mar 2021 12:51:05 +0000 (18:21 +0530)]
client/libcephfs: Add lchmod

Fixes: https://tracker.ceph.com/issues/49882
Signed-off-by: Kotresh HR <khiremat@redhat.com>
4 years agoMerge pull request #40227 from neha-ojha/wip-message-cap-val
Neha Ojha [Fri, 19 Mar 2021 00:28:01 +0000 (17:28 -0700)]
Merge pull request #40227 from neha-ojha/wip-message-cap-val

qa/suites/rados/perf: set osd client message cap to 5000

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
4 years agoMerge pull request #40185 from ronen-fr/wip-ronenf-extra-scrub-assert
Neha Ojha [Thu, 18 Mar 2021 21:54:37 +0000 (14:54 -0700)]
Merge pull request #40185 from ronen-fr/wip-ronenf-extra-scrub-assert

osd: remove a ceph_assert() from a legitimate path

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Sam Just <sjust@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
4 years agoqa/suites/rados/perf: set osd client message cap to 5000 40227/head
Neha Ojha [Thu, 18 Mar 2021 21:28:52 +0000 (21:28 +0000)]
qa/suites/rados/perf: set osd client message cap to 5000

Related to https://tracker.ceph.com/issues/49894
Signed-off-by: Neha Ojha <nojha@redhat.com>
4 years agocephadm: fix rbd-mirror auth name
Sage Weil [Thu, 18 Mar 2021 20:30:52 +0000 (16:30 -0400)]
cephadm: fix rbd-mirror auth name

Broken by 8fa941b35d89db6a40f7d2912b69eadf40c5004c

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agoMerge PR #40048 into master
Sage Weil [Thu, 18 Mar 2021 20:11:38 +0000 (16:11 -0400)]
Merge PR #40048 into master

* refs/pull/40048/head:
mgr/cephadm: stop conflicting daemon when deploying to a specific port
mgr/cephadm: make DaemonPlacement print nicer
mgr/cephadm: fix --force remove comment
mgr/cephadm/schedule: choose an IP from a subnet list
mgr/cephadm: rgw: clean up config and config-key values on removal
mgr/cephadm: rgw: drop .crt extension when storing cert in config-key
mgr/cephadm/services: allow beast/civetweb to bind to a particular IP
python-common: add 'networks' property to ServiceSpec
mgr/cephadm/schedule: match placement ip only combination with port

Reviewed-by: Sebastian Wagner <swagner@suse.com>
Reviewed-by: Juan Miguel Olmo <jolmomar@redhat.com>
4 years agoMerge pull request #39139 from TRYTOBE8TME/wip-rgw-bucket-tests-separation-new
Yuval Lifshitz [Thu, 18 Mar 2021 18:33:00 +0000 (20:33 +0200)]
Merge pull request #39139 from TRYTOBE8TME/wip-rgw-bucket-tests-separation-new

Wip rgw bucket tests separation new

4 years agocephadm: prevent podman from breaking socket.getfqdn() 40223/head
Sage Weil [Thu, 18 Mar 2021 18:26:48 +0000 (14:26 -0400)]
cephadm: prevent podman from breaking socket.getfqdn()

socket.getfqdn() will return the reverse lookup for 127.0.1.1, which is
the last item listed for that IP in /etc/hosts.  Podman, by default, will
append the container name (ceph-$fsid-$name) to that line, which is not
a valid hostname, and not what we want the dashbaord to use for the URI
it advertises in the service map.

Pass --no-hosts to podman to disable this.

Docker does not appear to modify /etc/hosts by default--or, more
importantly, does not add the container name there.

Explicitly instruct podman (and docker) to add a

Fixes: https://tracker.ceph.com/issues/49890
Signed-off-by: Sage Weil <sage@newdream.net>
4 years agomgr/orchestrator: remove image name field from 'orch ps' and 'orch ls' 40222/head
Adam King [Thu, 18 Mar 2021 17:20:46 +0000 (13:20 -0400)]
mgr/orchestrator: remove image name field from 'orch ps' and 'orch ls'

Now that we're typically using the image digests the name isn't as helpful. We also
end up in scenarios where some images use tags for their name and others use the
digest so the image name comes out as "mix" in orch ls despite it being the same image.

Fixes: https://tracker.ceph.com/issues/47333
Signed-off-by: Adam King <adking@redhat.com>
4 years agomon/MgrStatMonitor: ignore MMgrReport from non-active mgr 40219/head
Sage Weil [Thu, 18 Mar 2021 16:45:48 +0000 (11:45 -0500)]
mon/MgrStatMonitor: ignore MMgrReport from non-active mgr

If it's not the active mgr, we should ignore it.

Since the mgr instance is best identified by the gid, add that to the
message.  (We can't use the source_addrs for the message since that is
the MgrStandby monc addr, not the active mgr addrs in the MgrMap.)

This fixes a problem where a just-demoted mgr report gets processed and a
new mgr gets a ServiceMap with an epoch >= its pending map.  (At least,
that is my theory!)

Fixes: https://tracker.ceph.com/issues/48022
Signed-off-by: Sage Weil <sage@newdream.net>
4 years agoMerge pull request #40199 from dillaman/wip-rbd-lockdep
Mykola Golub [Thu, 18 Mar 2021 16:46:13 +0000 (18:46 +0200)]
Merge pull request #40199 from dillaman/wip-rbd-lockdep

test: ignore failures to force-enable lockdep

Reviewed-by: Mykola Golub <mgolub@suse.com>
4 years agoMerge pull request #40194 from dillaman/wip-49848
Mykola Golub [Thu, 18 Mar 2021 16:44:53 +0000 (18:44 +0200)]
Merge pull request #40194 from dillaman/wip-49848

test/pybind/rbd: fixed functional change in encryption API

Reviewed-by: Mykola Golub <mgolub@suse.com>
4 years agoMerge pull request #40161 from sseshasa/wip-fix-wait-for-clean
Neha Ojha [Thu, 18 Mar 2021 16:24:31 +0000 (09:24 -0700)]
Merge pull request #40161 from sseshasa/wip-fix-wait-for-clean

qa/tasks: Add additional wait_for_clean() check in lost_unfound tasks.

Reviewed-by: Neha Ojha <nojha@redhat.com>
4 years agomgr: tell monc when we get new servicemap, fsmap
Sage Weil [Thu, 18 Mar 2021 15:40:42 +0000 (10:40 -0500)]
mgr: tell monc when we get new servicemap, fsmap

Otherwise, when we re-subscribe we'll request an old map again.  In the
case of the servicemap, that can lead to a failed assertion.

Fixes: https://tracker.ceph.com/issues/48022
Signed-off-by: Sage Weil <sage@newdream.net>
4 years agocephadm: make default image the daily master build 40218/head
Sage Weil [Thu, 18 Mar 2021 15:26:36 +0000 (10:26 -0500)]
cephadm: make default image the daily master build

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agoosd: remove a ceph_assert() from a legitimate path 40185/head
Ronen Friedman [Wed, 17 Mar 2021 15:21:10 +0000 (17:21 +0200)]
osd: remove a ceph_assert() from a legitimate path

on_replica_init() might be legitimately called twice,
if the replica was waiting for updates to complete
before servicing the request.

Fixes: https://tracker.ceph.com/issues/49867
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
4 years agoMerge PR #40207 into master
Patrick Donnelly [Thu, 18 Mar 2021 13:46:03 +0000 (06:46 -0700)]
Merge PR #40207 into master

* refs/pull/40207/head:
doc: max_maps -> max_caps

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
4 years agoMerge pull request #40187 from ideepika/wip-tracing-intial-doc
zdover23 [Thu, 18 Mar 2021 11:45:39 +0000 (21:45 +1000)]
Merge pull request #40187 from ideepika/wip-tracing-intial-doc

dev/developer_guide: add jaegertracing intial developer documentation

Reviewed-by: Zac Dover <zac.dover@gmail.com>
4 years agodoc/dev/developer_guide: add jaegertracing intial developer documentation 40187/head
Deepika Upadhyay [Wed, 17 Mar 2021 16:05:50 +0000 (21:35 +0530)]
doc/dev/developer_guide: add jaegertracing intial developer documentation

Signed-off-by: Deepika Upadhyay <dupadhya@redhat.com>
4 years agoqa/tasks: Add additional wait_for_clean() check in lost_unfound tasks. 40161/head
Sridhar Seshasayee [Tue, 16 Mar 2021 19:48:40 +0000 (01:18 +0530)]
qa/tasks: Add additional wait_for_clean() check in lost_unfound tasks.

At the end of the lost_unfound tests add an additional wait_for_clean()
check to ensure that recoveries get enough time to complete before
proceeding and avoid failures down the line. For e.g. failure like
"Scrubbing terminated -- not all pgs were active and clean." is because
recoveries on the PGs did not get sufficient time to complete even though
they were bound to eventually complete.

Fixes: https://tracker.ceph.com/issues/49844
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
4 years agodoc: max_maps -> max_caps 40207/head
Dan van der Ster [Thu, 18 Mar 2021 07:05:49 +0000 (08:05 +0100)]
doc: max_maps -> max_caps

Signed-off-by: Dan van der Ster <daniel.vanderster@cern.ch>
4 years agoqa/rgw: notifications suite runs single job 39139/head
Casey Bodley [Mon, 1 Mar 2021 17:19:25 +0000 (12:19 -0500)]
qa/rgw: notifications suite runs single job

pin to the beast frontend, default bluestore, replicated pools, and run
against a random distro

Signed-off-by: Casey Bodley <cbodley@redhat.com>
4 years agoMerge pull request #40163 from ktdreyer/resource-agents-noarch
Kefu Chai [Thu, 18 Mar 2021 03:02:22 +0000 (11:02 +0800)]
Merge pull request #40163 from ktdreyer/resource-agents-noarch

rpm: ceph-resource-agents package is noarch

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge PR #40058 into master
Patrick Donnelly [Thu, 18 Mar 2021 02:41:09 +0000 (19:41 -0700)]
Merge PR #40058 into master

* refs/pull/40058/head:
doc: mds cap acquisition readdir throttle documentation

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
4 years agoMerge PR #40193 into master
Patrick Donnelly [Thu, 18 Mar 2021 02:31:43 +0000 (19:31 -0700)]
Merge PR #40193 into master

* refs/pull/40193/head:
ceph-debug-docker: podman build doesn't accept input via stdin

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
4 years agoMerge pull request #31514 from simon-rock/simon_work_fou
Brad Hubbard [Thu, 18 Mar 2021 02:30:26 +0000 (12:30 +1000)]
Merge pull request #31514 from simon-rock/simon_work_fou

osd:modify conf, timeout & suicide timeout, of workqueue at runtime to av…

Reviewed-by: Brad Hubbard <bhubbard@redhat.com>
4 years agomgr/cephadm: stop conflicting daemon when deploying to a specific port 40048/head
Sage Weil [Wed, 17 Mar 2021 19:39:15 +0000 (15:39 -0400)]
mgr/cephadm: stop conflicting daemon when deploying to a specific port

If we are deploying a daemon to bind to a specific port and there is
an existing daemon we are removing that also binds to that port, stop
it first.  Unless we are both binding to different IPs.

This resolves the case where daemons bind to * and we redeploy with a
subnet to bind to.  It would eventually converge before, but would
throw a bind error in the process and take longer.

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agomgr/cephadm: make DaemonPlacement print nicer
Sage Weil [Wed, 17 Mar 2021 19:38:57 +0000 (15:38 -0400)]
mgr/cephadm: make DaemonPlacement print nicer

'host(ip:port)' or 'host(*:port)' so we can show it to a user.

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agoMerge pull request #39911 from cyx1231st/wip-seastore-onode-tree-fix-cache
Samuel Just [Wed, 17 Mar 2021 20:43:32 +0000 (13:43 -0700)]
Merge pull request #39911 from cyx1231st/wip-seastore-onode-tree-fix-cache

crimson/onode-staged-tree: fix tree_cursor_t::Cursor to be aware of extent duplication

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Samuel Just <sjust@redhat.com>
4 years agoMerge PR #40160 into master
Sage Weil [Wed, 17 Mar 2021 19:50:50 +0000 (15:50 -0400)]
Merge PR #40160 into master

* refs/pull/40160/head:
qa/suites/rados/cephadm/orchestrator_cli: random-distro$ -> 0-random-distro$
qa/suites/rados/cephadm/smoke-roleless: distro -> 0-distro
qa/distros/podman: install kubic once per host, in parallel
qa/suites/fs/multiclient: use clients: not all: for pexec

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
4 years agomgr/cephadm: clean up misc messages 40200/head
Sage Weil [Wed, 17 Mar 2021 19:49:47 +0000 (15:49 -0400)]
mgr/cephadm: clean up misc messages

- join list with ' '
- key, not keyring
- -ing, not ': '

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agomgr/cephadm/configcheck: do not spam info every minute
Sage Weil [Wed, 17 Mar 2021 19:43:33 +0000 (15:43 -0400)]
mgr/cephadm/configcheck: do not spam info every minute

It doesn't make to spam INF every minute.  Reducing this to DBG means
it'll never be seen.  Just remove it.

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agoMerge pull request #39915 from CongMinYin/fix-vm-io-hang
Jason Dillaman [Wed, 17 Mar 2021 19:35:58 +0000 (15:35 -0400)]
Merge pull request #39915 from CongMinYin/fix-vm-io-hang

librbd/cache/pwl: set max size of continuous data

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
Reviewed-by: Jianpeng Ma <jianpeng.ma@intel.com>
4 years agotest: ignore failures to force-enable lockdep 40199/head
Jason Dillaman [Wed, 17 Mar 2021 19:29:37 +0000 (15:29 -0400)]
test: ignore failures to force-enable lockdep

PR #40062 tweaked the behavior of lockdep to compile it out
of the code entirely for release builds. This fixes several
gtests where lockdep was force-enabled.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>
4 years agoMerge pull request #40175 from idryomov/wip-krbd-blkroset-5.12
Ilya Dryomov [Wed, 17 Mar 2021 18:44:14 +0000 (19:44 +0100)]
Merge pull request #40175 from idryomov/wip-krbd-blkroset-5.12

qa: krbd_blkroset.t: update for separate hw and user read-only flags

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
4 years agomgr/cephadm: fix --force remove comment
Sage Weil [Wed, 17 Mar 2021 18:42:34 +0000 (14:42 -0400)]
mgr/cephadm: fix --force remove comment

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agotest/pybind/rbd: fixed functional change in encryption API 40194/head
Jason Dillaman [Wed, 17 Mar 2021 18:14:48 +0000 (14:14 -0400)]
test/pybind/rbd: fixed functional change in encryption API

The encryption format API now also implicitly loads the encryption
layer. This tweaks the tests to account for this functional
difference.

Fixes: https://tracker.ceph.com/issues/49848
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
4 years agoceph-debug-docker: podman build doesn't accept input via stdin 40193/head
Jeff Layton [Wed, 17 Mar 2021 17:12:09 +0000 (13:12 -0400)]
ceph-debug-docker: podman build doesn't accept input via stdin

podman on centos 8 at least doesn't accept the Dockerfile being fed to
it via stdin. Change that branch of the script to use the same method
that the ubuntu side does.

This gets the script working on senta03 for me.

Signed-off-by: Jeff Layton <jlayton@redhat.com>
4 years agoMerge pull request #40051 from strenuous-life/wip-cephadm-osd-exact-match-size
Sebastian Wagner [Wed, 17 Mar 2021 17:22:05 +0000 (18:22 +0100)]
Merge pull request #40051 from strenuous-life/wip-cephadm-osd-exact-match-size

mgr/cephadm: When device size contains the decimal, it can not match size exactly

Reviewed-by: Juan Miguel Olmo Martínez <jolmomar@redhat.com>
Reviewed-by: Sebastian Wagner <sebastian.wagner@suse.com>
4 years agoMerge pull request #39880 from adk3798/upgrade-status
Sebastian Wagner [Wed, 17 Mar 2021 17:08:23 +0000 (18:08 +0100)]
Merge pull request #39880 from adk3798/upgrade-status

mgr/cephadm: add info to 'ceph orch upgrade status' in cephadm

Reviewed-by: Juan Miguel Olmo Martínez <jolmomar@redhat.com>
Reviewed-by: Sebastian Wagner <sebastian.wagner@suse.com>
4 years agoMerge PR #40177 into master
Patrick Donnelly [Wed, 17 Mar 2021 16:00:56 +0000 (09:00 -0700)]
Merge PR #40177 into master

* refs/pull/40177/head:
doc: update Windows MSI link

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Jason Dillaman <dillaman@redhat.com>
4 years agoMerge pull request #40178 from idryomov/wip-rbd-map-sanity-check-fixup
Ilya Dryomov [Wed, 17 Mar 2021 15:43:26 +0000 (16:43 +0100)]
Merge pull request #40178 from idryomov/wip-rbd-map-sanity-check-fixup

krbd: check device node accessibility only if we actually mapped

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
4 years agoMerge pull request #39678 from yehudasa/wip-rgw-aws4-client
Casey Bodley [Wed, 17 Mar 2021 15:42:08 +0000 (11:42 -0400)]
Merge pull request #39678 from yehudasa/wip-rgw-aws4-client

rgw: awsv4 client support

Reviewed-by: Casey Bodley <cbodley@redhat.com>
4 years agoMerge pull request #36549 from markhpc/wip-bl-dynamic_alen
Kefu Chai [Wed, 17 Mar 2021 14:43:07 +0000 (22:43 +0800)]
Merge pull request #36549 from markhpc/wip-bl-dynamic_alen

common/buffer.cc: Implement dynamic alen in refill_append_space

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
4 years agoMerge pull request #39757 from aclamk/wip-qa-test-bluestore-reshard
Kefu Chai [Wed, 17 Mar 2021 14:41:34 +0000 (22:41 +0800)]
Merge pull request #39757 from aclamk/wip-qa-test-bluestore-reshard

qa: Add bluestore resharding test

Reviewed-by: Josh Durgin <jdurgin@redhat.com>