]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
4 years agodoc/cephfs/nfs: Add note about cephadm NFS-Ganesha daemon port 40305/head
Varsha Rao [Mon, 22 Mar 2021 11:51:53 +0000 (17:21 +0530)]
doc/cephfs/nfs: Add note about cephadm NFS-Ganesha daemon port

Fixes: https://tracker.ceph.com/issues/49921
Signed-off-by: Varsha Rao <varao@redhat.com>
4 years agoMerge pull request #40198 from jdurgin/wip-quick-fix-on-mount
Kefu Chai [Mon, 22 Mar 2021 07:23:48 +0000 (15:23 +0800)]
Merge pull request #40198 from jdurgin/wip-quick-fix-on-mount

common/options: turn off bluestore_fsck_quick_fix_on_mount by default

Reviewed-by: Igor Fedotov <ifedotov@suse.com>
Reviewed-by: Adam Kupczyk <akupczyk@redhat.com>
Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
Reviewed-by: Neha Ojha <nojha@redhat.com>
4 years agoMerge pull request #40239 from tchaikov/wip-osd-silence-warning
Kefu Chai [Mon, 22 Mar 2021 07:22:16 +0000 (15:22 +0800)]
Merge pull request #40239 from tchaikov/wip-osd-silence-warning

osd/PeeringState: remove unused variable

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
4 years agoMerge pull request #40143 from neha-ojha/wip-objectstore-debug
Kefu Chai [Mon, 22 Mar 2021 07:21:20 +0000 (15:21 +0800)]
Merge pull request #40143 from neha-ojha/wip-objectstore-debug

qa/suites/rados/objectstore/backends: use default level for filestore

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
4 years agoMerge pull request #40243 from liewegas/fix-pg-health-messages
Kefu Chai [Mon, 22 Mar 2021 07:20:21 +0000 (15:20 +0800)]
Merge pull request #40243 from liewegas/fix-pg-health-messages

mon/PGMap: nicely format pg acting in health messages

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
4 years agoMerge pull request #40238 from ygtzf/wip-blob-info
Kefu Chai [Mon, 22 Mar 2021 07:19:12 +0000 (15:19 +0800)]
Merge pull request #40238 from ygtzf/wip-blob-info

src/os: Remove redundant hexadecimal symbols from the log output

Reviewed-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge pull request #40112 from lxbsz/lockdep2
Kefu Chai [Mon, 22 Mar 2021 07:15:42 +0000 (15:15 +0800)]
Merge pull request #40112 from lxbsz/lockdep2

BackTrace: reduce the array max number to 32

Reviewed-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge pull request #39412 from tchaikov/wip-rocksdb
Kefu Chai [Mon, 22 Mar 2021 07:12:14 +0000 (15:12 +0800)]
Merge pull request #39412 from tchaikov/wip-rocksdb

bluestore: use string_view and bump rocksdb to v6.15.5

Reviewed-by: Igor Fedotov <ifedotov@suse.com>
4 years agoMerge pull request #40033 from tchaikov/wip-47380
Kefu Chai [Mon, 22 Mar 2021 07:11:23 +0000 (15:11 +0800)]
Merge pull request #40033 from tchaikov/wip-47380

mon/OSDMonitor: drop stale failure_info after a grace period

Reviewed-by: Sage Weil <sage@redhat.com>
4 years agoMerge pull request #40102 from tchaikov/wip-doc-fixes
Kefu Chai [Sun, 21 Mar 2021 14:04:02 +0000 (22:04 +0800)]
Merge pull request #40102 from tchaikov/wip-doc-fixes

doc: theme, cmake and formatting related fixes

Reviewed-by: Zac Dover <zac.dover@gmail.com>
4 years agoMerge pull request #40272 from tchaikov/wip-install-dep-remove-existing-boost
Kefu Chai [Sun, 21 Mar 2021 05:43:24 +0000 (13:43 +0800)]
Merge pull request #40272 from tchaikov/wip-install-dep-remove-existing-boost

install-deps.sh: remove existing ceph-libboost of different version

Reviewed-by: David Galloway <dgallowa@redhat.com>
4 years agoMerge PR #40147 into master
Sage Weil [Sat, 20 Mar 2021 23:57:23 +0000 (19:57 -0400)]
Merge PR #40147 into master

* refs/pull/40147/head:
python-common: Validate characters in service_id for container names

Reviewed-by: Sebastian Wagner <swagner@suse.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge PR #40244 into master
Sage Weil [Sat, 20 Mar 2021 23:56:58 +0000 (19:56 -0400)]
Merge PR #40244 into master

* refs/pull/40244/head:
qa/suites/rados/cephadm/smoke-roleless: deploy additional daemon types

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
4 years agoMerge pull request #40266 from jdurgin/wip-release-notes-retry
David Galloway [Sat, 20 Mar 2021 18:58:42 +0000 (14:58 -0400)]
Merge pull request #40266 from jdurgin/wip-release-notes-retry

script/ceph-release-notes: add retries to pull request fetching

4 years agoMerge pull request #40271 from liu-chunmei/seastore_fix_segment_cleaner
Kefu Chai [Sat, 20 Mar 2021 14:34:53 +0000 (22:34 +0800)]
Merge pull request #40271 from liu-chunmei/seastore_fix_segment_cleaner

crimson/seastore: fix segment_cleaner bugs

Reviewed-by: Samuel Just <sjust@redhat.com>
4 years agoMerge PR #40219 into master
Sage Weil [Sat, 20 Mar 2021 13:17:26 +0000 (09:17 -0400)]
Merge PR #40219 into master

* refs/pull/40219/head:
mon/MgrStatMonitor: ignore MMgrReport from non-active mgr
mgr: tell monc when we get new servicemap, fsmap

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Mykola Golub <mgolub@suse.com>
Reviewed-by: Jason Dillaman <dillaman@redhat.com>
4 years agoMerge PR #40117 into master
Sage Weil [Sat, 20 Mar 2021 13:16:41 +0000 (09:16 -0400)]
Merge PR #40117 into master

* refs/pull/40117/head:
mgr/orchestrator: DG loads properly the unmanaged attribute

Reviewed-by: Sebastian Wagner <swagner@suse.com>
4 years agoMerge PR #40103 into master
Sage Weil [Sat, 20 Mar 2021 13:16:14 +0000 (09:16 -0400)]
Merge PR #40103 into master

* refs/pull/40103/head:
cephadm: fix a minor typo in logging message

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Juan Miguel Olmo <jolmomar@redhat.com>
4 years agoMerge PR #40220 into master
Sage Weil [Sat, 20 Mar 2021 13:15:58 +0000 (09:15 -0400)]
Merge PR #40220 into master

* refs/pull/40220/head:
mgr/cephadm: identify rgw, cepfs-mirror in servicemap
mgr/ServiceMap: adjust 'ceph -s' summary
rgw: register daemons in servicemap by gid; include id
cephadm: fix rbd-mirror auth name

Reviewed-by: Casey Bodley <cbodley@redhat.com>
4 years agoMerge PR #40222 into master
Sage Weil [Sat, 20 Mar 2021 13:14:31 +0000 (09:14 -0400)]
Merge PR #40222 into master

* refs/pull/40222/head:
mgr/orchestrator: remove image name field from 'orch ps' and 'orch ls'

Reviewed-by: Michael Fritch <mfritch@suse.com>
Reviewed-by: Sage Weil <sage@redhat.com>
4 years agoMerge PR #40224 into master
Sage Weil [Sat, 20 Mar 2021 13:14:21 +0000 (09:14 -0400)]
Merge PR #40224 into master

* refs/pull/40224/head:
qa/suites/rados/cephadm/dashboard: test on centos

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
4 years agoMerge PR #40241 into master
Sage Weil [Sat, 20 Mar 2021 13:14:10 +0000 (09:14 -0400)]
Merge PR #40241 into master

* refs/pull/40241/head:
cephadm: use debug verbosity during container exec

Reviewed-by: Adam King <adking@redhat.com>
4 years agoinstall-deps.sh: remove existing ceph-libboost of different version 40272/head
Kefu Chai [Sat, 20 Mar 2021 05:00:01 +0000 (13:00 +0800)]
install-deps.sh: remove existing ceph-libboost of different version

we install different versions of precompiled ceph-libboost packages
for different branches when building and testing them on ubuntu test
nodes. for instance,

- nautilus: v1.72
- octopus, pacific: v1.73

they share the same set of test nodes. and these ceph-libboost packages
conflict with each other, because they install files to the same places.

in order to avoid the confliction, we should uninstall existing packages
before installing a different version of ceph-libboost packages.

ceph-libboost${version}-dev is a package providing the shared headers of
boost library, so, in this change we check if it is installed before
returning or removing the existing packages.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agocrimson/seastore: fix segment_cleaner bugs 40271/head
chunmei-liu [Sat, 20 Mar 2021 04:16:44 +0000 (21:16 -0700)]
crimson/seastore: fix segment_cleaner bugs

Signed-off-by: chunmei-liu <chunmei.liu@intel.com>
4 years agoMerge PR #40214 into master
Patrick Donnelly [Sat, 20 Mar 2021 03:11:47 +0000 (20:11 -0700)]
Merge PR #40214 into master

* refs/pull/40214/head:
mgr/volumes: Retain suid/guid bits in subvolume clone
pybind/cephfs: Add lchmod python binding
client/libcephfs: Add lchmod

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Ramana Raja <rraja@redhat.com>
4 years agoscript/ceph-release-notes: add retries to pull request fetching 40266/head
Josh Durgin [Sat, 20 Mar 2021 01:11:29 +0000 (21:11 -0400)]
script/ceph-release-notes: add retries to pull request fetching

API rate limits are easily hit without this for major releases.

Signed-off-by: Josh Durgin <jdurgin@redhat.com>
4 years agoMerge PR #40242 into master
Sage Weil [Fri, 19 Mar 2021 20:42:14 +0000 (16:42 -0400)]
Merge PR #40242 into master

* refs/pull/40242/head:
mgr/cephadm/upgrade: do not repeat crash message
mgr/cephadm/upgrade: a little less verbose
mgr/cephadm: don't log not-ok-to-stop at ERR level
mgr/cephadm: is presumed -> appears
mgr/cephadm: don't double-log ok-to-stop results
mgr/cephadm/upgrade: include upgrade progress in ceph -s

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
4 years agomgr/cephadm: identify rgw, cepfs-mirror in servicemap 40220/head
Sage Weil [Thu, 18 Mar 2021 21:27:08 +0000 (17:27 -0400)]
mgr/cephadm: identify rgw, cepfs-mirror in servicemap

Like rbd-mirror, cephfs-mirror and rgw daemons register under their gid.

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agomgr/ServiceMap: adjust 'ceph -s' summary
Sage Weil [Fri, 19 Mar 2021 12:21:18 +0000 (08:21 -0400)]
mgr/ServiceMap: adjust 'ceph -s' summary

- Do not list individual daemon ids as this won't scale for larger
  clusters
- Do not contemplate multile daemons of the same type that register with
  different "daemon_type" -- not until we actually have any that do that.
- Present counts by various groupings: distinct hosts and rgw zones to
  start.

  services:
    mon:           1 daemons, quorum a (age 4m)
    mgr:           x(active, since 3m)
    osd:           1 osds: 1 up (since 3m), 1 in (since 3m)
    cephfs-mirror: 1 daemon active (1 hosts)
    rbd-mirror:    2 daemons active (1 hosts)
    rgw:           2 daemons active (1 hosts, 1 zones)

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agosrc/os: Remove redundant hexadecimal symbols from the log output 40238/head
Yao guotao [Fri, 19 Mar 2021 12:01:31 +0000 (12:01 +0000)]
src/os: Remove redundant hexadecimal symbols from the log output
 because they are already printed in the region_t struct.

Signed-off-by: Yao guotao <yaoguo_tao@163.com>
4 years agoMerge pull request #40230 from tchaikov/wip-rgw-test-boost-asio
Kefu Chai [Fri, 19 Mar 2021 16:13:16 +0000 (00:13 +0800)]
Merge pull request #40230 from tchaikov/wip-rgw-test-boost-asio

cmake: define BOOST_ASIO_USE_TS_EXECUTOR_AS_DEFAULT for rgw tests

Reviewed-by: Casey Bodley <cbodley@redhat.com>
4 years agoqa/suites/rados/cephadm/smoke-roleless: deploy additional daemon types 40244/head
Sage Weil [Fri, 19 Mar 2021 15:57:58 +0000 (10:57 -0500)]
qa/suites/rados/cephadm/smoke-roleless: deploy additional daemon types

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agoosd: drop entry in failure_pending when resetting stale peer 40033/head
Kefu Chai [Sun, 14 Mar 2021 03:56:59 +0000 (11:56 +0800)]
osd: drop entry in failure_pending when resetting stale peer

no need to keep it in the pending list anymore.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agoosd: mark HeartbeatInfo::is_stale() and friends "const"
Kefu Chai [Sun, 14 Mar 2021 03:56:06 +0000 (11:56 +0800)]
osd: mark HeartbeatInfo::is_stale() and friends "const"

just for more const correctness.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agomon/OSDMonitor: drop stale failure_info
Kefu Chai [Thu, 11 Mar 2021 13:13:13 +0000 (21:13 +0800)]
mon/OSDMonitor: drop stale failure_info

failure_info keeps strong references of the MOSDFailure messages
sent by osd or peon monitors, whenever monitor starts to handle
an MOSDFailure message, it registers it in its OpTracker. and
the failure report messageis unregistered when monitor acks them
by either canceling them or replying the reporters with a new
osdmap marking the target osd down. but if this does not happen,
the failure reports just pile up in OpTracker. and monitor considers
them as slow ops. and they are reported as SLOW_OPS health warning.

in theory, it does not take long to mark an unresponsive osd down if
we have enough reporters. but there is chance, that a reporter fails
to cancel its report before it reboots, and the monitor also fails
to collect enough reports and mark the target osd down. so the
target osd never gets an osdmap marking it down, so it won't send
an alive message to monitor to fix this.

in this change, we check for the stale failure info in tick(), and
simply drop the stale reports. so the messages can released and
marked "done".

Fixes: https://tracker.ceph.com/issues/47380
Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agomon/OSDMonitor: restructure OSDMonitor::check_failures() loop
Kefu Chai [Thu, 11 Mar 2021 10:28:18 +0000 (18:28 +0800)]
mon/OSDMonitor: restructure OSDMonitor::check_failures() loop

will add a trim failures call in the loop, which mutates failure_info,
while we are still iterating this map. so have to restructure the loop
a little bit.

Fixes: https://tracker.ceph.com/issues/47380
Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agomon/OSDMonitor: extract get_grace_time()
Kefu Chai [Thu, 11 Mar 2021 11:49:36 +0000 (19:49 +0800)]
mon/OSDMonitor: extract get_grace_time()

for better readability

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agomon/OSDMonitor: do not return old failure report when updating it
Kefu Chai [Thu, 11 Mar 2021 09:47:50 +0000 (17:47 +0800)]
mon/OSDMonitor: do not return old failure report when updating it

there is no need to return stale report, as the caller is not interested
in it.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agomon/OSDMonitor: do not return no_reply() again
Kefu Chai [Thu, 11 Mar 2021 09:45:49 +0000 (17:45 +0800)]
mon/OSDMonitor: do not return no_reply() again

we always return "no_op" message to proxy monitor in
`OSDMonitor::prepare_failure()` at the very beginning of this method. so
no need to reply the peon again when discarding the failure report.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agomon/Monitor: early return if routed request is not found
Kefu Chai [Thu, 11 Mar 2021 09:09:57 +0000 (17:09 +0800)]
mon/Monitor: early return if routed request is not found

* early return if routed request is not found in routed_requests.
  reduce the indent level, for better readability.
* do not look up the request twice. for better performance.
* use unique_ptr<> for holding the request, for better readability

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agomon/PGMap: nicely format pg acting in health messages 40243/head
Sage Weil [Fri, 19 Mar 2021 15:11:45 +0000 (10:11 -0500)]
mon/PGMap: nicely format pg acting in health messages

For 'pg ls' we show CRUSH_ITEM_NONE as "NONE" and not "2147483647". Do
the same for the health messages.

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agomgr/cephadm/upgrade: do not repeat crash message 40242/head
Sage Weil [Fri, 19 Mar 2021 14:46:09 +0000 (10:46 -0400)]
mgr/cephadm/upgrade: do not repeat crash message

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agomgr/cephadm/upgrade: a little less verbose
Sage Weil [Fri, 19 Mar 2021 14:44:19 +0000 (10:44 -0400)]
mgr/cephadm/upgrade: a little less verbose

The _do_upgrade() method runs a zillion times; try to report fewer
repetitive messages on every iteration.

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agocmake: define BOOST_ASIO_USE_TS_EXECUTOR_AS_DEFAULT for rgw tests 40230/head
Kefu Chai [Fri, 19 Mar 2021 04:46:17 +0000 (12:46 +0800)]
cmake: define BOOST_ASIO_USE_TS_EXECUTOR_AS_DEFAULT for rgw tests

otherwise unittest_rgw_iam_policy does not compile with boost v1.75

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agomgr/cephadm: don't log not-ok-to-stop at ERR level
Sage Weil [Fri, 19 Mar 2021 14:38:06 +0000 (10:38 -0400)]
mgr/cephadm: don't log not-ok-to-stop at ERR level

This is normal during the upgrade; INF is fine.

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agomgr/cephadm: is presumed -> appears
Sage Weil [Fri, 19 Mar 2021 14:37:37 +0000 (10:37 -0400)]
mgr/cephadm: is presumed -> appears

The old wording was weird.

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agomgr/cephadm: don't double-log ok-to-stop results
Sage Weil [Fri, 19 Mar 2021 14:37:16 +0000 (10:37 -0400)]
mgr/cephadm: don't double-log ok-to-stop results

The calling upgrade code also reports this.

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agomgr/cephadm/upgrade: include upgrade progress in ceph -s
Sage Weil [Fri, 19 Mar 2021 14:31:24 +0000 (10:31 -0400)]
mgr/cephadm/upgrade: include upgrade progress in ceph -s

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agoMerge PR #40218 into master
Sage Weil [Fri, 19 Mar 2021 14:21:20 +0000 (10:21 -0400)]
Merge PR #40218 into master

* refs/pull/40218/head:
cephadm: make default image the daily master build

Reviewed-by: Michael Fritch <mfritch@suse.com>
4 years agocephadm: use debug verbosity during container exec 40241/head
Michael Fritch [Thu, 18 Mar 2021 21:41:06 +0000 (15:41 -0600)]
cephadm: use debug verbosity during container exec

avoid failures from appearing on the consle when exec'ing within the
container during the `ls` command

Signed-off-by: Michael Fritch <mfritch@suse.com>
4 years agoMerge pull request #40232 from tchaikov/wip-rgw-drop-unused-var
Kefu Chai [Fri, 19 Mar 2021 14:05:55 +0000 (22:05 +0800)]
Merge pull request #40232 from tchaikov/wip-rgw-drop-unused-var

rgw/rgw_zone: drop unused variable

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
4 years agoMerge pull request #40205 from tchaikov/wip-promtool-podman-docker
Kefu Chai [Fri, 19 Mar 2021 14:03:50 +0000 (22:03 +0800)]
Merge pull request #40205 from tchaikov/wip-promtool-podman-docker

test: run promtool test without docker on focal

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Brad Hubbard <bhubbard@redhat.com>
Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Aashish Sharma <aasharma@redhat.com>
4 years agoqa/suites/rados/cephadm/dashboard: test on centos 40224/head
Sage Weil [Fri, 19 Mar 2021 13:16:55 +0000 (08:16 -0500)]
qa/suites/rados/cephadm/dashboard: test on centos

Fixes: https://tracker.ceph.com/issues/49638
Signed-off-by: Sage Weil <sage@newdream.net>
4 years agorgw: register daemons in servicemap by gid; include id
Sage Weil [Fri, 19 Mar 2021 12:25:23 +0000 (08:25 -0400)]
rgw: register daemons in servicemap by gid; include id

Registering by gid allows multiple radosgw instances to share an auth
key/identity.  Including the id in the metadata allows them to still be
identified by name (even if not uniquely).

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agocmake: define BOOST_ASIO_USE_TS_EXECUTOR_AS_DEFAULT for rgw tests 40205/head
Kefu Chai [Fri, 19 Mar 2021 04:46:17 +0000 (12:46 +0800)]
cmake: define BOOST_ASIO_USE_TS_EXECUTOR_AS_DEFAULT for rgw tests

otherwise unittest_rbd_mirror does not compile with boost v1.75

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agotest: run promtool test without docker on ubuntu/focal
Kefu Chai [Fri, 19 Mar 2021 02:32:16 +0000 (10:32 +0800)]
test: run promtool test without docker on ubuntu/focal

before this change, we use docker for running promtools offered by
a docker image, but this is not efficient, and quite a few developers
do not want to use docker for running "make check". this change was
introduced by #39246, the reason was that, in Ceph's CI process, we
are using Ubuntu/Bionic for running "make check" jobs, but prometheus
packaged by Bionic does not offer the "test rules" command. so, to
address problem, we are using "dnanexus/promtool:2.9.2" docker image
for verifying monitoring/prometheus/alerts/test_alerts.yml.

after this change, we use prometheus packaged by debian derivatives
instead of pulling a docker image.

* debian/control: add prometheus as a "make check" dependency
* install-deps.sh: partially revert
  53a5816deda0874a3a37e131e9bc22d88bb2a588, as we don't need to
  pull docker or start docker service for using promtool anymore.
* cmake: check if promtool is capable of running "test rules"
  command, bail out if it is not.

see also: https://tracker.ceph.com/issues/49653

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agoinstall-deps.sh: install boost 1.75 on focal
Kefu Chai [Thu, 18 Mar 2021 11:50:58 +0000 (19:50 +0800)]
install-deps.sh: install boost 1.75 on focal

we bump boost on regular basis. let's take the opportunity of moving to
focal to use boost v1.75.

v1.73 was used before this change. since both boost 1.75 and boost 1.73
install some files at the same places, we need to remove boost 1.73
before installing boost 1.75.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agocmake: adapt FindBoost.cmake to our needs
Kefu Chai [Thu, 18 Mar 2021 13:16:34 +0000 (21:16 +0800)]
cmake: adapt FindBoost.cmake to our needs

the vanilla FindBoost.cmake pulled from cmake has couple assumptions
which do not hold in our environment. so address them case by case.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agocmake: add 1.75 to known versions
Kefu Chai [Thu, 18 Mar 2021 13:08:08 +0000 (21:08 +0800)]
cmake: add 1.75 to known versions

sync with
https://github.com/Kitware/CMake/blob/507710438d6033011cf980dc8a3c739b8bc1bb1c/Modules/FindBoost.cmake

for v1.75 support

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agoinstall-deps.sh: install libzbd on focal
Kefu Chai [Thu, 18 Mar 2021 11:43:06 +0000 (19:43 +0800)]
install-deps.sh: install libzbd on focal

WITH_ZBD is enabled for testing the build of zbd bluestore backend, and
we plan to migrate to Ubuntu/Focal for testing "make check", so need to
install libzbd when the distro version is focal.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge PR #40200 into master
Sage Weil [Fri, 19 Mar 2021 12:31:56 +0000 (08:31 -0400)]
Merge PR #40200 into master

* refs/pull/40200/head:
mgr/cephadm: clean up misc messages
mgr/cephadm/configcheck: do not spam info every minute

Reviewed-by: Adam King <adking@redhat.com>
4 years agoMerge PR #40223 into master
Sage Weil [Fri, 19 Mar 2021 12:31:24 +0000 (08:31 -0400)]
Merge PR #40223 into master

* refs/pull/40223/head:
cephadm: prevent podman from breaking socket.getfqdn()

Reviewed-by: Daniel Pivonka <dpivonka@redhat.com>
4 years agoosd/PeeringState: remove unused variable 40239/head
Kefu Chai [Fri, 19 Mar 2021 12:20:32 +0000 (20:20 +0800)]
osd/PeeringState: remove unused variable

recovery_ec_pool_below_min_size was used to verify if the osd in clsuter
are octopus and up, but since we are now quincy and up, there is no need
to verify this. so drop it for better readability and for silencing
the -Wunused-variable warning in Release build.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge pull request #40236 from tchaikov/wip-cbt-perf
Kefu Chai [Fri, 19 Mar 2021 12:10:33 +0000 (20:10 +0800)]
Merge pull request #40236 from tchaikov/wip-cbt-perf

script/run-cbt.sh: set kernel.perf_event_paranoid for running perf

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
4 years agoscript/run-make.sh: quote targets with double quote 40236/head
Kefu Chai [Fri, 19 Mar 2021 11:23:09 +0000 (19:23 +0800)]
script/run-make.sh: quote targets with double quote

in
ceph-build/ceph-perf-pull-requests/config/definitions/ceph-perf-pull-requests.yml,
we pass "vstart-base crimson-osd" as the targets argument, but the
build() function in ceph/src/script/run-make.sh fails to quote them, so
they are expanded into two argument of `test -n`. hence it breaks like

src/script/run-make.sh: line 124: test: vstart-base: binary operator expected
make will run with option(s) -j40
Unknown argument vstart-base
Unknown argument crimson-osd

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agoscript/run-cbt.sh: set kernel.perf_event_paranoid for running perf
Kefu Chai [Fri, 19 Mar 2021 11:04:03 +0000 (19:04 +0800)]
script/run-cbt.sh: set kernel.perf_event_paranoid for running perf

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge pull request #40233 from tchaikov/wip-make-check-aio-max
Kefu Chai [Fri, 19 Mar 2021 09:50:16 +0000 (17:50 +0800)]
Merge pull request #40233 from tchaikov/wip-make-check-aio-max

run-make-check.sh: increase fs.aio-max-nr

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
4 years agocmake: use --smp 1 --memory 256M to crimson tests 40233/head
Jenkins Build Slave User [Fri, 19 Mar 2021 08:32:59 +0000 (08:32 +0000)]
cmake: use --smp 1 --memory 256M to crimson tests

to reduce the resource usage when running tests

there is an exception though, as we want to test test_config.cc with
multiple reactors.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge pull request #40229 from tchaikov/wip-dashboard-flake8
Kefu Chai [Fri, 19 Mar 2021 08:25:14 +0000 (16:25 +0800)]
Merge pull request #40229 from tchaikov/wip-dashboard-flake8

pybind/mgr/dashboard: bump flake8 to 3.9.0

Reviewed-by: Nizamudeen A <nia@redhat.com>
4 years agorun-make-check.sh: increase fs.aio-max-nr
Kefu Chai [Fri, 19 Mar 2021 08:18:23 +0000 (16:18 +0800)]
run-make-check.sh: increase fs.aio-max-nr

without this change the seastar based tests fail on host with 48 cores,
because the /proc/sys/fs/aio-nr used by the tests is greater than
1048576. if run-make-check.sh is used to launch the test, the default
job number is `$(nproc) / 2`, and the peak number of /proc/sys/fs/aio-nr
when running ctest was 3190848 when testing on the 48-core host.

so we need to increase fs.aio-max-nr accordingly to the available cores
on the host.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agorgw/rgw_zone: drop unused variable 40232/head
Kefu Chai [Fri, 19 Mar 2021 06:52:06 +0000 (14:52 +0800)]
rgw/rgw_zone: drop unused variable

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agopybind/mgr/dashboard: remove "python_version >= 3' 40229/head
Kefu Chai [Fri, 19 Mar 2021 04:24:28 +0000 (12:24 +0800)]
pybind/mgr/dashboard: remove "python_version >= 3'

remove "python_version >= '3'" from requirements-lint.txt, as we've
dropped the Python2 support.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agopybind/mgr/dashboard: bump flake8 to 3.9.0
Kefu Chai [Fri, 19 Mar 2021 04:05:45 +0000 (12:05 +0800)]
pybind/mgr/dashboard: bump flake8 to 3.9.0

to address the failure of

ERROR: Cannot install -r requirements-lint.txt (line 2) and -r requirements-lint.txt (line 8) because these package versions have conflicting dependencies.

The conflict is caused by:
    flake8 3.8.4 depends on pycodestyle<2.7.0 and >=2.6.0a1
    autopep8 1.5.6 depends on pycodestyle>=2.7.0

To fix this you could try to:
1. loosen the range of package versions you've specified
2. remove package versions to allow pip attempt to solve the dependency conflict

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agomgr/volumes: Retain suid/guid bits in subvolume clone 40214/head
Kotresh HR [Thu, 18 Mar 2021 12:54:44 +0000 (18:24 +0530)]
mgr/volumes: Retain suid/guid bits in subvolume clone

Fixes: https://tracker.ceph.com/issues/49882
Signed-off-by: Kotresh HR <khiremat@redhat.com>
4 years agopybind/cephfs: Add lchmod python binding
Kotresh HR [Thu, 18 Mar 2021 12:51:05 +0000 (18:21 +0530)]
pybind/cephfs: Add lchmod python binding

Fixes: https://tracker.ceph.com/issues/49882
Signed-off-by: Kotresh HR <khiremat@redhat.com>
4 years agoclient/libcephfs: Add lchmod
Kotresh HR [Thu, 18 Mar 2021 12:51:05 +0000 (18:21 +0530)]
client/libcephfs: Add lchmod

Fixes: https://tracker.ceph.com/issues/49882
Signed-off-by: Kotresh HR <khiremat@redhat.com>
4 years agoMerge pull request #40227 from neha-ojha/wip-message-cap-val
Neha Ojha [Fri, 19 Mar 2021 00:28:01 +0000 (17:28 -0700)]
Merge pull request #40227 from neha-ojha/wip-message-cap-val

qa/suites/rados/perf: set osd client message cap to 5000

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
4 years agoMerge pull request #40185 from ronen-fr/wip-ronenf-extra-scrub-assert
Neha Ojha [Thu, 18 Mar 2021 21:54:37 +0000 (14:54 -0700)]
Merge pull request #40185 from ronen-fr/wip-ronenf-extra-scrub-assert

osd: remove a ceph_assert() from a legitimate path

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Sam Just <sjust@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
4 years agoqa/suites/rados/perf: set osd client message cap to 5000 40227/head
Neha Ojha [Thu, 18 Mar 2021 21:28:52 +0000 (21:28 +0000)]
qa/suites/rados/perf: set osd client message cap to 5000

Related to https://tracker.ceph.com/issues/49894
Signed-off-by: Neha Ojha <nojha@redhat.com>
4 years agocephadm: fix rbd-mirror auth name
Sage Weil [Thu, 18 Mar 2021 20:30:52 +0000 (16:30 -0400)]
cephadm: fix rbd-mirror auth name

Broken by 8fa941b35d89db6a40f7d2912b69eadf40c5004c

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agoMerge PR #40048 into master
Sage Weil [Thu, 18 Mar 2021 20:11:38 +0000 (16:11 -0400)]
Merge PR #40048 into master

* refs/pull/40048/head:
mgr/cephadm: stop conflicting daemon when deploying to a specific port
mgr/cephadm: make DaemonPlacement print nicer
mgr/cephadm: fix --force remove comment
mgr/cephadm/schedule: choose an IP from a subnet list
mgr/cephadm: rgw: clean up config and config-key values on removal
mgr/cephadm: rgw: drop .crt extension when storing cert in config-key
mgr/cephadm/services: allow beast/civetweb to bind to a particular IP
python-common: add 'networks' property to ServiceSpec
mgr/cephadm/schedule: match placement ip only combination with port

Reviewed-by: Sebastian Wagner <swagner@suse.com>
Reviewed-by: Juan Miguel Olmo <jolmomar@redhat.com>
4 years agoMerge pull request #39139 from TRYTOBE8TME/wip-rgw-bucket-tests-separation-new
Yuval Lifshitz [Thu, 18 Mar 2021 18:33:00 +0000 (20:33 +0200)]
Merge pull request #39139 from TRYTOBE8TME/wip-rgw-bucket-tests-separation-new

Wip rgw bucket tests separation new

4 years agocephadm: prevent podman from breaking socket.getfqdn() 40223/head
Sage Weil [Thu, 18 Mar 2021 18:26:48 +0000 (14:26 -0400)]
cephadm: prevent podman from breaking socket.getfqdn()

socket.getfqdn() will return the reverse lookup for 127.0.1.1, which is
the last item listed for that IP in /etc/hosts.  Podman, by default, will
append the container name (ceph-$fsid-$name) to that line, which is not
a valid hostname, and not what we want the dashbaord to use for the URI
it advertises in the service map.

Pass --no-hosts to podman to disable this.

Docker does not appear to modify /etc/hosts by default--or, more
importantly, does not add the container name there.

Explicitly instruct podman (and docker) to add a

Fixes: https://tracker.ceph.com/issues/49890
Signed-off-by: Sage Weil <sage@newdream.net>
4 years agomgr/orchestrator: remove image name field from 'orch ps' and 'orch ls' 40222/head
Adam King [Thu, 18 Mar 2021 17:20:46 +0000 (13:20 -0400)]
mgr/orchestrator: remove image name field from 'orch ps' and 'orch ls'

Now that we're typically using the image digests the name isn't as helpful. We also
end up in scenarios where some images use tags for their name and others use the
digest so the image name comes out as "mix" in orch ls despite it being the same image.

Fixes: https://tracker.ceph.com/issues/47333
Signed-off-by: Adam King <adking@redhat.com>
4 years agomon/MgrStatMonitor: ignore MMgrReport from non-active mgr 40219/head
Sage Weil [Thu, 18 Mar 2021 16:45:48 +0000 (11:45 -0500)]
mon/MgrStatMonitor: ignore MMgrReport from non-active mgr

If it's not the active mgr, we should ignore it.

Since the mgr instance is best identified by the gid, add that to the
message.  (We can't use the source_addrs for the message since that is
the MgrStandby monc addr, not the active mgr addrs in the MgrMap.)

This fixes a problem where a just-demoted mgr report gets processed and a
new mgr gets a ServiceMap with an epoch >= its pending map.  (At least,
that is my theory!)

Fixes: https://tracker.ceph.com/issues/48022
Signed-off-by: Sage Weil <sage@newdream.net>
4 years agoMerge pull request #40199 from dillaman/wip-rbd-lockdep
Mykola Golub [Thu, 18 Mar 2021 16:46:13 +0000 (18:46 +0200)]
Merge pull request #40199 from dillaman/wip-rbd-lockdep

test: ignore failures to force-enable lockdep

Reviewed-by: Mykola Golub <mgolub@suse.com>
4 years agoMerge pull request #40194 from dillaman/wip-49848
Mykola Golub [Thu, 18 Mar 2021 16:44:53 +0000 (18:44 +0200)]
Merge pull request #40194 from dillaman/wip-49848

test/pybind/rbd: fixed functional change in encryption API

Reviewed-by: Mykola Golub <mgolub@suse.com>
4 years agoMerge pull request #40161 from sseshasa/wip-fix-wait-for-clean
Neha Ojha [Thu, 18 Mar 2021 16:24:31 +0000 (09:24 -0700)]
Merge pull request #40161 from sseshasa/wip-fix-wait-for-clean

qa/tasks: Add additional wait_for_clean() check in lost_unfound tasks.

Reviewed-by: Neha Ojha <nojha@redhat.com>
4 years agomgr: tell monc when we get new servicemap, fsmap
Sage Weil [Thu, 18 Mar 2021 15:40:42 +0000 (10:40 -0500)]
mgr: tell monc when we get new servicemap, fsmap

Otherwise, when we re-subscribe we'll request an old map again.  In the
case of the servicemap, that can lead to a failed assertion.

Fixes: https://tracker.ceph.com/issues/48022
Signed-off-by: Sage Weil <sage@newdream.net>
4 years agocephadm: make default image the daily master build 40218/head
Sage Weil [Thu, 18 Mar 2021 15:26:36 +0000 (10:26 -0500)]
cephadm: make default image the daily master build

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agoosd: remove a ceph_assert() from a legitimate path 40185/head
Ronen Friedman [Wed, 17 Mar 2021 15:21:10 +0000 (17:21 +0200)]
osd: remove a ceph_assert() from a legitimate path

on_replica_init() might be legitimately called twice,
if the replica was waiting for updates to complete
before servicing the request.

Fixes: https://tracker.ceph.com/issues/49867
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
4 years agoMerge PR #40207 into master
Patrick Donnelly [Thu, 18 Mar 2021 13:46:03 +0000 (06:46 -0700)]
Merge PR #40207 into master

* refs/pull/40207/head:
doc: max_maps -> max_caps

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
4 years agoMerge pull request #40187 from ideepika/wip-tracing-intial-doc
zdover23 [Thu, 18 Mar 2021 11:45:39 +0000 (21:45 +1000)]
Merge pull request #40187 from ideepika/wip-tracing-intial-doc

dev/developer_guide: add jaegertracing intial developer documentation

Reviewed-by: Zac Dover <zac.dover@gmail.com>
4 years agodoc/dev/developer_guide: add jaegertracing intial developer documentation 40187/head
Deepika Upadhyay [Wed, 17 Mar 2021 16:05:50 +0000 (21:35 +0530)]
doc/dev/developer_guide: add jaegertracing intial developer documentation

Signed-off-by: Deepika Upadhyay <dupadhya@redhat.com>
4 years agoqa/tasks: Add additional wait_for_clean() check in lost_unfound tasks. 40161/head
Sridhar Seshasayee [Tue, 16 Mar 2021 19:48:40 +0000 (01:18 +0530)]
qa/tasks: Add additional wait_for_clean() check in lost_unfound tasks.

At the end of the lost_unfound tests add an additional wait_for_clean()
check to ensure that recoveries get enough time to complete before
proceeding and avoid failures down the line. For e.g. failure like
"Scrubbing terminated -- not all pgs were active and clean." is because
recoveries on the PGs did not get sufficient time to complete even though
they were bound to eventually complete.

Fixes: https://tracker.ceph.com/issues/49844
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
4 years agodoc: max_maps -> max_caps 40207/head
Dan van der Ster [Thu, 18 Mar 2021 07:05:49 +0000 (08:05 +0100)]
doc: max_maps -> max_caps

Signed-off-by: Dan van der Ster <daniel.vanderster@cern.ch>
4 years agoqa/rgw: notifications suite runs single job 39139/head
Casey Bodley [Mon, 1 Mar 2021 17:19:25 +0000 (12:19 -0500)]
qa/rgw: notifications suite runs single job

pin to the beast frontend, default bluestore, replicated pools, and run
against a random distro

Signed-off-by: Casey Bodley <cbodley@redhat.com>
4 years agoMerge pull request #40163 from ktdreyer/resource-agents-noarch
Kefu Chai [Thu, 18 Mar 2021 03:02:22 +0000 (11:02 +0800)]
Merge pull request #40163 from ktdreyer/resource-agents-noarch

rpm: ceph-resource-agents package is noarch

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge PR #40058 into master
Patrick Donnelly [Thu, 18 Mar 2021 02:41:09 +0000 (19:41 -0700)]
Merge PR #40058 into master

* refs/pull/40058/head:
doc: mds cap acquisition readdir throttle documentation

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
4 years agoMerge PR #40193 into master
Patrick Donnelly [Thu, 18 Mar 2021 02:31:43 +0000 (19:31 -0700)]
Merge PR #40193 into master

* refs/pull/40193/head:
ceph-debug-docker: podman build doesn't accept input via stdin

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>