]> git.apps.os.sepia.ceph.com Git - ceph-ci.git/log
ceph-ci.git
4 years agocephadm: add --single-host-defaults option to bootstrap
Sage Weil [Mon, 12 Apr 2021 15:51:33 +0000 (11:51 -0400)]
cephadm: add --single-host-defaults option to bootstrap

[global] osd_crush_choose_leaf_type = 0
[global] osd_pool_default_size = 2
[mgr] mgr_standby_modules = false

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agomgr/cephadm: allow mgr colo if mgr_standby_modules=false
Sage Weil [Mon, 12 Apr 2021 14:17:17 +0000 (10:17 -0400)]
mgr/cephadm: allow mgr colo if mgr_standby_modules=false

If the standby mgr daemons' modules aren't listening on any ports, then we
can schedule multiple on the same host.

Note that this may make 'orch ps' output misleading, as ports will be
reported for each mgr instance, but only one of them will actually be
listening at any one time (if they are behaving, at least!).  Treat a
mgr port check error as non-fatal.

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agomgr/MgrStandby: add mgr_standby_modules option
Sage Weil [Tue, 13 Apr 2021 14:11:31 +0000 (10:11 -0400)]
mgr/MgrStandby: add mgr_standby_modules option

Add config option to control whether the standby modules are started.
Default to true (no change in behavior), but if set to false the standby
mgr modules don't do the redirect business.

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agoqa/tasks/cephadm: ignore --keep-logs failure
Sage Weil [Fri, 16 Apr 2021 19:12:49 +0000 (14:12 -0500)]
qa/tasks/cephadm: ignore --keep-logs failure

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agoqa/tasks/cephadm: use yaml.dump_all()
Sage Weil [Fri, 16 Apr 2021 15:30:05 +0000 (10:30 -0500)]
qa/tasks/cephadm: use yaml.dump_all()

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agoqa/suites/rados/cephadm/smoke-*: use cephadm.wait_for_service
Sage Weil [Tue, 13 Apr 2021 16:53:24 +0000 (12:53 -0400)]
qa/suites/rados/cephadm/smoke-*: use cephadm.wait_for_service

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agoqa/tasks/cephadm: tear down clsuter before gathering logs
Sage Weil [Thu, 15 Apr 2021 22:55:00 +0000 (17:55 -0500)]
qa/tasks/cephadm: tear down clsuter before gathering logs

We dont' always stop all services, because teuthology doesn't know about
things it didn't start.  Use rm-cluster to tear things down, but do not
remove the logs themselves.  After we get logs, we'll clean up completely.

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agoqa/suites/rados/cephadm/smoke-roleless: test rgw-ingress
Sage Weil [Thu, 15 Apr 2021 22:45:21 +0000 (17:45 -0500)]
qa/suites/rados/cephadm/smoke-roleless: test rgw-ingress

Test this properly by downing each rgw and haproxy in turn and ensuring
that things remain up.

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agomgr/cephadm: remove virtual_ip check during scheduling
Sage Weil [Thu, 15 Apr 2021 22:22:26 +0000 (17:22 -0500)]
mgr/cephadm: remove virtual_ip check during scheduling

In 2f33c6ebbc8e2a6c3844a6921c857fb0796a1552 we made the keepalived task
set the necessary sysctls  to add a virtual_ip, so we don't need this
check anymore.

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agomgr/orchestrator: orch ls: leave off virtual_ip prefixlen
Sage Weil [Thu, 15 Apr 2021 21:59:09 +0000 (16:59 -0500)]
mgr/orchestrator: orch ls: leave off virtual_ip prefixlen

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agoqa/tasks/cephadm: add wait_for_service
Sage Weil [Tue, 13 Apr 2021 16:52:49 +0000 (12:52 -0400)]
qa/tasks/cephadm: add wait_for_service

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agoqa/tasks/cephadm: allow skip_monitor_stack=true
Sage Weil [Thu, 15 Apr 2021 19:01:48 +0000 (14:01 -0500)]
qa/tasks/cephadm: allow skip_monitor_stack=true

(Useful for roleless when we want to go faster)

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agoqa/tasks/cephadm: do subst_vip for cephadm.shell and .apply
Sage Weil [Thu, 15 Apr 2021 19:01:18 +0000 (14:01 -0500)]
qa/tasks/cephadm: do subst_vip for cephadm.shell and .apply

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agoqa/tasks/vip: add vip task to allocate virtual IPs
Sage Weil [Thu, 15 Apr 2021 19:00:57 +0000 (14:00 -0500)]
qa/tasks/vip: add vip task to allocate virtual IPs

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agoqa/suites/rados/cephadm/smoke-roleless: add rgw-ingress test case
Sage Weil [Tue, 13 Apr 2021 14:21:41 +0000 (10:21 -0400)]
qa/suites/rados/cephadm/smoke-roleless: add rgw-ingress test case

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agoqa/tasks/cephadm: shell: take 'all-roles' or 'all-hosts'
Sage Weil [Tue, 13 Apr 2021 14:58:09 +0000 (10:58 -0400)]
qa/tasks/cephadm: shell: take 'all-roles' or 'all-hosts'

'all' is ambiguous

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agoqa/tasks/cephadm: let cephadm.shell take string or list
Sage Weil [Mon, 12 Apr 2021 20:12:01 +0000 (16:12 -0400)]
qa/tasks/cephadm: let cephadm.shell take string or list

Make it a bit more forgiving.

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agoMerge PR #40870 into master
Sage Weil [Fri, 16 Apr 2021 12:14:28 +0000 (08:14 -0400)]
Merge PR #40870 into master

* refs/pull/40870/head:
auth/cephx: make KeyServer::build_session_auth_info() less confusing
auth/cephx: cap ticket validity by expiration of "next" key
auth/cephx: drop redundant KeyServerData::get_service_secret() overload

Reviewed-by: Sage Weil <sage@redhat.com>
4 years agoMerge PR #40711 into master
Sage Weil [Fri, 16 Apr 2021 12:01:15 +0000 (08:01 -0400)]
Merge PR #40711 into master

* refs/pull/40711/head:
mgr/cephadm: fix nfs-rgw stray daemon

Reviewed-by: Guillaume Abrioux <gabrioux@redhat.com>
Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Juan Miguel Olmo <jolmomar@redhat.com>
Reviewed-by: Michael Fritch <mfritch@suse.com>
4 years agoMerge pull request #40883 from tchaikov/wip-option-auth_mon_ticket_ttl
Ilya Dryomov [Fri, 16 Apr 2021 08:43:59 +0000 (10:43 +0200)]
Merge pull request #40883 from tchaikov/wip-option-auth_mon_ticket_ttl

common/options/global: correct default of auth_mon_ticket_ttl

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
4 years agocommon/options/global: correct default of auth_mon_ticket_ttl
Kefu Chai [Fri, 16 Apr 2021 05:39:06 +0000 (13:39 +0800)]
common/options/global: correct default of auth_mon_ticket_ttl

in 5505fc0051a302fed46aa2ecaa8c8aa2a75b9e91, during rebasing against
master it was set to 12_hr by accident.

this setting was changed to 72_hr in
522a52e6c258932274f0753feb623ce008519216

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge pull request #40319 from cypherean/skip_ssh_patch
Kefu Chai [Fri, 16 Apr 2021 04:46:36 +0000 (12:46 +0800)]
Merge pull request #40319 from cypherean/skip_ssh_patch

mgr/cephadm: skip-ssh flag enables cephadm mgr module

Reviewed-by: Adam King <adking@redhat.com>
4 years agoMerge pull request #40858 from adk3798/upgrade-fail
Kefu Chai [Fri, 16 Apr 2021 04:44:48 +0000 (12:44 +0800)]
Merge pull request #40858 from adk3798/upgrade-fail

mgr/cephadm: report exception during upgrade in upgrade status

Reviewed-by: Michael Fritch <mfritch@suse.com>
Reviewed-by: Juan Miguel Olmo <jolmomar@redhat.com>
4 years agoMerge pull request #40837 from liewegas/cephadm-qa-shorten-thrash
Kefu Chai [Fri, 16 Apr 2021 04:42:05 +0000 (12:42 +0800)]
Merge pull request #40837 from liewegas/cephadm-qa-shorten-thrash

qa/suites/rados/thrash: shorten radosbench

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
4 years agoMerge pull request #40860 from liewegas/cephadm-cleanup-ha-rgw
Kefu Chai [Fri, 16 Apr 2021 04:41:05 +0000 (12:41 +0800)]
Merge pull request #40860 from liewegas/cephadm-cleanup-ha-rgw

mgr/cephadm: remove old haproxy and keepalived templates

Reviewed-by: Michael Fritch <mfritch@suse.com>
Reviewed-by: Adam King <adking@redhat.com>
4 years agoMerge pull request #40552 from Daniel-Pivonka/cephadmverifylistsinspecjson
Kefu Chai [Fri, 16 Apr 2021 04:30:11 +0000 (12:30 +0800)]
Merge pull request #40552 from Daniel-Pivonka/cephadmverifylistsinspecjson

mgr/orchestrator: validate lists in spec jsons

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Juan Miguel Olmo <jolmomar@redhat.com>
4 years agoMerge pull request #40841 from tchaikov/wip-yamlize-legacy-option
Kefu Chai [Fri, 16 Apr 2021 04:28:56 +0000 (12:28 +0800)]
Merge pull request #40841 from tchaikov/wip-yamlize-legacy-option

common: generate legacy_config_opts.h from .yaml.in files

Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
4 years agoMerge PR #40828 into master
Patrick Donnelly [Fri, 16 Apr 2021 04:08:21 +0000 (21:08 -0700)]
Merge PR #40828 into master

* refs/pull/40828/head:
mds: set fscrypt flag on create if xattr buffer has an encryption context

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Xiubo Li <xiubli@redhat.com>
4 years agoMerge PR #40741 into master
Patrick Donnelly [Fri, 16 Apr 2021 04:07:08 +0000 (21:07 -0700)]
Merge PR #40741 into master

* refs/pull/40741/head:
mgr/volumes: remove unused semicolon

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
4 years agoMerge PR #40539 into master
Patrick Donnelly [Fri, 16 Apr 2021 04:06:31 +0000 (21:06 -0700)]
Merge PR #40539 into master

* refs/pull/40539/head:
cephfs-top: set the cursor to be invisible
cephfs-top: self-adapt the display according the window size
cephfs-top: use the default window object from curses.wrapper()
cephfs-top: improve the output

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
4 years agoMerge PR #40412 into master
Patrick Donnelly [Fri, 16 Apr 2021 04:05:51 +0000 (21:05 -0700)]
Merge PR #40412 into master

* refs/pull/40412/head:
vstart_runner: reuse code in LocalRemoteProcess

Reviewed-by: Xiubo Li <xiubli@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
4 years agoMerge PR #39660 into master
Patrick Donnelly [Fri, 16 Apr 2021 04:03:19 +0000 (21:03 -0700)]
Merge PR #39660 into master

* refs/pull/39660/head:
qa: Update the mdsmap schema in mgr/dashboard/test_health.py
doc: add lsflags command to Administrative Commands document
qa: test fs lsflags command
mon: add command to print fs flags
mds: print each flag value

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
4 years agoMerge pull request #40824 from batrick/i50304
Kefu Chai [Fri, 16 Apr 2021 02:10:16 +0000 (10:10 +0800)]
Merge pull request #40824 from batrick/i50304

pybind/mgr/devicehealth: fix command name

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Varsha Rao <varao@redhat.com>
4 years agoMerge pull request #40829 from batrick/i50307
Kefu Chai [Fri, 16 Apr 2021 01:55:12 +0000 (09:55 +0800)]
Merge pull request #40829 from batrick/i50307

SimpleRADOSStriper: use debug_cephsqlite

Reviewed-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge pull request #40859 from wjwithagen/wjw-fix-make-rook-optional
Kefu Chai [Fri, 16 Apr 2021 01:53:21 +0000 (09:53 +0800)]
Merge pull request #40859 from wjwithagen/wjw-fix-make-rook-optional

cmake: Add an option for enabling rook client in dashboard

Reviewed-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge pull request #40876 from tchaikov/wip-top-bar-spacing
Kefu Chai [Fri, 16 Apr 2021 00:25:13 +0000 (08:25 +0800)]
Merge pull request #40876 from tchaikov/wip-top-bar-spacing

doc/_theme: increasing the padding in navigation bar

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
4 years agomgr/cephadm: fix nfs-rgw stray daemon
Daniel Pivonka [Thu, 8 Apr 2021 19:20:18 +0000 (15:20 -0400)]
mgr/cephadm: fix nfs-rgw stray daemon

nfs-rgw registers under a gid cephadm needs covert that to its known name during the stray daemon check

Signed-off-by: Daniel Pivonka <dpivonka@redhat.com>
4 years agoMerge pull request #40718 from CongMinYin/fix-dead-lock-in-pwl
Mykola Golub [Thu, 15 Apr 2021 18:05:00 +0000 (21:05 +0300)]
Merge pull request #40718 from CongMinYin/fix-dead-lock-in-pwl

librbd/cache/pwl: Fix dead lock issue when pwl initialization failed

Reviewed-by: Mykola Golub <mgolub@suse.com>
4 years agodoc/_theme: increasing the padding in navigation bar
Kefu Chai [Thu, 15 Apr 2021 16:52:25 +0000 (00:52 +0800)]
doc/_theme: increasing the padding in navigation bar

otherwise the "home-icon" and the right most "Edit on GitHub" are displayed
right next to the left and right borders of the top-bar.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge pull request #40875 from batrick/i50378
Kefu Chai [Thu, 15 Apr 2021 16:25:57 +0000 (00:25 +0800)]
Merge pull request #40875 from batrick/i50378

test: bump DecayCounter.steady acceptable error

Reviewed-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge pull request #40847 from tchaikov/wip-doc-build
Kefu Chai [Thu, 15 Apr 2021 15:06:15 +0000 (23:06 +0800)]
Merge pull request #40847 from tchaikov/wip-doc-build

admin/build-doc: use htmldir builder for livehtml

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
4 years agoMerge PR #40845 into master
Patrick Donnelly [Thu, 15 Apr 2021 14:30:11 +0000 (07:30 -0700)]
Merge PR #40845 into master

* refs/pull/40845/head:
systemd: remove `ProtectClock=true` for `ceph-osd@.service`

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
4 years agoMerge pull request #40843 from tchaikov/wip-cmake-rgw-civetweb.h
Kefu Chai [Thu, 15 Apr 2021 14:08:43 +0000 (22:08 +0800)]
Merge pull request #40843 from tchaikov/wip-cmake-rgw-civetweb.h

cmake: update civetweb.h on demand

Reviewed-by: Casey Bodley <cbodley@redhat.com>
4 years agotest: bump DecayCounter.steady acceptable error
Patrick Donnelly [Thu, 15 Apr 2021 13:57:57 +0000 (06:57 -0700)]
test: bump DecayCounter.steady acceptable error

The test is often run on heavily loaded machines which will effect the
precision of the test.

Fixes: https://tracker.ceph.com/issues/50378
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
4 years agoMerge pull request #40873 from tchaikov/wip-50374
Kefu Chai [Thu, 15 Apr 2021 13:33:12 +0000 (21:33 +0800)]
Merge pull request #40873 from tchaikov/wip-50374

vstart.sh: disable "auth_allow_insecure_global_id_reclaim"

Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
4 years agoMerge pull request #40867 from tchaikov/wip-50339
Kefu Chai [Thu, 15 Apr 2021 13:24:38 +0000 (21:24 +0800)]
Merge pull request #40867 from tchaikov/wip-50339

test/cls_cas: allow multi hobjects tracked by cls_cas

Reviewed-by: Myoungwon Oh <myoungwon.oh@samsung.com>
Reviewed-by: Sam Just <sjust@redhat.com>
4 years agoauth/cephx: make KeyServer::build_session_auth_info() less confusing
Ilya Dryomov [Thu, 15 Apr 2021 13:18:58 +0000 (15:18 +0200)]
auth/cephx: make KeyServer::build_session_auth_info() less confusing

The second KeyServer::build_session_auth_info() overload is used only
by the monitor, for mon <-> mon authentication.  The monitor passes in
service_secret (mon secret) and secret_id (-1).  The TTL is irrelevant
because there is no rotation.

However the signature doesn't make it obvious.  Clarify that
service_secret and secret_id are input parameters and info is the only
output parameter.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
4 years agovstart.sh: disable "auth_allow_insecure_global_id_reclaim"
Kefu Chai [Thu, 15 Apr 2021 13:07:53 +0000 (21:07 +0800)]
vstart.sh: disable "auth_allow_insecure_global_id_reclaim"

to silence the health warning of "mons are allowing insecure global_id
reclaim", which prevents the cluster from being active+clean. couple
tests are expecting a warning free cluster before they starts.

as this option is enabled by default for appeasing the old clients, but when it
comes to most of upstream testing, we can just disable it.

Fixes: https://tracker.ceph.com/issues/50374
Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agomgr/cephadm: report exception during upgrade in upgrade status
Adam King [Wed, 14 Apr 2021 15:39:10 +0000 (11:39 -0400)]
mgr/cephadm: report exception during upgrade in upgrade status

Fixes: https://tracker.ceph.com/issues/50361
Signed-off-by: Adam King <adking@redhat.com>
4 years agoMerge pull request #40196 from ceph/wip-15210-notes
Ilya Dryomov [Thu, 15 Apr 2021 11:36:22 +0000 (13:36 +0200)]
Merge pull request #40196 from ceph/wip-15210-notes

doc/releases/octopus.rst: 15.2.10 Release Notes

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
4 years agodoc/releases/releases: add 15.2.10 release
Ilya Dryomov [Thu, 15 Apr 2021 11:22:22 +0000 (13:22 +0200)]
doc/releases/releases: add 15.2.10 release

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
4 years agodoc/releases/octopus.rst: 15.2.10 Release Notes
David Galloway [Wed, 17 Mar 2021 19:08:11 +0000 (15:08 -0400)]
doc/releases/octopus.rst: 15.2.10 Release Notes

Signed-off-by: David Galloway <dgallowa@redhat.com>
4 years agoauth/cephx: cap ticket validity by expiration of "next" key
Ilya Dryomov [Thu, 15 Apr 2021 07:48:13 +0000 (09:48 +0200)]
auth/cephx: cap ticket validity by expiration of "next" key

If auth_mon_ticket_ttl is increased by several times as done in
commit 522a52e6c258 ("auth/cephx: rotate auth tickets less often"),
active clients eventually get stuck because the monitor sends out an
auth ticket with a bogus validity.  The ticket is secured with the
"current" secret that is scheduled to expire according to the old TTL,
but the validity of the ticket is set to the new TTL.  As a result,
the client simply doesn't attempt to renew, letting the secrets rotate
potentially more than once.  When that happens, the client first hits
auth authorizer errors as it tries to renew service tickets and when
it finally gets to renewing the auth ticket, it hits the insecure
global_id reclaim wall.

Cap TTL by expiration of "next" key -- the "current" key may be
milliseconds away from expiration and still be used, legitimately.
Do it in KeyServerData alongside key rotation code and propagate the
capped TTL to the upper layer.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
4 years agoauth/cephx: drop redundant KeyServerData::get_service_secret() overload
Ilya Dryomov [Thu, 15 Apr 2021 07:47:50 +0000 (09:47 +0200)]
auth/cephx: drop redundant KeyServerData::get_service_secret() overload

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
4 years agoMerge pull request #40866 from hualongfeng/pmdk_in_dist
Kefu Chai [Thu, 15 Apr 2021 08:43:28 +0000 (16:43 +0800)]
Merge pull request #40866 from hualongfeng/pmdk_in_dist

make-dist: add pmdk to dist tarball

Reviewed-by: Kefu Chai <kchai@redhat.com>
4 years agocmake: Add an option for enabling rook client in dashboard
Willem Jan Withagen [Wed, 14 Apr 2021 16:36:51 +0000 (18:36 +0200)]
cmake: Add an option for enabling rook client in dashboard

Signed-off-by: Willem Jan Withagen <wjw@digiware.nl>
4 years agocephfs-top: set the cursor to be invisible
Xiubo Li [Fri, 2 Apr 2021 01:03:02 +0000 (09:03 +0800)]
cephfs-top: set the cursor to be invisible

Signed-off-by: Xiubo Li <xiubli@redhat.com>
4 years agocephfs-top: self-adapt the display according the window size
Xiubo Li [Thu, 1 Apr 2021 05:39:23 +0000 (13:39 +0800)]
cephfs-top: self-adapt the display according the window size

This will allow change the window size when the cephfs-top tool is
running and will adapt the display according to the real time window
size.

Fixes: https://tracker.ceph.com/issues/50091
Signed-off-by: Xiubo Li <xiubli@redhat.com>
4 years agocephfs-top: use the default window object from curses.wrapper()
Xiubo Li [Thu, 1 Apr 2021 05:04:07 +0000 (13:04 +0800)]
cephfs-top: use the default window object from curses.wrapper()

Signed-off-by: Xiubo Li <xiubli@redhat.com>
4 years agocephfs-top: improve the output
Xiubo Li [Wed, 31 Mar 2021 14:00:03 +0000 (22:00 +0800)]
cephfs-top: improve the output

When adding more metrics the top line will be too long and maybe
wrapped with serval lines, which will make it hard to read.

Signed-off-by: Xiubo Li <xiubli@redhat.com>
4 years agorgw: cast rgw_max_chunk_size to off_t
Kefu Chai [Thu, 15 Apr 2021 05:31:25 +0000 (13:31 +0800)]
rgw: cast rgw_max_chunk_size to off_t

as the left-hand operator is promoted to off_t which is a signed
integer, while rgw_max_chunk_size will be an unsigned after the
yaml-to-cxx migration. so let's cast it to `off_t` before comparing
them.

the same applies to rgw_copy_obj_progress_every_bytes.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agoos/bluestore: do not cast min_alloc_size to int64_t
Kefu Chai [Wed, 14 Apr 2021 03:25:17 +0000 (11:25 +0800)]
os/bluestore: do not cast min_alloc_size to int64_t

as the left-hand operator will be an unsigned integer after the
yaml-to-cxx migration.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agomsg/Message: use an unsigned type for get_data_len()
Kefu Chai [Wed, 14 Apr 2021 03:21:42 +0000 (11:21 +0800)]
msg/Message: use an unsigned type for get_data_len()

to avoid warnings like:

../src/osd/PrimaryLogPG.cc:2123:27: warning: comparison of integer expressions of different signedness: ‘off_t’ {aka ‘long int’} and ‘uint64_t’ {aka ‘long unsigned int’} [-Wsign-compare]
 2123 |         m->get_data_len() > cct->_conf->osd_max_write_size << 20) {
      |         ~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

after change the size options' type to unsigned or uint64_t.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agocommon: generate legacy_config_opts.h from .yaml.in files
Kefu Chai [Tue, 13 Apr 2021 05:43:57 +0000 (13:43 +0800)]
common: generate legacy_config_opts.h from .yaml.in files

* add a setting named "with_legacy" to .yaml.in files, so
  each option with a true "with_legacy" will have an entry
  in legacy_config_opts.h.
* preserve the comments from legacy_config_opts.h to .yaml.in,
  some of them are solely for developers, but some of them are
  good reading for users as well. we can use them for "desc"
  field in a follow-up change.
* move common/legacy_config_opts.h to common/options/legacy_config_opts.h
  as legacy_config_opts.h is "closer" to the options directory
  than other sources files under src/common.
* update y2c.py to generate separate .h files which are in turn
  included by legacy_config_opts.h
* add a target named "legacy-option-headers", and let
  some targets depend on it so that these headers generated by
  y2c.py can be generated before the .cc files including them
  are compiled.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agorgw: use min<offset_t> instead of min()
Kefu Chai [Wed, 14 Apr 2021 02:19:28 +0000 (10:19 +0800)]
rgw: use min<offset_t> instead of min()

as its parameters' types might not be identical. to be specific,
the 2nd parameter's type will be an unsigned one after the c++ => yaml
migration.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agotest/cls_cas: allow multi hobjects tracked by cls_cas
Kefu Chai [Thu, 15 Apr 2021 03:28:35 +0000 (11:28 +0800)]
test/cls_cas: allow multi hobjects tracked by cls_cas

in d2737fd41a146e8efe3162cdc39845226bd5a756, we started to use multiset
for tracking the references of hobject for snapshot support. as the same
hobject maps to multiple snapshots. and we don't want to consider
different snapshots as the same entry tracked by cls_cas.

but cls_cas.dup_get() still tries to verify that the `get` operation
is able to dedup the same referenced "source". but this does not apply
to "by_object" trunk ref type anymore.

since we cannot check/choose the chunk ref type used by OSD from the
client of the cls_cas, in this change, cls_cas.dup_get() is updated
to adapt the change solely for "by_object". otherwise we could skip
this test for "by_object" type and/or define another test for other
chunk ref types.

Fixes: https://tracker.ceph.com/issues/50339
Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agocmake: use src/pmdk for building pmdk if it exists
Feng Hualong [Wed, 14 Apr 2021 05:20:17 +0000 (13:20 +0800)]
cmake: use src/pmdk for building pmdk if it exists

so we can build with pmdk enabled if the dist tarball
contains pmdk

Signed-off-by: Feng Hualong <hualong.feng@intel.com>
4 years agomake-dist: add pmdk to dist tarball
Feng Hualong [Wed, 14 Apr 2021 01:26:55 +0000 (09:26 +0800)]
make-dist: add pmdk to dist tarball

Signed-off-by: Feng Hualong <hualong.feng@intel.com>
4 years agoMerge pull request #40862 from ivancich/wip-rgw-reshard-contention-logging
J. Eric Ivancich [Wed, 14 Apr 2021 20:13:17 +0000 (16:13 -0400)]
Merge pull request #40862 from ivancich/wip-rgw-reshard-contention-logging

rgw: during reshard lock contention, adjust logging

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
4 years agomgr/orchestrator: validate lists in spec jsons
Daniel Pivonka [Thu, 1 Apr 2021 17:56:48 +0000 (13:56 -0400)]
mgr/orchestrator: validate lists in spec jsons

Signed-off-by: Daniel Pivonka <dpivonka@redhat.com>
4 years agoMerge pull request #40821 from cbodley/wip-50218
Casey Bodley [Wed, 14 Apr 2021 18:45:00 +0000 (14:45 -0400)]
Merge pull request #40821 from cbodley/wip-50218

qa/rgw: use symlinks to specify distro

Reviewed-by: Ali Maredia <amaredia@redhat.com>
4 years agorgw: during reshard lock contention, adjust logging
J. Eric Ivancich [Wed, 14 Apr 2021 17:55:22 +0000 (13:55 -0400)]
rgw: during reshard lock contention, adjust logging

When RGW fails to get a lock on a reshard log, we log it in such a way
that it looks like an error. Instead we'll make sure that the log
message is informational.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
4 years agoMerge branch 'CVE-2021-20288' into master
Ilya Dryomov [Wed, 14 Apr 2021 17:58:49 +0000 (19:58 +0200)]
Merge branch 'CVE-2021-20288' into master

* CVE-2021-20288:
  qa/standalone: default to disable insecure global id reclaim
  qa/suites/upgrade/octopus-x: disable insecure global_id reclaim health warnings
  qa/tasks/ceph[adm].conf[.template]: disable insecure global_id reclaim health alerts
  cephadm: set auth_allow_insecure_global_id_reclaim for mon on bootstrap
  mon/HealthMonitor: raise AUTH_INSECURE_GLOBAL_ID_RENEWAL[_ALLOWED]
  auth/cephx: ignore CEPH_ENTITY_TYPE_AUTH in requested keys
  auth/cephx: rotate auth tickets less often
  mon: fail fast when unauthorized global_id (re)use is disallowed
  auth/cephx: option to disallow unauthorized global_id (re)use
  auth/cephx: make cephx_decode_ticket() take a const ticket_blob
  auth/AuthServiceHandler: keep track of global_id and whether it is new
  auth/AuthServiceHandler: build_cephx_response_header() is cephx-specific
  auth/AuthServiceHandler: drop unused start_session() args
  mon/MonClient: drop global_id arg from _add_conn() and _add_conns()
  mon/MonClient: reset auth state in shutdown()
  mon/MonClient: preserve auth state on reconnects
  mon/MonClient: claim active_con's auth explicitly
  mon/MonClient: resurrect "waiting for monmap|config" timeouts

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
Reviewed-by: Sage Weil <sage@redhat.com>
4 years agomgr/cephadm: remove old haproxy and keepalived templates
Sage Weil [Wed, 14 Apr 2021 16:47:54 +0000 (11:47 -0500)]
mgr/cephadm: remove old haproxy and keepalived templates

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agodoc/_ext: extract confval into an extension
Kefu Chai [Wed, 14 Apr 2021 16:03:10 +0000 (00:03 +0800)]
doc/_ext: extract confval into an extension

better readability this way

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agoadmin/doc-requirements.txt: pin sphinx and sphinx_rtd_theme to a certain version
Kefu Chai [Wed, 14 Apr 2021 15:24:36 +0000 (23:24 +0800)]
admin/doc-requirements.txt: pin sphinx and sphinx_rtd_theme to a certain version

to ensure that we can have reproducible builds

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge pull request #40614 from smithfarm/wip-39556
Kefu Chai [Wed, 14 Apr 2021 14:41:35 +0000 (22:41 +0800)]
Merge pull request #40614 from smithfarm/wip-39556

make-dist: refuse to run if script path contains a colon

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
4 years agoMerge pull request #40839 from gregsfortytwo/wip-mon-quorum-leader
Gregory Farnum [Wed, 14 Apr 2021 14:41:05 +0000 (07:41 -0700)]
Merge pull request #40839 from gregsfortytwo/wip-mon-quorum-leader

mon: MMonProbe: direct MMonJoin messages to the leader, instead of the first mon

Reviewed-by: Sam Just <sjust@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge pull request #40807 from tchaikov/wip-common-include
Kefu Chai [Wed, 14 Apr 2021 14:39:45 +0000 (22:39 +0800)]
Merge pull request #40807 from tchaikov/wip-common-include

common/Formatter: include used header

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
4 years agoMerge pull request #40830 from gregsfortytwo/wip-sqlite3-build
Kefu Chai [Wed, 14 Apr 2021 14:38:55 +0000 (22:38 +0800)]
Merge pull request #40830 from gregsfortytwo/wip-sqlite3-build

mgr: do not require sqlite3.h if WITH_LIBCEPHSQLITE=OFF

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
4 years agoMerge pull request #40840 from tchaikov/wip-silence-gcc-warnings
Kefu Chai [Wed, 14 Apr 2021 14:35:09 +0000 (22:35 +0800)]
Merge pull request #40840 from tchaikov/wip-silence-gcc-warnings

tools: silence gcc warnings

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
4 years agosystemd: remove `ProtectClock=true` for `ceph-osd@.service`
Wong Hoi Sing Edison [Wed, 14 Apr 2021 07:36:17 +0000 (15:36 +0800)]
systemd: remove `ProtectClock=true` for `ceph-osd@.service`

Ceph 16.2.0 Pacific by https://github.com/ceph/ceph/commit/9a84d5a introduce following new systemd restriction:

    ProtectClock=true
    ProtectHostname=true
    ProtectKernelLogs=true
    RestrictSUIDSGID=true

BTW, `ceph-osd@.service` failed with `ProtectClock=true` unexpectly, also see:

  - <https://lists.ceph.io/hyperkitty/list/ceph-users@ceph.io/thread/TNBGGNN6STGDKARAQTQCIPTU4KLIVJQV/>
  - <https://serverfault.com/questions/1059317/bluestore-var-lib-ceph-osd-ceph-2-block-read-bdev-label-failed-to-open-var-l>

This PR intruduce:

  - Remove `ProtectClock=true` for our systemd service templates

Fixes: https://tracker.ceph.com/issues/50347
Signed-off-by: Wong Hoi Sing Edison <hswong3i@pantarei-design.com>
4 years agoMerge PR #40735 into master
Sage Weil [Wed, 14 Apr 2021 13:53:31 +0000 (09:53 -0400)]
Merge PR #40735 into master

* refs/pull/40735/head:
mgr/orchestractor: rgw realm and zone flags must both be provided

Reviewed-by: Juan Miguel Olmo <jolmomar@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Adam King <adking@redhat.com>
4 years agoMerge PR #40734 into master
Sage Weil [Wed, 14 Apr 2021 13:39:38 +0000 (09:39 -0400)]
Merge PR #40734 into master

* refs/pull/40734/head:
mgr/cephadm: make prometheus scrape ingress haproxy
doc/cephadm: remove big warning about stability
doc/cepham/compatibility: rgw-ha -> ingress; note possibility of breaking changes
mgr/cephadm: ingress: add optional virtual_interface_networks
doc/cephadm/rgw: clean up example spec
mgr/cephadm/services/ingress: less verbose about prepare_create
doc/cephadm/rgw: add note about which ethernet interface is used
cephadm: make keepalived unit fiddle sysctl settings
mgr/orchestrator: report external endpoints from 'orch ls'
mgr/orchestrator: drop - when no ports
doc/cephadm/rgw: update docs for ingress service
mgr/cephadm: use per_host_daemon feature in scheduler
mgr/cephadm/schedule: add per_host_daemon_type support
mgr/cephadm: HA_RGW -> Ingress
mgr/cephadm: include daemon_type in DaemonPlacement
mgr/cephadm: update list-networks to report interface names too
mgr/orchestrator: streamline 'orch ps' PORTS formatting
mgr/cephadm/schedule: handle multiple ports per daemon
mgr/cephadm/utils: resolve_ip(): prefer IPv4

Reviewed-by: Sebastian Wagner <swagner@suse.com>
4 years agodoc/_ext: claim ceph_releses ext to be parallel read safe
Kefu Chai [Wed, 14 Apr 2021 09:07:23 +0000 (17:07 +0800)]
doc/_ext: claim ceph_releses ext to be parallel read safe

so we can pass `-j auto` to sphinx-build to speed up the build

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agoinclude/rados: document parameters
Kefu Chai [Wed, 14 Apr 2021 08:31:07 +0000 (16:31 +0800)]
include/rados: document parameters

less warnings from sphinx

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agoadmin/build-doc: use htmldir builder for livehtml
Kefu Chai [Wed, 14 Apr 2021 08:29:39 +0000 (16:29 +0800)]
admin/build-doc: use htmldir builder for livehtml

otherwise livehtml uses the default html builder. so we can copy and
paste the partial URL to cross check the rendering result.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agoMerge pull request #40149 from tchaikov/wip-cmake-job-pool
Kefu Chai [Wed, 14 Apr 2021 07:06:50 +0000 (15:06 +0800)]
Merge pull request #40149 from tchaikov/wip-cmake-job-pool

cmake: use ninja job pool

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
4 years agoMerge pull request #40785 from tchaikov/wip-doc-list-spacing
Kefu Chai [Wed, 14 Apr 2021 04:42:31 +0000 (12:42 +0800)]
Merge pull request #40785 from tchaikov/wip-doc-list-spacing

doc/_themes: remove spacing after `ul li p`

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
4 years agocmake: update civetweb.h on demand
Kefu Chai [Wed, 14 Apr 2021 03:58:57 +0000 (11:58 +0800)]
cmake: update civetweb.h on demand

instead of using a dedicated target for updating civetweb.h, use
add_custom_command() with OUTPUT option.

so the generated rule does not rerun the command specified by the
civetweb_h target every time we build the targets depending on it.

use the header file as the dependency helps to improve the readability
as there is one less link in the dependency chain, and by specifying the
OUTPUT, cmake is able to figure out the dependency on the header file so
it does not try to regenerate it every time.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agotools/osdmaptool: mark unused variable [[maybe_unused]]
Kefu Chai [Wed, 14 Apr 2021 02:46:14 +0000 (10:46 +0800)]
tools/osdmaptool: mark unused variable [[maybe_unused]]

to silence warning from GCC when performing release build, like:

../src/tools/osdmaptool.cc: In function ‘int main(int, const char**)’:
../src/tools/osdmaptool.cc:472:9: warning: variable ‘r’ set but not used [-Wunused-but-set-variable]
  472 |     int r = clock_gettime(CLOCK_MONOTONIC, &round_start);
      |         ^

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agodoc/radosgw: fix formatting of command line block
Kefu Chai [Wed, 14 Apr 2021 01:33:35 +0000 (09:33 +0800)]
doc/radosgw: fix formatting of command line block

should add an empty line after the directive.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agoadmin: require sphinx>=3.2.1
Kefu Chai [Wed, 14 Apr 2021 00:51:08 +0000 (08:51 +0800)]
admin: require sphinx>=3.2.1

loosen the required sphinx version.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agodoc/_themes: remove spacing after `ul li p`
Kefu Chai [Sun, 11 Apr 2021 03:09:25 +0000 (11:09 +0800)]
doc/_themes: remove spacing after `ul li p`

in the latest document generated from RtD, the spacing after `ul li p`
elements is set to 24px as the plain `p` elements. but this the lists
more sparse and difficult to read.

in this change, the spacing is restored to 0 as it was in old theme.css
in sphinx_rtd_theme.

Signed-off-by: Kefu Chai <kchai@redhat.com>
4 years agomon: MMonProbe: direct MMonJoin messages to the leader, instead of the first mon
Greg Farnum [Tue, 13 Apr 2021 22:53:03 +0000 (22:53 +0000)]
mon: MMonProbe: direct MMonJoin messages to the leader, instead of the first mon

When monitors are joining a cluster, they may send an MMonJoin message to place
themselves correctly in the map in either handle_probe_reply() or
finish_election(). These messages must be sent to the leader -- monitors do not
forward each other's messages.

Unfortunately, this scenario was missed when converting the monitors to support
connectivity-based elections, and they're sending these messages to
quorum.begin(). Fix this by including an explicit leader in MMonProbe (that the
new monitor may reference in handle_probe_reply) and using the leader
value in both locations.

Fixes: https://tracker.ceph.com/issues/50345
Signed-off-by: Greg Farnum <gfarnum@redhat.com>
4 years agoqa/suites/rados/thrash: shorten radosbench
Sage Weil [Tue, 13 Apr 2021 22:42:21 +0000 (17:42 -0500)]
qa/suites/rados/thrash: shorten radosbench

This is the longest of the thrash workloads; reducing it will bring
this test in line with the others (<= 45 min).

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agoMerge pull request #40835 from gregsfortytwo/wip-stretch-mon-state
Gregory Farnum [Tue, 13 Apr 2021 22:38:01 +0000 (15:38 -0700)]
Merge pull request #40835 from gregsfortytwo/wip-stretch-mon-state

Fix issues with in-memory monitor stretch state

Reviewed-by: Sam Just <sjust@redhat.com>
4 years agomgr/cephadm: make prometheus scrape ingress haproxy
Sage Weil [Tue, 13 Apr 2021 22:20:21 +0000 (18:20 -0400)]
mgr/cephadm: make prometheus scrape ingress haproxy

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agodoc/cephadm: remove big warning about stability
Sage Weil [Tue, 13 Apr 2021 14:20:48 +0000 (10:20 -0400)]
doc/cephadm: remove big warning about stability

It's the first item on the toctree that follows.

Signed-off-by: Sage Weil <sage@newdream.net>
4 years agodoc/cepham/compatibility: rgw-ha -> ingress; note possibility of breaking changes
Sage Weil [Tue, 13 Apr 2021 14:20:27 +0000 (10:20 -0400)]
doc/cepham/compatibility: rgw-ha -> ingress; note possibility of breaking changes

Signed-off-by: Sage Weil <sage@newdream.net>