Sage Weil [Sat, 7 Dec 2019 18:28:15 +0000 (12:28 -0600)]
Merge PR #32039 into master
* refs/pull/32039/head:
test: Improve races by using kill_daemons which waits for OSDs terminate
test: run-standalone.sh: Only run execs in the subdirectories of qa/standalone
test: Use activate_osd() when restarting OSDs
test: osd-scrub-snaps.sh: Fix race with osd restart and doing a scrub
Sage Weil [Fri, 6 Dec 2019 21:54:06 +0000 (15:54 -0600)]
qa/tasks/mgr/test_orchestrator_cli: test with valid network or IP
Passing 'network' will fail with
Error EINVAL: Failed to parse host list: '['host1:network', 'host2:network', 'host3:network']': u'network' does not appear to be an IPv4 or IPv6 address
Sage Weil [Fri, 6 Dec 2019 21:14:31 +0000 (15:14 -0600)]
ceph-daemon: fix bootstrap ownership of tmp monmap file
The file is created with mode 0644 by the ceph:ceph user, but root
cannot write to that. Instead, create the tmp file owned by root, write
to it, then chown it to user ceph.
David Zafman [Fri, 6 Dec 2019 17:01:41 +0000 (09:01 -0800)]
test: run-standalone.sh: Only run execs in the subdirectories of qa/standalone
This will ignore scripts placed at the qa/standalone level, though
I'm not sure if we should be putting any tests there. It does
allow support scripts present like ceph-helper.sh without modifying
run-standalone.sh to ignore it.
* refs/pull/29964/head:
qa/cephfs: test conf file reading for CephFS shell
qa/cephfs: allow passing conf to methods that runs shell command
qa/vstart_runner: add a method to create temporary file
test_cephfs_shell: update test since shell can now read ceph.conf
src/vstart.sh: add cephfs-shell section to ceph.conf
cephfs-shell: read shell variables from ceph.conf
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Rishabh Dave [Wed, 13 Nov 2019 12:42:13 +0000 (18:12 +0530)]
test_cephfs_shell: update test since shell can now read ceph.conf
In the test
test_cephfs_shell.TestMisc.test_issue_cephfs_shell_cmd_at_invocation,
don't set the shell option with in the command to be tested since
CephFS shell can now read it's value from ceph.conf.
Also, add methods to set config options for vstart as well teuthology
testing.
Rishabh Dave [Wed, 28 Aug 2019 13:57:29 +0000 (19:27 +0530)]
cephfs-shell: read shell variables from ceph.conf
Read ceph.conf and initialize shell variables by reading options set
under section cephfs-shell. If there are sections/options that are
duplicated in ceph.conf, CephFS shell overrides the options with the
values last.
Sage Weil [Thu, 5 Dec 2019 23:18:07 +0000 (17:18 -0600)]
mgr: simply exit on SIGINT or SIGTERM
The python modules don't reliable shut down. We've fixed a few shutdown
bugs chasing https://tracker.ceph.com/issues/42744.
This was compounded by https://tracker.ceph.com/issues/42981, which seems
to suggest something in the python module teardown is screwing with the
signal handlers.
Now that df507cde8d71063d5873a42f668156e4c32e86f9 is there, the mgr will
blacklist all rados clients the mgr instantiates, which means we can just
exit immediately and let the blacklisting clean things up.
Works-around: https://tracker.ceph.com/issues/42981 Fixes: https://tracker.ceph.com/issues/42744 Signed-off-by: Sage Weil <sage@redhat.com>
* refs/pull/29421/head:
qa/cephfs: add tests for ACLs
qa/cephfs: allow running tests from xfstests-dev
qa/tasks: add methods to get monitor's sockets
qa/cephfs: don't crash if mountpoint dir is already deleted
vstart_runner.py: set omit_sudo's default value to False
qa/vstart_runner.py: fix get_keyring_path()
qa/cephfs: don't abort if mountpoint is already present
qa/cephfs: allow specifying mountpoint for kernel mounts
qa/cephfs: allow specifying mountpoints for FUSE mounts
qa/vstart_runner.py: allow specifying mountpoint for local FUSE mounts
qa/mount.py: allow setting mountpoint
qa/vstart_runner.py: add a method to create a temporary directory
Reviewed-by: Jeff Layton <jlayton@redhat.com> Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Sage Weil [Thu, 5 Dec 2019 20:37:02 +0000 (14:37 -0600)]
mon/PGMap: use NONE for 'pg ls[-*]' output too
We already show CRUSH_ITEM_NONE as NONE in the 'pg dump' output as of 57bac8ea88e5cdcb3854852d439f7e6cbf9df900; do the same for 'pg ls' (which
is the friendlier of the two and what users should be using anyway!).
Rodrigo Severo [Thu, 5 Dec 2019 16:41:33 +0000 (13:41 -0300)]
doc: obsolete entries for allow_standby_replay
Nautilus release presented the allow_standby_replay fs setting that obsdoleted
several MDS config entries: mds_standby_for_*, mon_force_standby_active, and
mds_standby_replay.
Removing entries instead of just marking them as "Obsolete' as per batrick
suggestion.
Zac Dover [Thu, 5 Dec 2019 16:16:01 +0000 (02:16 +1000)]
doc: s/achieve/achieves (Fixed a verb disagreement)
This bug changes a clause of the form "When x remains in y status
and never achieve a z status" to a clause of the form "When x remains
in y status and never achieves a z status".
Jason Dillaman [Wed, 30 Oct 2019 18:55:41 +0000 (14:55 -0400)]
mgr/dashboard: block mirroring bootstrap UI
Two new modal windows allow an admin to create a base64-encoded bootstrap token
which can then be imported into dashboard on another cluster. The bootstrap
token embeds all the necessary data required to connect to a peer cluster for
RBD mirroring.
Fixes: http://tracker.ceph.com/issues/42355 Signed-off-by: Jason Dillaman <dillaman@redhat.com>
Jason Dillaman [Tue, 22 Oct 2019 17:58:31 +0000 (13:58 -0400)]
mgr/dashboard: integrate site name into block mirroring overview
The site name is used to differentiate clusters for RBD mirroring.
The local site name is now displayed on the block mirroring
overview page. It also includes a new modal window to edit the site
name.
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
* refs/pull/31986/head:
qa: test volumes plugin mount cleanup
pybind/mgr/volumes: use py3 items iterator
pybind/mgr/volumes: print errors in cleanup timer
qa: improve variable name
mgr/dashboard: show alert panel if prometheus/alertmanager is unconfigured
If the tabs under the "Monitoring" page aren't properly configured, a
notification is shown which explains the user which setting needs to be
enabled and also provides a link to the corresponding documentation.
Fixes: https://tracker.ceph.com/issues/42877 Signed-off-by: Patrick Seidensal <pseidensal@suse.com>
Sage Weil [Mon, 2 Dec 2019 13:43:54 +0000 (07:43 -0600)]
mon: cap keys in mon_sync messages
The previous cap was set at 1 MB. However, a user was experiencing mon
timeouts while syncing the purged_snap_epoch * keys, which are ~20 bytes
each. Reducing the max payload to 64K resolved the problem, which maps
to (very!) roughly 1500 keys per message. Set our limit a bit higher than
that since we just made this quite a bit more efficient. Most of the time
the keys are larger than 20 bytes and we wouldn't hit the key limit, but
having one ensures that we won't burn too much CPU in one go when we do
have lots of these little keys.
Zac Dover [Mon, 2 Dec 2019 15:40:03 +0000 (01:40 +1000)]
doc: add "chrony" to preflight checklist for Ubuntu 18.04
This commit updates the "Install NTP" section of the preflight
checklist, instructing readers who are using Ubuntu 18.04 to
use the package name "chrony" where elsewhere the package name
"ntp" would be used. "chrony" has replaced "ntp" in Ubnutu 18.04.
This commit fixes Bug number 8 in the list here:
https://pad.ceph.com/p/Report_Documentation_Bugs
Sage Weil [Wed, 4 Dec 2019 13:10:49 +0000 (07:10 -0600)]
Merge PR #31797 into master
* refs/pull/31797/head:
qa: test mgr cephfs mount blacklist
mgr: forward RADOS client instances for potential blacklist
mon/MgrMonitor: blacklist previous instance
mon: set RUNTIME flag on mon_mds_blacklist_interval
mgr: use more efficient data structure