]>
git.apps.os.sepia.ceph.com Git - ceph.git/log
Patrick Donnelly [Mon, 2 Nov 2020 18:01:59 +0000 (10:01 -0800)]
qa: leave one standby available to avoid warning
Four file systems will use all MDS and generate this warning:
2020-11-02T03:48:33.407 INFO:teuthology.orchestra.run.smithi003.stdout:2020-11-02T03:24:21.817337+0000 mon.a (mon.0) 481 : cluster [WRN] Health check failed: insufficient standby MDS daemons available (MDS_INSUFFICIENT_STANDBY)
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Mon, 2 Nov 2020 02:22:48 +0000 (18:22 -0800)]
qa: do not check access timestamps
kclient does not update these like ceph-fuse.
This avoids failures like:
2020-11-01T08:05:53.115 INFO:tasks.cephfs_test_runner:test_subvolume_clone_in_progress_getpath (tasks.cephfs.test_volumes.TestSubvolumeSnapshotClones) ... FAIL
2020-11-01T08:05:53.115 INFO:tasks.cephfs_test_runner:
2020-11-01T08:05:53.116 INFO:tasks.cephfs_test_runner:======================================================================
2020-11-01T08:05:53.116 INFO:tasks.cephfs_test_runner:FAIL: test_subvolume_clone_in_progress_getpath (tasks.cephfs.test_volumes.TestSubvolumeSnapshotClones)
2020-11-01T08:05:53.116 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
2020-11-01T08:05:53.116 INFO:tasks.cephfs_test_runner:Traceback (most recent call last):
2020-11-01T08:05:53.117 INFO:tasks.cephfs_test_runner: File "/home/teuthworker/src/github.com_batrick_ceph_cephfs-qa-reorg/qa/tasks/cephfs/test_volumes.py", line 2295, in test_subvolume_clone_in_progress_getpath
2020-11-01T08:05:53.117 INFO:tasks.cephfs_test_runner: self._verify_clone(subvolume, snapshot, clone)
2020-11-01T08:05:53.117 INFO:tasks.cephfs_test_runner: File "/home/teuthworker/src/github.com_batrick_ceph_cephfs-qa-reorg/qa/tasks/cephfs/test_volumes.py", line 154, in _verify_clone
2020-11-01T08:05:53.117 INFO:tasks.cephfs_test_runner: self._verify_clone_attrs(path1, path2)
2020-11-01T08:05:53.117 INFO:tasks.cephfs_test_runner: File "/home/teuthworker/src/github.com_batrick_ceph_cephfs-qa-reorg/qa/tasks/cephfs/test_volumes.py", line 107, in _verify_clone_attrs
2020-11-01T08:05:53.118 INFO:tasks.cephfs_test_runner: self.assertEqual(sval, cval)
2020-11-01T08:05:53.118 INFO:tasks.cephfs_test_runner:AssertionError:
1604217951 !=
1604217941
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Mon, 2 Nov 2020 02:17:43 +0000 (18:17 -0800)]
qa: ignore skip errors for kclient
To avoid this failure:
2020-11-01T07:21:35.117 INFO:tasks.cephfs_test_runner:test_volume_without_namespace_isolation (tasks.cephfs.test_volume_client.TestVolumeClient) ... ok
2020-11-01T07:21:35.118 INFO:tasks.cephfs_test_runner:
2020-11-01T07:21:35.118 INFO:tasks.cephfs_test_runner:======================================================================
2020-11-01T07:21:35.119 INFO:tasks.cephfs_test_runner:FAIL: test_evict_client (tasks.cephfs.test_volume_client.TestVolumeClient)
2020-11-01T07:21:35.119 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
2020-11-01T07:21:35.119 INFO:tasks.cephfs_test_runner:Requires FUSE client to inject client metadata
2020-11-01T07:21:35.119 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
2020-11-01T07:21:35.119 INFO:tasks.cephfs_test_runner:Ran 18 tests in 732.749s
2020-11-01T07:21:35.120 INFO:tasks.cephfs_test_runner:
2020-11-01T07:21:35.120 INFO:tasks.cephfs_test_runner:FAILED (failures=1)
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Sun, 1 Nov 2020 04:06:29 +0000 (21:06 -0700)]
qa: add more clients for test_volume_client
It requires 4.
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Sun, 1 Nov 2020 04:02:03 +0000 (21:02 -0700)]
qa: accept client returning pool id for layout
Resolving this failure:
2020-10-30T22:52:25.898 INFO:tasks.cephfs_test_runner:test_subvolume_group_create_with_desired_data_pool_layout (tasks.cephfs.test_volumes.TestSubvolumeGroups) ... FAIL
2020-10-30T22:52:25.898 INFO:tasks.cephfs_test_runner:
2020-10-30T22:52:25.899 INFO:tasks.cephfs_test_runner:======================================================================
2020-10-30T22:52:25.899 INFO:tasks.cephfs_test_runner:FAIL: test_subvolume_group_create_with_desired_data_pool_layout (tasks.cephfs.test_volumes.TestSubvolumeGroups)
2020-10-30T22:52:25.899 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
2020-10-30T22:52:25.899 INFO:tasks.cephfs_test_runner:Traceback (most recent call last):
2020-10-30T22:52:25.900 INFO:tasks.cephfs_test_runner: File "/home/teuthworker/src/github.com_batrick_ceph_cephfs-qa-reorg/qa/tasks/cephfs/test_volumes.py", line 568, in test_subvolume_group_create_with_desired_data_pool_layout
2020-10-30T22:52:25.900 INFO:tasks.cephfs_test_runner: self.assertEqual(desired_pool, new_pool)
2020-10-30T22:52:25.900 INFO:tasks.cephfs_test_runner:AssertionError: '34' != 'new_pool'
2020-10-30T22:52:25.900 INFO:tasks.cephfs_test_runner:- 34
2020-10-30T22:52:25.900 INFO:tasks.cephfs_test_runner:+ new_pool
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Sat, 31 Oct 2020 02:44:29 +0000 (19:44 -0700)]
qa: ignore expected warning about client missing features
2020-10-30T22:54:14.842 INFO:teuthology.orchestra.run.smithi168.stdout:2020-10-30T22:43:34.067194+0000 mds.a (mds.0) 1 : cluster [WRN] evicting session smithi168:0 (8757), missing required features '0x0000000000000400'
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Sat, 31 Oct 2020 02:05:09 +0000 (19:05 -0700)]
qa: do not fail on skip
Avoiding these failures:
2020-10-30T22:49:00.248 INFO:tasks.cephfs_test_runner:test_client_release_bug (tasks.cephfs.test_client_limits.TestClientLimits) ...
2020-10-30T22:49:00.248 INFO:tasks.cephfs_test_runner:======================================================================
2020-10-30T22:49:00.248 INFO:tasks.cephfs_test_runner:FAIL: test_client_cache_size (tasks.cephfs.test_client_limits.TestClientLimits)
2020-10-30T22:49:00.249 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
2020-10-30T22:49:00.249 INFO:tasks.cephfs_test_runner:Require FUSE client to inject client release failure
2020-10-30T22:49:00.249 INFO:tasks.cephfs_test_runner:======================================================================
2020-10-30T22:49:00.250 INFO:tasks.cephfs_test_runner:FAIL: test_client_oldest_tid (tasks.cephfs.test_client_limits.TestClientLimits)
2020-10-30T22:49:00.250 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
2020-10-30T22:49:00.251 INFO:tasks.cephfs_test_runner:Require FUSE client to inject client release failure
2020-10-30T22:49:00.251 INFO:tasks.cephfs_test_runner:======================================================================
2020-10-30T22:49:00.252 INFO:tasks.cephfs_test_runner:FAIL: test_client_release_bug (tasks.cephfs.test_client_limits.TestClientLimits)
2020-10-30T22:49:00.252 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
2020-10-30T22:49:00.252 INFO:tasks.cephfs_test_runner:Require FUSE client to inject client release failure
2020-10-30T22:49:00.252 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
2020-10-30T22:49:00.252 INFO:tasks.cephfs_test_runner:Ran 8 tests in 368.825s
2020-10-30T22:49:00.253 INFO:tasks.cephfs_test_runner:
2020-10-30T22:49:00.253 INFO:tasks.cephfs_test_runner:FAILED (failures=3)
2020-10-30T22:49:00.253 INFO:tasks.cephfs_test_runner:
2020-10-30T22:49:00.253 INFO:tasks.cephfs_test_runner:======================================================================
2020-10-30T22:49:00.253 INFO:tasks.cephfs_test_runner:FAIL: test_client_cache_size (tasks.cephfs.test_client_limits.TestClientLimits)
2020-10-30T22:49:00.254 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
2020-10-30T22:49:00.254 INFO:tasks.cephfs_test_runner:Require FUSE client to inject client release failure
2020-10-30T22:49:00.254 INFO:tasks.cephfs_test_runner:======================================================================
2020-10-30T22:49:00.254 INFO:tasks.cephfs_test_runner:FAIL: test_client_oldest_tid (tasks.cephfs.test_client_limits.TestClientLimits)
2020-10-30T22:49:00.255 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
2020-10-30T22:49:00.255 INFO:tasks.cephfs_test_runner:Require FUSE client to inject client release failure
2020-10-30T22:49:00.255 INFO:tasks.cephfs_test_runner:======================================================================
2020-10-30T22:49:00.255 INFO:tasks.cephfs_test_runner:FAIL: test_client_release_bug (tasks.cephfs.test_client_limits.TestClientLimits)
2020-10-30T22:49:00.255 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
2020-10-30T22:49:00.256 INFO:tasks.cephfs_test_runner:Require FUSE client to inject client release failure
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Fri, 30 Oct 2020 21:54:44 +0000 (14:54 -0700)]
qa: fix mount directory
Otherwise we get this error:
2020-10-30T03:57:44.256 INFO:teuthology.run_tasks:Running task exec...
2020-10-30T03:57:44.265 INFO:teuthology.task.exec:Executing custom commands...
2020-10-30T03:57:44.266 INFO:teuthology.task.exec:Running commands on role client.0 host ubuntu@smithi095.front.sepia.ceph.com
2020-10-30T03:57:44.266 INFO:teuthology.orchestra.run.smithi095:> sudo TESTDIR=/home/ubuntu/cephtest bash -c 'cd $TESTDIR/mnt.* && dd if=/dev/zero of=./foo count=100'
2020-10-30T03:57:44.380 INFO:teuthology.orchestra.run.smithi095.stderr:bash: line 0: cd: too many arguments
From: /ceph/teuthology-archive/pdonnell-2020-10-30_02:26:51-fs-master-distro-basic-smithi/
5573179 /teuthology.log
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Fri, 30 Oct 2020 21:51:57 +0000 (14:51 -0700)]
qa: rename distro link to distros
This needs to become any overrides present in "mount". Otherwise kclient
distro overrides have no effect.
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Fri, 30 Oct 2020 21:45:52 +0000 (14:45 -0700)]
qa: accept eio for mount failure
Example:
2020-10-30T03:37:33.752 INFO:tasks.cephfs_test_runner:======================================================================
2020-10-30T03:37:33.752 INFO:tasks.cephfs_test_runner:FAIL: test_mount_mon_and_osd_caps_present_mds_caps_absent (tasks.cephfs.test_multifs_auth.TestClientsWithoutAuth)
2020-10-30T03:37:33.752 INFO:tasks.cephfs_test_runner:----------------------------------------------------------------------
2020-10-30T03:37:33.753 INFO:tasks.cephfs_test_runner:Traceback (most recent call last):
2020-10-30T03:37:33.753 INFO:tasks.cephfs_test_runner: File "/home/teuthworker/src/github.com_batrick_ceph_cephfs-qa-reorg/qa/tasks/cephfs/test_multifs_auth.py", line 311, in test_mount_mon_and_osd_caps_present_mds_caps_absent
2020-10-30T03:37:33.753 INFO:tasks.cephfs_test_runner: self.check_that_mount_failed_for_right_reason(retval[2])
2020-10-30T03:37:33.753 INFO:tasks.cephfs_test_runner: File "/home/teuthworker/src/github.com_batrick_ceph_cephfs-qa-reorg/qa/tasks/cephfs/test_multifs_auth.py", line 269, in check_that_mount_failed_for_right_reason
2020-10-30T03:37:33.753 INFO:tasks.cephfs_test_runner: raise AssertionError('can\'t find expected set of words in the '
2020-10-30T03:37:33.754 INFO:tasks.cephfs_test_runner:AssertionError: can't find expected set of words in the stderr
2020-10-30T03:37:33.754 INFO:tasks.cephfs_test_runner:self.errmsgs - ('permission denied', 'no mds server is up or the cluster is laggy', 'no such file or directory')
2020-10-30T03:37:33.754 INFO:tasks.cephfs_test_runner:stderr - mount error 5 = input/output error
From: /ceph/teuthology-archive/pdonnell-2020-10-30_02:26:51-fs-master-distro-basic-smithi/
5573109 /teuthology.log
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Fri, 30 Oct 2020 21:40:25 +0000 (14:40 -0700)]
qa: unmount volumes before removal
To avoid potential failures/hangs in umount.
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Fri, 30 Oct 2020 21:33:37 +0000 (14:33 -0700)]
qa: some sessionmap tests require fuse
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Fri, 30 Oct 2020 21:19:37 +0000 (14:19 -0700)]
qa: remove obsolete test
Since the work in [1], the kclient can be killed with network namespaces
without powercycling.
[1] https://tracker.ceph.com/issues/44044
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Fri, 30 Oct 2020 21:13:56 +0000 (14:13 -0700)]
qa: do not fail on test skip
test_client_config only works with ceph-fuse.
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Fri, 30 Oct 2020 20:46:04 +0000 (13:46 -0700)]
qa: use more OSDs for workload tests
Running out of space / slow requests are less likely this way.
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Fri, 30 Oct 2020 20:31:30 +0000 (13:31 -0700)]
qa: use 4 clients for volumes testing
The test_volume_client.py tests require 4 clients.
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Fri, 30 Oct 2020 02:09:59 +0000 (19:09 -0700)]
qa: relocate tasks
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Fri, 30 Oct 2020 00:15:29 +0000 (17:15 -0700)]
qa: rename workload suite
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Thu, 29 Oct 2020 23:59:18 +0000 (16:59 -0700)]
qa: relocate new tasks from fs:basic_functional
These were not tested with kcephfs before, let's see if there's any
bugs!
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Thu, 29 Oct 2020 20:35:54 +0000 (13:35 -0700)]
qa: remove kcephfs suite
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Thu, 29 Oct 2020 20:35:37 +0000 (13:35 -0700)]
qa: move kcephfs:mixed-clients to fs
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Thu, 29 Oct 2020 18:28:48 +0000 (11:28 -0700)]
qa: remove dead kcephfs:cephfs suite
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Thu, 29 Oct 2020 18:28:24 +0000 (11:28 -0700)]
qa: merge snap tests for kcephfs/fs
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Thu, 29 Oct 2020 18:22:00 +0000 (11:22 -0700)]
qa: add new fs:shell sub-suite
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Thu, 29 Oct 2020 18:23:05 +0000 (11:23 -0700)]
qa: remove task-less kcephfs:recovery suite
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Thu, 29 Oct 2020 18:18:32 +0000 (11:18 -0700)]
qa: add new fs:volumes sub-suite
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Thu, 29 Oct 2020 18:12:44 +0000 (11:12 -0700)]
qa: merge fs:basic_functional and kcephfs:recovery
These functional tests can be run with both clients.
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Thu, 29 Oct 2020 03:23:40 +0000 (20:23 -0700)]
qa: move kcephfs:thrash to fs:thrash
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Thu, 29 Oct 2020 03:17:02 +0000 (20:17 -0700)]
qa: reorganize thrash tasks
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Thu, 29 Oct 2020 03:14:19 +0000 (20:14 -0700)]
qa: move multifs/failover tests
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Thu, 29 Oct 2020 03:04:12 +0000 (20:04 -0700)]
qa: run norstats workunit for klient
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Wed, 28 Oct 2020 03:19:36 +0000 (20:19 -0700)]
qa: merge workload tests from kcephfs with fs
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Wed, 28 Oct 2020 03:10:30 +0000 (20:10 -0700)]
qa: use consistent syntax
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Fri, 4 Sep 2020 03:15:34 +0000 (20:15 -0700)]
qa: remove inline testing
inline feature is now deprecated and TBR.
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Fri, 4 Sep 2020 03:33:01 +0000 (20:33 -0700)]
qa: move log-rotate config to common yaml
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Wed, 28 Oct 2020 03:00:13 +0000 (20:00 -0700)]
qa: remove fuse default permissions config
It is the default.
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Fri, 4 Sep 2020 03:35:27 +0000 (20:35 -0700)]
qa: fix whitespace
Fixes: https://tracker.ceph.com/issues/23718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Tue, 3 Nov 2020 20:54:16 +0000 (12:54 -0800)]
Merge PR #37923 into master
* refs/pull/37923/head:
client: increment file position on _read_sync near eof
Reviewed-by: Jeff Layton <jlayton@redhat.com>
Lenz Grimmer [Tue, 3 Nov 2020 12:57:51 +0000 (13:57 +0100)]
Merge pull request #37737 from votdev/issue_47926_orch_status
Reviewed-by: Aashish Sharma <aasharma@redhat.com>
Reviewed-by: Kiefer Chang <kiefer.chang@suse.com>
Lenz Grimmer [Tue, 3 Nov 2020 08:30:52 +0000 (09:30 +0100)]
Merge pull request #37757 from votdev/issue_47884_display_time
mgr/dashboard: displaying time in human-readable format
Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Kiefer Chang <kiefer.chang@suse.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Mykola Golub [Tue, 3 Nov 2020 07:30:18 +0000 (09:30 +0200)]
Merge pull request #37827 from dillaman/wip-librbd-migration-2
librbd: added API and CLI support for source-spec import migrations
Reviewed-by: Mykola Golub <mgolub@suse.com>
Kefu Chai [Tue, 3 Nov 2020 01:22:08 +0000 (09:22 +0800)]
Merge pull request #37829 from liu-chunmei/seastore-extentmap-fix
crimson/seastore: add Extentmap node type in cache.cc
Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
Jason Dillaman [Sun, 1 Nov 2020 14:44:59 +0000 (09:44 -0500)]
doc/man: add new RBD migration prepare source-spec optionals
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
Jason Dillaman [Mon, 26 Oct 2020 18:47:17 +0000 (14:47 -0400)]
qa/suite/rbd: simple migration import test case
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
Jason Dillaman [Fri, 23 Oct 2020 18:31:36 +0000 (14:31 -0400)]
qa/tasks: support explicit disk configuration for qemu task
The 'disks' key will now be treated as a dictionary where all previous
global settings can be individually applied. Additionally, a disk can be
pre-created and provided for use by qemu.
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
Jason Dillaman [Wed, 21 Oct 2020 18:29:15 +0000 (14:29 -0400)]
rbd: add new 'migration prepare --import-only' CLI optional
The '--import-only' optional can be combined with either a
'--source-spec' or '--source-spec-path' optional to define
the source for the read-only import.
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
Josh Durgin [Mon, 2 Nov 2020 23:58:56 +0000 (15:58 -0800)]
Merge pull request #37872 from theanalyst/doc/releases/v14.2.13
doc: releases: v14.2.13 release notes
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Jan Fajerski <jfajerski@suse.com>
Reviewed-by: Nathan Cutler <ncutler@suse.com>
Patrick Donnelly [Mon, 2 Nov 2020 20:01:29 +0000 (12:01 -0800)]
client: increment file position on _read_sync near eof
Fixes: https://tracker.ceph.com/issues/48076
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
chunmei-liu [Wed, 28 Oct 2020 00:01:37 +0000 (17:01 -0700)]
crimson/seastore: add replay test for extent map tree
Signed-off-by: chunmei-liu <chunmei.liu@intel.com>
Lenz Grimmer [Mon, 2 Nov 2020 18:27:40 +0000 (19:27 +0100)]
Merge pull request #37287 from pcuzner/dashboard-goals
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Juan Miguel Olmo Martínez <jolmomar@redhat.com>
Reviewed-by: Volker Theile <vtheile@suse.com>
Patrick Donnelly [Mon, 2 Nov 2020 17:00:09 +0000 (09:00 -0800)]
Merge PR #37878 into master
* refs/pull/37878/head:
doc/mgr/orchestrator: Update about "{mds, rgw} add" status in rook
Reviewed-by: Michael Fritch <mfritch@suse.com>
Reviewed-by: Travis Nielsen <tnielsen@redhat.com>
Patrick Donnelly [Mon, 2 Nov 2020 16:24:04 +0000 (08:24 -0800)]
Merge PR #37732 into master
* refs/pull/37732/head:
osdc: restart read on truncate/discard
Reviewed-by: Kefu Chai <kchai@redhat.com>
Abhishek Lekshmanan [Mon, 2 Nov 2020 15:54:23 +0000 (16:54 +0100)]
doc: releases/nautilus: add more changelog entries & fix date
Signed-off-by: Abhishek Lekshmanan <abhishek@suse.com>
Mykola Golub [Mon, 2 Nov 2020 12:53:26 +0000 (14:53 +0200)]
Merge pull request #37898 from lxbsz/nbd_reexpandmeta
rbd-nbd: reexpand the conf meta in child process
Reviewed-by: Jason Dillaman <dillaman@redhat.com>
Reviewed-by: Mykola Golub <mgolub@suse.com>
Igor Fedotov [Mon, 2 Nov 2020 12:26:04 +0000 (15:26 +0300)]
Merge pull request #37547 from ifed01/wip-ifed-fix-bmap-alloc
os/bluestore: fix segfault on out-of-bound offset provided to claim_…
Reviewed-by: Adam Kupczyk <akupczyk@redhat.com>
Kefu Chai [Mon, 2 Nov 2020 10:27:54 +0000 (18:27 +0800)]
Merge pull request #37879 from tchaikov/wip-seastar-v6
crimson: use Seastar API v6
Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
Reviewed-by: Xuehan Xu <xxhdx1985126@gmail.com>
Mykola Golub [Mon, 2 Nov 2020 09:02:59 +0000 (11:02 +0200)]
Merge pull request #37894 from dillaman/wip-48035
librbd: global config overrides do not apply to in-use images
Reviewed-by: Mykola Golub <mgolub@suse.com>
Kefu Chai [Mon, 2 Nov 2020 08:38:23 +0000 (16:38 +0800)]
Merge pull request #37907 from athanatos/sjust/wip-fix-init-weak
crimson/os/seastore/.../lba_btree_node_impl: hold reference to *this during lookup
Reviewed-by: Chunmei Liu <chunmei.liu@intel.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
Samuel Just [Fri, 30 Oct 2020 21:18:27 +0000 (14:18 -0700)]
crimson/os/seastore/.../lba_btree_node_impl: hold reference to *this during lookup
4f2f4f modified TransactionManager::mount() to use a weak_transaction
while calling init_cached_extents. This is fine, but lookup() needs
to hold a reference to *this until the child lookup completes in order
to ensure residence in the lba pinning set.
Signed-off-by: Samuel Just <sjust@redhat.com>
Paul Cuzner [Mon, 2 Nov 2020 04:50:42 +0000 (17:50 +1300)]
doc/dev/dashboard: typo fix
Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
Paul Cuzner [Thu, 22 Oct 2020 23:15:23 +0000 (12:15 +1300)]
doc/dev/dashboard: link designs to dev guide
Add a feature design section to the dashboard dev
guide, linking to the individual docs under dov/dev/dashboard
Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
Paul Cuzner [Mon, 21 Sep 2020 21:19:49 +0000 (09:19 +1200)]
doc/dev/dashboard: Document key UI design principles
Document which defines the high level design principles
for the Ceph Dashboard UI
Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
Kefu Chai [Mon, 2 Nov 2020 03:32:06 +0000 (11:32 +0800)]
Merge pull request #37590 from pcuzner/prometheus-slow-ops
mgr/prometheus: Add SLOW_OPS healthcheck as a metric
Reviewed-by: Kefu Chai <kchai@redhat.com>
Paul Cuzner [Thu, 8 Oct 2020 03:35:45 +0000 (16:35 +1300)]
mgr: add doc to describe relationship to prometheus
The healthcheck for slow_ops is used by the mgr/prometheus
module so it's important the alert generated matches the
format expected in mgr/prometheus.
Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
Paul Cuzner [Thu, 8 Oct 2020 03:30:56 +0000 (16:30 +1300)]
mgr/prometheus: Add healthcheck metric for SLOW_OPS
SLOW_OPS is triggered by op tracker, and generates a health
alert but healthchecks do not create metrics for prometheus to
use as alert triggers. This change adds SLOW_OPS metric, and
provides a simple means to extend to other relevant health
checks in the future
If the extract of the value from the health check message fails
we log an error and remove the metric from the metric set. In
addition the metric description has changed to better reflect
the scenarios where SLOW_OPS can be triggered.
Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
Jason Dillaman [Tue, 27 Oct 2020 00:29:57 +0000 (20:29 -0400)]
rbd: show migration source spec in image status
When using the non-legacy mode image migration, the 'rbd status'
command will now show the source-spec for the migration if the
source pool id is invalid.
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
Jason Dillaman [Wed, 21 Oct 2020 19:15:09 +0000 (15:15 -0400)]
rbd: ensure the help printer doesn't print past the end of the line
When long command names and long optional names are combined,
it's possible for the help text to be printed beyond the 80
character limit.
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
Jason Dillaman [Tue, 27 Oct 2020 18:57:54 +0000 (14:57 -0400)]
librbd: replace direct reads from ImageRequest with ImageDispatchSpec
Read requests will always need to go via the image dispatch layer to
ensure migrations are properly handled.
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
Jason Dillaman [Fri, 30 Oct 2020 14:27:32 +0000 (10:27 -0400)]
librbd/migration: allow native v2 images to be opened by name
When in import-only mode, allow the source image to be opened by
name in case it's a V2 image format w/o its image id specified in
the JSON.
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
Jason Dillaman [Fri, 30 Oct 2020 14:00:27 +0000 (10:00 -0400)]
librbd/migration: close image if failed to open source
To remain consistent between the native which will automatically close
the ImageCtx if it fails to open the image, the raw format should also
close the ImageCtx.
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
Jason Dillaman [Tue, 27 Oct 2020 15:27:33 +0000 (11:27 -0400)]
librbd/migration: initialize default source image layout
For backing source formats that don't care about the low-level
RBD layout details, provide safe default values for the layout
to ensure that striping logic will still function.
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
Jason Dillaman [Wed, 21 Oct 2020 19:58:20 +0000 (15:58 -0400)]
librbd: fix potential recursive lock of ImageCtx::image_lock
If the "list_snaps" IO call returns immediately, it can result
in an attempted recursive lock starting from CopyupRequest.
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
Jason Dillaman [Wed, 21 Oct 2020 19:39:34 +0000 (15:39 -0400)]
librbd: migration execute should allow "EXECUTING" state
If the "execute" action was interrupted, we need to be able to
restart it.
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
Jason Dillaman [Wed, 21 Oct 2020 19:32:47 +0000 (15:32 -0400)]
librbd: read image size when opening migration raw format
The migration prepare operation will need to know the size of
the source image so that it can create the destination image.
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
Jason Dillaman [Tue, 27 Oct 2020 01:25:02 +0000 (21:25 -0400)]
librbd/migration: fix issue with prepare-import and native-format
When preparing an import, the native format can not expect to have
the child image opened since it hasn't been created yet.
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
Jason Dillaman [Wed, 21 Oct 2020 18:27:38 +0000 (14:27 -0400)]
librbd: added new 'migration_prepare_import' API methods
These related methods accept a JSON-encoded source-spec that
describes how to read from the source image. The migration is
a one-way operation, so children/groups/etc won't be updated
to point to the new destination image.
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
Jason Dillaman [Wed, 21 Oct 2020 18:16:57 +0000 (14:16 -0400)]
librbd: store migration progress on both source+destination images
When migrating from an import-only source-spec like an external
file, it will be important to also store the progress on the
destination image.
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
Jason Dillaman [Tue, 20 Oct 2020 22:45:04 +0000 (18:45 -0400)]
librbd: switch to migration source-spec builder
Remove all the hard-coded references to the NativeFormat
handler and allow the RawFormat+FileStream handlers to be
instantiated.
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
Jason Dillaman [Tue, 20 Oct 2020 21:46:02 +0000 (17:46 -0400)]
librbd: switch migration native format to use source-spec JSON object
Remove the dependency on the MigrationInfo struct and allow the native
format to directly pull parameters from the JSON-encoded object
structure.
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
Jason Dillaman [Tue, 20 Oct 2020 21:05:30 +0000 (17:05 -0400)]
librbd: tweak native-format migration source-spec
Renamed "format" property to "type" and always include the original
"image_id".
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
Jason Dillaman [Tue, 20 Oct 2020 06:55:36 +0000 (02:55 -0400)]
librbd/api: ImageCtx must be closed instead of deleted
On error, the destination image context will need to be properly
closed instead of just deleted.
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
Jason Dillaman [Sun, 1 Nov 2020 13:42:01 +0000 (08:42 -0500)]
Merge pull request #36626 from lixiaoy1/rbd_image_discard
librbd/rwl: discard cache
Reviewed-by: Jason Dillaman <dillaman@redhat.com>
Kefu Chai [Sun, 1 Nov 2020 13:23:59 +0000 (21:23 +0800)]
Merge pull request #37870 from xxhdx1985126/wip-crimson-recovery-bug-fixes
crimson: recovery-related bug fixes
Reviewed-by: Kefu Chai <kchai@redhat.com>
Kefu Chai [Sun, 1 Nov 2020 13:21:42 +0000 (21:21 +0800)]
Merge pull request #37910 from tchaikov/wip-crimson-osd-send-pg-temp
crimson/osd: merge pg_temp_wanted before sending them
Reviewed-by: Xuehan Xu <xxhdx1985126@gmail.com>
Yuval Lifshitz [Sun, 1 Nov 2020 10:03:26 +0000 (12:03 +0200)]
Merge pull request #37656 from yuvalif/allow_lua_to_set_metadata
rgw/lua: allow setting metadata via lua
Volker Theile [Thu, 22 Oct 2020 09:17:31 +0000 (11:17 +0200)]
mgr/dashboard: displaying time in human-readable format
Fixes: https://tracker.ceph.com/issues/47884
Signed-off-by: Volker Theile <vtheile@suse.com>
Kefu Chai [Sun, 1 Nov 2020 03:23:13 +0000 (11:23 +0800)]
crimson/osd: merge pg_temp_wanted before sending them
there is chance that new pg_temp_wanted is added when we are sending
them in ShardServices::send_pg_temp(), but the pg temp are already
collected into the messages before seastar::parallel_for_each(). so,
we cannot move all elements from pg_temp_wanted to pg_temp_pending after
sending the messages, that'd also move the unsent pg temps to
pg_temp_pending as well. so we have to do this before do the async call,
this issue was root caused by Xuehan Xu <xxhdx1985126@gmail.com>.
Signed-off-by: Kefu Chai <kchai@redhat.com>
Kefu Chai [Sun, 1 Nov 2020 03:14:48 +0000 (11:14 +0800)]
crimson/osd: drop ShardServices::_sent_pg_temp()
there is no need to define a wrapper for moving pg_temp_wanted to
pg_temp_pending. and it hurts the readability.
Signed-off-by: Kefu Chai <kchai@redhat.com>
Xuehan Xu [Sun, 1 Nov 2020 03:30:38 +0000 (11:30 +0800)]
crimson/osd: report statfs data
Signed-off-by: Xuehan Xu <xxhdx1985126@gmail.com>
Xuehan Xu [Sun, 1 Nov 2020 03:29:57 +0000 (11:29 +0800)]
crimson/osd: rename heartbeat_timer to tick_timer
Signed-off-by: Xuehan Xu <xxhdx1985126@gmail.com>
Xuehan Xu [Sun, 1 Nov 2020 03:27:52 +0000 (11:27 +0800)]
crimson/common: make Gated handle gate_close_exception
Signed-off-by: Xuehan Xu <xxhdx1985126@gmail.com>
Kefu Chai [Sun, 1 Nov 2020 03:08:24 +0000 (11:08 +0800)]
crimson/osd: do not check HAVE_STDLIB_MAP_SPLICING
seastar always uses GCC-9 and up, so there is no need to check for
the existence of map::merge() before using it anymore.
Signed-off-by: Kefu Chai <kchai@redhat.com>
Xuehan Xu [Tue, 20 Oct 2020 06:52:41 +0000 (14:52 +0800)]
crimson/osd: clean up pending_txn when actingset is changed
Signed-off-by: Xuehan Xu <xxhdx1985126@gmail.com>
Xuehan Xu [Tue, 20 Oct 2020 06:51:30 +0000 (14:51 +0800)]
crimson/osd: fix pg info statistics error during recovery
Signed-off-by: Xuehan Xu <xxhdx1985126@gmail.com>
Xuehan Xu [Tue, 20 Oct 2020 10:21:04 +0000 (18:21 +0800)]
crimson/osd: make I/O reqs wait for object recovery if it's degraded or backfilling
Signed-off-by: Xuehan Xu <xxhdx1985126@gmail.com>
Kefu Chai [Sat, 31 Oct 2020 03:43:27 +0000 (11:43 +0800)]
Merge pull request #37327 from kamoltat/wip-mgr-progress-global-event
mgr/progress: Global Recovery Event in ceph -s
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Kefu Chai [Sat, 31 Oct 2020 03:42:35 +0000 (11:42 +0800)]
Merge pull request #37598 from ronen-fr/scrubstore-fix-rf
osd: fix signatures of get_store_errors() and friends
Reviewed-by: David Zafman <dzafman@redhat.com>
Reviewed-By: Neha Ojha <nojha@redhat.com>
Kefu Chai [Sat, 31 Oct 2020 03:41:19 +0000 (11:41 +0800)]
Merge pull request #37751 from changchengx/nasm_avx512
common: use nasm to build isa-l and isa-l_crypto to prepare for AVX512
Reviewed-by: Kefu Chai <kchai@redhat.com>
Josh Durgin [Fri, 30 Oct 2020 15:15:59 +0000 (08:15 -0700)]
Merge pull request #37773 from sunnyku/wip-f33-pyfix
do_cmake.sh: use python-3.9 with fedora version 33
Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Jason Dillaman [Fri, 30 Oct 2020 13:40:10 +0000 (09:40 -0400)]
Merge pull request #37789 from trociny/wip-47827
mgr/rbd_support: create mirror snapshots asynchronously
Reviewed-by: Jason Dillaman <dillaman@redhat.com>