]>
git.apps.os.sepia.ceph.com Git - ceph.git/log
Nathan Cutler [Wed, 19 Jul 2017 10:50:22 +0000 (12:50 +0200)]
Merge pull request #14540 from smithfarm/wip-18910-kraken
kraken: rbd-nbd: check /sys/block/nbdX/size to ensure kernel mapped correctly
Reviewed-by: Mykola Golub <mgolub@mirantis.com>
Nathan Cutler [Wed, 19 Jul 2017 06:05:11 +0000 (08:05 +0200)]
Merge pull request #16140 from smithfarm/wip-20271-kraken
kraken: tests: LibRadosMiscConnectFailure.ConnectFailure hang
Reviewed-by: Kefu Chai <kchai@redhat.com>
Nathan Cutler [Tue, 18 Jul 2017 21:14:59 +0000 (23:14 +0200)]
Merge pull request #16112 from smithfarm/wip-20499-kraken
kraken: tests: ObjectStore/StoreTest.OnodeSizeTracking/2 fails on bluestore
Reviewed-by: Sage Weil <sage@redhat.com>
Yehuda Sadeh [Tue, 18 Jul 2017 20:58:14 +0000 (13:58 -0700)]
Merge pull request #14960 from yehudasa/wip-19704
kraken: civetweb: move to post 1.8 version
Reviewed-by: Yehuda Sadeh <yehuda@redhat.com>
Nathan Cutler [Tue, 18 Jul 2017 17:50:46 +0000 (19:50 +0200)]
Merge pull request #16143 from smithfarm/wip-20365-kraken
kraken: mon: osd crush set crushmap need sanity check
Reviewed-by: Loic Dachary <ldachary@redhat.com>
Nathan Cutler [Tue, 18 Jul 2017 17:48:51 +0000 (19:48 +0200)]
Merge pull request #16138 from smithfarm/wip-20034-kraken
kraken: ceph-disk: Racing between partition creation & device node creation
Reviewed-by: Loic Dachary <ldachary@redhat.com>
Nathan Cutler [Tue, 18 Jul 2017 17:47:34 +0000 (19:47 +0200)]
Merge pull request #16135 from smithfarm/wip-20010-kraken
kraken: ceph-disk: separate ceph-osd --check-needs-* logs
Reviewed-by: Loic Dachary <ldachary@redhat.com>
Sage Weil [Sun, 9 Jul 2017 02:39:59 +0000 (21:39 -0500)]
Merge pull request #16215 from liewegas/wip-17743
kraken: CMakeLists.txt: disable memstore make check test
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Sage Weil [Fri, 7 Jul 2017 15:38:25 +0000 (11:38 -0400)]
CMakeLists.txt: disable memstore make check test
This fails frequently on kraken, but is probably a wontfix.
See http://tracker.ceph.com/issues/17743
Signed-off-by: Sage Weil <sage@redhat.com>
Nathan Cutler [Fri, 7 Jul 2017 07:49:51 +0000 (09:49 +0200)]
Merge pull request #14067 from asheplyakov/19322-bp-kraken
kraken: rgw: fix break inside of yield in RGWFetchAllMetaCR
Reviewed-by: Orit Wasserman <owasserm@redhat.com>
Nathan Cutler [Fri, 7 Jul 2017 07:48:57 +0000 (09:48 +0200)]
Merge pull request #14509 from smithfarm/wip-18499-kraken
kraken: rgw: Realm set does not create a new period
Reviewed-by: Orit Wasserman <owasserm@redhat.com>
Nathan Cutler [Fri, 7 Jul 2017 07:48:01 +0000 (09:48 +0200)]
Merge pull request #14511 from smithfarm/wip-18772-kraken
kraken: rgw: crash when updating period with placement group
Reviewed-by: Orit Wasserman <owasserm@redhat.com>
Nathan Cutler [Fri, 7 Jul 2017 07:45:15 +0000 (09:45 +0200)]
Merge pull request #14513 from smithfarm/wip-18843-kraken
kraken: rgw: usage stats and quota are not operational for multi-tenant users
Reviewed-by: Orit Wasserman <owasserm@redhat.com>
Nathan Cutler [Fri, 7 Jul 2017 07:44:21 +0000 (09:44 +0200)]
Merge pull request #14515 from smithfarm/wip-18904-kraken
kraken: rgw: first write also tries to read object
Reviewed-by: Orit Wasserman <owasserm@redhat.com>
Nathan Cutler [Fri, 7 Jul 2017 07:42:36 +0000 (09:42 +0200)]
Merge pull request #14516 from smithfarm/wip-18909-kraken
kraken: rgw: the swift container acl does not support field .ref
Reviewed-by: Orit Wasserman <owasserm@redhat.com>
Reviewed-by: Radoslaw Zarzynski <rzarzynski@mirantis.com>
Nathan Cutler [Fri, 7 Jul 2017 07:25:43 +0000 (09:25 +0200)]
Merge pull request #14517 from smithfarm/wip-19047-kraken
kraken: rgw: fix use of marker in List::list_objects()
Reviewed-by: Orit Wasserman <owasserm@redhat.com>
Nathan Cutler [Fri, 7 Jul 2017 07:24:13 +0000 (09:24 +0200)]
Merge pull request #14524 from smithfarm/wip-19477-kraken
kraken: rgw: S3 v4 authentication issue with X-Amz-Expires
Reviewed-by: Orit Wasserman <owasserm@redhat.com>
Reviewed-by: Radoslaw Zarzynski <rzarzynski@mirantis.com>
Nathan Cutler [Fri, 7 Jul 2017 07:22:26 +0000 (09:22 +0200)]
Merge pull request #14525 from smithfarm/wip-19479-kraken
kraken: rgw: zonegroupmap set does not work
Reviewed-by: Orit Wasserman <owasserm@redhat.com>
Nathan Cutler [Fri, 7 Jul 2017 07:21:22 +0000 (09:21 +0200)]
Merge pull request #14528 from smithfarm/wip-19534-kraken
kraken: rgw: Error parsing xml when get bucket lifecycle
Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
Reviewed-by: Orit Wasserman <owasserm@redhat.com>
Nathan Cutler [Fri, 7 Jul 2017 07:19:08 +0000 (09:19 +0200)]
Merge pull request #14606 from asheplyakov/19608-kraken
kraken: rgw: multisite: fetch_remote_obj() gets wrong version when copying from remote
Reviewed-by: Orit Wasserman <owasserm@redhat.com>
Nathan Cutler [Fri, 7 Jul 2017 07:18:10 +0000 (09:18 +0200)]
Merge pull request #15384 from asheplyakov/20015-bp-kraken
kraken: cls/rgw: list_plain_entries() stops before bi_log entries
Reviewed-by: Orit Wasserman <owasserm@redhat.com>
Nathan Cutler [Fri, 7 Jul 2017 07:03:16 +0000 (09:03 +0200)]
Merge pull request #14522 from smithfarm/wip-19472-kraken
kraken: rgw: add the remove-x-delete feature to cancel swift object expiration
Reviewed-by: Radoslaw Zarzynski <rzarzynski@mirantis.com>
Nathan Cutler [Fri, 7 Jul 2017 07:00:43 +0000 (09:00 +0200)]
Merge pull request #14523 from smithfarm/wip-19475-kraken
kraken: rgw: multisite: EPERM when trying to read SLO objects as system/admin user
Reviewed-by: Radoslaw Zarzynski <rzarzynski@mirantis.com>
Nathan Cutler [Fri, 7 Jul 2017 06:58:44 +0000 (08:58 +0200)]
Merge pull request #14519 from smithfarm/wip-19175-kraken
kraken: swift API: cannot disable object versioning with empty X-Versions-Location
Reviewed-by: Radoslaw Zarzynski <rzarzynski@mirantis.com>
Nathan Cutler [Fri, 7 Jul 2017 06:56:47 +0000 (08:56 +0200)]
Merge pull request #14526 from smithfarm/wip-19524-kraken
kraken: rgw: 'radosgw-admin zone create' command with specified zone-id creates a zone with different id
Reviewed-by: Radoslaw Zarzynski <rzarzynski@mirantis.com>
Nathan Cutler [Fri, 7 Jul 2017 06:55:30 +0000 (08:55 +0200)]
Merge pull request #14529 from smithfarm/wip-19573-kraken
kraken: rgw: Response header of swift API returned by radosgw does not contain x-openstack-request-id. But Swift returns it.
Reviewed-by: Radoslaw Zarzynski <rzarzynski@mirantis.com>
Nathan Cutler [Thu, 6 Jul 2017 15:46:06 +0000 (17:46 +0200)]
Merge pull request #16113 from smithfarm/wip-19807-kraken
kraken: tests: remove hard-coded image name from TestLibRBD.Mirror
Reviewed-by: Jason Dillaman <dillaman@redhat.com>
Reviewed-by: Mykola Golub <mgolub@mirantis.com>
Nathan Cutler [Thu, 6 Jul 2017 07:28:26 +0000 (09:28 +0200)]
Merge pull request #16132 from smithfarm/wip-20522-kraken
kraken: FAILED assert(object_contexts.empty()) (live on master only from Jan-Feb 2017, all other instances are different)
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Nathan Cutler [Thu, 6 Jul 2017 07:27:37 +0000 (09:27 +0200)]
Merge pull request #15962 from asheplyakov/20443-bp-kraken
kraken: osd: unlock sdata_op_ordering_lock with sdata_lock hold to avoid missing wakeup signal
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Nathan Cutler [Thu, 6 Jul 2017 07:26:40 +0000 (09:26 +0200)]
Merge pull request #15066 from asheplyakov/19916-bp-kraken
kraken: osd: osd/PrimaryLogPG: do not call on_shutdown() if (pg.deleting)
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Nathan Cutler [Thu, 6 Jul 2017 07:24:26 +0000 (09:24 +0200)]
Merge pull request #14942 from shinobu-x/wip-18293-kraken
kraken: osd: leaked MOSDMap
Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Nathan Cutler [Thu, 6 Jul 2017 07:20:56 +0000 (09:20 +0200)]
Merge pull request #14736 from smithfarm/wip-19326-kraken
kraken: bluestore bdev: flush no-op optimization is racy
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Nathan Cutler [Thu, 6 Jul 2017 07:19:50 +0000 (09:19 +0200)]
Merge pull request #14732 from smithfarm/wip-19560-kraken
kraken: objecter: full_try behavior not consistent with osd
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Nathan Cutler [Thu, 6 Jul 2017 07:18:24 +0000 (09:18 +0200)]
Merge pull request #13542 from shinobu-x/wip-18929-kraken
kraken: osd: bogus assert when checking acting set on recovery completion in rados/upgrade
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Nathan Cutler [Thu, 6 Jul 2017 07:11:09 +0000 (09:11 +0200)]
Merge pull request #14939 from rzarzynski/wip-rgw-19754-kraken
kraken: rgw: fix RadosGW hang during multi-chunk upload of AWSv4
Reviewed-by: Matt Benjamin <mbenjamin@redhat.com>
Nathan Cutler [Thu, 6 Jul 2017 06:54:16 +0000 (08:54 +0200)]
Merge pull request #14530 from smithfarm/wip-19574-kraken
kraken: rgw: unsafe access in RGWListBucket_ObjStore_SWIFT::send_response()
Reviewed-by: Yehuda Sadeh <yehuda@redhat.com>
Nathan Cutler [Thu, 6 Jul 2017 06:40:16 +0000 (08:40 +0200)]
Merge pull request #13868 from smithfarm/wip-19160-kraken
kraken: rgw: multisite: RGWMetaSyncShardControlCR gives up on EIO
Reviewed-by: Orit Wasserman <owasserm@redhat.com>
Loic Dachary [Sat, 18 Mar 2017 09:04:20 +0000 (10:04 +0100)]
mon: osd crush set crushmap need sanity check
The sanity check verifying the new crushmap does not remove crush rules
that are in use is not exclusive to ceph setcrushmap.
Fixes: http://tracker.ceph.com/issues/19302
Signed-off-by: Loic Dachary <loic@dachary.org>
(cherry picked from commit
ed760457bf154c10adf75c6df046eecab7eb8e4b )
Sage Weil [Tue, 6 Jun 2017 19:13:50 +0000 (15:13 -0400)]
ceph_test_rados_api_misc: fix LibRadosMiscConnectFailure.ConnectFailure retry
Fixes: http://tracker.ceph.com/issues/19901
Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit
087dff80cac707ee9bcb5bcfc98cb3ec047bd49f )
Erwan Velu [Fri, 31 Mar 2017 12:54:33 +0000 (14:54 +0200)]
ceph-disk: Adding retry loop in get_partition_dev()
There is very rare cases where get_partition_dev() is called before the actual partition is available in /sys/block/<device>.
It appear that waiting a very short is usually enough to get the partition beein populated.
Analysis:
update_partition() is supposed to be enough to avoid any racing between events sent by parted/sgdisk/partprobe and
the actual creation on the /sys/block/<device>/* entrypoint.
On our CI that race occurs pretty often but trying to reproduce it locally never been possible.
This patch is almost a workaround rather than a fix to the real problem.
It offer retrying after a very short to be make a chance the device to appear.
This approach have been succesful on the CI.
Note his patch is not changing the timing when the device is perfectly created on time and just differ by a 1/5th up to 2 seconds when the bug occurs.
A typical output from the build running on a CI with that code.
command_check_call: Running command: /usr/bin/udevadm settle --timeout=600
get_dm_uuid: get_dm_uuid /dev/sda uuid path is /sys/dev/block/8:0/dm/uuid
get_partition_dev: Try 1/10 : partition 2 for /dev/sda does not in /sys/block/sda
get_partition_dev: Found partition 2 for /dev/sda after 1 tries
get_dm_uuid: get_dm_uuid /dev/sda uuid path is /sys/dev/block/8:0/dm/uuid
get_dm_uuid: get_dm_uuid /dev/sda2 uuid path is /sys/dev/block/8:2/dm/uuid
fixes: #19428
Signed-off-by: Erwan Velu <erwan@redhat.com>
(cherry picked from commit
93e7b95ed8b4c78daebf7866bb1f0826d7199075 )
Erwan Velu [Wed, 22 Mar 2017 09:11:44 +0000 (10:11 +0100)]
ceph-disk: Reporting /sys directory in get_partition_dev()
When get_partition_dev() fails, it reports the following message :
ceph_disk.main.Error: Error: partition 2 for /dev/sdb does not appear to exist
The code search for a directory inside the /sys/block/get_dev_name(os.path.realpath(dev)).
The issue here is the error message doesn't report that path when failing while it might be involved in.
This patch is about reporting where the code was looking at when trying to estimate if the partition was available.
Signed-off-by: Erwan Velu <erwan@redhat.com>
(cherry picked from commit
413c9fcfbe8e6ab33d73b8428090ccacc33c5d15 )
Loic Dachary [Tue, 9 May 2017 10:32:51 +0000 (12:32 +0200)]
ceph-disk: separate ceph-osd --check-needs-* logs
It is using the OSD id zero but have nothing to do with OSD zero and
this is confusing to the user. The log themselves do not need to be kept
around and are stored in the run directory so that they can be disposed
of after reboot.
Fixes: http://tracker.ceph.com/issues/19888
Signed-off-by: Loic Dachary <loic@dachary.org>
(cherry picked from commit
c7b3c46bd63b78475868e405bf20d9c142f0336a )
Samuel Just [Tue, 14 Feb 2017 20:47:37 +0000 (12:47 -0800)]
ReplicatedBackend: don't queue Context outside of ObjectStore with obc
We only flush the ObjectStore callbacks, not everything else. Thus,
there isn't a guarrantee that the obc held by pull_complete_info will
be cleaned up before the Flush callback is triggered. Instead, just
defer clearing the pull state until the callback (it'll be cleaned up
during the interval change) and remove the ObjectContext from
pull_complete_info.
Introduced:
68defc2b0561414711d4dd0a76bc5d0f46f8a3f8
Fixes: http://tracker.ceph.com/issues/18927
Signed-off-by: Samuel Just <sjust@redhat.com>
(cherry picked from commit
51eee55c475a3d931844831e040ed3d66ee59af4 )
Samuel Just [Fri, 3 Feb 2017 21:12:47 +0000 (13:12 -0800)]
osd/: don't leak context for Blessed*Context or RecoveryQueueAsync
This has always been a bug, but until
68defc2b0561414711d4dd0a76bc5d0f46f8a3f8 , nothing deleted those contexts
without calling complete().
Fixes: http://tracker.ceph.com/issues/18809
Bug shadowed until:
68defc2b0561414711d4dd0a76bc5d0f46f8a3f8
Signed-off-by: Samuel Just <sjust@redhat.com>
(cherry picked from commit
91b74235027c8a4872dcab6b37767b12c3267061 )
Nathan Cutler [Wed, 5 Jul 2017 20:04:41 +0000 (22:04 +0200)]
Merge pull request #15792 from liewegas/wip-bluestore-leaks-kraken
kraken: os/bluestore: deep decode onode value
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Nathan Cutler [Wed, 5 Jul 2017 20:01:58 +0000 (22:01 +0200)]
Merge pull request #15729 from asheplyakov/20315-bp-kraken
kraken: mon: fail to form large quorum; msg/async busy loop
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Nathan Cutler [Wed, 5 Jul 2017 20:00:17 +0000 (22:00 +0200)]
Merge pull request #15084 from asheplyakov/19928-bp-kraken
kraken: mon crash on shutdown, lease_ack_timeout event
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Nathan Cutler [Wed, 5 Jul 2017 19:59:03 +0000 (21:59 +0200)]
Merge pull request #14735 from smithfarm/wip-19685-kraken
kraken: osd: Give requested scrubs a higher priority
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Nathan Cutler [Wed, 5 Jul 2017 19:57:53 +0000 (21:57 +0200)]
Merge pull request #14733 from smithfarm/wip-19561-kraken
kraken: api_misc: [ FAILED ] LibRadosMiscConnectFailure.ConnectFailure
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Nathan Cutler [Wed, 5 Jul 2017 19:51:26 +0000 (21:51 +0200)]
Merge pull request #13883 from shinobu-x/wip-19119-kraken
kraken: pre-jewel "osd rm" incrementals are misinterpreted
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Nathan Cutler [Wed, 5 Jul 2017 17:39:54 +0000 (19:39 +0200)]
Merge pull request #14646 from gregsfortytwo/wip-17916-kraken
kraken: osd: pglog: with config, don't assert in the presence of stale diverg…
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Nathan Cutler [Wed, 5 Jul 2017 16:41:59 +0000 (18:41 +0200)]
Merge pull request #15237 from asheplyakov/20035-bp-kraken
kraken: mon/PGMap: factor mon_osd_full_ratio into MAX AVAIL calc
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Nathan Cutler [Wed, 5 Jul 2017 16:19:05 +0000 (18:19 +0200)]
Merge pull request #14331 from shinobu-x/wip-19396-kraken
kraken: Objecter::epoch_barrier isn't respected in _op_submit()
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Nathan Cutler [Wed, 5 Jul 2017 15:54:30 +0000 (17:54 +0200)]
Merge pull request #16083 from smithfarm/wip-18677-kraken
kraken: osd: metadata reports filestore when using bluestore
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Nathan Cutler [Wed, 5 Jul 2017 15:51:36 +0000 (17:51 +0200)]
Merge pull request #16098 from smithfarm/wip-19621-kraken
kraken: rbd-nbd: add signal handler
Reviewed-by: Jason Dillaman <dillaman@redhat.com>
Nathan Cutler [Wed, 5 Jul 2017 15:50:21 +0000 (17:50 +0200)]
Merge pull request #15612 from smithfarm/wip-19611-kraken
kraken: rbd: Issues with C API image metadata retrieval functions
Reviewed-by: Jason Dillaman <dillaman@redhat.com>
Reviewed-by: Mykola Golub <mgolub@mirantis.com>
Nathan Cutler [Wed, 5 Jul 2017 15:48:24 +0000 (17:48 +0200)]
Merge pull request #14833 from smithfarm/wip-19794-kraken
kraken: tests: test_notify.py: assert(not image.is_exclusive_lock_owner()) on line 147
Reviewed-by: Jason Dillaman <dillaman@redhat.com>
Nathan Cutler [Wed, 5 Jul 2017 15:26:58 +0000 (17:26 +0200)]
Merge pull request #14543 from smithfarm/wip-19173-kraken
kraken: rbd: rbd_clone_copy_on_read ineffective with exclusive-lock
Reviewed-by: Jason Dillaman <dillaman@redhat.com>
Nathan Cutler [Wed, 5 Jul 2017 07:37:01 +0000 (09:37 +0200)]
Merge pull request #14974 from dillaman/wip-19872-kraken
kraken: rbd-mirror: failover and failback of unmodified image results in split-brain
Reviewed-by: Mykola Golub <mgolub@mirantis.com>
Nathan Cutler [Wed, 5 Jul 2017 07:36:14 +0000 (09:36 +0200)]
Merge pull request #14934 from dillaman/wip-19833-kraken
kraken: cls_rbd: default initialize snapshot namespace for legacy clients
Reviewed-by: Mykola Golub <mgolub@mirantis.com>
Nathan Cutler [Wed, 5 Jul 2017 07:30:07 +0000 (09:30 +0200)]
Merge pull request #14539 from smithfarm/wip-18771-kraken
kraken: rbd: Improve compatibility between librbd + krbd for the data pool
Reviewed-by: Jason Dillaman <dillaman@redhat.com>
Jason Dillaman [Thu, 27 Apr 2017 20:21:33 +0000 (16:21 -0400)]
test: remove hard-coded image name from RBD metadata test
Fixes: http://tracker.ceph.com/issues/19798
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
(cherry picked from commit
8f72e745e157cc12d76ca6babe956c5698ee297f )
Mykola Golub [Thu, 2 Mar 2017 16:18:18 +0000 (17:18 +0100)]
librbd: relax "is parent mirrored" check when enabling mirroring for pool
If the parent is in the same pool and has the journaling feature enabled
we can assume the mirroring will eventually be enabled for it.
Fixes: http://tracker.ceph.com/issues/19130
Signed-off-by: Mykola Golub <mgolub@mirantis.com>
(cherry picked from commit
fe31bca22f90ce02f461d6421a4f66539db888d3 )
Mykola Golub [Mon, 9 Jan 2017 09:40:46 +0000 (10:40 +0100)]
rbd-mirror: check image mirroring state when bootstrapping
Fixes: http://tracker.ceph.com/issues/18447
Signed-off-by: Mykola Golub <mgolub@mirantis.com>
(cherry picked from commit
5fc5a8ac895524f05eed6e7db20b0dda3a8cb60f )
Mykola Golub [Mon, 9 Jan 2017 08:23:19 +0000 (09:23 +0100)]
rbd-mirror: async request to test if image is primary
Signed-off-by: Mykola Golub <mgolub@mirantis.com>
(cherry picked from commit
0a1cb35caacdf85029f31a0364dc07a5d7462f5f )
Mykola Golub [Mon, 9 Jan 2017 14:02:02 +0000 (15:02 +0100)]
rbd-mirror: hold owner lock when testing if lock owner
Signed-off-by: Mykola Golub <mgolub@mirantis.com>
(cherry picked from commit
82aa89668d666f434cd19ff444223017b5512c6a )
xie xingguo [Tue, 27 Dec 2016 11:39:51 +0000 (19:39 +0800)]
os/bluestore: fix OnodeSizeTracking testing
[ FAILED ] 1 test, listed below:
[ FAILED ] ObjectStore/StoreTest.OnodeSizeTracking/2, where GetParam() = "bluestore"
1 FAILED TEST
The above test failure happens as the bluestore mount() process
will try to load all collections and put them into the coll_map,
which will be also tracked as mempool::bluestore_meta_other.
So total_bytes from mempool won't be equal to zero.
Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
(cherry picked from commit
1d97862f716ce24bfe227b192ecc5e1eb3f76757 )
Sage Weil [Fri, 24 Mar 2017 16:15:50 +0000 (12:15 -0400)]
os/bluestore/KernelDevice: fix uninit value
Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit
42db0c70bc7ef595f0925657c043ce081799b2b9 )
Conflicts:
src/os/bluestore/KernelDevice.h
- flush_lock mutex is still used in kraken, so keep it
Sage Weil [Thu, 9 Mar 2017 21:51:21 +0000 (16:51 -0500)]
os/bluestore/BlueFS: fix flush_bdev placement
We need to flush any new writes on any fsync(). Notably, this includes
the rocksdb log. However, previously _fsync was only doing a bdev flush if
we also had a dirty bluefs journal and called into _sync_and_flush_journal.
If we didn't, we weren't doing a flush() at all, which could lead to
corrupted data.
Fix this by moving the first flush_bdev *out* of _sync_and_flush_log. (The
second one is there to flush the bluefs journal; the first one was to
ensure prior writes are stable.) Instead, flush prior writes in all of the
callers prior to calling _sync_and_flush_log. This includes _fsync (and
fixes the bug by covering the non-journal-flush path) as well as several
other callers.
Fixes: http://tracker.ceph.com/issues/19250
Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit
2924a96493d8570317e55854a25fc64911ecf151 )
Sage Weil [Thu, 9 Mar 2017 21:51:05 +0000 (16:51 -0500)]
os/bluestore/KernelDevice: make flush() thread safe
flush() may be called from multiple racing threads (notably, rocksdb can call fsync via
bluefs at any time), and we need to make sure that if one thread sees the io_since_flush
command and does an actual flush, that other racing threads also wait until that flush is
complete. This is accomplished with a simple mutex!
Also, set the flag on IO *completion*, since flush is only a promise about
completed IOs, not submitted IOs.
Document.
Fixes: http://tracker.ceph.com/issues/19251
Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit
6b3c52643c8e5fa820c53d96608862b7649c3fd0 )
Nathan Cutler [Tue, 4 Jul 2017 11:32:46 +0000 (13:32 +0200)]
Merge pull request #14065 from asheplyakov/19212-bp-kraken
kraken: rgw: "cluster [WRN] bad locator @X on object @X...." in cluster log
Reviewed-by: Orit Wasserman <owasserm@redhat.com>
Nathan Cutler [Tue, 4 Jul 2017 11:18:48 +0000 (13:18 +0200)]
Merge pull request #16092 from smithfarm/wip-20150-kraken
kraken: ceph-disk: fails if OSD udev rule triggers prior to mount of /var
Reviewed-by: Loic Dachary <ldachary@redhat.com>
Nathan Cutler [Tue, 4 Jul 2017 09:37:29 +0000 (11:37 +0200)]
Merge pull request #14604 from asheplyakov/19647-kraken
kraken: ceph-disk: enable directory backed OSD at boot time
Reviewed-by: Loic Dachary <ldachary@redhat.com>
Pan Liu [Thu, 30 Mar 2017 16:23:12 +0000 (00:23 +0800)]
rbd-nbd: remove debug message from do_unmap
Global context is not initialized when do_unmap is called.
Signed-off-by: Pan Liu <liupan1111@gmail.com>
(cherry picked from commit
cd748f9b260a3c388b45091ff0bfc041257acd0c )
Kefu Chai [Wed, 29 Mar 2017 11:07:34 +0000 (19:07 +0800)]
rbd-nbd: s/cpp_error/cpp_strerror/ to fix FTBFS
the build failure was introduced by
ff4dcf0
Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit
c12ecd984df0f9ac0d7de81b9d689f6182a3646a )
Pan Liu [Tue, 28 Mar 2017 08:48:21 +0000 (16:48 +0800)]
rbd-nbd: polish the output info before and after ioctl NBD_DISCONNECT.
Signed-off-by: Pan Liu <liupan1111@gmail.com>
(cherry picked from commit
ff4dcf029028e8a3636ac71a6c5ac5380bf274e4 )
Pan Liu [Tue, 28 Mar 2017 08:33:25 +0000 (16:33 +0800)]
rbd-nbd: support signal handle for SIGHUP, SIGINT, and SIGTERM.
Fixes: http://tracker.ceph.com/issues/19349
Signed-off-by: Pan Liu <liupan1111@gmail.com>
(cherry picked from commit
3ba01aa6ce052d1afa42132feffc2353d73caae6 )
Nathan Cutler [Tue, 4 Jul 2017 08:26:43 +0000 (10:26 +0200)]
Merge pull request #14616 from smithfarm/wip-18723-kraken
kraken: osd: calc_clone_subsets misuses try_read_lock vs missing
Reviewed-by: Kefu Chai <kchai@redhat.com>
Loic Dachary [Thu, 1 Jun 2017 09:37:20 +0000 (11:37 +0200)]
ceph-disk: do not setup_statedir on trigger
trigger may run when statedir is unavailable and does not use it.
Fixes: http://tracker.ceph.com/issues/19941
Signed-off-by: Loic Dachary <loic@dachary.org>
(cherry picked from commit
16bfbdd3d9988523bba31aace516c303057daa58 )
Nathan Cutler [Mon, 3 Jul 2017 22:02:05 +0000 (00:02 +0200)]
Merge pull request #14852 from yehudasa/wip-rgw-support-ragweed-kraken
kraken: rgw: add apis to support ragweed suite
Reviewed-by: Nathan Cutler <ncutler@suse.com>
Nathan Cutler [Mon, 3 Jul 2017 20:28:08 +0000 (22:28 +0200)]
Merge pull request #13514 from rzarzynski/wip-rgw-controllable-204-kraken
kraken: rgw: make sending Content-Length in 204 and 304 controllable
Reviewed-by: Yehuda Sadeh <yehuda@redhat.com>
Nathan Cutler [Mon, 3 Jul 2017 15:05:34 +0000 (17:05 +0200)]
Merge pull request #14403 from shinobu-x/wip-17331-kraken
kraken: ceph-disk list reports mount error for OSD having mount options with SELinux context
Reviewed-by: Loic Dachary <ldachary@redhat.com>
Wido den Hollander [Mon, 23 Jan 2017 20:06:05 +0000 (21:06 +0100)]
osd: Return correct osd_objectstore in OSD metadata
Do not simply read the configuration value as it might have changed
during OSD startup by reading the type from disk.
Fixes: http://tracker.ceph.com/issues/18638
Signed-off-by: Wido den Hollander <wido@42on.com>
(cherry picked from commit
8fe6a0303b02ac1033f5bfced9f94350fe3e33de )
Conflicts:
src/osd/OSD.cc
- g_conf->osd_objectstore was changed to cct->_conf->osd_objectstore by
1d5e967a05ddbcceb10efe3b57e242b3b6b7eb8c which is not in kraken
Yehuda Sadeh [Fri, 27 Jan 2017 00:26:42 +0000 (16:26 -0800)]
rgw: fix use of marker in List::list_objects()
Fixes: http://tracker.ceph.com/issues/18331
List marker is an index key, so treat it as such. This
fixes infinite loop in orphans find command.
Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>
(cherry picked from commit
a5d1fa0587184f43c69d8e03114b58d43f320781 )
Radoslaw Zarzynski [Tue, 25 Apr 2017 10:22:54 +0000 (12:22 +0200)]
rgw: fix RadosGW hang during multi-chunk upload of AWSv4.
Fixes: http://tracker.ceph.com/issues/19754
Signed-off-by: Radoslaw Zarzynski <rzarzynski@mirantis.com>
(cherry picked from commit
72c1e2e351d984d0425a20f2c772951cbc36f13e )
liuchang0812 [Mon, 27 Mar 2017 05:08:12 +0000 (13:08 +0800)]
rgw/lifecycle: do not send lifecycle rules when GetLifeCycle failed
Now, RGW will send two HTTP responses when GetLifeCycle failed. The first one is
Error Respnse like 404, and the second is lifecycle rules. It will breaks s3 sdk
and s3 utilities.
Fixes: http://tracker.ceph.com/issues/19363
Signed-off-by: liuchang0812 <liuchang0812@gmail.com>
(cherry picked from commit
c3c0c828da5a64ca896475c1b0c369fde1bbd76a )
tone-zhang [Thu, 6 Apr 2017 09:56:05 +0000 (17:56 +0800)]
rgw: fix response header of Swift API
Response header of Swift API returned by radosgw does not contain
"x-openstack-request-id", but Swift returns it. Enhance the
compatibility of radosgw.
Fixes: http://tracker.ceph.com/issues/19443
Signed-off-by: tone-zhang <tone.zhang@linaro.org>
(cherry picked from commit
e96db213079ab5e026156ab4b38418d1d4c23d27 )
Yehuda Sadeh [Wed, 8 Mar 2017 22:52:34 +0000 (14:52 -0800)]
rgw: fix crash when listing objects via swift
Fixes: http://tracker.ceph.com/issues/19249
Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>
(cherry picked from commit
a9ec5e8ce184e19c009863db4d3519f9d8af91bd )
Conflicts:
src/rgw/rgw_rest_swift.cc ("key" element of RGWObjEnt struct
is not a reference; fix)
Jason Dillaman [Thu, 26 Jan 2017 19:02:11 +0000 (14:02 -0500)]
librbd: introduce new constants for tracking max block name prefix
Fixes: http://tracker.ceph.com/issues/18653
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
(cherry picked from commit
2c08629c99d90aa7676b59263c055c9f1f577039 )
Pan Liu [Thu, 16 Feb 2017 14:17:52 +0000 (22:17 +0800)]
rbd-nbd: no need create asok file for unmap and list-mapped commands.
Fixes: http://tracker.ceph.com/issues/17951
Signed-off-by: Pan Liu <liupan1111@gmail.com>
(cherry picked from commit
72352653d585ef89043a4ece371b5c0cb3f6f32a )
Mykola Golub [Thu, 2 Feb 2017 10:11:35 +0000 (11:11 +0100)]
rbd-nbd: check /sys/block/nbdX/size to ensure kernel mapped correctly
Fixes: http://tracker.ceph.com/issues/18335
Signed-off-by: Mykola Golub <mgolub@mirantis.com>
(cherry picked from commit
596e5ea8a5df72002672eef0a6d20572ca6f60f0 )
Venky Shankar [Mon, 20 Feb 2017 06:34:10 +0000 (12:04 +0530)]
librbd: acquire exclusive-lock during copy on read
Fixes: http://tracker.ceph.com/issues/18888
Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit
7dba5311b12011a4a6e8564e68150e54c5af5ddd )
Conflicts:
src/librbd/AioImageRequestWQ.h:
- in master this file has morphed into src/librbd/io/ImageRequestWQ.h
- kraken has AioImageRequest<ImageCtx> instead of ImageRequest<ImageCtx>
src/librbd/image/RefreshRequest.cc:
- rename image context element to "aio_work_queue" (from "io_work_queue")
because kraken doesn't have
de95d862f57b56738e04d77f2351622f83f17f4a
src/test/librbd/image/test_mock_RefreshRequest.cc:
- rename image context element to "aio_work_queue" (from "io_work_queue")
because kraken doesn't have
de95d862f57b56738e04d77f2351622f83f17f4a
Samuel Just [Wed, 18 Jan 2017 18:24:13 +0000 (10:24 -0800)]
PrimaryLogPG::try_lock_for_read: give up if missing
The only users calc_*_subsets might try to read_lock an object which is
missing on the primary. Returning false in those cases is perfectly
reasonable and avoids the problem.
Fixes: http://tracker.ceph.com/issues/18583
Signed-off-by: Samuel Just <sjust@redhat.com>
(cherry picked from commit
3833440adea6f8bcb0093603c3a9d16360ed57ec )
Samuel Just [Wed, 23 Nov 2016 23:41:13 +0000 (15:41 -0800)]
ReplicatedBackend: take read locks for clone sources during recovery
Otherwise, we run the risk of a clone source which hasn't actually
come into existence yet being used if we grab a clone which *just*
got added the the ssc, but has not yet actually had time to be
created (can't rely on message ordering here since recovery messages
don't necessarily order with client IO!).
Fixes: http://tracker.ceph.com/issues/17831
Signed-off-by: Samuel Just <sjust@redhat.com>
(cherry picked from commit
68defc2b0561414711d4dd0a76bc5d0f46f8a3f8 )
Sage Weil [Thu, 30 Mar 2017 17:50:41 +0000 (13:50 -0400)]
osd/PrimaryLogPG: do not expect FULL_TRY ops to get resent
The objecter will not resend FULL_TRY requests that were sent prior to
becoming full, so we should not discard them.
Fixes: http://tracker.ceph.com/issues/19430
Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit
3f7acdbc9a942fd18937dbcf07fbc7b752c50ba3 )
Sage Weil [Fri, 31 Mar 2017 14:06:42 +0000 (10:06 -0400)]
ceph_test_librados_api_misc: fix stupid LibRadosMiscConnectFailure.ConnectFailure test
Sometimes the cond doesn't time out and it wakes up instead. Just repeat
the test many times to ensure that at least once it times out (usually
it doesn't; it's pretty infrequent that it doesn't).
Fixes: http://tracker.ceph.com/issues/15368
Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit
8bc197400d94ee2716d3f2fa454247379a676cf9 )
David Zafman [Mon, 17 Apr 2017 21:58:02 +0000 (14:58 -0700)]
osd: Give requested scrub work a higher priority
Once started we now queue scrub work at higher priority than
scheduled scrubs.
Fixes: http://tracker.ceph.com/issues/15789
Signed-off-by: David Zafman <dzafman@redhat.com>
(cherry picked from commit
ebab8b1f4f67fbdec1e147c580329c1e2b5cf7cd )
Conflicts:
src/osd/OSD.h - in kraken, the PGScrub() call is enclosed within
op_wq.queue(make_pair(...)) instead of enqueue_back()
Nathan Cutler [Mon, 3 Jul 2017 09:39:42 +0000 (11:39 +0200)]
Merge pull request #16069 from smithfarm/wip-20345-kraken
kraken: make check fails with Error EIO: load dlopen(build/lib/libec_FAKE.so): build/lib/libec_FAKE.so: cannot open shared object file: No such file or directory
Reviewed-by: Kefu Chai <kchai@redhat.com>
Kyr Shatskyy [Wed, 8 Feb 2017 15:02:52 +0000 (16:02 +0100)]
ceph.spec.in, debian/control: Add bc to build dependencies
The bc is missing for ceph-helpers.sh
Fixes: http://tracker.ceph.com/issues/18876
Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@gmail.com>
(cherry picked from commit
3ff4be6d6896346b7b9ec7f158fcde8866faeb38 )
Kefu Chai [Wed, 12 Apr 2017 09:31:52 +0000 (17:31 +0800)]
qa/workunits/ceph-helpers: do not error out if is_clean
it would be a race otherwise, because we cannot be sure that the cluster
pgs are not all clean or not when run_osd() returns, but we can be sure
that they are expected to active+clean after a while. that's what
wait_for_clean() does.
Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit
6cb4503a40ae4ebee5690fda024cae8d1a506bce )