]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
7 years agomon: osd crush set crushmap need sanity check 16144/head
Loic Dachary [Sat, 18 Mar 2017 09:04:20 +0000 (10:04 +0100)]
mon: osd crush set crushmap need sanity check

The sanity check verifying the new crushmap does not remove crush rules
that are in use is not exclusive to ceph setcrushmap.

Fixes: http://tracker.ceph.com/issues/19302
Signed-off-by: Loic Dachary <loic@dachary.org>
(cherry picked from commit ed760457bf154c10adf75c6df046eecab7eb8e4b)

7 years ago crush: when take place the crush map should consider the rule is in used
songbaisen [Thu, 31 Mar 2016 03:39:44 +0000 (11:39 +0800)]
 crush: when take place the crush map should consider the rule is in used

Signed-off-by: song baisen song.baisen@zte.com.cn
(cherry picked from commit 09bf6f2858d09c8c314a8c242a9c70d2834718dc)

7 years agoMerge pull request #16059 from vumrao/wip-vumrao-20353
Nathan Cutler [Thu, 31 Aug 2017 08:24:21 +0000 (10:24 +0200)]
Merge pull request #16059 from vumrao/wip-vumrao-20353

jewel: osd: Add heartbeat message for Jumbo Frames (MTU 9000)

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
7 years agoMerge pull request #15726 from asheplyakov/20314-bp-jewel
Nathan Cutler [Thu, 31 Aug 2017 08:23:00 +0000 (10:23 +0200)]
Merge pull request #15726 from asheplyakov/20314-bp-jewel

jewel: mon: fail to form large quorum; msg/async busy loop

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Haomai Wang <haomai@xsky.com>
7 years agoMerge pull request #15719 from Vicente-Cheng/wip-20146-jewel
Nathan Cutler [Tue, 29 Aug 2017 16:51:27 +0000 (18:51 +0200)]
Merge pull request #15719 from Vicente-Cheng/wip-20146-jewel

jewel: rgw: 'gc list --include-all' command infinite loop the first 1000 items

Reviewed-by: Casey Bodley <cbodley@redhat.com>
7 years agoMerge pull request #15477 from smithfarm/wip-19767-jewel
Nathan Cutler [Tue, 29 Aug 2017 16:50:21 +0000 (18:50 +0200)]
Merge pull request #15477 from smithfarm/wip-19767-jewel

jewel: rgw: Delete non-empty bucket in slave zonegroup

Reviewed-by: Casey Bodley <cbodley@redhat.com>
7 years agoMerge pull request #16015 from asheplyakov/19208-bp-jewel
Nathan Cutler [Tue, 29 Aug 2017 11:32:21 +0000 (13:32 +0200)]
Merge pull request #16015 from asheplyakov/19208-bp-jewel

jewel: osd: osd_internal_types: wake snaptrimmer on put_read lock, too

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
7 years agoMerge pull request #15762 from smithfarm/wip-17385-jewel
Nathan Cutler [Tue, 29 Aug 2017 11:31:05 +0000 (13:31 +0200)]
Merge pull request #15762 from smithfarm/wip-17385-jewel

jewel: ceph cli: Rados object in state configuring race

Reviewed-by: Kefu Chai <kchai@redhat.com>
7 years agoMerge pull request #15474 from smithfarm/wip-19559-jewel
Nathan Cutler [Tue, 29 Aug 2017 11:28:29 +0000 (13:28 +0200)]
Merge pull request #15474 from smithfarm/wip-19559-jewel

jewel: osd: objecter full_try behavior not consistent with osd

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
7 years agoMerge pull request #15947 from asheplyakov/20428-bp-jewel
Nathan Cutler [Tue, 29 Aug 2017 07:15:37 +0000 (09:15 +0200)]
Merge pull request #15947 from asheplyakov/20428-bp-jewel

jewel: osd: unlock sdata_op_ordering_lock with sdata_lock hold to avoid missing wakeup signal

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Liu-Chunmei <chunmei.liu@intel.com>
Reviewed-by: Xie Xingguo <xie.xingguo@zte.com.cn>
7 years agoMerge pull request #15475 from tchaikov/wip-16239-jewel
Nathan Cutler [Tue, 29 Aug 2017 07:04:24 +0000 (09:04 +0200)]
Merge pull request #15475 from tchaikov/wip-16239-jewel

qa: add a sleep after restarting osd before "tell"ing it

Reviewed-by: Nathan Cutler <ncutler@suse.com>
7 years agoMerge pull request #15473 from smithfarm/wip-20080-jewel
Nathan Cutler [Tue, 29 Aug 2017 06:56:01 +0000 (08:56 +0200)]
Merge pull request #15473 from smithfarm/wip-20080-jewel

jewel: build/ops: rpm: set subman cron attributes in spec file

Reviewed-by: Ken Dreyer <kdreyer@redhat.com>
Reviewed-by: Thomas Serlin <tserlin@redhat.com>
7 years agoMerge pull request #15460 from smithfarm/wip-17843-jewel
Nathan Cutler [Mon, 28 Aug 2017 20:34:52 +0000 (22:34 +0200)]
Merge pull request #15460 from smithfarm/wip-17843-jewel

jewel: rbd: object-map: batch updates during trim operation

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
Reviewed-by: Mykola Golub <mgolub@mirantis.com>
7 years agoMerge pull request #15488 from dillaman/wip-20023-jewel
Nathan Cutler [Mon, 28 Aug 2017 20:33:13 +0000 (22:33 +0200)]
Merge pull request #15488 from dillaman/wip-20023-jewel

jewel: rbd: rbd-mirror: ensure missing images are re-synced when detected

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
Reviewed-by: Mykola Golub <mgolub@mirantis.com>
7 years agoMerge pull request #15602 from tchaikov/wip-20175-jewel
Nathan Cutler [Mon, 28 Aug 2017 20:32:04 +0000 (22:32 +0200)]
Merge pull request #15602 from tchaikov/wip-20175-jewel

jewel: tests: decouple ceph_test_librbd_api from libceph-common

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
7 years agoMerge pull request #16124 from smithfarm/wip-19174-jewel
Nathan Cutler [Mon, 28 Aug 2017 20:31:09 +0000 (22:31 +0200)]
Merge pull request #16124 from smithfarm/wip-19174-jewel

jewel: rbd: rbd_clone_copy_on_read ineffective with exclusive-lock

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
7 years agoMerge pull request #15760 from smithfarm/wip-20325-jewel
Nathan Cutler [Mon, 28 Aug 2017 09:03:32 +0000 (11:03 +0200)]
Merge pull request #15760 from smithfarm/wip-20325-jewel

libradosstriper silently fails to delete empty objects in jewel

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
7 years agoMerge pull request #16061 from weiqiaomiao/wqm-wip-ceph-disk-jewel
Nathan Cutler [Mon, 28 Aug 2017 08:50:27 +0000 (10:50 +0200)]
Merge pull request #16061 from weiqiaomiao/wqm-wip-ceph-disk-jewel

jewel: ceph-disk: remove the special check to bcache devices

Reviewed-by: Loic Dachary <ldachary@redhat.com>
Reviewed-by: Zhu Shangzhong <zhu.shangzhong@zte.com.cn>
7 years agoMerge pull request #15503 from Vicente-Cheng/wip-20011-jewel
Nathan Cutler [Mon, 28 Aug 2017 08:47:59 +0000 (10:47 +0200)]
Merge pull request #15503 from Vicente-Cheng/wip-20011-jewel

jewel: tools: ceph-disk: separate ceph-osd --check-needs-* logs

Reviewed-by: Nathan Cutler <ncutler@suse.com>
7 years agoMerge pull request #14874 from ceph/wip-19805-jewel
Nathan Cutler [Sun, 27 Aug 2017 16:10:39 +0000 (18:10 +0200)]
Merge pull request #14874 from ceph/wip-19805-jewel

jewel: rbd: default features should be negotiated with the OSD

Reviewed-by: Mykola Golub <mgolub@mirantis.com>
Reviewed-by: Jason Dillaman <dillaman@redhat.com>
7 years agoMerge pull request #14977 from dillaman/wip-19873-jewel
Nathan Cutler [Sun, 27 Aug 2017 16:09:03 +0000 (18:09 +0200)]
Merge pull request #14977 from dillaman/wip-19873-jewel

jewel: rbd: rbd-mirror: failover and failback of unmodified image results in split-brain

Reviewed-by: Mykola Golub <mgolub@mirantis.com>
7 years agoMerge pull request #15461 from smithfarm/wip-19795-jewel
Nathan Cutler [Sun, 27 Aug 2017 15:53:15 +0000 (17:53 +0200)]
Merge pull request #15461 from smithfarm/wip-19795-jewel

jewel: tests: test_notify.py: assert(not image.is_exclusive_lock_owner()) on line 147

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
Reviewed-by: Mykola Golub <mgolub@mirantis.com>
7 years agoMerge pull request #15463 from smithfarm/wip-20017-jewel
Nathan Cutler [Sun, 27 Aug 2017 15:51:35 +0000 (17:51 +0200)]
Merge pull request #15463 from smithfarm/wip-20017-jewel

jewel: rbd: rbd-nbd: kernel reported invalid device size (0, expected 1073741824)

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
Reviewed-by: Mykola Golub <mgolub@mirantis.com>
7 years agoMerge pull request #15464 from smithfarm/wip-20153-jewel
Nathan Cutler [Sun, 27 Aug 2017 15:49:36 +0000 (17:49 +0200)]
Merge pull request #15464 from smithfarm/wip-20153-jewel

jewel: rbd: Potential IO hang if image is flattened while read request is in-flight

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
Reviewed-by: Mykola Golub <mgolub@mirantis.com>
7 years agoMerge pull request #15455 from smithfarm/wip-19838-jewel
Nathan Cutler [Sat, 26 Aug 2017 17:43:55 +0000 (19:43 +0200)]
Merge pull request #15455 from smithfarm/wip-19838-jewel

jewel: rgw: reduce log level of 'storing entry at' in cls_log

Reviewed-by: Casey Bodley <cbodley@redhat.com>
7 years agoMerge pull request #15456 from smithfarm/wip-20003-jewel
Nathan Cutler [Sat, 26 Aug 2017 17:42:23 +0000 (19:42 +0200)]
Merge pull request #15456 from smithfarm/wip-20003-jewel

jewel: rgw: rgw_file:  v3 write timer does not close open handles

Reviewed-by: Matt Benjamin <mbenjamin@redhat.com>
7 years agoMerge pull request #15457 from smithfarm/wip-20018-jewel
Nathan Cutler [Sat, 26 Aug 2017 17:38:29 +0000 (19:38 +0200)]
Merge pull request #15457 from smithfarm/wip-20018-jewel

jewel: rgw: multisite: memory leak on failed lease in RGWDataSyncShardCR

Reviewed-by: Casey Bodley <cbodley@redhat.com>
7 years agoMerge pull request #15459 from smithfarm/wip-20144-jewel
Nathan Cutler [Sat, 26 Aug 2017 17:33:42 +0000 (19:33 +0200)]
Merge pull request #15459 from smithfarm/wip-20144-jewel

jewel: rgw: rgw_file: cannot delete bucket w/uxattrs

Reviewed-by: Matt Benjamin <mbenjamin@redhat.com>
7 years agoMerge pull request #15465 from smithfarm/wip-20155-jewel
Nathan Cutler [Sat, 26 Aug 2017 17:32:40 +0000 (19:32 +0200)]
Merge pull request #15465 from smithfarm/wip-20155-jewel

jewel: rgw: fix crash caused by shard id out of range when listing data log

Reviewed-by: Casey Bodley <cbodley@redhat.com>
7 years agolibrbd: clean up object map update interface, revisited 15460/head
Nathan Cutler [Sat, 26 Aug 2017 08:06:25 +0000 (10:06 +0200)]
librbd: clean up object map update interface, revisited

In master, the "batch update" change [1] was merged before the "order
concurrent updates" [2], while in jewel the latter is already
backported [3]. A backport of [1] to jewel was attempted, and was
necessarily applied on top of [3] - i.e. in the reverse order compared
to how the commits went into master. This reverse ordering caused the
automated cherry-pick to miss some parts of [1] which this commit is
adding manually.

[1] https://github.com/ceph/ceph/pull/11510
[2] https://github.com/ceph/ceph/pull/12420
[3] https://github.com/ceph/ceph/pull/12909

Signed-off-by: Mykola Golub <mgolub@mirantis.com>
Signed-off-by: Nathan Cutler <ncutler@suse.com>
7 years agolibrbd: batch ObjectMap updations upon trim
Venky Shankar [Sat, 15 Oct 2016 11:48:30 +0000 (17:18 +0530)]
librbd: batch ObjectMap updations upon trim

Shrinking a clone which has snapshots and does not share
majority of objects with its parent (i.e., there are less
objects to be copied up) involves huge number of object
map updates -- two (pre, post) per object. This results
in lots of requests to be send to OSDs especially when
trimming a gigantus image. This situation can be optimized
by sending batch ObjectMap updates for an object range
thereby significantly cutting down OSD traffic resulting
in faster trim times.

Fixes: http://tracker.ceph.com/issues/17356
Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit 05653b7c512334533b801013f7e426363237301b)

7 years agoMerge PR #16248 into jewel
Patrick Donnelly [Fri, 25 Aug 2017 21:26:21 +0000 (14:26 -0700)]
Merge PR #16248 into jewel

* refs/remotes/upstream/pull/16248/head:
client: don't re-send interrupted flock request
mds/flock: properly remove item from global_waiting_locks
mds/flock: properly update ceph_lock_state_t::client_waiting_lock_counts
mds/flock: don't import ceph_lock_state_t::waiting_locks

Reviewed-by: Zheng Yan <zyan@redhat.com>
7 years agoMerge pull request #15454 from smithfarm/wip-19775-jewel
Nathan Cutler [Fri, 25 Aug 2017 18:09:58 +0000 (20:09 +0200)]
Merge pull request #15454 from smithfarm/wip-19775-jewel

jewel: rgw: multisite: realm rename does not propagate to other clusters

Reviewed-by: Orit Wasserman <owasserm@redhat.com>
7 years agoMerge pull request #15453 from smithfarm/wip-19768-jewel
Nathan Cutler [Fri, 25 Aug 2017 18:09:15 +0000 (20:09 +0200)]
Merge pull request #15453 from smithfarm/wip-19768-jewel

jewel: rgw: multisite: operating bucket's acl&cors is not restricted on slave zone

Reviewed-by: Casey Bodley <cbodley@redhat.com>
7 years agoMerge pull request #15452 from smithfarm/wip-19765-jewel
Nathan Cutler [Fri, 25 Aug 2017 18:08:26 +0000 (20:08 +0200)]
Merge pull request #15452 from smithfarm/wip-19765-jewel

jewel: rgw: when uploading objects continuously into a versioned bucket, some objects will not sync

Reviewed-by: Casey Bodley <cbodley@redhat.com>
7 years agoMerge pull request #15451 from smithfarm/wip-19764-jewel
Nathan Cutler [Fri, 25 Aug 2017 18:06:37 +0000 (20:06 +0200)]
Merge pull request #15451 from smithfarm/wip-19764-jewel

jewel: rgw: set latest object's acl failed

Reviewed-by: Casey Bodley <cbodley@redhat.com>
7 years agoMerge pull request #15450 from smithfarm/wip-19758-jewel
Nathan Cutler [Fri, 25 Aug 2017 18:05:54 +0000 (20:05 +0200)]
Merge pull request #15450 from smithfarm/wip-19758-jewel

jewel: rgw: multisite: after CreateBucket is forwarded to master, local bucket may use different value for bucket index shards

Reviewed-by: Casey Bodley <cbodley@redhat.com>
7 years agoqa/workunits/rados/test-upgrade-v11.0.0: skip ListObjectsError
Sage Weil [Thu, 24 Aug 2017 14:27:22 +0000 (10:27 -0400)]
qa/workunits/rados/test-upgrade-v11.0.0: skip ListObjectsError

Signed-off-by: Sage Weil <sage@redhat.com>
7 years agoqa/workunits/rados/test-upgrade-v11.0.0: skip more tests
Sage Weil [Thu, 24 Aug 2017 14:20:00 +0000 (10:20 -0400)]
qa/workunits/rados/test-upgrade-v11.0.0: skip more tests

Signed-off-by: Sage Weil <sage@redhat.com>
7 years agoMerge pull request #16870 from liewegas/wip-20929-jewel
Sage Weil [Thu, 24 Aug 2017 14:16:19 +0000 (09:16 -0500)]
Merge pull request #16870 from liewegas/wip-20929-jewel

jewel: ceph-disk: dmcrypt cluster must default to ceph

Reviewed-by: Loic Dachary <ldachary@redhat.com>
Reviewed-by: Alfredo Deza <adeza@redhat.com>
7 years agoMerge pull request #14659 from asheplyakov/19473-jewel
Nathan Cutler [Thu, 24 Aug 2017 08:15:57 +0000 (10:15 +0200)]
Merge pull request #14659 from asheplyakov/19473-jewel

jewel: rgw: add the remove-x-delete feature to cancel swift object expiration

Reviewed-by: Casey Bodley <cbodley@redhat.com>
7 years agoMerge pull request #15322 from gregsfortytwo/jewel-19931-snaptrim-shutdown
Nathan Cutler [Thu, 24 Aug 2017 07:13:23 +0000 (09:13 +0200)]
Merge pull request #15322 from gregsfortytwo/jewel-19931-snaptrim-shutdown

jewel: osd: Reset() snaptrimmer on shutdown and do not default-abort on leaked pg refs

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
7 years agoMerge pull request #15236 from asheplyakov/20036-bp-jewel
Nathan Cutler [Thu, 24 Aug 2017 07:11:52 +0000 (09:11 +0200)]
Merge pull request #15236 from asheplyakov/20036-bp-jewel

jewel: mon: factor mon_osd_full_ratio into MAX AVAIL calc

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
7 years agoMerge pull request #14346 from ceph/wip-ceph-disk-fix-jewel
Nathan Cutler [Thu, 24 Aug 2017 07:07:08 +0000 (09:07 +0200)]
Merge pull request #14346 from ceph/wip-ceph-disk-fix-jewel

jewel: build/ops: Add fix subcommand to ceph-disk, fix SELinux denials, and speed up upgrade from non-SELinux enabled ceph to an SELinux enabled one

Reviewed-by: Kefu Chai <kchai@redhat.com>
7 years agoMerge pull request #15083 from asheplyakov/19926-bp-jewel
Nathan Cutler [Thu, 24 Aug 2017 07:04:49 +0000 (09:04 +0200)]
Merge pull request #15083 from asheplyakov/19926-bp-jewel

jewel: mon: crash on shutdown, lease_ack_timeout event

Reviewed-by: Kefu Chai <kchai@redhat.com>
7 years agoMerge pull request #14663 from smithfarm/wip-19228-jewel
Nathan Cutler [Wed, 23 Aug 2017 18:56:38 +0000 (20:56 +0200)]
Merge pull request #14663 from smithfarm/wip-19228-jewel

jewel: rbd: Enabling mirroring for a pool with clones may fail

Reviewed-by: Mykola Golub <mgolub@mirantis.com>
7 years agoMerge pull request #15050 from asheplyakov/19265-bp-jewel
Nathan Cutler [Wed, 23 Aug 2017 18:52:04 +0000 (20:52 +0200)]
Merge pull request #15050 from asheplyakov/19265-bp-jewel

jewel: core: an OSD was seen getting ENOSPC even with osd_failsafe_full_ratio passed

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: David Zafman <dzafman@redhat.com>
7 years agoMerge pull request #15065 from asheplyakov/19915-bp-jewel
Nathan Cutler [Wed, 23 Aug 2017 18:50:01 +0000 (20:50 +0200)]
Merge pull request #15065 from asheplyakov/19915-bp-jewel

jewel: osd: PrimaryLogPG: do not call on_shutdown() if (pg.deleting)

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Brad Hubbard <bhubbard@redhat.com>
7 years agoMerge pull request #14943 from shinobu-x/wip-18293-jewel
Nathan Cutler [Wed, 23 Aug 2017 18:48:38 +0000 (20:48 +0200)]
Merge pull request #14943 from shinobu-x/wip-18293-jewel

jewel: osd: leaked MOSDMap

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>
7 years agoMerge pull request #14699 from smithfarm/wip-19679-jewel
Nathan Cutler [Wed, 23 Aug 2017 18:12:01 +0000 (20:12 +0200)]
Merge pull request #14699 from smithfarm/wip-19679-jewel

jewel: mds: damage reporting by ino number is useless

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
7 years agoMerge pull request #14673 from smithfarm/wip-19466-jewel
Nathan Cutler [Wed, 23 Aug 2017 18:10:07 +0000 (20:10 +0200)]
Merge pull request #14673 from smithfarm/wip-19466-jewel

jewel: mds: log rotation doesn't work if mds has respawned

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
7 years agoqa/workunits/rados/test-upgrade-v11.0.0.sh: more tests skipped
Sage Weil [Wed, 23 Aug 2017 17:53:48 +0000 (13:53 -0400)]
qa/workunits/rados/test-upgrade-v11.0.0.sh: more tests skipped

Signed-off-by: Sage Weil <sage@redhat.com>
7 years agoMerge pull request #15442 from Vicente-Cheng/wip-20025-jewel
Nathan Cutler [Wed, 23 Aug 2017 06:41:55 +0000 (08:41 +0200)]
Merge pull request #15442 from Vicente-Cheng/wip-20025-jewel

jewel: cephfs: osdc/Filer: truncate large file party by party

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Yan, Zheng <zyan@redhat.com>
7 years agoMerge pull request #14691 from smithfarm/wip-19571-jewel
Nathan Cutler [Wed, 23 Aug 2017 06:40:20 +0000 (08:40 +0200)]
Merge pull request #14691 from smithfarm/wip-19571-jewel

jewel: tests: upgrade:client-upgrade/firefly-client-x: drop CentOS

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
7 years agolibrbd: default features should be negotiated with the OSD 14874/head
Jason Dillaman [Fri, 28 Apr 2017 16:52:18 +0000 (12:52 -0400)]
librbd: default features should be negotiated with the OSD

Derived from f066ce8f80bce929edc209590efd47cce2196ae2

Signed-off-by: Mykola Golub <mgolub@mirantis.com>
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
7 years agocls/rbd: add get_all_features on client side
Mykola Golub [Sat, 5 Nov 2016 20:44:33 +0000 (22:44 +0200)]
cls/rbd: add get_all_features on client side

Signed-off-by: Mykola Golub <mgolub@mirantis.com>
(cherry picked from commit 84f6d5c109911923c25414de639308921983e438)

Conflicts:
src/cls/rbd/cls_rbd_client.h: trivial resolution

7 years agoMerge pull request #15448 from smithfarm/wip-19615-jewel
Nathan Cutler [Tue, 22 Aug 2017 14:25:14 +0000 (16:25 +0200)]
Merge pull request #15448 from smithfarm/wip-19615-jewel

jewel: rgw: multisite: bucket zonegroup redirect not working

Reviewed-by: Casey Bodley <cbodley@redhat.com>
7 years agotests: upgrade:client-upgrade/firefly-client-x: drop CentOS 14691/head
Nathan Cutler [Thu, 20 Apr 2017 17:15:29 +0000 (19:15 +0200)]
tests: upgrade:client-upgrade/firefly-client-x: drop CentOS

The RBD suite needs ceph-cm-ansible to install qemu-kvm on CentOS, but doing
that breaks the firefly install on CentOS because:

1. the qemu-kvm that gets installed is from hammer (0.94.5)
2. qemu-kvm brings in librados2, librbd1 as dependencies

As a result, the hammer librados2 and librbd1 are installed on the test nodes
even before the teuthology install task starts. When it does start and tries
to install firefly, it fails because firefly librados2 and librbd1 cannot be
installed over their hammer versions.

Fixes: http://tracker.ceph.com/issues/19571
Signed-off-by: Nathan Cutler <ncutler@suse.com>
7 years agoMerge pull request #15197 from smithfarm/wip-20007-jewel
Nathan Cutler [Tue, 22 Aug 2017 11:47:21 +0000 (13:47 +0200)]
Merge pull request #15197 from smithfarm/wip-20007-jewel

jewel: rgw: remove unnecessary 'error in read_id for object name: default'

Reviewed-by: Jos Collin <jcollin@redhat.com>
Reviewed-by: Orit Wasserman <owasserm@redhat.com>
7 years agoMerge pull request #15051 from asheplyakov/19910-bp-jewel
Nathan Cutler [Tue, 22 Aug 2017 08:00:07 +0000 (10:00 +0200)]
Merge pull request #15051 from asheplyakov/19910-bp-jewel

jewel: build/ops: extended, customizable systemd ceph-disk timeout

Reviewed-by: David Disseldorp <ddiss@suse.com>
Reviewed-by: Nathan Cutler <ncutler@suse.com>
7 years agoMerge pull request #15428 from smithfarm/wip-20162-jewel
Nathan Cutler [Tue, 22 Aug 2017 07:54:12 +0000 (09:54 +0200)]
Merge pull request #15428 from smithfarm/wip-20162-jewel

jewel: build/ops: deb: Fix logrotate packaging

Reviewed-by: Kefu Chai <kchai@redhat.com>
7 years agoMerge pull request #15447 from smithfarm/wip-19613-jewel
Nathan Cutler [Tue, 22 Aug 2017 07:52:49 +0000 (09:52 +0200)]
Merge pull request #15447 from smithfarm/wip-19613-jewel

jewel: rgw: multisite: rest api fails to decode large period on "period commit"

Reviewed-by: Casey Bodley <cbodley@redhat.com>
7 years agotest: remove hard-coded image name from RBD metadata test 14663/head
Jason Dillaman [Thu, 27 Apr 2017 20:21:33 +0000 (16:21 -0400)]
test: remove hard-coded image name from RBD metadata test

Fixes: http://tracker.ceph.com/issues/19798
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
(cherry picked from commit 8f72e745e157cc12d76ca6babe956c5698ee297f)

7 years agolibrbd: relax "is parent mirrored" check when enabling mirroring for pool
Mykola Golub [Thu, 2 Mar 2017 16:18:18 +0000 (17:18 +0100)]
librbd: relax "is parent mirrored" check when enabling mirroring for pool

If the parent is in the same pool and has the journaling feature enabled
we can assume the mirroring will eventually be enabled for it.

Fixes: http://tracker.ceph.com/issues/19130
Signed-off-by: Mykola Golub <mgolub@mirantis.com>
(cherry picked from commit fe31bca22f90ce02f461d6421a4f66539db888d3)

7 years agoMerge pull request #15196 from smithfarm/wip-20006-jewel
Nathan Cutler [Tue, 22 Aug 2017 07:31:04 +0000 (09:31 +0200)]
Merge pull request #15196 from smithfarm/wip-20006-jewel

jewel: build/ops: rpm: fix python-Sphinx package name for SUSE

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Ken Dreyer <kdreyer@redhat.com>
Reviewed-by: Abhishek Lekshmanan <abhishek@suse.com>
7 years agoqa/workunits/rados/test/test-upgrade-v11.0.0: skip Quota test
Sage Weil [Sat, 12 Aug 2017 13:00:22 +0000 (09:00 -0400)]
qa/workunits/rados/test/test-upgrade-v11.0.0: skip Quota test

This makes a pool go full which confounds the wait for healthy check
during restarts.

Signed-off-by: Sage Weil <sage@redhat.com>
7 years agoqa/workunits/rados/test-upgrade-v11.0.0: skip api_io *CmpExt*
Sage Weil [Wed, 9 Aug 2017 13:56:21 +0000 (09:56 -0400)]
qa/workunits/rados/test-upgrade-v11.0.0: skip api_io *CmpExt*

Signed-off-by: Sage Weil <sage@redhat.com>
7 years agoqa/workunits/rados/test-upgrade-11.0.0: skip RoundTrip3
Sage Weil [Mon, 7 Aug 2017 18:01:58 +0000 (14:01 -0400)]
qa/workunits/rados/test-upgrade-11.0.0: skip RoundTrip3

This does the crc op, which isn't in jewel.

Signed-off-by: Sage Weil <sage@redhat.com>
7 years agoMerge pull request #16343 from dillaman/wip-20630-jewel
Jason Dillaman [Mon, 7 Aug 2017 14:07:13 +0000 (10:07 -0400)]
Merge pull request #16343 from dillaman/wip-20630-jewel

jewel: tests: qa/tasks: rbd-mirror daemon not properly run in foreground mode

Reviewed-by: Mykola Golub <mgolub@mirantis.com>
7 years agoceph-disk: dmcrypt cluster must default to ceph 16870/head
Loic Dachary [Thu, 3 Aug 2017 08:56:24 +0000 (10:56 +0200)]
ceph-disk: dmcrypt cluster must default to ceph

If ceph_fsid is not found, which is the case for legacy dmcrypted OSD,
the cluster must default to ceph, as it was before support for non
standard cluster names was introduced.

Fixes: http://tracker.ceph.com/issues/20893
Signed-off-by: Loic Dachary <loic@dachary.org>
(cherry picked from commit ed22159fdd7071a110e81cd7b63d7e0f2296761b)

7 years agoMerge pull request #13507 from shinobu-x/wip-18468-jewel
Nathan Cutler [Tue, 1 Aug 2017 11:06:54 +0000 (13:06 +0200)]
Merge pull request #13507 from shinobu-x/wip-18468-jewel

jewel: mon: Disallow enabling 'hashpspool' option to a pool without some kind of --i-understand-this-will-remap-all-pgs flag

Reviewed-by: Kefu Chai <kchai@redhat.com>
7 years agoosdc/Filer: truncate large file party by party 15442/head
Yan, Zheng [Tue, 25 Apr 2017 08:21:24 +0000 (16:21 +0800)]
osdc/Filer: truncate large file party by party

Fixes: http://tracker.ceph.com/issues/19755
Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
(cherry picked from commit 5fab215e461e5ecc36c0f9d9ea867f6c45e80263)

Conflicts:
src/osdc/Filer.h
src/osdc/Filer.cc
    - add parameter to fit _modify (need onack parameter)
src/mds/MDCache.cc
    - make truncate() consistency

7 years agojewel:ceph-disk:remove the special check to bcache devices 16061/head
weiqiaomiao [Mon, 3 Jul 2017 01:34:08 +0000 (09:34 +0800)]
jewel:ceph-disk:remove the special check to bcache devices

if we skip partition when coming cross the bcache device in ceph-disk,
the udev can't detect the osd in such devices when we reboot the osd's host,
so the osd who deploy in bcache device can't auto start when the host restart.

And partition is now supported by bcache devices (https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux-stable.git/commit/drivers/md/bcache?id=b8c0d911ac5285e6be8967713271a51bdc5a936a),
we can remove this special check now.

This cannot be cherry-picked from master because 'accept bcache devices as data disks' [21f0216](https://github.com/ceph/ceph/commit/21f0216287485e6cce8811f324fee804ef733524)
is a jewel-only feature not present in master.

Signed-off-by: Wei Qiaomiao <wei.qiaomiao@zte.com.cn>
8 years agojewel: mon: Fix status output warning for mon_warn_osd_usage_min_max_delta 15050/head
David Zafman [Fri, 7 Jul 2017 17:53:41 +0000 (10:53 -0700)]
jewel: mon: Fix status output warning for mon_warn_osd_usage_min_max_delta

Fixes: http://tracker.ceph.com/issues/20544
Caued by: 489e810c37ed6fb9d32d1015634041a577501ee4

Signed-off-by: David Zafman <dzafman@redhat.com>
(cherry picked from commit 56f9808016ab289bdc0ce7cfbb0503c78b509593)

Conflicts:
apply changes to src/mon/PGMonitor.cc instead of src/mon/PGMap.cc

8 years agoMerge pull request #16289 from cbodley/wip-20513
Matt Benjamin [Thu, 20 Jul 2017 19:00:32 +0000 (15:00 -0400)]
Merge pull request #16289 from cbodley/wip-20513

jewel: rgw: cls: ceph::timespan tag_timeout wrong units

8 years agomessages/MOSDPing: optimize encode and decode of dummy payload 16059/head
Piotr Dałek [Fri, 16 Jun 2017 11:34:19 +0000 (13:34 +0200)]
messages/MOSDPing: optimize encode and decode of dummy payload

The dummy payload doesn't need to be processed, we can just skip over
it when decoding and we can use a single bufferptr instead of entire
bufferlist to encode it.

Signed-off-by: Piotr Dałek <piotr.dalek@corp.ovh.com>
(cherry picked from commit 15ce0772841f913d1eef1daebad0834e5f862383)

8 years agomessages/MOSDPing: fix the inflation amount calculation
Piotr Dałek [Fri, 16 Jun 2017 11:10:36 +0000 (13:10 +0200)]
messages/MOSDPing: fix the inflation amount calculation

If user specifies a min_message_size small enough (or zero to disable
it altogether), OSDs will crash and burn while trying to allocate
almost 4GB of payload (both min_message_size and payload.length() are
unsigned, so it'll roll over back to 4GB and MAX(4GB, 0) will use 4GB).
If the size of dummy payload is 0, don't bother constructing bufferptr
and bufferlist, then encoding that.

Signed-off-by: Piotr Dałek <piotr.dalek@corp.ovh.com>
(cherry picked from commit d959735a7d684018df564573a2ff88092d870fc1)

8 years agoOSD: mark two heartbeat config opts as observed
Piotr Dałek [Fri, 16 Jun 2017 11:05:10 +0000 (13:05 +0200)]
OSD: mark two heartbeat config opts as observed

"osd heartbeat min size" and "osd heartbeat interval" can be changed
at runtime, because their values, when used, are always taken from
global Ceph configuration. Mark them as observed, so the message
the user sees once they're changed doesn't confuse them.

Signed-off-by: Piotr Dałek <piotr.dalek@corp.ovh.com>
(cherry picked from commit 44b7839c788b20e0b9da76bc3838ae10d3ad2f89)

8 years agomessages/MOSDPing: initialize MOSDPing padding
Sage Weil [Fri, 16 Jun 2017 02:18:08 +0000 (22:18 -0400)]
messages/MOSDPing: initialize MOSDPing padding

This memory must be initialized or else valgrind will be very unhappy.

Avoid the cost of zeroing (or even allocating) the buffer for normal
padding values by (re)using a static zero buffer.

Fixes: http://tracker.ceph.com/issues/20323
Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 9beaf5efd74daa8c15e50b42583264d1252a85f5)

8 years agoosd: heartbeat with packets large enough to require working jumbo frames.
Greg Farnum [Mon, 5 Jun 2017 20:33:14 +0000 (13:33 -0700)]
osd: heartbeat with packets large enough to require working jumbo frames.

We get periodic reports that users somehow misconfigure one of their switches
so that it drops jumbo frames, yet the servers are still passing them along. In
that case, MOSDOp messages generally don't get through because they are much
larger than the 1500-byte non-jumbo limit, but the MOSDPing messages have kept
going (as they are very small and dispatched independently, even when the
server is willing to make jumbo frames). This means peer OSDs won't mark down
the ones behind the broken switch, despite all IO hanging.
Push the MOSDPing message size over the 1500-byte limit so that anybody in
this scenario will see the OSDs stuck behind a bad switch get marked down.

Fixes: http://tracker.ceph.com/issues/20087
Signed-off-by: Greg Farnum <gfarnum@redhat.com>
(cherry picked from commit 2096113d9e1589c571d96e34dd9cd841308a2567)

 Conflicts:
src/messages/MOSDPing.h
         - Changed HEAD_VERSION to 3 and kept COMPAT_VERSION to 1.
         - In class MOSDPing removed following line:
               if (header.version >= 2)
         - To keep ::decode(stamp, p) without condition because HEAD_Version
           is already 3 now and this condition is removed in the backport commit.

8 years agoqa/tasks: rbd-mirror daemon not properly run in foreground mode 16343/head
Jason Dillaman [Fri, 14 Jul 2017 14:32:28 +0000 (10:32 -0400)]
qa/tasks: rbd-mirror daemon not properly run in foreground mode

Fixes: http://tracker.ceph.com/issues/20630
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
(cherry picked from commit 4fa1918717b25a5ffafbf649eedcfe7d5ab829c2)

8 years ago10.2.9 v10.2.9
Jenkins Build Slave User [Thu, 13 Jul 2017 13:04:57 +0000 (13:04 +0000)]
10.2.9

8 years agoMerge pull request #16282 from smithfarm/wip-20599-jewel
Nathan Cutler [Thu, 13 Jul 2017 08:31:31 +0000 (10:31 +0200)]
Merge pull request #16282 from smithfarm/wip-20599-jewel

jewel: cephfs: Damaged MDS with 10.2.8

Reviewed-by: Yan, Zheng <zyan@redhat.com>
8 years agorgw: allow larger payload for period commit 15447/head
Casey Bodley [Wed, 5 Apr 2017 18:20:20 +0000 (14:20 -0400)]
rgw: allow larger payload for period commit

testing with 3 zonegroups and 3 zones each, the period json grew larger
than 4k and caused decode failures on period commit

updated to use the new config variable rgw_max_put_param_size

Fixes: http://tracker.ceph.com/issues/19505
Signed-off-by: Casey Bodley <cbodley@redhat.com>
(cherry picked from commit 7f2871fe59d933b03f37fde40f1781b2320d0d50)

Conflicts:
    src/rgw/rgw_rest_realm.cc - in RGWOp_Period_Post::execute(), set max_size
        explicitly instead of backporting rgw_max_put_param_size

8 years agorgw: when create_bucket use the same num_shards with info.num_shards 15450/head
lu.shasha [Tue, 9 May 2017 07:05:03 +0000 (15:05 +0800)]
rgw: when create_bucket use the same num_shards with info.num_shards

pr #14388 only fix the num_shards in BucketInfo, "init_bucket_index" function still use local num_shards

Fixes: http://tracker.ceph.com/issues/19745
Signed-off-by: Shasha Lu <lu.shasha@eisoo.com>
(cherry picked from commit 4ce64a190b4ff36985e785e574c077d39796feea)

Conflicts:
src/rgw/rgw_rados.cc: init_bucket_index() called earlier

8 years agorgw: using the same bucket num_shards as master zg when create bucket in secondary zg
lu.shasha [Fri, 7 Apr 2017 07:34:27 +0000 (15:34 +0800)]
rgw: using the same bucket num_shards as master zg when create bucket in secondary zg

create bucket in secondary zonegroup will forward to master. The master may have different num_shards option.
So when create bucket in local, should use master's num_shards instead of local num_shards option.

Fixes: http://tracker.ceph.com/issues/19745
Signed-off-by: Shasha Lu <lu.shasha@eisoo.com>
(cherry picked from commit a34c4b8fb13dd5590eb3c6ecb5e55207ed8e3ee8)

Conflicts:
    src/rgw/rgw_op.cc
    - different argument list of store->create_bucket() in jewel, compared to
      master
    - no RGWBulkUploadOp:: in jewel; modifications to
      RGWBulkUploadOp::handle_dir() omitted

8 years agorgw: cls: ceph::timespan tag_timeout wrong units 16289/head
Matt Benjamin [Thu, 29 Jun 2017 19:19:36 +0000 (15:19 -0400)]
rgw: cls: ceph::timespan tag_timeout wrong units

In rgw_dir_suggest(), the ceph::timespan tag_timeout is intended
to be a value in seconds, but has been taken as something much
smaller (millis?).  The incorrect time scale likely induces a race
condition with object deletes.

Fixes: http://tracker.ceph.com/issues/20380
Signed-off-by: Matt Benjamin <mbenjamin@redhat.com>
(cherry picked from commit e6763ebd6124c1b4736c5c7850077df7644bc8f5)

8 years agoRevert "osdc/Journaler: make header write_pos align to boundary of flushed entry" 16282/head
Nathan Cutler [Wed, 12 Jul 2017 06:40:20 +0000 (08:40 +0200)]
Revert "osdc/Journaler: make header write_pos align to boundary of flushed entry"

This reverts commit 2e299b50de4a297fee2aec21290632336d239857.

Signed-off-by: Nathan Cutler <ncutler@suse.com>
8 years agoRevert "osdc/Journaler: avoid executing on_safe contexts prematurely"
Nathan Cutler [Wed, 12 Jul 2017 06:40:13 +0000 (08:40 +0200)]
Revert "osdc/Journaler: avoid executing on_safe contexts prematurely"

This reverts commit 06cf9f365033f7913051bdf4060f0bc6fc0444d7.

Signed-off-by: Nathan Cutler <ncutler@suse.com>
8 years agoMerge pull request #16273 from smithfarm/wip-jewel-pending-release-notes
Sage Weil [Tue, 11 Jul 2017 20:58:04 +0000 (15:58 -0500)]
Merge pull request #16273 from smithfarm/wip-jewel-pending-release-notes

jewel: doc: clarify status of jewel PendingReleaseNotes

8 years agodoc: zero PendingReleaseNotes in preparation for v10.2.9 16273/head
Nathan Cutler [Tue, 11 Jul 2017 20:53:56 +0000 (22:53 +0200)]
doc: zero PendingReleaseNotes in preparation for v10.2.9

Signed-off-by: Nathan Cutler <ncutler@suse.com>
8 years agodoc: clarify status of jewel PendingReleaseNotes
Nathan Cutler [Tue, 11 Jul 2017 20:27:33 +0000 (22:27 +0200)]
doc: clarify status of jewel PendingReleaseNotes

Status as of 10.2.8 release

Signed-off-by: Nathan Cutler <ncutler@suse.com>
8 years agoclient: don't re-send interrupted flock request 16248/head
Yan, Zheng [Sat, 3 Jun 2017 04:33:36 +0000 (12:33 +0800)]
client: don't re-send interrupted flock request

don't re-send interrupted flock request in cases of mds failover
and receiving request forward.

Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
(cherry picked from commit 044fabd669fdc8287f5cab0f38a82dfbfe41f0cd)

 Conflicts:
src/client/Client.cc: UserPerms not in jewel

8 years agomds/flock: properly remove item from global_waiting_locks
Yan, Zheng [Sat, 3 Jun 2017 04:06:10 +0000 (12:06 +0800)]
mds/flock: properly remove item from global_waiting_locks

ceph_lock_state_t::remove_waiting() uses wrong key to search
global_waiting_locks. It should use item in waiting_locks as
key.

Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
(cherry picked from commit 0d71c6120e61f31b803c3fb6488fc7e97134e348)

8 years agomds/flock: properly update ceph_lock_state_t::client_waiting_lock_counts
Yan, Zheng [Sat, 3 Jun 2017 04:01:27 +0000 (12:01 +0800)]
mds/flock: properly update ceph_lock_state_t::client_waiting_lock_counts

no item is added to waiting_locks when deadlock check fails

Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
(cherry picked from commit a6b01939a199833568961ae4d30d8540a25a7740)

8 years agomds/flock: don't import ceph_lock_state_t::waiting_locks
Yan, Zheng [Sat, 3 Jun 2017 03:42:12 +0000 (11:42 +0800)]
mds/flock: don't import ceph_lock_state_t::waiting_locks

Item in waiting_locks is associated with flock mds request in
exporter mds. If client re-sends the flock mds request to the
importer, the importer starts a new MDRequest. The new one's
'flock_was_waiting' is false.

Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
(cherry picked from commit 8d777c4855b0c8e58a8d6966371698b8b7721a42)

8 years agorpm: Fix undefined FIRST_ARG 14346/head
Boris Ranto [Fri, 7 Jul 2017 10:37:55 +0000 (12:37 +0200)]
rpm: Fix undefined FIRST_ARG

If FIRST_ARG is undefined, the rpms will show an error on upgrade
because the condition in the postun script gets broken.

This was a regression introduced by commit for issue 20077 that moved
ceph-disk unit files to ceph-base.

Fixes: http://tracker.ceph.com/issues/20077
Signed-off-by: Boris Ranto <branto@redhat.com>
(cherry picked from commit 562816914ccca8e4e7d9c31f333db2f0da6f7c99)

8 years agoselinux: Install ceph-base before ceph-selinux
Boris Ranto [Mon, 5 Jun 2017 16:44:18 +0000 (18:44 +0200)]
selinux: Install ceph-base before ceph-selinux

We need to have ceph-base installed before ceph-selinux to use ceph-disk
in %post script. The default ordering is random and so the installation
randomly failed to relabel the files.

Fixes: http://tracker.ceph.com/issues/20184
Signed-off-by: Boris Ranto <branto@redhat.com>
(cherry picked from commit e69086645d3411a2ed781609b670eb5f16ac4810)
Conflicts:
ceph.spec.in: No _epoch_prefix in jewel

8 years agotest/librbd: decouple ceph_test_librbd_api from libceph-common 15602/head
Kefu Chai [Mon, 12 Jun 2017 03:55:36 +0000 (11:55 +0800)]
test/librbd: decouple ceph_test_librbd_api from libceph-common

Fixes: http://tracker.ceph.com/issues/20175
Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit b7287fdc4d70c5ecedda78ae367b98e5d8f61c5b)

Conflicts: master has switched from autotools to cmake, so this is
  somewhat an equivalent in automake to the cherry-picked change on
  CMakeLists.txt

8 years agotest/librados: extract functions using libcommon in test.cc into test_common.cc
Kefu Chai [Mon, 12 Jun 2017 03:55:04 +0000 (11:55 +0800)]
test/librados: extract functions using libcommon in test.cc into test_common.cc

Fixes: http://tracker.ceph.com/issues/20175
Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit 82a848c2053ea69ebc6d3ec1003e18921e2d08d2)

Conflicts: since master has switched to cmake. the changes in this
  commit on .am files are ported from counterpart of CMakeLists.txt.