]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
6 years agoceph-volume: enable device discards 25748/head
Jonas Jelten [Fri, 19 Oct 2018 13:15:03 +0000 (15:15 +0200)]
ceph-volume: enable device discards

When using SSDs as encrypted OSD device, discards do not pass the
encryption layer. This option activates discard requests.

Signed-off-by: Jonas Jelten <jj@stusta.net>
(cherry picked from commit 333330414730bf57c68efe6f1f13841b0a014dae)

6 years agoMerge pull request #25500 from joscollin/wip-37600-luminous
Nathan Cutler [Thu, 13 Dec 2018 11:25:11 +0000 (12:25 +0100)]
Merge pull request #25500 from joscollin/wip-37600-luminous

luminous: doc: broken link on troubleshooting-mon page

Reviewed-by: Nathan Cutler <ncutler@suse.com>
6 years agoMerge pull request #25418 from xiexingguo/wip-luminous-upmap-fixes
Xie Xingguo [Thu, 13 Dec 2018 05:54:32 +0000 (13:54 +0800)]
Merge pull request #25418 from xiexingguo/wip-luminous-upmap-fixes

luminous: osd: backport recent upmap fixes

Reviewed-by: Neha Ojha <nojha@redhat.com>
6 years agoMerge pull request #25286 from smithfarm/wip-37397-luminous
Nathan Cutler [Wed, 12 Dec 2018 23:58:12 +0000 (00:58 +0100)]
Merge pull request #25286 from smithfarm/wip-37397-luminous

luminous: tests: /usr/bin/ld: cannot find -lradospp in rados mimic

Reviewed-by: Yuri Weinstein <yweinste@redhat.com>
6 years agoMerge pull request #25041 from smithfarm/wip-36750-luminous
Yuri Weinstein [Wed, 12 Dec 2018 23:39:47 +0000 (15:39 -0800)]
Merge pull request #25041 from smithfarm/wip-36750-luminous

luminous: mgr: [restful] deep_scrub is not a valid OSD command

Reviewed-by: Boris Ranto <branto@redhat.com>
6 years agoMerge pull request #25047 from swinds24/fix_36764
Yuri Weinstein [Wed, 12 Dec 2018 23:39:11 +0000 (15:39 -0800)]
Merge pull request #25047 from swinds24/fix_36764

luminous: mgr: hold lock while accessing the request list and submitting request

Reviewed-by: Boris Ranto <branto@redhat.com>
6 years agoMerge pull request #25216 from k0ste/luminous_backports
Yuri Weinstein [Wed, 12 Dec 2018 23:38:20 +0000 (15:38 -0800)]
Merge pull request #25216 from k0ste/luminous_backports

luminous: mgr: prometheus: added bluestore db and wal devices to ceph_disk_occupation metric.//

Reviewed-by: Jan Fajerski <jfajerski@suse.com>
6 years agoMerge pull request #25259 from k0ste/luminous_backports4
Yuri Weinstein [Wed, 12 Dec 2018 23:35:40 +0000 (15:35 -0800)]
Merge pull request #25259 from k0ste/luminous_backports4

luminous: mgr/balancer: add cmd to list all plans

Reviewed-by: Sage Weil <sage@redhat.com>
6 years agoMerge pull request #25326 from theanalyst/wip-luminous-reshard-fixes-1
Casey Bodley [Wed, 12 Dec 2018 19:07:20 +0000 (14:07 -0500)]
Merge pull request #25326 from theanalyst/wip-luminous-reshard-fixes-1

luminous: bucket resharding fixes

Reviewed-by: J. Eric Ivancich <ivancich@redhat.com>
6 years agoMerge pull request #24547 from linuxbox2/luminous-24595
Yuri Weinstein [Wed, 12 Dec 2018 16:49:40 +0000 (08:49 -0800)]
Merge pull request #24547 from linuxbox2/luminous-24595

luminous: rgw: apply quota config to users created via external auth

Reviewed-by: Casey Bodley <cbodley@redhat.com>
6 years agoMerge pull request #24621 from cbodley/wip-24358
Yuri Weinstein [Wed, 12 Dec 2018 16:49:12 +0000 (08:49 -0800)]
Merge pull request #24621 from cbodley/wip-24358

luminous: rgw: add ssl support to beast frontend

Reviewed-by: Casey Bodley <cbodley@redhat.com>
6 years agoMerge pull request #24909 from smithfarm/wip-36414-luminous
Yuri Weinstein [Wed, 12 Dec 2018 16:48:31 +0000 (08:48 -0800)]
Merge pull request #24909 from smithfarm/wip-36414-luminous

luminous: rgw: librgw: crashes in multisite configuration

Reviewed-by: Casey Bodley <cbodley@redhat.com>
6 years agoMerge pull request #24944 from jonsger/luminous-backport-pr#24700
Yuri Weinstein [Wed, 12 Dec 2018 16:48:02 +0000 (08:48 -0800)]
Merge pull request #24944 from jonsger/luminous-backport-pr#24700

luminous: rgw: SSE encryption does not detect ssl termination in proxy

Reviewed-by: Casey Bodley <cbodley@redhat.com>
6 years agoMerge pull request #25088 from jonsger/wip-36757-luminous
Yuri Weinstein [Wed, 12 Dec 2018 16:47:33 +0000 (08:47 -0800)]
Merge pull request #25088 from jonsger/wip-36757-luminous

luminous: rgw: rgw-admin: reshard add can add a non-existent bucket

Reviewed-by: Casey Bodley <cbodley@redhat.com>
6 years agoMerge pull request #25195 from pdvian/wip-37284-luminous
Yuri Weinstein [Wed, 12 Dec 2018 16:47:08 +0000 (08:47 -0800)]
Merge pull request #25195 from pdvian/wip-37284-luminous

luminous: rgw: radosgw-admin: translate reshard status codes (trivial)

Reviewed-by: Casey Bodley <cbodley@redhat.com>
6 years agoMerge pull request #25320 from theanalyst/wip-luminous-zone-deletion
Yuri Weinstein [Wed, 12 Dec 2018 16:46:41 +0000 (08:46 -0800)]
Merge pull request #25320 from theanalyst/wip-luminous-zone-deletion

luminous: rgw: fixes for zone deletion

Reviewed-by: Casey Bodley <cbodley@redhat.com>
6 years agodoc: broken link on troubleshooting-mon page 25500/head
James McClune [Thu, 29 Nov 2018 03:10:33 +0000 (22:10 -0500)]
doc: broken link on troubleshooting-mon page
Fixes: https://tracker.ceph.com/issues/37451
This fix involves adding :ref: labels to the add-or-rm-mons.rst
page. I also added :ref: labels for other headings within
add-or-rm-mons.rst (for future reference).

Signed-off-by: James McClune <jmcclune@mcclunetechnologies.net>
(cherry picked from commit ca05e456969a721631f5e9882c896cab76a4a3dc)

Conflicts:
doc/rados/troubleshooting/troubleshooting-mon.rst
Fixed the :ref: sentence and removed the _replace line.

6 years agoMerge pull request #25284 from jdurgin/wip-osd-memory-target-docs-luminous
Josh Durgin [Mon, 10 Dec 2018 23:54:02 +0000 (15:54 -0800)]
Merge pull request #25284 from jdurgin/wip-osd-memory-target-docs-luminous

luminous: doc/rados: update bluestore provisioning and autotuning docs

Reviewed-by: Mark Nelson <mnelson@redhat.com>
6 years agorgw: fix typo 25326/head
J. Eric Ivancich [Wed, 28 Nov 2018 17:55:35 +0000 (12:55 -0500)]
rgw: fix typo

This fixes a typo in a log message. It's a separate commit so
downstream commits point to the right upstream commits via
cherry-pick.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
(cherry picked from commit 7d1768f2a332c5ac389cbb4bbeda28752e4f17bc)

6 years agorgw: perf -- remove bucket shards asynchronously rather than synchronously
J. Eric Ivancich [Tue, 20 Nov 2018 18:32:54 +0000 (13:32 -0500)]
rgw: perf -- remove bucket shards asynchronously rather than synchronously

We can now take advantage of the new asynchronous bucket shard removal
code and where we used to remove each shard synchronously now remove
them asynchronously. This would be a huge win when we have tens of
thousands of shards.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
(cherry picked from commit cb0da45c5d6e4dff6514919b3b894df0eea7c1e4)

 Conflicts:
src/rgw/rgw_rados.cc
conflicts with placement set and rgw rados refactor

6 years agorgw: clean up unused bucket index shards
J. Eric Ivancich [Tue, 20 Nov 2018 14:52:39 +0000 (09:52 -0500)]
rgw: clean up unused bucket index shards

Clean up old bucket index shards when a resharding is complete. Also,
when a resharding fails, clean up unfinished bucket index shards. Do
both clean-ups asynchronously.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
(cherry picked from commit f84f70d451036ee99011b8e2c7b974f15d3a005a)

 Conflicts:
src/rgw/rgw_rados.h
merge conflict as bucket_placement functions were moved after the rgw rados
refactor

6 years agorgw: a little code clean-up by adding const
J. Eric Ivancich [Mon, 19 Nov 2018 21:10:52 +0000 (16:10 -0500)]
rgw: a little code clean-up by adding const

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
(cherry picked from commit 48e22fb76feffb551074f274986bc4125a02003f)

6 years agorgw: remove never-called RGWRados member function
J. Eric Ivancich [Fri, 16 Nov 2018 22:08:46 +0000 (17:08 -0500)]
rgw: remove never-called RGWRados member function

Because RGWRados::cls_rgw_init_index is never called, remove it.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
(cherry picked from commit 4593778c6d830a1a0d33528507f1795aee019ce4)

6 years agorgw: rename function for clarity
J. Eric Ivancich [Wed, 7 Nov 2018 20:33:52 +0000 (15:33 -0500)]
rgw: rename function for clarity

The function cls_rgw_bucket_init was renamed to
cls_rgw_bucket_init_index in order to better describe its
functionality.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
(cherry picked from commit 20868bd3b893ba2e17d2bfdc63d6544dbe677a9b)

6 years agorgw: fix bug in error codes returned
J. Eric Ivancich [Fri, 16 Nov 2018 15:47:48 +0000 (10:47 -0500)]
rgw: fix bug in error codes returned

Fixes two bugs where positive error code was returned rather than
negative error code, which is the RGW norm.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
(cherry picked from commit dc0732c524c2d8fb5b423f8cce8c6cc9aa7ca44a)

6 years agoosd/OSDMap: fix upmap mis-killing for erasure-coded PGs 25418/head
xie xingguo [Sat, 1 Dec 2018 09:42:01 +0000 (17:42 +0800)]
osd/OSDMap: fix upmap mis-killing for erasure-coded PGs

The up-set of erasure-coded PGs may include CRUSH_ITEM_NONE,
which as a result causes mis-killing of valid upmap items.

Fixes: https://tracker.ceph.com/issues/37493
Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
Signed-off-by: ningtao <ningtao@sangfor.com.cn>
(cherry picked from commit f043dcc6d8663ed960a354b2441fd3f8dd66c62d)

Conflicts:
        - *maybe_remove_pg_upmaps* input changed, in master
          we now have a passed in *nextmap* parameter

6 years agomon: auto clean up stale upmap items
xie xingguo [Wed, 21 Nov 2018 06:25:45 +0000 (14:25 +0800)]
mon: auto clean up stale upmap items

See https://www.spinics.net/lists/ceph-users/msg49518.html

Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
(cherry picked from commit 4181aa86906886423af80e478e45554606fc7cf6)

Conflicts:
        - *nextmap* has been renamed to *tmpmap*
        - add the *have_pg_upmaps* wrapper which is necessary for unittest

6 years agoosd/OSDMap: add pg-existence sanity check
xie xingguo [Wed, 21 Nov 2018 01:36:21 +0000 (09:36 +0800)]
osd/OSDMap: add pg-existence sanity check

The reason why __get_pg_pool_size(pg)__ or __get_pg_pool_crush_rule(pg)__ fails is
that the pg does not exist anymore. So it generally makes sense to check __pg_exists(pg)__
before moving further.

Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
(cherry picked from commit 79ef3bea7550559b7ea4e2b68dc407c7eb25defa)

Conflicts:
- *nextmap* has been renamed to *tmpmap*

6 years agoosd/OSDMap.cc: move sanity checks to the proper place
xie xingguo [Wed, 20 Jun 2018 01:27:46 +0000 (09:27 +0800)]
osd/OSDMap.cc: move sanity checks to the proper place

Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
(cherry picked from commit 3dcd4eaf867369437720a21daf3c344cc4b9844e)

6 years agoosd/OSDMap.cc: remove pg_upmap/pg_upmap_items too if osd is gone
xie xingguo [Wed, 20 Jun 2018 01:04:19 +0000 (09:04 +0800)]
osd/OSDMap.cc: remove pg_upmap/pg_upmap_items too if osd is gone

If an osd is gone or moved out from the specific crush rule,
we should cancel any pg_upmap/pg_upmap_items still bound to
that osd too.

The original code does not work for the above case because
get_parent_of_type() will fail if that osd does not belong
to the crush_rule passed in and hence hits the assert below:

```
src/osd/OSDMap.cc: 4078: FAILED assert(target > 0)
```

Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
(cherry picked from commit 8a11ed479d7ea95f76170e74123ca2c48e3a3963)

6 years agoMerge pull request #25350 from alfredodeza/luminous-c-v-inventory-add-device_id
Alfredo Deza [Tue, 4 Dec 2018 12:06:40 +0000 (07:06 -0500)]
Merge pull request #25350 from alfredodeza/luminous-c-v-inventory-add-device_id

luminous ceph-volume add device_id to inventory listing

Reviewed-by: Jan Fajerski <jfajerski@suse.com>
6 years agoceph-volume: add test for disk.udevadm_property 25350/head
Jan Fajerski [Wed, 28 Nov 2018 09:21:36 +0000 (10:21 +0100)]
ceph-volume: add test for disk.udevadm_property

Signed-off-by: Jan Fajerski <jfajerski@suse.com>
(cherry picked from commit 58316e3f826f8bb91b375589aa3676b664ce3984)

6 years agoceph-volume: add device_id field to inventory output
Jan Fajerski [Wed, 21 Nov 2018 12:20:45 +0000 (13:20 +0100)]
ceph-volume: add device_id field to inventory output

This intends to mimic the C++ implementation in src/common/blkdev.cc.

Signed-off-by: Jan Fajerski <jfajerski@suse.com>
(cherry picked from commit 41da5e9a4334628db09e3166eaceff79b9b6c7e0)

6 years agoMerge pull request #24914 from smithfarm/wip-36429-luminous
Yuri Weinstein [Fri, 30 Nov 2018 16:14:46 +0000 (08:14 -0800)]
Merge pull request #24914 from smithfarm/wip-36429-luminous

luminous: tests: rbd: move OpenStack devstack test to rocky release

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
6 years agoMerge pull request #24917 from smithfarm/wip-36554-luminous
Yuri Weinstein [Fri, 30 Nov 2018 16:14:17 +0000 (08:14 -0800)]
Merge pull request #24917 from smithfarm/wip-36554-luminous

luminous: rbd: [rbd-mirror] periodic mirror status timer might fail to be scheduled

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
6 years agoMerge pull request #24975 from dillaman/wip-36568-luminous
Yuri Weinstein [Fri, 30 Nov 2018 16:13:29 +0000 (08:13 -0800)]
Merge pull request #24975 from dillaman/wip-36568-luminous

luminous: qa/tasks/qemu: use unique clone directory to avoid race with workunit

Reviewed-by: Mykola Golub <mgolub@mirantis.com>
6 years agoMerge pull request #25006 from dillaman/wip-36407-luminous
Yuri Weinstein [Fri, 30 Nov 2018 16:12:33 +0000 (08:12 -0800)]
Merge pull request #25006 from dillaman/wip-36407-luminous

luminous: rbd: pybind: added missing RBD_FLAG_FAST_DIFF_INVALID constant

Reviewed-by: Mykola Golub <mgolub@mirantis.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
6 years agoMerge pull request #25253 from pdvian/wip-37364-luminous
Yuri Weinstein [Fri, 30 Nov 2018 16:10:34 +0000 (08:10 -0800)]
Merge pull request #25253 from pdvian/wip-37364-luminous

luminous:librbd: fix missing unblock_writes if shrink is not allowed

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
6 years agorgw: batch and process bucket instances
Abhishek Lekshmanan [Wed, 17 Oct 2018 22:44:51 +0000 (00:44 +0200)]
rgw: batch and process bucket instances

Sort through and batch bucket instances so that multiple calls to reading
current bucket info and locking can be avoided. For the most trivial case when
the bucket is already deleted we exit early with all the stale instances. When
the bucket reshard is in progress we only process the stale entries with status
done, if the bucket is available for locking then we lock down and mark the
other instances as well.

Signed-off-by: Abhishek Lekshmanan <abhishek@suse.com>
(cherry picked from commit fb9c0497621fce34283094cfe260480f9122573f)

 Conflicts:
src/rgw/rgw_bucket.cc
Get rid of the following c++17isms:
- split_tenant auto return type -> trailing return type
- tuple destructuring bind for split tenant with std::tie

6 years agocli: use rgw-admin reshard stale-instances list/delete
Abhishek Lekshmanan [Mon, 15 Oct 2018 17:34:28 +0000 (19:34 +0200)]
cli: use rgw-admin reshard stale-instances list/delete

Add a delete command as well that clears the resharded instances. We print out
the json status to indicate success or error state

Signed-off-by: Abhishek Lekshmanan <abhishek@suse.com>
(cherry picked from commit bf8f885016a588130f05c25fdef3d3b6e1216c40)

 Conflicts:
src/rgw/rgw_admin.cc:
  ceph::util::match_str and the related rm/delete normalizations introduced in
  mimic are not backported

6 years agorgw: implement listing stale instances from a reshard
Abhishek Lekshmanan [Fri, 12 Oct 2018 12:55:48 +0000 (14:55 +0200)]
rgw: implement listing stale instances from a reshard

Dynamic resharding used to leave behind stale bucket instances; walk through the
metadata pool and identify these instances by comparing the reshard status. If
the reshard status is done, these instances are ok to be cleared. For reshard
status of none we compare against the bucket entry point to ensure that we don't
match the current entry point.

Signed-off-by: Abhishek Lekshmanan <abhishek@suse.com>
(cherry picked from commit 0c35a6fab250a817b2752f319b5c9f1da13bb3df)

6 years agorgw: recover from incomplete reshard attempt
J. Eric Ivancich [Wed, 17 Oct 2018 17:43:24 +0000 (13:43 -0400)]
rgw: recover from incomplete reshard attempt

In case a reshard attempt is left in an incomplete state, i.e., flags
still show resharding even though the bucket reshard lock isn't being
held, try to recover by taking the bucket reshard lock and clearing
flags associated with resharding.

This change requires access to an RGWBucketInfo object. So call stack
into this function should provide that to prevent unnecessary
work. Changes were made to provide this object.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
(cherry picked from commit 4891ae59314041802da0f6dc249ccbeb761616dc)

6 years agorgw: move RGWReshardBucket lock to its own separate class
J. Eric Ivancich [Tue, 16 Oct 2018 20:40:03 +0000 (16:40 -0400)]
rgw: move RGWReshardBucket lock to its own separate class

There are other processes beyond resharding that would need to take a
bucket reshard lock (e.g., correcting bucet resharding flags in event
of crash, tools to remove bucket shard information from earlier
versions of ceph). Pulling this logic outside of RGWReshardBucket
allows this code to be re-used.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
(cherry picked from commit 18ab99cd542cc253d62719a18a9d13e7a312eb15)

6 years agorgw: failed resharding clears resharding status from shard heads
J. Eric Ivancich [Fri, 12 Oct 2018 22:07:24 +0000 (18:07 -0400)]
rgw: failed resharding clears resharding status from shard heads

Previously, when resharding failed, we restored the shard status on
the bucket info object. However the status on each of the shards was
left indicating a reshard was underway. This prevented some write
operations from taking place, as they would wait for resharding to
complete. This adds the missing functionality. It also makes the
functionality available to other classes via static functions in
RGWBucketReshard.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
(cherry picked from commit 4577801271454b147cb3b1f3169d268d1c14948b)

6 years agorgw: change the bucket reshard lock to exclusive-ephemeral
J. Eric Ivancich [Fri, 12 Oct 2018 14:24:32 +0000 (10:24 -0400)]
rgw: change the bucket reshard lock to exclusive-ephemeral

The bucket reshard lock was simply an exclusive lock that existed on
an object solely for the purpose of representing the lock. This is now
changed to exclusvie-ephemeral lock, so as not to leave these objects
behind.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
(cherry picked from commit bc0a5ff95247b41b5a8336a8c4ed83c4315c1966)

6 years agocls: add exclusive ephemeral locks that auto-clean
J. Eric Ivancich [Fri, 12 Oct 2018 14:23:57 +0000 (10:23 -0400)]
cls: add exclusive ephemeral locks that auto-clean

Add a new type of cls lock -- exclusive ephemeral for which the
object only exists to represent the lock and for which the object
should be deleted at unlock. This is to prevent the accumulation of
unneeded objects in the cluster by automatically cleaning them up.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
(cherry picked from commit a289f2d8654cf4b865430465b87299d3618b41c8)

6 years agorgw: renew resharding locks to prevent expiration
J. Eric Ivancich [Thu, 27 Sep 2018 17:31:57 +0000 (13:31 -0400)]
rgw: renew resharding locks to prevent expiration

Fix lock expiration problem with resharding. The resharding process
will renew its bucket lock (and logshard lock if necessary) when half
the remaining time is left on the lock. If the lock is expired and
cannot renew the process fails and errors out appropriately.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
(cherry picked from commit 8cebffa1d8ad4df6fdae4e10e782aad0753545ce)

6 years agocls: add semantics for cls locks to require renewal without expiring
J. Eric Ivancich [Mon, 1 Oct 2018 19:18:39 +0000 (15:18 -0400)]
cls: add semantics for cls locks to require renewal without expiring

Add ability to *require* renewal of an existing lock in addition
toexisting ability to *allow* renewal of an existing lock. The key
difference is that a MUST_RENEW will fail if the lock has expired
(where a MAY_RENEW) will succeed. This provides calling code with the
ability to verify that a lock is held continually and that it was
never lost/expired.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
(cherry picked from commit 479c90993cc80b140af010a72ff9a34ecb7b3b32)

6 years agorgw: use the same lock when resharding
Orit Wasserman [Fri, 21 Sep 2018 13:26:41 +0000 (16:26 +0300)]
rgw: use the same lock when resharding

Signed-off-by: Orit Wasserman <owasserm@redhat.com>
(cherry picked from commit 173bfc87544e4f349e2d51e8e34acb83c6067897)

6 years agorgw: renew resharding lock during bucket resharding
Orit Wasserman [Fri, 21 Sep 2018 07:48:21 +0000 (10:48 +0300)]
rgw: renew resharding lock during bucket resharding

Signed-off-by: Orit Wasserman <owasserm@owasserm.redhat.com>
(cherry picked from commit 32d85970d4d635a31c35b48fb0fb62c2db11e40a)

6 years agoMerge pull request #25030 from smithfarm/wip-36464-luminous
Yuri Weinstein [Thu, 29 Nov 2018 02:45:51 +0000 (18:45 -0800)]
Merge pull request #25030 from smithfarm/wip-36464-luminous

luminous: core: mgr crash on scrub of unconnected osd

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
6 years agoMerge pull request #24648 from liewegas/wip-warning-fixes-luminous
Yuri Weinstein [Thu, 29 Nov 2018 02:45:06 +0000 (18:45 -0800)]
Merge pull request #24648 from liewegas/wip-warning-fixes-luminous

luminous: common: shut up some warnings

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
6 years agoMerge pull request #25025 from smithfarm/wip-36556-luminous
Yuri Weinstein [Thu, 29 Nov 2018 01:16:57 +0000 (17:16 -0800)]
Merge pull request #25025 from smithfarm/wip-36556-luminous

luminous: osd: RBD client IOPS pool stats are incorrect (2x higher; includes IO hints as an op)

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
6 years agoMerge pull request #25028 from smithfarm/wip-36646-luminous
Yuri Weinstein [Thu, 29 Nov 2018 01:16:18 +0000 (17:16 -0800)]
Merge pull request #25028 from smithfarm/wip-36646-luminous

luminous: tests: librados api aio tests race condition

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
6 years agoMerge pull request #25032 from smithfarm/wip-36575-luminous
Yuri Weinstein [Thu, 29 Nov 2018 01:15:12 +0000 (17:15 -0800)]
Merge pull request #25032 from smithfarm/wip-36575-luminous

luminous: cephfs: mgr/status: fix fs status subcommand did not show standby-replay MDS' perf info

Reviewed-by: Yan, Zheng <zyan@redhat.com>
6 years agoMerge pull request #25035 from smithfarm/wip-36636-luminous
Yuri Weinstein [Thu, 29 Nov 2018 01:14:35 +0000 (17:14 -0800)]
Merge pull request #25035 from smithfarm/wip-36636-luminous

luminous: osd: race condition opening heartbeat connection

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
6 years agoMerge pull request #25074 from jonsger/wip-36657-luminous
Yuri Weinstein [Thu, 29 Nov 2018 01:14:06 +0000 (17:14 -0800)]
Merge pull request #25074 from jonsger/wip-36657-luminous

luminous: core: Objecter: add ignore cache flag if got redirect reply

Reviewed-by: Greg Farnum <gfarnum@redhat.com>
6 years agoMerge pull request #25184 from smithfarm/wip-25201-luminous
Yuri Weinstein [Thu, 29 Nov 2018 01:13:03 +0000 (17:13 -0800)]
Merge pull request #25184 from smithfarm/wip-25201-luminous

luminous: mgr: Module 'influx' has failed

Reviewed-by: Wido den Hollander <wido@widodh.nl>
6 years agoMerge pull request #24501 from dzafman/wip-remove-luminous
Yuri Weinstein [Thu, 29 Nov 2018 01:09:24 +0000 (17:09 -0800)]
Merge pull request #24501 from dzafman/wip-remove-luminous

luminous: core: enable the pg deletion process to be throttled

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
6 years agoMerge pull request #24649 from liewegas/wip-pr-23273-luminous
Yuri Weinstein [Thu, 29 Nov 2018 01:07:42 +0000 (17:07 -0800)]
Merge pull request #24649 from liewegas/wip-pr-23273-luminous

luminous: os/bluestore: handle spurious read errors

Reviewed-by: Igor Fedotov <ifedotov@suse.com>
6 years agoMerge pull request #24705 from smithfarm/wip-36145-luminous
Yuri Weinstein [Thu, 29 Nov 2018 01:06:53 +0000 (17:06 -0800)]
Merge pull request #24705 from smithfarm/wip-36145-luminous

luminous: fsck: cid is improperly matched to oid

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
6 years agoMerge pull request #24833 from trociny/wip-36630-luminous
Yuri Weinstein [Thu, 29 Nov 2018 01:05:31 +0000 (17:05 -0800)]
Merge pull request #24833 from trociny/wip-36630-luminous

luminous: osd: potential deadlock in PG::_scan_snaps when repairing snap mapper

Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Nathan Cutler <ncutler@suse.com>
6 years agoMerge pull request #24989 from jonsger/wip-36638-luminous
Yuri Weinstein [Thu, 29 Nov 2018 01:04:49 +0000 (17:04 -0800)]
Merge pull request #24989 from jonsger/wip-36638-luminous

luminous: core: bluestore: rename does not old ref to replacement onode at old name

Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Jianpeng Ma <jianpeng.ma@intel.com>
6 years agoMerge pull request #24992 from jonsger/wip-26943-luminous
Yuri Weinstein [Thu, 29 Nov 2018 01:03:53 +0000 (17:03 -0800)]
Merge pull request #24992 from jonsger/wip-26943-luminous

luminous: os/bluestore/BlueStore.cc: 1025: FAILED assert(buffer_bytes >= b->length) from ObjectStore/StoreTest.ColSplitTest2/2

Reviewed-by: Igor Fedotov <ifedotov@suse.com>
6 years agoMerge pull request #25018 from smithfarm/wip-36436-luminous
Yuri Weinstein [Thu, 29 Nov 2018 01:03:00 +0000 (17:03 -0800)]
Merge pull request #25018 from smithfarm/wip-36436-luminous

luminous: core: rados rm --force-full is blocked when cluster is in full status

Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
6 years agoMerge pull request #25267 from noahdesu/luminous-docs
Noah Watkins [Wed, 28 Nov 2018 18:59:43 +0000 (10:59 -0800)]
Merge pull request #25267 from noahdesu/luminous-docs

luminous: doc: show edit on github links and version warnings

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
6 years agodoc/_templates/page.html: add link to report doc bugs 25267/head
Neha Ojha [Fri, 19 Oct 2018 20:26:53 +0000 (13:26 -0700)]
doc/_templates/page.html: add link to report doc bugs

Signed-off-by: Neha Ojha <nojha@redhat.com>
(cherry picked from commit de30c4d3c5927bb58b49b44b391647e397e1fe4f)

6 years agoluminous: doc: show edit on github links and version warnings
Noah Watkins [Mon, 1 Oct 2018 23:54:19 +0000 (16:54 -0700)]
luminous: doc: show edit on github links and version warnings

backport of #24452 that adds edit on
github links to documentation and notification banners that display
warnings when old documentation is being viewed.

this is not a cherry-pick: it removes from the original patch the
dynamic generation of the releases schedule from a yaml database file.
backporting this portion requires modifying the patch to deal with a
different file / directory structure [in luminous] with no real added value.

Signed-off-by: Noah Watkins <nwatkins@redhat.com>
6 years agoMerge pull request #25049 from gregsfortytwo/luminous-readable-failures
Nathan Cutler [Tue, 27 Nov 2018 21:32:10 +0000 (22:32 +0100)]
Merge pull request #25049 from gregsfortytwo/luminous-readable-failures

luminous: test: make readable.sh fail if it doesn't run anything

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Nathan Cutler <ncutler@suse.com>
6 years agoMerge pull request #25001 from MrStupnikov/luminous-docfix-1358
Nathan Cutler [Tue, 27 Nov 2018 21:03:56 +0000 (22:03 +0100)]
Merge pull request #25001 from MrStupnikov/luminous-docfix-1358

luminous: doc: Put command template into literal block

Reviewed-by: Nathan Cutler <ncutler@suse.com>
6 years agoMerge pull request #24752 from joscollin/wip-36309-luminous
Nathan Cutler [Tue, 27 Nov 2018 20:55:22 +0000 (21:55 +0100)]
Merge pull request #24752 from joscollin/wip-36309-luminous

luminous: doc: Fix typo error on cephfs/fuse/

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Jeff Layton <jlayton@redhat.com>
Reviewed-by: Xie Xingguo <xie.xingguo@zte.com.cn>
6 years agoMerge pull request #24434 from joscollin/wip-36312-luminous
Nathan Cutler [Tue, 27 Nov 2018 20:31:42 +0000 (21:31 +0100)]
Merge pull request #24434 from joscollin/wip-36312-luminous

luminous: doc: fix broken fstab url in cephfs/fuse

Reviewed-by: Yan, Zheng <zyan@redhat.com>
6 years agoqa/workunits/rados/test_librados_build.sh: grab files from explicit git branch 25286/head
Nathan Cutler [Mon, 26 Nov 2018 23:33:36 +0000 (00:33 +0100)]
qa/workunits/rados/test_librados_build.sh: grab files from explicit git branch

Fixes: https://tracker.ceph.com/issues/37392
Signed-off-by: Nathan Cutler <ncutler@suse.com>
(cherry picked from commit 6c6ae9e30d02a0844289f249269e7dcc0058ab88)

Conflicts:
    qa/workunits/rados/test_librados_build.sh
- use luminous as the branch in luminous
- similar change in the comment line, below

6 years agodoc/rados: Add bluestore memory autotuning docs. 25284/head
Mark Nelson [Mon, 12 Nov 2018 23:17:42 +0000 (17:17 -0600)]
doc/rados: Add bluestore memory autotuning docs.

Signed-off-by: Mark Nelson <mnelson@redhat.com>
(cherry picked from commit 9e2d1a52abacbab2b4dab24bd209dc12d26fe2db)

Conflicts:
doc/rados/configuration/bluestore-config-ref.rst

Deployment docs updated in master, pulled in those here too

6 years ago12.2.10 v12.2.10
Jenkins Build Slave User [Mon, 26 Nov 2018 19:35:58 +0000 (19:35 +0000)]
12.2.10

6 years agomgr/balancer: add cmd to list all plans 25259/head
Yang Honggang [Fri, 11 May 2018 02:30:09 +0000 (02:30 +0000)]
mgr/balancer: add cmd to list all plans

Signed-off-by: Yang Honggang <yanghonggang@umcloud.com>
(cherry picked from commit 99245c69829c53dab6790da5502420c7183c6bcd)

6 years agolibrbd: fix missing unblock_writes if shrink is not allowed 25253/head
runsisi [Mon, 12 Nov 2018 12:01:32 +0000 (20:01 +0800)]
librbd: fix missing unblock_writes if shrink is not allowed

Fixes: http://tracker.ceph.com/issues/36778
Signed-off-by: runsisi <luo.runbing@zte.com.cn>
(cherry picked from commit 3899bee9f5ea2c4b19fb1266a8b59f6e04e99926)

6 years agotests: rgw: test_multi: python 2-3 compatibility 25320/head
Abhishek Lekshmanan [Tue, 20 Nov 2018 18:29:42 +0000 (19:29 +0100)]
tests: rgw: test_multi: python 2-3 compatibility

Adding the usual six libraries for python3 compat

Signed-off-by: Abhishek Lekshmanan <abhishek@suse.com>
(cherry picked from commit 0510d47d47d32f627059a8728042d447d62e8d81)

Conflicts:
 src/test/rgw/rgw_multi/zone_cloud.py
- this file isn't a part of luminous

6 years agotests: tests for master zone deletion
Abhishek Lekshmanan [Tue, 20 Nov 2018 13:30:32 +0000 (14:30 +0100)]
tests: tests for master zone deletion

Adding a basic test in test_multi that creates a new zonegroup and zone and
removes them, period update after zone deletion will fail now lessening the
chance for the period referring to a non existant master_zone. Subsequent
zonegroup deletion will allow things to pass.

Signed-off-by: Abhishek Lekshmanan <abhishek@suse.com>
(cherry picked from commit e3d60c053316c1bc108ca3b3c38f8354db3690be)

6 years agorgw: period update: check for dangling master zone references
Abhishek Lekshmanan [Mon, 19 Nov 2018 16:47:51 +0000 (17:47 +0100)]
rgw: period update: check for dangling master zone references

If we are deleting a master zone of a zonegroup fail on period update, if this
was intentional, either creating / modifying a zone as master or in case of
deletions, deletion of the zonegroup itself will correct the period update to
work correctly. Without the check, while period commit will be successful a
subsequent RGWRados::init_complete() will fail.

Signed-off-by: Abhishek Lekshmanan <abhishek@suse.com>
(cherry picked from commit 3bea89096857230ecd6137ec6fe551c4ae20ae37)

src/rgw/rgw_zone.cc
Changes made to corresponding function rgw_rados

6 years agorgw: allow init complete to proceed in case of erroneus zone deletes
Abhishek Lekshmanan [Mon, 19 Nov 2018 16:21:15 +0000 (17:21 +0100)]
rgw: allow init complete to proceed in case of erroneus zone deletes

Currently a master zone delete in a zonegroup followed by a period commit would
render RGWRados to be unusable, check if the zonegroup is empty and continue
initialization in these cases so that removal can proceed.

Fixes: http://tracker.ceph.com/issues/37328
Signed-off-by: Abhishek Lekshmanan <abhishek@suse.com>
(cherry picked from commit 58eb3545e4db2c9208fbc09789b89dd9d2ff7a71)

* src/rgw/services/svc_zone.cc
Changes are made to the same function in  rgw_rados.cc
as the rgw_rados refactor is post mimic

6 years agomgr: prometheus: added bluestore db and wal devices to ceph_disk_occupation metric. 25216/head
Konstantin Shalygin [Tue, 30 Oct 2018 06:43:02 +0000 (13:43 +0700)]
mgr: prometheus: added bluestore db and wal devices to ceph_disk_occupation metric.

Signed-off-by: Konstantin Shalygin <k0ste@k0ste.ru>
(cherry picked from commit 12fbf48e637ca807d4d489eebdfe64bb8cd1909d)

6 years agoradosgw-admin: translate reshard status codes (trivial) 25195/head
Matt Benjamin [Wed, 17 Oct 2018 14:43:01 +0000 (10:43 -0400)]
radosgw-admin: translate reshard status codes (trivial)

Fixes: http://tracker.ceph.com/issues/36486
Signed-off-by: Matt Benjamin <mbenjamin@redhat.com>
(cherry picked from commit 731fca4f921e8227e907b204dec9f1016d66b8c3)

6 years agoMerge pull request #25170 from badone/wip-luminous-ceph_ansible-specify-branch-stable-3.2
Yuri Weinstein [Tue, 20 Nov 2018 17:19:43 +0000 (09:19 -0800)]
Merge pull request #25170 from badone/wip-luminous-ceph_ansible-specify-branch-stable-3.2

luminous: qa/ceph-ansible: Specify stable-3.2 branch

Reviewed-by: Yuri Weinstein <yweinste@redhat.com>
6 years agomgr: Ignore daemon if no metadata was returned 25184/head
Nathan Cutler [Tue, 20 Nov 2018 12:45:11 +0000 (13:45 +0100)]
mgr: Ignore daemon if no metadata was returned

It can happen that the Mgr does not return any metadata for a given
daemon as it might not be available at that moment.

None is returned by the get_metadata() method at that moment and both
the Influx and Telegraf module should then ignore the daemon in their
statistics and continue on to the next daemon.

Signed-off-by: Wido den Hollander <wido@42on.com>
(manual cherry-pick of 02569c88341a6dcc8aee5626f10ac2927c2cf064)

Signed-off-by: Nathan Cutler <ncutler@suse.com>
6 years agoqa/ceph-ansible: Specify stable-3.2 branch 25170/head
Brad Hubbard [Sun, 18 Nov 2018 23:37:02 +0000 (09:37 +1000)]
qa/ceph-ansible: Specify stable-3.2 branch

Ceph-ansible no longer supports luminous post stable-3.2.

Fixes: https://tracker.ceph.com/issues/37331
Signed-off-by: Brad Hubbard <bhubbard@redhat.com>
6 years agoMerge pull request #25162 from yuriw/wip-yuriw-ceph-ansible-ver-luminous
Yuri Weinstein [Tue, 20 Nov 2018 02:00:59 +0000 (18:00 -0800)]
Merge pull request #25162 from yuriw/wip-yuriw-ceph-ansible-ver-luminous

qa/tests: clean up

Reviewed-by: Brad Hubbard <bhubbard@redhat.com>
6 years agoqa/tests: clean up 25162/head
Yuri Weinstein [Tue, 20 Nov 2018 01:53:50 +0000 (17:53 -0800)]
qa/tests: clean up

Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
6 years agoos/bluestore: handle spurious read errors 24649/head
Sage Weil [Wed, 17 Oct 2018 22:12:34 +0000 (17:12 -0500)]
os/bluestore: handle spurious read errors

Some kernels (4.9+) sometime fail to return data when reading
from a block device under memory pressure. This patch retries
the read if the checksum verification fails, tests show that
the first retried read succeeds in ~99.5% of the cases, so
3 attempts are made by default before giving up on the data.

Works-around: http://tracker.ceph.com/issues/22464
Signed-off-by: Paul Emmerich <paul.emmerich@croit.io>
(cherry picked from commit cffcbc73aaaa874829d5fc9091af3042b887f9a7)

 Conflicts:
src/common/legacy_config_opts.h
- adjacent options
src/common/options.cc
- no RUNTIME flag in luminous
src/os/bluestore/BlueStore.cc
src/os/bluestore/BlueStore.h
- adjacent perfcounter
src/test/objectstore/store_test.cc
- adjacent tests, no #ifdef
- g_conf, not g_conf()
- no create_new_collection
- queue_transaction etc take osr, not ch

6 years agoluminous: doc: put command template into literal block 25001/head
Alexey Stupnikov [Thu, 8 Nov 2018 19:44:33 +0000 (20:44 +0100)]
luminous: doc: put command template into literal block

``ceph-deploy osd create --data {device} {ceph-node}`` command
is not displayed properly because of missing double colons.

Signed-off-by: Alexey Stupnikov <aleksey.stupnikov@gmail.com>
(cherry picked from commit bff5a2122fd2a059c6576f36d7d1505c8d747776)

6 years agoMerge pull request #25134 from alfredodeza/luminous-c-v-dont-duplicated-rejected...
Alfredo Deza [Fri, 16 Nov 2018 18:02:56 +0000 (13:02 -0500)]
Merge pull request #25134 from alfredodeza/luminous-c-v-dont-duplicated-rejected-reasons

luminous ceph-volume: rename Device property valid to available

Reviewed-by: Jan Fajerski <jfajerski@suse.com>
6 years agoc-v device test: rename valid to available 25134/head
Jan Fajerski [Fri, 16 Nov 2018 08:27:34 +0000 (09:27 +0100)]
c-v device test: rename valid to available

Update test to reflect 8a80990471108b0920d1d8aa1239733ae2b20e9c

Signed-off-by: Jan Fajerski <jfajerski@suse.com>
(cherry picked from commit 047dc1c157e44642fa83873f02674b146e6d9028)

6 years agoinventory test: rename valid to available
Jan Fajerski [Fri, 16 Nov 2018 08:24:37 +0000 (09:24 +0100)]
inventory test: rename valid to available

Fixes test to reflect 8a80990471108b0920d1d8aa1239733ae2b20e9c

Signed-off-by: Jan Fajerski <jfajerski@suse.com>
(cherry picked from commit d895f97bb021c765f3c4d2d091b23bd0eef19f71)

6 years agoceph-volume: rename Device property valid to available
Jan Fajerski [Fri, 16 Nov 2018 08:22:06 +0000 (09:22 +0100)]
ceph-volume: rename Device property valid to available

This flag is used in the inventory reporting and available is deemed more
appropriate. Furthermore this fixes a bug where rejected_reasons
accumulated duplicate entries.

Fixes: http://tracker.ceph.com/issues/36701
Signed-off-by: Jan Fajerski <jfajerski@suse.com>
(cherry picked from commit 8a80990471108b0920d1d8aa1239733ae2b20e9c)

6 years agoMerge pull request #25131 from alfredodeza/luminous-c-v-inventory-doc
Alfredo Deza [Fri, 16 Nov 2018 15:22:15 +0000 (10:22 -0500)]
Merge pull request #25131 from alfredodeza/luminous-c-v-inventory-doc

luminous doc: add ceph-volume inventory sections

Reviewed-by: Jan Fajerski <jfajerski@suse.com>
6 years agodoc: add ceph-volume inventory sections 25131/head
Jan Fajerski [Wed, 14 Nov 2018 13:09:37 +0000 (14:09 +0100)]
doc: add ceph-volume inventory sections

Signed-off-by: Jan Fajerski <jfajerski@suse.com>
(cherry picked from commit dfc0e1d9a69f6f9ffb1d3e0b0ec33a75827c36e0)

6 years agoMerge pull request #25117 from alfredodeza/luminous-rm26953 25124/head
Andrew Schoen [Thu, 15 Nov 2018 17:23:29 +0000 (11:23 -0600)]
Merge pull request #25117 from alfredodeza/luminous-rm26953

luminous ceph-volume: allow to specify --cluster-fsid instead of reading from ceph.conf

Reviewed-by: Andrew Schoen <aschoen@redhat.com>
6 years agoceph-volume tests.lvm verify get_cluster_fsid can read from args or ceph.conf 25117/head
Alfredo Deza [Wed, 3 Oct 2018 19:21:03 +0000 (15:21 -0400)]
ceph-volume tests.lvm verify get_cluster_fsid can read from args or ceph.conf

Signed-off-by: Alfredo Deza <adeza@redhat.com>
(cherry picked from commit 02ea6dc0663cdaf764fdb1a1a5b31a736b854e6e)

6 years agoceph-volume lvm.prepare consume the value from --cluster-fsid if available
Alfredo Deza [Wed, 3 Oct 2018 17:32:05 +0000 (13:32 -0400)]
ceph-volume lvm.prepare consume the value from --cluster-fsid if available

Signed-off-by: Alfredo Deza <adeza@redhat.com>
(cherry picked from commit a92883e60f3f2206a11b43762b3afcb84b05f07e)

6 years agoceph-volume lvm.common add the --cluster-fsid flag
Alfredo Deza [Wed, 3 Oct 2018 17:31:43 +0000 (13:31 -0400)]
ceph-volume lvm.common add the --cluster-fsid flag

Signed-off-by: Alfredo Deza <adeza@redhat.com>
(cherry picked from commit 15546ae6991c244197537d81092875b12613e20b)