]> git.apps.os.sepia.ceph.com Git - ceph-ci.git/log
ceph-ci.git
6 years agocrush/CrushWrapper: add update_weight_sets arg to adjust_item_weight_*
Sage Weil [Thu, 14 Mar 2019 16:29:10 +0000 (11:29 -0500)]
crush/CrushWrapper: add update_weight_sets arg to adjust_item_weight_*

- Make it optional whether the weight-set weights are adjusted to
  match the weight.

- Fix the adjustment of the parent bucket(s) so that the
  summations in weight-sets are correctly maintained.  Prior to
  this change, if I adjust any weight, all parent buckets'
  weight-set weights are reset to the bucket's primary weight.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agocrush/CrushWrapper: refactor adjust_weight_* into per-bucket helper
Sage Weil [Wed, 13 Mar 2019 21:22:00 +0000 (16:22 -0500)]
crush/CrushWrapper: refactor adjust_weight_* into per-bucket helper

This unifies the regular and _in_loc variants.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agocrush/CrushWrapper: pass cct down into more places
Sage Weil [Fri, 8 Mar 2019 23:23:29 +0000 (17:23 -0600)]
crush/CrushWrapper: pass cct down into more places

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agocrush/CrushWrapper: ensure crush_choose_arg_map.size == max_buckets
Sage Weil [Fri, 8 Mar 2019 22:54:53 +0000 (16:54 -0600)]
crush/CrushWrapper: ensure crush_choose_arg_map.size == max_buckets

The crush/builder.c crush_add_bucket method resizes the max_buckets array
but a power of 2 when it has to expand, but the code in CrushWrapper was
assuming that if the array grew the pos for the new bucket would be the
last position in the new array.  This led to a situation where the
crush_choose_arg_map args array size didn't match max_buckets, and
eventually caused a crash.

Fixes: http://tracker.ceph.com/issues/38664
Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoMerge PR #26838 into master
Ilya Dryomov [Fri, 8 Mar 2019 17:44:21 +0000 (18:44 +0100)]
Merge PR #26838 into master

* refs/pull/26838/head:
librbd: avoid aggregate-initializing IsWriteOpVisitor

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
6 years agoMerge pull request #26826 from ceph/wip-simple-scan-all
Alfredo Deza [Fri, 8 Mar 2019 16:49:07 +0000 (11:49 -0500)]
Merge pull request #26826 from ceph/wip-simple-scan-all

ceph-volume: `simple scan` will now scan all running ceph-disk OSDs

Reviewed-by: Alfredo Deza <adeza@redhat.com>
6 years agoceph-volume: make systemctl.get_running_osds resilient to garbage output
Andrew Schoen [Fri, 8 Mar 2019 15:53:08 +0000 (09:53 -0600)]
ceph-volume: make systemctl.get_running_osds resilient to garbage output

Makes systemd.systemctl.get_running_osds more resilient to output
that doesn't match what we're expecting to parse. This also handles
the case of a nonzero exit status by retuning an empty list.

Signed-off-by: Andrew Schoen <aschoen@redhat.com>
6 years agoceph-volume: remove obsoleted simple unit test
Andrew Schoen [Thu, 7 Mar 2019 18:38:02 +0000 (12:38 -0600)]
ceph-volume: remove obsoleted simple unit test

This test is no longer valid because `simple scan`
without any arguments will now attempt to scan all
running OSDs.

Signed-off-by: Andrew Schoen <aschoen@redhat.com>
6 years agoceph-volume: tox tests should ignore the tests/functional directory
Andrew Schoen [Thu, 7 Mar 2019 18:37:00 +0000 (12:37 -0600)]
ceph-volume: tox tests should ignore the tests/functional directory

Without this the functional testinfra tests try to run when we
are intending to only run the unit tests.

Signed-off-by: Andrew Schoen <aschoen@redhat.com>
6 years agoceph-volume: use `simple scan` and `simple activate --all` in tests
Andrew Schoen [Thu, 7 Mar 2019 17:57:03 +0000 (11:57 -0600)]
ceph-volume: use `simple scan` and `simple activate --all` in tests

In a couple of the functional tests for simple we should run `simple
scan` without any arguments and `simple activate --all` to increase
the test coverage. We want to test pointing at a diretory and scanning
all running OSDs.

Signed-off-by: Andrew Schoen <aschoen@redhat.com>
6 years agoceph-volume: update docs and help menu for `simple scan`
Andrew Schoen [Thu, 7 Mar 2019 17:10:19 +0000 (11:10 -0600)]
ceph-volume: update docs and help menu for `simple scan`

Signed-off-by: Andrew Schoen <aschoen@redhat.com>
6 years agoceph-volume: adds unit tests for systemd.systemctl.get_running_osd_ids
Andrew Schoen [Thu, 7 Mar 2019 16:01:40 +0000 (10:01 -0600)]
ceph-volume: adds unit tests for systemd.systemctl.get_running_osd_ids

Signed-off-by: Andrew Schoen <aschoen@redhat.com>
6 years agoceph-volume: update `simple scan` to scan all running OSDs
Andrew Schoen [Wed, 6 Mar 2019 22:52:33 +0000 (16:52 -0600)]
ceph-volume: update `simple scan` to scan all running OSDs

If no argument is passed to `ceph-volume simple scan` it will
inspect any running osds and scan them if they were created by
ceph-disk.

Signed-off-by: Andrew Schoen <aschoen@redhat.com>
6 years agoceph-volume: add systemd.systemctl.get_running_osd_ids
Andrew Schoen [Wed, 6 Mar 2019 22:51:06 +0000 (16:51 -0600)]
ceph-volume: add systemd.systemctl.get_running_osd_ids

This is used to retrieve a list of osd ids from the currently
running ceph-osd systemd units.

Signed-off-by: Andrew Schoen <aschoen@redhat.com>
6 years agoMerge pull request #26815 from tchaikov/wip-38584
Patrick Nawracay [Fri, 8 Mar 2019 15:24:18 +0000 (16:24 +0100)]
Merge pull request #26815 from tchaikov/wip-38584

doc/releases/nautilus: dashboard package notes

Reviewed-by: Laura Paduano <lpaduano@suse.com>
Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Sebastian Wagner <sebastian.wagner@suse.com>
6 years agoMerge PR #26823 into master
Sage Weil [Fri, 8 Mar 2019 15:04:21 +0000 (09:04 -0600)]
Merge PR #26823 into master

* refs/pull/26823/head:
qa/suites: disable valgrind leak checks on ceph-mgr
mgr: skip shutdown and exit

Reviewed-by: Tim Serong <tserong@suse.com>
Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
6 years agolibrbd: avoid aggregate-initializing IsWriteOpVisitor
Willem Jan Withagen [Fri, 8 Mar 2019 13:59:09 +0000 (14:59 +0100)]
librbd: avoid aggregate-initializing IsWriteOpVisitor

Clang 8 complains about missing dtor, and this circumvents this problem.
```
/home/jenkins/workspace/ceph-master/src/librbd/io/ImageDispatchSpec.cc:136:48:
error: temporary of type 'boost::static_visitor<bool>' has protected
destructor
   return boost::apply_visitor(IsWriteOpVisitor{}, m_request);
                                                ^
/home/jenkins/workspace/ceph-master/src/librbd/io/ImageDispatchSpec.cc:152:28:
note: in instantiation of member function
'librbd::io::ImageDispatchSpec<librbd::ImageCtx>::is_write_op' requested
here
template class librbd::io::ImageDispatchSpec<librbd::ImageCtx>;
                            ^
/usr/local/include/boost/variant/static_visitor.hpp:53:5: note: declared
protected here
     ~static_visitor() = default;
     ^
1 error generated.
gmake[2]: *** [src/librbd/CMakeFiles/rbd_internal.dir/build.make:895:
src/librbd/CMakeFiles/rbd_internal.dir/io/ImageDispatchSpec.cc.o] Error 1
```

The details of the problem are explained in more detail in:
https://github.com/ceph/ceph/pull/26301/commits/09870d7a083913b5a3830ba8b0be04d0b0996aca

Signed-off-by: Willem Jan Withagen <wjw@digiware.nl>
6 years agoMerge PR #26796 into master
Sage Weil [Fri, 8 Mar 2019 14:11:47 +0000 (08:11 -0600)]
Merge PR #26796 into master

* refs/pull/26796/head:
common/options: osd_memory_target et al are TYPE_SIZE

Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
6 years agoMerge pull request #26827 from adamemerson/wip-doc-s3-op-acl
Casey Bodley [Fri, 8 Mar 2019 13:54:57 +0000 (08:54 -0500)]
Merge pull request #26827 from adamemerson/wip-doc-s3-op-acl

Document mappings of S3 Operations to ACL grants

Reviewed-by: Casey Bodley <cbodley@redhat.com>
6 years agoMerge PR #26832 into master
Sage Weil [Fri, 8 Mar 2019 13:04:35 +0000 (07:04 -0600)]
Merge PR #26832 into master

* refs/pull/26832/head:
msg/async/ProtocolV2: end scope of std::hex properly

Reviewed-by: Sage Weil <sage@redhat.com>
6 years agoMerge PR #26794 into master
Sage Weil [Fri, 8 Mar 2019 04:12:52 +0000 (22:12 -0600)]
Merge PR #26794 into master

* refs/pull/26794/head:
mon/MgrMonitor: only try to update always_on_modules if >= NAUTILUS
qa/standalone/mon/msgr-v2-transition: add some tests for enabling msgr v2
mon/MonmapMonitor: add 'ceph mon set-addrs <name> <addrvec>' command
Revert "mon/MonClient: disable ms_bind_msgr2 if NAUTILUS feature not set"
mon/OSDMonitor: use legacy_equals to compare osd addrs
msg/msg_types: make legacy_equals() symmetrical
mon/MDSMonitor: stop using get_orig_source_inst()

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
6 years agoMerge PR #26770 into master
Sage Weil [Fri, 8 Mar 2019 04:10:12 +0000 (22:10 -0600)]
Merge PR #26770 into master

* refs/pull/26770/head:
qa/standalone/osd/osd-force-create-pg: create more pgs
qa/standalone: make sure an osd is running before create_rbd_pool

Reviewed-by: Mykola Golub <mgolub@suse.com>
6 years agoMerge PR #26504 into master
Sage Weil [Fri, 8 Mar 2019 03:58:01 +0000 (21:58 -0600)]
Merge PR #26504 into master

* refs/pull/26504/head:
osd: fixup OpTracker destruct assert, waiting_for_osdmap take ref

Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
Reviewed-by: Sage Weil <sage@redhat.com>
6 years agoMerge PR #26803 into master
Sage Weil [Thu, 7 Mar 2019 22:59:24 +0000 (16:59 -0600)]
Merge PR #26803 into master

* refs/pull/26803/head:
msg/async/ProtocolV1: fix locking around authorizer_buf

Reviewed-by: Ricardo Dias <rdias@suse.com>
6 years agoMerge PR #26781 into master
Patrick Donnelly [Thu, 7 Mar 2019 22:40:16 +0000 (14:40 -0800)]
Merge PR #26781 into master

* refs/pull/26781/head:
mds: avoid calling clientreplay_done() prematurely

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
6 years agoMerge PR #26773 into master
Patrick Donnelly [Thu, 7 Mar 2019 22:38:48 +0000 (14:38 -0800)]
Merge PR #26773 into master

* refs/pull/26773/head:
mds: refactor subtree accessors

Reviewed-by: Zheng Yan <zyan@redhat.com>
6 years agomon/MgrMonitor: only try to update always_on_modules if >= NAUTILUS
Sage Weil [Thu, 7 Mar 2019 19:42:34 +0000 (13:42 -0600)]
mon/MgrMonitor: only try to update always_on_modules if >= NAUTILUS

Pre-nautilus mgrmap doesn't encode have this field, so we'll just loop.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoqa/standalone/mon/msgr-v2-transition: add some tests for enabling msgr v2
Sage Weil [Wed, 6 Mar 2019 22:23:08 +0000 (16:23 -0600)]
qa/standalone/mon/msgr-v2-transition: add some tests for enabling msgr v2

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/MonmapMonitor: add 'ceph mon set-addrs <name> <addrvec>' command
Sage Weil [Wed, 6 Mar 2019 22:21:31 +0000 (16:21 -0600)]
mon/MonmapMonitor: add 'ceph mon set-addrs <name> <addrvec>' command

This lets us explicitly adjust the IPs/ports a monitor binds to.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoRevert "mon/MonClient: disable ms_bind_msgr2 if NAUTILUS feature not set"
Sage Weil [Wed, 6 Mar 2019 17:54:43 +0000 (11:54 -0600)]
Revert "mon/MonClient: disable ms_bind_msgr2 if NAUTILUS feature not set"

This reverts commit f3ddb1c9b88067b1a4b54d23d825ca4d62e10586.

That behavior complicated the process of moving to v1 -> v2+v1 (extra
daemon restart, for example).  Also, it wasn't sufficient: there are lots
of other reasons besides the mon version that could lead to a mismatch
between what we bind to locally and what is published in the cluster
maps.  Better to embrace that reality and address those other issues.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/OSDMonitor: use legacy_equals to compare osd addrs
Sage Weil [Wed, 6 Mar 2019 17:22:18 +0000 (11:22 -0600)]
mon/OSDMonitor: use legacy_equals to compare osd addrs

The OSDs may bind to v2+v1 due to ms_bind_msgr2, but we may only use the
v1 address in the OSDMap due to

 - the osd->mon connection being v1 (mon isn't using v2 yet)
 - the osd require_osd_release < nautilus
 - ???

Conversely, the OSD may end up with a v2+v1 address in the osdmap, but
may connect to the current monitor via v1 only, due to the mon only being
bound to a v1 address.

If/when this happens, make sure we are doing a loose comparison of the
osdmap addrs vs the osd's addr they are connecting from.

Fixes: http://tracker.ceph.com/issues/38598
Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomsg/msg_types: make legacy_equals() symmetrical
Sage Weil [Wed, 6 Mar 2019 17:05:07 +0000 (11:05 -0600)]
msg/msg_types: make legacy_equals() symmetrical

The prior version would return true for

 [v2,v1] vs [v2,v1]
 [v2,v1] vs [v1]

but not

 [v1] vs [v2,v1]

Now that final case is also true, which means a.legacy_equals(b) ==
b.legacy_equals(a) for any combination.  This is what you would expect
for an equality operator... even a fuzzy one.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/MDSMonitor: stop using get_orig_source_inst()
Sage Weil [Wed, 6 Mar 2019 17:04:51 +0000 (11:04 -0600)]
mon/MDSMonitor: stop using get_orig_source_inst()

Use get_orig_source() + get_orig_source_addrs()

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agodoc/radosgw: Document mappings of S3 Operations to ACL grants
Adam C. Emerson [Thu, 7 Mar 2019 20:18:42 +0000 (15:18 -0500)]
doc/radosgw: Document mappings of S3 Operations to ACL grants

Since the ACL documentation is a bit sparse, let people know exactly
what operations they can expect to enable with a given ACL grant.

Fixes: https://tracker.ceph.com/issues/38523
Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
6 years agoMerge PR #26802 into master
Sage Weil [Thu, 7 Mar 2019 19:49:58 +0000 (13:49 -0600)]
Merge PR #26802 into master

* refs/pull/26802/head:
qa/suites/upgrade/mimic-x/parallel: run master rados/test.sh

Reviewed-by: Yuri Weinstein <yweins@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
6 years agoqa/suites: disable valgrind leak checks on ceph-mgr
Sage Weil [Thu, 7 Mar 2019 17:23:19 +0000 (11:23 -0600)]
qa/suites: disable valgrind leak checks on ceph-mgr

We've disabled the "clean" shutdown in ceph-mgr due to
https://tracker.ceph.com/issues/38621

Until then, no valgrind leak checks!

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomgr: skip shutdown and exit
Sage Weil [Thu, 7 Mar 2019 17:22:27 +0000 (11:22 -0600)]
mgr: skip shutdown and exit

The python subinterpreter teardown is broken; see
https://tracker.ceph.com/issues/38621

Instead, just exit with the appropriate error code.  That way we don't
crash and leave a core and ugly stack trace behind.

Users won't care, although our leak checking will.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoMerge PR #26821 into master
Patrick Donnelly [Thu, 7 Mar 2019 16:48:05 +0000 (08:48 -0800)]
Merge PR #26821 into master

* refs/pull/26821/head:
mds,mgr,mon: warn future respawn() copy-pasters to set thread name

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
6 years agoMerge PR #26764 into master
Sage Weil [Thu, 7 Mar 2019 14:52:56 +0000 (08:52 -0600)]
Merge PR #26764 into master

* refs/pull/26764/head:
mgr: 'osd df' by specified class or (crush) name
mon/OSDMonitor: add 'osd crush get-device-class' command

6 years agomds,mgr,mon: warn future respawn() copy-pasters to set thread name
Sage Weil [Thu, 7 Mar 2019 14:50:53 +0000 (08:50 -0600)]
mds,mgr,mon: warn future respawn() copy-pasters to set thread name

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agocommon/options: osd_memory_target et al are TYPE_SIZE
Sage Weil [Wed, 6 Mar 2019 19:28:57 +0000 (13:28 -0600)]
common/options: osd_memory_target et al are TYPE_SIZE

This makes us interpret "2G" and 2 GiB, not 2 billion bytes.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agodoc/mgr/dashboard: note dashboard as a separate package
Kefu Chai [Thu, 7 Mar 2019 14:31:17 +0000 (22:31 +0800)]
doc/mgr/dashboard: note dashboard as a separate package

Fixes: https://tracker.ceph.com/issues/38584
Signed-off-by: Kefu Chai <kchai@redhat.com>
6 years agodoc/releases/nautilus: dashboard package notes
Kefu Chai [Thu, 7 Mar 2019 12:08:44 +0000 (20:08 +0800)]
doc/releases/nautilus: dashboard package notes

Fixes: https://tracker.ceph.com/issues/38584
Signed-off-by: Kefu Chai <kchai@redhat.com>
6 years agoMerge pull request #26743 from votdev/cleanup
Sebastian Wagner [Thu, 7 Mar 2019 13:14:51 +0000 (14:14 +0100)]
Merge pull request #26743 from votdev/cleanup

mgr/dashboard: Cleanup Python code

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Sebastian Wagner <sebastian.wagner@suse.com>
Reviewed-by: Tatjana Dehler <tdehler@suse.com>
6 years agoMerge PR #26797 into master
Sage Weil [Thu, 7 Mar 2019 12:36:04 +0000 (06:36 -0600)]
Merge PR #26797 into master

* refs/pull/26797/head:
ceph-mon: reset process name to 'ceph-mon' on startup

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: João Eduardo Luís <joao@suse.de>
6 years agoMerge PR #26795 into master
Sage Weil [Thu, 7 Mar 2019 12:35:49 +0000 (06:35 -0600)]
Merge PR #26795 into master

* refs/pull/26795/head:
mon/MgrStatMonitor: fix typo

Reviewed-by: Neha Ojha <nojha@redhat.com>
6 years agoMerge PR #26027 into master
Sage Weil [Thu, 7 Mar 2019 12:32:30 +0000 (06:32 -0600)]
Merge PR #26027 into master

* refs/pull/26027/head:
osd,mon,mds: support get|set tcmalloc release rate

Reviewed-by: Sage Weil <sage@redhat.com>
6 years agoMerge PR #26674 into master
Sage Weil [Thu, 7 Mar 2019 12:31:03 +0000 (06:31 -0600)]
Merge PR #26674 into master

* refs/pull/26674/head:
os/bluestore: log long aio operations in KernelDevice
os/bluestore: treat bdev_debug_aio_suicide_timeout==0 as no suicide
os/bluestore: track omap iterator latencies.
os/bluestore: log slow operations/long lasting states

Reviewed-by: Sage Weil <sage@redhat.com>
6 years agoMerge PR #26718 into master
Sage Weil [Thu, 7 Mar 2019 12:30:46 +0000 (06:30 -0600)]
Merge PR #26718 into master

* refs/pull/26718/head:
rpm: put numpy and scipy deps in distro conditionals

Reviewed-by: Kefu Chai <kchai@redhat.com>
6 years agoMerge PR #26725 into master
Sage Weil [Thu, 7 Mar 2019 12:30:29 +0000 (06:30 -0600)]
Merge PR #26725 into master

* refs/pull/26725/head:
doc/releases/nautilus: ask users to opt in to telemetry
doc/mgr/telemtry: update docs
mgr/telemetry: drop config-set and config-show; add just show
mgr/telemetry: make 'telemetry show' readable by a human
mgr/telemetry: add 'telemetry on' and 'telemetry off' commands
mgr/telemetry: off by default

Reviewed-by: Wido den Hollander <wido@42on.com>
Reviewed-by: Dan Mick <dmick@redhat.com>
6 years agoMerge pull request #26716 from rjfd/wip-dashboard-ganesha-docs
Ricardo Dias [Thu, 7 Mar 2019 12:10:37 +0000 (12:10 +0000)]
Merge pull request #26716 from rjfd/wip-dashboard-ganesha-docs

mgr/dashboard: nfs-ganesha: controller API documentation

Reviewed-by: Jeff Layton <jlayton@redhat.com>
Reviewed-by: Lenz Grimmer <lgrimmer@suse.com>
6 years agomsg/async/ProtocolV2: end scope of std::hex properly
xie xingguo [Thu, 7 Mar 2019 11:49:29 +0000 (19:49 +0800)]
msg/async/ProtocolV2: end scope of std::hex properly

Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
6 years agoMerge PR #26775 into master
Ilya Dryomov [Thu, 7 Mar 2019 11:19:36 +0000 (12:19 +0100)]
Merge PR #26775 into master

* refs/pull/26775/head:
qa/suites/krbd/wac: bluestore snippet is placed incorrectly

Reviewed-by: Mike Christie <mchristi@redhat.com>
6 years agoMerge pull request #26778 from xiexingguo/wip-fix-wrong-lpr
Xie Xingguo [Thu, 7 Mar 2019 08:40:59 +0000 (16:40 +0800)]
Merge pull request #26778 from xiexingguo/wip-fix-wrong-lpr

osd/PrimaryLogPG: fix last_peering_reset checking on manifest flushing

Reviewed-by: Myoungwon Oh <ommw@sk.com>
6 years agomsg/async/ProtocolV1: fix locking around authorizer_buf
Sage Weil [Wed, 6 Mar 2019 23:40:48 +0000 (17:40 -0600)]
msg/async/ProtocolV1: fix locking around authorizer_buf

Fix two problems:

- we are accessing authorizer_buf without the connection lock, and
  under the lock we are modifying it (in connect()).
- if we receive two connect_msg's with a different length, we won't
  have a buffer that's large enough.

Fixes: http://tracker.ceph.com/issues/38524
Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoqa/suites/upgrade/mimic-x/parallel: run master rados/test.sh
Sage Weil [Wed, 6 Mar 2019 22:50:12 +0000 (16:50 -0600)]
qa/suites/upgrade/mimic-x/parallel: run master rados/test.sh

We rename ceph_test_rados_api_tier to add _pp, so the mimic version doesn't
work.  And in any case, at this stage the client host has master installed.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoqa/standalone/osd/osd-force-create-pg: create more pgs
Sage Weil [Wed, 6 Mar 2019 18:15:42 +0000 (12:15 -0600)]
qa/standalone/osd/osd-force-create-pg: create more pgs

Avoid warnings about too few pgs.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoqa/standalone: make sure an osd is running before create_rbd_pool
Sage Weil [Tue, 5 Mar 2019 15:19:38 +0000 (09:19 -0600)]
qa/standalone: make sure an osd is running before create_rbd_pool

'rbd pool init' now does IO.  Drop the pool, or change the pool size to 1.

Fixes: http://tracker.ceph.com/issues/38585
Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoceph-mon: reset process name to 'ceph-mon' on startup
Sage Weil [Wed, 6 Mar 2019 19:49:50 +0000 (13:49 -0600)]
ceph-mon: reset process name to 'ceph-mon' on startup

After a respawn the process name ends up as 'exe' (because we do execve
on /proc/$pid/exe).  Reset to 'ceph-mon' so that killall works.

Fixes: http://tracker.ceph.com/issues/38604
Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/MgrStatMonitor: fix typo
Sage Weil [Wed, 6 Mar 2019 19:27:41 +0000 (13:27 -0600)]
mon/MgrStatMonitor: fix typo

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoMerge pull request #26633 from jtlayton/wip-nfs-scale
Sebastian Wagner [Wed, 6 Mar 2019 18:08:48 +0000 (19:08 +0100)]
Merge pull request #26633 from jtlayton/wip-nfs-scale

mgr/orchestrator: Allow the orchestrator to scale the NFS server count

Reviewed-by: Sebastian Wagner <sebastian.wagner@suse.com>
6 years agoMerge PR #26748 into master
Sage Weil [Wed, 6 Mar 2019 15:22:42 +0000 (09:22 -0600)]
Merge PR #26748 into master

* refs/pull/26748/head:
ceph_test_rados_[a]io_pp: add test verifying read-into-bufferlist behavior

Reviewed-by: Neha Ojha <nojha@redhat.com>
6 years agorpm: put numpy and scipy deps in distro conditionals
Nathan Cutler [Fri, 1 Mar 2019 12:54:07 +0000 (13:54 +0100)]
rpm: put numpy and scipy deps in distro conditionals

Without this patch, attempts to install the ceph-mgr-diskprediction-local RPM
fail on SUSE platforms with the following error:

can't install ceph-mgr-diskprediction-local-14.1.0.402+ga396e8bf3b-3742.1.noarch:
  nothing provides numpy needed by ceph-mgr-diskprediction-local-14.1.0.402+ga396e8bf3b-3742.1.noarch
  nothing provides scipy needed by ceph-mgr-diskprediction-local-14.1.0.402+ga396e8bf3b-3742.1.noarch

Also take into account package naming differences between Fedora and
RHEL/CentOS.

Signed-off-by: Kefu Chai <kchai@redhat.com>
Signed-off-by: Nathan Cutler <ncutler@suse.com>
6 years agomgr/orchestrator: allow scaling the NFS server count up and down
Jeff Layton [Mon, 25 Feb 2019 14:21:08 +0000 (09:21 -0500)]
mgr/orchestrator: allow scaling the NFS server count up and down

Add a new 'ceph orchestrator nfs update' command that will take the
NFS clustername and a new count as arguments. That will get translated
to a StatelessServiceSpec and passed to update_stateless_service.

Also, add the necessary stubs to the test_orchestrator and the CLI
QA test.

Signed-off-by: Jeff Layton <jlayton@redhat.com>
6 years agomgr/rook: allow scaling nfs count
Jeff Layton [Mon, 25 Feb 2019 14:27:02 +0000 (09:27 -0500)]
mgr/rook: allow scaling nfs count

Allow rook to handle scaling the NFS server count up and down in an NFS
cluster. We just manifest these changes as change to the
spec.server.active field in the CRD.

Signed-off-by: Jeff Layton <jlayton@redhat.com>
6 years agoorchestrator: drop id_ argument from update_stateless_service
Jeff Layton [Thu, 28 Feb 2019 17:33:04 +0000 (12:33 -0500)]
orchestrator: drop id_ argument from update_stateless_service

...we have a "name" field in StatelessServiceSpec.

Signed-off-by: Jeff Layton <jlayton@redhat.com>
6 years agomgr/orchestrator: just keep a single count value in StatelessServiceSpec
Jeff Layton [Tue, 26 Feb 2019 19:53:45 +0000 (14:53 -0500)]
mgr/orchestrator: just keep a single count value in StatelessServiceSpec

We currently have a min_size/max_size values in here, but we don't have
any orchestrators that can take advantage of two values. Let's just keep
a simple count for now, until we do.

Signed-off-by: Jeff Layton <jlayton@redhat.com>
6 years agoMerge pull request #26388 from tspmelo/wip-iscsi-no-auth-ui
Ricardo Marques [Wed, 6 Mar 2019 12:11:05 +0000 (12:11 +0000)]
Merge pull request #26388 from tspmelo/wip-iscsi-no-auth-ui

mgr/dashboard: Add UI for disabling ACL authentication

Reviewed-by: Ricardo Marques <rimarques@suse.com>
6 years agoMerge pull request #26382 from ricardoasmarques/no-auth
Volker Theile [Wed, 6 Mar 2019 11:45:33 +0000 (12:45 +0100)]
Merge pull request #26382 from ricardoasmarques/no-auth

mgr/dashboard: iSCSI - Add support for disabling ACL authentication (backend)

Reviewed-by: Volker Theile <vtheile@suse.com>
6 years agoMerge pull request #26783 from cyx1231st/wip-fix-async-echo
Kefu Chai [Wed, 6 Mar 2019 10:15:09 +0000 (18:15 +0800)]
Merge pull request #26783 from cyx1231st/wip-fix-async-echo

test/crimson: add dummy_auth to test_async_echo

Reviewed-by: Kefu Chai <kchai@redhat.com>
6 years agotest/crimson: add dummy_auth to test_async_echo
Yingxin Cheng [Wed, 6 Mar 2019 09:29:04 +0000 (17:29 +0800)]
test/crimson: add dummy_auth to test_async_echo

fix echo between async-msgrs

Signed-off-by: Yingxin Cheng <yingxincheng@gmail.com>
6 years agomds: avoid calling clientreplay_done() prematurely
Yan, Zheng [Wed, 6 Mar 2019 08:08:05 +0000 (16:08 +0800)]
mds: avoid calling clientreplay_done() prematurely

maybe_clientreplay_done() does not correctly handle the case that
replayed request is in the finished_queue (hasn't been dispatched)

Fixes: https://tracker.ceph.com/issues/38597
Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
6 years agoosd/PrimaryLogPG: fix last_peering_reset checking on manifest flushing
xie xingguo [Wed, 6 Mar 2019 06:11:16 +0000 (14:11 +0800)]
osd/PrimaryLogPG: fix last_peering_reset checking on manifest flushing

```handle_manifest_flush``` is obviously using the wrong
**last_peering_reset** to check whether a new peering procedure
has been re-initialized by then.

Fix by using a different alias of the local copy of the
pg-wide **last_peering_reset** variable, which is less confusing and
error-prone.

Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
6 years agoMerge PR #26749 into master
Sage Weil [Wed, 6 Mar 2019 03:12:19 +0000 (21:12 -0600)]
Merge PR #26749 into master

* refs/pull/26749/head:
common/util: report CONTAINER_IMAGE in daemon metadata

Reviewed-by: Sébastien Han <seb@redhat.com>
6 years agomgr: 'osd df' by specified class or (crush) name
xie xingguo [Tue, 5 Mar 2019 06:28:59 +0000 (14:28 +0800)]
mgr: 'osd df' by specified class or (crush) name

For large clusters, we use device classes to isolate storage pools.
The existing 'osd df' output turns out to be too nosiy, say, if
you care about only single storage pool with osds possibly spanning over
all hosts.

With this change you are now being able to do 'osd df' by class (or by pool,
if you simply use classes to separate different pools), or by a specified
crush bucket name you are currently interested in, which is much more
convenient.

Some examples:
```
$ bin/ceph osd df tree
ID CLASS WEIGHT  REWEIGHT SIZE   RAW USE DATA    OMAP META  AVAIL   %USE  VAR  PGS STATUS TYPE NAME
-1       0.05878        - 60 GiB 6.4 GiB  23 MiB  0 B 6 GiB  54 GiB 10.60 1.00   -        root default
-3       0.02939        - 30 GiB 3.2 GiB  12 MiB  0 B 3 GiB  27 GiB 10.60 1.00   -            host ceph11
 3   aaa 0.00980  1.00000 10 GiB 1.1 GiB 3.9 MiB  0 B 1 GiB 9.0 GiB 10.60 1.00  56     up         osd.3
 4   bbb 0.00980  1.00000 10 GiB 1.1 GiB 3.9 MiB  0 B 1 GiB 9.0 GiB 10.60 1.00  58     up         osd.4
 5   ccc 0.00980  1.00000 10 GiB 1.1 GiB 3.9 MiB  0 B 1 GiB 9.0 GiB 10.60 1.00  60     up         osd.5
-5       0.02939        - 30 GiB 3.2 GiB  12 MiB  0 B 3 GiB  27 GiB 10.60 1.00   -            host ceph12
 0   aaa 0.00980  1.00000 10 GiB 1.1 GiB 3.9 MiB  0 B 1 GiB 9.0 GiB 10.60 1.00  50     up         osd.0
 1   bbb 0.00980  1.00000 10 GiB 1.1 GiB 3.9 MiB  0 B 1 GiB 9.0 GiB 10.60 1.00  61     up         osd.1
 2   ccc 0.00980  1.00000 10 GiB 1.1 GiB 3.9 MiB  0 B 1 GiB 9.0 GiB 10.60 1.00  51     up         osd.2
                    TOTAL 60 GiB 6.4 GiB  23 MiB  0 B 6 GiB  54 GiB 10.60
MIN/MAX VAR: 1.00/1.00  STDDEV: 0

$ bin/ceph osd df tree class aaa
ID CLASS WEIGHT  REWEIGHT SIZE   RAW USE DATA    OMAP META  AVAIL   %USE  VAR  PGS STATUS TYPE NAME
-1       0.05878        - 20 GiB 2.1 GiB 7.8 MiB  0 B 2 GiB  18 GiB 10.60 1.00   -        root default
-3       0.02939        - 10 GiB 1.1 GiB 3.9 MiB  0 B 1 GiB 9.0 GiB 10.60 1.00   -            host ceph11
 3   aaa 0.00980  1.00000 10 GiB 1.1 GiB 3.9 MiB  0 B 1 GiB 9.0 GiB 10.60 1.00  56     up         osd.3
-5       0.02939        - 10 GiB 1.1 GiB 3.9 MiB  0 B 1 GiB 9.0 GiB 10.60 1.00   -            host ceph12
 0   aaa 0.00980  1.00000 10 GiB 1.1 GiB 3.9 MiB  0 B 1 GiB 9.0 GiB 10.60 1.00  50     up         osd.0
                    TOTAL 20 GiB 2.1 GiB 7.8 MiB  0 B 2 GiB  18 GiB 10.60
MIN/MAX VAR: 1.00/1.00  STDDEV: 0

$ bin/ceph osd df tree name ceph11
ID CLASS WEIGHT  REWEIGHT SIZE   RAW USE DATA    OMAP META  AVAIL   %USE  VAR  PGS STATUS TYPE NAME
-3       0.02939        - 30 GiB 3.2 GiB  12 MiB  0 B 3 GiB  27 GiB 10.60 1.00   -            host ceph11
 3   aaa 0.00980  1.00000 10 GiB 1.1 GiB 3.9 MiB  0 B 1 GiB 9.0 GiB 10.60 1.00  56     up         osd.3
 4   bbb 0.00980  1.00000 10 GiB 1.1 GiB 3.9 MiB  0 B 1 GiB 9.0 GiB 10.60 1.00  58     up         osd.4
 5   ccc 0.00980  1.00000 10 GiB 1.1 GiB 3.9 MiB  0 B 1 GiB 9.0 GiB 10.60 1.00  60     up         osd.5
                    TOTAL 30 GiB 3.2 GiB  12 MiB  0 B 3 GiB  27 GiB 10.60
MIN/MAX VAR: 1.00/1.00  STDDEV: 0

```

Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
6 years agoMerge pull request #26774 from liu-chunmei/ceph_seastar_osd_preboot
Kefu Chai [Wed, 6 Mar 2019 02:42:55 +0000 (10:42 +0800)]
Merge pull request #26774 from liu-chunmei/ceph_seastar_osd_preboot

crimson/osd: correct the order of parameters passed to OSD::_preboot()

Reviewed-by: Kefu Chai <kchai@redhat.com>
6 years agocrimson/osd: correct the order of parameters passed to OSD::_preboot()
chunmei Liu [Wed, 6 Mar 2019 01:34:28 +0000 (17:34 -0800)]
crimson/osd: correct the order of parameters passed to OSD::_preboot()

Signed-off-by: chunmei Liu <chunmei.liu@intel.com>
6 years agoqa/suites/krbd/wac: bluestore snippet is placed incorrectly
Ilya Dryomov [Tue, 5 Mar 2019 22:07:27 +0000 (23:07 +0100)]
qa/suites/krbd/wac: bluestore snippet is placed incorrectly

Instead of generating three tests, each with bluestore-bitmap.yaml, it
generates four tests: one consisting of just bluestore-bitmap.yaml and
the other three without any trace of bluestore.  This was introduced in
commit 711df71790fa ("qa: objectstore snippets for krbd").

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
6 years agomds: refactor subtree accessors
Patrick Donnelly [Tue, 5 Mar 2019 15:51:04 +0000 (07:51 -0800)]
mds: refactor subtree accessors

Mostly avoiding inefficient use of std::set.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
6 years agoMerge PR #26740 into master
Sage Weil [Tue, 5 Mar 2019 20:05:54 +0000 (14:05 -0600)]
Merge PR #26740 into master

* refs/pull/26740/head:
rpm: advance spec file copyright date

Reviewed-by: Ken Dreyer <kdreyer@redhat.com>
6 years agoMerge PR #26767 into master
Sage Weil [Tue, 5 Mar 2019 17:08:03 +0000 (11:08 -0600)]
Merge PR #26767 into master

* refs/pull/26767/head:
Revert "msg/async/Protocol*: send keep alive if existing wins"

Reviewed-by: Sage Weil <sage@redhat.com>
6 years agoMerge PR #26719 into master
Sage Weil [Tue, 5 Mar 2019 15:12:43 +0000 (09:12 -0600)]
Merge PR #26719 into master

* refs/pull/26719/head:
rpm: re-add pyOpenSSL as ceph-mgr runtime dependency

Reviewed-by: Kefu Chai <kchai@redhat.com>
6 years agoMerge PR #25540 into master
Sage Weil [Tue, 5 Mar 2019 15:12:26 +0000 (09:12 -0600)]
Merge PR #25540 into master

* refs/pull/25540/head:
mgr/ServiceMap: show service's id on summary

Reviewed-by: Sage Weil <sage@redhat.com>
6 years agoMerge PR #26567 into master
Sage Weil [Tue, 5 Mar 2019 15:11:53 +0000 (09:11 -0600)]
Merge PR #26567 into master

* refs/pull/26567/head:
os/bluestore: fix bluefs inability to reclaim space

Reviewed-by: Igor Fedotov <ifedotov@suse.com>
6 years agoMerge PR #26752 into master
Sage Weil [Tue, 5 Mar 2019 15:10:33 +0000 (09:10 -0600)]
Merge PR #26752 into master

* refs/pull/26752/head:
msg/async, v2: round up segments to CRYPTO_BLOCK_SIZE in secure mode.

Reviewed-by: Sage Weil <sage@redhat.com>
6 years agoMerge PR #26742 into master
Sage Weil [Tue, 5 Mar 2019 15:09:29 +0000 (09:09 -0600)]
Merge PR #26742 into master

* refs/pull/26742/head:
osd/PG: do not touch this->cct after PG is destroyed

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
6 years agoMerge PR #26460 into master
Patrick Donnelly [Tue, 5 Mar 2019 14:25:19 +0000 (06:25 -0800)]
Merge PR #26460 into master

* refs/pull/26460/head:
client: parameter "cap" is not used

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Jeff Layton <jlayton@redhat.com>
6 years agoMerge pull request #26729 from xiexingguo/wip-recovery-priority-restrictions
Xie Xingguo [Tue, 5 Mar 2019 12:16:18 +0000 (20:16 +0800)]
Merge pull request #26729 from xiexingguo/wip-recovery-priority-restrictions

mon/OSDMonitor: add boundary check for pool recovery_priority

Reviewed-by: David Zafman <dzafman@redhat.com>
6 years agoRevert "msg/async/Protocol*: send keep alive if existing wins"
xie xingguo [Tue, 5 Mar 2019 11:46:09 +0000 (19:46 +0800)]
Revert "msg/async/Protocol*: send keep alive if existing wins"

This reverts commit 61b9432ef9a3847eceb96f8d5a854567c49bbf61.

If we are in the middle of replacing, we can not queue any further
write events into the old center because we may end up replacing
existing connection's center with a new one, and hence executing
the newly queued write events in the old thread.

See **transfer_existing** for a detailed description.

Also the patch does not make a lot of sense for the original issue
it tried to resolve, because **send_keepalive** is a pure noop if the
underlying connection is not ready, which is obviously true for the
case demonstrated in http://tracker.ceph.com/issues/38493..

Fixes: http://tracker.ceph.com/issues/38569
Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
6 years agoclient: parameter "cap" is not used
huanwen ren [Fri, 15 Feb 2019 18:58:01 +0000 (02:58 +0800)]
client: parameter "cap" is not used

Signed-off-by: huanwen ren <ren.huanwen@zte.com.cn>
6 years agomon/OSDMonitor: add 'osd crush get-device-class' command
xie xingguo [Tue, 5 Mar 2019 09:10:41 +0000 (17:10 +0800)]
mon/OSDMonitor: add 'osd crush get-device-class' command

which should work as a good complementation of
the existing **set-device-class** and "rm-device-class"
command family.

Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
6 years agoMerge pull request #26715 from sebastian-philipp/script-pypy
Sebastian Wagner [Tue, 5 Mar 2019 09:18:18 +0000 (10:18 +0100)]
Merge pull request #26715 from sebastian-philipp/script-pypy

src/script: add run_mypy to run static type checking on Python code

Reviewed-by: Brad Hubbard <bhubbard@redhat.com>
Reviewed-by: Volker Theile <vtheile@suse.com>
6 years agomon/OSDMonitor: add boundary check for pool recovery_priority
xie xingguo [Sat, 2 Mar 2019 01:43:58 +0000 (09:43 +0800)]
mon/OSDMonitor: add boundary check for pool recovery_priority

See https://github.com/ceph/ceph/pull/26705

Fixes: http://tracker.ceph.com/issues/38578
Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
6 years agoMerge PR #26444 into master
Sage Weil [Mon, 4 Mar 2019 23:26:12 +0000 (17:26 -0600)]
Merge PR #26444 into master

* refs/pull/26444/head:
ceph_features: deprecate MON_SINGLE_PAXOS
osdmap: map versions to releases
mon: retire some `osd set/unset` flags

Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
6 years agoMerge PR #26754 into master
Sage Weil [Mon, 4 Mar 2019 23:15:41 +0000 (17:15 -0600)]
Merge PR #26754 into master

* refs/pull/26754/head:
osdc: remove lderr() hexdump from handle_osd_op_reply

Reviewed-by: Sage Weil <sage@redhat.com>
6 years agoMerge PR #26744 into master
Sage Weil [Mon, 4 Mar 2019 23:13:30 +0000 (17:13 -0600)]
Merge PR #26744 into master

* refs/pull/26744/head:
mds: fix encoded version of mds_info_t

Reviewed-by: Sage Weil <sage@redhat.com>
6 years agoMerge PR #26694 into master
Sage Weil [Mon, 4 Mar 2019 22:25:18 +0000 (16:25 -0600)]
Merge PR #26694 into master

* refs/pull/26694/head:
rpm: drop use of $FIRST_ARG

Reviewed-by: Boris Ranto <branto@redhat.com>
Reviewed-by: Tim Serong <tserong@suse.com>
Reviewed-by: Ken Dreyer <kdreyer@redhat.com>
6 years agoMerge pull request #26753 from ceph/ceph-volume-osd-ports
Alfredo Deza [Mon, 4 Mar 2019 21:58:36 +0000 (16:58 -0500)]
Merge pull request #26753 from ceph/ceph-volume-osd-ports

ceph-volume: set number of osd ports in the tests

Reviewed-by: Alfredo Deza <adeza@redhat.com>
6 years agoosdc: remove lderr() hexdump from handle_osd_op_reply
Casey Bodley [Mon, 4 Mar 2019 21:29:51 +0000 (16:29 -0500)]
osdc: remove lderr() hexdump from handle_osd_op_reply

Signed-off-by: Casey Bodley <cbodley@redhat.com>
6 years agoMerge PR #26631 into master
Sage Weil [Mon, 4 Mar 2019 21:00:09 +0000 (15:00 -0600)]
Merge PR #26631 into master

* refs/pull/26631/head:
msg/async, v2: fix unused variable warning in ::discard_out_queue().

Reviewed-by: Sage Weil <sage@redhat.com>