git-server-git.apps.pok.os.sepia.ceph.com Git - ceph.git/log

]> git-server-git.apps.pok.os.sepia.ceph.com Git - ceph.git/log

projects / ceph.git / log

summary | shortlog | log | commit | commitdiff | tree
first ⋅ prev ⋅ next

commit | commitdiff | tree

Sage Weil [Tue, 29 Oct 2019 14:28:18 +0000 (09:28 -0500)]

mgr/telemetry: send device telemetry via per-host POST to device endpoint

We do not want to associate devices with clusters because that may
communicate unnecessary information about the association between vendors
and clusters (which, when large, are potentially identifying).

Instead, do a POST per host with all of the devices on that host only.

The devices endpoint does not log the POST time, so these per-host
records won't be associated with each other.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Tue, 29 Oct 2019 10:40:45 +0000 (11:40 +0100)]

Merge pull request #31023 from idryomov/wip-krbd-udev-enumerate-retry

krbd: retry on transient errors from udev_enumerate_scan_devices()

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
Reviewed-by: Dongsheng Yang <dongsheng.yang@easystack.cn>

commit | commitdiff | tree

Jan Fajerski [Tue, 29 Oct 2019 07:16:14 +0000 (08:16 +0100)]

Merge pull request #31072 from sebastian-philipp/ceph-volume-inventory-device-id

ceph-volume: add Ceph's device id to inventory

commit | commitdiff | tree

Patrick Donnelly [Tue, 29 Oct 2019 03:37:31 +0000 (20:37 -0700)]

Merge PR #31063 into master

* refs/pull/31063/head:
qa: disable too few PG warning during Mimic deploy

Reviewed-by: Nathan Cutler <ncutler@suse.com>
Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Kefu Chai [Mon, 28 Oct 2019 23:57:36 +0000 (07:57 +0800)]

Merge pull request #31178 from tchaikov/wip/rpm/pyyaml

ceph.spec.in: add missing python-yaml dependency for mgr-k8sevents

Reviewed-by: Tim Serong <tserong@suse.com>
Reviewed-by: Ken Dreyer <kdreyer@redhat.com>

commit | commitdiff | tree

Sage Weil [Mon, 28 Oct 2019 19:59:43 +0000 (14:59 -0500)]

Merge PR #31168 into master

* refs/pull/31168/head:
ceph-daemon: try py2 import before py3
qa/suites/rados/singleton-nomsgr/ceph-daemon: make sure python3 is installed
qa/standalone/test_ceph_damon.sh: test with python2 and python3
mgr/ssh: python, not python3
ceph-daemon: python, not python3
ceph-daemon: os.makedirs
ceph-daemon: configparser is ConfigParser on py2
ceph-daemon: avoid py3-isms

Reviewed-by: Sebastian Wagner <swagner@suse.com>
Reviewed-by: Alfredo Deza <adeza@redhat.com>

commit | commitdiff | tree

Sage Weil [Mon, 28 Oct 2019 17:17:07 +0000 (12:17 -0500)]

ceph-daemon: try py2 import before py3

See https://github.com/ceph/ceph-medic/commit/f499a89b15b72a5bb842e1ebbc7cafc4daf58c1c

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 25 Oct 2019 21:34:15 +0000 (16:34 -0500)]

qa/suites/rados/singleton-nomsgr/ceph-daemon: make sure python3 is installed

Centos7 doesn't have it by default.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 25 Oct 2019 21:20:46 +0000 (16:20 -0500)]

qa/standalone/test_ceph_damon.sh: test with python2 and python3

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 25 Oct 2019 21:03:24 +0000 (16:03 -0500)]

mgr/ssh: python, not python3

python 2.x is okay too.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 25 Oct 2019 21:02:35 +0000 (16:02 -0500)]

ceph-daemon: python, not python3

python2 is okay too.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 25 Oct 2019 21:24:40 +0000 (16:24 -0500)]

ceph-daemon: os.makedirs

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 25 Oct 2019 21:16:48 +0000 (16:16 -0500)]

ceph-daemon: configparser is ConfigParser on py2

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Wed, 23 Oct 2019 12:45:41 +0000 (07:45 -0500)]

ceph-daemon: avoid py3-isms

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Kefu Chai [Mon, 28 Oct 2019 15:18:02 +0000 (23:18 +0800)]

Merge pull request #31180 from liewegas/revert-bluestore-pinned-list

os/bluestore: revert cache pinned list

Reviewed-by: Mark Nelson <mnelson@redhat.com>

commit | commitdiff | tree

Sebastian Wagner [Fri, 18 Oct 2019 11:59:44 +0000 (13:59 +0200)]

ceph-volume: add Ceph's device id to inventory

This will benefit the orchestrator and dashboard to show a unified view of devices with SMART data

Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>

commit | commitdiff | tree

Sebastian Wagner [Mon, 28 Oct 2019 14:07:08 +0000 (15:07 +0100)]

Merge pull request #31071 from sebastian-philipp/python-common-mypy

python-common: Add mypy testing

Reviewed-by: Jan Fajerski <jfajerski@suse.com>

commit | commitdiff | tree

Jan Fajerski [Mon, 28 Oct 2019 13:57:37 +0000 (14:57 +0100)]

Merge pull request #30995 from wjwithagen/wjw-wip-volume-zfs-inventory

ceph-volume-zfs: add the inventory command

commit | commitdiff | tree

Kefu Chai [Mon, 28 Oct 2019 12:53:07 +0000 (20:53 +0800)]

Merge pull request #31014 from tchaikov/wip-38378

mgr/dashboard: accept expected exception when SSL handshaking

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Sebastian Wagner <swagner@suse.com>

commit | commitdiff | tree

Kefu Chai [Mon, 28 Oct 2019 12:36:13 +0000 (20:36 +0800)]

Merge pull request #31092 from liewegas/wip-revert-42026-test

Revert "test: librados startup/shutdown racer test"

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Jeff Layton <jlayton@redhat.com>

commit | commitdiff | tree

Kefu Chai [Mon, 28 Oct 2019 12:29:20 +0000 (20:29 +0800)]

Merge pull request #31153 from ceph/wip-restful-node-items

restful: Query nodes_by_id for items

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Mon, 28 Oct 2019 12:27:22 +0000 (20:27 +0800)]

Merge pull request #31176 from tchaikov/wip/mgr/diskprediction_cloud/cleanup

mgr/diskprediction_cloud: refactor timeout() decorator

Reviewed-by: Rick Chen <rick.chen@prophetstor.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>

commit | commitdiff | tree

Kefu Chai [Mon, 28 Oct 2019 12:24:48 +0000 (20:24 +0800)]

Merge pull request #31175 from tchaikov/wip/mon/OSDMonitor/cleanup

mon/OSDMonitor: use initializer_list<> for {si,iec}_options

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Kefu Chai [Mon, 28 Oct 2019 12:11:14 +0000 (20:11 +0800)]

Merge pull request #30279 from runsisi/wip-fix-strerror_r

common: fix compat of strerror_r

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Mon, 28 Oct 2019 12:09:58 +0000 (20:09 +0800)]

Merge pull request #29836 from majianpeng/fix-async-msgr-perfcounter-bug

msg/async: Don't dec(msgr_active_connections) if conn still in accept…

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Ricardo Dias <rdias@suse.com>

commit | commitdiff | tree

Kefu Chai [Mon, 28 Oct 2019 04:27:11 +0000 (12:27 +0800)]

Merge pull request #31156 from cyx1231st/wip-seastar-msgr-testpeer

test/crimson: implement a remote async TestPeer for crimson msgr tests

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Yingxin Cheng [Fri, 25 Oct 2019 02:51:08 +0000 (10:51 +0800)]

test/crimson: add nonstop option in async TestPeer

If set, TestPeer won't shutdown if all tests are done, and will wait for
another round of tests.

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Fri, 25 Oct 2019 02:15:04 +0000 (10:15 +0800)]

test/crimson: add delay to markdown_peer()

Let test sleep awhile for peer markdown propagated.

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Wed, 23 Oct 2019 04:42:11 +0000 (12:42 +0800)]

test/crimson: async msgr TestPeer for v2 failover tests

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Wed, 23 Oct 2019 02:48:54 +0000 (10:48 +0800)]

test/crimson: allow v2 failover tests to connect to remote TestPeer

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Tue, 22 Oct 2019 02:24:08 +0000 (10:24 +0800)]

test/crimson: extract msgr test commands to header

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

J. Eric Ivancich [Sun, 27 Oct 2019 18:47:39 +0000 (14:47 -0400)]

Merge pull request #30610 from ivancich/wip-no-schedule-reshard-during-reshard

rgw: prevent bucket reshard scheduling if bucket is resharding

Reviewed-by: Matt Benjamin <mbenjamin@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Abhishek Lekshmanan <abhishek@suse.com>

commit | commitdiff | tree

J. Eric Ivancich [Sun, 27 Oct 2019 18:46:01 +0000 (14:46 -0400)]

Merge pull request #30795 from ivancich/wip-dynamic-reshard-adjustments

rgw: adjust allowable bucket index shard counts for dynamic resharding

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

J. Eric Ivancich [Sun, 27 Oct 2019 18:44:47 +0000 (14:44 -0400)]

Merge pull request #30845 from croit/rgw-bucket-stats-add-num-shards

rgw: add num_shards to radosgw-admin bucket stats

Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Abhishek Lekshmanan <abhishek@suse.com>
Reviewed-by: J. Eric Ivancich <ivancich@redhat.com>

commit | commitdiff | tree

J. Eric Ivancich [Sun, 27 Oct 2019 18:42:27 +0000 (14:42 -0400)]

Merge pull request #30956 from joke-lee/rgw-storage-class-post-obj

rgw: set correct storage class for post object upload

Reviewed-by: Abhishek Lekshmanan <abhishek@suse.com>

commit | commitdiff | tree

J. Eric Ivancich [Sun, 27 Oct 2019 18:39:46 +0000 (14:39 -0400)]

Merge pull request #30970 from theanalyst/rgw/list-common-prefix

rgw: url encode common prefixes for List Objects response

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sun, 27 Oct 2019 15:58:59 +0000 (23:58 +0800)]

Merge pull request #31179 from liewegas/cleanup-exec-bit

os/bluestore,comon,erasure-code: chmod -x source files

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sun, 27 Oct 2019 15:57:57 +0000 (23:57 +0800)]

Merge pull request #31171 from liewegas/bug-42496

qa/tasks/cbt: run stop-all.sh while shutting down

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 27 Oct 2019 14:59:10 +0000 (09:59 -0500)]

Revert "Merge pull request #30964 from markhpc/wip-bs-cache-trim-pinned"

This reverts commit 304c37f52165363cad4aaf31386d62ea5a573425, reversing
changes made to 7b4f9a083f0c8cc015f7b0cc12a3bc88db632610.

This causes some bluestore test failures due to an ENOENT right after a
new object is created. The simplest reproducer is the
ObjectStore/StoreTest.FiemapEmpty/2 test.

Fixes: https://tracker.ceph.com/issues/42495
Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 27 Oct 2019 14:25:53 +0000 (09:25 -0500)]

os/bluestore,comon,erasure-code: chmod -x source files

Fixes

*** WARNING: ./usr/src/debug/ceph-15.0.0-6512.g62bd825.el8.x86_64/src/common/obj_bencher.cc is executable but has empty or no shebang, removing executable bit
*** WARNING: ./usr/src/debug/ceph-15.0.0-6512.g62bd825.el8.x86_64/src/erasure-code/shec/determinant.c is executable but has empty or no shebang, removing executable bit
*** WARNING: ./usr/src/debug/ceph-15.0.0-6512.g62bd825.el8.x86_64/src/os/bluestore/AvlAllocator.cc is executable but has empty or no shebang, removing executable bit
*** WARNING: ./usr/src/debug/ceph-15.0.0-6512.g62bd825.el8.x86_64/src/os/bluestore/AvlAllocator.h is executable but has empty or no shebang, removing executable bit

in rpm build.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 27 Oct 2019 14:21:56 +0000 (09:21 -0500)]

Merge PR #31170 into master

* refs/pull/31170/head:
src/telemetry: remove, now lives in ceph-telemetry.git

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 27 Oct 2019 14:20:51 +0000 (09:20 -0500)]

Merge PR #31131 into master

* refs/pull/31131/head:
crush: remove invalid upmap items

Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>

commit | commitdiff | tree

Sage Weil [Sun, 27 Oct 2019 14:19:56 +0000 (09:19 -0500)]

Merge PR #31154 into master

* refs/pull/31154/head:
auth/cephx: always initialize local variables

Reviewed-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 27 Oct 2019 14:19:40 +0000 (09:19 -0500)]

Merge PR #31167 into master

* refs/pull/31167/head:
os/bluestore: do not mark per_pool_omap updated unless we fixed it

Reviewed-by: Igor Fedotov <ifedotov@suse.com>

commit | commitdiff | tree

Kefu Chai [Sun, 27 Oct 2019 03:10:22 +0000 (11:10 +0800)]

mgr/dashboard: accept socket error 0

see
https://github.com/cherrypy/cheroot/commit/9d73226ab28a07a38cbb6816edf9ff46be4f9de3

Fixes: https://tracker.ceph.com/issues/38378
Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sun, 27 Oct 2019 03:03:20 +0000 (11:03 +0800)]

mgr/dashboard: accept exceptions from builtin SSL

see also https://github.com/cherrypy/cheroot/pull/4

so we don't panic when client is trying to talk with us with an
unsupported protocol, the exception should be accepted, and the
client can fallback to supported protocol.

Fixes: https://tracker.ceph.com/issues/38378
Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sun, 27 Oct 2019 02:57:20 +0000 (10:57 +0800)]

mgr/dashboard: extract monkey patches for cherrypy out

so they are less distracting to the dashboard developers. and probably
less scaring.

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sun, 27 Oct 2019 04:44:45 +0000 (12:44 +0800)]

ceph.spec.in: add missing python-yaml dependency for mgr-k8sevents

otherwise we might have:

```
ceph/src/pybind/mgr/k8sevents/__init__.py", line 1, in <module>
    from .module import Module
  File "/home/kchai/ceph/src/pybind/mgr/k8sevents/module.py", line 28, in <module>
    import yaml
ImportError: No module named yaml
```

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sun, 27 Oct 2019 02:00:01 +0000 (10:00 +0800)]

mgr/diskprediction_cloud: refactor timeout() decorator

* timeout() is never passed any parameter when being called, so let's
  remove the parameters list of "seconds" and "error_message"
* use `getattr()` instead of `hasattr()` for retrieving the
  member variable of `self`
* pass `self` to wrapper function explicitly.
* return `func()` right away.
* hardwire the error message of `TimeoutError` to "Timer expired",
  because
  - as neither errno.ETIME nor errno.ETIMEOUT is portable
  - the only caller of `TimeoutError` is `timeout()`, so there is
    no need to have the flexibility to pass a different error message
* use `wraps()` as a decorator, simpler this way.

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sun, 27 Oct 2019 01:43:39 +0000 (09:43 +0800)]

mon/OSDMonitor: use initializer_list<> for {si,iec}_options

* use initializer_list<> for {si,iec}_options, no need to uset set<>
* remove the comments, the variable names are self-documented.

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sun, 27 Oct 2019 01:38:23 +0000 (09:38 +0800)]

Merge pull request #31010 from pdvian/wip-fix-target-max

mon/OSDMonitor.cc: Allow pool set target_max_(objects/bytes) with SI/IEC units

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sun, 27 Oct 2019 01:18:16 +0000 (09:18 +0800)]

Merge pull request #30434 from smithfarm/wip-41820

qa: enable dashboard tests to be run with "--suite rados/dashboard"

Reviewed-by: Laura Paduano <lpaduano@suse.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sun, 27 Oct 2019 01:16:37 +0000 (09:16 +0800)]

Merge pull request #31005 from tchaikov/wip/qa/tasks/ceph/cleanup

qa/tasks/ceph.py: remove unused variables

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Sage Weil [Sat, 26 Oct 2019 00:20:05 +0000 (19:20 -0500)]

qa/tasks/cbt: run stop-all.sh when finishing up

stop-all.sh will work if the right deps are there (currently we lack 'nc')

also killall -9 java to be sure.

Fixes: https://tracker.ceph.com/issues/42496
Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sat, 26 Oct 2019 16:42:31 +0000 (11:42 -0500)]

Merge PR #31086 into master

* refs/pull/31086/head:
doc/bootstrap.rst: fix github's url

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sat, 26 Oct 2019 15:55:19 +0000 (10:55 -0500)]

Merge PR #31169 into master

* refs/pull/31169/head:
mgr/ssh: implement (synchronous) describe_service
ceph-daemon: ls: replace 'active' bool with 'state' enum
ceph-daemon: include container_id and version in 'ls' output
ceph-daemon: tolerate systemctl is-* exit codes
mgr/ssh: modernize timeout config option

Reviewed-by: Sebastian Wagner <swagner@suse.com>

commit | commitdiff | tree

Alexandre Bruyelles [Wed, 23 Oct 2019 13:31:38 +0000 (15:31 +0200)]

doc/bootstrap.rst: fix github's url

Signed-off-by: Alexandre Bruyelles <jack@jack.fr.eu.org>

commit | commitdiff | tree

Kefu Chai [Sat, 26 Oct 2019 01:30:54 +0000 (09:30 +0800)]

Merge pull request #31126 from wjwithagen/wjw-fix-std-bitset

rgw: Select the std::bitset to resolv ambiguity

Reviewed-by: Adam C. Emerson <aemerson@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Dan Mick [Fri, 25 Oct 2019 23:44:05 +0000 (16:44 -0700)]

src/telemetry: remove, now lives in ceph-telemetry.git

Signed-off-by: Dan Mick <dan.mick@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 25 Oct 2019 17:50:50 +0000 (12:50 -0500)]

mgr/ssh: implement (synchronous) describe_service

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 25 Oct 2019 23:43:18 +0000 (18:43 -0500)]

ceph-daemon: ls: replace 'active' bool with 'state' enum

('running', 'inactive', 'error', 'unknown')

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 25 Oct 2019 22:50:26 +0000 (17:50 -0500)]

ceph-daemon: include container_id and version in 'ls' output

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 25 Oct 2019 21:46:09 +0000 (16:46 -0500)]

ceph-daemon: tolerate systemctl is-* exit codes

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 24 Oct 2019 13:21:39 +0000 (08:21 -0500)]

mgr/ssh: modernize timeout config option

(and rename it)

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Tue, 8 Oct 2019 18:12:30 +0000 (20:12 +0200)]

qa: add script to stress udev_enumerate_scan_devices()

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Ilya Dryomov [Thu, 24 Oct 2019 15:35:23 +0000 (17:35 +0200)]

krbd: retry on an empty list from udev_enumerate_scan_devices()

systemd 219 doesn't have the issue that is worked around in the
previous commit, but has a different one: udev_enumerate_scan_devices()
always succeeds, but sometimes returns an empty list when the device is
actually there.  This happens rarely and at random so I haven't been
able to get to the bottom of it yet, but it looks like another similar
race condition in libudev.

Since an empty list is expected if the device isn't there, retry just
twice with a small sleep in-between.  This appears to be enough: I got
7 occurrences per 600000 "rbd unmap" invocations, all of which needed
a single retry:

  rbd: udev enumerate missed a device, tries = 1

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Sage Weil [Fri, 25 Oct 2019 19:30:33 +0000 (14:30 -0500)]

Merge PR #31141 into master

* refs/pull/31141/head:
ceph-daemon: fix warning message
ceph-daemon: fix enable with conflicting ceph.target files
test_ceph_daemon.sh: use same fsid as qa/standalone/test_ceph_daemon.sh
test_ceph_daemon.sh: use other latest-master image
ceph-daemon: fix pathify (fix shell --config/--keyring)
ceph-daemon: /var/run/ceph -> /var/run/ceph/$fsid

Reviewed-by: Sebastian Wagner <swagner@suse.com>

commit | commitdiff | tree

Sage Weil [Fri, 25 Oct 2019 17:46:52 +0000 (12:46 -0500)]

ceph-daemon: fix warning message

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Josh Durgin [Fri, 25 Oct 2019 17:41:42 +0000 (10:41 -0700)]

Merge pull request #31164 from tchaikov/wip/doc/rados/fix-typo

doc/rados/deployment/ceph-deploy-mon: fix typo

Reviewed-by: Josh Durgin <jdurgin@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 25 Oct 2019 16:26:32 +0000 (11:26 -0500)]

ceph-daemon: fix enable with conflicting ceph.target files

+ make description better

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 25 Oct 2019 16:02:28 +0000 (11:02 -0500)]

os/bluestore: do not mark per_pool_omap updated unless we fixed it

We only fix the per-pool-omap issues if we do a non-shallow fsck.

Fixes: https://tracker.ceph.com/issues/42490
Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Kefu Chai [Fri, 25 Oct 2019 14:38:11 +0000 (22:38 +0800)]

doc/rados/deployment/ceph-deploy-mon: fix typo

s/comingling/commingling/

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Fri, 25 Oct 2019 14:24:42 +0000 (22:24 +0800)]

Merge pull request #31113 from sungjunyoung/do_cmake-warning

do_cmake: Warn user about slow debug performance only for not set

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 25 Oct 2019 14:18:20 +0000 (09:18 -0500)]

Merge PR #31130 into master

* refs/pull/31130/head:
ceph-daemon: only set up crash dir mount if it exists

Reviewed-by: Sebastian Wagner <swagner@suse.com>

commit | commitdiff | tree

Sage Weil [Fri, 25 Oct 2019 14:11:36 +0000 (09:11 -0500)]

Merge PR #31138 into master

* refs/pull/31138/head:
mon: fix tell to hybrid octopus/pre-octopus mons

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Boris Ranto [Fri, 25 Oct 2019 12:24:19 +0000 (14:24 +0200)]

restful: Use node_id for _gather_leaf_ids

The _gather_leaf_ids function doesn't need the node structure, it only
needs the id.

Signed-off-by: Boris Ranto <branto@redhat.com>

commit | commitdiff | tree

Boris Ranto [Thu, 24 Oct 2019 14:54:05 +0000 (16:54 +0200)]

restful: Query nodes_by_id for items

The node dict that is passed to the _gather_leaf_ids function from the
_gather_osds function does not have 'items' in it. We also can't use
buckets at this point since those only exist for leaf nodes, not all
nodes.

We need to query the nodes_by_id dict to get 'items' for a node inside
the _gather_leaf_ids function instead.

Signed-off-by: Boris Ranto <branto@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Mon, 7 Oct 2019 13:32:39 +0000 (15:32 +0200)]

krbd: retry on transient errors from udev_enumerate_scan_devices()

udev_enumerate_scan_devices() doesn't handle disappearing devices well.
If called while some devices are being removed, it sometimes propagates
ENOENT and ENODEV errors encountered operating on directory entries in
/sys that no longer exist.  Some of these errors are suppressed, but
this isn't reliable and varies across versions.  In particular, systemd
239 suppresses ENODEV from sd_device_new_from_syspath() but doesn't
suppress ENODEV from sd_device_get_devnum().  In systemd 243 the call
to sd_device_get_devnum() has been moved, but it still leaks ENOENT
from sd_device_get_is_initialized() (referring to the body of
FOREACH_DIRENT_ALL loop in enumerator_scan_dir_and_add_devices()).

Assume that all ENOENT and ENODEV errors are transient and retry the
call to udev_enumerate_scan_devices().  Don't limit the number, but log
each retry.

Fixes: https://tracker.ceph.com/issues/41036
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Sebastian Wagner [Wed, 16 Oct 2019 11:14:36 +0000 (13:14 +0200)]

python-common: Add mypy testing

Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>

commit | commitdiff | tree

Jason Dillaman [Fri, 25 Oct 2019 12:34:25 +0000 (08:34 -0400)]

Merge pull request #31006 from zy751713126/fix_features

librbd: fix rbd_features_to_string output

Reviewed-by: Jason Dillaman <dillaman@redhat.com>

commit | commitdiff | tree

Jason Dillaman [Fri, 25 Oct 2019 12:33:51 +0000 (08:33 -0400)]

Merge pull request #30955 from runsisi/wip-fix-dup-lock

librbd: force reacquire lock if blacklist is disabled

Reviewed-by: Jason Dillaman <dillaman@redhat.com>

commit | commitdiff | tree

Jason Dillaman [Fri, 25 Oct 2019 12:33:12 +0000 (08:33 -0400)]

Merge pull request #30954 from mxdInspur/thickimg_create_progress

rbd: creating thick-provision image progress percent info exceeds 100%

Reviewed-by: Jason Dillaman <dillaman@redhat.com>

commit | commitdiff | tree

Kefu Chai [Fri, 25 Oct 2019 11:15:43 +0000 (19:15 +0800)]

Merge pull request #31143 from badone/wip-tracker-38466-enable-librabbitmq-devel

ceph.spec.in: enable amqp_endpoint on RHEL8 by default

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Fri, 25 Oct 2019 10:00:03 +0000 (18:00 +0800)]

Merge pull request #31110 from tchaikov/wip-cmake/enable-seastar-with-dpdk

cmake: support `Seastar_DPDK=ON` option

Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Kefu Chai [Fri, 25 Oct 2019 08:42:55 +0000 (16:42 +0800)]

auth/cephx: always initialize local variables

to silence GCC warnings like:

rc/auth/cephx/CephxProtocol.h:309:5: warning: 'type' may be used uninitialized in this function [-Wmaybe-uninitialized]
if (i != tickets_map.end())
^~

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Brad Hubbard [Fri, 25 Oct 2019 05:29:37 +0000 (15:29 +1000)]

ceph.spec.in: Enable amqp_endpoint on RHEL8 by default

RHEL/CentOS 8 now provide librabbitmq-devel so we can enable it as a
build requirement.

Fixes: https://tracker.ceph.com/issues/38466
Signed-off-by: Brad Hubbard <bhubbard@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 25 Oct 2019 02:27:27 +0000 (21:27 -0500)]

test_ceph_daemon.sh: use same fsid as qa/standalone/test_ceph_daemon.sh

That way these scripts both clean up for each other (and are obviously
bogus clusters).

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 25 Oct 2019 02:26:21 +0000 (21:26 -0500)]

test_ceph_daemon.sh: use other latest-master image

This one is newer?

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 25 Oct 2019 02:25:59 +0000 (21:25 -0500)]

ceph-daemon: fix pathify (fix shell --config/--keyring)

The relative path works on one of my machines but not my laptop; the
full CWD works everywhere. This is presumably better anyway!

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 25 Oct 2019 02:03:34 +0000 (21:03 -0500)]

ceph-daemon: /var/run/ceph -> /var/run/ceph/$fsid

This is better than having a single /var/run/ceph on the host with a
weird naming scheme. Among other things, it means that we can access
the asok for any daemon for a given fsid from any container on the same
host with the same fsid (notably, a shell).

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 24 Oct 2019 23:56:04 +0000 (18:56 -0500)]

mon: fix tell to hybrid octopus/pre-octopus mons

We can't decide whether to use the new tell command style based on the
monmap.min_mon_release alone because some mons may be octopus even though
that hasn't updated yet. The same goes for if we look at the combined
features for the cluster--the underlying problem is the monmap doesn't
tell us which mons are octopus and which ones aren't, so we don't know
how to behave.

Instead, allow octopus+ mons to advertise the converted tell commands
going forward, for compatibility with pre-octopus clients (who do the old
style of tell) and for octopus+ clients talking to a min_mon_release <
octopus cluster.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 24 Oct 2019 13:41:33 +0000 (08:41 -0500)]

ceph-daemon: only set up crash dir mount if it exists

Sometimes we run containers on a host that doesn't have a crash dir set
up (becuase no daemon has been deployed). Examples include shell and
ceph-volume.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

David Zafman [Fri, 25 Oct 2019 00:23:48 +0000 (17:23 -0700)]

Merge pull request #31133 from dzafman/wip-42476

ceph-objectstore-tool: call collection_bits() crashes on the meta col…

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Kefu Chai [Thu, 24 Oct 2019 23:50:32 +0000 (07:50 +0800)]

Merge pull request #31135 from athanatos/sjust/wip-alien-echo-fix

test_alien_echo: update to use crimson:: namespace

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 24 Oct 2019 23:29:52 +0000 (18:29 -0500)]

Merge PR #31121 into master

* refs/pull/31121/head:
mon/MonClient: fix mon tell to older mons

Reviewed-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Thu, 24 Oct 2019 22:17:48 +0000 (15:17 -0700)]

Merge PR #31095 into master

* refs/pull/31095/head:
qa: do not check pg count for new data_isolated volume

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Tue, 22 Oct 2019 23:04:57 +0000 (16:04 -0700)]

qa: disable too few PG warning during Mimic deploy

Mimic will raise this warning when we use 8 PGs for CephFS metadata/data
pools.

Fixes: fc88e6c6c55402120a432ea47f05f321ba4c9bb1
Fixes: https://tracker.ceph.com/issues/42434
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Samuel Just [Thu, 24 Oct 2019 20:18:29 +0000 (13:18 -0700)]

test/crimson/test_alien_echo: update to use crimson:: namespace

See: bd023211d999f1e2b70f48254507524d0763bd33
Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Mark Nelson [Thu, 24 Oct 2019 19:50:11 +0000 (14:50 -0500)]

Merge pull request #30964 from markhpc/wip-bs-cache-trim-pinned

os/bluestore: Keep separate onode cache pinned list.

commit | commitdiff | tree

David Zafman [Thu, 24 Oct 2019 18:31:52 +0000 (11:31 -0700)]

ceph-objectstore-tool: call collection_bits() crashes on the meta collection

Skip new check for meta collection
test:
Turn off osd_pool_default_pg_autoscale_mode just like bash tests do
Fix test by checking for new error message

Caused by: f88b3534549bb2ede76e9f3e8b6b641b5886a9f4

Fixes: https://tracker.ceph.com/issues/42476
Signed-off-by: David Zafman <dzafman@redhat.com>

Unnamed repository; edit this file 'description' to name the repository.