git.apps.os.sepia.ceph.com Git

]> git.apps.os.sepia.ceph.com Git - ceph.git/log

Lenz Grimmer [Mon, 23 Mar 2020 12:11:50 +0000 (13:11 +0100)]

Merge pull request #34063 from s0nea/wip-dashboard-crush-rule-suite

mgr/dashboard: add crush rule test suite

Reviewed-by: Laura Paduano <lpaduano@suse.com>
Reviewed-by: Stephan Müller <smueller@suse.com>

commit | commitdiff | tree

Lenz Grimmer [Mon, 23 Mar 2020 12:07:17 +0000 (13:07 +0100)]

Merge pull request #34113 from s0nea/wip-dashboard-orch-docu-link

mgr/dashboard: correct Orchestrator documentation link

Reviewed-by: Laura Paduano <lpaduano@suse.com>
Reviewed-by: Sebastian Wagner <swagner@suse.com>

commit | commitdiff | tree

Tatjana Dehler [Mon, 23 Mar 2020 10:45:53 +0000 (11:45 +0100)]

mgr/dashboard: correct Orchestrator documentation link

Fixes: https://tracker.ceph.com/issues/44708
Signed-off-by: Tatjana Dehler <tdehler@suse.com>

commit | commitdiff | tree

Kefu Chai [Mon, 23 Mar 2020 09:10:40 +0000 (17:10 +0800)]

Merge pull request #34104 from tchaikov/crimson-admin-close

crimson/admin: do not reset connected_sock before closing

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Kefu Chai [Mon, 23 Mar 2020 09:05:57 +0000 (17:05 +0800)]

Merge pull request #33909 from cyx1231st/wip-seastar-msgr-fix-reset

crimson: misc fixes for writes to multiple-osd cluster

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Yingxin Cheng [Tue, 17 Mar 2020 07:14:06 +0000 (15:14 +0800)]

crimson/net: add critical info logs to track and debug racing

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Mon, 16 Mar 2020 03:38:58 +0000 (11:38 +0800)]

crimson/net: fix incorrect SocketConnection::print()

The informaction about SocketConnection::side and
SocketConnection::ephemeral_port are not up-to-date in the log, because
they are not moved with Socket during connection replacement. They are
actually socket-level information.

Also take the chance to reorder Socket members.

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Sat, 14 Mar 2020 10:34:19 +0000 (18:34 +0800)]

crimson/osd: make send_heartbeat() atomic

The item in Heartbeat:peers could be removed/re-added during the
asynchronous operation.

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Kefu Chai [Mon, 23 Mar 2020 04:46:39 +0000 (12:46 +0800)]

Merge pull request #32171 from rosinL/wip-ec-isla-aarch64

erasure-code: enable isa-l EC for aarch64 platform

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sat, 21 Mar 2020 12:18:50 +0000 (20:18 +0800)]

crimson/admin: do not reset connected_sock before closing

* no need to discard_result(). as `output_stream::close()` returns an
  empty future<> already
* free the connected socket after the background task finishes, because:

we should not free the connected socket before the promise referencing it is fulfilled.

otherwise we have error messages from ASan, like

==287182==ERROR: AddressSanitizer: heap-use-after-free on address 0x611000019aa0 at pc 0x55e2ae2de882 bp 0x7fff7e2bf080 sp 0x7fff7e2bf078
READ of size 8 at 0x611000019aa0 thread T0
    #0 0x55e2ae2de881 in seastar::reactor_backend_aio::await_events(int, __sigset_t const*) ../src/seastar/src/core/reactor_backend.cc:396
    #1 0x55e2ae2dfb59 in seastar::reactor_backend_aio::reap_kernel_completions() ../src/seastar/src/core/reactor_backend.cc:428
    #2 0x55e2adbea397 in seastar::reactor::reap_kernel_completions_pollfn::poll() (/var/ssd/ceph/build/bin/crimson-osd+0x155e9397)
    #3 0x55e2adaec6d0 in seastar::reactor::poll_once() ../src/seastar/src/core/reactor.cc:2789
    #4 0x55e2adae7cf7 in operator() ../src/seastar/src/core/reactor.cc:2687
    #5 0x55e2adb7c595 in __invoke_impl<bool, seastar::reactor::run()::<lambda()>&> /usr/include/c++/10/bits/invoke.h:60
    #6 0x55e2adb699b0 in __invoke_r<bool, seastar::reactor::run()::<lambda()>&> /usr/include/c++/10/bits/invoke.h:113
    #7 0x55e2adb50222 in _M_invoke /usr/include/c++/10/bits/std_function.h:291
    #8 0x55e2adc2ba00 in std::function<bool ()>::operator()() const /usr/include/c++/10/bits/std_function.h:622
    #9 0x55e2adaea491 in seastar::reactor::run() ../src/seastar/src/core/reactor.cc:2713
    #10 0x55e2ad98f1c7 in seastar::app_template::run_deprecated(int, char**, std::function<void ()>&&) ../src/seastar/src/core/app-template.cc:199
    #11 0x55e2a9e57538 in main ../src/crimson/osd/main.cc:148
    #12 0x7fae7f20de0a in __libc_start_main ../csu/libc-start.c:308
    #13 0x55e2a9d431e9 in _start (/var/ssd/ceph/build/bin/crimson-osd+0x117421e9)

0x611000019aa0 is located 96 bytes inside of 240-byte region [0x611000019a40,0x611000019b30)
freed by thread T0 here:
    #0 0x7fae80a4e487 in operator delete(void*, unsigned long) (/usr/lib/x86_64-linux-gnu/libasan.so.6+0xac487)
    #1 0x55e2ae302a0a in seastar::aio_pollable_fd_state::~aio_pollable_fd_state() ../src/seastar/src/core/reactor_backend.cc:458
    #2 0x55e2ae2e1059 in seastar::reactor_backend_aio::forget(seastar::pollable_fd_state&) ../src/seastar/src/core/reactor_backend.cc:524
    #3 0x55e2adab9b9a in seastar::pollable_fd_state::forget() ../src/seastar/src/core/reactor.cc:1396
    #4 0x55e2adab9d05 in seastar::intrusive_ptr_release(seastar::pollable_fd_state*) ../src/seastar/src/core/reactor.cc:1401
    #5 0x55e2ace1b72b in boost::intrusive_ptr<seastar::pollable_fd_state>::~intrusive_ptr() /opt/ceph/include/boost/smart_ptr/intrusive_ptr.hpp:98
    #6 0x55e2ace115a5 in seastar::pollable_fd::~pollable_fd() ../src/seastar/include/seastar/core/internal/pollable_fd.hh:109
    #7 0x55e2ae0ed35c in seastar::net::posix_server_socket_impl::~posix_server_socket_impl() ../src/seastar/include/seastar/net/posix-stack.hh:161
    #8 0x55e2ae0ed3cf in seastar::net::posix_server_socket_impl::~posix_server_socket_impl() ../src/seastar/include/seastar/net/posix-stack.hh:161
    #9 0x55e2ae0ed943 in std::default_delete<seastar::net::api_v2::server_socket_impl>::operator()(seastar::net::api_v2::server_socket_impl*) const /usr/include/c++/10/bits/unique_ptr.h:81
    #10 0x55e2ae0db357 in std::unique_ptr<seastar::net::api_v2::server_socket_impl, std::default_delete<seastar::net::api_v2::server_socket_impl> >::~unique_ptr()
/usr/include/c++/10/bits/unique_ptr.h:357    #11 0x55e2ae1438b7 in seastar::api_v2::server_socket::~server_socket() ../src/seastar/src/net/stack.cc:195
    #12 0x55e2aa1c7656 in std::_Optional_payload_base<seastar::api_v2::server_socket>::_M_destroy() /usr/include/c++/10/optional:260
    #13 0x55e2aa16c84b in std::_Optional_payload_base<seastar::api_v2::server_socket>::_M_reset() /usr/include/c++/10/optional:280
    #14 0x55e2ac24b2b7 in std::_Optional_base_impl<seastar::api_v2::server_socket, std::_Optional_base<seastar::api_v2::server_socket, false, false> >::_M_reset() /usr/include/c++/10/optional:432
    #15 0x55e2ac23f37b in std::optional<seastar::api_v2::server_socket>::reset() /usr/include/c++/10/optional:975
    #16 0x55e2ac21a2e7 in crimson::admin::AdminSocket::stop() ../src/crimson/admin/admin_socket.cc:265
    #17 0x55e2aa099825 in operator() ../src/crimson/osd/osd.cc:450
    #18 0x55e2aa0d4e3e in apply ../src/seastar/include/seastar/core/apply.hh:36

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sun, 22 Mar 2020 02:41:23 +0000 (10:41 +0800)]

Merge pull request #34071 from badone/wip-docker-test-helper-use-podman-by-default

tests: Use podman if available

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sun, 22 Mar 2020 02:40:16 +0000 (10:40 +0800)]

Merge pull request #34048 from tchaikov/wip-test-docker-fc31

tests: update Dockerfile to support fc-31

Reviewed-by: Brad Hubbard <bhubbard@redhat.com>

commit | commitdiff | tree

Brad Hubbard [Fri, 20 Mar 2020 04:15:14 +0000 (14:15 +1000)]

tests: Use podman if available

Signed-off-by: Brad Hubbard <bhubbard@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sat, 21 Mar 2020 06:34:28 +0000 (14:34 +0800)]

Merge pull request #34066 from mgfritch/cephadm-mon-b-test

qa/workunits/cephadm/test_cephadm.sh: fix mon.b failure

Reviewed-by: Sebastian Wagner <sebastian.wagner@suse.com>
Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sat, 21 Mar 2020 06:32:15 +0000 (14:32 +0800)]

Merge pull request #34022 from ifed01/wip-ifed-fix-leak-in-expand

os/bluestore: fix extent leak after main device expand.

Reviewed-by: Adam Kupczyk <akucpzyk@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sat, 21 Mar 2020 06:31:16 +0000 (14:31 +0800)]

Merge pull request #33883 from dragonylffly/wip-fix-comments

msg/async: fix log information

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sat, 21 Mar 2020 06:30:23 +0000 (14:30 +0800)]

Merge pull request #33869 from mgfritch/cephadm-osd-create-test

qa/workunits/cephadm/test_cephadm.sh: move osd test to ceph-volume

Reviewed-by: Sebastian Wagner <sebastian.wagner@suse.com>

commit | commitdiff | tree

Kefu Chai [Sat, 21 Mar 2020 06:29:22 +0000 (14:29 +0800)]

Merge pull request #34097 from adamemerson/wip-boost-use-valgrind-fix

cmake: Don't enable BOOST_USE_VALGRIND when not requested

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sat, 21 Mar 2020 06:27:51 +0000 (14:27 +0800)]

Merge pull request #34056 from xiexingguo/wip-44662

qa/*/osd-markdown.sh: propagate map to osd before testing its reaction

Reviewed-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sat, 21 Mar 2020 02:33:16 +0000 (10:33 +0800)]

Merge pull request #33796 from adamemerson/wip-using-namespace-common

Build the target 'common' without relying on using namespace in headers

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sat, 21 Mar 2020 02:30:08 +0000 (10:30 +0800)]

Merge pull request #33903 from tchaikov/wip-rados-object-locator

tools/rados: use object-locator in user-visible outputs

Reviewed-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sat, 21 Mar 2020 01:27:09 +0000 (09:27 +0800)]

Merge pull request #33976 from tchaikov/wip-build-doc-on-darwin

admin/build-doc, pybind/*/setup.py: support Darwin

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Xie Xingguo [Sat, 21 Mar 2020 01:01:29 +0000 (09:01 +0800)]

Merge pull request #34070 from bangmingcheng/wip-doc-ceph-chenbm

doc: fix a spelling error at /doc/radosgw/dynamicresharding.rst

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Adam C. Emerson [Fri, 20 Mar 2020 18:10:14 +0000 (14:10 -0400)]

cmake: Don't enable BOOST_USE_VALGRIND when not requested

We were adding the define without support in the library if
WITH_BOOST_VALGRIND was turned off.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Fri, 20 Mar 2020 15:23:14 +0000 (08:23 -0700)]

Merge pull request #33841 from xiexingguo/wip-44507

osd/PeeringState: fix pending want_acting vs osd offline race

Reviewed-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Sebastian Wagner [Fri, 20 Mar 2020 15:07:01 +0000 (16:07 +0100)]

Merge pull request #32319 from jan--f/pybind-mgr-util-cephfs-con-pool

mgr_util: add CephfsClient implementation

Reviewed-by: Sebastian Wagner <sebastian.wagner@suse.com>
Reviewed-by: Venky Shankar vshankar@redhat.com

commit | commitdiff | tree

Yingxin Cheng [Sat, 14 Mar 2020 10:32:54 +0000 (18:32 +0800)]

crimson/osd: log Heartbeat::remove_peer before assertion failure

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Avan [Fri, 20 Mar 2020 14:32:52 +0000 (20:02 +0530)]

Merge pull request #33529 from rhcs-dashboard/wip-42979-fix-hover-issues

mgr/dashboard: add popover list of Stand-by Managers & Metadata Servers (MDS) in landing page

commit | commitdiff | tree

Kefu Chai [Fri, 20 Mar 2020 14:17:46 +0000 (22:17 +0800)]

Merge pull request #34043 from adamemerson/wip-boost-valgrind

cmake: Add WITH_BOOST_VALGRIND option

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Michael Fritch [Wed, 11 Mar 2020 00:40:07 +0000 (18:40 -0600)]

qa/workunits/cephadm/test_cephadm.sh: move osd test to ceph-volume

instead of via the orchestrator

Signed-off-by: Michael Fritch <mfritch@suse.com>

commit | commitdiff | tree

Michael Fritch [Wed, 11 Mar 2020 00:38:29 +0000 (18:38 -0600)]

cephadm: allow config and keyring args for ceph-volume

easier to script via the cephadm tests

Signed-off-by: Michael Fritch <mfritch@suse.com>

commit | commitdiff | tree

Sebastian Wagner [Fri, 20 Mar 2020 12:13:00 +0000 (13:13 +0100)]

Merge pull request #34080 from sebastian-philipp/doc-osd-create-osd-apply

doc/cephadm: replace `osd create` with `apply osd`

Reviewed-by: Nathan Cutler <ncutler@suse.com>

commit | commitdiff | tree

Sebastian Wagner [Fri, 20 Mar 2020 11:33:27 +0000 (12:33 +0100)]

doc/cephadm: replace `osd create` with `apply osd`

Fixes: https://tracker.ceph.com/issues/44692
Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>

commit | commitdiff | tree

Kefu Chai [Thu, 19 Mar 2020 03:21:48 +0000 (11:21 +0800)]

tests: update Dockerfile to support fc-31

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Yingxin Cheng [Fri, 13 Mar 2020 08:58:31 +0000 (16:58 +0800)]

crimson/net: allow mark_down() inside ms_handle_reset()

Although it is not necessary to mark_down the connection in its
ms_handle_reset() event, but it can be more convenient to allow it.
And Heartbeat already encounters this assertion failure.

So move the assertion to close_clean() which will help identify problems
if we happen to make ms_handle_reset() wait for messenger shutdown.

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Fri, 13 Mar 2020 06:22:40 +0000 (14:22 +0800)]

crimson/net: change close() to mark_down()

* be explicit that mark_down() won't trigger reset event;
* return void so no deadlock is possible and memory is still safe
guarded by Messenger::shutdown();
* related changes in crimson/osd;

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Thu, 12 Mar 2020 15:38:22 +0000 (23:38 +0800)]

crimson/net: cleanup dispatches with connection gate

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Thu, 12 Mar 2020 07:59:53 +0000 (15:59 +0800)]

crimson/net: enforce strict reset/accept order

When a new connection tries to replace the old one, the event order
should be like:
1. reset(old);
2. accept(new);

This means we cannot just reschedule the reset event asynchronously. And
we still need to make sure the internal state is integral when reset.

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Thu, 12 Mar 2020 06:28:56 +0000 (14:28 +0800)]

crimson/net: fix incorrect reset events according to async-msgr

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Thu, 12 Mar 2020 04:45:38 +0000 (12:45 +0800)]

crimson/net: close() with ms_handle_reset()

* ms_handle_reset() should not be able to contaminate the internal
atomic messenger status, so make it an asynchronous event along
with close();

* add is_closed_clean() for messenger unit test, because the reset event
now happens after connection closed.

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Adam C. Emerson [Wed, 18 Mar 2020 20:26:30 +0000 (16:26 -0400)]

build: Add WITH_BOOST_VALGRIND option

To build Boost.Context (and other libraries) with support to allow
them to be valground usefully, and to include the define to link
against them.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>

commit | commitdiff | tree

bangmingcheng [Fri, 20 Mar 2020 02:23:01 +0000 (10:23 +0800)]

doc: fix a spelling error at /doc/radosgw/dynamicresharding.rst

Signed-off-by: bangmingcheng <bangmingcheng@gmail.com>

commit | commitdiff | tree

Michael Fritch [Thu, 19 Mar 2020 19:43:56 +0000 (13:43 -0600)]

qa/workunits/cephadm/test_cephadm.sh: fix mon.b failure

and is_available check to ensure quorum

Signed-off-by: Michael Fritch <mfritch@suse.com>

commit | commitdiff | tree

Sage Weil [Thu, 19 Mar 2020 16:07:16 +0000 (11:07 -0500)]

Merge PR #34060 into master

* refs/pull/34060/head:
Merge PR #34027 into octopus
Merge PR #34045 into octopus
Merge pull request #34035 from dillaman/wip-rbd-permissions
mgr/progress: fix duration strings
Merge PR #34014 into octopus
Merge PR #34001 into octopus
Merge PR #34011 into octopus
qa/workunits/rbd: use context managers to control Rados lifespan
Merge pull request #34032 from dillaman/wip-rbd-octopus-docs
doc/releases/octopus: add additional RBD improvements
qa/workunits/cephadm/test_cephadm: mark services unmanaged for test
mgr/cephadm: do not reconfig unmanaged services
Merge PR #33981 into octopus
Merge pull request #34018 from ajarr/octopus-subvolume-clone-cancel
qa/workunits/cephadm/test_cephadm: output file for pub key
Merge PR #33866 into octopus
Merge PR #34005 into octopus
Merge PR #34013 into octopus
mgr/cephadm: pytest: Enable SpecStore
mgr/orchestrator: add test for default implementation for apply()
python-common: validate ServiceSpec.service_type
fixup mgr/cephadm: Fix ceph orch apply -i
mgr/dashbaord: orchestrator service: Revert wait_api_result to a single completion
mgr/orchestrator: `orch daemon add` accepts a yaml
mgr/cephadm: apply_drivegroups() returns a single Completion
mgr/cephadm: remove `trivial_result()`
mgr/cephadm: Fix `ceph orch apply -i`
Merge pull request #33994 from dillaman/wip-librbd-poll-event-race
doc: document `clone cancel` command
test: add `clone cancel` tests
mgr/volumes: introduce "clone cancel" volume command
mgr/volumes: allow canceling a single asynchronous job for a volume
mgr/volumes: helper for looking up a clone entry index
mgr/volumes: periodically check if clone operations should be canceled
mgr/volumes: periodically check if copy operations should be canceled
mgr/volumes: introduce 'canceled' state in clone op state machine
qa/suites/rados/verify/validater/valgrind: tolerate SLOW_OPS
qa/suites/rados/verify/validater/valgrind: less bluestore logging
qa/suites/rados/verify/validater: increase heartbeat grace
Revert "qa/suites/rados/verify: debug_ms = 1, osd_heartbeat_grace = 60"
Revert "qa/suites/rados/verify/validator/valgrind: debug refs = 5"
ceph_test_watch_notify: try notify 10x if ALLOW_TIMEOUTS is set
ceph_test_rados_api_misc: ShutdownRace timeout if ALLOW_TIMEOUTS is set
qa/suites/rados/verify: set ALLOW_TIMEOUTS for workunits
doc/install: edits
doc/cephadm: more edits
doc/cephadm/install: edits
doc/cephadm/adoption: improvements
doc/cephadm/install: a few edits
doc/cephadm/install: do not install ceph-common on host (by default)
doc/cephadm: drop os recs link
doc/cephadm/upgrade: improvements
doc/cephadm/upgrade: document upgrade
doc/cephadm/install: revamp install docs
doc: reorganize cephadm docs
doc/cephadm/administration: update docs on customizing SSH config
doc/cephadm/administration: add a note about the 'removed' dir
mgr/balancer: tolerate pgs outside of target weight map
qa/workunits/cephadm/test_cephadm: --skip-monitoring-stack
Merge PR #33974 into octopus
Merge PR #33442 into octopus
Merge PR #33997 into octopus
Merge PR #34000 into octopus
use quay octopus tip until 15.2 tag is available
python-common: reduce output of ServiceSpec.to_json()
python-common,mgr/cephadm: move assert_valid_host to service_spec
mgr/cephadm: add HostAssignment.validate()
mgr/dashboard: adapt create_osds interface change
mon/MgrMonitor: make 'mgr fail' work with no arguments
cephadm: add allow_ptrace option to enable SYS_PTRACE
update default container images
mgr/cephadm: limit number of times check host is performed in the serve loop
Merge PR #33961 into octopus
Merge PR #33952 into octopus
Merge PR #33990 into octopus
Merge PR #33955 into octopus
Merge PR #33936 into octopus
mgr/orch: add --all-available-devices to 'orch apply osd'
qa/workunits/cephadm: --skip-mon-network when using 127.0.0.1
cephadm: add tests
qa/tasks/cephadm: pass -v to bootstrap
mgr/cephadm: only try to place mons on hosts matching public_network
mgr/cephadm: keep track of host networks, ips
cephadm: automatically infer mon public_network, if we can
cephadm: add list-networks command
cephadm: bootstrap: deploy monitoring stack by default
librbd: defer event socket completion until after callback issued
cephadm: add-repo: add --version
mgr/cephadm: respect 'unmanaged' flag in spec
mgr/orch: orch ls: show <no spec> or <unmanaged> as appropriate
mgr/orch: orch ls: rename SPEC -> PLACEMENT
mgr/orch: add 'unmanaged' property to ServiceSpec
cephadm: rename distro args in repo methods
mgr/orch: combine 'orch daemon add <type> ...' into one command
mgr/orch: combine 'orch apply <type> [<placement>]' into one command

Reviewed-by: Laura Paduano <lpaduano@suse.com>

commit | commitdiff | tree

Tatjana Dehler [Thu, 19 Mar 2020 15:33:22 +0000 (16:33 +0100)]

mgr/dashboard: add crush rule test suite

https://tracker.ceph.com/issues/44679
Signed-off-by: Tatjana Dehler <tdehler@suse.com>

commit | commitdiff | tree

Lenz Grimmer [Thu, 19 Mar 2020 14:45:09 +0000 (15:45 +0100)]

Merge pull request #33925 from tspmelo/wip-update-npm

mgr/dashboard: Update Angular and fix npm audit

Reviewed-by: Kiefer Chang <kiefer.chang@suse.com>

commit | commitdiff | tree

Sage Weil [Thu, 19 Mar 2020 13:22:40 +0000 (08:22 -0500)]

Merge PR #34027 into octopus

* refs/pull/34027/head:
qa/workunits/cephadm/test_cephadm: mark services unmanaged for test
mgr/cephadm: do not reconfig unmanaged services
qa/workunits/cephadm/test_cephadm: output file for pub key

Reviewed-by: Sebastian Wagner <swagner@suse.com>

commit | commitdiff | tree

Sage Weil [Thu, 19 Mar 2020 13:07:05 +0000 (08:07 -0500)]

Merge PR #34045 into octopus

* refs/pull/34045/head:
mgr/progress: fix duration strings

Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>

commit | commitdiff | tree

Sage Weil [Thu, 19 Mar 2020 13:02:06 +0000 (08:02 -0500)]

Merge PR #34047 into master

* refs/pull/34047/head:
qa/standalone/osd/osd-backfill-stats.sh: get_latest_osdmap to propagate map change

Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Kefu Chai [Thu, 19 Mar 2020 10:32:44 +0000 (18:32 +0800)]

Merge pull request #34017 from xxhdx1985126/wip-crimson-osd-report-failure

crimson/os/heartbeat: make Heartbeat::send_failures() safe

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

xie xingguo [Thu, 19 Mar 2020 09:19:46 +0000 (17:19 +0800)]

qa/*/osd-markdown.sh: propagate map to osd before testing its reaction

Mon might fail to share the newest map with any of up osds, e.g.,
due to an injected broken pipe. Since we don't have any client
activities during the osd-markdown tests, osds might be unaware of
the map changes made through CLI. Make sure osds have pulled the
newest map down before we can test its reaction correctly.

Fixes: https://tracker.ceph.com/issues/44662
Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>

commit | commitdiff | tree

Mykola Golub [Thu, 19 Mar 2020 08:43:57 +0000 (10:43 +0200)]

Merge pull request #34035 from dillaman/wip-rbd-permissions

qa/workunits/rbd: use context managers to control Rados lifespan

Reviewed-by: Mykola Golub <mgolub@suse.com>

commit | commitdiff | tree

Xuehan Xu [Wed, 18 Mar 2020 02:32:02 +0000 (10:32 +0800)]

crimson/os/heartbeat: make Heartbeat::send_failures() safe

Currently, Heartbeat::send_failures() invokes monc.send_message() in a
continuation which may be run asynchronously, risking involving a daggling
"monc" reference when OSD shuts down and MonClient is destroyed.

Signed-off-by: Xuehan Xu <xxhdx1985126@163.com>

commit | commitdiff | tree

Kefu Chai [Thu, 19 Mar 2020 02:10:28 +0000 (10:10 +0800)]

Merge pull request #34016 from tchaikov/wip-44658

ceph.spec.in: add build deps for Debug build of seastar

Reviewed-by: Greg Farnum <gfarnum@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 19 Mar 2020 00:04:14 +0000 (19:04 -0500)]

mgr/progress: fix duration strings

- simplify the code to just calculate the durations when we need them
(I'm not sure why we had those temporary strings!)
- use a nicer time delta format

Fixes: https://tracker.ceph.com/issues/44672
Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Neha [Wed, 18 Mar 2020 22:57:41 +0000 (22:57 +0000)]

qa/standalone/osd/osd-backfill-stats.sh: get_latest_osdmap to propagate map change

Fixes: https://tracker.ceph.com/issues/44518
Signed-off-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Sage Weil [Wed, 18 Mar 2020 22:42:48 +0000 (17:42 -0500)]

Merge PR #34014 into octopus

* refs/pull/34014/head:
mgr/balancer: tolerate pgs outside of target weight map

Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>

commit | commitdiff | tree

Sage Weil [Wed, 18 Mar 2020 22:42:10 +0000 (17:42 -0500)]

Merge PR #34001 into octopus

* refs/pull/34001/head:
mgr/cephadm: pytest: Enable SpecStore
mgr/orchestrator: add test for default implementation for apply()
python-common: validate ServiceSpec.service_type
fixup mgr/cephadm: Fix ceph orch apply -i
mgr/dashbaord: orchestrator service: Revert wait_api_result to a single completion
mgr/orchestrator: `orch daemon add` accepts a yaml
mgr/cephadm: apply_drivegroups() returns a single Completion
mgr/cephadm: remove `trivial_result()`
mgr/cephadm: Fix `ceph orch apply -i`

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Tiago Melo [Thu, 12 Mar 2020 14:34:56 +0000 (13:34 -0100)]

mgr/dashboard: Update Angular and fix npm audits

This fixes a new audit problem found in older versions of "acorn"
and updates Angular to latest v8 version.

Fixes: https://tracker.ceph.com/issues/44580
Signed-off-by: Tiago Melo <tmelo@suse.com>

commit | commitdiff | tree

Sage Weil [Wed, 18 Mar 2020 17:12:54 +0000 (12:12 -0500)]

Merge PR #34011 into octopus

* refs/pull/34011/head:
qa/suites/rados/verify/validater/valgrind: tolerate SLOW_OPS
qa/suites/rados/verify/validater/valgrind: less bluestore logging
qa/suites/rados/verify/validater: increase heartbeat grace
Revert "qa/suites/rados/verify: debug_ms = 1, osd_heartbeat_grace = 60"
Revert "qa/suites/rados/verify/validator/valgrind: debug refs = 5"
ceph_test_watch_notify: try notify 10x if ALLOW_TIMEOUTS is set
ceph_test_rados_api_misc: ShutdownRace timeout if ALLOW_TIMEOUTS is set
qa/suites/rados/verify: set ALLOW_TIMEOUTS for workunits

Reviewed-by: Brad Hubbard <bhubbard@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Jason Dillaman [Wed, 18 Mar 2020 16:54:16 +0000 (12:54 -0400)]

qa/workunits/rbd: use context managers to control Rados lifespan

There is a potential race between the expected exceptions being
thrown and Python shutting down racing with librados background
threads. Ensure that librados is properly shut down prior to
exiting Python.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>

commit | commitdiff | tree

Mykola Golub [Wed, 18 Mar 2020 16:45:43 +0000 (18:45 +0200)]

Merge pull request #34032 from dillaman/wip-rbd-octopus-docs

doc/releases/octopus: add additional RBD improvements

Reviewed-by: Mykola Golub <mgolub@suse.com>

commit | commitdiff | tree

Jason Dillaman [Wed, 18 Mar 2020 16:16:28 +0000 (12:16 -0400)]

doc/releases/octopus: add additional RBD improvements

Signed-off-by: Jason Dillaman <dillaman@redhat.com>

commit | commitdiff | tree

Sage Weil [Wed, 18 Mar 2020 15:12:59 +0000 (10:12 -0500)]

qa/workunits/cephadm/test_cephadm: mark services unmanaged for test

We are deploying containers manually. Mark them unmanaged so that we
do not fight against mgr/cephadm cleaning up orphan daemons.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Wed, 18 Mar 2020 15:09:00 +0000 (10:09 -0500)]

mgr/cephadm: do not reconfig unmanaged services

Fixes: https://tracker.ceph.com/issues/44615
Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Wed, 18 Mar 2020 14:45:16 +0000 (09:45 -0500)]

Merge PR #33981 into octopus

* refs/pull/33981/head:
doc/install: edits
doc/cephadm: more edits
doc/cephadm/install: edits
doc/cephadm/adoption: improvements
doc/cephadm/install: a few edits
doc/cephadm/install: do not install ceph-common on host (by default)
doc/cephadm: drop os recs link
doc/cephadm/upgrade: improvements
doc/cephadm/upgrade: document upgrade
doc/cephadm/install: revamp install docs
doc: reorganize cephadm docs
doc/cephadm/administration: update docs on customizing SSH config
doc/cephadm/administration: add a note about the 'removed' dir

Reviewed-by: Sebastian Wagner <swagner@suse.com>

commit | commitdiff | tree

Ramana Raja [Wed, 18 Mar 2020 13:52:06 +0000 (19:22 +0530)]

Merge pull request #34018 from ajarr/octopus-subvolume-clone-cancel

mgr/volumes: add FS subvolume clone cancel

Reviewed-by: Ramana Raja <rraja@redhat.com>

commit | commitdiff | tree

Sage Weil [Wed, 18 Mar 2020 13:44:10 +0000 (08:44 -0500)]

qa/workunits/cephadm/test_cephadm: output file for pub key

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Wed, 18 Mar 2020 13:14:19 +0000 (08:14 -0500)]

Merge PR #33866 into octopus

* refs/pull/33866/head:
mgr/cephadm: limit number of times check host is performed in the serve loop

Reviewed-by: Sebastian Wagner <swagner@suse.com>

commit | commitdiff | tree

Sage Weil [Wed, 18 Mar 2020 13:14:06 +0000 (08:14 -0500)]

Merge PR #34005 into octopus

* refs/pull/34005/head:
python-common: reduce output of ServiceSpec.to_json()
python-common,mgr/cephadm: move assert_valid_host to service_spec
mgr/cephadm: add HostAssignment.validate()

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Wed, 18 Mar 2020 13:13:56 +0000 (08:13 -0500)]

Merge PR #34013 into octopus

* refs/pull/34013/head:
qa/workunits/cephadm/test_cephadm: --skip-monitoring-stack

Reviewed-by: Michael Fritch <mfritch@suse.com>

commit | commitdiff | tree

Igor Fedotov [Wed, 18 Mar 2020 11:35:55 +0000 (14:35 +0300)]

os/bluestore: fix extent leak after main device expand.

To reproduce the issue one can expand device of 3,147,480,064 bytes to
4,147,480,064 using bluefs-bdev-expand command.
Subsequent fsck will detect the leakage.

Signed-off-by: Igor Fedotov <ifedotov@suse.com>

commit | commitdiff | tree

Sebastian Wagner [Wed, 18 Mar 2020 12:18:53 +0000 (13:18 +0100)]

mgr/cephadm: pytest: Enable SpecStore

Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>

commit | commitdiff | tree

Sebastian Wagner [Wed, 18 Mar 2020 12:02:12 +0000 (13:02 +0100)]

mgr/orchestrator: add test for default implementation for apply()

Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>

commit | commitdiff | tree

Sebastian Wagner [Wed, 18 Mar 2020 10:25:50 +0000 (11:25 +0100)]

python-common: validate ServiceSpec.service_type

Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>

commit | commitdiff | tree

Sebastian Wagner [Wed, 18 Mar 2020 10:25:11 +0000 (11:25 +0100)]

fixup mgr/cephadm: Fix ceph orch apply -i

Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>

commit | commitdiff | tree

Jan Fajerski [Wed, 18 Mar 2020 12:04:16 +0000 (13:04 +0100)]

Merge pull request #33787 from jan--f/c-v-prepare-slots

ceph-volume: prepare: use *-slots arguments for implicit sizing

commit | commitdiff | tree

Lenz Grimmer [Wed, 18 Mar 2020 11:24:00 +0000 (12:24 +0100)]

Merge pull request #33760 from tspmelo/wip-update-prettier

mgr/dashboard: Update prettier

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Stephan Müller <smueller@suse.com>

commit | commitdiff | tree

Sebastian Wagner [Wed, 18 Mar 2020 09:52:38 +0000 (10:52 +0100)]

mgr/dashbaord: orchestrator service: Revert wait_api_result to a single completion

Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>

commit | commitdiff | tree

Sebastian Wagner [Tue, 17 Mar 2020 11:11:36 +0000 (12:11 +0100)]

mgr/orchestrator: `orch daemon add` accepts a yaml

Fixes: https://tracker.ceph.com/issues/44622
Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>

commit | commitdiff | tree

Sebastian Wagner [Tue, 17 Mar 2020 10:52:54 +0000 (11:52 +0100)]

mgr/cephadm: apply_drivegroups() returns a single Completion

Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>

commit | commitdiff | tree

Sebastian Wagner [Tue, 17 Mar 2020 10:44:02 +0000 (11:44 +0100)]

mgr/cephadm: remove `trivial_result()`

Replaced with `@trivial_completion`, otherwise
`self._apply()` would have needed to accept a list
of specs.

Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>

commit | commitdiff | tree

Sebastian Wagner [Tue, 17 Mar 2020 10:33:20 +0000 (11:33 +0100)]

mgr/cephadm: Fix `ceph orch apply -i`

* We had two implementations for `ceph orcha apply`
* Remove yaml pasing in `mgr/cephadm`

Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>

commit | commitdiff | tree

Kefu Chai [Wed, 18 Mar 2020 08:27:30 +0000 (16:27 +0800)]

Merge pull request #34010 from grinapo/patch-1

doc/rbd: s/use use/use/

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Mykola Golub [Wed, 18 Mar 2020 08:08:25 +0000 (10:08 +0200)]

Merge pull request #33994 from dillaman/wip-librbd-poll-event-race

librbd: defer event socket completion until after callback issued

Reviewed-by: Dehao Shang <dehao.shang@intel.com>
Reviewed-by: Mykola Golub <mgolub@suse.com>

commit | commitdiff | tree

Peter Gervai [Tue, 17 Mar 2020 17:00:51 +0000 (18:00 +0100)]

doc/rbd: s/use use/use/

just a typo, really.

Signed-off-by: Peter Gervai <grin@grin.hu>

commit | commitdiff | tree

Venky Shankar [Tue, 25 Feb 2020 07:29:54 +0000 (02:29 -0500)]

doc: document `clone cancel` command

Fixes: http://tracker.ceph.com/issues/44208
Signed-off-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Tue, 4 Feb 2020 10:02:56 +0000 (05:02 -0500)]

test: add `clone cancel` tests

Signed-off-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Tue, 14 Jan 2020 09:20:50 +0000 (04:20 -0500)]

mgr/volumes: introduce "clone cancel" volume command

Signed-off-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Tue, 14 Jan 2020 09:19:42 +0000 (04:19 -0500)]

mgr/volumes: allow canceling a single asynchronous job for a volume

Signed-off-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Tue, 14 Jan 2020 09:10:22 +0000 (04:10 -0500)]

mgr/volumes: helper for looking up a clone entry index

... this would be used when canceling a clone operation to
find the clone index for a given clone.

Signed-off-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Wed, 8 Jan 2020 06:07:03 +0000 (01:07 -0500)]

mgr/volumes: periodically check if clone operations should be canceled

Signed-off-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Wed, 8 Jan 2020 06:00:20 +0000 (01:00 -0500)]

mgr/volumes: periodically check if copy operations should be canceled

Signed-off-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Tue, 14 Jan 2020 09:13:16 +0000 (04:13 -0500)]

mgr/volumes: introduce 'canceled' state in clone op state machine

When fetching the next execution state, -EINTR jumps to 'canceled'
state signifying a canceled (interrupted) operation. Also include
a helper routine to check if a given state machine is in initial
state.

Signed-off-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Kefu Chai [Wed, 18 Mar 2020 02:29:54 +0000 (10:29 +0800)]

ceph.spec.in: add build deps for Debug build of seastar

it's a regression introduced by
6158bcfdef91cc2930c57ff2bbe2bfae37da7363, which dropped the change to
make Sanitizers optional

since we've switched from xenial to bionic. there is no need to disable
this anymore. we ran into an issue caused by the ancient linker shipped
by xenial before.

Fixes: https://tracker.ceph.com/issues/44658
Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 17 Mar 2020 23:10:37 +0000 (18:10 -0500)]

qa/suites/rados/verify/validater/valgrind: tolerate SLOW_OPS

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 17 Mar 2020 21:36:46 +0000 (16:36 -0500)]

qa/suites/rados/verify/validater/valgrind: less bluestore logging

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 17 Mar 2020 21:35:21 +0000 (16:35 -0500)]

qa/suites/rados/verify/validater: increase heartbeat grace

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 17 Mar 2020 21:34:49 +0000 (16:34 -0500)]

Revert "qa/suites/rados/verify: debug_ms = 1, osd_heartbeat_grace = 60"

This reverts commit 4f742f200df6c91db87bfee1109c37fad3c0548b.

This was in the wrong file.. see valgrind.yaml

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 17 Mar 2020 21:34:08 +0000 (16:34 -0500)]

Revert "qa/suites/rados/verify/validator/valgrind: debug refs = 5"

This reverts commit 65e81e6eb4f136bf21b67e5de10ab49f028f9e95.

This slows things down too much with valgrind.

Signed-off-by: Sage Weil <sage@redhat.com>

Unnamed repository; edit this file 'description' to name the repository.

RSS Atom