git.apps.os.sepia.ceph.com Git

]> git.apps.os.sepia.ceph.com Git - ceph.git/log

Sage Weil [Tue, 3 Mar 2020 15:05:12 +0000 (09:05 -0600)]

Merge PR #33679 into master

* refs/pull/33679/head:
cephadm: fix failure when getting keyring for deploying daemons

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Mykola Golub [Tue, 3 Mar 2020 13:19:55 +0000 (15:19 +0200)]

Merge pull request #33588 from dillaman/wip-rbd-snapshot-mirror

rbd-mirror: initial end-to-end test and associated bug fixes

Reviewed-by: Mykola Golub <mgolub@suse.com>

commit | commitdiff | tree

Ramana Raja [Tue, 3 Mar 2020 10:43:34 +0000 (16:13 +0530)]

Merge pull request #33120 from vshankar/wip-mgr-volumes-clone-sync-attrs

mgr/volumes: sync inode attributes for cloned subvolumes

Reviewed-by: Ramana Raja <rraja@redhat.com>

commit | commitdiff | tree

Lenz Grimmer [Tue, 3 Mar 2020 09:30:16 +0000 (09:30 +0000)]

Merge pull request #32423 from bk201/wip-43420

mgr/dashboard: allow refreshing inventory page

Reviewed-by: Tiago Melo <tmelo@suse.com>
Reviewed-by: Volker Theile <vtheile@suse.com>

commit | commitdiff | tree

Kefu Chai [Tue, 3 Mar 2020 08:23:23 +0000 (16:23 +0800)]

Merge pull request #33680 from tchaikov/wip-crimson-cleanups

crimson: clang related cleanups

Reviewed-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Kefu Chai [Tue, 3 Mar 2020 06:34:22 +0000 (14:34 +0800)]

Merge pull request #31910 from rosinL/wip-fix-dpdk-test

test/msgr: fix ComplexTest fail when using DPDK protocal stack

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Tue, 3 Mar 2020 05:52:41 +0000 (13:52 +0800)]

Merge pull request #33661 from bk203/patch-1

doc: update Zabbix template reference

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Tue, 3 Mar 2020 03:46:03 +0000 (11:46 +0800)]

cmake: build used source

to address the link failures when compiling using clang

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Tue, 3 Mar 2020 03:10:00 +0000 (11:10 +0800)]

crimson: do not capture unused variable

this silences the warning of:
```
../src/crimson/osd/osdmap_gate.cc:48:38: warning: lambda capture 'this' is not used [-Wunused-lambda-capture]
std::for_each(first, last, [epoch, this](auto& blocked_requests) {
~~^~~~
```

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kiefer Chang [Tue, 3 Mar 2020 03:28:53 +0000 (11:28 +0800)]

cephadm: fix failure when getting keyring for deploying daemons

Fixes: https://tracker.ceph.com/issues/44390
Signed-off-by: Kiefer Chang <kiefer.chang@suse.com>

commit | commitdiff | tree

bk203 [Mon, 2 Mar 2020 13:19:56 +0000 (14:19 +0100)]

doc: update Zabbix template reference

The old link references a 2017 version of the template, I experienced problems using this version of the template with the latest version of Ceph. Ceph would report "Failed to send data to Zabbix", by importing the newer 2019 version of the template within Zabbix Ceph could again send data (due to changed Zabbix Trapper item keys). Propose to replace the link for a link referencing the master branch of the template so the newest version is always referenced in the docs.

Signed-off-by: Mathijs Smit <m.smit@goldenvalue.nl>

commit | commitdiff | tree

Kefu Chai [Tue, 3 Mar 2020 02:09:49 +0000 (10:09 +0800)]

Merge pull request #33621 from aclamk/fix_mempool_btree

test/test_mempool: test accounting for btree_map

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Tue, 3 Mar 2020 01:01:48 +0000 (17:01 -0800)]

Merge PR #33363 into master

* refs/pull/33363/head:
mds: add perf counters for openfiletable

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Tue, 3 Mar 2020 00:59:51 +0000 (16:59 -0800)]

Merge PR #33491 into master

* refs/pull/33491/head:
mount.ceph: add "fs=<fs_name>" mount options support

Reviewed-by: Jeff Layton <jlayton@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Tue, 3 Mar 2020 00:58:02 +0000 (16:58 -0800)]

Merge PR #33506 into master

* refs/pull/33506/head:
client: add client_fs mount option support

Reviewed-by: Ramana Raja <rraja@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Tue, 3 Mar 2020 00:55:09 +0000 (16:55 -0800)]

Merge PR #33538 into master

* refs/pull/33538/head:
mds: just delete MDSIOContextBase during shutdown
mds: add comment explaining unusual locking for MDSIOContextBase

Reviewed-by: Zheng Yan <zyan@redhat.com>

commit | commitdiff | tree

Jason Dillaman [Mon, 2 Mar 2020 20:34:22 +0000 (15:34 -0500)]

rbd-mirror: move resetting of snapshot replayer rescan variable

The `m_image_updated` boolean should be reset at the start of the
state checking loop now that we scan the local image meta and check
for forced-promotion of the local image.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>

commit | commitdiff | tree

Sage Weil [Mon, 2 Mar 2020 20:27:58 +0000 (14:27 -0600)]

Merge PR #33601 into master

* refs/pull/33601/head:
qa/tasks/thrashosds-health: disable osd_max_markdown behavior
osd: derr on do_shutdown cases

Reviewed-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Sage Weil [Mon, 2 Mar 2020 20:14:52 +0000 (14:14 -0600)]

Merge PR #33647 into master

* refs/pull/33647/head:
vstart.sh: fix install of cephadm ssh keys from ~/.ssh

Reviewed-by: Michael Fritch <mfritch@suse.com>

commit | commitdiff | tree

Sage Weil [Mon, 2 Mar 2020 20:14:40 +0000 (14:14 -0600)]

Merge PR #33648 into master

* refs/pull/33648/head:
qa/suites/cephadm: move orchestrator_cli test into rados/cephadm

Reviewed-by: Michael Fritch <mfritch@suse.com>

commit | commitdiff | tree

Sage Weil [Mon, 2 Mar 2020 20:14:28 +0000 (14:14 -0600)]

Merge PR #33658 into master

* refs/pull/33658/head:
cephadm: remove remaining crash_keyring cruft
vstart.sh: deploy crash containers
cephadm: bootstrap: deploy crash on every host
mgr/cephadm: deploy crash daemons
mgr/orch: add crash hooks
cephadm: do not magically deploy 'crash' container

Reviewed-by: Michael Fritch <mfritch@suse.com>

commit | commitdiff | tree

Sage Weil [Mon, 2 Mar 2020 19:23:56 +0000 (13:23 -0600)]

cephadm: remove remaining crash_keyring cruft

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Mon, 2 Mar 2020 18:04:29 +0000 (12:04 -0600)]

Merge PR #33633 into master

* refs/pull/33633/head:
msg: add get_{pid,random}_nonce() helpers
msg/Policy: make stateless_server default to anon (again)
osd: drop broken 'poison pill'

Reviewed-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Jason Dillaman [Mon, 2 Mar 2020 17:17:09 +0000 (12:17 -0500)]

Merge pull request #33642 from trociny/wip-run-rbd-tests-pool-init

test/run-rbd-tests: properly initialize newly created rbd pool

Reviewed-by: Jason Dillaman <dillaman@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 1 Mar 2020 22:50:28 +0000 (16:50 -0600)]

qa/suites/cephadm: move orchestrator_cli test into rados/cephadm

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Mon, 2 Mar 2020 16:30:48 +0000 (10:30 -0600)]

vstart.sh: deploy crash containers

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 1 Mar 2020 18:11:41 +0000 (12:11 -0600)]

cephadm: bootstrap: deploy crash on every host

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 1 Mar 2020 17:38:51 +0000 (11:38 -0600)]

mgr/cephadm: deploy crash daemons

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 1 Mar 2020 17:38:42 +0000 (11:38 -0600)]

mgr/orch: add crash hooks

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sat, 29 Feb 2020 16:50:42 +0000 (10:50 -0600)]

cephadm: do not magically deploy 'crash' container

We'll soon do this explicitly with a persistent spec instead.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Mon, 2 Mar 2020 16:30:03 +0000 (10:30 -0600)]

Merge PR #33523 into master

* refs/pull/33523/head:
mgr/orch: ServiceSpec: drop 'count'
mgr/rook: use spec.placement.count (instead of spec.count)
mgr/cephadm: make HostAssignment make sense
mgr/orch: PlacementSpec: do not combine all_hosts with anything else
mgr/orch: use PlacementSpec.from_strings() for all CLI commands

Reviewed-by: Joshua Schmid <jschmid@suse.de>

commit | commitdiff | tree

Jason Dillaman [Mon, 2 Mar 2020 13:36:23 +0000 (08:36 -0500)]

librbd: clean-up special casing for enabling mirroring on creating image

Signed-off-by: Jason Dillaman <dillaman@redhat.com>

commit | commitdiff | tree

Jason Dillaman [Thu, 27 Feb 2020 03:25:54 +0000 (22:25 -0500)]

qa: test cases for snapshot-based mirroring

Signed-off-by: Jason Dillaman <dillaman@redhat.com>

commit | commitdiff | tree

Jason Dillaman [Thu, 27 Feb 2020 20:58:59 +0000 (15:58 -0500)]

rbd-mirror: simple snapshot split-brain detection

If the local image has a primary demotion and the remote does not
have a matching non-primary demotion snapshot, we should throw a
split-brain error.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>

commit | commitdiff | tree

Jason Dillaman [Thu, 27 Feb 2020 19:52:06 +0000 (14:52 -0500)]

librbd: refresh image after creating primary mirror snapshot

If RPC was used to create the snapshot, the local image context will
not yet have the snapshot id and will therefore return CEPH_NOSNAP.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>

commit | commitdiff | tree

Jason Dillaman [Thu, 27 Feb 2020 19:50:59 +0000 (14:50 -0500)]

librbd: acquire exclusive lock from peer when removing

This solves an issue with snapshot-based mirroring when the
rbd-mirror daemon is the exclusive lock owner. For other cases,
it still checks for watchers before proceeding.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>

commit | commitdiff | tree

Jason Dillaman [Thu, 27 Feb 2020 19:49:11 +0000 (14:49 -0500)]

librbd: fixed issue with deep-copy incorrectly setting parent spec

When not syncing the full image, the destination image would receive
the incorrect source parent spec instead of the expected destination
parent spec.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>

commit | commitdiff | tree

Jason Dillaman [Thu, 27 Feb 2020 17:44:48 +0000 (12:44 -0500)]

rbd: do not show mirroring mode if not enabled

The 'info' action was incorrectly showing the mirroring mode
when mirroring was not enabled.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>

commit | commitdiff | tree

Jason Dillaman [Thu, 27 Feb 2020 15:44:21 +0000 (10:44 -0500)]

librbd: allow force promotions to create necessary orphan snapshot

The rbd-mirror daemon needs to accept the RPC request to create an
orphan snapshot so that it knows to shut-down the replayer.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>

commit | commitdiff | tree

Jason Dillaman [Thu, 27 Feb 2020 13:53:37 +0000 (08:53 -0500)]

librbd: forced promotion of snapshot mirrored image needs to create orphan

Tweak the state machine to create an orphan record if the current
state is not demoted.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>

commit | commitdiff | tree

Jason Dillaman [Thu, 27 Feb 2020 13:50:18 +0000 (08:50 -0500)]

rbd-mirror: properly handle promoted/demoted snapshot-based images

We should return -EREMOTEIO if we don't have any primary images to sync to
since we want display a warning. Additionally, don't attempt to sync a
remote snapshot against a primary (demoted) local snapshot since it would
have an invalid primary snapshot id.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>

commit | commitdiff | tree

Jason Dillaman [Thu, 27 Feb 2020 04:30:08 +0000 (23:30 -0500)]

librbd: improve debug logging on primary mirror snapshot creation

Signed-off-by: Jason Dillaman <dillaman@redhat.com>

commit | commitdiff | tree

Jason Dillaman [Thu, 27 Feb 2020 04:29:12 +0000 (23:29 -0500)]

cls/rbd: fixed typo in mirror image state ostream operator

The non-primary demoted state was incorrectly outputting demoted
twice.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>

commit | commitdiff | tree

Jason Dillaman [Thu, 27 Feb 2020 03:25:03 +0000 (22:25 -0500)]

rbd-mirror: do not re-register the ASOK if the image replayer isn't running

Avoid the possibility of a race when shutting down and post-unregister a status
update causes the ASOK to become re-registered.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>

commit | commitdiff | tree

Casey Bodley [Mon, 2 Mar 2020 14:38:04 +0000 (09:38 -0500)]

Merge pull request #32660 from cbodley/wip-rgw-admin-zone-shards

rgw: Increase the default number of RGW bucket shards

Reviewed-by: J. Eric Ivancich <ivancich@redhat.com>

commit | commitdiff | tree

Casey Bodley [Mon, 2 Mar 2020 14:12:34 +0000 (09:12 -0500)]

Merge pull request #33628 from cbodley/wip-44353

rgw: fix sharded bucket listing with prefix/delimiter

Reviewed-by: J. Eric Ivancich <ivancich@redhat.com>

commit | commitdiff | tree

Lenz Grimmer [Mon, 2 Mar 2020 13:55:52 +0000 (13:55 +0000)]

Merge pull request #31922 from rhcs-dashboard/42094-bucket-mfa-delete

mgr/dashboard: enable/disable MFA Delete on RGW bucket

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Laura Paduano <lpaduano@suse.com>

commit | commitdiff | tree

Lenz Grimmer [Mon, 2 Mar 2020 13:54:31 +0000 (13:54 +0000)]

Merge pull request #33178 from rhcs-dashboard/42475-rgw-api-keys-allowed

mgr/dashboard: do not show RGW API keys if only read-only privileges

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Laura Paduano <lpaduano@suse.com>
Reviewed-by: Tatjana Dehler <tdehler@suse.com>

commit | commitdiff | tree

Adam Kupczyk [Mon, 2 Mar 2020 08:08:45 +0000 (09:08 +0100)]

common/mempool: Add test for mempool in btree_map

Signed-off-by: Adam Kupczyk <akupczyk@redhat.com>

commit | commitdiff | tree

Kefu Chai [Mon, 2 Mar 2020 11:16:09 +0000 (19:16 +0800)]

Merge pull request #33638 from tchaikov/wip-btree

include/cpp-btree: use the same type when allocate/deallocate

Reviewed-by: Adam Kupczyk <akupczyk@redhat.com>

commit | commitdiff | tree

Kefu Chai [Mon, 2 Mar 2020 11:11:42 +0000 (19:11 +0800)]

Merge pull request #33645 from liewegas/fix-mon-down-cephtool-test

qa/suites/rados/singleton-bluestore/cephtool: whitelist MON_DOWN

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Mon, 2 Mar 2020 11:10:29 +0000 (19:10 +0800)]

Merge pull request #33536 from yuvalif/sync_object_write_with_return_value_op

librados: allow passing flags to operate sync APIs

Reviewed-By: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Kefu Chai [Mon, 2 Mar 2020 11:01:29 +0000 (19:01 +0800)]

Merge pull request #33583 from rzarzynski/wip-recovery-cleanup1

osd: tiny clean-ups around the backfill

Reviewed-by: David Zafman <dzafman@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Mon, 2 Mar 2020 10:58:19 +0000 (18:58 +0800)]

Merge pull request #33593 from dzafman/wip-cot-fix

test: Fix failing ceph_objectstore_tool.py test

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Mon, 2 Mar 2020 10:25:21 +0000 (18:25 +0800)]

Merge pull request #33373 from rzarzynski/wip-bl-bumpup_api_verion

common, include: bump the version of ceph::buffer's C++ API.

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Lenz Grimmer [Mon, 2 Mar 2020 10:24:30 +0000 (10:24 +0000)]

Merge pull request #33572 from p-se/wip-pse-grafana-unit-issue

mgr/dashboard: fix stray requests/error in Grafana unit test

Reviewed-by: Kiefer Chang <kiefer.chang@suse.com>
Reviewed-by: Tatjana Dehler <tdehler@suse.com>

commit | commitdiff | tree

Kefu Chai [Mon, 2 Mar 2020 10:15:24 +0000 (18:15 +0800)]

Merge pull request #31041 from liu-chunmei/ceph_seastar_alien_blue_store

crimson:: add alien blue store

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Mon, 2 Mar 2020 09:36:10 +0000 (17:36 +0800)]

Merge pull request #33595 from cyx1231st/wip-crimson-osd-fixes

crimson/osd: misc fixes for OSD reboot-ability

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Mon, 2 Mar 2020 09:30:34 +0000 (17:30 +0800)]

Merge pull request #33597 from optimistyzy/228_improve_sync_read

NVMEDevice: Remove the unnecessary aio_wait in sync read

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sat, 29 Feb 2020 06:51:28 +0000 (14:51 +0800)]

include/cpp-btree: use the same type when allocate/deallocate

btree_set<> by default uses `std::allocator<Key>`, and btree_map by
default uses `std::allocator<std::pair<Key, Value>>`.

before this change, btree uses the allocator directly for allocating n
elements where element is `Key` or `std::pair<Key, Value>` respectively,
while "n" is actually supposed to be the number of bytes used by each
node which is being allocated.
but, what we need to allocate is actually a "node_type" for holding
multiple slots, and each slot holds an element. in addition to the
slots, a node also keeps track of metadata for btree itself. in short,
what we allocate now is (in bytes):

  alignof(sizeof(node_type)) * sizeof(element)

but what we should allocate is (in bytes):

  alignof(sizeof(node_type))

in this change:

* always rebind the allocator to the correct aligned type with given
  alignment
* extract the allocator related helpers into a template class

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Mon, 2 Mar 2020 05:26:00 +0000 (13:26 +0800)]

Merge pull request #33650 from joscollin/wip-unused-nvme

test: silence warning unused variable nvme

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Jos Collin [Mon, 2 Mar 2020 03:34:39 +0000 (09:04 +0530)]

test: silence warning unused variable nvme

Signed-off-by: Jos Collin <jcollin@redhat.com>

commit | commitdiff | tree

Sage Weil [Mon, 2 Mar 2020 01:53:35 +0000 (19:53 -0600)]

Merge PR #33512 into master

* refs/pull/33512/head:
os/bluestore/BlueFS: less verbose about alloc adjustments

Reviewed-by: Igor Fedotov <ifedotov@suse.com>

commit | commitdiff | tree

Sage Weil [Mon, 2 Mar 2020 01:53:25 +0000 (19:53 -0600)]

Merge PR #33575 into master

* refs/pull/33575/head:
mgr/telemetry: fix and document proxy usage

Reviewed-by: Lenz Grimmer <lgrimmer@suse.com>
Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Yaarit Hatuka <yaarithatuka@gmail.com>

commit | commitdiff | tree

Sage Weil [Sun, 1 Mar 2020 23:15:29 +0000 (17:15 -0600)]

Merge PR #33644 into master

* refs/pull/33644/head:
qa/suites/rados/verity/tasks/mon_recovery: whitelist SLOW_OPS

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 1 Mar 2020 22:55:06 +0000 (16:55 -0600)]

Merge PR #33541 into master

* refs/pull/33541/head:
mgr/test_orchestrator: add daemon_action
qa/tasksmgr/test_orchestrator_cli: fix test_service_action
qa/tasks/test_orchestrator_cli: disable test_osd_create exception check
mgr/test_orchestrator: update_foo -> apply_foo
mgr/test_orchestrator: add force flag to remove_daemons
qa/tasks/mgr/test_orchestrator_cli: update
qa/tasks/mgr/test_orchestrator_cli: fix test_load_data
mgr/test_orchestrator: fix list_daemons

Reviewed-by: Joshua Schmid <jschmid@suse.de>
Reviewed-by: Sebastian Wagner <swagner@suse.com>

commit | commitdiff | tree

Sage Weil [Sun, 1 Mar 2020 20:26:09 +0000 (14:26 -0600)]

Merge PR #33627 into master

* refs/pull/33627/head:
doc/cephadm/administration: clarify log gathering

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 1 Mar 2020 17:30:46 +0000 (11:30 -0600)]

mgr/orch: ServiceSpec: drop 'count'

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 1 Mar 2020 17:30:38 +0000 (11:30 -0600)]

mgr/rook: use spec.placement.count (instead of spec.count)

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 25 Feb 2020 00:29:12 +0000 (18:29 -0600)]

mgr/cephadm: make HostAssignment make sense

- if hosts are passed, use those.
- if all_hosts=true, use all hosts.

Otherwise, build a set of hosts based on the labels--either explicit or
implied. If there's no label, use all hosts.

If there is a count, use a subset of candidate hosts. If there was no
label and there is no count, fail.

If count and hosts are both provided, then we either (1) use the hosts
as the candidate list and place among them (if len(hosts) >= count), or
(2) ensure that any result includes *at least* the provided hosts.

Add tests that (I'm pretty sure) cover cases!

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 28 Feb 2020 16:17:50 +0000 (10:17 -0600)]

mgr/orch: PlacementSpec: do not combine all_hosts with anything else

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 27 Feb 2020 19:32:24 +0000 (13:32 -0600)]

mgr/orch: use PlacementSpec.from_strings() for all CLI commands

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 1 Mar 2020 20:18:39 +0000 (14:18 -0600)]

Merge PR #33553 into master

* refs/pull/33553/head:
mgr/cephadm: orch ls: include specs with no daemons
mgr/cephadm: orch ls: show spec size
mgr/orch: remove unused fields in RGWSpec
mgr/orch: fix ServiceSpec fields
mgr/cephadm: simplify spec apply
pybind/mgr/mgr_module: revert PersistentStoreDict seperator
mgr/cephadm: apply services after refreshing inventory
mgr/cephadm: catch and log exceptions from apply
mgr/orch: no extra whitespace in stored json specs
mgr/cephadm: drop daemon_type arg to _apply_service
mgr/cephadm: use _apply() helper for all apply_ methods
mgr/cephadm: replace PersistentStoreDict with SpecStore
mgr/cephadm: do not remove service spec when removing a daemon
mgr/cephadm: rename completion variables&cleanup
mgr/cephadm: leverage service specs

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 1 Mar 2020 20:18:16 +0000 (14:18 -0600)]

mgr/test_orchestrator: add daemon_action

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 1 Mar 2020 17:52:36 +0000 (11:52 -0600)]

qa/tasksmgr/test_orchestrator_cli: fix test_service_action

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 1 Mar 2020 18:06:19 +0000 (12:06 -0600)]

vstart.sh: fix install of cephadm ssh keys from ~/.ssh

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 1 Mar 2020 16:48:42 +0000 (10:48 -0600)]

qa/tasks/test_orchestrator_cli: disable test_osd_create exception check

I'm not sure what behavior we want here...

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 1 Mar 2020 14:27:11 +0000 (08:27 -0600)]

mgr/cephadm: orch ls: include specs with no daemons

This way you see entries like

NAME RUNNING REFRESHED IMAGE NAME IMAGE ID SPEC
mds.foo 0/1 - <unknown> <unknown> present

before any daemons have been created.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 1 Mar 2020 14:23:40 +0000 (08:23 -0600)]

mgr/cephadm: orch ls: show spec size

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 1 Mar 2020 15:05:50 +0000 (09:05 -0600)]

mgr/orch: remove unused fields in RGWSpec

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 1 Mar 2020 13:48:17 +0000 (07:48 -0600)]

mgr/orch: fix ServiceSpec fields

- service_type is required. Make it the first position arg to the ctor.
- service_id is the id *only* and optional.
- service_name() is the full service name (no change)

The old 'name' was previously used as the id only, so it was poorly named,
and optional, but in this series was changed to include the type, breaking
naming for a bunch of things (e.g., daemons called mds.mds.fsname.xyz).

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 1 Mar 2020 03:09:57 +0000 (21:09 -0600)]

mgr/cephadm: simplify spec apply

- Teach _apply_service how to pick the create (and config) functions, so
that we don't need any weird wrappers in the callers.
- Replace trigger_deploy() and _apply_services() with a simpler
_apply_all_services()
- Drop all of the per-type _apply_foo() methods.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 1 Mar 2020 02:47:38 +0000 (20:47 -0600)]

pybind/mgr/mgr_module: revert PersistentStoreDict seperator

This fixes the self-test.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sat, 29 Feb 2020 21:59:25 +0000 (15:59 -0600)]

mgr/cephadm: apply services after refreshing inventory

Ensure we have an inventory to apply against.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sat, 29 Feb 2020 21:55:40 +0000 (15:55 -0600)]

mgr/cephadm: catch and log exceptions from apply

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 28 Feb 2020 23:05:26 +0000 (17:05 -0600)]

mgr/orch: no extra whitespace in stored json specs

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 28 Feb 2020 23:05:05 +0000 (17:05 -0600)]

mgr/cephadm: drop daemon_type arg to _apply_service

This is part of ServiceSpec now.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 28 Feb 2020 22:55:31 +0000 (16:55 -0600)]

mgr/cephadm: use _apply() helper for all apply_ methods

Clean up a few that were missed (due to racing PRs, presumably).

'creation' -> 'update', since an apply doesn't necessarily create (it may
destroy!).

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 28 Feb 2020 22:38:26 +0000 (16:38 -0600)]

mgr/cephadm: replace PersistentStoreDict with SpecStore

Explicit implementation of the dict of specs.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 28 Feb 2020 21:32:04 +0000 (15:32 -0600)]

mgr/cephadm: do not remove service spec when removing a daemon

This makes our behavior similar to kube: if you kill a pod, the operator
or controller will come along and create a new one (probably somewhere
else).

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Joshua Schmid [Thu, 27 Feb 2020 12:30:56 +0000 (13:30 +0100)]

mgr/cephadm: rename completion variables&cleanup

Signed-off-by: Joshua Schmid <jschmid@suse.de>

commit | commitdiff | tree

Joshua Schmid [Wed, 26 Feb 2020 13:26:42 +0000 (14:26 +0100)]

mgr/cephadm: leverage service specs

Fixes: https://tracker.ceph.com/issues/44205
This does a couple of things:

* Change the way apply_$service() works:

Instead of triggering the deployment mechanism it will rather
transform the already passed ServiceSpec into a json representation
and save it in a persistent mon_store section.

`mgr/cephadm/service_spec/$service|daemon_type/service_name`

These locations will be periodically checked in the serve() thread.
This works since all the apply_$service_type functions are idempotent.

* Allow to save a config-like specification in the mon_store.

`ceph orch apply -i <service_spec_file.yaml>`

will read the specified services and save them in the mon store
section like mentioned above. The same serve() mechanism like above
also applies to deployment.

Signed-off-by: Joshua Schmid <jschmid@suse.de>

commit | commitdiff | tree

Sage Weil [Sun, 1 Mar 2020 14:03:21 +0000 (08:03 -0600)]

qa/suites/rados/singleton-bluestore/cephtool: whitelist MON_DOWN

cephtool/test.sh now includes a test that disallows mon from the quorum
for a short period.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 1 Mar 2020 13:58:11 +0000 (07:58 -0600)]

qa/suites/rados/verity/tasks/mon_recovery: whitelist SLOW_OPS

The mon can see slow ops when thrashing.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 1 Mar 2020 13:23:54 +0000 (07:23 -0600)]

mgr/test_orchestrator: update_foo -> apply_foo

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 1 Mar 2020 13:22:45 +0000 (07:22 -0600)]

Merge PR #33639 into master

* refs/pull/33639/head:
pybind/mgr/mgr_module: fix standby module logging options

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 28 Feb 2020 21:11:37 +0000 (15:11 -0600)]

msg: add get_{pid,random}_nonce() helpers

In cases where we normally use a pid for a nonce, fall back to a random
value when the pid == 1 (i.e., we're in a container). For the cases where
we use a random value, use the helper.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 28 Feb 2020 20:52:02 +0000 (14:52 -0600)]

msg/Policy: make stateless_server default to anon (again)

Midway through the octopus cycle, we made stateless server more stateless
in the sense that it would not register incoming client connections.  And,
in so doing, it would not enforce that client connections came from
unique addresses, by closing an existing connection from the same addr
when a new connection was accepted.

This turned out to cause out of order OSD ops because the OSD needed that
behavior.  See https://tracker.ceph.com/issues/42328.  We fixed that by
reverting to the old behavior for all but monitor connections, where we
needed it, in 507d213cc453ed86ab38619590f710f33245c652.

This, in turn, breaks most OSD <-> OSD communication (and probably lots
of other things) with cephadm, because we make entity_addr_t unique with
a nonce that is populated by getpid()... and the containerized daemons
all have pid 1.  When we finally merged the follow-on fixes for the change
above cephadm OSDs can't ping each other.

In my view, the 'anon' connection handling is a good idea in the general
case.  So, let's adjust our fix for #42328 so that it is only the OSD
client-side interface that registers client connections and makes them
unique.

Fixes: https://tracker.ceph.com/issues/44358
Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 28 Feb 2020 20:46:22 +0000 (14:46 -0600)]

osd: drop broken 'poison pill'

This was effectively a no-op, since the default policy was *also*
stateless_server.

This line originates from v0.24 (2010) when we added the cluster msgr.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Mykola Golub [Sun, 1 Mar 2020 08:47:21 +0000 (08:47 +0000)]

test/run-rbd-tests: properly initialize newly created rbd pool

The scheduler tests from cli_generic expect 'rbd' pool marked as rbd
application pool.

Signed-off-by: Mykola Golub <mgolub@suse.com>

Unnamed repository; edit this file 'description' to name the repository.

RSS Atom