]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
5 years agocephadm: fix failure when getting keyring for deploying daemons 33679/head
Kiefer Chang [Tue, 3 Mar 2020 03:28:53 +0000 (11:28 +0800)]
cephadm: fix failure when getting keyring for deploying daemons

Fixes: https://tracker.ceph.com/issues/44390
Signed-off-by: Kiefer Chang <kiefer.chang@suse.com>
5 years agoMerge PR #33363 into master
Patrick Donnelly [Tue, 3 Mar 2020 01:01:48 +0000 (17:01 -0800)]
Merge PR #33363 into master

* refs/pull/33363/head:
mds: add perf counters for openfiletable

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
5 years agoMerge PR #33491 into master
Patrick Donnelly [Tue, 3 Mar 2020 00:59:51 +0000 (16:59 -0800)]
Merge PR #33491 into master

* refs/pull/33491/head:
mount.ceph: add "fs=<fs_name>" mount options support

Reviewed-by: Jeff Layton <jlayton@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
5 years agoMerge PR #33506 into master
Patrick Donnelly [Tue, 3 Mar 2020 00:58:02 +0000 (16:58 -0800)]
Merge PR #33506 into master

* refs/pull/33506/head:
client: add client_fs mount option support

Reviewed-by: Ramana Raja <rraja@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
5 years agoMerge PR #33538 into master
Patrick Donnelly [Tue, 3 Mar 2020 00:55:09 +0000 (16:55 -0800)]
Merge PR #33538 into master

* refs/pull/33538/head:
mds: just delete MDSIOContextBase during shutdown
mds: add comment explaining unusual locking for MDSIOContextBase

Reviewed-by: Zheng Yan <zyan@redhat.com>
5 years agoMerge PR #33601 into master
Sage Weil [Mon, 2 Mar 2020 20:27:58 +0000 (14:27 -0600)]
Merge PR #33601 into master

* refs/pull/33601/head:
qa/tasks/thrashosds-health: disable osd_max_markdown behavior
osd: derr on do_shutdown cases

Reviewed-by: Neha Ojha <nojha@redhat.com>
5 years agoMerge PR #33647 into master
Sage Weil [Mon, 2 Mar 2020 20:14:52 +0000 (14:14 -0600)]
Merge PR #33647 into master

* refs/pull/33647/head:
vstart.sh: fix install of cephadm ssh keys from ~/.ssh

Reviewed-by: Michael Fritch <mfritch@suse.com>
5 years agoMerge PR #33648 into master
Sage Weil [Mon, 2 Mar 2020 20:14:40 +0000 (14:14 -0600)]
Merge PR #33648 into master

* refs/pull/33648/head:
qa/suites/cephadm: move orchestrator_cli test into rados/cephadm

Reviewed-by: Michael Fritch <mfritch@suse.com>
5 years agoMerge PR #33658 into master
Sage Weil [Mon, 2 Mar 2020 20:14:28 +0000 (14:14 -0600)]
Merge PR #33658 into master

* refs/pull/33658/head:
cephadm: remove remaining crash_keyring cruft
vstart.sh: deploy crash containers
cephadm: bootstrap: deploy crash on every host
mgr/cephadm: deploy crash daemons
mgr/orch: add crash hooks
cephadm: do not magically deploy 'crash' container

Reviewed-by: Michael Fritch <mfritch@suse.com>
5 years agocephadm: remove remaining crash_keyring cruft 33658/head
Sage Weil [Mon, 2 Mar 2020 19:23:56 +0000 (13:23 -0600)]
cephadm: remove remaining crash_keyring cruft

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agoMerge PR #33633 into master
Sage Weil [Mon, 2 Mar 2020 18:04:29 +0000 (12:04 -0600)]
Merge PR #33633 into master

* refs/pull/33633/head:
msg: add get_{pid,random}_nonce() helpers
msg/Policy: make stateless_server default to anon (again)
osd: drop broken 'poison pill'

Reviewed-by: Neha Ojha <nojha@redhat.com>
5 years agoMerge pull request #33642 from trociny/wip-run-rbd-tests-pool-init
Jason Dillaman [Mon, 2 Mar 2020 17:17:09 +0000 (12:17 -0500)]
Merge pull request #33642 from trociny/wip-run-rbd-tests-pool-init

test/run-rbd-tests: properly initialize newly created rbd pool

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
5 years agoqa/suites/cephadm: move orchestrator_cli test into rados/cephadm 33648/head
Sage Weil [Sun, 1 Mar 2020 22:50:28 +0000 (16:50 -0600)]
qa/suites/cephadm: move orchestrator_cli test into rados/cephadm

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agovstart.sh: deploy crash containers
Sage Weil [Mon, 2 Mar 2020 16:30:48 +0000 (10:30 -0600)]
vstart.sh: deploy crash containers

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agocephadm: bootstrap: deploy crash on every host
Sage Weil [Sun, 1 Mar 2020 18:11:41 +0000 (12:11 -0600)]
cephadm: bootstrap: deploy crash on every host

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agomgr/cephadm: deploy crash daemons
Sage Weil [Sun, 1 Mar 2020 17:38:51 +0000 (11:38 -0600)]
mgr/cephadm: deploy crash daemons

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agomgr/orch: add crash hooks
Sage Weil [Sun, 1 Mar 2020 17:38:42 +0000 (11:38 -0600)]
mgr/orch: add crash hooks

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agocephadm: do not magically deploy 'crash' container
Sage Weil [Sat, 29 Feb 2020 16:50:42 +0000 (10:50 -0600)]
cephadm: do not magically deploy 'crash' container

We'll soon do this explicitly with a persistent spec instead.

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agoMerge PR #33523 into master
Sage Weil [Mon, 2 Mar 2020 16:30:03 +0000 (10:30 -0600)]
Merge PR #33523 into master

* refs/pull/33523/head:
mgr/orch: ServiceSpec: drop 'count'
mgr/rook: use spec.placement.count (instead of spec.count)
mgr/cephadm: make HostAssignment make sense
mgr/orch: PlacementSpec: do not combine all_hosts with anything else
mgr/orch: use PlacementSpec.from_strings() for all CLI commands

Reviewed-by: Joshua Schmid <jschmid@suse.de>
5 years agoMerge pull request #32660 from cbodley/wip-rgw-admin-zone-shards
Casey Bodley [Mon, 2 Mar 2020 14:38:04 +0000 (09:38 -0500)]
Merge pull request #32660 from cbodley/wip-rgw-admin-zone-shards

rgw: Increase the default number of RGW bucket shards

Reviewed-by: J. Eric Ivancich <ivancich@redhat.com>
5 years agoMerge pull request #33628 from cbodley/wip-44353
Casey Bodley [Mon, 2 Mar 2020 14:12:34 +0000 (09:12 -0500)]
Merge pull request #33628 from cbodley/wip-44353

rgw: fix sharded bucket listing with prefix/delimiter

Reviewed-by: J. Eric Ivancich <ivancich@redhat.com>
5 years agoMerge pull request #31922 from rhcs-dashboard/42094-bucket-mfa-delete
Lenz Grimmer [Mon, 2 Mar 2020 13:55:52 +0000 (13:55 +0000)]
Merge pull request #31922 from rhcs-dashboard/42094-bucket-mfa-delete

mgr/dashboard: enable/disable MFA Delete on RGW bucket

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Laura Paduano <lpaduano@suse.com>
5 years agoMerge pull request #33178 from rhcs-dashboard/42475-rgw-api-keys-allowed
Lenz Grimmer [Mon, 2 Mar 2020 13:54:31 +0000 (13:54 +0000)]
Merge pull request #33178 from rhcs-dashboard/42475-rgw-api-keys-allowed

mgr/dashboard: do not show RGW API keys if only read-only privileges

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Laura Paduano <lpaduano@suse.com>
Reviewed-by: Tatjana Dehler <tdehler@suse.com>
5 years agoMerge pull request #33638 from tchaikov/wip-btree
Kefu Chai [Mon, 2 Mar 2020 11:16:09 +0000 (19:16 +0800)]
Merge pull request #33638 from tchaikov/wip-btree

include/cpp-btree: use the same type when allocate/deallocate

Reviewed-by: Adam Kupczyk <akupczyk@redhat.com>
5 years agoMerge pull request #33645 from liewegas/fix-mon-down-cephtool-test
Kefu Chai [Mon, 2 Mar 2020 11:11:42 +0000 (19:11 +0800)]
Merge pull request #33645 from liewegas/fix-mon-down-cephtool-test

qa/suites/rados/singleton-bluestore/cephtool: whitelist MON_DOWN

Reviewed-by: Kefu Chai <kchai@redhat.com>
5 years agoMerge pull request #33536 from yuvalif/sync_object_write_with_return_value_op
Kefu Chai [Mon, 2 Mar 2020 11:10:29 +0000 (19:10 +0800)]
Merge pull request #33536 from yuvalif/sync_object_write_with_return_value_op

librados: allow passing flags to operate sync APIs

Reviewed-By: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>
5 years agoMerge pull request #33583 from rzarzynski/wip-recovery-cleanup1
Kefu Chai [Mon, 2 Mar 2020 11:01:29 +0000 (19:01 +0800)]
Merge pull request #33583 from rzarzynski/wip-recovery-cleanup1

osd: tiny clean-ups around the backfill

Reviewed-by: David Zafman <dzafman@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
5 years agoMerge pull request #33593 from dzafman/wip-cot-fix
Kefu Chai [Mon, 2 Mar 2020 10:58:19 +0000 (18:58 +0800)]
Merge pull request #33593 from dzafman/wip-cot-fix

test: Fix failing ceph_objectstore_tool.py test

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
5 years agoMerge pull request #33373 from rzarzynski/wip-bl-bumpup_api_verion
Kefu Chai [Mon, 2 Mar 2020 10:25:21 +0000 (18:25 +0800)]
Merge pull request #33373 from rzarzynski/wip-bl-bumpup_api_verion

common, include: bump the version of ceph::buffer's C++ API.

Reviewed-by: Kefu Chai <kchai@redhat.com>
5 years agoMerge pull request #33572 from p-se/wip-pse-grafana-unit-issue
Lenz Grimmer [Mon, 2 Mar 2020 10:24:30 +0000 (10:24 +0000)]
Merge pull request #33572 from p-se/wip-pse-grafana-unit-issue

mgr/dashboard: fix stray requests/error in Grafana unit test

Reviewed-by: Kiefer Chang <kiefer.chang@suse.com>
Reviewed-by: Tatjana Dehler <tdehler@suse.com>
5 years agoMerge pull request #31041 from liu-chunmei/ceph_seastar_alien_blue_store
Kefu Chai [Mon, 2 Mar 2020 10:15:24 +0000 (18:15 +0800)]
Merge pull request #31041 from liu-chunmei/ceph_seastar_alien_blue_store

crimson:: add alien blue store

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
5 years agoMerge pull request #33595 from cyx1231st/wip-crimson-osd-fixes
Kefu Chai [Mon, 2 Mar 2020 09:36:10 +0000 (17:36 +0800)]
Merge pull request #33595 from cyx1231st/wip-crimson-osd-fixes

crimson/osd: misc fixes for OSD reboot-ability

Reviewed-by: Kefu Chai <kchai@redhat.com>
5 years agoMerge pull request #33597 from optimistyzy/228_improve_sync_read
Kefu Chai [Mon, 2 Mar 2020 09:30:34 +0000 (17:30 +0800)]
Merge pull request #33597 from optimistyzy/228_improve_sync_read

NVMEDevice: Remove the unnecessary aio_wait in sync read

Reviewed-by: Kefu Chai <kchai@redhat.com>
5 years agoinclude/cpp-btree: use the same type when allocate/deallocate 33638/head
Kefu Chai [Sat, 29 Feb 2020 06:51:28 +0000 (14:51 +0800)]
include/cpp-btree: use the same type when allocate/deallocate

btree_set<> by default uses `std::allocator<Key>`, and btree_map by
default uses `std::allocator<std::pair<Key, Value>>`.

before this change, btree uses the allocator directly for allocating n
elements where element is `Key` or `std::pair<Key, Value>` respectively,
while "n" is actually supposed to be the number of bytes used by each
node which is being allocated.
but, what we need to allocate is actually a "node_type" for holding
multiple slots, and each slot holds an element. in addition to the
slots, a node also keeps track of metadata for btree itself. in short,
what we allocate now is (in bytes):

  alignof(sizeof(node_type)) * sizeof(element)

but what we should allocate is (in bytes):

  alignof(sizeof(node_type))

in this change:

* always rebind the allocator to the correct aligned type with given
  alignment
* extract the allocator related helpers into a template class

Signed-off-by: Kefu Chai <kchai@redhat.com>
5 years agoMerge pull request #33650 from joscollin/wip-unused-nvme
Kefu Chai [Mon, 2 Mar 2020 05:26:00 +0000 (13:26 +0800)]
Merge pull request #33650 from joscollin/wip-unused-nvme

test: silence warning unused variable nvme

Reviewed-by: Kefu Chai <kchai@redhat.com>
5 years agotest: silence warning unused variable nvme 33650/head
Jos Collin [Mon, 2 Mar 2020 03:34:39 +0000 (09:04 +0530)]
test: silence warning unused variable nvme

Signed-off-by: Jos Collin <jcollin@redhat.com>
5 years agoMerge PR #33512 into master
Sage Weil [Mon, 2 Mar 2020 01:53:35 +0000 (19:53 -0600)]
Merge PR #33512 into master

* refs/pull/33512/head:
os/bluestore/BlueFS: less verbose about alloc adjustments

Reviewed-by: Igor Fedotov <ifedotov@suse.com>
5 years agoMerge PR #33575 into master
Sage Weil [Mon, 2 Mar 2020 01:53:25 +0000 (19:53 -0600)]
Merge PR #33575 into master

* refs/pull/33575/head:
mgr/telemetry: fix and document proxy usage

Reviewed-by: Lenz Grimmer <lgrimmer@suse.com>
Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Yaarit Hatuka <yaarithatuka@gmail.com>
5 years agoMerge PR #33644 into master
Sage Weil [Sun, 1 Mar 2020 23:15:29 +0000 (17:15 -0600)]
Merge PR #33644 into master

* refs/pull/33644/head:
qa/suites/rados/verity/tasks/mon_recovery: whitelist SLOW_OPS

Reviewed-by: Kefu Chai <kchai@redhat.com>
5 years agoMerge PR #33541 into master
Sage Weil [Sun, 1 Mar 2020 22:55:06 +0000 (16:55 -0600)]
Merge PR #33541 into master

* refs/pull/33541/head:
mgr/test_orchestrator: add daemon_action
qa/tasksmgr/test_orchestrator_cli: fix test_service_action
qa/tasks/test_orchestrator_cli: disable test_osd_create exception check
mgr/test_orchestrator: update_foo -> apply_foo
mgr/test_orchestrator: add force flag to remove_daemons
qa/tasks/mgr/test_orchestrator_cli: update
qa/tasks/mgr/test_orchestrator_cli: fix test_load_data
mgr/test_orchestrator: fix list_daemons

Reviewed-by: Joshua Schmid <jschmid@suse.de>
Reviewed-by: Sebastian Wagner <swagner@suse.com>
5 years agoMerge PR #33627 into master
Sage Weil [Sun, 1 Mar 2020 20:26:09 +0000 (14:26 -0600)]
Merge PR #33627 into master

* refs/pull/33627/head:
doc/cephadm/administration: clarify log gathering

Reviewed-by: Sage Weil <sage@redhat.com>
5 years agomgr/orch: ServiceSpec: drop 'count' 33523/head
Sage Weil [Sun, 1 Mar 2020 17:30:46 +0000 (11:30 -0600)]
mgr/orch: ServiceSpec: drop 'count'

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agomgr/rook: use spec.placement.count (instead of spec.count)
Sage Weil [Sun, 1 Mar 2020 17:30:38 +0000 (11:30 -0600)]
mgr/rook: use spec.placement.count (instead of spec.count)

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agomgr/cephadm: make HostAssignment make sense
Sage Weil [Tue, 25 Feb 2020 00:29:12 +0000 (18:29 -0600)]
mgr/cephadm: make HostAssignment make sense

- if hosts are passed, use those.
- if all_hosts=true, use all hosts.

Otherwise, build a set of hosts based on the labels--either explicit or
implied.  If there's no label, use all hosts.

If there is a count, use a subset of candidate hosts.  If there was no
label and there is no count, fail.

If count and hosts are both provided, then we either (1) use the hosts
as the candidate list and place among them (if len(hosts) >= count), or
(2) ensure that any result includes *at least* the provided hosts.

Add tests that (I'm pretty sure) cover cases!

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agomgr/orch: PlacementSpec: do not combine all_hosts with anything else
Sage Weil [Fri, 28 Feb 2020 16:17:50 +0000 (10:17 -0600)]
mgr/orch: PlacementSpec: do not combine all_hosts with anything else

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agomgr/orch: use PlacementSpec.from_strings() for all CLI commands
Sage Weil [Thu, 27 Feb 2020 19:32:24 +0000 (13:32 -0600)]
mgr/orch: use PlacementSpec.from_strings() for all CLI commands

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agoMerge PR #33553 into master
Sage Weil [Sun, 1 Mar 2020 20:18:39 +0000 (14:18 -0600)]
Merge PR #33553 into master

* refs/pull/33553/head:
mgr/cephadm: orch ls: include specs with no daemons
mgr/cephadm: orch ls: show spec size
mgr/orch: remove unused fields in RGWSpec
mgr/orch: fix ServiceSpec fields
mgr/cephadm: simplify spec apply
pybind/mgr/mgr_module: revert PersistentStoreDict seperator
mgr/cephadm: apply services after refreshing inventory
mgr/cephadm: catch and log exceptions from apply
mgr/orch: no extra whitespace in stored json specs
mgr/cephadm: drop daemon_type arg to _apply_service
mgr/cephadm: use _apply() helper for all apply_ methods
mgr/cephadm: replace PersistentStoreDict with SpecStore
mgr/cephadm: do not remove service spec when removing a daemon
mgr/cephadm: rename completion variables&cleanup
mgr/cephadm: leverage service specs

Reviewed-by: Sage Weil <sage@redhat.com>
5 years agomgr/test_orchestrator: add daemon_action 33541/head
Sage Weil [Sun, 1 Mar 2020 20:18:16 +0000 (14:18 -0600)]
mgr/test_orchestrator: add daemon_action

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agoqa/tasksmgr/test_orchestrator_cli: fix test_service_action
Sage Weil [Sun, 1 Mar 2020 17:52:36 +0000 (11:52 -0600)]
qa/tasksmgr/test_orchestrator_cli: fix test_service_action

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agovstart.sh: fix install of cephadm ssh keys from ~/.ssh 33647/head
Sage Weil [Sun, 1 Mar 2020 18:06:19 +0000 (12:06 -0600)]
vstart.sh: fix install of cephadm ssh keys from ~/.ssh

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agoqa/tasks/test_orchestrator_cli: disable test_osd_create exception check
Sage Weil [Sun, 1 Mar 2020 16:48:42 +0000 (10:48 -0600)]
qa/tasks/test_orchestrator_cli: disable test_osd_create exception check

I'm not sure what behavior we want here...

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agomgr/cephadm: orch ls: include specs with no daemons 33553/head
Sage Weil [Sun, 1 Mar 2020 14:27:11 +0000 (08:27 -0600)]
mgr/cephadm: orch ls: include specs with no daemons

This way you see entries like

NAME           RUNNING  REFRESHED  IMAGE NAME                                      IMAGE ID        SPEC
mds.foo            0/1  -          <unknown>                                       <unknown>     present

before any daemons have been created.

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agomgr/cephadm: orch ls: show spec size
Sage Weil [Sun, 1 Mar 2020 14:23:40 +0000 (08:23 -0600)]
mgr/cephadm: orch ls: show spec size

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agomgr/orch: remove unused fields in RGWSpec
Sage Weil [Sun, 1 Mar 2020 15:05:50 +0000 (09:05 -0600)]
mgr/orch: remove unused fields in RGWSpec

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agomgr/orch: fix ServiceSpec fields
Sage Weil [Sun, 1 Mar 2020 13:48:17 +0000 (07:48 -0600)]
mgr/orch: fix ServiceSpec fields

- service_type is required.  Make it the first position arg to the ctor.
- service_id is the id *only* and optional.
- service_name() is the full service name (no change)

The old 'name' was previously used as the id only, so it was poorly named,
and optional, but in this series was changed to include the type, breaking
naming for a bunch of things (e.g., daemons called mds.mds.fsname.xyz).

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agomgr/cephadm: simplify spec apply
Sage Weil [Sun, 1 Mar 2020 03:09:57 +0000 (21:09 -0600)]
mgr/cephadm: simplify spec apply

- Teach _apply_service how to pick the create (and config) functions, so
  that we don't need any weird wrappers in the callers.
- Replace trigger_deploy() and _apply_services() with a simpler
  _apply_all_services()
- Drop all of the per-type _apply_foo() methods.

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agopybind/mgr/mgr_module: revert PersistentStoreDict seperator
Sage Weil [Sun, 1 Mar 2020 02:47:38 +0000 (20:47 -0600)]
pybind/mgr/mgr_module: revert PersistentStoreDict seperator

This fixes the self-test.

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agomgr/cephadm: apply services after refreshing inventory
Sage Weil [Sat, 29 Feb 2020 21:59:25 +0000 (15:59 -0600)]
mgr/cephadm: apply services after refreshing inventory

Ensure we have an inventory to apply against.

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agomgr/cephadm: catch and log exceptions from apply
Sage Weil [Sat, 29 Feb 2020 21:55:40 +0000 (15:55 -0600)]
mgr/cephadm: catch and log exceptions from apply

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agomgr/orch: no extra whitespace in stored json specs
Sage Weil [Fri, 28 Feb 2020 23:05:26 +0000 (17:05 -0600)]
mgr/orch: no extra whitespace in stored json specs

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agomgr/cephadm: drop daemon_type arg to _apply_service
Sage Weil [Fri, 28 Feb 2020 23:05:05 +0000 (17:05 -0600)]
mgr/cephadm: drop daemon_type arg to _apply_service

This is part of ServiceSpec now.

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agomgr/cephadm: use _apply() helper for all apply_ methods
Sage Weil [Fri, 28 Feb 2020 22:55:31 +0000 (16:55 -0600)]
mgr/cephadm: use _apply() helper for all apply_ methods

Clean up a few that were missed (due to racing PRs, presumably).

'creation' -> 'update', since an apply doesn't necessarily create (it may
destroy!).

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agomgr/cephadm: replace PersistentStoreDict with SpecStore
Sage Weil [Fri, 28 Feb 2020 22:38:26 +0000 (16:38 -0600)]
mgr/cephadm: replace PersistentStoreDict with SpecStore

Explicit implementation of the dict of specs.

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agomgr/cephadm: do not remove service spec when removing a daemon
Sage Weil [Fri, 28 Feb 2020 21:32:04 +0000 (15:32 -0600)]
mgr/cephadm: do not remove service spec when removing a daemon

This makes our behavior similar to kube: if you kill a pod, the operator
or controller will come along and create a new one (probably somewhere
else).

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agomgr/cephadm: rename completion variables&cleanup
Joshua Schmid [Thu, 27 Feb 2020 12:30:56 +0000 (13:30 +0100)]
mgr/cephadm: rename completion variables&cleanup

Signed-off-by: Joshua Schmid <jschmid@suse.de>
5 years agomgr/cephadm: leverage service specs
Joshua Schmid [Wed, 26 Feb 2020 13:26:42 +0000 (14:26 +0100)]
mgr/cephadm: leverage service specs

Fixes: https://tracker.ceph.com/issues/44205
This does a couple of things:

* Change the way apply_$service() works:

Instead of triggering the deployment mechanism it will rather
transform the already passed ServiceSpec into a json representation
and save it in a persistent mon_store section.

`mgr/cephadm/service_spec/$service|daemon_type/service_name`

These locations will be periodically checked in the serve() thread.
This works since all the apply_$service_type functions are idempotent.

* Allow to save a config-like specification in the mon_store.

`ceph orch apply -i <service_spec_file.yaml>`

will read the specified services and save them in the mon store
section like mentioned above. The same serve() mechanism like above
also applies to deployment.

Signed-off-by: Joshua Schmid <jschmid@suse.de>
5 years agoqa/suites/rados/singleton-bluestore/cephtool: whitelist MON_DOWN 33645/head
Sage Weil [Sun, 1 Mar 2020 14:03:21 +0000 (08:03 -0600)]
qa/suites/rados/singleton-bluestore/cephtool: whitelist MON_DOWN

cephtool/test.sh now includes a test that disallows mon from the quorum
for a short period.

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agoqa/suites/rados/verity/tasks/mon_recovery: whitelist SLOW_OPS 33644/head
Sage Weil [Sun, 1 Mar 2020 13:58:11 +0000 (07:58 -0600)]
qa/suites/rados/verity/tasks/mon_recovery: whitelist SLOW_OPS

The mon can see slow ops when thrashing.

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agomgr/test_orchestrator: update_foo -> apply_foo
Sage Weil [Sun, 1 Mar 2020 13:23:54 +0000 (07:23 -0600)]
mgr/test_orchestrator: update_foo -> apply_foo

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agoMerge PR #33639 into master
Sage Weil [Sun, 1 Mar 2020 13:22:45 +0000 (07:22 -0600)]
Merge PR #33639 into master

* refs/pull/33639/head:
pybind/mgr/mgr_module: fix standby module logging options

Reviewed-by: Kefu Chai <kchai@redhat.com>
5 years agomsg: add get_{pid,random}_nonce() helpers 33633/head
Sage Weil [Fri, 28 Feb 2020 21:11:37 +0000 (15:11 -0600)]
msg: add get_{pid,random}_nonce() helpers

In cases where we normally use a pid for a nonce, fall back to a random
value when the pid == 1 (i.e., we're in a container).  For the cases where
we use a random value, use the helper.

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agomsg/Policy: make stateless_server default to anon (again)
Sage Weil [Fri, 28 Feb 2020 20:52:02 +0000 (14:52 -0600)]
msg/Policy: make stateless_server default to anon (again)

Midway through the octopus cycle, we made stateless server more stateless
in the sense that it would not register incoming client connections.  And,
in so doing, it would not enforce that client connections came from
unique addresses, by closing an existing connection from the same addr
when a new connection was accepted.

This turned out to cause out of order OSD ops because the OSD needed that
behavior.  See https://tracker.ceph.com/issues/42328.  We fixed that by
reverting to the old behavior for all but monitor connections, where we
needed it, in 507d213cc453ed86ab38619590f710f33245c652.

This, in turn, breaks most OSD <-> OSD communication (and probably lots
of other things) with cephadm, because we make entity_addr_t unique with
a nonce that is populated by getpid()... and the containerized daemons
all have pid 1.  When we finally merged the follow-on fixes for the change
above cephadm OSDs can't ping each other.

In my view, the 'anon' connection handling is a good idea in the general
case.  So, let's adjust our fix for #42328 so that it is only the OSD
client-side interface that registers client connections and makes them
unique.

Fixes: https://tracker.ceph.com/issues/44358
Signed-off-by: Sage Weil <sage@redhat.com>
5 years agoosd: drop broken 'poison pill'
Sage Weil [Fri, 28 Feb 2020 20:46:22 +0000 (14:46 -0600)]
osd: drop broken 'poison pill'

This was effectively a no-op, since the default policy was *also*
stateless_server.

This line originates from v0.24 (2010) when we added the cluster msgr.

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agotest/run-rbd-tests: properly initialize newly created rbd pool 33642/head
Mykola Golub [Sun, 1 Mar 2020 08:47:21 +0000 (08:47 +0000)]
test/run-rbd-tests: properly initialize newly created rbd pool

The scheduler tests from cli_generic expect 'rbd' pool marked as rbd
application pool.

Signed-off-by: Mykola Golub <mgolub@suse.com>
5 years agoMerge pull request #31109 from liupengs/wip-msg-async-fix-event-center-block
Kefu Chai [Sun, 1 Mar 2020 07:47:19 +0000 (15:47 +0800)]
Merge pull request #31109 from liupengs/wip-msg-async-fix-event-center-block

msg/async/rdma: unblock event center if the peer is down when connecting

Reviewed-by: Kefu Chai <kchai@redhat.com>
5 years agomsg/async/rdma: move C_handle_connection to RDMAConnectionSocketImpl.cc 31109/head
liupengs [Sun, 1 Mar 2020 05:04:37 +0000 (13:04 +0800)]
msg/async/rdma: move C_handle_connection to RDMAConnectionSocketImpl.cc

Signed-off-by: Peng Liu <liupeng37@baidu.com>
5 years agoMerge pull request #33591 from badone/wip-install-deps-set-gpgcheck-for-reals
Kefu Chai [Sun, 1 Mar 2020 05:43:26 +0000 (13:43 +0800)]
Merge pull request #33591 from badone/wip-install-deps-set-gpgcheck-for-reals

install-deps.sh: Actually set gpgcheck to false

Reviewed-by: Dan Mick <dan.mick@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
5 years agomsg/async/rdma: fix bug event center is blocked by rdma construct connection for...
liupengs [Sun, 17 Nov 2019 15:03:07 +0000 (23:03 +0800)]
msg/async/rdma: fix bug event center is blocked by rdma construct connection for transport ib sync msg

We construct a tcp connection to transport ib sync msg, if the
remote node is shutdown (shutdown by accident), the net.connect will be blocked until timeout
is reached, which cause the event center be blocked.

This bug may cause mon probe timeout and osd not reply, and so on.

Signed-off-by: Peng Liu <liupeng37@baidu.com>
5 years agomgr/test_orchestrator: add force flag to remove_daemons
Sage Weil [Sun, 1 Mar 2020 01:29:38 +0000 (19:29 -0600)]
mgr/test_orchestrator: add force flag to remove_daemons

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agoqa/tasks/mgr/test_orchestrator_cli: update
Sage Weil [Sat, 29 Feb 2020 23:37:18 +0000 (17:37 -0600)]
qa/tasks/mgr/test_orchestrator_cli: update

Most of these were broken due to CLI changes weeks ago.

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agoMerge PR #33634 into master
Sage Weil [Sat, 29 Feb 2020 22:13:07 +0000 (16:13 -0600)]
Merge PR #33634 into master

* refs/pull/33634/head:
qa/workunits/cephadm/test_cephadm.sh: dump logs on exit
qa/workunits/cephadm/test_cephadm.sh: add `cleanup` function

Reviewed-by: Sage Weil <sage@redhat.com>
5 years agopybind/mgr/mgr_module: fix standby module logging options 33639/head
Sage Weil [Sat, 29 Feb 2020 14:58:59 +0000 (08:58 -0600)]
pybind/mgr/mgr_module: fix standby module logging options

We need to define the module options and their default so that
_configure_logging can succeed.

Broken by 8ec3b3d3cc7d4c2d2678f460de27ecc8cb07d8fe

Signed-off-by: Sage Weil <sage@redhat.com>
5 years agoMerge PR #33438 into master
Sage Weil [Sat, 29 Feb 2020 03:32:47 +0000 (21:32 -0600)]
Merge PR #33438 into master

* refs/pull/33438/head:
cephadm: add prometheus adopt

Reviewed-by: Sage Weil <sage@redhat.com>
5 years agoMerge PR #33433 into master
Sage Weil [Sat, 29 Feb 2020 03:32:36 +0000 (21:32 -0600)]
Merge PR #33433 into master

* refs/pull/33433/head:
cephadm: also return JSON decode error.

Reviewed-by: Sage Weil <sage@redhat.com>
5 years agoMerge pull request #33596 from badone/wip-serve-doc-python3
Kefu Chai [Sat, 29 Feb 2020 01:58:01 +0000 (09:58 +0800)]
Merge pull request #33596 from badone/wip-serve-doc-python3

admin/serve-doc: Switch to python3 only

Reviewed-by: Kefu Chai <kchai@redhat.com>
5 years agoadmin/serve-doc: Switch to python3 only 33596/head
Brad Hubbard [Fri, 28 Feb 2020 05:25:17 +0000 (15:25 +1000)]
admin/serve-doc: Switch to python3 only

Signed-off-by: Brad Hubbard <bhubbard@redhat.com>
5 years agoqa/workunits/cephadm/test_cephadm.sh: dump logs on exit 33634/head
Michael Fritch [Fri, 28 Feb 2020 21:46:59 +0000 (14:46 -0700)]
qa/workunits/cephadm/test_cephadm.sh: dump logs on exit

dumps the last few lines from each of the surviving daemon logs

Signed-off-by: Michael Fritch <mfritch@suse.com>
5 years agoqa/workunits/cephadm/test_cephadm.sh: add `cleanup` function
Michael Fritch [Fri, 28 Feb 2020 21:45:42 +0000 (14:45 -0700)]
qa/workunits/cephadm/test_cephadm.sh: add `cleanup` function

moves logic for clean-up during `trap EXIT` into a function

Signed-off-by: Michael Fritch <mfritch@suse.com>
5 years agoMerge pull request #33624 from trociny/wip-dateutil-dep
Jason Dillaman [Fri, 28 Feb 2020 20:40:01 +0000 (15:40 -0500)]
Merge pull request #33624 from trociny/wip-dateutil-dep

rpm,deb: fix python dateutil module dependency

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
Reviewed-by: Nathan Cutler <ncutler@suse.com>
5 years agoqa/rgw: verify suite selects a random bucket sharding configuration 32660/head
Casey Bodley [Fri, 28 Feb 2020 19:38:13 +0000 (14:38 -0500)]
qa/rgw: verify suite selects a random bucket sharding configuration

Signed-off-by: Casey Bodley <cbodley@redhat.com>
5 years agoqa/rgw: add different bucket sharding overrides
Casey Bodley [Fri, 28 Feb 2020 19:36:59 +0000 (14:36 -0500)]
qa/rgw: add different bucket sharding overrides

Signed-off-by: Casey Bodley <cbodley@redhat.com>
5 years agorgw: move ShardTracker::next_candidate() into lamdba 33628/head
Casey Bodley [Fri, 28 Feb 2020 18:58:53 +0000 (13:58 -0500)]
rgw: move ShardTracker::next_candidate() into lamdba

Signed-off-by: Casey Bodley <cbodley@redhat.com>
5 years agorgw: bucket_list_ordered loops until it gets a unique candidate
Casey Bodley [Fri, 28 Feb 2020 17:49:58 +0000 (12:49 -0500)]
rgw: bucket_list_ordered loops until it gets a unique candidate

when we detect a duplicate common prefix, we need to loop until we get
the next unique candidate. we must add a new candidate for each shard,
or we won't visit it again and would miss later entries

Fixes: https://tracker.ceph.com/issues/44353
Signed-off-by: Casey Bodley <cbodley@redhat.com>
5 years agorgw: bucket_list_ordered advances past duplicate common prefixes
Casey Bodley [Fri, 28 Feb 2020 16:06:32 +0000 (11:06 -0500)]
rgw: bucket_list_ordered advances past duplicate common prefixes

we may see the same common prefix from more than one shard. when we
detect a duplicate, we need to advance past it. otherwise, we may make
the wrong decision about is_truncated because the shards with
duplicates won't be at_end()

Fixes: https://tracker.ceph.com/issues/44353
Signed-off-by: Casey Bodley <cbodley@redhat.com>
5 years agoNVMEDevice: Remove the unnecessary aio_wait in sync read 33597/head
Ziye Yang [Fri, 28 Feb 2020 17:12:57 +0000 (01:12 +0800)]
NVMEDevice: Remove the unnecessary aio_wait in sync read

Using the aio_wait are unncessary, since all the async read
submission and completion happen in the same thread.

Signed-off-by: Ziye Yang <ziye.yang@intel.com>
5 years agodoc/cephadm/administration: clarify log gathering 33627/head
Nathan Cutler [Fri, 28 Feb 2020 17:16:17 +0000 (18:16 +0100)]
doc/cephadm/administration: clarify log gathering

This is an attempt to bring the current state of the documentation more
into line with the current state of the cephadm code.

However, when I try to grab logs from a daemon on a host other than the
one where the daemon is running, I get an empty log...

References: https://tracker.ceph.com/issues/44354
Signed-off-by: Nathan Cutler <ncutler@suse.com>
5 years agorpm,deb: fix python dateutil module dependency 33624/head
Mykola Golub [Fri, 28 Feb 2020 16:24:01 +0000 (16:24 +0000)]
rpm,deb: fix python dateutil module dependency

(needed for mgr/rbd_support)

Signed-off-by: Mykola Golub <mgolub@suse.com>
5 years agoMerge pull request #33063 from theanalyst/boto3-ext-append
Casey Bodley [Fri, 28 Feb 2020 14:31:14 +0000 (09:31 -0500)]
Merge pull request #33063 from theanalyst/boto3-ext-append

examples: rgw: add boto3 append & get usage api extensions

Reviewed-by: Yuval Lifshitz <yuvalif@yahoo.com>
5 years agoMerge pull request #33527 from alimaredia/wip-rgw-realm-pull-log-msg
Casey Bodley [Fri, 28 Feb 2020 14:29:18 +0000 (09:29 -0500)]
Merge pull request #33527 from alimaredia/wip-rgw-realm-pull-log-msg

rgw: increase log level for same or older period pull msg

Reviewed-by: Yuval Lifshitz <yuvalif@yahoo.com>
5 years agoMerge pull request #33600 from ceph/doc_config_set
Sebastian Wagner [Fri, 28 Feb 2020 12:58:49 +0000 (13:58 +0100)]
Merge pull request #33600 from ceph/doc_config_set

doc/cephadm: adjust syntax for config set

Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Sebastian Wagner <sebastian.wagner@suse.com>