git.apps.os.sepia.ceph.com Git - ceph.git/log

]> git.apps.os.sepia.ceph.com Git - ceph.git/log

projects / ceph.git / log

summary | shortlog | log | commit | commitdiff | tree
first ⋅ prev ⋅ next

commit | commitdiff | tree

Sage Weil [Thu, 15 Apr 2021 22:55:00 +0000 (17:55 -0500)]

qa/tasks/cephadm: tear down clsuter before gathering logs

We dont' always stop all services, because teuthology doesn't know about
things it didn't start. Use rm-cluster to tear things down, but do not
remove the logs themselves. After we get logs, we'll clean up completely.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit deec9074bb2fc42d29d6fa14c22b6b14b97c352f)

commit | commitdiff | tree

Sage Weil [Thu, 15 Apr 2021 22:45:21 +0000 (17:45 -0500)]

qa/suites/rados/cephadm/smoke-roleless: test rgw-ingress

Test this properly by downing each rgw and haproxy in turn and ensuring
that things remain up.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 3ff3f697b474c9669bc4f51c472a9cad35e72266)

commit | commitdiff | tree

Sage Weil [Thu, 15 Apr 2021 22:22:26 +0000 (17:22 -0500)]

mgr/cephadm: remove virtual_ip check during scheduling

In 2f33c6ebbc8e2a6c3844a6921c857fb0796a1552 we made the keepalived task
set the necessary sysctls to add a virtual_ip, so we don't need this
check anymore.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 2382603162ec6785681700134e3c5764bd5aa99f)

commit | commitdiff | tree

Sage Weil [Thu, 15 Apr 2021 21:59:09 +0000 (16:59 -0500)]

mgr/orchestrator: orch ls: leave off virtual_ip prefixlen

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 4498bbe77f58c59583bcb8b9ca1aae33296b329f)

commit | commitdiff | tree

Sage Weil [Tue, 13 Apr 2021 16:52:49 +0000 (12:52 -0400)]

qa/tasks/cephadm: add wait_for_service

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit ced2f7fe4a04ebaa09896376c342b7b866ab5bc7)

commit | commitdiff | tree

Sage Weil [Thu, 15 Apr 2021 19:01:48 +0000 (14:01 -0500)]

qa/tasks/cephadm: allow skip_monitor_stack=true

(Useful for roleless when we want to go faster)

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit bb825157dccd5dee65fc75e63fde856c8bcc12e6)

commit | commitdiff | tree

Sage Weil [Thu, 15 Apr 2021 19:01:18 +0000 (14:01 -0500)]

qa/tasks/cephadm: do subst_vip for cephadm.shell and .apply

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 0d561e2741718498efa319e0c2b5ec3a902c67ca)

commit | commitdiff | tree

Sage Weil [Thu, 15 Apr 2021 19:00:57 +0000 (14:00 -0500)]

qa/tasks/vip: add vip task to allocate virtual IPs

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 2a8ab2d2b87f76dad9b8ecd7f8ce8370f8004b3c)

commit | commitdiff | tree

Sage Weil [Tue, 13 Apr 2021 14:21:41 +0000 (10:21 -0400)]

qa/suites/rados/cephadm/smoke-roleless: add rgw-ingress test case

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 7e16bf3468b53d218ac02a81c01fdbbc002b5f1b)

commit | commitdiff | tree

Sage Weil [Tue, 13 Apr 2021 14:58:09 +0000 (10:58 -0400)]

qa/tasks/cephadm: shell: take 'all-roles' or 'all-hosts'

'all' is ambiguous

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 08039576950697e7b9dd55c6e44068440d2a1553)

commit | commitdiff | tree

Sage Weil [Mon, 12 Apr 2021 20:12:01 +0000 (16:12 -0400)]

qa/tasks/cephadm: let cephadm.shell take string or list

Make it a bit more forgiving.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 779af3da6fd6405e13a2e522c57fa1d1512595a9)

commit | commitdiff | tree

Adam King [Thu, 8 Apr 2021 19:43:08 +0000 (15:43 -0400)]

doc/cephadm: wrong command for single daemon events

Fixes: https://tracker.ceph.com/issues/50257
Signed-off-by: Adam King <adking@redhat.com>
(cherry picked from commit 31959dd6d361273cb125420338a47a6fcbf3998e)

commit | commitdiff | tree

Adam King [Wed, 24 Mar 2021 16:29:20 +0000 (12:29 -0400)]

mgr/cephadm: place maximum on placement count based on host count

Fixes: https://tracker.ceph.com/issues/49960
Signed-off-by: Adam King <adking@redhat.com>
(cherry picked from commit 73532088915b8e7daf4a45d8b7968cdab55de9d1)

commit | commitdiff | tree

Daniel Pivonka [Thu, 8 Apr 2021 19:20:18 +0000 (15:20 -0400)]

mgr/cephadm: fix nfs-rgw stray daemon

nfs-rgw registers under a gid cephadm needs covert that to its known name during the stray daemon check

Signed-off-by: Daniel Pivonka <dpivonka@redhat.com>
(cherry picked from commit f94e0baf9e1897f803160eff8ba36df57aa433ac)

commit | commitdiff | tree

cypherean [Sun, 21 Mar 2021 22:13:46 +0000 (03:43 +0530)]

mgr/cephadm: skip-ssh flag enables cephadm mgr module

This commit fixes the use of skip-ssh flag. It disables ssh config and enables the cephadm mgr module.

Fixes: http://tracker.ceph.com/issues/49737
Signed-off-by: Shreyaa Sharma <shreyasharma.ss305@gmail.com>
(cherry picked from commit 777f236ad885b03b551dd820f41a00b9c89761eb)

commit | commitdiff | tree

Adam King [Wed, 14 Apr 2021 15:39:10 +0000 (11:39 -0400)]

mgr/cephadm: report exception during upgrade in upgrade status

Fixes: https://tracker.ceph.com/issues/50361
Signed-off-by: Adam King <adking@redhat.com>
(cherry picked from commit 6119294b2871977e0a70b138d48dc5afc8abd45d)

commit | commitdiff | tree

Sage Weil [Tue, 13 Apr 2021 22:42:21 +0000 (17:42 -0500)]

qa/suites/rados/thrash: shorten radosbench

This is the longest of the thrash workloads; reducing it will bring
this test in line with the others (<= 45 min).

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit babbab14f4ed8b000741fb92b76b97459510c689)

commit | commitdiff | tree

Sage Weil [Wed, 14 Apr 2021 16:47:54 +0000 (11:47 -0500)]

mgr/cephadm: remove old haproxy and keepalived templates

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 0536364645ff06f285c3dd698df02898a920f1b9)

commit | commitdiff | tree

Daniel Pivonka [Thu, 1 Apr 2021 17:56:48 +0000 (13:56 -0400)]

mgr/orchestrator: validate lists in spec jsons

Signed-off-by: Daniel Pivonka <dpivonka@redhat.com>
(cherry picked from commit 7844ce0785595c951f5822d2c38d1381dc13c8c1)

commit | commitdiff | tree

Sebastian Wagner [Thu, 11 Feb 2021 11:23:56 +0000 (12:23 +0100)]

python-common: Verify service spec is not None

Fixes: https://tracker.ceph.com/issues/48325
Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>
(cherry picked from commit 518edfae7522ada8c74b413cef6e1ae1f08a244b)

commit | commitdiff | tree

Sebastian Wagner [Thu, 11 Feb 2021 10:05:12 +0000 (11:05 +0100)]

python-common: Verify data_devices is not None

Add validation to verify that `data_devices` is not None

Fixes: https://tracker.ceph.com/issues/49191
Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>
(cherry picked from commit 55e9ecbc88bf6a33fe185e8b54491b9048d66adb)

commit | commitdiff | tree

Juan Miguel Olmo Martínez [Mon, 15 Mar 2021 13:19:33 +0000 (14:19 +0100)]

mgr/orchestrator: DG loads properly the unmanaged attribute

Fixes: https://tracker.ceph.com/issues/49805
Signed-off-by: Juan Miguel Olmo Martínez <jolmomar@redhat.com>
(cherry picked from commit 0af4ad8614e426adf60eec32bd4b36974c5cb30b)

commit | commitdiff | tree

Daniel Pivonka [Fri, 9 Apr 2021 19:25:21 +0000 (15:25 -0400)]

mgr/orchestractor: rgw realm and zone flags must both be provided

Signed-off-by: Daniel Pivonka <dpivonka@redhat.com>
(cherry picked from commit c0803f8f271ea6b2c653b3a443f7807185303912)

commit | commitdiff | tree

Sage Weil [Tue, 13 Apr 2021 22:20:21 +0000 (18:20 -0400)]

mgr/cephadm: make prometheus scrape ingress haproxy

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 7a30e656b9c719ec0141e19bfc629c0f7ae89c9f)

commit | commitdiff | tree

Sage Weil [Tue, 13 Apr 2021 14:20:48 +0000 (10:20 -0400)]

doc/cephadm: remove big warning about stability

It's the first item on the toctree that follows.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit d72c61b850a6a1102a58eaa731759afc41d73181)

commit | commitdiff | tree

Sage Weil [Tue, 13 Apr 2021 14:20:27 +0000 (10:20 -0400)]

doc/cepham/compatibility: rgw-ha -> ingress; note possibility of breaking changes

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 53477261194b50b6e5af0573f2aa82486092df8c)

commit | commitdiff | tree

Zac Dover [Wed, 24 Mar 2021 15:47:17 +0000 (01:47 +1000)]

doc/cephadm: rewrite "dry run" section in osd.rst

This rewrites the "dry run" section of the "OSD Service"
chapter of the Cephdam documentation. This commit makes
minor changes that reduce the cognitive load of the
reader.

Signed-off-by: Zac Dover <zac.dover@gmail.com>
(cherry picked from commit e61237f3a1f83b90d4ab22396c6e8291620a60fa)

commit | commitdiff | tree

Zac Dover [Wed, 24 Mar 2021 14:39:01 +0000 (00:39 +1000)]

doc/cephadm: rewrite part of "deploy osds"

This reorganizes the section "Deploy OSDs"
in the "OSD Service" chapter of the Cephadm
Guide. Two new sections, "Listing Storage
Devices" and "Creating New OSDs" gather
information under headings in a sensible way,
making the information more accessible to someone
skimming this Guide.

Signed-off-by: Zac Dover <zac.dover@gmail.com>
(cherry picked from commit 5f1ce2f6e8df185673613df9a31bac2395a46438)

commit | commitdiff | tree

Zac Dover [Sun, 28 Mar 2021 19:23:08 +0000 (05:23 +1000)]

doc/cephadm: rewrite osd.rst "Remove an OSD"

This commit rewrites the entire "Remove an OSD"
section of the "OSD Service" chapter of the
cephadm book.

I got carried away and didn't break this one into
four smaller PRs, and I'm sorry in advance to
whomever ends up reviewing this. I'll break "Advanced
OSD Service Specifications", the next section in the
queue, into multiple sections.

Signed-off-by: Zac Dover <zac.dover@gmail.com>
(cherry picked from commit 577e45c78b7fbb93e9d4cacf213f89f5d6a0abe4)

commit | commitdiff | tree

Zac Dover [Tue, 23 Mar 2021 16:23:46 +0000 (02:23 +1000)]

doc/cephadm: rewrite osd.rst - list devices

This PR rewrites the "List Devices" section of
the OSD chapter of the Cephadm guide. This PR
is a simple grammar-and-elegance improvement.

Signed-off-by: Zac Dover <zac.dover@gmail.com>
(cherry picked from commit 49352a3150b3cf19c5a6a65c270d69e81536990e)

commit | commitdiff | tree

Zac Dover [Mon, 15 Mar 2021 15:03:06 +0000 (01:03 +1000)]

doc/cephadm: break mon section into sections

This PR breaks the "Deploy Additional Monitors" section
of the cephadm documentation into several subsections
whose titles spotlight the matter under discussion in
those respective subsections.

inb4: Another PR is on deck that rewrites the sentences
in this chapter of the cephadm documentation. I'd like
to get this chapter broken up into these subsections before
I rewrite those sentences. So I'm hoping for no grammatical
mission creep on this one. The grammar and clarity updates
are coming.

Signed-off-by: Zac Dover <zac.dover@gmail.com>
(cherry picked from commit 25d9429d66a6edf446fd8bc3b7903b30de2aa31b)

commit | commitdiff | tree

Zac Dover [Mon, 15 Mar 2021 15:03:06 +0000 (01:03 +1000)]

doc/cephadm: rewrite "deploying add. mons"

This rewrites the section "Deploying Additional
Monitors (Beyond the Default Three)" for elegance
and clarity.

Signed-off-by: Zac Dover <zac.dover@gmail.com>
(cherry picked from commit c605750db22e0807b887beae36648131805ede3c)

commit | commitdiff | tree

Jeff Layton [Fri, 29 Jan 2021 19:15:26 +0000 (14:15 -0500)]

doc: fixes for cephadm documentation

Be sure to note that python 3 is a prerequisite. Minimal centos 8
installs don't have it, for instance.

Also, we probably don't want to hardcode an octopus URL into the
suggested curl command. Change it to fill that in with
"|stable-release|", which should always point to the latest released
version name.

Fixes: https://tracker.ceph.com/issues/49806
Signed-off-by: Jeff Layton <jlayton@redhat.com>
(cherry picked from commit bf69cdc68970789a7410928bd8a1af34d0d9b6a2)

commit | commitdiff | tree

Sebastian Wagner [Wed, 3 Mar 2021 13:26:23 +0000 (14:26 +0100)]

doc/cephadm: remove warning about cephadm in production

Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>
(cherry picked from commit 230b78f35395c2c8b21cba6d2d1631971ebc752a)

commit | commitdiff | tree

Sebastian Wagner [Wed, 3 Mar 2021 13:00:51 +0000 (14:00 +0100)]

doc/cephadm: Add Compatibility with Podman Versions

Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>
(cherry picked from commit f15f0deccb93d0ec79beb4f7ba32c843e6e07e63)

commit | commitdiff | tree

Zac Dover [Tue, 23 Mar 2021 15:19:11 +0000 (01:19 +1000)]

doc/cephadm: rewrite "index.rst"

This PR rewrites the three paragraphs at the
front of the cephadm guide, increasing their
elegance and removing ambiguities.

Signed-off-by: Zac Dover <zac.dover@gmail.com>
(cherry picked from commit dfd205dca7889e325a6ec22892d3a9e058ad89d2)

commit | commitdiff | tree

Daniel Pivonka [Tue, 23 Mar 2021 17:50:33 +0000 (13:50 -0400)]

doc/cephadm: explicitly show host requirments in adding host section

Signed-off-by: Daniel Pivonka <dpivonka@redhat.com>
(cherry picked from commit b28fd9838ec3ad5b47a7b5e14015d986348f31e5)

commit | commitdiff | tree

Sage Weil [Mon, 12 Apr 2021 21:21:33 +0000 (17:21 -0400)]

mgr/cephadm: ingress: add optional virtual_interface_networks

It may be that the virtual IP we want to use is not in the same network
as any existing IPs on the host. In that case, allow the spec to specify
a list of networks to match against existing IPs so that a match will
identify an ethernet interface to use.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit bbf6a12752092b406abbec1e600533366ac59548)

commit | commitdiff | tree

Sage Weil [Mon, 12 Apr 2021 21:21:22 +0000 (17:21 -0400)]

doc/cephadm/rgw: clean up example spec

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 99b9f032de1d3611058caa748b5256ea2134446f)

commit | commitdiff | tree

Sage Weil [Mon, 12 Apr 2021 19:55:19 +0000 (15:55 -0400)]

mgr/cephadm/services/ingress: less verbose about prepare_create

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit f7977c551db86dc46a1f6ceb6c25497aebd15c16)

commit | commitdiff | tree

Sage Weil [Mon, 12 Apr 2021 19:53:50 +0000 (15:53 -0400)]

doc/cephadm/rgw: add note about which ethernet interface is used

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 702829f7df462e10e1999a47fa587001fc61de1b)

commit | commitdiff | tree

Sage Weil [Mon, 12 Apr 2021 17:50:12 +0000 (13:50 -0400)]

cephadm: make keepalived unit fiddle sysctl settings

No need to make the user adjust these manually.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 2f33c6ebbc8e2a6c3844a6921c857fb0796a1552)

commit | commitdiff | tree

Sage Weil [Fri, 9 Apr 2021 22:47:52 +0000 (18:47 -0400)]

mgr/orchestrator: report external endpoints from 'orch ls'

Add a PORTS column and report the external/virtual IP (and port(s)) from
'orch ls' output.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 3f55c708b367da7b8890a00e48eb7c0498ef5d97)

commit | commitdiff | tree

Sage Weil [Sat, 10 Apr 2021 16:53:24 +0000 (12:53 -0400)]

mgr/orchestrator: drop - when no ports

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 60562414e10dfd29f0d42f2047da9bddc64b7d34)

commit | commitdiff | tree

Sage Weil [Fri, 9 Apr 2021 19:10:49 +0000 (15:10 -0400)]

doc/cephadm/rgw: update docs for ingress service

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit ef2d92aab2df45e34d85d3db3c5a7fcb9f96eb4f)

commit | commitdiff | tree

Sage Weil [Fri, 9 Apr 2021 18:43:59 +0000 (14:43 -0400)]

mgr/cephadm: use per_host_daemon feature in scheduler

This only affects ingress, at least for now.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 6dcd0597bfea115eb7ec59817255bf1217a0b97e)

commit | commitdiff | tree

Guillaume Abrioux [Tue, 30 Mar 2021 12:23:25 +0000 (14:23 +0200)]

cephadm: fix a typo

this adds a space in order to avoid displaying this:

```
"2021-03-29T10:51:32.595782Z service:rgw.default [ERROR] \"Failed while placing rgw.default.ceph-vasi-node5-osd-rgw-iscsi-gw.hpuesfon ceph-vasi-node5-osd-rgw-iscsi-gw
```

instead of:

```
"2021-03-29T10:51:32.595782Z service:rgw.default [ERROR] \"Failed while placing rgw.default.ceph-vasi-node5-osd-rgw-iscsi-gw.hpuesf on ceph-vasi-node5-osd-rgw-iscsi-rgw
```

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit a3cb119a7a5134e7a3e4006381da14ea2a927136)

commit | commitdiff | tree

Sage Weil [Fri, 9 Apr 2021 18:43:45 +0000 (14:43 -0400)]

mgr/cephadm/schedule: add per_host_daemon_type support

This will be used to schedule a per-host keepalived alongside other
services.

Implement this as a final stage for place() that puts one per host and
also takes existing/stray daemons into consideration.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit db9f1930fee942c345a171508cb7250d6260c94b)

commit | commitdiff | tree

Sage Weil [Fri, 9 Apr 2021 16:34:20 +0000 (12:34 -0400)]

mgr/cephadm: HA_RGW -> Ingress

This is mostly a rename, with some simplification and cleanup.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 0894773e95bd0d10ba85768ba8b9116fcd375f94)

commit | commitdiff | tree

Sage Weil [Fri, 9 Apr 2021 18:19:37 +0000 (14:19 -0400)]

mgr/cephadm: include daemon_type in DaemonPlacement

Initially, this will always match the service_type.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit d7b4a51a520d5d720fe784d64ac7864e828dea4a)

commit | commitdiff | tree

Sage Weil [Thu, 1 Apr 2021 18:14:13 +0000 (14:14 -0400)]

mgr/cephadm: update list-networks to report interface names too

Also, minor fix in the ipv6 addr reporting: ignore networks that aren't in CIDR
form (no /).

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 1897d1cd15af385bd888da0a9ee944cd3a68af07)

commit | commitdiff | tree

Sage Weil [Tue, 23 Mar 2021 21:18:56 +0000 (17:18 -0400)]

mgr/orchestrator: streamline 'orch ps' PORTS formatting

"*:8000 *:8100" -> "*:8000,8100"

FWIW this matches the internal rendering used by DaemonPlacement

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit f93c555c24336003770c182a4e3ccaae392c2d47)

commit | commitdiff | tree

Sage Weil [Tue, 23 Mar 2021 20:37:20 +0000 (16:37 -0400)]

mgr/cephadm/schedule: handle multiple ports per daemon

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 9256f1c374ab1e1e6d45f3a912048ab486357606)

commit | commitdiff | tree

Sage Weil [Tue, 23 Mar 2021 20:09:15 +0000 (16:09 -0400)]

mgr/cephadm/utils: resolve_ip(): prefer IPv4

On my system the first item in hte list is
'fe80::408d:35e7:510:e9fe%eno1np0'.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 2e7808ccf80c78d424e1fc3db330ce0a6db1cb5d)

commit | commitdiff | tree

胡玮文 [Wed, 7 Apr 2021 13:18:52 +0000 (21:18 +0800)]

cephadm: cleanup extra slash in runtime dir

%t already contains a slash, no need to add an extra one

Signed-off-by: 胡玮文 <huww98@outlook.com>
(cherry picked from commit 9a864f086bedb7098060637ffb27ccc5dc92a88b)

commit | commitdiff | tree

胡玮文 [Thu, 11 Mar 2021 04:43:34 +0000 (12:43 +0800)]

cephadm: use split cgroup strategy for podman

Since systemd will create a cgroup for each service, we can instruct podman to
just split the current cgroup into sub-cgroups. This enables system admins to
use resource control features from systemd.

Signed-off-by: 胡玮文 <huww98@outlook.com>
(cherry picked from commit 1a76f4793ec96045b0fed5cd85b1a6b3dbcd732c)

commit | commitdiff | tree

胡玮文 [Thu, 11 Mar 2021 16:51:33 +0000 (00:51 +0800)]

cephadm: use class to represent container engine

This allow us to store additional information about engine apart from it's
path.

Signed-off-by: 胡玮文 <huww98@outlook.com>
(cherry picked from commit ca6a8fc90b1ad567ad4d777eaab402219d5d7ffb)

commit | commitdiff | tree

Melissa Li [Mon, 29 Mar 2021 04:34:42 +0000 (00:34 -0400)]

mgr/cephadm: don't cleanup the daemon keyring on failed redeploy

Fixes: https://tracker.ceph.com/issues/49872
Signed-off-by: Melissa Li <li.melissa.kun@gmail.com>
(cherry picked from commit 9b6ae808c68feec672c6e55a65bcde22b7085ee4)

commit | commitdiff | tree

Daniel Pivonka [Tue, 30 Mar 2021 20:17:46 +0000 (16:17 -0400)]

mgr/cephadm: fix orch host add with multiple labels and no addr

Signed-off-by: Daniel Pivonka <dpivonka@redhat.com>
(cherry picked from commit 92ad1420c848dd5406685e0d78d3b56356ed9455)

commit | commitdiff | tree

Daniel Pivonka [Tue, 30 Mar 2021 14:13:02 +0000 (10:13 -0400)]

doc/cephadm: remove keepalived_user from haproxy docs

keepalived_user is not used and not required

Signed-off-by: Daniel Pivonka <dpivonka@redhat.com>
(cherry picked from commit d4630eaab43c14d388918003c771f01b64bdd42e)

commit | commitdiff | tree

Nathan Cutler [Thu, 25 Feb 2021 18:01:18 +0000 (19:01 +0100)]

rpm: re-disable SUSE lttng build on z390x

This partially reverts 2b1e646f7aade3135a98c505111ac7ebef5e93a6 which
mistakenly changed a line inside an "%if 0%{?suse_version}" conditional.

Fixes: 2b1e646f7aade3135a98c505111ac7ebef5e93a6
Signed-off-by: Nathan Cutler <ncutler@suse.com>
(cherry picked from commit ffd202a08619fc535df593eb41c0769577a1586a)

commit | commitdiff | tree

Yaakov Selkowitz [Tue, 9 Feb 2021 16:03:42 +0000 (11:03 -0500)]

ceph.spec.in: enable tcmalloc and lttng on s390x

The necessary prerequisites are already in RHEL+EPEL 8.

Signed-off-by: Yaakov Selkowitz <yselkowi@redhat.com>
(cherry picked from commit 2b1e646f7aade3135a98c505111ac7ebef5e93a6)

commit | commitdiff | tree

Sage Weil [Fri, 23 Apr 2021 12:21:50 +0000 (07:21 -0500)]

Merge PR #40746 into pacific

* refs/pull/40746/head:
doc/cephadm: fix a typo
mgr/cephadm: rewrite/simplify describe_service
mgr/orchestrator: report osds as osd.unmanaged as appropriate
mgr/orchestrator: remove IMAGE ID from 'orch ls'
cephadm: normalize unqualified repo digests to docker.io
mgr/cephadm/upgrade: normalize unqualified target image
cephadm:persist the grafana.db file
qa/tasks/cephadm: add apply() method/task
cephadm: pass '-i' to docker|podman run for shell|enter

Reviewed-by: Juan Miguel Olmo <jolmomar@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 22 Apr 2021 16:23:50 +0000 (11:23 -0500)]

Merge PR #40985 into pacific

* refs/pull/40985/head:
ceph-volume: fix raw listing when finding OSDs from different clusters

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sébastien Han [Thu, 22 Apr 2021 10:52:09 +0000 (12:52 +0200)]

ceph-volume: fix raw listing when finding OSDs from different clusters

When listing OSDs on host with 2 OSDs with the same ID, the output gets
overwritten with the last listed device. So a single OSD will show up.
See the ceph-volume.log which correctly parsed both disks:

```
[2021-04-22 09:44:21,391][ceph_volume.devices.raw.list][DEBUG ] Examining /dev/sda1
[2021-04-22 09:44:21,391][ceph_volume.process][INFO  ] Running command: /usr/bin/ceph-bluestore-tool show-label --dev /dev/sda1
[2021-04-22 09:44:21,418][ceph_volume.process][INFO  ] stdout {
[2021-04-22 09:44:21,418][ceph_volume.process][INFO  ] stdout "/dev/sda1": {
[2021-04-22 09:44:21,418][ceph_volume.process][INFO  ] stdout "osd_uuid": "423bf64d-f241-4f4b-a589-25a66fc836d1",
[2021-04-22 09:44:21,418][ceph_volume.process][INFO  ] stdout "size": 6442450944,
[2021-04-22 09:44:21,418][ceph_volume.process][INFO  ] stdout "btime": "2021-04-22T09:32:55.894961+0000",
[2021-04-22 09:44:21,418][ceph_volume.process][INFO  ] stdout "description": "main",
[2021-04-22 09:44:21,418][ceph_volume.process][INFO  ] stdout "bfm_blocks": "1572864",
[2021-04-22 09:44:21,418][ceph_volume.process][INFO  ] stdout "bfm_blocks_per_key": "128",
[2021-04-22 09:44:21,418][ceph_volume.process][INFO  ] stdout "bfm_bytes_per_block": "4096",
[2021-04-22 09:44:21,418][ceph_volume.process][INFO  ] stdout "bfm_size": "6442450944",
[2021-04-22 09:44:21,418][ceph_volume.process][INFO  ] stdout "bluefs": "1",
[2021-04-22 09:44:21,419][ceph_volume.process][INFO  ] stdout "ceph_fsid": "d3cd4b72-5342-4fd3-96ec-a6e581261eab",
[2021-04-22 09:44:21,419][ceph_volume.process][INFO  ] stdout "kv_backend": "rocksdb",
[2021-04-22 09:44:21,419][ceph_volume.process][INFO  ] stdout "magic": "ceph osd volume v026",
[2021-04-22 09:44:21,419][ceph_volume.process][INFO  ] stdout "mkfs_done": "yes",
[2021-04-22 09:44:21,419][ceph_volume.process][INFO  ] stdout "osd_key": "AQDGQoFg+XHqJBAAw9ZQmtrnotHCLI0Nc2to6A==",
[2021-04-22 09:44:21,419][ceph_volume.process][INFO  ] stdout "ready": "ready",
[2021-04-22 09:44:21,419][ceph_volume.process][INFO  ] stdout "whoami": "0"
[2021-04-22 09:44:21,419][ceph_volume.process][INFO  ] stdout }
[2021-04-22 09:44:21,419][ceph_volume.process][INFO  ] stdout }
[2021-04-22 09:44:21,419][ceph_volume.devices.raw.list][DEBUG ] Examining /dev/sda2
[2021-04-22 09:44:21,419][ceph_volume.process][INFO  ] Running command: /usr/bin/ceph-bluestore-tool show-label --dev /dev/sda2
[2021-04-22 09:44:21,445][ceph_volume.process][INFO  ] stdout {
[2021-04-22 09:44:21,445][ceph_volume.process][INFO  ] stdout "/dev/sda2": {
[2021-04-22 09:44:21,445][ceph_volume.process][INFO  ] stdout "osd_uuid": "c7c66bbd-7b38-4dcd-ad6d-3769c516f2fe",
[2021-04-22 09:44:21,446][ceph_volume.process][INFO  ] stdout "size": 6442450944,
[2021-04-22 09:44:21,446][ceph_volume.process][INFO  ] stdout "btime": "2021-04-22T09:32:21.814768+0000",
[2021-04-22 09:44:21,446][ceph_volume.process][INFO  ] stdout "description": "main",
[2021-04-22 09:44:21,446][ceph_volume.process][INFO  ] stdout "bfm_blocks": "1572864",
[2021-04-22 09:44:21,446][ceph_volume.process][INFO  ] stdout "bfm_blocks_per_key": "128",
[2021-04-22 09:44:21,446][ceph_volume.process][INFO  ] stdout "bfm_bytes_per_block": "4096",
[2021-04-22 09:44:21,446][ceph_volume.process][INFO  ] stdout "bfm_size": "6442450944",
[2021-04-22 09:44:21,446][ceph_volume.process][INFO  ] stdout "bluefs": "1",
[2021-04-22 09:44:21,446][ceph_volume.process][INFO  ] stdout "ceph_fsid": "69c40cb1-22af-42e4-9d59-4a4468a2f58f",
[2021-04-22 09:44:21,446][ceph_volume.process][INFO  ] stdout "kv_backend": "rocksdb",
[2021-04-22 09:44:21,446][ceph_volume.process][INFO  ] stdout "magic": "ceph osd volume v026",
[2021-04-22 09:44:21,446][ceph_volume.process][INFO  ] stdout "mkfs_done": "yes",
[2021-04-22 09:44:21,446][ceph_volume.process][INFO  ] stdout "osd_key": "AQCkQoFgre9SKBAANgHH6scIb+IiyKxh6MhY0A==",
[2021-04-22 09:44:21,446][ceph_volume.process][INFO  ] stdout "ready": "ready",
[2021-04-22 09:44:21,446][ceph_volume.process][INFO  ] stdout "require_osd_release": "16",
[2021-04-22 09:44:21,446][ceph_volume.process][INFO  ] stdout "whoami": "0"
[2021-04-22 09:44:21,446][ceph_volume.process][INFO  ] stdout }
[2021-04-22 09:44:21,446][ceph_volume.process][INFO  ] stdout }
```

However, a single OSD gets listed by `ceph-volume raw list`:

```
[root@2b5a3b8bf31c /]# ceph-volume raw list
{
    "0": {
        "ceph_fsid": "69c40cb1-22af-42e4-9d59-4a4468a2f58f",
        "device": "/dev/sda2",
        "osd_id": 0,
        "osd_uuid": "c7c66bbd-7b38-4dcd-ad6d-3769c516f2fe",
        "type": "bluestore"
    }
}
```

We now use the osd_uuid so the output will never conflict:

```
[root@2b5a3b8bf31c /]# ceph-volume raw list
{
    "423bf64d-f241-4f4b-a589-25a66fc836d1": {
        "ceph_fsid": "d3cd4b72-5342-4fd3-96ec-a6e581261eab",
        "dev": "/dev/sda1",
        "osd_id": 0,
        "osd_uuid": "423bf64d-f241-4f4b-a589-25a66fc836d1",
        "type": "bluestore"
    },
    "c7c66bbd-7b38-4dcd-ad6d-3769c516f2fe": {
        "ceph_fsid": "69c40cb1-22af-42e4-9d59-4a4468a2f58f",
        "dev": "/dev/sda2",
        "osd_id": 0,
        "osd_uuid": "c7c66bbd-7b38-4dcd-ad6d-3769c516f2fe",
        "type": "bluestore"
    }
}
```

Fixes: https://tracker.ceph.com/issues/50478
Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit ec0f5f3b22d24754c16131a1996e42b787e4255f)

commit | commitdiff | tree

Ernesto Puerta [Thu, 22 Apr 2021 09:01:44 +0000 (11:01 +0200)]

Merge pull request #40822 from rhcs-dashboard/wip-50303-pacific

pacific: mgr/dashboard: fix errors when creating NFS export.

Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Wed, 21 Apr 2021 16:00:46 +0000 (18:00 +0200)]

Merge pull request #40957 from rhcs-dashboard/wip-50458-pacific

pacific: vstart.sh: disable "auth_allow_insecure_global_id_reclaim"

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Alfonso Martínez [Fri, 9 Apr 2021 08:51:21 +0000 (10:51 +0200)]

mgr/dashboard: fix errors when creating NFS export.

- Fix daemon raw config parsing.
- Handle error when no rgw daemons found.

Fixes: https://tracker.ceph.com/issues/49925
Signed-off-by: Alfonso Martínez <almartin@redhat.com>
(cherry picked from commit 8bad7360ef23fa154622d0bee7823092b9440ca6)

commit | commitdiff | tree

Kefu Chai [Thu, 15 Apr 2021 13:07:53 +0000 (21:07 +0800)]

vstart.sh: disable "auth_allow_insecure_global_id_reclaim"

to silence the health warning of "mons are allowing insecure global_id
reclaim", which prevents the cluster from being active+clean. couple
tests are expecting a warning free cluster before they starts.

as this option is enabled by default for appeasing the old clients, but when it
comes to most of upstream testing, we can just disable it.

Fixes: https://tracker.ceph.com/issues/50374
Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit 77a8376d0731c24e7bbf24523d3d7450e9f978af)

commit | commitdiff | tree

Patrick Donnelly [Wed, 21 Apr 2021 00:01:13 +0000 (17:01 -0700)]

Merge PR #40687 into pacific

* refs/pull/40687/head:
doc/cephfs/nfs: add user id, fs name and key to FSAL block

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Tue, 20 Apr 2021 09:01:15 +0000 (11:01 +0200)]

Merge branch 'pacific-saved' into pacific

Conflicts:
qa/tasks/ceph.conf.template [ commit 94df76244798
  ("qa/tasks/ceph.conf: shorten cephx TTL for testing") was
  cherry-picked to 16.2.0 separately and so exists both in
  16.2.0 and pacific-saved ]
qa/tasks/cephadm.conf [ ditto ]

commit | commitdiff | tree

Jenkins Build Slave User [Mon, 19 Apr 2021 13:50:07 +0000 (13:50 +0000)]

16.2.1

commit | commitdiff | tree

Ilya Dryomov [Thu, 15 Apr 2021 13:18:58 +0000 (15:18 +0200)]

auth/cephx: make KeyServer::build_session_auth_info() less confusing

The second KeyServer::build_session_auth_info() overload is used only
by the monitor, for mon <-> mon authentication.  The monitor passes in
service_secret (mon secret) and secret_id (-1).  The TTL is irrelevant
because there is no rotation.

However the signature doesn't make it obvious.  Clarify that
service_secret and secret_id are input parameters and info is the only
output parameter.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 6f12cd3688b753633c8ff29fb3bd64758f960b2b)

commit | commitdiff | tree

Ilya Dryomov [Thu, 15 Apr 2021 07:48:13 +0000 (09:48 +0200)]

auth/cephx: cap ticket validity by expiration of "next" key

If auth_mon_ticket_ttl is increased by several times as done in
commit 522a52e6c258 ("auth/cephx: rotate auth tickets less often"),
active clients eventually get stuck because the monitor sends out an
auth ticket with a bogus validity.  The ticket is secured with the
"current" secret that is scheduled to expire according to the old TTL,
but the validity of the ticket is set to the new TTL.  As a result,
the client simply doesn't attempt to renew, letting the secrets rotate
potentially more than once.  When that happens, the client first hits
auth authorizer errors as it tries to renew service tickets and when
it finally gets to renewing the auth ticket, it hits the insecure
global_id reclaim wall.

Cap TTL by expiration of "next" key -- the "current" key may be
milliseconds away from expiration and still be used, legitimately.
Do it in KeyServerData alongside key rotation code and propagate the
capped TTL to the upper layer.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 370c9b13970d47a55b1b20ef983c6f01236c9565)

commit | commitdiff | tree

Ilya Dryomov [Thu, 15 Apr 2021 07:47:50 +0000 (09:47 +0200)]

auth/cephx: drop redundant KeyServerData::get_service_secret() overload

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 3078af716505ae754723864786a41a6d6af0534c)

commit | commitdiff | tree

Guillaume Abrioux [Wed, 7 Apr 2021 15:00:13 +0000 (17:00 +0200)]

doc/cephadm: fix a typo

This fixes a small typo in the cephadm/iscsi documentation

s/iSCSI Ganesha gateway/iSCSI gateway/

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit 6602eb7e7cd127509c4967c84d92574ccd7296c4)

commit | commitdiff | tree

Sage Weil [Fri, 9 Apr 2021 20:26:00 +0000 (16:26 -0400)]

mgr/cephadm: rewrite/simplify describe_service

The prior implementation first tried to fabricate services based on the
running daemons, and then filled in defined services on top. This led
to duplication and a range of small errors.

Instead, flip this around: start with the services that are defined,
and only fill in 'unmanaged' services where we need to.

Drop the osd kludges and instead rely on DaemonDescription.service_id to
return the right thing.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 58d9e90425679fd715aa31d7c8f1044f4582388e)

commit | commitdiff | tree

Sage Weil [Fri, 9 Apr 2021 20:22:49 +0000 (16:22 -0400)]

mgr/orchestrator: report osds as osd.unmanaged as appropriate

If there is no osdspec_affinity or service_name (from unit.meta), then
report as 'osd.unmanaged'.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 5adef5f7663e25dc946a9a44d5a1ac33e8452ccf)

commit | commitdiff | tree

Sage Weil [Fri, 9 Apr 2021 19:35:17 +0000 (15:35 -0400)]

mgr/orchestrator: remove IMAGE ID from 'orch ls'

This is not very useful at this level:
- we see it from 'orch ps'
- it can be a mix of ids during upgrade
- some services may have multiple images at steady state (e.g., ingress)

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 2b63ae25c9af576b8cbab80b17af013b2868f7a2)

commit | commitdiff | tree

Ilya Dryomov [Tue, 13 Apr 2021 16:41:54 +0000 (18:41 +0200)]

Merge pull request #40826 from idryomov/wip-no-cephxv2-for-unmap-pacific

pacific: qa/suites/krbd: don't require CEPHX_V2 for unmap subsuite

Reviewed-by: Greg Farnum <gfarnum@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Tue, 13 Apr 2021 09:44:48 +0000 (11:44 +0200)]

Merge pull request #40665 from idryomov/wip-require-ceph-common-for-ioc-pacific

pacific: packaging: require ceph-common for immutable object cache daemon

Reviewed-by: Greg Farnum <gfarnum@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Sat, 3 Apr 2021 09:13:56 +0000 (11:13 +0200)]

qa/suites/krbd: don't require CEPHX_V2 for unmap subsuite

Starting with pacific, CEPHX_V2 is required by default but
pre-single-major.yaml kernel doesn't support it.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 4027eb864efeb8b85f3d459048aabdffb894b150)

commit | commitdiff | tree

Sage Weil [Sun, 28 Mar 2021 22:07:57 +0000 (18:07 -0400)]

qa/standalone: default to disable insecure global id reclaim

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 72c4fc75ad301980baebc7789ed6391444057e5b)

commit | commitdiff | tree

Sage Weil [Thu, 25 Mar 2021 17:36:56 +0000 (13:36 -0400)]

qa/suites/upgrade/octopus-x: disable insecure global_id reclaim health warnings

These will trigger on upgrade; suppress them so that our health gates
will still work.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 3e80f61efeafc186ea8130984d64c05b2707d6ba)

Conflicts:
qa/suites/upgrade/octopus-x/rgw-multisite/overrides.yaml [
commit b6773dd3f197 ("qa/rgw: add octopus-x upgrade suite for
multisite") not in pacific ]

commit | commitdiff | tree

Sage Weil [Fri, 26 Mar 2021 22:08:46 +0000 (18:08 -0400)]

qa/tasks/ceph[adm].conf[.template]: disable insecure global_id reclaim health alerts

Turn these off everywhere for our tests so they don't interfere with our health checks.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 9f6fd4fe563c9cd4cf65316921d511b677c972e4)

commit | commitdiff | tree

Sage Weil [Fri, 26 Mar 2021 16:02:50 +0000 (12:02 -0400)]

cephadm: set auth_allow_insecure_global_id_reclaim for mon on bootstrap

If this is a fresh pacific cluster, let's assume that there won't be
legacy clients connecting. (And if there are, let's put the burden on
the user to enable them to do so insecurely.)

This is in contrast to upgrades, where our focus is on not breaking
anything.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 7ca74183226b1125b29f4ea8f324ae9e38b46795)

commit | commitdiff | tree

Sage Weil [Thu, 25 Mar 2021 22:07:53 +0000 (18:07 -0400)]

mon/HealthMonitor: raise AUTH_INSECURE_GLOBAL_ID_RENEWAL[_ALLOWED]

Two new alerts:

- AUTH_INSECURE_GLOBAL_ID_RENEWAL_ALLOWED if we are allowing clients to reclaim
global_ids in an insecure manner (for backwards compatibility until
clients are upgraded)

- AUTH_INSECURE_GLBOAL_ID_RENEWAL if there are currently clients connected that
do not know how to securely renew their global_id, as exposed by
auth_expose_insecure_global_id_reclaim=true. The client auth names and IPs
are listed the alert details (up to a limit, at least).

The docs recommend operators mute these alerts instead of silencing, but
we still include option that allow the alerts to be disabled entirely.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 18b343b06e5dd904af425dc99e2c848e12f3b552)

commit | commitdiff | tree

Ilya Dryomov [Tue, 2 Mar 2021 14:09:26 +0000 (15:09 +0100)]

auth/cephx: ignore CEPH_ENTITY_TYPE_AUTH in requested keys

When handling CEPHX_GET_AUTH_SESSION_KEY requests from nautilus+
clients, ignore CEPH_ENTITY_TYPE_AUTH in CephXAuthenticate::other_keys.
Similarly, when handling CEPHX_GET_PRINCIPAL_SESSION_KEY requests,
ignore CEPH_ENTITY_TYPE_AUTH in CephXServiceTicketRequest::keys.
These fields are intended for requesting service tickets, the auth
ticket (which is really a ticket granting ticket) must not be shared
this way.

Otherwise we end up sharing an auth ticket that a) isn't encrypted
with the old session key even if needed (should_enc_ticket == true)
and b) has the wrong validity, namely auth_service_ticket_ttl instead
of auth_mon_ticket_ttl. In the CEPHX_GET_AUTH_SESSION_KEY case, this
undue ticket immediately supersedes the actual auth ticket already
encoded in the same reply (the reply frame ends up containing two auth
tickets).

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 05772ab6127bdd9ed2f63fceef840f197ecd9ea8)

commit | commitdiff | tree

Ilya Dryomov [Mon, 22 Mar 2021 18:16:32 +0000 (19:16 +0100)]

auth/cephx: rotate auth tickets less often

If unauthorized global_id (re)use is disallowed, a client that has
been disconnected from the network long enough for keys to rotate
and its auth ticket to expire (i.e. become invalid/unverifiable)
would not be able to reconnect.

The default TTL is 12 hours, resulting in a 12-24 hour reconnect
window (the previous key is kept around, so the actual window can be
up to double the TTL). The setting has stayed the same since 2009,
but it also hasn't been enforced. Bump it to get a 72 hour reconnect
window to cover for something breaking on Friday and not getting fixed
until Monday.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 522a52e6c258932274f0753feb623ce008519216)

commit | commitdiff | tree

Ilya Dryomov [Thu, 25 Mar 2021 19:59:13 +0000 (20:59 +0100)]

mon: fail fast when unauthorized global_id (re)use is disallowed

When unauthorized global_id (re)use is disallowed, we don't want to
let unpatched clients in because they wouldn't be able to reestablish
their monitor session later, resulting in subtle hangs and disrupted
user workloads.

Denying the initial connect for all legacy (CephXAuthenticate < v3)
clients is not feasible because a large subset of them never stopped
presenting their ticket on reconnects and are therefore compatible with
enforcing mode: most notably all kernel clients but also pre-luminous
userspace clients.  They don't need to be patched and excluding them
would significantly hamper the adoption of enforcing mode.

Instead, force clients that we are not sure about to reconnect shortly
after they go through authentication and obtain global_id.  This is
done in Monitor::dispatch_op() to capture both msgr1 and msgr2, most
likely instead of dispatching mon_subscribe.

We need to let mon_getmap through for "ceph ping" and "ceph tell" to
work.  This does mean that we share the monmap, which lets the client
return from MonClient::authenticate() considering authentication to be
finished and causing the potential reconnect error to not propagate to
the user -- the client would hang waiting for remaining cluster maps.
For msgr1, this is unavoidable because the monmap is sent immediately
after the final MAuthReply.  But for msgr2 this is rare: most of the
time we get to their mon_subscribe and cut the connection before they
process the monmap!

Regardless, the user doesn't get a chance to start a workload since
there is no proper higher-level session at that point.

To help with identifying clients that need patching, add global_id and
global_id_status to "sessions" output.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 08766a17edebb7450cd9b17cc2dc01efc068bb94)

commit | commitdiff | tree

Ilya Dryomov [Sat, 13 Mar 2021 13:53:52 +0000 (14:53 +0100)]

auth/cephx: option to disallow unauthorized global_id (re)use

global_id is a cluster-wide unique id that must remain stable for the
lifetime of the client instance.  The cephx protocol has a facility to
allow clients to preserve their global_id across reconnects:

(1) the client should provide its global_id in the initial handshake
    message/frame and later include its auth ticket proving previous
    possession of that global_id in CEPHX_GET_AUTH_SESSION_KEY request

(2) the monitor should verify that the included auth ticket is valid
    and has the same global_id and, if so, allow the reclaim

(3) if the reclaim is allowed, the new auth ticket should be
    encrypted with the session key of the included auth ticket to
    ensure authenticity of the client performing reclaim.  (The
    included auth ticket could have been snooped when the monitor
    originally shared it with the client or any time the client
    provided it back to the monitor as part of requesting service
    tickets, but only the genuine client would have its session key
    and be able to decrypt.)

Unfortunately, all (1), (2) and (3) have been broken for a while:

- (1) was broken in 2016 by commit a2eb6ae3fb57 ("mon/monclient:
  hunt for multiple monitor in parallel") and is addressed in patch
  "mon/MonClient: preserve auth state on reconnects"

- it turns out that (2) has never been enforced.  When cephx was
  being designed and implemented in 2009, two changes to the protocol
  raced with each other pulling it in different directions: commits
  0669ca21f4f7 ("auth: reuse global_id when requesting tickets")
  and fec31964a12b ("auth: when renewing session, encrypt ticket")
  added the reclaim mechanism based strictly on auth tickets, while
  commit 5eeb711b6b2b ("auth: change server side negotiation a bit")
  allowed the client to provide global_id in the initial handshake.
  These changes didn't get reconciled and as a result a malicious
  client can assign itself any global_id of its choosing by simply
  passing something other than 0 in MAuth message or AUTH_REQUEST
  frame and not even bother supplying any ticket.  This includes
  getting a global_id that is being used by another client.

- (3) was broken in 2019 with addition of support for msgr2, where
  the new auth ticket ends up being shared unencrypted.  However the
  root cause is deeper and a malicious client can coerce msgr1 into
  the same.  This also goes back to 2009 and is addressed in patch
  "auth/cephx: ignore CEPH_ENTITY_TYPE_AUTH in requested keys".

Because (2) has never been enforced, no one noticed when (1) got
broken and we began to rely on this flaw for normal operation in
the face of reconnects due to network hiccups or otherwise.  As of
today, only pre-luminous userspace clients and kernel clients are
not exercising it on a daily basis.

Bump CephXAuthenticate version and use a dummy v3 to distinguish
between legacy clients that don't (may not) include their auth ticket
and new clients.  For new clients, unconditionally disallow claiming
global_id without a corresponding auth ticket.  For legacy clients,
introduce a choice between permissive (current behavior, default for
the foreseeable future) and enforcing mode.

If the reclaim is disallowed, return EACCES.  While MonClient does
have some provision for global_id changes and we could conceivably
implement enforcement by handing out a fresh global_id instead of
the provided one, those code paths have never been tested and there
are too many ways a sudden global_id change could go wrong.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit abebd643cc60fa8a7cb82dc29a9d5041fb3c3d36)

commit | commitdiff | tree

Ilya Dryomov [Tue, 30 Mar 2021 09:10:17 +0000 (11:10 +0200)]

auth/cephx: make cephx_decode_ticket() take a const ticket_blob

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 6b860684c6e59b11c727206819805f89f0518575)

commit | commitdiff | tree

Ilya Dryomov [Tue, 9 Mar 2021 15:33:55 +0000 (16:33 +0100)]

auth/AuthServiceHandler: keep track of global_id and whether it is new

AuthServiceHandler already has global_id field, but it is unused.
Revive it and let the handler know whether global_id is newly assigned
by the monitor or provided by the client.

Lift the setting of entity_name into AuthServiceHandler.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit b50b6abd60e730176a7ef602bdd25d789a3c467d)

commit | commitdiff | tree

Ilya Dryomov [Tue, 9 Mar 2021 13:36:39 +0000 (14:36 +0100)]

auth/AuthServiceHandler: build_cephx_response_header() is cephx-specific

Make the one in CephxServiceHandler private and drop the stub in
AuthNoneServiceHandler.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 49cba02a750d4c1ab68399401f0c04f9c9be5b9e)

commit | commitdiff | tree

Ilya Dryomov [Tue, 9 Mar 2021 13:25:39 +0000 (14:25 +0100)]

auth/AuthServiceHandler: drop unused start_session() args

session_key, connection_secret and connection_secret_required_length
aren't material for start_session() across all three implementations.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit c151c9659bdb71f30b520bbd62f91cc009ec51cd)

commit | commitdiff | tree

Ilya Dryomov [Tue, 30 Mar 2021 13:19:41 +0000 (15:19 +0200)]

mon/MonClient: drop global_id arg from _add_conn() and _add_conns()

Passing anything but MonClient instance's global_id doesn't make
sense.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit a71f6e90d43cca5a79db92ca6a640598796ae7ee)

commit | commitdiff | tree

Ilya Dryomov [Thu, 1 Apr 2021 08:55:36 +0000 (10:55 +0200)]

mon/MonClient: reset auth state in shutdown()

Destroying AuthClientHandler and not resetting global_id is another
way to get MonClient to send CEPHX_GET_AUTH_SESSION_KEY requests with
CephXAuthenticate::old_ticket not populated. This is particularly
pertinent to get_monmap_and_config() which shuts down the bootstrap
MonClient between retry attempts.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit c9b022e07392979e7f9ea6c11484a7dd872cc235)

commit | commitdiff | tree

Ilya Dryomov [Mon, 8 Mar 2021 14:37:02 +0000 (15:37 +0100)]

mon/MonClient: preserve auth state on reconnects

Commit a2eb6ae3fb57 ("mon/monclient: hunt for multiple monitor in
parallel") introduced a regression where auth state (global_id and
AuthClientHandler) was no longer preserved on reconnects.  The ensuing
breakage was quickly noticed and prompted a follow-on fix 8bb6193c8f53
("mon/MonClient: persist global_id across re-connecting").

However, as evident from the subject, the follow-on fix only took
care of the global_id part.  AuthClientHandler is still destroyed
and all cephx tickets are discarded.  A new from-scratch instance
is created for each MonConnection and CEPHX_GET_AUTH_SESSION_KEY
requests end up with CephXAuthenticate::old_ticket not populated.
The bug is in MonClient, so both msgr1 and msgr2 are affected.

This should have resulted in a similar sort of breakage but didn't
because of a much larger bug.  The monitor should have denied the
attempt to reclaim global_id with no valid ticket proving previous
possession of that global_id presented.  Alas, it appears that this
aspect of the cephx protocol has never been enforced.  This is dealt
with in the next patch.

To fix the issue at hand, clone AuthClientHandler into each
MonConnection so that each respective CEPHX_GET_AUTH_SESSION_KEY
request gets a copy of the current auth ticket.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 236b536b28482ec9d8b872de03da7d702ce4787b)

commit | commitdiff | tree

Ilya Dryomov [Sat, 6 Mar 2021 10:15:40 +0000 (11:15 +0100)]

mon/MonClient: claim active_con's auth explicitly

Eliminate confusion by moving auth from active_con into MonClient
instead of swapping them.

The existing MonClient::auth can be destroyed right away -- I don't
see why active_con would need it or a reason to delay its destruction
(which is what stashing in active_con effectively does).

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit eec24e4d119c57c7eb5119dc0083616a61b33b89)

commit | commitdiff | tree

Ilya Dryomov [Thu, 1 Apr 2021 08:07:00 +0000 (10:07 +0200)]

mon/MonClient: resurrect "waiting for monmap|config" timeouts

This fixes a regression introduced in commit 85157d5aae3d ("mon:
s/Mutex/ceph::mutex/"). Waiting for monmap and config indefinitely
is not just bad UX, it actually masks other more serious bugs.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 6faa18e0a8e8efba6bd2978942eb9909b6568d5c)

Unnamed repository; edit this file 'description' to name the repository.