git.apps.os.sepia.ceph.com Git

]> git.apps.os.sepia.ceph.com Git - ceph.git/log

Venky Shankar [Wed, 10 Mar 2021 13:37:47 +0000 (08:37 -0500)]

cephfs-mirror: null terminate buffer before synchronizing symbolc link

Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit 30f2066cfdb359c06167068253de0627b15aff91)

commit | commitdiff | tree

Venky Shankar [Thu, 11 Mar 2021 04:31:45 +0000 (23:31 -0500)]

doc: clarify mirror daemon user capability requirements

Fixes: http://tracker.ceph.com/issues/49619
Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit 943ea38678ee6b3bc1c329c3cc56d0e61d87088e)

commit | commitdiff | tree

Venky Shankar [Thu, 4 Mar 2021 04:29:03 +0000 (23:29 -0500)]

doc: doc changes for additional mirroring interfaces

Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit f766e8297ce1eba06e25891a22428a63ae386f60)

commit | commitdiff | tree

Venky Shankar [Tue, 9 Mar 2021 04:44:29 +0000 (23:44 -0500)]

pybind/mirroring: add interface to list file system mirror peers

Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit 8ee88f57355954977320d9967b4cadfe18150f8f)

commit | commitdiff | tree

Venky Shankar [Tue, 9 Mar 2021 11:18:20 +0000 (16:48 +0530)]

pybind/mirroring: set libcephfs handle root uid/gid as 0:0

Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit 928e3c49338078ce1b28e66b318a39dc9e2cc724)

commit | commitdiff | tree

Venky Shankar [Thu, 4 Mar 2021 05:01:48 +0000 (00:01 -0500)]

test: add tests for mirroring bootstrap interfaces

Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit 3e13f489371201488e8d80ddaf77976559e3c6df)

commit | commitdiff | tree

Venky Shankar [Thu, 4 Mar 2021 05:07:43 +0000 (00:07 -0500)]

pybind/mirroring: introduce peer_bootstrap {create|import} commands

Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit 8d2c726e3c1953b1b21dd365d3963078f551747b)

commit | commitdiff | tree

Venky Shankar [Tue, 23 Feb 2021 04:21:47 +0000 (23:21 -0500)]

cephfs-mirror: use peer cluster monitor address (and key) if available

This allows connecting to the peer cluster without having the cluster
configuration file on the primary cluster.

Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit ab410213fedc0247161809258966575688fa8cb9)

commit | commitdiff | tree

Venky Shankar [Tue, 23 Feb 2021 04:06:19 +0000 (23:06 -0500)]

mon: peer_add should accept Ceph file system UUID

Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit a04010e9490aa726d219c41139c27417dac836e2)

commit | commitdiff | tree

Venky Shankar [Thu, 4 Mar 2021 05:01:11 +0000 (00:01 -0500)]

mon: introduce "profile cephfs-mirror" cap constrained to "config-get cephfs/mirror/peer"

Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit f1858bf650ef0d23dbf2166ea2acb80bf9962d81)

commit | commitdiff | tree

Venky Shankar [Mon, 8 Mar 2021 09:49:52 +0000 (04:49 -0500)]

test: add test for failed filesystem mirror instances

Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit d1585af77b184ad6e684902002ecdcc28f85adae)

commit | commitdiff | tree

Venky Shankar [Mon, 8 Mar 2021 09:48:56 +0000 (04:48 -0500)]

cephfs-mirror: restart failed mirror filesystem instances

Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit 158884820ee2ab7982ae4a75f571730ba5c3b439)

commit | commitdiff | tree

Sage Weil [Tue, 23 Mar 2021 00:20:48 +0000 (19:20 -0500)]

Merge PR #40184 into pacific

* refs/pull/40184/head:
qa/suites/rados/cephadm/orchestrator_cli: random-distro$ -> 0-random-distro$
qa/suites/rados/cephadm/smoke-roleless: distro -> 0-distro
qa/distros/podman: install kubic once per host, in parallel
qa/suites/fs/multiclient: use clients: not all: for pexec

Reviewed-by: Josh Durgin <jdurgin@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 23 Mar 2021 00:20:39 +0000 (19:20 -0500)]

Merge PR #40202 into pacific

* refs/pull/40202/head:
qa/suites/rados/cephadm/upgrade: wait for rgw servicemap entries to refresh
mgr/cephadm: identify iscsi service by the pool
qa/distros/podman: install containernetworking-plugins along with podman
python-common: Validate characters in service_id for container names
qa/suites/rados/cephadm/smoke-roleless: deploy additional daemon types
cephadm: fix a minor typo in logging message
qa/suites/rados/cephadm/dashboard: test on centos
cephadm: use debug verbosity during container exec
mgr/cephadm/upgrade: do not repeat crash message
mgr/cephadm/upgrade: a little less verbose
mgr/cephadm: don't log not-ok-to-stop at ERR level
mgr/cephadm: is presumed -> appears
mgr/cephadm: don't double-log ok-to-stop results
mgr/cephadm/upgrade: include upgrade progress in ceph -s
mgr/cephadm: clean up misc messages
mgr/cephadm/configcheck: do not spam info every minute
mgr/cephadm: stop conflicting daemon when deploying to a specific port
mgr/cephadm: make DaemonPlacement print nicer
mgr/cephadm: fix --force remove comment
mgr/cephadm/schedule: choose an IP from a subnet list
mgr/cephadm: rgw: clean up config and config-key values on removal
mgr/cephadm: rgw: drop .crt extension when storing cert in config-key
mgr/cephadm/services: allow beast/civetweb to bind to a particular IP
python-common: add 'networks' property to ServiceSpec
mgr/cephadm/schedule: match placement ip only combination with port
mgr/cephadm: less noise about refreshing hosts
mgr/cephadm: fall back to service spec port if none on DaemonDescription
mgr/cephadm: fix redeploy when daemons have ip:port
mgr/cephadm/schedule: add test case
qa/suites/rados/cephadm/smoke-roleless: add rgw test on many ports
doc/cephadm/rgw: update docs to show count-per-host
mgr/cephadm: add support for rgw_frontend_type (beast or civetweb)
mgr/cephadm: remove ssl_frontend_ssl_key from RGWSpec
mgr/cephadm: fix beast private key config option
mgr/cephadm: fix rgw ssl cert/key config-key path
mgr/cephadm/schedule: dynamically assign ports for rgw
mgr/cephadm/schedule: only 1 port in DaemonPlacement
mgr/cephadm: move rgw frontend logic into RgwService
mgr/cephadm/schedule: return DaemonPlacement instead of HostPlacementSpec
mgr/cephadm/schedule: remove unused methods
mgr/cephadm: propagate ip:port from CephadmDaemoNDeploySpec to deployment
cephadm: populate ports if known and not included in unit.meta
mgr/cephadm: gather and report ports in 'orch ps' output
qa/suites/rados/cephadm/orchestrator_cli: random-distro$ -> 0-random-distro$
qa/suites/rados/cephadm/smoke-roleless: distro -> 0-distro
qa/distros/podman: install kubic once per host, in parallel
qa/suites/fs/multiclient: use clients: not all: for pexec
mgr/cephadm: add info to 'ceph orch upgrade status' in cephadm

Reviewed-by: Michael Fritch <mfritch@suse.com>
Reviewed-by: Juan Miguel Olmo <jolmomar@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 23 Mar 2021 00:20:30 +0000 (19:20 -0500)]

Merge PR #40279 into pacific

* refs/pull/40279/head:
mgr/cephadm: identify rgw, cepfs-mirror in servicemap
mgr/ServiceMap: adjust 'ceph -s' summary
rgw: register daemons in servicemap by gid; include id
cephadm: fix rbd-mirror auth name

Reviewed-by: Josh Durgin <jdurgin@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 21 Mar 2021 14:07:49 +0000 (09:07 -0500)]

qa/suites/rados/cephadm/upgrade: wait for rgw servicemap entries to refresh

rgw changed the way it registered in the service map. Wait a bit for
the old entries to be flushed out.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 3f3d955b19a1dcadbe8137b32eb695e8be6b496a)

commit | commitdiff | tree

Sage Weil [Fri, 19 Mar 2021 15:59:46 +0000 (10:59 -0500)]

mgr/cephadm: identify iscsi service by the pool

Since we deploy one of these per pool, name the service by the pool.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit d8fff6e522d91a56f93c613710e45c232f4450fd)

commit | commitdiff | tree

Kefu Chai [Mon, 22 Mar 2021 06:49:13 +0000 (14:49 +0800)]

qa/distros/podman: install containernetworking-plugins along with podman

/etc/cni/net.d/87-podman-bridge.conflist tries to load "bridge",
"firewall", "tuning" and "portmap" plugins, which are provided by
containernetworking-plugins package.

Fixes: https://tracker.ceph.com/issues/49909
Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit 325c8fce46bee0b5046b2c5ae732678aeeec6629)

commit | commitdiff | tree

Yuri Weinstein [Mon, 22 Mar 2021 15:22:43 +0000 (08:22 -0700)]

Merge pull request #40254 from singuliere/wip-49767-pacific

pacific: librbd: allow interrupted trash move request to be restarted

Reviewed-by: Jason Dillaman <dillaman@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Mon, 22 Mar 2021 15:22:13 +0000 (08:22 -0700)]

Merge pull request #40253 from singuliere/wip-49773-pacific

pacific: librbd/io: send alloc_hint when compression hint is set

Reviewed-by: Jason Dillaman <dillaman@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 21 Mar 2021 18:25:06 +0000 (13:25 -0500)]

Merge PR #40247 into pacific

* refs/pull/40247/head:
common: reset last_log_sent when clog_to_monitors is updated
logclient: move LogChannel::set_log_to_monitors(bool v) to LogClient.cc

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 21 Mar 2021 18:24:25 +0000 (13:24 -0500)]

Merge PR #40246 into pacific

* refs/pull/40246/head:
osd: fix potential null pointer dereference when sending ping

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 21 Mar 2021 18:23:41 +0000 (13:23 -0500)]

Merge PR #40126 into pacific

* refs/pull/40126/head:
pybind/mgr/balancer/module.py: assign weight-sets to all buckets before balancing

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 21 Mar 2021 18:22:56 +0000 (13:22 -0500)]

Merge PR #40249 into pacific

* refs/pull/40249/head:
osd: ignore already dumped osd in dump_item()

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 21 Mar 2021 18:22:08 +0000 (13:22 -0500)]

Merge PR #40248 into pacific

* refs/pull/40248/head:
debian/ceph-common.postinst: do not chown cephadm log dirs

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sun, 21 Mar 2021 17:19:49 +0000 (01:19 +0800)]

Merge pull request #40285 from tchaikov/pacific-pr-40272

pacific: install-deps.sh: remove existing ceph-libboost of different version

Reviewed-by: David Galloway <dgallowa@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 21 Mar 2021 14:39:20 +0000 (09:39 -0500)]

Merge PR #40231 into pacific

* refs/pull/40231/head:
mgr/dashboard: check .badge instead of text for expected label
mgr/dashboard: Add badge to the Label column in Host List

Reviewed-by: Avan Thakkar <athakkar@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 21 Mar 2021 14:38:58 +0000 (09:38 -0500)]

Merge PR #40209 into pacific

* refs/pull/40209/head:
mgr/dashboard: select any object gateway on local cluster.

Reviewed-by: Avan Thakkar <athakkar@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 21 Mar 2021 14:38:49 +0000 (09:38 -0500)]

Merge PR #40129 into pacific

* refs/pull/40129/head:
osd: PeeringState: implement an acting_set_writeable() function
osd: PeeringState: fix a boolean conditional direction
osd: PeeringState: fix stretch peering so PGs can go peered but not active
osd: PeeringState: don't add acting-set OSDs to candidate set in stretch mode
osd: PeeringState: fix calc_replicated_acting_stretch() syntax/logic
osd: PeeringState: respect stretch peering constraints for async recovery
osd: PeeringState: add a comment about using size as a proxy for activateable
osd: check for is_stretch_pool() in stretch_set_can_peer()
scripts: some additions to help with local testing
script: set_up_stretch_mode: include OSDs in root=default so pg creation works

Reviewed-by: Josh Durgin <jdurgin@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sat, 20 Mar 2021 05:00:01 +0000 (13:00 +0800)]

install-deps.sh: remove existing ceph-libboost of different version

we install different versions of precompiled ceph-libboost packages
for different branches when building and testing them on ubuntu test
nodes. for instance,

- nautilus: v1.72
- octopus, pacific: v1.73

they share the same set of test nodes. and these ceph-libboost packages
conflict with each other, because they install files to the same places.

in order to avoid the confliction, we should uninstall existing packages
before installing a different version of ceph-libboost packages.

ceph-libboost${version}-dev is a package providing the shared headers of
boost library, so, in this change we check if it is installed before
returning or removing the existing packages.

Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit 939b147a55192c21e98d21cb380d0ec0b2ca84d5)

Conflicts:
install-deps.sh: trivial resolution

commit | commitdiff | tree

Kefu Chai [Sun, 21 Mar 2021 05:45:47 +0000 (13:45 +0800)]

Merge pull request #40273 from singuliere/wip-49907-pacific

pacific: pybind/mgr/dashboard: bump flake8 to 3.9.0

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Melissa Li [Tue, 16 Mar 2021 05:07:31 +0000 (01:07 -0400)]

python-common: Validate characters in service_id for container names

Service_ids need to be valid docker and podman container names.

Fixes: https://tracker.ceph.com/issues/46497
Signed-off-by: Melissa Li <li.melissa.kun@gmail.com>
(cherry picked from commit 8dd2bf85e759072b4af6546e93ef3768ef9b2db8)

commit | commitdiff | tree

Sage Weil [Fri, 19 Mar 2021 15:57:58 +0000 (10:57 -0500)]

qa/suites/rados/cephadm/smoke-roleless: deploy additional daemon types

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit e48d80671a876ee1fae6b1ddb9f542987e3ce215)

commit | commitdiff | tree

Sage Weil [Thu, 18 Mar 2021 21:27:08 +0000 (17:27 -0400)]

mgr/cephadm: identify rgw, cepfs-mirror in servicemap

Like rbd-mirror, cephfs-mirror and rgw daemons register under their gid.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 2bd11c4ceb156a398423e4f7ee3131624a86f810)

commit | commitdiff | tree

Sage Weil [Fri, 19 Mar 2021 12:21:18 +0000 (08:21 -0400)]

mgr/ServiceMap: adjust 'ceph -s' summary

- Do not list individual daemon ids as this won't scale for larger
  clusters
- Do not contemplate multile daemons of the same type that register with
  different "daemon_type" -- not until we actually have any that do that.
- Present counts by various groupings: distinct hosts and rgw zones to
  start.

  services:
    mon:           1 daemons, quorum a (age 4m)
    mgr:           x(active, since 3m)
    osd:           1 osds: 1 up (since 3m), 1 in (since 3m)
    cephfs-mirror: 1 daemon active (1 hosts)
    rbd-mirror:    2 daemons active (1 hosts)
    rgw:           2 daemons active (1 hosts, 1 zones)

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit ab0d8f2ae9f551e15a4c7bacbf69161e91263785)

commit | commitdiff | tree

Sage Weil [Fri, 19 Mar 2021 12:25:23 +0000 (08:25 -0400)]

rgw: register daemons in servicemap by gid; include id

Registering by gid allows multiple radosgw instances to share an auth
key/identity. Including the id in the metadata allows them to still be
identified by name (even if not uniquely).

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit afc33758e076761b8d4ec004c8f9c49b80a48770)

commit | commitdiff | tree

Sage Weil [Thu, 18 Mar 2021 20:30:52 +0000 (16:30 -0400)]

cephadm: fix rbd-mirror auth name

Broken by 8fa941b35d89db6a40f7d2912b69eadf40c5004c

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit aa13b25c80aec42fbe8e56107c813123907e218c)

commit | commitdiff | tree

Matthew Cengia [Sun, 14 Mar 2021 23:58:21 +0000 (10:58 +1100)]

cephadm: fix a minor typo in logging message

remove duplicated "to"

Signed-off-by: Matthew Cengia <mattcen@mattcen.com>
(cherry picked from commit bd500e88e225135d24d3aec78cfb0b6db7481dae)

commit | commitdiff | tree

Sage Weil [Fri, 19 Mar 2021 13:16:55 +0000 (08:16 -0500)]

qa/suites/rados/cephadm/dashboard: test on centos

Fixes: https://tracker.ceph.com/issues/49638
Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 04e89d57e73dc240f9c5de438ec63b2b0b4d35a5)

commit | commitdiff | tree

Michael Fritch [Thu, 18 Mar 2021 21:41:06 +0000 (15:41 -0600)]

cephadm: use debug verbosity during container exec

avoid failures from appearing on the consle when exec'ing within the
container during the `ls` command

Signed-off-by: Michael Fritch <mfritch@suse.com>
(cherry picked from commit 46f00a7bd7f38a1e2f3b301cd94c5d22b60bcc5a)

commit | commitdiff | tree

Sage Weil [Fri, 19 Mar 2021 14:46:09 +0000 (10:46 -0400)]

mgr/cephadm/upgrade: do not repeat crash message

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 217ddfeb22ad52f1443bc585c044140d9bf07328)

commit | commitdiff | tree

Sage Weil [Fri, 19 Mar 2021 14:44:19 +0000 (10:44 -0400)]

mgr/cephadm/upgrade: a little less verbose

The _do_upgrade() method runs a zillion times; try to report fewer
repetitive messages on every iteration.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit e03fffe6489278892b35fc6ae7d2a1d5e8e42844)

commit | commitdiff | tree

Sage Weil [Fri, 19 Mar 2021 14:38:06 +0000 (10:38 -0400)]

mgr/cephadm: don't log not-ok-to-stop at ERR level

This is normal during the upgrade; INF is fine.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 0d5787c0d1eb16fdbc50ecbb19096d531f9ae0f8)

commit | commitdiff | tree

Sage Weil [Fri, 19 Mar 2021 14:37:37 +0000 (10:37 -0400)]

mgr/cephadm: is presumed -> appears

The old wording was weird.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 3ea3ee5c09d107b3b51c1a92f1733829b7dd4d1f)

commit | commitdiff | tree

Sage Weil [Fri, 19 Mar 2021 14:37:16 +0000 (10:37 -0400)]

mgr/cephadm: don't double-log ok-to-stop results

The calling upgrade code also reports this.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit df7af90b89359a0ef0df4eeeec00ce5758d2b738)

commit | commitdiff | tree

Sage Weil [Fri, 19 Mar 2021 14:31:24 +0000 (10:31 -0400)]

mgr/cephadm/upgrade: include upgrade progress in ceph -s

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit efb7ab22a426d648aea0562701c2d77b0df0d119)

commit | commitdiff | tree

Kefu Chai [Fri, 19 Mar 2021 04:24:28 +0000 (12:24 +0800)]

pybind/mgr/dashboard: remove "python_version >= 3'

remove "python_version >= '3'" from requirements-lint.txt, as we've
dropped the Python2 support.

Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit de9a6a4d6c6e20f6ba6ee7798e0a29431d04def9)

commit | commitdiff | tree

Kefu Chai [Fri, 19 Mar 2021 04:05:45 +0000 (12:05 +0800)]

pybind/mgr/dashboard: bump flake8 to 3.9.0

to address the failure of

ERROR: Cannot install -r requirements-lint.txt (line 2) and -r requirements-lint.txt (line 8) because these package versions have conflicting dependencies.

The conflict is caused by:
flake8 3.8.4 depends on pycodestyle<2.7.0 and >=2.6.0a1
autopep8 1.5.6 depends on pycodestyle>=2.7.0

To fix this you could try to:
1. loosen the range of package versions you've specified
2. remove package versions to allow pip attempt to solve the dependency conflict

Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit 152964ca360293d9accd18f435efcd66d145063e)

commit | commitdiff | tree

Yuri Weinstein [Fri, 19 Mar 2021 21:13:45 +0000 (14:13 -0700)]

Merge pull request #40226 from neha-ojha/wip-49895-pacific

pacific: osd: remove a ceph_assert() from a legitimate path

Reviewed-by: Josh Durgin <jdurgin@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Fri, 19 Mar 2021 21:13:14 +0000 (14:13 -0700)]

Merge pull request #40221 from sseshasa/wip-49886-pacific

pacific: qa/tasks: Add additional wait_for_clean() check in lost_unfound tasks.

Reviewed-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Fri, 19 Mar 2021 21:12:51 +0000 (14:12 -0700)]

Merge pull request #40197 from neha-ojha/wip-39757-pacific

pacific: qa: Add bluestore resharding test

Reviewed-by: Josh Durgin <jdurgin@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Fri, 19 Mar 2021 21:12:19 +0000 (14:12 -0700)]

Merge pull request #39997 from sseshasa/wip-49699-pacific

pacific: osd: Refinements to mclock built-in profiles implementation.

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>

commit | commitdiff | tree

Jason Dillaman [Wed, 10 Mar 2021 20:31:22 +0000 (15:31 -0500)]

rbd: clarify trash remove error code from interrupted move

Fixes: https://tracker.ceph.com/issues/49716
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
(cherry picked from commit 138d71fb0635682510cadda8e4ad5aaab3f39e44)

commit | commitdiff | tree

Jason Dillaman [Wed, 10 Mar 2021 20:37:39 +0000 (15:37 -0500)]

librbd/trash: don't return -ENOENT error from move state machine

Signed-off-by: Jason Dillaman <dillaman@redhat.com>
(cherry picked from commit f6ed98d682e562de1cad301696e918c52a4dba5d)

commit | commitdiff | tree

Jason Dillaman [Wed, 10 Mar 2021 20:29:11 +0000 (15:29 -0500)]

librbd/api: trash remove/purge should indicate interrupted move

This will help the user self-diagnose that a trash move operation
was interrupted and therefore the state is invalid.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>
(cherry picked from commit c808abea64f00e25c6fd3bcaa7ebf9bc763e7ca0)

commit | commitdiff | tree

Jason Dillaman [Wed, 10 Mar 2021 20:15:26 +0000 (15:15 -0500)]

librbd/api: allow an interrupted trash move to be restarted

Search the trash entries for a matching image name that is
still in the moving state and allow the operation to be
restarted.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>
(cherry picked from commit ed2d696e1eafaa59d29ce6fac952e4e5f4f1e920)

commit | commitdiff | tree

Jason Dillaman [Wed, 10 Mar 2021 19:44:36 +0000 (14:44 -0500)]

librbd/api: helper method for natively listing the trash

The existing list method converts the native TrashImageSpec to the
API's rbd_trash_image_info_t which is missing the source field.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>
(cherry picked from commit 21adc927fe50ae37069d77482edd4c4e098433c9)

commit | commitdiff | tree

Jason Dillaman [Fri, 12 Mar 2021 00:44:15 +0000 (19:44 -0500)]

librbd/io: send alloc_hint when compression hint is set

Previously the hint would not be set if the object map indicated the
object may exist.

Fixes: https://tracker.ceph.com/issues/49690
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
(cherry picked from commit b52b5fe06d1f88130b72b8357dbf5630c7cf1cbd)

commit | commitdiff | tree

jhonxue [Fri, 5 Mar 2021 15:33:10 +0000 (23:33 +0800)]

osd: ignore already dumped osd in dump_item()

Fixes: https://tracker.ceph.com/issues/49627
Signed-off-by: Xue Yantao <jhonxue@tencent.com>
(cherry picked from commit 7813819445e73d1e7f333bd9aaaf42624cd781ec)

commit | commitdiff | tree

Sage Weil [Tue, 9 Mar 2021 17:56:42 +0000 (11:56 -0600)]

debian/ceph-common.postinst: do not chown cephadm log dirs

The container uid/gid is different than the debian uid/gid (because the
container is centos-based and we got a different uid/gid allocation there).

Fixes: https://tracker.ceph.com/issues/49677
Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit b89ffdcae51303f185e1b423a948df353497250f)

commit | commitdiff | tree

Gerald Yang [Wed, 3 Mar 2021 04:37:15 +0000 (04:37 +0000)]

common: reset last_log_sent when clog_to_monitors is updated

When clog_to_monitors is disabled, "last_log" still keeps increasing by
get_next_seq() if OSD writes info to clog

But "last_log_sent" doesn't increase, if we disable clog_to_monitors for
a bit longer and then re-enabling it, the num_unsent could be bigger than
log_queue_size(), it will trigger an assertion in _get_mon_log_message

We need to reset last_log_sent to last_log before updating clog_to_monitors

Signed-off-by: Gerald Yang <gerald.yang@canonical.com>
(cherry picked from commit 294ddf9ba779d40b0bc859e55f5287379c75624f)

commit | commitdiff | tree

Gerald Yang [Thu, 21 Jan 2021 08:16:48 +0000 (08:16 +0000)]

logclient: move LogChannel::set_log_to_monitors(bool v) to LogClient.cc

Signed-off-by: Gerald Yang <gerald.yang@canonical.com>
(cherry picked from commit faf2e099ca58868e0b35e5b6f9639c1ecabb4e16)

commit | commitdiff | tree

Mykola Golub [Sat, 16 Jan 2021 05:00:09 +0000 (05:00 +0000)]

osd: fix potential null pointer dereference when sending ping

Fixes: https://tracker.ceph.com/issues/48821
Signed-off-by: Mykola Golub <mgolub@suse.com>
(cherry picked from commit 86576b09973b857ec2fe8195069e21812992db26)

commit | commitdiff | tree

Jenkins Build Slave User [Fri, 19 Mar 2021 16:54:22 +0000 (16:54 +0000)]

16.1.0

commit | commitdiff | tree

Sage Weil [Wed, 17 Mar 2021 19:49:47 +0000 (15:49 -0400)]

mgr/cephadm: clean up misc messages

- join list with ' '
- key, not keyring
- -ing, not ': '

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit f8c32b0fcc2f56d5ec66623a3c9539f2dc8db3d6)

commit | commitdiff | tree

Sage Weil [Wed, 17 Mar 2021 19:43:33 +0000 (15:43 -0400)]

mgr/cephadm/configcheck: do not spam info every minute

It doesn't make to spam INF every minute. Reducing this to DBG means
it'll never be seen. Just remove it.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit b828e627d644b91bf49d0f140c4450b9c566164a)

commit | commitdiff | tree

Sage Weil [Wed, 17 Mar 2021 19:39:15 +0000 (15:39 -0400)]

mgr/cephadm: stop conflicting daemon when deploying to a specific port

If we are deploying a daemon to bind to a specific port and there is
an existing daemon we are removing that also binds to that port, stop
it first. Unless we are both binding to different IPs.

This resolves the case where daemons bind to * and we redeploy with a
subnet to bind to. It would eventually converge before, but would
throw a bind error in the process and take longer.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit a2b7587e04651fd6e3409c421ee9c6cbaa020479)

commit | commitdiff | tree

Sage Weil [Wed, 17 Mar 2021 19:38:57 +0000 (15:38 -0400)]

mgr/cephadm: make DaemonPlacement print nicer

'host(ip:port)' or 'host(*:port)' so we can show it to a user.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 98fa727cad7b90b0325c51d75fac657ac6aa456c)

commit | commitdiff | tree

Sage Weil [Wed, 17 Mar 2021 18:42:34 +0000 (14:42 -0400)]

mgr/cephadm: fix --force remove comment

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit a40c96d793fb22df8f17a4463c5ef5a9e5fa818f)

commit | commitdiff | tree

Sage Weil [Thu, 11 Mar 2021 23:47:24 +0000 (18:47 -0500)]

mgr/cephadm/schedule: choose an IP from a subnet list

Choose an IP from the subnet list provided by the ServiceSpec.

A few caveats:
- we ignore hosts that don't have IPs in the given subnet
- the subnet matching is STRICT.  That is, the CIDR name has to exactly
match what is configured on the host.  That means you can't just say 10/8
to match any 10.whatever addres--you need the exactly network on the host
(e.g, 10.1.2.0/24).
- If you modify a servicespec and change the networks when there are
already deployed daemons, we will try to deploy the new instances on
the same ports but bound to a specific IP instead of *.  Which will fail.
You need to remove the service first, or remove the old daemons manually
so that creating new ones will succeed.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 48d33f8a1b30a5d0b88c5e50f9a7d22c7e07a1d2)

commit | commitdiff | tree

Sage Weil [Tue, 16 Mar 2021 16:58:52 +0000 (12:58 -0400)]

mgr/cephadm: rgw: clean up config and config-key values on removal

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 4841093c7643c907934c684800a44e85ce873990)

commit | commitdiff | tree

Sage Weil [Tue, 16 Mar 2021 16:58:03 +0000 (12:58 -0400)]

mgr/cephadm: rgw: drop .crt extension when storing cert in config-key

This will no affect upgrades since we will run the config() method before
prepare_create() any time we deploy a new daemon on this service, which
means we'll re-store the cert in the new key location before we generate
a new rgw_frontends option that references it.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit f81a4a2278881123649ef50991ab29537b68c225)

commit | commitdiff | tree

Sage Weil [Thu, 11 Mar 2021 23:42:33 +0000 (18:42 -0500)]

mgr/cephadm/services: allow beast/civetweb to bind to a particular IP

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit db5439250726b62b1b5eaad1f0277de302fa73aa)

commit | commitdiff | tree

Sage Weil [Tue, 16 Mar 2021 22:59:56 +0000 (18:59 -0400)]

python-common: add 'networks' property to ServiceSpec

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 6d59d43dd42878ed7e8f2cea9acc13d6822f6360)

commit | commitdiff | tree

Sage Weil [Thu, 11 Mar 2021 23:40:22 +0000 (18:40 -0500)]

mgr/cephadm/schedule: match placement ip only combination with port

1- We only have an IP to bind to if we also have a port, and
2- If we do, we want an exact match: if the DaemonPlacement has ip of
None, then the DaemonDescription should have None too.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 5db8ff54864f5cb2d72108d00324e22ea3867ff5)

commit | commitdiff | tree

Jason Dillaman [Fri, 19 Mar 2021 12:40:43 +0000 (08:40 -0400)]

Merge pull request #40165 from dillaman/wip-librbd-backports-pacific-9

pacific: librbd: miscellaneous backports

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@redhat.com>
Reviewed-by: Mykola Golub <mgolub@suse.com>

commit | commitdiff | tree

Nizamudeen A [Mon, 8 Feb 2021 20:21:25 +0000 (01:51 +0530)]

mgr/dashboard: check .badge instead of text for expected label

this change fixes a regression introduced by
8c5e31ec1a13bc53394eb2cb6880d74db169fac4 which broke the 01-hosts.e2e-spec.ts test
driven by test_dashboard_e2e.sh

Fixes: https://tracker.ceph.com/issues/49205
Signed-off-by: Nizamudeen A <nia@redhat.com>
(cherry picked from commit 6156055a78e63cef0eede0670816a24c3a097b4c)

commit | commitdiff | tree

Nizamudeen A [Tue, 2 Feb 2021 15:12:02 +0000 (20:42 +0530)]

mgr/dashboard: Add badge to the Label column in Host List

Fixes: https://tracker.ceph.com/issues/49105
Signed-off-by: Nizamudeen A <nia@redhat.com>
(cherry picked from commit 8c5e31ec1a13bc53394eb2cb6880d74db169fac4)

commit | commitdiff | tree

Neha Ojha [Fri, 19 Mar 2021 01:40:04 +0000 (18:40 -0700)]

Merge pull request #40228 from neha-ojha/wip-revert-39637

pacific: Revert "PendingReleaseNotes: mgr/pg_autoscaler"

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Kamoltat (Junior) Sirivadhna <ksirivad@redhat.com>

commit | commitdiff | tree

Neha Ojha [Thu, 18 Mar 2021 23:50:46 +0000 (23:50 +0000)]

Revert "PendingReleaseNotes: mgr/pg_autoscaler"

This reverts commit ce45584800f81d1d70d39a76d78778f0ccd73bb2.

Needs reverting since the corresponding code changes were reverted in
https://github.com/ceph/ceph/pull/39921.

Signed-off-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Wed, 17 Mar 2021 15:21:10 +0000 (17:21 +0200)]

osd: remove a ceph_assert() from a legitimate path

on_replica_init() might be legitimately called twice,
if the replica was waiting for updates to complete
before servicing the request.

Fixes: https://tracker.ceph.com/issues/49867
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
(cherry picked from commit 437456ecf9429dd5623cda105e1399234fcc86de)

commit | commitdiff | tree

Matt Benjamin [Thu, 18 Mar 2021 20:04:40 +0000 (16:04 -0400)]

Merge pull request #40180 from linuxbox2/wip-pacific-lcloop

rgw: lc: fix infinite loop in bucket_lc_prepare

commit | commitdiff | tree

Jason Dillaman [Wed, 17 Mar 2021 19:29:37 +0000 (15:29 -0400)]

test: ignore failures to force-enable lockdep

PR #40062 tweaked the behavior of lockdep to compile it out
of the code entirely for release builds. This fixes several
gtests where lockdep was force-enabled.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>
(cherry picked from commit bdc1178bd8a722233743a1b6ad63f79dccb3f8f8)

commit | commitdiff | tree

Jason Dillaman [Wed, 17 Mar 2021 18:14:48 +0000 (14:14 -0400)]

test/pybind/rbd: fixed functional change in encryption API

The encryption format API now also implicitly loads the encryption
layer. This tweaks the tests to account for this functional
difference.

Fixes: https://tracker.ceph.com/issues/49848
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
(cherry picked from commit 625244f999a5ecaf908220d7bc68c81bab01cc6a)

commit | commitdiff | tree

Yin Congmin [Mon, 15 Mar 2021 07:34:35 +0000 (15:34 +0800)]

rbd/cache/pwl: update wait_buffer state and add wake_up

Signed-off-by: Yin Congmin <congmin.yin@intel.com>
(cherry picked from commit 21cc46bb3aaf3315ceeef786710f6874c1ab6e86)

commit | commitdiff | tree

Yin Congmin [Mon, 8 Mar 2021 16:26:04 +0000 (00:26 +0800)]

librbd/cache/pwl: set max size of continuous data

Signed-off-by: Yin Congmin <congmin.yin@intel.com>
(cherry picked from commit bcad92c126526be7ba249322ac3ead0d83b4d188)

commit | commitdiff | tree

Ilya Dryomov [Wed, 17 Mar 2021 10:00:33 +0000 (11:00 +0100)]

qa: krbd_blkroset.t: update for separate hw and user read-only flags

Since kernel 5.12, hardware read-only state and user read-only
policy (BLKROGET/SET ioctls) are tracked separately in the block
layer. As the purpose of our ->set_read_only() method was exactly
that, it was removed.

As a side effect, BLKROSET no longer returns EROFS on an attempt
to make a read-only mapping read-write with "blockdev --setrw".
The policy gets updated, but the device remains read-only as before
because the hardware (== mapping) state is controlled by the driver.

Fixes: https://tracker.ceph.com/issues/49858
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit d72fca26edcff49d203ed6fb940e0cf331e943dd)

commit | commitdiff | tree

Ilya Dryomov [Mon, 15 Mar 2021 19:30:07 +0000 (20:30 +0100)]

krbd: check device node accessibility only if we actually mapped

Fix a braino that came with commit f6854ac65d2a ("krbd: make sure the
device node is accessible after the mapping").

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 8330c9fa4e27204c768777afe45af0eeb273c835)

commit | commitdiff | tree

Xiubo Li [Thu, 4 Feb 2021 06:14:13 +0000 (14:14 +0800)]

mgr: enhance the rados service

For some use cases, like the tcmu-runner, there maybe handreds or
thousands of LUNs, and then for each LUN it will register one service
daemon, then in the `ceph -s` output will be full of useless info.

This will allow to classify the sevices service daemons in one
specified format by adding two pairs in metadata:

  "daemon_type"   : "${TYPE}"
  "daemon_prefix" : "${PREFIX}"

TYPE: will be used to replace the default "daemon(s)"
showed in `ceph -s`. If absent, the "daemon" will be used.
PREFIX: if present the active members will be classified
by the prefix instead of "daemon_name".

For exmaple for iscsi gateways, it will be something likes:
  "daemon_type"   : "portal"
  "daemon_prefix" : "gw${N}"

Then the `ceph -s` output will be:

  ...
  services:
    mon:   3 daemons, quorum a,b,c (age 50m)
    mgr:   x(active, since 49m)
    mds:   a:1 {0=c=up:active} 2 up:standby
    osd:   3 osds: 3 up (since 49m), 3 in (since 49m)
    iscsi: 8 portals active (gw0, gw1, gw2, gw3, gw4, gw5, gw6, gw7)
  ...

Fixes: https://tracker.ceph.com/issues/49057
Signed-off-by: Xiubo Li <xiubli@redhat.com>
(cherry picked from commit a968f65d784b3d6c6a172929aa293f09e6917fa6)

commit | commitdiff | tree

Rachanaben Patel [Tue, 16 Mar 2021 22:37:46 +0000 (15:37 -0700)]

doc/RBD:fixes for ceph-immutable-object-cache daemon enable command

Document for rbd-persistent-read-only-cache show how to manage
ceph-immutable-object-cache daemon using systemd.
command example needs fixing.It should be

systemctl enable ceph-immutable-object-cache@ceph-immutable-object-cache.{unique id}

Fixes: https://tracker.ceph.com/issues/49849
Signed-off-by: Rachanaben Patel <racpatel@redhat.com>
(cherry picked from commit f000ecb64e6e10c9525cc303e15df477b5670570)

commit | commitdiff | tree

Sridhar Seshasayee [Thu, 4 Mar 2021 13:02:01 +0000 (18:32 +0530)]

osd: Disable sleep times for all best effort clients of mclock

If mClockScheduler is scheduling IOs then the various sleep options
for the best effort clients of mclock viz. pg_delete, snaptrim and
scrub are disabled so as to not affect the QoS being applied.

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
(cherry picked from commit 18fab9054ae730ce68dfad1a7e1f4f7da3eb5e01)

commit | commitdiff | tree

Sridhar Seshasayee [Thu, 4 Mar 2021 11:50:27 +0000 (17:20 +0530)]

osd: handle config change for cost per io and cost per byte options

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
(cherry picked from commit 33c258a973c9b284194678c5332f918e2ea827b4)

commit | commitdiff | tree

Sridhar Seshasayee [Thu, 4 Mar 2021 11:38:58 +0000 (17:08 +0530)]

osd: Add config options for cost per io & byte for the mclock scheduler

The cost per io and cost per byte options for hdd and ssd are specified
and set to default values determined using experiments on hdds and ssds
using a cost model. The values are used in calc_scaled_cost() to
determine the scaled cost for every OpSchedulerItem that is enqueued
within the mClockScheduler.

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
(cherry picked from commit 2da091229bd3a9c4d81fecacb60b918a614aeb84)

commit | commitdiff | tree

Sridhar Seshasayee [Tue, 16 Mar 2021 19:48:40 +0000 (01:18 +0530)]

qa/tasks: Add additional wait_for_clean() check in lost_unfound tasks.

At the end of the lost_unfound tests add an additional wait_for_clean()
check to ensure that recoveries get enough time to complete before
proceeding and avoid failures down the line. For e.g. failure like
"Scrubbing terminated -- not all pgs were active and clean." is because
recoveries on the PGs did not get sufficient time to complete even though
they were bound to eventually complete.

Fixes: https://tracker.ceph.com/issues/49844
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
(cherry picked from commit 88df47230b5ad85e95b0be2eca6f5763914b175c)

commit | commitdiff | tree

Sage Weil [Thu, 18 Mar 2021 16:47:14 +0000 (11:47 -0500)]

Merge PR #40119 into pacific

* refs/pull/40119/head:
osd: propagate base pool application_metadata to tiers

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 18 Mar 2021 15:30:58 +0000 (10:30 -0500)]

Merge PR #40195 into pacific

* refs/pull/40195/head:
Revert "osd: Try other PGs when reservation failures occur"
Revert "test: Add test for scrub parallelism"

Reviewed-by: Josh Durgin <jdurgin@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 18 Mar 2021 15:16:37 +0000 (10:16 -0500)]

Merge PR #40156 into pacific

* refs/pull/40156/head:
qa/tests: changed image path to 'quay.ceph.io/ceph-ci/ceph:octopus'

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 18 Mar 2021 15:16:22 +0000 (10:16 -0500)]

Merge PR #40181 into pacific

* refs/pull/40181/head:
mgr/prometheus: fix typo in get_collect_time_metrics

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Alfonso Martínez [Wed, 24 Feb 2021 07:20:53 +0000 (08:20 +0100)]

mgr/dashboard: select any object gateway on local cluster.

Dashboard backend settings:
- Refactoring: now accepting more than 1 type of value.
- RGW_API_ACCESS_KEY & RGW_API_SECRET_KEY accept string (backward compatibility: legacy behavior) as well as dictionary of strings for connecting multiple daemons.
- Ease of use: deprecated: mgr/dashboard/RGW_API_USER_ID: not useful anymore (kept for backward compatibility).

UI/UX:
- Created context component (to be shown only on rgw-related routes) for selecting operating daemon.
- Daemon selector only shown if there is more than 1 daemon running on a local cluster (to reduce cognitive load).

Fixes: https://tracker.ceph.com/issues/47375
Signed-off-by: Alfonso Martínez <almartin@redhat.com>
(cherry picked from commit 94fe271b06f1e87d37850ac20dd31fa2314e8dfe)

commit | commitdiff | tree

Sage Weil [Mon, 15 Mar 2021 18:47:39 +0000 (13:47 -0500)]

mgr/cephadm: less noise about refreshing hosts

These happen every ~10 minutes and will obscure any real messages of
interest.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit a4fa7b8cce58eb7a43a80a7fd0ea79703a4ab60f)

Unnamed repository; edit this file 'description' to name the repository.

RSS Atom