git.apps.os.sepia.ceph.com Git

]> git.apps.os.sepia.ceph.com Git - ceph.git/log

Or Ozeri [Tue, 9 Mar 2021 20:14:49 +0000 (22:14 +0200)]

librbd: crypto format api semantics change

This commit alters the semantics of the encryption format api
to also load the encryption after format completes.
Additionally, several other small changes in librbd crypto are included,
in preparation of supporting clone formatting.

Signed-off-by: Or Ozeri <oro@il.ibm.com>
(cherry picked from commit bb8c66772abaa5890f4caa23bda0eb2ad3279221)

commit | commitdiff | tree

Ilya Dryomov [Wed, 24 Feb 2021 10:41:24 +0000 (11:41 +0100)]

qa/suites/krbd: bump scratch image size to 15G

Allow generic/038 and generic/048 to run.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 37d56e13544fa06b88196b1a4d6edd1b3425325e)

commit | commitdiff | tree

Ilya Dryomov [Fri, 26 Feb 2021 12:54:49 +0000 (13:54 +0100)]

qa/suites/krbd: exclude ext4/002

ext4/002 exercises obsolete EXT4_EOFBLOCKS_FL feature that was removed
in kernel 5.7 and therefore always fails.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit d2bdf0ac43a35f755ea70103f497625d882a4592)

commit | commitdiff | tree

Ilya Dryomov [Sat, 27 Feb 2021 13:58:45 +0000 (14:58 +0100)]

qa/tasks/rbd: fix xfstests_branch/url example

- don't mention xfstests_branch together with xfstests_url as
xfstests_branch gets superseded by xfstests_url
- make xfstests_url example more clear

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit ec36d501ed5f875d26d1362ec097187697170a4d)

commit | commitdiff | tree

Mahati Chamarthy [Thu, 11 Feb 2021 10:24:30 +0000 (15:54 +0530)]

librbd/cache/pwl: Fix IO completion

- Fix deep copy tests and discard I/O completion

- Change read extents to a pointer so that data
persists when multiple extents are pushed into the
vector of bufferlists

- Cleanup tests

Signed-off-by: Mahati Chamarthy <mahati.chamarthy@intel.com>
(cherry picked from commit eeb0b405044bfaade49da0cc635bc05fb0286c0f)

commit | commitdiff | tree

Jason Dillaman [Wed, 3 Mar 2021 19:38:35 +0000 (14:38 -0500)]

qa/objectstore: reduce debug log levels for bluestore

This will help speed up teuthology jobs for non-RADOS suites
where previously tests were IO bound due to excessive logging
and the artifact collection was slowed due to very large OSD
logs.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>
(cherry picked from commit 8889feaaec2d94620fc3944e2f6d7a30dbcc44b1)

commit | commitdiff | tree

Jason Dillaman [Wed, 3 Mar 2021 19:26:38 +0000 (14:26 -0500)]

qa/suites: move RADOS tests to use new debug log objectstores

This will retain the debug log settings for all RADOS suites
that were previously symlinked to the 'objectstore'
directory. The next commit will reduce the debug log level
for the original 'objectstore' directory for the remainder
of tests.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>
(cherry picked from commit 103243725f28cf69252d35cdb7a5985ed260697c)

commit | commitdiff | tree

Kefu Chai [Tue, 9 Mar 2021 09:21:03 +0000 (17:21 +0800)]

Merge pull request #39925 from tchaikov/pacific-github-workflow-milestone

pacific: .github: add workflow for adding milestone

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>

commit | commitdiff | tree

Kefu Chai [Tue, 9 Mar 2021 04:16:26 +0000 (12:16 +0800)]

.github/workflows: use a released sha1

otherwise the action cannot be found

Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit ae8ea10f415ec170be4a7be48567dbac83221dc0)

commit | commitdiff | tree

Kefu Chai [Mon, 8 Mar 2021 18:46:46 +0000 (02:46 +0800)]

.github/workflows: rename labeler.yml to pr-triage.yml

to reflect the change in the actions in it.

Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit 453cd5fd602c54809dc091f0c3a498f903366c8f)

commit | commitdiff | tree

Kefu Chai [Mon, 8 Mar 2021 18:26:34 +0000 (02:26 +0800)]

.github/workflows: use @{sha1} for actions

more secure this way. see also https://julienrenaux.fr/2019/12/20/github-actions-security-risk/

point the sha1 to

* labeler@v3
* milestone@main HEAD

Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit 79e8038046092053b0c0b120e0d7ca07a33a1c00)

commit | commitdiff | tree

Kefu Chai [Sun, 7 Mar 2021 11:21:25 +0000 (19:21 +0800)]

.github/workflows: move milestone.yml into labeler.yml

no need to have two actions triggered by the same event.

we could backport this labeler.yml to LTS branches.

Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit 98b90189791d3f4d2524fd829edb616d0ffd94f7)

commit | commitdiff | tree

Kefu Chai [Sat, 6 Mar 2021 16:32:42 +0000 (00:32 +0800)]

.github: correct the regex in mileston workflow

also use pull_request_target event so the action is run in the
context of the base of the pull request. this helps us to overcome
the "Resource not accessible by integration" issue where the action
is run in the context of the pull request.

Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit c5f6d15e36fdd019196a1520525382f73f276f14)

commit | commitdiff | tree

Kefu Chai [Sat, 6 Mar 2021 13:18:13 +0000 (21:18 +0800)]

.github: add workflow for adding milestone

Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit 1657a44750442ff2582d601fb028aa89e3a95999)

commit | commitdiff | tree

Yuri Weinstein [Mon, 8 Mar 2021 16:08:30 +0000 (08:08 -0800)]

Merge pull request #39741 from tchaikov/pacific-49502

pacific: mgr/rbd_support: bail out if snapshot mirroring is not enabled

Reviewed-by: Mykola Golub <mgolub@mirantis.com>

commit | commitdiff | tree

Sage Weil [Mon, 8 Mar 2021 15:42:59 +0000 (09:42 -0600)]

Merge PR #39856 into pacific

* refs/pull/39856/head:
qa/distro/ubuntu_20.04_podman: Avoid getting asked
qa/suites/rados/cephadm: drop centos/rhel cephadm tests for the moment
qa/sites/rados/cephadm/thrash: rename 3-tasks.yaml/ -> 3-tasks/
qa/suites/rados/cephadm: adjust distros
qa/suites/upgrade: use kubic; test all distros
qa/suites/rados/cephadm/upgrade: use kubic on centos
qa: new kubic distro files; use kubic podman for centos/rhel
qa/suites/rados/cephadm: Add 20.04 podman:testing

Reviewed-by: Sebastian Wagner <swagner@suse.com>

commit | commitdiff | tree

Sebastian Wagner [Mon, 22 Feb 2021 13:58:38 +0000 (14:58 +0100)]

qa/distro/ubuntu_20.04_podman: Avoid getting asked

... for confic changes interactively

Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>
(cherry picked from commit 0defae4e65dee123ee0b8474bf3cb417ddff7eb1)

commit | commitdiff | tree

Sage Weil [Fri, 5 Mar 2021 20:35:26 +0000 (14:35 -0600)]

Merge PR #39628 into pacific

* refs/pull/39628/head:
mgr/dashboard: set security headers

Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 5 Mar 2021 20:35:15 +0000 (14:35 -0600)]

Merge PR #39702 into pacific

* refs/pull/39702/head:
mon: simplify fs 'services' view of 'ceph status'
mds: add volumes + status to data: section of 'ceph status'

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 5 Mar 2021 20:35:05 +0000 (14:35 -0600)]

Merge PR #39727 into pacific

* refs/pull/39727/head:
mgr/telemetry: check if 'ident' channel is active
qa/tasks/mgr/dashboard: identify cluster in telemetry test

Reviewed-by: Yaarit Hatuka <yaarithatuka@gmail.com>

commit | commitdiff | tree

Sage Weil [Fri, 5 Mar 2021 20:34:51 +0000 (14:34 -0600)]

Merge PR #39788 into pacific

* refs/pull/39788/head:
mon/ConfigMonitor: make config changes via KVMonitor's pending set

Reviewed-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 5 Mar 2021 20:34:28 +0000 (14:34 -0600)]

Merge PR #39805 into pacific

* refs/pull/39805/head:
src/global/signal_handler.h: fix preprocessor logic for alpine

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 5 Mar 2021 20:34:16 +0000 (14:34 -0600)]

Merge PR #39811 into pacific

* refs/pull/39811/head:
doc/rbd: clarify QCOW2 migration support

Reviewed-by: Mykola Golub <mgolub@suse.com>

commit | commitdiff | tree

Yuri Weinstein [Fri, 5 Mar 2021 17:16:46 +0000 (09:16 -0800)]

Merge pull request #39806 from liewegas/pr-39665-pacific

pacific: qa/rados/dashbord use "random" objectore

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Sebastian Wagner <sebastian.wagner@suse.com>

commit | commitdiff | tree

Yuri Weinstein [Fri, 5 Mar 2021 17:16:03 +0000 (09:16 -0800)]

Merge pull request #39796 from dzafman/wip-48984-pacific

pacific: osd: Don't clear m_planned_scrub until scrub finishes successfully

Reviewed-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Fri, 5 Mar 2021 17:15:32 +0000 (09:15 -0800)]

Merge pull request #39773 from myoungwon/pacific-backport-49482

pacific: osd, test: fix to avoid fail notification when testing manifest refcount

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Fri, 5 Mar 2021 17:14:46 +0000 (09:14 -0800)]

Merge pull request #39753 from ifed01/wip-ifed-bluefs-zero-read-retry-pac

pacific: os/bluestore: Add option to check BlueFS reads

Reviewed-by: Adam Kupczyk <akupczyk@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Fri, 5 Mar 2021 17:14:20 +0000 (09:14 -0800)]

Merge pull request #39748 from liewegas/pr-39631-pacific

pacific: mon/OSDMonitor: do not mark newly created OSDs OUT

Reviewed-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Fri, 5 Mar 2021 17:13:23 +0000 (09:13 -0800)]

Merge pull request #39688 from ifed01/wip-ifed-fix-huge-bluefs-writes-pac

pacific: os/bluestore: fix huge(>4GB) writes from RocksDB to BlueFS.

Reviewed-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 4 Mar 2021 18:33:12 +0000 (12:33 -0600)]

qa/suites/rados/cephadm: drop centos/rhel cephadm tests for the moment

...until we sort out the podman problems! :( :(

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit e42223c4fc8ca42e03a5721ae5b0bc5362f144ce)

commit | commitdiff | tree

Sage Weil [Thu, 4 Mar 2021 18:32:27 +0000 (12:32 -0600)]

qa/sites/rados/cephadm/thrash: rename 3-tasks.yaml/ -> 3-tasks/

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 5f5472b7daede39a7bf35bf9135fc4fc75ac5a86)

commit | commitdiff | tree

Sage Weil [Thu, 4 Mar 2021 18:32:00 +0000 (12:32 -0600)]

qa/suites/rados/cephadm: adjust distros

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 93e77ac3c81b5e6fc363786429b7eee0ac8567c3)

commit | commitdiff | tree

Sage Weil [Wed, 3 Mar 2021 14:23:56 +0000 (08:23 -0600)]

qa/suites/upgrade: use kubic; test all distros

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit b378c042296858ec4d1c63a42cd257580ca11c96)

commit | commitdiff | tree

Sage Weil [Wed, 3 Mar 2021 14:18:19 +0000 (08:18 -0600)]

qa/suites/rados/cephadm/upgrade: use kubic on centos

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit f4de8738ca86b7b31998718b53c8b7659fcdd27f)

commit | commitdiff | tree

Sage Weil [Wed, 3 Mar 2021 14:14:29 +0000 (08:14 -0600)]

qa: new kubic distro files; use kubic podman for centos/rhel

The current centos/rhel version of podman (2.2.1) is broken.

- create new qa/distros/podman/* files that install kubic podman
- include centos/rhel variants
- adjust cephadm jobs to use new yaml files
- remove old qa/distros/all/*_podman.yaml files

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 11beb7287084b56ac7cbd45ac3d28b3d9fa4cc5b)

commit | commitdiff | tree

Sebastian Wagner [Thu, 11 Feb 2021 15:01:57 +0000 (16:01 +0100)]

qa/suites/rados/cephadm: Add 20.04 podman:testing

make sure next podman versions aren't breaking cephadm

Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>
(cherry picked from commit 2216350b567102db4ab80dc42dfff9dced2b9740)

commit | commitdiff | tree

Ernesto Puerta [Fri, 5 Mar 2021 09:20:43 +0000 (10:20 +0100)]

Merge pull request #39808 from liewegas/pr-39801-pacific

pacific: mgr/dashboard: fix issues related with PyJWT versions >=2.0.0

Reviewed-by: Laura Paduano <lpaduano@suse.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 4 Mar 2021 21:08:22 +0000 (15:08 -0600)]

Merge PR #39737 into pacific

* refs/pull/39737/head:
mgr/DaemonServer: osd ok-to-stop: return json when there are unknown PGs
doc/man/8/ceph: document --max option
src/test/osd/safe-to-destroy: adjust test
ceph: print command output to stdout even on error
mgr/DaemonServer: include details in 'osd ok-to-stop' output
mgr: add --max <n> to 'osd ok-to-stop' command
mgr: relax osd ok-to-stop condition on degraded pgs

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>

commit | commitdiff | tree

Ernesto Puerta [Thu, 4 Mar 2021 19:54:58 +0000 (20:54 +0100)]

Merge pull request #39517 from k0ste/wip-49325-pacific

pacific: mgr/dashboard: fix MTU Mismatch alert

Reviewed-by: Aashish Sharma <aasharma@redhat.com>
Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 4 Mar 2021 18:49:37 +0000 (12:49 -0600)]

Merge PR #39736 into pacific

* refs/pull/39736/head:
crush/CrushWrapper: rebuild shadow tree on 'osd crush reweight-subtree'
crush/CrushWrapper: update shadow trees on update_item()

Reviewed-by: Josh Durgin <jdurgin@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 4 Mar 2021 18:49:12 +0000 (12:49 -0600)]

Merge PR #39734 into pacific

* refs/pull/39734/head:
mon/PGMap: include the per-pool usage breakdown in pg dump

Reviewed-by: Josh Durgin <jdurgin@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 4 Mar 2021 18:46:41 +0000 (12:46 -0600)]

Merge PR #39735 into pacific

* refs/pull/39735/head:
mon/OSDMonitor: fix safety/idempotency of {set,rm}-device-class

Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>

commit | commitdiff | tree

Sage Weil [Thu, 4 Mar 2021 18:46:18 +0000 (12:46 -0600)]

Merge PR #39597 into pacific

* refs/pull/39597/head:
osd: Don't clear m_planned_scrub until scrub finishes successfully

Reviewed-by: Brad Hubbard <bhubbard@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 4 Mar 2021 13:35:24 +0000 (08:35 -0500)]

mgr/DaemonServer: osd ok-to-stop: return json when there are unknown PGs

In 791952cc01201010f298033003ba52374cc0159f we switched to return JSON
both on success and fail to describe which PGs are affected or are blocking
the ability to stop/restart OSDs. Do the same for the case where
some PG states are unknown (i.e., just after a mgr restart) so that
the cephadm upgrade process can unconditionally expect a JSON result.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 2cce16537c9f9e92bf0a8fce746d0364cbb2d15d)

commit | commitdiff | tree

Ernesto Puerta [Thu, 4 Mar 2021 12:54:36 +0000 (13:54 +0100)]

Merge pull request #39524 from rhcs-dashboard/wip-49326-pacific

pacific: mgr/dashboard: avoid using document.write()

Reviewed-by: Laura Paduano <lpaduano@suse.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Ernesto Puerta [Thu, 4 Mar 2021 12:53:54 +0000 (13:53 +0100)]

Merge pull request #39647 from rhcs-dashboard/wip-49390-pacific

pacific: mgr/dashboard: fix tooltip for Provisioned/Total Provisioned fields

Reviewed-by: Laura Paduano <lpaduano@suse.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Casey Bodley [Wed, 3 Mar 2021 20:59:01 +0000 (15:59 -0500)]

Merge pull request #39816 from cbodley/wip-49585

pacific: valgrind: suppress operator delete[] for tcmalloc

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>

commit | commitdiff | tree

Casey Bodley [Tue, 2 Mar 2021 14:31:08 +0000 (09:31 -0500)]

valgrind: suppress operator delete[] for tcmalloc

suppresses a mismatched operator delete[] from the librados client's
destructor:

<error>
  <unique>0xeef7eb</unique>
  <tid>1</tid>
  <kind>MismatchedFree</kind>
  <what>Mismatched free() / delete / delete []</what>
  <stack>
    <frame>
      <ip>0x4C32EA0</ip>
      <obj>/usr/libexec/valgrind/vgpreload_memcheck-amd64-linux.so</obj>
      <fn>operator delete[](void*, unsigned long)</fn>
      <dir>/builddir/build/BUILD/valgrind-3.16.0/coregrind/m_replacemalloc</dir>
      <file>vg_replace_malloc.c</file>
      <line>660</line>
    </frame>
    <frame>
      <ip>0x5FBEB86</ip>
      <obj>/usr/lib64/librados.so.2.0.0</obj>
    </frame>
    <frame>
      <ip>0x5FBEF2C</ip>
      <obj>/usr/lib64/librados.so.2.0.0</obj>
    </frame>
    <frame>
      <ip>0x5F7F7C5</ip>
      <obj>/usr/lib64/librados.so.2.0.0</obj>
      <fn>librados::v14_2_0::Rados::shutdown()</fn>
    </frame>

Fixes: https://tracker.ceph.com/issues/49501
Signed-off-by: Casey Bodley <cbodley@redhat.com>
(cherry picked from commit bf087c3f840216a704600bb2436c643159421ea2)

commit | commitdiff | tree

Alfonso Martínez [Wed, 3 Mar 2021 14:36:06 +0000 (15:36 +0100)]

mgr/dashboard: fix issues related with PyJWT versions >=2.0.0

Fixes: https://tracker.ceph.com/issues/49574
Signed-off-by: Alfonso Martínez <almartin@redhat.com>
(cherry picked from commit 4b96bb51e8f133badd7bc651bcb4dcd755b43d75)

commit | commitdiff | tree

Jason Dillaman [Tue, 2 Mar 2021 22:29:29 +0000 (17:29 -0500)]

doc/rbd: clarify QCOW2 migration support

Signed-off-by: Jason Dillaman <dillaman@redhat.com>
(cherry picked from commit 68fda590cf88e6b232dfb573236fe3ae3949db28)

commit | commitdiff | tree

Kefu Chai [Wed, 3 Mar 2021 14:07:08 +0000 (22:07 +0800)]

Merge pull request #39568 from tchaikov/pr-39365-pacific

pacific: mgr/devicehaelth: extract and present SSD wear

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sebastian Wagner [Wed, 24 Feb 2021 12:01:54 +0000 (13:01 +0100)]

qa/rados/dashbord use "random" objectore

1. removed symlink to `.qa/objectstore`
2. added `random-objectstore&` to `.qa/objectstore`

Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>
(cherry picked from commit 72229a098b878036e2738147814898bc65efcaea)

commit | commitdiff | tree

Duncan Bellamy [Thu, 25 Feb 2021 12:05:56 +0000 (12:05 +0000)]

src/global/signal_handler.h: fix preprocessor logic for alpine
Fixes: https://tracker.ceph.com/issues/49494
Co-authored-by: Kefu Chai <tchaikov@gmail.com>
Signed-off-by: Duncan Bellamy <dunk@denkimushi.com>
(cherry picked from commit 387f33e7265825ae4cdd3580289f5154e250b05b)

commit | commitdiff | tree

Jan Fajerski [Wed, 3 Mar 2021 08:40:13 +0000 (09:40 +0100)]

Merge PR #39525 into pacific

* refs/pull/39525/head:
ceph-volume: pass --filter-for-batch from drive-group subcommand

Reviewed-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Jan Fajerski [Wed, 3 Mar 2021 08:31:24 +0000 (09:31 +0100)]

Merge PR #39468 into pacific

* refs/pull/39468/head:
Update batch.py

Reviewed-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 21 Feb 2021 16:40:06 +0000 (10:40 -0600)]

qa/tasks/mgr/dashboard: add wear_level to devices schema

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 6f6792cf51f9042452bc79df30b3a4d02dd5ef96)

commit | commitdiff | tree

Sage Weil [Thu, 18 Feb 2021 14:27:49 +0000 (08:27 -0600)]

mgr/devicehealth: extract+store wear level from metrics scraping

When we scrape and store health metrics for a device, extract the wear
level from the JSON. If present, also store it in the config-key
per-device metadata.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 4840507cfcdd5182003671994d0bc9604d072e3e)

Conflicts:
src/pybind/mgr/devicehealth/module.py
- type annotations for _get_device_metrics

commit | commitdiff | tree

Sage Weil [Mon, 8 Feb 2021 18:54:04 +0000 (12:54 -0600)]

mgr/DaemonServer: show wear level in 'device ls' output

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 8f93e3b55351df0bc8308e0cace482aabb15e8c8)

commit | commitdiff | tree

Sage Weil [Mon, 8 Feb 2021 18:53:51 +0000 (12:53 -0600)]

mgr/DaemonState: store optional wear_level for devices

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit c52fcb51afb6da2dccd6954a40f0f1c616324855)

commit | commitdiff | tree

Sage Weil [Mon, 8 Feb 2021 18:53:24 +0000 (12:53 -0600)]

common/blkdev: collect non-SMART data too

Call smartctl with -x instead of -a:

       -a, --all
              Prints  all SMART information about the disk, or TapeAlert infor‐
              mation about the tape drive or changer.  For ATA devices this  is
              equivalent to
              '-H -i -c -A -l error -l selftest -l selective'
              and for SCSI, this is equivalent to
              '-H -i -A -l error -l selftest'.
              For NVMe, this is equivalent to
              '-H -i -c -A -l error'.
              Note  that  for  ATA  disks  this  does  not enable the non-SMART
              options and the SMART options which require  support  for  48-bit
              ATA commands.

vs

       -x, --xall
              Prints all SMART and non-SMART information about the device.  For
              ATA devices this is equivalent to
              '-H -i -g all -g wcreorder -c -A  -f  brief  -l  xerror,error  -l
              xselftest,selftest -l selective -l directory -l scttemp -l scterc
              -l devstat -l defects -l sataphy'.
              and for SCSI, this is equivalent to
              '-H -i -g all -A -l error -l selftest -l background -l sasphy'.
              For NVMe, this is equivalent to
              '-H -i -c -A -l error'.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 176bb54da90e404646c0e80e0c3ea804bb8a58d6)

commit | commitdiff | tree

Sage Weil [Mon, 8 Feb 2021 16:41:48 +0000 (10:41 -0600)]

mgr/DaemonState: fix serialization/storage of device life expectancy min/max

These weren't encoded as strings, which means they were effectively
forgotton on each mgr restart. Sigh.

Fixes: https://tracker.ceph.com/issues/49215
Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 78b72b0be752fce855203b28b558133a0d47eba8)

commit | commitdiff | tree

Kefu Chai [Fri, 29 Jan 2021 02:30:19 +0000 (10:30 +0800)]

pybind/mgr: silence flake8 warnings

silence following warnings from flake8

* line too long (107 > 100 characters) [E501]

Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit 367f264100778717890b7fb69668e44f0a22d8ce)

commit | commitdiff | tree

Kefu Chai [Wed, 27 Jan 2021 04:07:07 +0000 (12:07 +0800)]

pybind/mgr/mgr_module: correct annotation

Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit 19cdc4325ce47bef951fdab41fa15e6acadea1f7)

commit | commitdiff | tree

Kefu Chai [Wed, 27 Jan 2021 06:34:35 +0000 (14:34 +0800)]

mgr/devicehealth: remove handle_command()

as commands are collected by COMMANDS already.

Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit 1361b3060e7643a3a0f2f59b72eb5d9201c21bcd)

commit | commitdiff | tree

Kefu Chai [Wed, 27 Jan 2021 06:58:01 +0000 (14:58 +0800)]

mgr/devicehealth: return -1 if prediction plugin is not available

Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit bb611c64ee5120272215683b4b5ac4397f9c5ea6)

commit | commitdiff | tree

Kefu Chai [Wed, 27 Jan 2021 04:08:11 +0000 (12:08 +0800)]

mgr/devicehealth: add more annotations

Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit 48438cccedb383e19cab743dd0222d6ec6265588)

commit | commitdiff | tree

David Zafman [Wed, 3 Mar 2021 00:24:53 +0000 (16:24 -0800)]

osd: Don't clear m_planned_scrub until scrub finishes successfully

Fixes: https://tracker.ceph.com/issues/48984
Signed-off-by: David Zafman <dzafman@redhat.com>
(cherry picked from commit 0b63c850a251f75bcb7436efefe4f24480c4ed7e)

commit | commitdiff | tree

J. Eric Ivancich [Tue, 2 Mar 2021 21:07:17 +0000 (16:07 -0500)]

Merge pull request #39769 from ivancich/wip-49185-pacific

pacific: rgw: tooling to locate rgw objects with missing rados components

Reviewed-by: Michael Kidd <linuxkidd@gmail.com>

commit | commitdiff | tree

J. Eric Ivancich [Tue, 2 Mar 2021 21:06:31 +0000 (16:06 -0500)]

Merge pull request #39764 from ivancich/wip-49194-pacific

pacific: rgw: allow rgw-orphan-list to handle intermediate files w/ binary data

Reviewed-by: Thomas Serlin <tserlin@redhat.com>

commit | commitdiff | tree

Jason Dillaman [Tue, 2 Mar 2021 20:06:04 +0000 (15:06 -0500)]

Merge pull request #39675 from dillaman/wip-librbd-backports-pacific-6

pacific: librbd: miscellaneous backports

Reviewed-by: Mykola Golub <mgolub@suse.com>

commit | commitdiff | tree

Yuri Weinstein [Tue, 2 Mar 2021 19:32:13 +0000 (11:32 -0800)]

Merge pull request #39635 from vshankar/wip-49425

pacific: cephfs-mirror: run mirror daemon with valgrind

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Tue, 2 Mar 2021 19:01:19 +0000 (11:01 -0800)]

Merge pull request #39508 from k0ste/wip-49315-pacific

pacific: mgr/dashboard: prometheus alerting: add some leeway for package drops and errors

Reviewed-by: Avan Thakkar <athakkar@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Tue, 2 Mar 2021 19:00:20 +0000 (11:00 -0800)]

Merge pull request #39466 from petrutlucian94/wip-49250-pacific

pacific: common: Add Windows ETW support

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 26 Feb 2021 16:42:52 +0000 (11:42 -0500)]

mon/ConfigMonitor: make config changes via KVMonitor's pending set

We need to ensure that changes we make to the kv store (config/...)
are proposed via KVMonitor so that they are properly versioned there
and shared with subscribers (notably, the mgr).

Fixes: bb7ebc41532aeb23cff2241ab07b3f01c2f57ddd
Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit dab72abd0ae8a3038f73dbe0983b2eaef3937ef6)

commit | commitdiff | tree

Ernesto Puerta [Tue, 2 Mar 2021 10:37:18 +0000 (11:37 +0100)]

Merge pull request #39752 from rhcs-dashboard/wip-49546-pacific

pacific: mgr/dashboard: Splitting tenant$user when creating rgw user

Reviewed-by: Aashish Sharma <aasharma@redhat.com>
Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Avan Thakkar <athakkar@redhat.com>

commit | commitdiff | tree

myoungwon oh [Tue, 16 Feb 2021 05:42:44 +0000 (14:42 +0900)]

osd, test: wait if the snapshot is deleting

After calling selfmanaged_snap_remove, we don't know
when trimming snapshot is finished.
So, we make the OSD to return EBUSY if the snapshot in removed_snap_queue,
then the unit test waits the completion

Signed-off-by: Myoungwon Oh <myoungwon.oh@samsung.com>
(cherry-picked from commit d6f9f23012b9ac133e767bd07a804707ba2207ef)

commit | commitdiff | tree

myoungwon oh [Mon, 18 Jan 2021 03:16:32 +0000 (12:16 +0900)]

src/test: fix to avoid fail notification when testing manifest refcount

Due to false-positive design on manifest snap refcounting,
a message to decrement the refcount can be missing.
This commit checks whether the manifest object's state is correct
when such mismatch happens to prevent aborting unit test.

Signed-off-by: Myoungwon Oh <myoungwon.oh@samsung.com>
(cherry-picked from commit d0369dc73ee9a911e39642f3b63c4d0f17b04ce3)

commit | commitdiff | tree

myoungwon oh [Fri, 15 Jan 2021 09:38:12 +0000 (18:38 +0900)]

osd: remove unnecessary check

Signed-off-by: Myoungwon Oh <myoungwon.oh@samsung.com>
(cherry-picked from commit 1109a59e2941cf4b16798cd04941daf350ec0d22)

commit | commitdiff | tree

myoungwon oh [Fri, 8 Jan 2021 06:17:16 +0000 (15:17 +0900)]

osd, test: return the number of references of chunked object

Signed-off-by: Myoungwon Oh <myoungwon.oh@samsung.com>
(cherry-picked from commit cb7cf5ba545386e1bf010d655f0736d0ef5a29ae)

commit | commitdiff | tree

myoungwon oh [Tue, 5 Jan 2021 12:47:37 +0000 (21:47 +0900)]

test: add cls_has_chunk test to count chunk in snapshot

Signed-off-by: Myoungwon Oh <myoungwon.oh@samsung.com>
(cherry-picked from commit 11b52b4d00c7d56f0a571fd9208740932b0d00dd)

commit | commitdiff | tree

myoungwon oh [Tue, 5 Jan 2021 12:40:36 +0000 (21:40 +0900)]

osd: add has_manifest_chunk() to check chunks in snapshot

cls_has_chunk does not cover snapshotted manifest object.
This leads to unexpected behavior during chunk scrub.

Signed-off-by: Myoungwon Oh <myoungwon.oh@samsung.com>
(cherry-picked from commit bdcedd4e9f2efed59b1e5c2d40171e6da595a259)

commit | commitdiff | tree

J. Eric Ivancich [Fri, 29 Jan 2021 17:03:50 +0000 (12:03 -0500)]

rgw: add rgw-gap-list-comparator tool

The rgw-gap-list tool can produce a number of false positives when the
cluster is being used during its run. One technique to minimize the
number of false positives is to run the tool twice and look for the
objects that appear in both lists. The rgw-gap-list-comparator tool is
designed to do this comparison.

Signed-off-by: Michael Kidd <linuxkidd@gmail.com>
(cherry picked from commit 3c5154f010a34f9845b40e49a7870771e7f5372e)

commit | commitdiff | tree

J. Eric Ivancich [Thu, 17 Dec 2020 23:21:36 +0000 (18:21 -0500)]

rgw: add rgw-gap-list tool

Due to a prior bug (pr: 38228) tail rados objects of some RGW objects
could have been incorrectly deleted. This tool is designed to look for
such cases. It essentially does the opposite of rgw-orphan-list,
looking for rados objects that RGW expects to be there, but which are
not to be found.

IMPORTANT: This is very experimental at this point in time, and any
"results" produced should be verified by other means.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
Signed-off-by: Michael Kidd <linuxkidd@gmail.com>
(cherry picked from commit 07b42195fbbcd27e330cb1daa35e77e0952f8a3c)

commit | commitdiff | tree

J. Eric Ivancich [Thu, 17 Dec 2020 23:32:20 +0000 (18:32 -0500)]

rgw: add radosgw-admin bucket radoslist bucket/obj mapping

A new command-line option "--rgw-obj-fs" is added to
radosgw-admin. When used with the "bucket radoslist" subcommand, will
output lines with a rados object, bucket name, and object name,
separated by the field separator specified. Without this command-line
option, only the rados object is output, which is the previous
behavior.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
(cherry picked from commit 81d6260c1a42400bfa8c677c419e2c3f76d7a920)

commit | commitdiff | tree

J. Eric Ivancich [Thu, 4 Feb 2021 21:19:17 +0000 (16:19 -0500)]

rgw: allow rgw-orphan-list to handle intermediate files w/ binary data

Updates the grep commands with the --text command-line option since
there are cases where rados object names contain NUL characters.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
(cherry picked from commit b520cc5a910ea4fba0d2023b13e22b4166f767c7)

commit | commitdiff | tree

YuanXin [Sat, 27 Feb 2021 04:01:40 +0000 (12:01 +0800)]

test/librbd: resolve compile error on centos

Signed-off-by: YuanXin <yuanxin@didiglobal.com>
(cherry picked from commit 49a63a594038f96db235d20e82bcda9963f8ded8)

commit | commitdiff | tree

YuanXin [Sat, 27 Feb 2021 03:37:19 +0000 (11:37 +0800)]

rbd: use portable zero-ing memory function

Signed-off-by: YuanXin <yuanxin@didiglobal.com>
(cherry picked from commit 72c04a9d319520c29dfdf952408beb12ed73a88e)

commit | commitdiff | tree

Yuri Weinstein [Mon, 1 Mar 2021 18:19:59 +0000 (10:19 -0800)]

Merge pull request #39188 from liewegas/pr-39173-pacific

pacific: mgr/devicehealth: do not create pool until there are enough osds

Reviewed-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Mon, 1 Mar 2021 17:37:56 +0000 (09:37 -0800)]

Merge pull request #39189 from liewegas/pr-39172-pacific

pacific: mgr/orchestrator: remove 'host' arg for 'orch ls'

Reviewed-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Mon, 1 Mar 2021 17:36:25 +0000 (09:36 -0800)]

Merge pull request #39236 from k0ste/wip-49108-pacific

pacific: mgr/zabbix: format ceph.[{#POOL},percent_used as float

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Mon, 1 Mar 2021 17:21:12 +0000 (09:21 -0800)]

Merge pull request #39663 from pponnuvel/wip-49442-pacific

pacific: rgw: radosgw-admin: clarify error when email address already in use

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Mon, 1 Mar 2021 17:20:33 +0000 (09:20 -0800)]

Merge pull request #39048 from smithfarm/wip-48971-pacific

pacific: rgw: Initialize Layout Logs

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Adam Kupczyk [Sat, 30 Jan 2021 11:57:05 +0000 (12:57 +0100)]

os/bluestore: Add option to check BlueFS reads

Add option "bluefs_check_for_zeros" to check if there are any zero-filled page.
If so, reread data. It is known that sometimes BlueStore gets such pages.
See "bluestore_retry_disk_reads".

Signed-off-by: Adam Kupczyk <akupczyk@redhat.com>
(cherry picked from commit 7e495c57fe0ec8d8bdb1dbff90b177e656a22cbe)

commit | commitdiff | tree

Nizamudeen A [Sat, 16 Jan 2021 14:40:36 +0000 (20:10 +0530)]

mgr/dashboard: Splitting tenant$user when creating rgw user

Fixes: https://tracker.ceph.com/issues/47378
Signed-off-by: Nizamudeen A <nia@redhat.com>
(cherry picked from commit 7f4387d34fd073a3b0d8c828fecdc5df4b498122)

commit | commitdiff | tree

Sage Weil [Sun, 28 Feb 2021 16:29:53 +0000 (11:29 -0500)]

osd/OSDMap: don't warn on NEW osd ids

If we allocate a new OSD, don't raise a health alert about it.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 7aba1840fb08db3f4f5b17eb96780a038c01349a)

commit | commitdiff | tree

Sage Weil [Wed, 24 Feb 2021 22:05:55 +0000 (17:05 -0500)]

mon/OSDMonitor: behave if inc map sets weight on not-yet-existing OSD

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 678dc40cb2772340f6e3764825f5a3ee17cac9ec)

commit | commitdiff | tree

Sage Weil [Mon, 22 Feb 2021 23:48:14 +0000 (18:48 -0500)]

mon/OSDMonitor: do not mark newly created OSDs OUT

Currently, new OSDs are marked OUT.

This behavior appears to date back all the way to the 'osd new' command
in c9e6cac1cfae73cc4bce4a995ed2bba5dc85ae4b, and for 'osd create' from
118f081b3c64bc36a8b9825e3373587a334c672b.  The first commit has no
real explanation, but it presumably inherited it from the second. That
second commit, though, says

    if we are creating an osd which has the same id as a previously
    removed 'in' osd, we should not mark this newly created osd as 'in'

This isn't actually a good idea, however.  If we are creating (or reusing)
a new OSD id, the OSD that starts up will have no data.  So no matter what
there will be a data migration from the before state to the final state.
If we mark the osd OUT when the osd id is allocated but before the OSD
starts up, we'll create a middle state where PGs are mapped to the id (by
virtue of the CRUSH weight) and then remapped away (due to out), creating
a middle state where a bunch of PGs will repeer and maybe data will move.

Instead, we have two cases:

1) If we are reusing a DESTROYED osd id, we should leave the in/out
state the way it was.  This way we still go straight from the before
state to the after state (the osd will mark itself in when it starts up).

2) If we are allocating a new id in do_osd_create(), we want the OSD
to be IN, so there is no middle state.  Unfortunately, we have to work
around apply_incremental() being obnoxious here: it's sloppy implementation
will implicitly set EXISTS by virtue of new_osd_weight (the mark IN part)
before applying the osd_state XOR, so be careful!  (This behavior is
mirrored by the Linux kernel implementation too, thankfully.)

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit c8f021cb85b4375faf91af1d580dc8e2e8669395)

commit | commitdiff | tree

Sage Weil [Sun, 28 Feb 2021 16:07:12 +0000 (10:07 -0600)]

Merge PR #39483 into pacific

* refs/pull/39483/head:
mgr/cephadm: based sleep interval on configured intervals/timeouts

Reviewed-by: Juan Miguel Olmo <jolmomar@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 28 Feb 2021 16:06:56 +0000 (10:06 -0600)]

Merge PR #39595 into pacific

* refs/pull/39595/head:
mon/KVMonitor: fix 'osd new' cross-service commit
mgr/ActivePyModules.cc: use wrappers for acquiring/releasing GIL
mgr: rename config_prefix -> mgr_store_prefix
mgr: fix config option prefix
mgr/MgrStandby: simplify weird code
mgr_module: get_foreign_ceph_option(entity, name)
mgr: maintain a ConfigMap copy
mon/ConfigMap: factor out parse_key helper
mon/ConfigMap: fix stray option leak
mgr: use new kv subscription for mgr/, device/, config/
mon: allow subscription to kv/config-key data
mon: convert ConfigKeyService -> KVMonitor

Reviewed-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Kefu Chai [Wed, 10 Feb 2021 08:30:49 +0000 (16:30 +0800)]

mgr/rbd_support: bail out when snapshot mirroring is not enabled

before this change, we continue on and try to get mirror info of
specified image after calling close_image(), even the snapshot mirroring
is not enabled.

after this change, we bail out after calling close_image(). this
behavior is consistent with other places where we handle errors.

Fixes: https://tracker.ceph.com/issues/49355
Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit 8372d9370b530f953acd5bd6502c46fd11f43109)

Unnamed repository; edit this file 'description' to name the repository.

RSS Atom