]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
6 years agoosd: remove orphan include after PGLSParentFilter. 29709/head
Radoslaw Zarzynski [Fri, 16 Aug 2019 14:15:34 +0000 (16:15 +0200)]
osd: remove orphan include after PGLSParentFilter.

Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
6 years agoMerge pull request #28824 from dang/zipper
Casey Bodley [Fri, 16 Aug 2019 13:49:28 +0000 (09:49 -0400)]
Merge pull request #28824 from dang/zipper

rgw: Project Zipper part 1

Reviewed-by: Casey Bodley <cbodley@redhat.com>
6 years agoMerge pull request #29621 from xiexingguo/wip-build-push-segv
Kefu Chai [Fri, 16 Aug 2019 12:25:55 +0000 (20:25 +0800)]
Merge pull request #29621 from xiexingguo/wip-build-push-segv

osd/ReplicatedBackend: check against empty data_included before enabling crc

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
6 years agoMerge pull request #29579 from liewegas/wip-big-vs-bluestore
Kefu Chai [Fri, 16 Aug 2019 12:24:43 +0000 (20:24 +0800)]
Merge pull request #29579 from liewegas/wip-big-vs-bluestore

osd: scrub error on big objects; make bluestore refuse to start on big objects

Reviewed-by: David Zafman <dzafman@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
6 years agoMerge pull request #29663 from cxytt/bugfix
Kefu Chai [Fri, 16 Aug 2019 12:22:32 +0000 (20:22 +0800)]
Merge pull request #29663 from cxytt/bugfix

osd/PrimaryLogPG: Avoid accessing destroyed references in finish_degr…

Reviewed-by: Kefu Chai <kchai@redhat.com>
6 years agoMerge pull request #29646 from zy751713126/get_pool_name_and_id
Kefu Chai [Fri, 16 Aug 2019 12:21:34 +0000 (20:21 +0800)]
Merge pull request #29646 from zy751713126/get_pool_name_and_id

pybind/rados: add Ioctx.get_pool_id() and Ioctx.get_pool_name()

Reviewed-by: Kefu Chai <kchai@redhat.com>
6 years agoMerge pull request #29675 from tchaikov/wip-remove-parent-pgls-filter
Kefu Chai [Fri, 16 Aug 2019 12:05:00 +0000 (20:05 +0800)]
Merge pull request #29675 from tchaikov/wip-remove-parent-pgls-filter

osd/PrimaryLogPG: remove unused "parent" pgls-filter

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
6 years agoMerge pull request #29597 from tchaikov/wip-qa/tasks/cbt
Kefu Chai [Fri, 16 Aug 2019 11:17:58 +0000 (19:17 +0800)]
Merge pull request #29597 from tchaikov/wip-qa/tasks/cbt

qa/tasks/cbt.py: use "git --depth 1 for" faster clone

Reviewed-by: Neha Ojha <nojha@redhat.com>
6 years agoMerge pull request #29683 from jan--f/c-v-keep-device-list-as-lists
Alfredo Deza [Fri, 16 Aug 2019 11:00:50 +0000 (07:00 -0400)]
Merge pull request #29683 from jan--f/c-v-keep-device-list-as-lists

ceph-volume: don't keep device lists as sets

Reviewed-by: Alfredo Deza <adeza@redhat.com>
6 years agoMerge PR #29676 into master
Sage Weil [Thu, 15 Aug 2019 19:03:37 +0000 (14:03 -0500)]
Merge PR #29676 into master

* refs/pull/29676/head:
test/unittest_bluefs: always remove temp bdev file

Reviewed-by: Sage Weil <sage@redhat.com>
6 years agoMerge PR #29581 into master
Sage Weil [Thu, 15 Aug 2019 19:02:34 +0000 (14:02 -0500)]
Merge PR #29581 into master

* refs/pull/29581/head:
os/bluestore: do not set osd_memory_target default from cgroup limit

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Mark Nelson <mnelson@redhat.com>
6 years agoMerge PR #29577 into master
Sage Weil [Thu, 15 Aug 2019 19:02:00 +0000 (14:02 -0500)]
Merge PR #29577 into master

* refs/pull/29577/head:
os/bluestore/KernelDevice: fix RW_IO_MAX constant
os/bluestore/KernelDevice: print aio error extent in hex

Reviewed-by: Kefu Chai <kchai@redhat.com>
6 years agoMerge pull request #29578 from theanalyst/rgw-user-policy-urlencode
Casey Bodley [Thu, 15 Aug 2019 18:02:03 +0000 (14:02 -0400)]
Merge pull request #29578 from theanalyst/rgw-user-policy-urlencode

rgw: url decode PutUserPolicy params

Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Pritha Srivastava <prsrivas@redhat.com>
Reviewed-by: Matt Benjamin <mbenjamin@redhat.com>
6 years agoMerge pull request #29670 from linuxbox2/wip-rgwfile-marker
Matt Benjamin [Thu, 15 Aug 2019 17:33:27 +0000 (13:33 -0400)]
Merge pull request #29670 from linuxbox2/wip-rgwfile-marker

rgw_file: readdir: do not construct markers w/leading '/'

6 years agoMerge PR #29422 into master
Sage Weil [Thu, 15 Aug 2019 17:28:26 +0000 (12:28 -0500)]
Merge PR #29422 into master

* refs/pull/29422/head:
qa/tasks/mgr/dashboard/test_health: update schema
doc/rados/operations/monitoring: document muting health alerts
qa/standalone/mon/health-mutes: add tests
doc/rados/operations/health-checks: document MON_DISK_{LOW,CRIT,BIG}
doc/rados/operations/health-checks: document OSD_NO_DOWN_OUT_INTERVAL
doc/rados/operations/health-checks: document AUTH_BAD_CAPS
doc/reados/operations/health-checks: document PG_SLOW_SNAP_TRIMMING
doc/rados/operations/health-checks: document MGR_DOWN
mon/HealthCheck: check mutes based on count, not parsing the summary string
mon/health_checks: associate a count with health_alert_t
mon/HealthMonitor: simplify health alert dump
mon/PGMap: use nice timespan for PG stuck warnings
mon/HealthMonitor: allow muted alert counts to decrease but not increase
mon/PGMap: fix summary form for bluestore health alerts
doc/rados/operations/health-alerts: document BLUESTORE_NO_COMPRESSION
mon/PGMap: fix summary form for POOL_APP_NOT_ENABLED
mon/HealthMonitor: persist summary for non-sticky mutes
mon/HealthMonitor: move get_health_status()
mon/HealthMonitor: automatically clear non-sticky mutes when alert clears
mon/HealthMonitor: add gather_all_health_checks helper
mon/HealthMonitor: add sticky flag to mutes
mon/HealthMonitor: expire mutes based on ttl
mon: apply mutes to health [detail]
mon/HealthMonitor: implement mute and unmount commands
mon/HealthMonitor: maintain list of mutes
mon: refactor/simplify health [detail]
mon/health_checks: format 'health summary' with a colon
mon/health_checks: drop dump_summary_compat

Reviewed-by: Neha Ojha <nojha@redhat.com>
6 years agoMerge PR #29537 into master
Sage Weil [Thu, 15 Aug 2019 17:26:29 +0000 (12:26 -0500)]
Merge PR #29537 into master

* refs/pull/29537/head:
os/bluestore/BlueFS: fix device_migrate_to_* to handle varying alloc sizes
os/bluestore/BlueFS: apply shared_alloc_size to shared device
os/bluestore: whitespace
os/bluestore/BlueFS: add bluefs_shared_alloc_size
os/bluestore/BlueStore.cc: start should be >= _get_ondisk_reserved()

Reviewed-by: Igor Fedotov <ifedotov@suse.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
6 years agoMerge pull request #29686 from tchaikov/wip-osdc-wait-for-osdmap
Kefu Chai [Thu, 15 Aug 2019 16:10:25 +0000 (00:10 +0800)]
Merge pull request #29686 from tchaikov/wip-osdc-wait-for-osdmap

osdc: should release the rwlock before waiting

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
6 years agoMerge pull request #26953 from Exotelis/ceph-dashboard-i18ntool
Lenz Grimmer [Thu, 15 Aug 2019 14:45:51 +0000 (14:45 +0000)]
Merge pull request #26953 from Exotelis/ceph-dashboard-i18ntool

mgr/dashboard: ceph dashboard i18ntool

Reviewed-by: Tatjana Dehler <tdehler@suse.com>
Reviewed-by: Tiago Melo <tmelo@suse.com>
6 years agoMerge pull request #29568 from votdev/ignore_dirs
Kefu Chai [Thu, 15 Aug 2019 14:39:05 +0000 (22:39 +0800)]
Merge pull request #29568 from votdev/ignore_dirs

.gitignore: add more stuff

Reviewed-by: Stephan Müller <smueller@suse.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
6 years agoceph-volume: don't keep device lists as sets 29683/head
Jan Fajerski [Thu, 15 Aug 2019 10:20:00 +0000 (12:20 +0200)]
ceph-volume: don't keep device lists as sets

This was introduced by #27754. The explicit device lists were cast to
sets but other parts of the code where not updated accordingly. To avoid
touching all code places, only cast to sets for disjoint test and keep
lists otherwise.

Fixes: https://tracker.ceph.com/issues/41292
Signed-off-by: Jan Fajerski <jfajerski@suse.com>
6 years agoMerge pull request #29684 from jan--f/c-v-batch-functional-check-stderr
Jan Fajerski [Thu, 15 Aug 2019 13:19:23 +0000 (15:19 +0200)]
Merge pull request #29684 from jan--f/c-v-batch-functional-check-stderr

ceph-volume: fix batch functional tests, idempotent test must check s…

6 years agomgr/dashboard: Daemons Page Tables Test (#29469)
Lenz Grimmer [Thu, 15 Aug 2019 13:18:33 +0000 (13:18 +0000)]
mgr/dashboard: Daemons Page Tables Test (#29469)

mgr/dashboard: Daemons Page Tables Test

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Stephan Müller <smueller@suse.com>
Reviewed-by: Tiago Melo <tmelo@suse.com>
Reviewed-by: Volker Theile <vtheile@suse.com>
6 years agomgr/dashboard: Logs Page E2E Tests (#29434)
Lenz Grimmer [Thu, 15 Aug 2019 13:17:53 +0000 (13:17 +0000)]
mgr/dashboard: Logs Page E2E Tests (#29434)

mgr/dashboard: Logs Page E2E Tests

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Tiago Melo <tmelo@suse.com>
6 years agoMerge pull request #29420 from ricardoasmarques/fix-default-builder-is-not-a-function
Lenz Grimmer [Thu, 15 Aug 2019 13:16:48 +0000 (13:16 +0000)]
Merge pull request #29420 from ricardoasmarques/fix-default-builder-is-not-a-function

mgr/dashboard: Fixes 'defaultBuilder' is not a function

Reviewed-by: Ricardo Dias <rdias@suse.com>
Reviewed-by: Stephan Müller <smueller@suse.com>
Reviewed-by: Tatjana Dehler <tdehler@suse.com>
Reviewed-by: Tiago Melo <tmelo@suse.com>
6 years agoProject Zipper Part 1 - Framework and RGWRadosStore 28824/head
Daniel Gryniewicz [Thu, 6 Jun 2019 18:41:55 +0000 (14:41 -0400)]
Project Zipper Part 1 - Framework and RGWRadosStore

This is the first part of Project Zipper, the Store Abstraction Layer.
It introduces the basic framework, and wraps RGWRados in RGWRadosStore.
The goal over the next few weeks is to do the same for user, bucket, and
object.  This will make most of the remaining users of RGWRados wrapped
in SAL classes, allowing it to be completely absorbed into the private
RGWRadosStore.  This will also expose all the APIs that need to be
pusheg up to higher layers in the SAL.

Signed-off-by: Daniel Gryniewicz <dang@redhat.com>
6 years agoosdc: should release the rwlock before waiting 29686/head
Kefu Chai [Thu, 15 Aug 2019 11:33:15 +0000 (19:33 +0800)]
osdc: should release the rwlock before waiting

this addresses a regresssion introduced by 20b1ac6e

Signed-off-by: Kefu Chai <kchai@redhat.com>
6 years agoceph-volume: fix batch functional tests, idempotent test must check stderr 29684/head
Jan Fajerski [Thu, 15 Aug 2019 11:28:42 +0000 (13:28 +0200)]
ceph-volume: fix batch functional tests, idempotent test must check stderr

Fixes: https://tracker.ceph.com/issues/41295
Signed-off-by: Jan Fajerski <jfajerski@suse.com>
6 years agoImprove .gitignore 29568/head
Volker Theile [Fri, 9 Aug 2019 07:54:23 +0000 (09:54 +0200)]
Improve .gitignore

Ignore some Python related caching dirs.

Signed-off-by: Volker Theile <vtheile@suse.com>
6 years agoMerge pull request #29536 from batrick/backport-https
Nathan Cutler [Thu, 15 Aug 2019 09:08:01 +0000 (11:08 +0200)]
Merge pull request #29536 from batrick/backport-https

scripts: use https url for redmine

Reviewed-by: Willem Jan Withagen <wjw@digiware.nl>
Reviewed-by: Nathan Cutler <ncutler@suse.com>
6 years agoMerge pull request #28997 from b-ranto/wip-push-dash
Boris Ranto [Thu, 15 Aug 2019 09:02:43 +0000 (11:02 +0200)]
Merge pull request #28997 from b-ranto/wip-push-dash

Make ceph-dashboard require grafana dashboards

Reviewed-by: Zack Cerza <zcerza@redhat.com>
Reviewed-by: Nathan Cutler <ncutler@suse.com>
6 years agotest/unittest_bluefs: always remove temp bdev file 29676/head
Kefu Chai [Tue, 13 Aug 2019 02:29:24 +0000 (10:29 +0800)]
test/unittest_bluefs: always remove temp bdev file

we leave files in build directory if the test fails. better off
removing them.

Signed-off-by: Kefu Chai <kchai@redhat.com>
6 years agoosd, osdc: drop the unused outdata feature of PGLSFilter. 29675/head
Radoslaw Zarzynski [Sat, 10 Aug 2019 08:38:20 +0000 (10:38 +0200)]
osd, osdc: drop the unused outdata feature of PGLSFilter.

Before this commit PGLSFilter interface was offering the outdata
parameter in its filter() method:

  filter(..., bufferlist& outdata)

OSD was serializing and appending the bufferlist to response to
CEPH_OSD_OP_PGLS_FILTER and CEPH_OSD_OP_PGNLS_FILTER operations.
At the Objecter's side these extra bits were being parsed and
finally stored in NListContext::extra_info. However, it really
looks this member is not used anywhere.

The commit removes the outdata handling on multiple layers: from
PGLSFilter implementations, through OSD till Objecter.

Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
6 years agoosd/PrimaryLogPG: remove unused "parent" pgls-filter
Kefu Chai [Thu, 15 Aug 2019 02:34:19 +0000 (10:34 +0800)]
osd/PrimaryLogPG: remove unused "parent" pgls-filter

it's implemented using `PGLSParentFilter`, and this filter has never been
used. the only possible user would be `cephfs-data-scan`, but it's using
`PGLSCephFSFilter` which is referened with "cephfs.inode_tag".

Signed-off-by: Kefu Chai <kchai@redhat.com>
6 years agoosd/PrimaryLogPG: Avoid accessing destroyed references in finish_degraded_object 29663/head
Tao Ning [Wed, 14 Aug 2019 13:07:46 +0000 (21:07 +0800)]
osd/PrimaryLogPG: Avoid accessing destroyed references in finish_degraded_object

As follows:
for (auto i = callbacks_for_degraded_object.begin(); i != callbacks_for_degraded_object.end();) {
    finish_degraded_object((i++)->first);
}

void PrimaryLogPG::finish_degraded_object(const hobject_t oid)
{
  if (callbacks_for_degraded_object.count(oid)) {
    contexts.swap(callbacks_for_degraded_object[oid]);
    callbacks_for_degraded_object.erase(oid);   // Release
  }

  map<hobject_t, snapid_t>::iterator i = objects_blocked_on_degraded_snap.find(
    oid.get_head());  // Access
  ...
}

Fixes: https://tracker.ceph.com/issues/41250
Signed-off-by: Tao Ning <ningtao@sangfor.com.cn>
6 years agoos/bluestore/KernelDevice: fix RW_IO_MAX constant 29577/head
Sage Weil [Fri, 9 Aug 2019 14:21:17 +0000 (09:21 -0500)]
os/bluestore/KernelDevice: fix RW_IO_MAX constant

This depends on the page size.  See:

https://github.com/torvalds/linux/blob/6e6d05360b80f196ed07061327f03346b204abea/include/linux/fs.h#L2305

https://github.com/torvalds/linux/blob/30d1d92a888d03681b927c76a35181b4eed7071f/tools/virtio/linux/kernel.h#L23

Fixes 4d33114a40d5ae0d541c36175977ca22789a3b88

Fixes: https://tracker.ceph.com/issues/41188
Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoqa/tasks/mgr/dashboard/test_health: update schema 29422/head
Sage Weil [Thu, 8 Aug 2019 16:21:42 +0000 (11:21 -0500)]
qa/tasks/mgr/dashboard/test_health: update schema

Also fix the 'checks' field, which is a list of objects, not strings.  (The
test doesn't notice because it's empty.)

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agodoc/rados/operations/monitoring: document muting health alerts
Sage Weil [Thu, 1 Aug 2019 20:00:10 +0000 (15:00 -0500)]
doc/rados/operations/monitoring: document muting health alerts

I think someday the docs for how health alerts work (here) and the
enumeration of all actual alerts should be restructured.  For now this
si the simplest placde to fit this!

Signed-off-by: Sage Weil <sage@redhat.com>t
6 years agoqa/standalone/mon/health-mutes: add tests
Sage Weil [Wed, 31 Jul 2019 22:28:18 +0000 (17:28 -0500)]
qa/standalone/mon/health-mutes: add tests

Make sure mute and unmute work.  Make sure stick is sticky. Mkae sure
counts can go down bupt if they go upt hte mute clears.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agodoc/rados/operations/health-checks: document MON_DISK_{LOW,CRIT,BIG}
Sage Weil [Wed, 31 Jul 2019 10:24:20 +0000 (05:24 -0500)]
doc/rados/operations/health-checks: document MON_DISK_{LOW,CRIT,BIG}

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agodoc/rados/operations/health-checks: document OSD_NO_DOWN_OUT_INTERVAL
Sage Weil [Wed, 31 Jul 2019 10:15:14 +0000 (05:15 -0500)]
doc/rados/operations/health-checks: document OSD_NO_DOWN_OUT_INTERVAL

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agodoc/rados/operations/health-checks: document AUTH_BAD_CAPS
Sage Weil [Wed, 31 Jul 2019 10:10:50 +0000 (05:10 -0500)]
doc/rados/operations/health-checks: document AUTH_BAD_CAPS

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agodoc/reados/operations/health-checks: document PG_SLOW_SNAP_TRIMMING
Sage Weil [Wed, 31 Jul 2019 10:04:37 +0000 (05:04 -0500)]
doc/reados/operations/health-checks: document PG_SLOW_SNAP_TRIMMING

The mitigation steps are weak, but it's not clear concrete guidance to
provide.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agodoc/rados/operations/health-checks: document MGR_DOWN
Sage Weil [Wed, 31 Jul 2019 09:57:49 +0000 (04:57 -0500)]
doc/rados/operations/health-checks: document MGR_DOWN

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/HealthCheck: check mutes based on count, not parsing the summary string
Sage Weil [Wed, 31 Jul 2019 09:51:38 +0000 (04:51 -0500)]
mon/HealthCheck: check mutes based on count, not parsing the summary string

This is more explicit and robust, and works with the PG warnings, which
don't conform to the "%d ..." form that the other messages do.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/health_checks: associate a count with health_alert_t
Sage Weil [Wed, 31 Jul 2019 09:14:01 +0000 (04:14 -0500)]
mon/health_checks: associate a count with health_alert_t

0 means this is a singleton.  Otherwise, we can sum this up, either
via merge() or get_or_add().

We always structure this so the count goes toward zero (more healthy), so
if a value is too low, then we count how much too low it is.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/HealthMonitor: simplify health alert dump
Sage Weil [Wed, 31 Jul 2019 08:24:45 +0000 (03:24 -0500)]
mon/HealthMonitor: simplify health alert dump

Use dump() member instead of duplicating!  The only reason we had this
before was because the detail portion was optinoal

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/PGMap: use nice timespan for PG stuck warnings
Sage Weil [Wed, 31 Jul 2019 08:22:38 +0000 (03:22 -0500)]
mon/PGMap: use nice timespan for PG stuck warnings

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/HealthMonitor: allow muted alert counts to decrease but not increase
Sage Weil [Wed, 31 Jul 2019 07:04:20 +0000 (02:04 -0500)]
mon/HealthMonitor: allow muted alert counts to decrease but not increase

If the summary starts with a digit, parse a count.

If the count goes up, clear the mute.

If the count goes down, update the mute so that we ratchet the threshold
down.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/PGMap: fix summary form for bluestore health alerts
Sage Weil [Wed, 31 Jul 2019 06:39:28 +0000 (01:39 -0500)]
mon/PGMap: fix summary form for bluestore health alerts

Count goes first.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agodoc/rados/operations/health-alerts: document BLUESTORE_NO_COMPRESSION
Sage Weil [Wed, 31 Jul 2019 06:39:03 +0000 (01:39 -0500)]
doc/rados/operations/health-alerts: document BLUESTORE_NO_COMPRESSION

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/PGMap: fix summary form for POOL_APP_NOT_ENABLED
Sage Weil [Wed, 31 Jul 2019 06:31:39 +0000 (01:31 -0500)]
mon/PGMap: fix summary form for POOL_APP_NOT_ENABLED

Count goes first.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/HealthMonitor: persist summary for non-sticky mutes
Sage Weil [Wed, 31 Jul 2019 06:31:22 +0000 (01:31 -0500)]
mon/HealthMonitor: persist summary for non-sticky mutes

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/HealthMonitor: move get_health_status()
Sage Weil [Wed, 31 Jul 2019 05:27:26 +0000 (00:27 -0500)]
mon/HealthMonitor: move get_health_status()

This operates exclusively on HealthMonitor members.  Make public member
private again.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/HealthMonitor: automatically clear non-sticky mutes when alert clears
Sage Weil [Wed, 31 Jul 2019 05:19:39 +0000 (00:19 -0500)]
mon/HealthMonitor: automatically clear non-sticky mutes when alert clears

If the alert goes away, drop the mute.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/HealthMonitor: add gather_all_health_checks helper
Sage Weil [Wed, 31 Jul 2019 05:05:04 +0000 (00:05 -0500)]
mon/HealthMonitor: add gather_all_health_checks helper

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/HealthMonitor: add sticky flag to mutes
Sage Weil [Wed, 31 Jul 2019 03:54:25 +0000 (22:54 -0500)]
mon/HealthMonitor: add sticky flag to mutes

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/HealthMonitor: expire mutes based on ttl
Sage Weil [Wed, 31 Jul 2019 00:44:59 +0000 (19:44 -0500)]
mon/HealthMonitor: expire mutes based on ttl

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon: apply mutes to health [detail]
Sage Weil [Wed, 31 Jul 2019 00:44:48 +0000 (19:44 -0500)]
mon: apply mutes to health [detail]

- de-escalate severity
- mark mutes in structured output
- note mutes in summary text output
- mark mutes in detail text output

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/HealthMonitor: implement mute and unmount commands
Sage Weil [Tue, 30 Jul 2019 16:30:15 +0000 (11:30 -0500)]
mon/HealthMonitor: implement mute and unmount commands

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/HealthMonitor: maintain list of mutes
Sage Weil [Tue, 30 Jul 2019 16:28:55 +0000 (11:28 -0500)]
mon/HealthMonitor: maintain list of mutes

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon: refactor/simplify health [detail]
Sage Weil [Tue, 30 Jul 2019 16:23:57 +0000 (11:23 -0500)]
mon: refactor/simplify health [detail]

Get rid of single caller helpers.  Instead, assimilate all the checks
together at once, and have two separate blocks, one for formatted, and
one for plaintext output.  Much easier to follow!

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/health_checks: format 'health summary' with a colon
Sage Weil [Tue, 30 Jul 2019 15:01:28 +0000 (10:01 -0500)]
mon/health_checks: format 'health summary' with a colon

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/health_checks: drop dump_summary_compat
Sage Weil [Tue, 30 Jul 2019 15:01:15 +0000 (10:01 -0500)]
mon/health_checks: drop dump_summary_compat

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoosd/PG: scrub error when objects are larger than osd_max_object_size 29579/head
David Zafman [Fri, 9 Aug 2019 20:07:22 +0000 (20:07 +0000)]
osd/PG: scrub error when objects are larger than osd_max_object_size

Signed-off-by: David Zafman <dzafman@redhat.com>
6 years agoMerge PR #29431 into master
Patrick Donnelly [Wed, 14 Aug 2019 22:21:51 +0000 (15:21 -0700)]
Merge PR #29431 into master

* refs/pull/29431/head:
qa: fix malformed suite config

Reviewed-by: Zheng Yan <zyan@redhat.com>
6 years agoMerge PR #28652 into master
Patrick Donnelly [Wed, 14 Aug 2019 22:05:29 +0000 (15:05 -0700)]
Merge PR #28652 into master

* refs/pull/28652/head:
cephfs-shell: Add error message for invalid ls commands

Reviewed-by: Rishabh Dave <ridave@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
6 years agoMerge PR #29554 into master
Patrick Donnelly [Wed, 14 Aug 2019 22:04:04 +0000 (15:04 -0700)]
Merge PR #29554 into master

* refs/pull/29554/head:
cephfs-shell: Fix onecmd TypeError

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
6 years agoMerge PR #29552 into master
Patrick Donnelly [Wed, 14 Aug 2019 22:01:35 +0000 (15:01 -0700)]
Merge PR #29552 into master

* refs/pull/29552/head:
cephfs-shell: Convert paths type from string to bytes

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>
6 years agorgw_file: readdir: do not construct markers w/leading '/' 29670/head
Matt Benjamin [Wed, 14 Aug 2019 18:11:16 +0000 (14:11 -0400)]
rgw_file: readdir: do not construct markers w/leading '/'

This case arises when listing the top directory of a bucket, and,
with proper continued enumeration, would generate a non-terminating
loop if a directory contained names which sort lexically before '/'.

Fixes: https://tracker.ceph.com/issues/41252
Signed-off-by: Matt Benjamin <mbenjamin@redhat.com>
6 years agoMerge pull request #29666 from yuriw/wip-yuriw-crontab-master
Yuri Weinstein [Wed, 14 Aug 2019 16:49:39 +0000 (09:49 -0700)]
Merge pull request #29666 from yuriw/wip-yuriw-crontab-master

qa/tests - upped priority for upgrades on master, otherwise they neve…

6 years agoqa/tests - upped priority for upgrades on master, otherwise they never lock nodes... 29666/head
Yuri Weinstein [Wed, 14 Aug 2019 16:43:02 +0000 (09:43 -0700)]
qa/tests - upped priority for upgrades on master, otherwise they never lock nodes for testing and fail

Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
6 years agopybind/rados: add get_id and get_name method in class Ioctx 29646/head
zhengyin [Wed, 14 Aug 2019 02:27:15 +0000 (22:27 -0400)]
pybind/rados: add get_id and get_name method in class Ioctx

Signed-off-by: Zheng Yin <zhengyin@cmss.chinamobile.com>
6 years agomgr/dashboard: Fix e2e issue in HACKING.rst (#29640)
Lenz Grimmer [Wed, 14 Aug 2019 09:05:07 +0000 (09:05 +0000)]
mgr/dashboard: Fix e2e issue in HACKING.rst (#29640)

mgr/dashboard: Fix e2e issue in HACKING.rst

Reviewed-by: Stephan Müller <smueller@suse.com>
Reviewed-by: Tatjana Dehler <tdehler@suse.com>
Reviewed-by: Tiago Melo <tmelo@suse.com>
6 years agoMerge pull request #29570 from rhcs-dashboard/new-bucket-utilities-adaptation
Lenz Grimmer [Wed, 14 Aug 2019 08:19:32 +0000 (08:19 +0000)]
Merge pull request #29570 from rhcs-dashboard/new-bucket-utilities-adaptation

mgr/dashboard: adapt bucket tenant API tests to new behaviour

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Tatjana Dehler <tdehler@suse.com>
Reviewed-by: Volker Theile <vtheile@suse.com>
6 years agoMerge pull request #29634 from rhcs-dashboard/mgr-module-fixes
Lenz Grimmer [Wed, 14 Aug 2019 08:18:18 +0000 (08:18 +0000)]
Merge pull request #29634 from rhcs-dashboard/mgr-module-fixes

mgr/dashboard: fix mgr module API tests

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Tatjana Dehler <tdehler@suse.com>
Reviewed-by: Volker Theile <vtheile@suse.com>
6 years agoMerge pull request #29612 from tchaikov/wip-crimson-perf-test
Kefu Chai [Wed, 14 Aug 2019 05:48:21 +0000 (13:48 +0800)]
Merge pull request #29612 from tchaikov/wip-crimson-perf-test

crimson/test: add CBT based perf tests

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
6 years agoMerge pull request #29644 from anthonyeleven/patch-3
Kefu Chai [Wed, 14 Aug 2019 03:24:45 +0000 (11:24 +0800)]
Merge pull request #29644 from anthonyeleven/patch-3

doc: operations: correct 'comma-delimited'

Reviewed-by: Kefu Chai <kchai@redhat.com>
6 years agosrc/script: add run-cbt.sh 29612/head
Kefu Chai [Mon, 12 Aug 2019 14:38:25 +0000 (22:38 +0800)]
src/script: add run-cbt.sh

this script will be used by jenkins to drive the CBT based test

also, this test can be used by developer for testing crimson or classic
OSD.

Signed-off-by: Kefu Chai <kchai@redhat.com>
6 years agocrimson/test: add perf tests for crimson
Kefu Chai [Mon, 12 Aug 2019 12:10:35 +0000 (20:10 +0800)]
crimson/test: add perf tests for crimson

Signed-off-by: Kefu Chai <kchai@redhat.com>
6 years agocrimson/test: add script to convert teuthology task config to cbt config
Kefu Chai [Mon, 12 Aug 2019 12:08:49 +0000 (20:08 +0800)]
crimson/test: add script to convert teuthology task config to cbt config

Signed-off-by: Kefu Chai <kchai@redhat.com>
6 years agodoc: operations: correct 'comma-delimited' 29644/head
Anthony D'Atri [Tue, 13 Aug 2019 19:50:39 +0000 (12:50 -0700)]
doc: operations: correct 'comma-delimited'

CIDR blocks are comma-separated, not comma-delimited.

Signed-off-by: Anthony D'Atri <anthony.datri@gmail.com>
6 years agoMerge pull request #29118 from cbodley/wip-rgw-metadata-servicification
Casey Bodley [Tue, 13 Aug 2019 16:56:52 +0000 (12:56 -0400)]
Merge pull request #29118 from cbodley/wip-rgw-metadata-servicification

rgw: metadata refactoring

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
6 years agoMerge branch 'wip-rgw-metadata-servicification' 29118/head
Casey Bodley [Mon, 12 Aug 2019 19:47:29 +0000 (15:47 -0400)]
Merge branch 'wip-rgw-metadata-servicification'

 Conflicts:
src/rgw/rgw_auth.cc
src/rgw/rgw_auth_registry.h
src/rgw/rgw_auth_s3.h
src/rgw/rgw_bucket.cc
src/rgw/rgw_bucket.h
src/rgw/rgw_data_sync.h
src/rgw/rgw_frontend.h
src/rgw/rgw_log.h
src/rgw/rgw_main.cc
src/rgw/rgw_rados.cc
src/rgw/rgw_rados.h
src/rgw/rgw_rest_s3.h
src/rgw/rgw_rest_sts.h
src/rgw/rgw_swift_auth.h
src/rgw/rgw_user.cc
src/rgw/rgw_user.h
src/rgw/services/svc_sys_obj_core.h

6 years agomgr/dashboard: Fix e2e issue in HACKING.rst 29640/head
Volker Theile [Tue, 13 Aug 2019 13:20:56 +0000 (15:20 +0200)]
mgr/dashboard: Fix e2e issue in HACKING.rst

Signed-off-by: Volker Theile <vtheile@suse.com>
6 years agomgr/dashboard: Daemons Page Tables Test 29469/head
Adam King [Fri, 2 Aug 2019 18:53:31 +0000 (14:53 -0400)]
mgr/dashboard: Daemons Page Tables Test

Selects first daemon from list and checks details table is displayed
Clicks performance counters tab and checks performance counters table is displayed

Fixes: https://tracker.ceph.com/issues/41063
Signed-off-by: Adam King <adking@redhat.com>
Signed-off-by: Rafael Quintero <rquinter@redhat.com>
6 years agoMerge pull request #29633 from hanfengzhe-hi/Fix-decompression-logprint
Casey Bodley [Tue, 13 Aug 2019 12:15:17 +0000 (08:15 -0400)]
Merge pull request #29633 from hanfengzhe-hi/Fix-decompression-logprint

rgw:Fix rgw decompression log-print

Reviewed-by: Casey Bodley <cbodley@redhat.com>
6 years agomgr/dashboard: fix mgr module API tests 29634/head
alfonsomthd [Tue, 13 Aug 2019 10:15:38 +0000 (12:15 +0200)]
mgr/dashboard: fix mgr module API tests

Signed-off-by: alfonsomthd <almartin@redhat.com>
6 years agorgw:Fix rgw decompression log-print 29633/head
Han Fengzhe [Tue, 13 Aug 2019 09:08:23 +0000 (17:08 +0800)]
rgw:Fix rgw decompression log-print

The zlib compression takes effect in RGW。
When getting objects failed because of decompress-failed。“ceph-client.rgw” log printed “Compression failed with exit code......”,it should be “deCompression failed with exit code......”。

Signed-off-by: Han Fengzhe <hanfengzhe@hisilicon.com>
6 years agoMerge pull request #29587 from yuvalif/wip-yuvali-fix-issue-41169
Yuval Lifshitz [Tue, 13 Aug 2019 08:11:34 +0000 (17:11 +0900)]
Merge pull request #29587 from yuvalif/wip-yuvali-fix-issue-41169

rgw: don't throw when accept errors are happening on frontend

6 years agoMerge pull request #29615 from tchaikov/wip-qa/tasks/mgr/dashboard/test_health
Kefu Chai [Tue, 13 Aug 2019 05:59:28 +0000 (13:59 +0800)]
Merge pull request #29615 from tchaikov/wip-qa/tasks/mgr/dashboard/test_health

qa/tasks/mgr/dashboard/test_health: add missing field for test_full_health

Reviewed-by: Kiefer Chang <kiefer.chang@suse.com>
6 years agoqa/tasks/mgr/dashboard/test_health: add missing field for test_full_health 29615/head
Kefu Chai [Tue, 13 Aug 2019 00:43:34 +0000 (08:43 +0800)]
qa/tasks/mgr/dashboard/test_health: add missing field for test_full_health

fix regressions introduced by a076260e and d6ff61ed

Signed-off-by: Kefu Chai <kchai@redhat.com>
6 years agoosd/ReplicatedBackend: check against empty data_included before enabling crc 29621/head
xie xingguo [Tue, 13 Aug 2019 02:03:27 +0000 (10:03 +0800)]
osd/ReplicatedBackend: check against empty data_included before enabling crc

The data_included portion could be empty, e.g., due to there are
too many omap entries and can not be sent in one shot, which as
a result may end up crashing the OSD.

Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
6 years agoqa/tasks/mgr/dashboard/test_mgr_module: remove enable/disable test from MgrModuleTele... 29619/head
Kefu Chai [Tue, 13 Aug 2019 01:54:24 +0000 (09:54 +0800)]
qa/tasks/mgr/dashboard/test_mgr_module: remove enable/disable test from MgrModuleTelemetryTest

telemetry is always enabled since 2d62d71cd46d791294b3803e034c0995fef08b33

Fixes: https://tracker.ceph.com/issues/41186
Signed-off-by: Kefu Chai <kchai@redhat.com>
6 years agoMerge pull request #29342 from Jeegn-Chen/wip-scrub-extended-sleep
Kefu Chai [Tue, 13 Aug 2019 01:09:52 +0000 (09:09 +0800)]
Merge pull request #29342 from Jeegn-Chen/wip-scrub-extended-sleep

osd: support osd_scrub_extended_sleep

Reviewed-by: David Zafman <dzafman@redhat.com>
6 years agoMerge pull request #29522 from majianpeng/bluestore-optimization
Kefu Chai [Tue, 13 Aug 2019 01:08:00 +0000 (09:08 +0800)]
Merge pull request #29522 from majianpeng/bluestore-optimization

os/bluestore: deferred IO notify and locking optimization

Reviewed-by: Sage Weil <sage@redhat.com>
6 years agoMerge pull request #29418 from junxfl/test1
Kefu Chai [Tue, 13 Aug 2019 01:06:52 +0000 (09:06 +0800)]
Merge pull request #29418 from junxfl/test1

OSD: modify n.cookie to op.notify.cookie

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
6 years agoMerge pull request #29571 from tchaikov/wip-41177
Kefu Chai [Tue, 13 Aug 2019 01:04:58 +0000 (09:04 +0800)]
Merge pull request #29571 from tchaikov/wip-41177

ceph-objectstore-tool: update-mon-db: do not fail if incmap is missing

Reviewed-by: huangjun <huangjun@xsky.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
6 years agorgw: RGWUserCtl::PutParams::attrs defaults to nullptr
Casey Bodley [Mon, 12 Aug 2019 14:18:34 +0000 (10:18 -0400)]
rgw: RGWUserCtl::PutParams::attrs defaults to nullptr

Signed-off-by: Casey Bodley <cbodley@redhat.com>
6 years agovstart: pass "--smp 1 --cpuset $osd" to crimson-osd
Kefu Chai [Mon, 12 Aug 2019 12:04:17 +0000 (20:04 +0800)]
vstart: pass "--smp 1 --cpuset $osd" to crimson-osd

for optimal performance when testing multiple crimson-osd in a vstart cluster

Signed-off-by: Kefu Chai <kchai@redhat.com>
6 years agoMerge pull request #29605 from yaozongyou/remove-unused-marco
Casey Bodley [Mon, 12 Aug 2019 14:14:52 +0000 (10:14 -0400)]
Merge pull request #29605 from yaozongyou/remove-unused-marco

common: remove unused _STR and STRINGIFY macro

Reviewed-by: Casey Bodley <cbodley@redhat.com>