]>
git.apps.os.sepia.ceph.com Git - ceph.git/log
Patrick Donnelly [Mon, 17 Jul 2017 04:27:00 +0000 (21:27 -0700)]
script: add docker core dump debugger
This script sets up a docker container with all the packages used for your Ceph
(test) branch. From there, you can gdb the core dump if you have access to the
test cluster.
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Kefu Chai [Sat, 15 Jul 2017 08:31:35 +0000 (16:31 +0800)]
Merge pull request #15587 from wjwithagen/wip-wjw-ceph-disk-is_diskdevice
ceph-disk/ceph_disk/main.py: Replace ST_ISBLK() test by is_diskdevice()
Reviewed-by: Loic Dachary <ldachary@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
Kefu Chai [Sat, 15 Jul 2017 08:24:23 +0000 (16:24 +0800)]
Merge pull request #16347 from tchaikov/wip-test-ceph-disk
tests: ceph-disk: use communicate() instead of wait() for output
Reviewed-by: Loic Dachary <ldachary@redhat.com>
Jos Collin [Sat, 15 Jul 2017 05:21:23 +0000 (05:21 +0000)]
Merge pull request #16334 from wjwithagen/wjw-bug-stringyfy
core:" Stringify needs access to << before reference" src/include/stringify.h
Reviewed-by: Jos Collin <jcollin@redhat.com>
Kefu Chai [Sat, 15 Jul 2017 02:40:31 +0000 (10:40 +0800)]
qa/suites/ceph-disk: whitelist health warnings
Signed-off-by: Kefu Chai <kchai@redhat.com>
Kefu Chai [Fri, 14 Jul 2017 16:07:03 +0000 (00:07 +0800)]
tests: ceph-disk: use communicate() instead of wait() for output
to avoid possible deadlock. quote from doc of Popen.wait()
> This will deadlock when using stdout=PIPE and/or stderr=PIPE and the
child process generates enough output to a pipe such that it blocks
waiting for the OS pipe buffer to accept more data. Use communicate() to
avoid that.
and print out the stdout and stderr using LOG.warn() if the command
fails.
Signed-off-by: Kefu Chai <kchai@redhat.com>
Kefu Chai [Sat, 15 Jul 2017 02:57:13 +0000 (10:57 +0800)]
Merge pull request #16045 from Liuchang0812/wip-compact-osd-feature
osd: compact osd feature
Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
Sage Weil [Fri, 14 Jul 2017 20:09:22 +0000 (15:09 -0500)]
Merge pull request #16221 from liewegas/wip-20546
crush/CrushWrapper: make get_immediate_parent[_id] ignore per-class shadow hierarchy
Reviewed-by: Neha Ojha <nojha@redhat.com>
Yehuda Sadeh [Fri, 14 Jul 2017 16:13:10 +0000 (09:13 -0700)]
Merge pull request #12139 from tianshan/fix_zone_set_realm_id
rgw: fix zone did't update realm_id when added to zonegroup
Reviewed-by: Orit Wasserman <owasserm@redhat.com>
Yuri Weinstein [Fri, 14 Jul 2017 15:52:52 +0000 (08:52 -0700)]
Merge pull request #14915 from cbodley/wip-19817
rgw: add missing RGWPeriod::reflect() based on new atomic update_latest_epoch()
Reviewed-by: Orit Wasserman <owasserm@redhat.com>
Yuri Weinstein [Fri, 14 Jul 2017 15:51:42 +0000 (08:51 -0700)]
Merge pull request #15953 from rzarzynski/wip-rgw-20418
rgw: reject request if decoded URI contains \0 in the middle.
Reviewed-by: Orit Wasserman <owasserm@redhat.com>
Sage Weil [Fri, 14 Jul 2017 15:19:25 +0000 (10:19 -0500)]
Merge pull request #16340 from dillaman/wip-20630
qa/tasks: rbd-mirror daemon not properly run in foreground mode
Reviewed-by: Sage Weil <sage@redhat.com>
Casey Bodley [Fri, 14 Jul 2017 15:12:57 +0000 (11:12 -0400)]
Merge pull request #16331 from zhangsw/cleanup-zonegroup-list
rgw: remove the useless output when listing zonegroups.
Reviewed-by: Casey Bodley <cbodley@redhat.com>
Jason Dillaman [Fri, 14 Jul 2017 14:32:28 +0000 (10:32 -0400)]
qa/tasks: rbd-mirror daemon not properly run in foreground mode
Fixes: http://tracker.ceph.com/issues/20630
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
Sage Weil [Fri, 14 Jul 2017 14:16:33 +0000 (09:16 -0500)]
Merge pull request #16043 from jcsp/wip-dashboard-updates
mgr: dashboard improvements
Reviewed-by: Kefu Chai <kchai@redhat.com>
John Spray [Thu, 22 Jun 2017 00:34:27 +0000 (20:34 -0400)]
mgr/dashboard: update health display
This takes account of the new health format, also
expands and visually cleans up the frontpage
where we put the health information.
Dark backgrounds make it much easier to use
red/amber/green colours to grab attention.
Signed-off-by: John Spray <john.spray@redhat.com>
John Spray [Thu, 22 Jun 2017 00:12:58 +0000 (20:12 -0400)]
mgr: expose a MgrMap in PyModules
Signed-off-by: John Spray <john.spray@redhat.com>
Sage Weil [Fri, 14 Jul 2017 14:04:16 +0000 (09:04 -0500)]
Merge pull request #16020 from jcsp/wip-20383
mgr: clean up daemon start process
Reviewed-by: Sage Weil <sage@redhat.com>
Jos Collin [Fri, 14 Jul 2017 13:58:15 +0000 (13:58 +0000)]
Merge pull request #16338 from scienceluo/wip-doc-branch
doc/release-notes: Luminous release notes typo fixes
Reviewed-by: Jos Collin <jcollin@redhat.com>
Luo Kexue [Fri, 14 Jul 2017 10:17:07 +0000 (18:17 +0800)]
doc/release-notes: Luminous release notes typo fixes "systemctl ceph-osd.target"->"systemctl restart ceph-osd.target" and "systemctl ceph-mgr.target"->"systemctl restart ceph-mgr.target"
Signed-off-by: Luo Kexue <luo.kexue@zte.com.cn>
Nathan Cutler [Fri, 14 Jul 2017 10:07:18 +0000 (12:07 +0200)]
Merge pull request #16318 from smithfarm/wip-jewel-10-2-9
doc: Jewel v10.2.9 release notes
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Willem Jan Withagen [Fri, 14 Jul 2017 08:54:32 +0000 (10:54 +0200)]
ceph_disk/main.py: fixed bugs from theutology report
Signed-off-by: Willem Jan Withagen <wjw@digiware.nl>
Willem Jan Withagen [Thu, 6 Jul 2017 08:40:23 +0000 (10:40 +0200)]
ceph-disk/main.py: Replace ST_ISBLK() test by is_diskdevice()
- FreeBSD does not have blockdevices any more (since 2002)
So disk are just Character special devices, so test on ISCHR
Signed-off-by: Willem Jan Withagen <wjw@digiware.nl>
Willem Jan Withagen [Fri, 14 Jul 2017 08:44:44 +0000 (10:44 +0200)]
core:" Stringify needs access to << before reference" src/include/stringify.h
Clang complains:
In file included from /home/jenkins/workspace/ceph-master/src/mon/HealthMonitor.cc:21:
/home/jenkins/workspace/ceph-master/src/include/stringify.h:15:6: error: call to function 'operator<<' that is neither visible in the template definition nor found by argument-dependent lookup
ss << a;
^
/home/jenkins/workspace/ceph-master/src/mon/HealthMonitor.cc:129:32: note: in instantiation of function template specialization 'stringify<std::__1::set<std::__1::basic_string<char>, std::__1::less<std::__1::basic_string<char> >, std::__1::allocator<std::__1::basic_string<char> > > >' requested here
boost::regex("%names%"), stringify(names[p.first]));
^
/home/jenkins/workspace/ceph-master/src/include/types.h:160:17: note: 'operator<<' should be declared prior to the call site
inline ostream& operator<<(ostream& out, const set<A, Comp, Alloc>& iset) {
Signed-off-by: Willem Jan Withagen <wjw@digiware.nl>
Kefu Chai [Fri, 14 Jul 2017 08:31:42 +0000 (16:31 +0800)]
Merge pull request #16330 from scienceluo/master
doc/release-notes: Luminous release notes typo fixes "ceph config-key ls"->"ceph config-key list"
Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
Reviewed-by: Kefu Chai <kchai@redhat.com>
scienceluo [Thu, 13 Jul 2017 13:05:58 +0000 (21:05 +0800)]
doc:fix release-notes
Signed-off-by: luo.kexue@zte.com.cn
Sage Weil [Fri, 14 Jul 2017 03:16:39 +0000 (22:16 -0500)]
Merge pull request #16300 from liewegas/wip-20600
mon: fix hang on deprecated/removed 'pg set_*full_ratio' commands
Reviewed-by: Greg Farnum <gfarnum@redhat.com>
Sage Weil [Fri, 14 Jul 2017 03:16:16 +0000 (22:16 -0500)]
Merge pull request #16321 from Yan-waller/wip-walle-0712cephosd
common: misc cleanups in common, global, os, osd submodules
Reviewed-by: Jos Collin <jcollin@redhat.com>
Sage Weil [Fri, 14 Jul 2017 03:15:03 +0000 (22:15 -0500)]
Merge pull request #16322 from liewegas/wip-20617
qa/tasks/ceph_manager: wait longer for pg stats to flush
Reviewed-by: Kefu Chai <kchai@redhat.com>
Sage Weil [Fri, 14 Jul 2017 03:14:35 +0000 (22:14 -0500)]
Merge pull request #16323 from ceph/revert-15897-wip-20390
Revert "msg/async: increase worker reference with local listen table enabled backend"
Kefu Chai [Fri, 14 Jul 2017 03:13:57 +0000 (11:13 +0800)]
Merge pull request #16319 from tchaikov/wip-ceph-helper-with-exp-features
qa/workunits/ceph-helpers: enable experimental features for osd
Reviewed-by: Loic Dachary <ldachary@redhat.com>
Kefu Chai [Fri, 14 Jul 2017 03:10:52 +0000 (11:10 +0800)]
Merge pull request #16320 from tchaikov/wip-clang-analyzer-warnings
test,mon,msg: kill clang analyzer warnings
Reviewed-by: Haomai Wang <haomai@xsky.com>
Reviewed-by: Brad Hubbard <bhubbard@redhat.com>
Zhang Shaowen [Fri, 14 Jul 2017 02:47:34 +0000 (10:47 +0800)]
rgw: remove the useless output when listing zonegroups.
Signed-off-by: Zhang Shaowen <zhangshaowen@cmss.chinamobile.com>
Nathan Cutler [Thu, 13 Jul 2017 21:13:46 +0000 (23:13 +0200)]
doc: Jewel v10.2.9 changelog
Signed-off-by: Nathan Cutler <ncutler@suse.com>
Sage Weil [Thu, 13 Jul 2017 17:59:26 +0000 (13:59 -0400)]
mon/PGMonitor: EOPNOTSUPP for old pgmon commands
This includes 'pg set_full_ratio', which we have only for the upgrade, but
goes away afterwards.
Also, return true to either swallow the request or indicate it has been
processed.
Fixes: http://tracker.ceph.com/issues/20600
Signed-off-by: Sage Weil <sage@redhat.com>
Kefu Chai [Thu, 13 Jul 2017 10:44:45 +0000 (18:44 +0800)]
mon: do not assign to never-read variable
this silences clang analyzer's warning of
Value stored to 'err' is never read
Signed-off-by: Kefu Chai <kchai@redhat.com>
Kefu Chai [Thu, 13 Jul 2017 10:42:11 +0000 (18:42 +0800)]
msg/async/rdma: return stored errno on error
otherwise the errno would be overwritten, and we are returning 0 or the
errno set by ::close()
Signed-off-by: Kefu Chai <kchai@redhat.com>
Kefu Chai [Thu, 13 Jul 2017 10:38:36 +0000 (18:38 +0800)]
test: test_c_headers: silence clang analyzer warnings
this silences clang analyzer's warnings like:
Value stored to 'ret' is never read
Signed-off-by: Kefu Chai <kchai@redhat.com>
Gregory Farnum [Thu, 13 Jul 2017 16:37:42 +0000 (09:37 -0700)]
Merge pull request #16262 from liewegas/wip-20208
mgr/ClusterState: do not mangle PGMap outside of Incremental
Yehuda Sadeh [Thu, 13 Jul 2017 16:36:12 +0000 (09:36 -0700)]
Merge pull request #9974 from weiqiaomiao/wqm-wip-copy_obj
rgw: fix memory leak in copy_obj_to_remote_dest
Reviewed-by: Orit Wasserman <owasserm@redhat.com>
Yehuda Sadeh [Thu, 13 Jul 2017 16:22:59 +0000 (09:22 -0700)]
Merge pull request #11124 from zhangsw/cleanup-rgwrados-deleteobj
rgw: remove a redundant judgement in rgw_rados.cc:delete_obj.
Reviewed-by: Yehuda Sadeh <yehuda@redhat.com>
Yehuda Sadeh [Thu, 13 Jul 2017 16:19:36 +0000 (09:19 -0700)]
Merge pull request #12010 from zhangsw/fix-rgw-multipart-bug
rgw: Fix a bug that multipart upload may exceed the quota.
Reviewed-by: Yehuda Sadeh <yehuda@redhat.com>
Sage Weil [Mon, 10 Jul 2017 22:31:49 +0000 (18:31 -0400)]
osdmaptool: add --health option
Generate and dump health checks.
Signed-off-by: Sage Weil <sage@redhat.com>
Sage Weil [Mon, 10 Jul 2017 18:18:37 +0000 (14:18 -0400)]
qa/workunits/rados/test_health_warnings: test down msgs vs crush classes
Signed-off-by: Sage Weil <sage@redhat.com>
Yehuda Sadeh [Thu, 13 Jul 2017 16:14:45 +0000 (09:14 -0700)]
Merge pull request #12197 from zhangsw/fix-rgw-metasync-lock-bug
rgw: lock is not released when set sync marker is failed.
Reviewed-by: Casey Bodley <cbodley@redhat.com>
Sage Weil [Tue, 11 Jul 2017 01:21:59 +0000 (21:21 -0400)]
osd/OSDMap: remove assumption about type ids
The code is assuming type==1 is in use, but it might not be. (It is
usually 'chassis' by default, which is rarely used; 'host' is type usually
type 2.) Remove the type check entirely and identify leaves by a child
>= 0.
Signed-off-by: Sage Weil <sage@redhat.com>
Sage Weil [Tue, 11 Jul 2017 01:20:43 +0000 (21:20 -0400)]
osd/OSDMap: whitespace
Signed-off-by: Sage Weil <sage@redhat.com>
Sage Weil [Tue, 11 Jul 2017 15:45:03 +0000 (11:45 -0400)]
crush/CrushWrapper: make get_immediate_parent[_id] ignore per-class shadow hierarchy
Fixes: http://tracker.ceph.com/issues/20546
Signed-off-by: Sage Weil <sage@redhat.com>
Sage Weil [Thu, 13 Jul 2017 16:13:45 +0000 (12:13 -0400)]
qa/tasks/ceph_manager: wait longer for pg stats to flush
An ill-timed mgr restart could blow the current 15s wait.
Signed-off-by: Sage Weil <sage@redhat.com>
Yehuda Sadeh [Thu, 13 Jul 2017 16:12:06 +0000 (09:12 -0700)]
Merge pull request #16021 from joscollin/wip-uninitialized-pointer-fields-1
rgw: Initialize pointer fields
Reviewed-by: Yehuda Sadeh <yehuda@redhat.com>
Sage Weil [Thu, 13 Jul 2017 15:50:45 +0000 (10:50 -0500)]
Merge pull request #16014 from xiexingguo/wip-autoclass
osd/OSD: auto class on osd start up
Reviewed-by: Sage Weil <sage@redhat.com>
Yuri Weinstein [Thu, 13 Jul 2017 15:41:19 +0000 (08:41 -0700)]
Merge pull request #15774 from shashalu/drop-temp-var
rgw: using RGW_OBJ_NS_MULTIPART in check_bad_index_multipart
Reviewed-by: Casey Bodley <cbodley@redhat.com>
Nathan Cutler [Thu, 13 Jul 2017 09:59:37 +0000 (11:59 +0200)]
doc: Jewel v10.2.9 release notes
Signed-off-by: Nathan Cutler <ncutler@suse.com>
Haomai Wang [Thu, 13 Jul 2017 15:19:11 +0000 (23:19 +0800)]
Revert "msg/async: increase worker reference with local listen table enabled backend"
Jason Dillaman [Thu, 13 Jul 2017 15:18:09 +0000 (11:18 -0400)]
Merge pull request #16255 from trociny/wip-test-librbd-internals
test/librbd: re-enable internal tests in ceph_test_librbd
Reviewed-by: Jason Dillaman <dillaman@redhat.com>
Sage Weil [Thu, 13 Jul 2017 14:46:10 +0000 (09:46 -0500)]
Merge pull request #16130 from fullerdj/wip-djf-ceph-connect-timeout
ceph.in: Check return value when connecting
Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Alfredo Deza <adeza@redhat.com>
Sage Weil [Thu, 13 Jul 2017 14:43:46 +0000 (09:43 -0500)]
Merge pull request #16243 from markhpc/wip-bluestore-freelist-iterator
os/bluestore: Make BitmapFreelistManager kv itereator short lived.
Reviewed-by: Sage Weil <sage@redhat.com>
Sage Weil [Thu, 13 Jul 2017 14:37:40 +0000 (09:37 -0500)]
Merge pull request #16269 from liewegas/wip-bluestore-deferred-pending
os/bluestore: only submit deferred if there is any
Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
Reviewed-by: Mark Nelson <mnelson@redhat.com>
Sage Weil [Thu, 13 Jul 2017 14:33:27 +0000 (09:33 -0500)]
Merge pull request #16306 from liewegas/wip-reg11184-health
qa/suites/rados/singleton/all/reg11184: whitelist health warnings
Yan Jun [Wed, 12 Jul 2017 06:20:33 +0000 (14:20 +0800)]
osd: cleanups
Signed-off-by: Yan Jun <yan.jun8@zte.com.cn>
liuchang0812 [Mon, 3 Jul 2017 08:31:31 +0000 (16:31 +0800)]
test: add test of ceph compact command
Signed-off-by: liuchang0812 <liuchang0812@gmail.com>
liuchang0812 [Fri, 30 Jun 2017 12:56:04 +0000 (20:56 +0800)]
osd: new command compact via tell/daemon
user could manual compact OSD's omap as following:
1. ceph tell osd.id compact
2. ceph daemon osd.id compact
user's requests will be impacted during compaction.
Fixes: http://tracker.ceph.com/issues/19592
Signed-off-by: liuchang0812 <liuchang0812@gmail.com>
liuchang0812 [Fri, 30 Jun 2017 08:50:53 +0000 (16:50 +0800)]
os: export compact interface in ObjectStore and ObjectMap
Signed-off-by: liuchang0812 <liuchang0812@gmail.com>
Kefu Chai [Thu, 13 Jul 2017 13:59:40 +0000 (21:59 +0800)]
Merge pull request #16317 from tchaikov/wip-0-osd-is-not-an-error
qa/workunits/ceph-helpers: test wait_for_health_ok differently
Reviewed-by: Loic Dachary <ldachary@redhat.com>
Kefu Chai [Thu, 13 Jul 2017 13:01:40 +0000 (21:01 +0800)]
Merge pull request #16274 from smithfarm/wip-jewel-10-2-8
doc: Jewel v10.2.8 release notes
Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Gregory Farnum <gfarnum@redhat.com>
Boris Ranto [Thu, 13 Jul 2017 12:04:01 +0000 (14:04 +0200)]
Merge pull request #16267 from liewegas/wip-restful-defaults
mgr/restful: bind to :: and update docs
Reviewed-by: John Spray <john.spray@redhat.com>
Reviewed-by: Boris Ranto <branto@redhat.com>
Kefu Chai [Thu, 13 Jul 2017 11:19:55 +0000 (19:19 +0800)]
Merge pull request #16311 from Songweibin/wip-update-info-affi
.mailmap, .organizationmap: Update Song Weibin information and affiliation
Reviewed-by: Abhishek Lekshmanan <abhishek@suse.com>
Kefu Chai [Thu, 13 Jul 2017 11:14:30 +0000 (19:14 +0800)]
Merge pull request #16303 from bstillwell/releases-doc-update-201707
doc/releases: Update releases from Feb 2017 to July 2017
Reviewed-by: Kefu Chai <kchai@redhat.com>
Nathan Cutler [Thu, 13 Jul 2017 10:18:17 +0000 (12:18 +0200)]
doc: add v10.2.8 changelog
Signed-off-by: Nathan Cutler <ncutler@suse.com>
Kefu Chai [Thu, 13 Jul 2017 09:57:07 +0000 (17:57 +0800)]
qa/workunits/ceph-helpers: enable experimental features for osd
it matches the settings in vstart.sh, also it would be handy for those
who are still developing on btrfs, which is now marked as an experimental
features now.
Signed-off-by: Kefu Chai <kchai@redhat.com>
Kefu Chai [Thu, 13 Jul 2017 09:43:39 +0000 (17:43 +0800)]
qa/workunits/ceph-helpers: test wait_for_health_ok differently
0 OSDs is not an error anymore in the new health checking implemented by
OSDMap::check_health(). this case was treated as an error before, see
OSDMonitor::get_health(). but an osdmap without any OSD is fine, i
think. but an osdmap with 3 OSDs, but all of them are down and out, this
is an error. and we do report this as an error. so, let's update the
test instead.
Signed-off-by: Kefu Chai <kchai@redhat.com>
Kefu Chai [Thu, 13 Jul 2017 08:48:52 +0000 (16:48 +0800)]
Merge pull request #16277 from smithfarm/wip-rn-script
tools: ceph-release-notes: handle an edge case
Reviewed-by: Kefu Chai <kchai@redhat.com>
Nathan Cutler [Wed, 12 Jul 2017 07:05:12 +0000 (09:05 +0200)]
doc: release notes: note MDS regression in 10.2.8
See the discussion in https://github.com/ceph/ceph/pull/16192
Signed-off-by: Nathan Cutler <ncutler@suse.com>
Mykola Golub [Thu, 13 Jul 2017 07:57:42 +0000 (10:57 +0300)]
Merge pull request #16264 from dillaman/wip-20571
rbd-mirror: ignore permission errors on rbd_mirroring object
Reviewed-by: Mykola Golub <mgolub@mirantis.com>
songweibin [Thu, 13 Jul 2017 03:42:17 +0000 (11:42 +0800)]
.mailmap, .organizationmap: Update Song Weibin information and affiliation
Signed-off-by: songweibin <song.weibin@zte.com.cn>
Sage Weil [Wed, 12 Jul 2017 22:39:24 +0000 (18:39 -0400)]
qa/suites/rados/singleton/all/reg11184: whitelist health warnings
Signed-off-by: Sage Weil <sage@redhat.com>
Bryan Stillwell [Wed, 12 Jul 2017 21:38:59 +0000 (15:38 -0600)]
Update releases from Feb 2017 to July 2017
Pull in the latest releases from the past 5 months and fix some of the
links so they jump to the correct sections in the release notes.
Signed-off-by: Bryan Stillwell <bstillwell@godaddy.com>
Sage Weil [Wed, 12 Jul 2017 19:55:56 +0000 (14:55 -0500)]
Merge pull request #16301 from dmick/master
mgr: increase debug level for ticks 0 -> 10
Dan Mick [Wed, 12 Jul 2017 19:40:01 +0000 (15:40 -0400)]
Mgr: increase debug level for ticks 0 -> 10
Signed-off-by: Dan Mick <dmick@redhat.com>
Sage Weil [Wed, 12 Jul 2017 19:09:00 +0000 (15:09 -0400)]
mon/MonCommands: mark 'pg set_*_ratio' deprecated
Signed-off-by: Sage Weil <sage@redhat.com>
Sage Weil [Wed, 12 Jul 2017 18:21:18 +0000 (13:21 -0500)]
Merge pull request #16263 from liupan1111/wip-fix-fio
test/fio: remove experimental option for bluestore & rocksdb.
Reviewed-by: Casey Bodley <cbodley@redhat.com>
Sage Weil [Wed, 12 Jul 2017 17:10:47 +0000 (12:10 -0500)]
Merge pull request #15643 from liewegas/wip-health
mon: revamp health check/warning system
Sage Weil [Wed, 12 Jul 2017 13:17:55 +0000 (09:17 -0400)]
mon/PGMap: adjust scrub checks to avoid overflow for future stamps
Avoid an overflow (and false warning) when scrub stamps are in the future.
Signed-off-by: Sage Weil <sage@redhat.com>
Sage Weil [Wed, 12 Jul 2017 12:10:47 +0000 (08:10 -0400)]
qa/workunits/cephtool/test.sh: adjust full tests to avoid races
OSDs may report fullness in any order.
Signed-off-by: Sage Weil <sage@redhat.com>
Sage Weil [Tue, 11 Jul 2017 03:48:47 +0000 (23:48 -0400)]
qa/tasks/ceph: wait for osds to come up before creating pool
Avoid health warnings.
Signed-off-by: Sage Weil <sage@redhat.com>
Sage Weil [Tue, 11 Jul 2017 03:39:31 +0000 (23:39 -0400)]
qa/tasks/ceph_test_case.py: update health check helpers
Signed-off-by: Sage Weil <sage@redhat.com>
Sage Weil [Mon, 10 Jul 2017 16:40:01 +0000 (12:40 -0400)]
qa/suites/fs: whitelist health warnings
Signed-off-by: Sage Weil <sage@redhat.com>
Sage Weil [Mon, 10 Jul 2017 16:39:50 +0000 (12:39 -0400)]
qa/suites/rgw/thrash: whitelist
Signed-off-by: Sage Weil <sage@redhat.com>
Sage Weil [Mon, 10 Jul 2017 16:25:23 +0000 (12:25 -0400)]
qa/suites/rbd: whitelist health messages
Signed-off-by: Sage Weil <sage@redhat.com>
Sage Weil [Thu, 6 Jul 2017 21:58:16 +0000 (17:58 -0400)]
qa: whitelist health warnings
Signed-off-by: Sage Weil <sage@redhat.com>
Sage Weil [Fri, 7 Jul 2017 03:24:52 +0000 (23:24 -0400)]
qa/workunits/cephtool/test.sh: adjust for new health error codes
Signed-off-by: Sage Weil <sage@redhat.com>
Sage Weil [Thu, 6 Jul 2017 21:53:34 +0000 (17:53 -0400)]
mon/MgrMonitor: clear last_beacon after mon election
The last_beacon map is local to an election interval; if there is a new
election completed we should reset it or else we may kill an apparently
laggy mgr that hasn't been able to get a beacon processed due to the mon
quorum changing, or had its beacon processed on a different leader.
Signed-off-by: Sage Weil <sage@redhat.com>
John Spray [Tue, 4 Jul 2017 22:37:25 +0000 (18:37 -0400)]
mon: clean up `osd out` messages
Cleaner prose for the auto-out case, and add
a cluster log message for OSDs that go out
at the behest of the administrator.
Signed-off-by: John Spray <john.spray@redhat.com>
John Spray [Tue, 4 Jul 2017 17:29:38 +0000 (13:29 -0400)]
osd: don't log per-PG backfill messages at INFO level
This behaviour led to way too many messages going to
the cluster log when an OSD is marked in. Retain
the messages at debug level.
Signed-off-by: John Spray <john.spray@redhat.com>
John Spray [Tue, 4 Jul 2017 13:52:59 +0000 (09:52 -0400)]
mon: simplify PG health checks
Instead of a distinct health check for each possible
PG state, group the states into categories for availability,
degraded, damage, and report on that.
That way, while a PG/pool is suffering from one of those
bad PG states, health conditions don't keep toggling on and
off as we transition from one unavailable state to another
unavailable state.
Signed-off-by: John Spray <john.spray@redhat.com>
John Spray [Mon, 26 Jun 2017 18:27:38 +0000 (14:27 -0400)]
mgr/dashboard: update for new style health checks
Signed-off-by: John Spray <john.spray@redhat.com>
John Spray [Fri, 23 Jun 2017 10:39:56 +0000 (06:39 -0400)]
mon: demote cluster map prints to DEBUG level
The PaxosService subclasses should be writing out
informative log messages, and not relying on
a stream of map summary prints to communicate
changes.
Signed-off-by: John Spray <john.spray@redhat.com>
John Spray [Fri, 23 Jun 2017 10:37:53 +0000 (06:37 -0400)]
mon: prettify health check log messages
Add a "Cluster is now healthy" to give clarity
after a series of "health check cleared" that
they were the last ones.
Convert certain health check messages into
well formed sentences.
Don't print severity in the log string (it's already
expressed in the severity of the log entry.
Signed-off-by: John Spray <john.spray@redhat.com>
John Spray [Thu, 22 Jun 2017 21:41:35 +0000 (17:41 -0400)]
mgr: fix spurious PG health messages on mgr restart
Previously, the mgr would send MMonMgrReport indicating
a very unhappy PGMap to the mon right after startup.
This is a change to hold off on sending that report until
all the OSDs have reported in, or until some time has passed.
Signed-off-by: John Spray <john.spray@redhat.com>
John Spray [Thu, 22 Jun 2017 15:18:32 +0000 (11:18 -0400)]
mon: don't consider a starting mgr to be an error
The .available flag is there to tell MgrClients whether
to try and connect -- it isn't the right condition
for a health complaint.
Signed-off-by: John Spray <john.spray@redhat.com>