]> git-server-git.apps.pok.os.sepia.ceph.com Git - ceph.git/commit
mgr/devicehealth: warn on failing devices at 6 weeks
authorSage Weil <sage@redhat.com>
Tue, 9 Oct 2018 12:21:27 +0000 (07:21 -0500)
committerSage Weil <sage@redhat.com>
Tue, 9 Oct 2018 12:21:27 +0000 (07:21 -0500)
commitc1a9d02c9bc1b7f42b5ae82c18dbe3ca33e7b3bc
treea90ae876e6dd6296a27ae841ffd4c9d43fe76c03
parent4878508dcc7e9636ba092034bfaa9795241c47a2
mgr/devicehealth: warn on failing devices at 6 weeks

This gives us an interval where we warn before automatically marking an
OSD out.  That way the operator has an opportunity to preemptively replace
the device and incurring only a single rebalance/recovery event (vs two,
one to evacutate the failing the device, another to refill the
replacement).

Signed-off-by: Sage Weil <sage@redhat.com>
src/pybind/mgr/devicehealth/module.py