]> git-server-git.apps.pok.os.sepia.ceph.com Git - ceph.git/commitdiff
mgr/devicehealth: warn on failing devices at 6 weeks
authorSage Weil <sage@redhat.com>
Tue, 9 Oct 2018 12:21:27 +0000 (07:21 -0500)
committerSage Weil <sage@redhat.com>
Tue, 9 Oct 2018 12:21:27 +0000 (07:21 -0500)
This gives us an interval where we warn before automatically marking an
OSD out.  That way the operator has an opportunity to preemptively replace
the device and incurring only a single rebalance/recovery event (vs two,
one to evacutate the failing the device, another to refill the
replacement).

Signed-off-by: Sage Weil <sage@redhat.com>
src/pybind/mgr/devicehealth/module.py

index a7bea09854d9a1c343104129f7ce75cfdb62548a..6f069137881afbbada6aa1f8fc3a4506fd9099b8 100644 (file)
@@ -47,7 +47,7 @@ class Module(MgrModule):
         },
         {
             'name': 'warn_threshold',
-            'default': str(86400 * 14 * 2),
+            'default': str(86400 * 14 * 6),
         },
         {
             'name': 'self_heal',