From: Sage Weil Date: Sat, 18 Aug 2018 12:46:38 +0000 (-0500) Subject: doc/mgr/devicehealth: update docs X-Git-Tag: v14.0.1~451^2~1 X-Git-Url: http://git.apps.os.sepia.ceph.com/?a=commitdiff_plain;h=df71738d7923341aec069de072d2dd5094f906e1;p=ceph.git doc/mgr/devicehealth: update docs - several new commands weren't documented Signed-off-by: Sage Weil --- diff --git a/doc/mgr/devicehealth.rst b/doc/mgr/devicehealth.rst index 5e0d0012192b8..8b9106f5c431e 100644 --- a/doc/mgr/devicehealth.rst +++ b/doc/mgr/devicehealth.rst @@ -13,7 +13,16 @@ The *devicehealth* module is enabled with:: ceph mgr module enable devicehealth -(It is enabled by default.) +(The module is enabled by default.) + +To turn on automatic device health monitoring, including regular (daily) +scraping of device health metrics like SMART:: + + ceph device monitoring on + +To disable monitoring,:: + + ceph device monitoring off Scraping -------- @@ -30,6 +39,10 @@ Or a single daemon's devices can be scraped with:: ceph device scrape-daemon-health-metrics +The stored health metrics for a device can be retrieved (optionally +for a specific timestamp) with:: + + ceph device show-health-metrics [sample-timestamp] Health monitoring ----------------- @@ -50,3 +63,8 @@ will automatically migrate data away from them by marking the devices The ``mgr/devicehealth/mark_out_threshold`` controls how soon an expected device failure must be before we automatically mark an osd "out". + +The stored life expectancy of all devices can be checked, and any +appropriate health alerts generated, with:: + + ceph device check-health