From df71738d7923341aec069de072d2dd5094f906e1 Mon Sep 17 00:00:00 2001 From: Sage Weil Date: Sat, 18 Aug 2018 07:46:38 -0500 Subject: [PATCH] doc/mgr/devicehealth: update docs - several new commands weren't documented Signed-off-by: Sage Weil --- doc/mgr/devicehealth.rst | 20 +++++++++++++++++++- 1 file changed, 19 insertions(+), 1 deletion(-) diff --git a/doc/mgr/devicehealth.rst b/doc/mgr/devicehealth.rst index 5e0d0012192b8..8b9106f5c431e 100644 --- a/doc/mgr/devicehealth.rst +++ b/doc/mgr/devicehealth.rst @@ -13,7 +13,16 @@ The *devicehealth* module is enabled with:: ceph mgr module enable devicehealth -(It is enabled by default.) +(The module is enabled by default.) + +To turn on automatic device health monitoring, including regular (daily) +scraping of device health metrics like SMART:: + + ceph device monitoring on + +To disable monitoring,:: + + ceph device monitoring off Scraping -------- @@ -30,6 +39,10 @@ Or a single daemon's devices can be scraped with:: ceph device scrape-daemon-health-metrics +The stored health metrics for a device can be retrieved (optionally +for a specific timestamp) with:: + + ceph device show-health-metrics [sample-timestamp] Health monitoring ----------------- @@ -50,3 +63,8 @@ will automatically migrate data away from them by marking the devices The ``mgr/devicehealth/mark_out_threshold`` controls how soon an expected device failure must be before we automatically mark an osd "out". + +The stored life expectancy of all devices can be checked, and any +appropriate health alerts generated, with:: + + ceph device check-health -- 2.39.5