From deb0af6347ba596a941bd63fc778d48b467dac57 Mon Sep 17 00:00:00 2001 From: Igor Fedotov Date: Tue, 4 Feb 2020 19:19:03 +0300 Subject: [PATCH] doc/rados/operations/health-checks: document bluestore spurious read errors alert. Signed-off-by: Igor Fedotov --- doc/rados/operations/health-checks.rst | 22 ++++++++++++++++++++++ 1 file changed, 22 insertions(+) diff --git a/doc/rados/operations/health-checks.rst b/doc/rados/operations/health-checks.rst index bd71a2ee937..41502376649 100644 --- a/doc/rados/operations/health-checks.rst +++ b/doc/rados/operations/health-checks.rst @@ -530,6 +530,28 @@ are correctly installed and that the OSD daemon(s) have been restarted. If the problem persists, check the OSD log for any clues as to the source of the problem. +BLUESTORE_SPURIOUS_READ_ERRORS +______________________________ + +One or more OSDs using BlueStore detects spurious read errors at main device. +BlueStore has recovered from these errors by retrying disk reads. +Though this might show some issues with underlying hardware, I/O subsystem, +etc. +Which theoretically might cause permanent data corruption. +Some observations on the root cause can be found at +https://tracker.ceph.com/issues/22464 + +This alert doesn't require immediate response but corresponding host might need +additional attention, e.g. upgrading to the latest OS/kernel versions and +H/W resource utilization monitoring. + +This warning can be disabled on all OSDs with:: + + ceph config set osd bluestore_warn_on_spurious_read_errors false + +Alternatively, it can be disabled on a specific OSD with:: + + ceph config set osd.123 bluestore_warn_on_spurious_read_errors false Device health -- 2.47.3