From: Igor Fedotov Date: Tue, 4 Feb 2020 16:19:03 +0000 (+0300) Subject: doc/rados/operations/health-checks: document bluestore spurious read X-Git-Tag: v16.1.0~2532^2 X-Git-Url: http://git-server-git.apps.pok.os.sepia.ceph.com/?a=commitdiff_plain;h=deb0af6347ba596a941bd63fc778d48b467dac57;p=ceph.git doc/rados/operations/health-checks: document bluestore spurious read errors alert. Signed-off-by: Igor Fedotov --- diff --git a/doc/rados/operations/health-checks.rst b/doc/rados/operations/health-checks.rst index bd71a2ee937f..41502376649d 100644 --- a/doc/rados/operations/health-checks.rst +++ b/doc/rados/operations/health-checks.rst @@ -530,6 +530,28 @@ are correctly installed and that the OSD daemon(s) have been restarted. If the problem persists, check the OSD log for any clues as to the source of the problem. +BLUESTORE_SPURIOUS_READ_ERRORS +______________________________ + +One or more OSDs using BlueStore detects spurious read errors at main device. +BlueStore has recovered from these errors by retrying disk reads. +Though this might show some issues with underlying hardware, I/O subsystem, +etc. +Which theoretically might cause permanent data corruption. +Some observations on the root cause can be found at +https://tracker.ceph.com/issues/22464 + +This alert doesn't require immediate response but corresponding host might need +additional attention, e.g. upgrading to the latest OS/kernel versions and +H/W resource utilization monitoring. + +This warning can be disabled on all OSDs with:: + + ceph config set osd bluestore_warn_on_spurious_read_errors false + +Alternatively, it can be disabled on a specific OSD with:: + + ceph config set osd.123 bluestore_warn_on_spurious_read_errors false Device health