From: Zac Dover Date: Sun, 15 Sep 2024 11:33:55 +0000 (+1000) Subject: doc/rados: add osd_deep_scrub_interval setting operation X-Git-Tag: v20.0.0~1066^2 X-Git-Url: http://git.apps.os.sepia.ceph.com/?a=commitdiff_plain;h=refs%2Fpull%2F59801%2Fhead;p=ceph.git doc/rados: add osd_deep_scrub_interval setting operation Add a second method of changing the value of osd_deep_scrub_interval to remedy the condition indicated by the "PGs not deep-scrubbed in time" warning. This procedure was developed by Eugen Block, and is at the time of this commit available on his blog at https://heiterbiswolkig.blogs.nde.ag/2024/09/06/pgs-not-deep-scrubbed-in-time/ Co-authored-by: Eugen Block Signed-off-by: Zac Dover --- diff --git a/doc/rados/operations/health-checks.rst b/doc/rados/operations/health-checks.rst index e6f56f9200554..94fa080d0f0f0 100644 --- a/doc/rados/operations/health-checks.rst +++ b/doc/rados/operations/health-checks.rst @@ -1507,6 +1507,14 @@ that they are to be cleaned, and not that they have been examined and found to be clean). Misplaced or degraded PGs might not be flagged as ``clean`` (see *PG_AVAILABILITY* and *PG_DEGRADED* above). +This document offers two methods of setting the value of +``osd_deep_scrub_interval``. The first method listed here changes the value of +``osd_deep_scrub_interval`` globally. The second method listed here changes the +value of ``osd_deep scrub interval`` for OSDs and for the Manager daemon. + +First Method +~~~~~~~~~~~~ + To manually initiate a deep scrub of a clean PG, run the following command: .. prompt:: bash $ @@ -1516,7 +1524,7 @@ To manually initiate a deep scrub of a clean PG, run the following command: Under certain conditions, the warning ``X PGs not deep-scrubbed in time`` appears. This might be because the cluster contains many large PGs, which take longer to deep-scrub. To remedy this situation, you must change the value of -``osd_deep_scrub_interval`` either globally or for the Manager daemon. +``osd_deep_scrub_interval`` globally. #. Confirm that ``ceph health detail`` returns a ``pgs not deep-scrubbed in time`` warning:: @@ -1538,6 +1546,46 @@ See `Eugen Block's blog post `_. +Second Method +~~~~~~~~~~~~~ + +To manually initiate a deep scrub of a clean PG, run the following command: + +.. prompt:: bash $ + + ceph pg deep-scrub + +Under certain conditions, the warning ``X PGs not deep-scrubbed in time`` +appears. This might be because the cluster contains many large PGs, which take +longer to deep-scrub. To remedy this situation, you must change the value of +``osd_deep_scrub_interval`` for OSDs and for the Manager daemon. + +#. Confirm that ``ceph health detail`` returns a ``pgs not deep-scrubbed in + time`` warning:: + + # ceph health detail + HEALTH_WARN 1161 pgs not deep-scrubbed in time + [WRN] PG_NOT_DEEP_SCRUBBED: 1161 pgs not deep-scrubbed in time + pg 86.fff not deep-scrubbed since 2024-08-21T02:35:25.733187+0000 + +#. Change the ``osd_deep_scrub_interval`` for OSDs: + + .. prompt:: bash # + + ceph config set osd osd_deep_scrub_interval 1209600 + +#. Change the ``osd_deep_scrub_interval`` for Managers: + + .. prompt:: bash # + + ceph config set mgr osd_deep_scrub_interval 1209600 + +The above procedure was developed by Eugen Block in September of 2024. + +See `Eugen Block's blog post `_ for much more detail. + +See `Redmine tracker issue #44959 `_. + PG_SLOW_SNAP_TRIMMING