From: Zac Dover Date: Mon, 27 Nov 2023 00:17:17 +0000 (+1000) Subject: doc/rados: edit "Everything Failed! Now What?" X-Git-Tag: v18.2.4~429^2 X-Git-Url: http://git.apps.os.sepia.ceph.com/?a=commitdiff_plain;h=112120f7be9c5bf902573543e434523811c8f50c;p=ceph.git doc/rados: edit "Everything Failed! Now What?" Edit the section "Everything Failed! Now what?" in doc/rados/troubleshooting/troubleshooting-mon.rst. Co-authored-by: Anthoy D'Atri Signed-off-by: Zac Dover (cherry picked from commit d8cd49259d797304adfbdb8eb46110e8b749256e) --- diff --git a/doc/rados/troubleshooting/troubleshooting-mon.rst b/doc/rados/troubleshooting/troubleshooting-mon.rst index 4bb9d7b5b8601..443d6c443bee3 100644 --- a/doc/rados/troubleshooting/troubleshooting-mon.rst +++ b/doc/rados/troubleshooting/troubleshooting-mon.rst @@ -616,7 +616,6 @@ The above recovery tool is unable to recover the following information: - **MDS Maps**: The MDS maps are lost. - Everything Failed! Now What? ============================ @@ -628,16 +627,20 @@ irc.oftc.net), or at ``dev@ceph.io`` and ``ceph-users@lists.ceph.com``. Make sure that you have prepared your logs and that you have them ready upon request. -See https://ceph.io/en/community/connect/ for current (as of October 2023) -information on getting in contact with the upstream Ceph community. +The upstream Ceph Slack workspace can be joined at this address: +https://ceph-storage.slack.com/ +See https://ceph.io/en/community/connect/ for current (as of December 2023) +information on getting in contact with the upstream Ceph community. Preparing your logs ------------------- -The default location for monitor logs is ``/var/log/ceph/ceph-mon.FOO.log*``. -However, if they are not there, you can find their current location by running -the following command: +The default location for Monitor logs is ``/var/log/ceph/ceph-mon.FOO.log*``. +It is possible that the location of the Monitor logs has been changed from the +default. If the location of the Monitor logs has been changed from the default +location, find the location of the Monitor logs by running the following +command: .. prompt:: bash @@ -648,21 +651,21 @@ cluster's configuration files. If Ceph is using the default debug levels, then your logs might be missing important information that would help the upstream Ceph community address your issue. -To make sure your monitor logs contain relevant information, you can raise -debug levels. Here we are interested in information from the monitors. As with -other components, the monitors have different parts that output their debug +Raise debug levels to make sure that your Monitor logs contain relevant +information. Here we are interested in information from the Monitors. As with +other components, the Monitors have different parts that output their debug information on different subsystems. If you are an experienced Ceph troubleshooter, we recommend raising the debug -levels of the most relevant subsystems. Of course, this approach might not be -easy for beginners. In most cases, however, enough information to address the -issue will be secured if the following debug levels are entered:: +levels of the most relevant subsystems. This approach might not be easy for +beginners. In most cases, however, enough information to address the issue will +be logged if the following debug levels are entered:: debug_mon = 10 debug_ms = 1 Sometimes these debug levels do not yield enough information. In such cases, -members of the upstream Ceph community might ask you to make additional changes +members of the upstream Ceph community will ask you to make additional changes to these or to other debug levels. In any case, it is better for us to receive at least some useful information than to receive an empty log. @@ -670,10 +673,12 @@ at least some useful information than to receive an empty log. Do I need to restart a monitor to adjust debug levels? ------------------------------------------------------ -No, restarting a monitor is not necessary. Debug levels may be adjusted by -using two different methods, depending on whether or not there is a quorum: +No. It is not necessary to restart a Monitor when adjusting its debug levels. + +There are two different methods for adjusting debug levels. One method is used +when there is quorum. The other is used when there is no quorum. -**If there is a quorum** +**Adjusting debug levels when there is a quorum** Either inject the debug option into the specific monitor that needs to be debugged:: @@ -685,17 +690,19 @@ using two different methods, depending on whether or not there is a quorum: ceph tell mon.* config set debug_mon 10/10 -**If there is no quorum** +**Adjusting debug levels when there is no quorum** Use the admin socket of the specific monitor that needs to be debugged and directly adjust the monitor's configuration options:: ceph daemon mon.FOO config set debug_mon 10/10 +**Returning debug levels to their default values** To return the debug levels to their default values, run the above commands -using the debug level ``1/10`` rather than ``10/10``. To check a monitor's -current values, use the admin socket and run either of the following commands: +using the debug level ``1/10`` rather than the debug level ``10/10``. To check +a Monitor's current values, use the admin socket and run either of the +following commands: .. prompt:: bash @@ -712,17 +719,17 @@ or: I Reproduced the problem with appropriate debug levels. Now what? ----------------------------------------------------------------- -We prefer that you send us only the portions of your logs that are relevant to -your monitor problems. Of course, it might not be easy for you to determine -which portions are relevant so we are willing to accept complete and -unabridged logs. However, we request that you avoid sending logs containing -hundreds of thousands of lines with no additional clarifying information. One -common-sense way of making our task easier is to write down the current time -and date when you are reproducing the problem and then extract portions of your +Send the upstream Ceph community only the portions of your logs that are +relevant to your Monitor problems. Because it might not be easy for you to +determine which portions are relevant, the upstream Ceph community accepts +complete and unabridged logs. But don't send logs containing hundreds of +thousands of lines with no additional clarifying information. One common-sense +way to help the Ceph community help you is to write down the current time and +date when you are reproducing the problem and then extract portions of your logs based on that information. -Finally, reach out to us on the mailing lists or IRC or Slack, or by filing a -new issue on the `tracker`_. +Contact the upstream Ceph community on the mailing lists or IRC or Slack, or by +filing a new issue on the `tracker`_. .. _tracker: http://tracker.ceph.com/projects/ceph/issues/new