From: Sebastian Wagner Date: Mon, 15 Feb 2021 12:01:01 +0000 (+0100) Subject: doc/cephadm: Restoring the MON quorum X-Git-Tag: v16.2.0~178^2~3 X-Git-Url: http://git-server-git.apps.pok.os.sepia.ceph.com/?a=commitdiff_plain;h=26bff47e48bea6d8dede3e428a9125d75aa3463e;p=ceph.git doc/cephadm: Restoring the MON quorum Fixes: https://tracker.ceph.com/issues/49214 Signed-off-by: Sebastian Wagner (cherry picked from commit 02584d3ee772e4d5d8d7b4858dff192cd49c5281) --- diff --git a/doc/cephadm/troubleshooting.rst b/doc/cephadm/troubleshooting.rst index 5d7a29c3a06..b6f224e1f59 100644 --- a/doc/cephadm/troubleshooting.rst +++ b/doc/cephadm/troubleshooting.rst @@ -1,4 +1,3 @@ - Troubleshooting =============== @@ -176,3 +175,28 @@ To access the admin socket, first enter the daemon container on the host:: [root@mon1 ~]# cephadm enter --name [ceph: root@mon1 /]# ceph --admin-daemon /var/run/ceph/ceph-.asok config show + + +Restoring the MON quorum +------------------------ + +In case the Ceph MONs cannot form a quorum, cephadm is not able +to manage the cluster, until the quorum is restored. + +In order to restore the MON quorum, remove unhealthy MONs +form the monmap by following these steps: + +1. Stop all MONs. For each MON host:: + + ssh {mon-host} + cephadm unit --name mon.`hostname` stop + + +2. Identify a surviving monitor and log in to that host:: + + ssh {mon-host} + cephadm enter --name mon.`hostname` + +3. Follow the steps in :ref:`rados-mon-remove-from-unhealthy` + + diff --git a/doc/rados/operations/add-or-rm-mons.rst b/doc/rados/operations/add-or-rm-mons.rst index b0edb199bd8..124076d6e4a 100644 --- a/doc/rados/operations/add-or-rm-mons.rst +++ b/doc/rados/operations/add-or-rm-mons.rst @@ -164,6 +164,7 @@ quorum. #. Remove the monitor entry from ``ceph.conf``. +.. _rados-mon-remove-from-unhealthy: Removing Monitors from an Unhealthy Cluster ------------------------------------------- @@ -176,7 +177,7 @@ quorum. #. Stop all ``ceph-mon`` daemons on all monitor hosts. :: ssh {mon-host} - service ceph stop mon || stop ceph-mon-all + systemctl stop ceph-mon.target # and repeat for all mons #. Identify a surviving monitor and log in to that host. ::