From 5f454a31f21b411d880dcaa2776e6d5220315cd4 Mon Sep 17 00:00:00 2001 From: Ville Ojamo <14869000+bluikko@users.noreply.github.com> Date: Mon, 23 Oct 2023 14:27:57 +0700 Subject: [PATCH] doc/cephadm: Fix quorum restore process in troubleshooting.rst The quorum restoring process as listed does not currently work: cephadm enter will not work if the mon container is first stopped. Fix the process by using cephadm shell and pass the monfs manually as a volume. Also fix a typo "form" to "form" and remove a double newline for consistency. Signed-off-by: Ville Ojamo <14869000+bluikko@users.noreply.github.com> --- doc/cephadm/troubleshooting.rst | 10 ++++++---- 1 file changed, 6 insertions(+), 4 deletions(-) diff --git a/doc/cephadm/troubleshooting.rst b/doc/cephadm/troubleshooting.rst index 5ec692881661a..d9735ee420b18 100644 --- a/doc/cephadm/troubleshooting.rst +++ b/doc/cephadm/troubleshooting.rst @@ -280,20 +280,22 @@ If the Ceph monitor daemons (mons) cannot form a quorum, cephadm will not be able to manage the cluster until quorum is restored. In order to restore the quorum, remove unhealthy monitors -form the monmap by following these steps: +from the monmap by following these steps: 1. Stop all mons. For each mon host:: ssh {mon-host} cephadm unit --name mon.`hostname` stop - 2. Identify a surviving monitor and log in to that host:: ssh {mon-host} - cephadm enter --name mon.`hostname` -3. Follow the steps in :ref:`rados-mon-remove-from-unhealthy` +3. Start an interactive container and mount the surviving monitor data directory as a volume for it:: + + cephadm shell -v /var/lib/ceph/{fsid}/mon.{mon-host}:/var/lib/ceph/mon/ceph-{mon-host}:z + +4. Follow the steps in :ref:`rados-mon-remove-from-unhealthy` starting from step 3 ("extracing a copy of the monmap") .. _cephadm-manually-deploy-mgr: -- 2.39.5