From: Sage Weil Date: Tue, 20 Aug 2013 18:27:23 +0000 (-0700) Subject: mon/Paxos: always refresh after any store_state X-Git-Tag: v0.68~31 X-Git-Url: http://git-server-git.apps.pok.os.sepia.ceph.com/?a=commitdiff_plain;h=981eda9f7787c83dc457f061452685f499e7dd27;p=ceph.git mon/Paxos: always refresh after any store_state If we store any new state, we need to refresh the services, even if we are still in the midst of Paxos recovery. This is because the subscription path will share any committed state even when paxos is still recovering. This prevents a race like: - we have maps 10..20 - we drop out of quorum - we are elected leader, paxos recovery starts - we get one LAST with committed states that trim maps 10..15 - we get a subscribe for map 10..20 - we crash because 10 is no longer on disk because the PaxosService is out of sync with the on-disk state. Fixes: #6045 Backport: dumpling Signed-off-by: Sage Weil Reviewed-by: Joao Eduardo Luis --- diff --git a/src/mon/Paxos.cc b/src/mon/Paxos.cc index 09b3391e182b..347810775c04 100644 --- a/src/mon/Paxos.cc +++ b/src/mon/Paxos.cc @@ -375,6 +375,8 @@ void Paxos::_sanity_check_store() // leader void Paxos::handle_last(MMonPaxos *last) { + bool need_refresh = false; + dout(10) << "handle_last " << *last << dendl; if (!mon->is_leader()) { @@ -401,7 +403,7 @@ void Paxos::handle_last(MMonPaxos *last) assert(g_conf->paxos_kill_at != 1); // store any committed values if any are specified in the message - store_state(last); + need_refresh = store_state(last); assert(g_conf->paxos_kill_at != 2); @@ -477,6 +479,7 @@ void Paxos::handle_last(MMonPaxos *last) dout(10) << "that's everyone. active!" << dendl; extend_lease(); + need_refresh = false; if (do_refresh()) { finish_round(); @@ -491,6 +494,9 @@ void Paxos::handle_last(MMonPaxos *last) dout(10) << "old pn, ignoring" << dendl; } + if (need_refresh) + do_refresh(); + last->put(); }