From: Sage Weil Date: Tue, 3 May 2016 03:28:18 +0000 (-0400) Subject: osd: handle boot racing with NOUP set X-Git-Tag: v11.0.0~620^2 X-Git-Url: http://git-server-git.apps.pok.os.sepia.ceph.com/?a=commitdiff_plain;h=11e4242fbdb2f2f6f654d4cb3a7c95d5b38a88c2;p=ceph.git osd: handle boot racing with NOUP set This is a follow-on to 7139a232d26beef441ffbc13bc087baab3505ea8, which handled the NOUP set + clear case when the OSD found out about the flag being cleared. However, it's possible that the flag will get cleared but the OSD won't get a map update (because it hasn't subscribed and is not doing any work). This means that it is *more* likely than before that we will restart the boot process even though the OSD did successfully mark us up. However, as before, it is unavoidable because there is no notification of whether our boot request succeeds or not. And it is still mostly harmless (an extra mark down + up cycle). Fixes: http://tracker.ceph.com/issues/15678 Signed-off-by: Sage Weil --- diff --git a/src/osd/OSD.cc b/src/osd/OSD.cc index f7907f5c832..9680ef7f370 100644 --- a/src/osd/OSD.cc +++ b/src/osd/OSD.cc @@ -6794,9 +6794,9 @@ void OSD::_committed_osd_maps(epoch_t first, epoch_t last, MOSDMap *m) } } - if (osdmap->test_flag(CEPH_OSDMAP_NOUP) && - !newmap->test_flag(CEPH_OSDMAP_NOUP)) { - dout(10) << __func__ << " NOUP flag cleared in " << newmap->get_epoch() + if (osdmap->test_flag(CEPH_OSDMAP_NOUP) != + newmap->test_flag(CEPH_OSDMAP_NOUP)) { + dout(10) << __func__ << " NOUP flag changed in " << newmap->get_epoch() << dendl; if (is_booting()) { // this captures the case where we sent the boot message while