git.apps.os.sepia.ceph.com Git - ceph-ci.git/commit

author	root <penglaiyxy>
	Mon, 30 Jul 2018 01:29:48 +0000 (21:29 -0400)
committer	root <penglaiyxy>
	Thu, 2 Aug 2018 01:08:13 +0000 (21:08 -0400)
commit	00e0ab407b2e9659d9121be1217e95c8117c411e
tree	7620d67ebdc924a686894f1a15a18758d9c83113	tree \| snapshot
parent	3f01b63888fc366b78b72049da8acda5f3e7f373	commit \| diff

msg: ceph_abort() when there are enough accepter errors in msg server
In some extrem cases(we have met one in our production cluster), when Accepter thread break out , new client can not connect to the osd. Because the former heartbeat connections are already connected, other osd can not detect failure then notify monitor to mark the failed osd down.
In the patch, we there are abnormal communication errors ,we just ceph_abort so that osd can go down fastly and other osds can notify monitor to mark the failed osd down.
Signed-off-by: penglaiyxy@gmail.com <penglaiyxy@gmail.com>

src/common/legacy_config_opts.h		diff \| blob \| history
src/common/options.cc		diff \| blob \| history
src/msg/async/AsyncMessenger.cc		diff \| blob \| history
src/msg/simple/Accepter.cc		diff \| blob \| history