git.apps.os.sepia.ceph.com Git

author	root <penglaiyxy>
	Mon, 30 Jul 2018 01:29:48 +0000 (21:29 -0400)
committer	Prashant D <pdhange@redhat.com>
	Thu, 4 Oct 2018 01:20:39 +0000 (03:20 +0200)
commit	999980ac69d4f346ec3395e001f10abe1e763fa5
tree	8ce630fe6b36b38d39320f93f265eaad6fd03de3	tree \| snapshot
parent	2c73ec0b8069556a3aa7d26e2fa78fc3982d7778	commit \| diff

msg: ceph_abort() when there are enough accepter errors in msg server
In some extrem cases(we have met one in our production cluster), when Accepter thread break out , new client can not connect to the osd. Because the former heartbeat connections are already connected, other osd can not detect failure then notify monitor to mark the failed osd down.
In the patch, we there are abnormal communication errors ,we just ceph_abort so that osd can go down fastly and other osds can notify monitor to mark the failed osd down.
Signed-off-by: penglaiyxy@gmail.com <penglaiyxy@gmail.com>
(cherry picked from commit 00e0ab407b2e9659d9121be1217e95c8117c411e)

Conflicts:
src/common/legacy_config_opts.h : Resolved for ms_max_accept_failures
src/common/options.cc : Resolved for ms_max_accept_failures
src/msg/async/AsyncMessenger.cc : Resolved in accept
src/msg/simple/Accepter.cc : Resolved in entry

src/common/legacy_config_opts.h		diff \| blob \| history
src/common/options.cc		diff \| blob \| history
src/msg/async/AsyncMessenger.cc		diff \| blob \| history
src/msg/simple/Accepter.cc		diff \| blob \| history