From: xie xingguo Date: Wed, 26 Jun 2019 06:24:08 +0000 (+0800) Subject: osd/OSD: auto mark heartbeat sessions as stale and tear them down X-Git-Tag: v15.1.0~2319^2 X-Git-Url: http://git-server-git.apps.pok.os.sepia.ceph.com/?a=commitdiff_plain;h=refs%2Fpull%2F28752%2Fhead;p=ceph.git osd/OSD: auto mark heartbeat sessions as stale and tear them down The primary benefit is that the OSD doesn't need to keep a flood of blocked heartbeat messages around in memory. This prevents OSDs from accumulating heartbeat messages due to a broken switch and then exhausting the whole node's memory: Jun 11 04:19:26 host-192-168-9-12 kernel: [409881.137077] Out of memory: Kill process 1471476 (ceph-osd) score 47 or sacrifice child Jun 11 04:19:26 host-192-168-9-12 kernel: [409881.146054] Killed process 1471476 (ceph-osd) total-vm:4822548kB, anon-rss:3097860kB, file-rss:2556kB, shmem-rss:0kB Fixes: http://tracker.ceph.com/issues/40586 Signed-off-by: xie xingguo --- diff --git a/src/common/options.cc b/src/common/options.cc index d967c934bf6f..ac570b17e3df 100644 --- a/src/common/options.cc +++ b/src/common/options.cc @@ -3279,6 +3279,13 @@ std::vector