From: Jaya Prakash Date: Thu, 3 Jul 2025 07:15:52 +0000 (+0000) Subject: mon: Integrate discard queue overflow into pg health warnings X-Git-Tag: testing/wip-vshankar-testing-20250710.130520-debug~48^2~1 X-Git-Url: http://git.apps.os.sepia.ceph.com/?a=commitdiff_plain;h=2e121b360c729c2bcd85850b12508e7dd767d611;p=ceph-ci.git mon: Integrate discard queue overflow into pg health warnings Added a health warning mechanism to monitor the discard queue for potential overload Emits a warning if the accumulated discarded bytes in the queue exceed the configured threshold Introduced a debugging tool to simulate slow discard operations by adding a configurable delay Fixes : https://tracker.ceph.com/issues/69082 Signed-off-by: Jaya Prakash --- diff --git a/src/mon/PGMap.cc b/src/mon/PGMap.cc index 940b7ef9348..486e38de7f6 100644 --- a/src/mon/PGMap.cc +++ b/src/mon/PGMap.cc @@ -3313,6 +3313,10 @@ void PGMap::get_health_checks( summary += " experiencing stalled read in wal device of BlueFS"; } else if (asum.first == "DB_DEVICE_STALLED_READ_ALERT") { summary += " experiencing stalled read in db device of BlueFS"; + } else if (asum.first.find("_DISCARD_QUEUE") != std::string::npos) { + for (auto str : asum.second.second) { + summary += str; + } } auto& d = checks->add(asum.first, HEALTH_WARN, summary, asum.second.first);