From 8621c72feaf2b42583e3c8914ed52d07bc2ca148 Mon Sep 17 00:00:00 2001 From: Rishabh Dave Date: Thu, 1 Aug 2024 16:17:28 +0530 Subject: [PATCH] qa/cephfs: update ignorelist In following 2 jobs from a Squid QA run we see cluster errors/warnings which are expected but not added to ignorelist which causes these jobs to fail. From https://pulpito.ceph.com/xiubli-2024-07-29_02:08:56-fs-wip-xiubli-testing-20240726.021939-squid-distro-default-smithi/7823678 - "2024-07-29T05:10:00.000118+0000 mon.smithi057 (mon.0) 879 : cluster [WRN] Health detail: HEALTH_WARN insufficient standby MDS daemons available" in cluster log From https://pulpito.ceph.com/xiubli-2024-07-29_02:08:56-fs-wip-xiubli-testing-20240726.021939-squid-distro-default-smithi/7823724 - "2024-07-29T06:00:00.000144+0000 mon.smithi104 (mon.0) 600 : cluster [ERR] fs cephfs is degraded" in cluster log "FS_DEGRADED" and "MDS_INSUFFICIENT_STANDBY" are already present in the ignorelist for these jobs bit these are not sufficient catch above cluster warninings/errors. Therefore, add "fs.*is degraded" and "insufficient standby MDS daemons available" too. Fixes: https://tracker.ceph.com/issues/67303 Signed-off-by: Rishabh Dave (cherry picked from commit c30cdbd14dd8858772b9259804c257977bf973be) Conflicts: qa/cephfs/overrides/ignorelist_health.yaml - A PR that was merged comparatively later in main branch got merged in reef branch before this commit/PR. This caused the file in reef to be slightly different from its main brach version which caused this conflict during application of patch. --- qa/cephfs/overrides/ignorelist_health.yaml | 2 ++ 1 file changed, 2 insertions(+) diff --git a/qa/cephfs/overrides/ignorelist_health.yaml b/qa/cephfs/overrides/ignorelist_health.yaml index a79ba1bbbdc..678548fe2cc 100644 --- a/qa/cephfs/overrides/ignorelist_health.yaml +++ b/qa/cephfs/overrides/ignorelist_health.yaml @@ -2,6 +2,7 @@ overrides: ceph: log-ignorelist: - FS_DEGRADED + - fs.*is degraded - filesystem is degraded - FS_INLINE_DATA_DEPRECATED - FS_WITH_FAILED_MDS @@ -11,6 +12,7 @@ overrides: - MDS_DEGRADED - MDS_FAILED - MDS_INSUFFICIENT_STANDBY + - insufficient standby MDS daemons available - MDS_UP_LESS_THAN_MAX - online, but wants - filesystem is online with fewer MDS than max_mds -- 2.39.5