In following 2 jobs from a Squid QA run we see cluster errors/warnings
which are expected but not added to ignorelist which causes these jobs
to fail.
From https://pulpito.ceph.com/xiubli-2024-07-29_02:08:56-fs-wip-xiubli-testing-
20240726.021939-squid-distro-default-smithi/
7823678 -
"2024-07-29T05:10:00.000118+0000 mon.smithi057 (mon.0) 879 : cluster [WRN] Health detail: HEALTH_WARN insufficient standby MDS daemons available" in cluster log
From https://pulpito.ceph.com/xiubli-2024-07-29_02:08:56-fs-wip-xiubli-testing-
20240726.021939-squid-distro-default-smithi/
7823724 -
"2024-07-29T06:00:00.000144+0000 mon.smithi104 (mon.0) 600 : cluster [ERR] fs cephfs is degraded" in cluster log
"FS_DEGRADED" and "MDS_INSUFFICIENT_STANDBY" are already present in the
ignorelist for these jobs bit these are not sufficient catch above
cluster warninings/errors. Therefore, add "fs.*is degraded" and
"insufficient standby MDS daemons available" too.
Fixes: https://tracker.ceph.com/issues/67303
Signed-off-by: Rishabh Dave <ridave@redhat.com>
ceph:
log-ignorelist:
- FS_DEGRADED
+ - fs.*is degraded
- FS_INLINE_DATA_DEPRECATED
- FS_WITH_FAILED_MDS
- MDS_ALL_DOWN
- MDS_DEGRADED
- MDS_FAILED
- MDS_INSUFFICIENT_STANDBY
+ - insufficient standby MDS daemons available
- MDS_UP_LESS_THAN_MAX
- filesystem is online with fewer MDS than max_mds
- POOL_APP_NOT_ENABLED