overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(FS_DEGRADED\)
- \(MDS_FAILED\)
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(OSD_DOWN\)
- \(OSD_
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- SLOW_OPS
- slow request
conf:
global:
osd_pool_default_size: 2
osd_pool_default_min_size: 2
- log-whitelist:
+ log-ignorelist:
- \(REQUEST_STUCK\)
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
conf:
mds:
conf:
mon:
mon warn on osd down out interval zero: false
- log-whitelist:
+ log-ignorelist:
- no active mgr
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
tasks:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- bad backtrace
- object missing on disk
- error reading table object
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- force file system read-only
- bad backtrace
- MDS in read-only mode
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- Replacing daemon mds
- Scrub error on inode
- Behind on trimming
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- responding to mclientcaps\(revoke\)
- not advance its oldest_client_tid
- failing to advance its oldest client/flush tid
# to permit OSDs to complain about that.
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- evicting unresponsive client
- but it is still running
- slow request
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- bad backtrace
- object missing on disk
- error reading table object
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- bad backtrace
- object missing on disk
- error reading table object
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- inode wrongly marked free
- bad backtrace on inode
- inode table repaired for inode
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- bad backtrace on directory inode
- error reading table object
- Metadata damage detected
ceph:
cephfs_ec_profile:
- disabled
- log-whitelist:
+ log-ignorelist:
- OSD full dropping all updates
- OSD near full
- pausewr flag
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- client session with non-allowable root
tasks:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- MON_DOWN
tasks:
- cephfs_test_runner:
conf:
mgr:
debug client: 10
- log-whitelist:
+ log-ignorelist:
- OSD full dropping all updates
- OSD near full
- pausewr flag
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- MDS cache is too large
- \(MDS_CACHE_OVERSIZED\)
tasks:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- evicting unresponsive client
- POOL_APP_NOT_ENABLED
- has not responded to cap revoke by MDS for over
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- Replacing daemon mds
- \(MDS_INSUFFICIENT_STANDBY\)
- \(MDS_ALL_DOWN\)
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- Replacing daemon mds
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(MON_DOWN\)
tasks:
ms inject delay probability: .005
ms inject delay max: 1
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
- ceph:
mon_bind_addrvec: false
mon_bind_msgr2: false
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(FS_
- \(MDS_
ceph:
mon_bind_msgr2: false
mon_bind_addrvec: false
- log-whitelist:
+ log-ignorelist:
- scrub mismatch
- ScrubResult
- wrongly marked
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- missing required features
tasks:
- exec:
- ceph:
mon_bind_addrvec: false
mon_bind_msgr2: false
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(FS_
- \(MDS_
ceph:
mon_bind_msgr2: false
mon_bind_addrvec: false
- log-whitelist:
+ log-ignorelist:
- scrub mismatch
- ScrubResult
- wrongly marked
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- missing required features
tasks:
- exec:
- ceph:
mon_bind_addrvec: false
mon_bind_msgr2: false
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(FS_
- \(MDS_
ceph:
mon_bind_msgr2: false
mon_bind_addrvec: false
- log-whitelist:
+ log-ignorelist:
- scrub mismatch
- ScrubResult
- wrongly marked
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- missing required features
tasks:
- exec:
debuginfo: true
ceph:
# Valgrind makes everything slow, so ignore slow requests and extend heartbeat grace
- log-whitelist:
+ log-ignorelist:
- slow requests are blocked
conf:
global:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- SLOW_OPS
- slow request
tasks:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- force file system read-only
- bad backtrace
- MDS in read-only mode
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- responding to mclientcaps\(revoke\)
- not advance its oldest_client_tid
- failing to advance its oldest client/flush tid
# to permit OSDs to complain about that.
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- slow request
- evicting unresponsive client
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- bad backtrace
- object missing on disk
- error reading table object
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- bad backtrace
- object missing on disk
- error reading table object
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- Replacing daemon mds
- \(MDS_INSUFFICIENT_STANDBY\)
- \(MDS_ALL_DOWN\)
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- inode wrongly marked free
- bad backtrace on inode
- inode table repaired for inode
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- bad backtrace on directory inode
- error reading table object
- Metadata damage detected
ceph:
cephfs_ec_profile:
- disabled
- log-whitelist:
+ log-ignorelist:
- OSD full dropping all updates
- OSD near full
- pausewr flag
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- client session with non-allowable root
tasks:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- MON_DOWN
tasks:
- cephfs_test_runner:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
tasks:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- Replacing daemon mds
tasks:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- \(MON_DOWN\)
tasks:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- SLOW_OPS
- slow request
conf:
global:
ms inject socket failures: 5000
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
global:
ms inject socket failures: 500
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
global:
ms inject socket failures: 5000
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
global:
ms inject socket failures: 5000
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
global:
ms inject socket failures: 500
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- pauserd,pausewr flag\(s\) set
tasks:
global:
ms inject socket failures: 5000
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
global:
ms inject socket failures: 500
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
global:
ms inject socket failures: 5000
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
global:
ms inject socket failures: 500
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
osd:
osd backoff on peering: true
osd backoff on degraded: true
- log-whitelist:
+ log-ignorelist:
- wrongly marked me down
- objects unfound and apparently lost
tasks:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
tasks:
conf:
mon:
mon osd initial require min compat client: luminous
- log-whitelist:
+ log-ignorelist:
- wrongly marked me down
- objects unfound and apparently lost
tasks:
global:
ms inject socket failures: 500
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
tasks:
- exec:
check-counter:
dry_run: true
ceph:
- log-whitelist:
+ log-ignorelist:
- evicting unresponsive client
- RECENT_CRASH
- ceph:
fs: xfs
wait-for-scrub: false
- log-whitelist:
+ log-ignorelist:
- \(PG_
- \(OSD_
- \(OBJECT_
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- reached quota
- \(POOL_APP_NOT_ENABLED\)
- \(PG_AVAILABILITY\)
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- \(MDS_TRIM\)
- \(MDS_SLOW_REQUEST\)
- MDS_SLOW_METADATA_IO
global:
ms inject socket failures: 5000
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
global:
ms inject socket failures: 1500
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- reached quota
- but it is still running
- overall HEALTH_
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- overall HEALTH_
- \(OSDMAP_FLAGS\)
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(CACHE_POOL_NO_HIT_SET\)
- \(TOO_FEW_PGS\)
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- overall HEALTH_
- \(POOL_APP_NOT_ENABLED\)
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- overall HEALTH_
- \(POOL_APP_NOT_ENABLED\)
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- overall HEALTH_
- \(POOL_APP_NOT_ENABLED\)
overrides:
ceph:
wait-for-scrub: false
- log-whitelist:
+ log-ignorelist:
- candidate had a stat error
- candidate had a read error
- deep-scrub 0 missing, 1 inconsistent objects
overrides:
ceph:
wait-for-scrub: false
- log-whitelist:
+ log-ignorelist:
- '!= data_digest'
- '!= omap_digest'
- '!= size'
# tests may leave mgrs broken, so don't try and call into them
# to invoke e.g. pg dump during teardown.
wait-for-scrub: false
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(MGR_DOWN\)
- \(DEVICE_IDENT_ON\)
# tests may leave mgrs broken, so don't try and call into them
# to invoke e.g. pg dump during teardown.
wait-for-scrub: false
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(MGR_DOWN\)
- \(PG_
# tests may leave mgrs broken, so don't try and call into them
# to invoke e.g. pg dump during teardown.
wait-for-scrub: false
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(MGR_DOWN\)
- \(PG_
# tests may leave mgrs broken, so don't try and call into them
# to invoke e.g. pg dump during teardown.
wait-for-scrub: false
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(MGR_DOWN\)
- \(PG_
# tests may leave mgrs broken, so don't try and call into them
# to invoke e.g. pg dump during teardown.
wait-for-scrub: false
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(MGR_DOWN\)
- \(MGR_INSIGHTS_WARNING\)
# tests may leave mgrs broken, so don't try and call into them
# to invoke e.g. pg dump during teardown.
wait-for-scrub: false
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(MGR_DOWN\)
- \(PG_
# tests may leave mgrs broken, so don't try and call into them
# to invoke e.g. pg dump during teardown.
wait-for-scrub: false
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(MGR_DOWN\)
- \(MDS_ALL_DOWN\)
# tests may leave mgrs broken, so don't try and call into them
# to invoke e.g. pg dump during teardown.
wait-for-scrub: false
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(MGR_DOWN\)
- \(PG_
# tests may leave mgrs broken, so don't try and call into them
# to invoke e.g. pg dump during teardown.
wait-for-scrub: false
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(MGR_DOWN\)
- \(PG_
mon scrub inject crc mismatch: 0.01
mon scrub inject missing keys: 0.05
# thrashing monitors may make mgr have trouble w/ its keepalive
- log-whitelist:
+ log-ignorelist:
- ScrubResult
- scrub mismatch
- overall HEALTH_
global:
ms inject socket failures: 5000
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
mon client directed command retry: 5
mgr:
debug monc: 10
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(MON_DOWN\)
- \(TOO_FEW_PGS\)
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(MON_DOWN\)
conf:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(MON_DOWN\)
tasks:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(MON_DOWN\)
conf:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(MON_DOWN\)
conf:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- slow request
- overall HEALTH_
- \(POOL_APP_NOT_ENABLED\)
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(POOL_APP_NOT_ENABLED\)
tasks:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- reached quota
- overall HEALTH_
- \(CACHE_POOL_NO_HIT_SET\)
mon osdmap full prune txsize: 100
osd:
osd beacon report interval: 10
- log-whitelist:
+ log-ignorelist:
# setting/unsetting noup will trigger health warns,
# causing tests to fail due to health warns, even if
# the tests themselves are successful.
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- overall HEALTH_
- \(PG_
global:
ms inject socket failures: 5000
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
ms inject socket failures: 1000
mon client directed command retry: 5
mon mgr beacon grace: 90
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
tasks:
- install:
- ceph:
- log-whitelist:
+ log-ignorelist:
- slow request
- .*clock.*skew.*
- clocks not synchronized
- date -u -s @$(expr $(date -u +%s) + 2)
- ceph:
wait-for-healthy: false
- log-whitelist:
+ log-ignorelist:
- .*clock.*skew.*
- clocks not synchronized
- overall HEALTH_
tasks:
- install:
- ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(MON_DOWN\)
- \(PG_AVAILABILITY\)
osd max object namespace len: 64
osd:
osd objectstore: filestore
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(OSDMAP_FLAGS\)
- \(OSD_
- ceph:
fs: xfs
wait-for-scrub: false
- log-whitelist:
+ log-ignorelist:
- \(PG_
- \(OSD_
- \(OBJECT_
tasks:
- install:
- ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(MGR_DOWN\)
- \(PG_
tasks:
- install:
- ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- had wrong client addr
- had wrong cluster addr
- [mon.a, mds.a, mgr.x, osd.0, osd.1, client.0]
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- MDS in read-only mode
- force file system read-only
- overall HEALTH_
pre-mgr-commands:
- sudo ceph config set mgr mgr/devicehealth/enable_monitoring false --force
fs: xfs
- log-whitelist:
+ log-ignorelist:
- \(PG_AVAILABILITY\)
- cram:
clients:
- ceph:
pre-mgr-commands:
- sudo ceph config set mgr mgr/devicehealth/enable_monitoring false --force
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(CACHE_POOL_NO_HIT_SET\)
conf:
ceph:
pre-mgr-commands:
- sudo ceph config set mgr mgr/devicehealth/enable_monitoring false --force
- log-whitelist:
+ log-ignorelist:
- but it is still running
- overall HEALTH_
- \(POOL_APP_NOT_ENABLED\)
- ceph:
pre-mgr-commands:
- sudo ceph config set mgr mgr/devicehealth/enable_monitoring false --force
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(CACHE_POOL_NO_HIT_SET\)
conf:
- [mon.a, mgr.x, osd.0, osd.1, osd.2, client.0]
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- is full
- overall HEALTH_
- \(POOL_FULL\)
# we may land on ext4
osd max object name len: 400
osd max object namespace len: 64
- log-whitelist:
+ log-ignorelist:
- but it is still running
- overall HEALTH_
- \(OSDMAP_FLAGS\)
ceph:
pre-mgr-commands:
- sudo ceph config set mgr mgr/devicehealth/enable_monitoring false --force
- log-whitelist:
+ log-ignorelist:
- \(OSDMAP_FLAGS\)
- \(OSD_FULL\)
- \(MDS_READ_ONLY\)
- [mon.a, mgr.x, osd.0, osd.1, osd.2, client.0]
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- \(POOL_APP_NOT_ENABLED\)
tasks:
- install:
- [mon.a, mds.a, mgr.x, osd.0, osd.1, client.0]
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- \(POOL_APP_NOT_ENABLED\)
tasks:
- install:
- ceph:
pre-mgr-commands:
- sudo ceph config set mgr mgr/devicehealth/enable_monitoring false --force
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(PG_
- \(OSD_
- [mon.a, mgr.x, osd.0, osd.1, osd.2, client.0]
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- \(OSD_DOWN\)
- \(POOL_APP_NOT_ENABLED\)
- \(SLOW_OPS\)
osd:
osd recovery sleep: .1
osd objectstore: filestore
- log-whitelist:
+ log-ignorelist:
- \(POOL_APP_NOT_ENABLED\)
- \(OSDMAP_FLAGS\)
- \(OSD_
- [mon.a, mds.a, mgr.x, osd.0, osd.1, client.0]
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- \(POOL_APP_NOT_ENABLED\)
tasks:
- install:
- ceph:
pre-mgr-commands:
- sudo ceph config set mgr mgr/devicehealth/enable_monitoring false --force
- log-whitelist:
+ log-ignorelist:
- but it is still running
- had wrong client addr
- had wrong cluster addr
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(OSDMAP_FLAGS\)
- \(OSD_
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(OSDMAP_FLAGS\)
- \(OSD_
- ceph:
pre-mgr-commands:
- sudo ceph config set mgr mgr/devicehealth/enable_monitoring false --force
- log-whitelist:
+ log-ignorelist:
- but it is still running
- overall HEALTH_
- \(OSDMAP_FLAGS\)
create_rbd_pool: false
pre-mgr-commands:
- sudo ceph config set mgr mgr/devicehealth/enable_monitoring false --force
- log-whitelist:
+ log-ignorelist:
- objects unfound and apparently lost
- overall HEALTH_
- \(OSDMAP_FLAGS\)
- ceph:
pre-mgr-commands:
- sudo ceph config set mgr mgr/devicehealth/enable_monitoring false --force
- log-whitelist:
+ log-ignorelist:
- objects unfound and apparently lost
- overall HEALTH_
- \(OSDMAP_FLAGS\)
- ceph:
pre-mgr-commands:
- sudo ceph config set mgr mgr/devicehealth/enable_monitoring false --force
- log-whitelist:
+ log-ignorelist:
- objects unfound and apparently lost
- overall HEALTH_
- \(OSDMAP_FLAGS\)
osd:
mon max pg per osd : 2
osd max pg per osd hard ratio : 1
- log-whitelist:
+ log-ignorelist:
- \(TOO_FEW_PGS\)
- \(PENDING_CREATING_PGS\)
tasks:
osd:
mon max pg per osd : 1
osd max pg per osd hard ratio : 1
- log-whitelist:
+ log-ignorelist:
- \(TOO_FEW_PGS\)
- \(PG_
- \(PENDING_CREATING_PGS\)
osd:
mon max pg per osd : 1
osd max pg per osd hard ratio : 1
- log-whitelist:
+ log-ignorelist:
- \(TOO_FEW_PGS\)
- \(PG_
- \(PENDING_CREATING_PGS\)
- ceph:
pre-mgr-commands:
- sudo ceph config set mgr mgr/devicehealth/enable_monitoring false --force
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(AUTH_BAD_CAPS\)
- workunit:
- ceph:
pre-mgr-commands:
- sudo ceph config set mgr mgr/devicehealth/enable_monitoring false --force
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(AUTH_BAD_CAPS\)
- workunit:
create_rbd_pool: false
pre-mgr-commands:
- sudo ceph config set mgr mgr/devicehealth/enable_monitoring false --force
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(OSDMAP_FLAGS\)
- \(OSD_
- ceph:
pre-mgr-commands:
- sudo ceph config set mgr mgr/devicehealth/enable_monitoring false --force
- log-whitelist:
+ log-ignorelist:
- but it is still running
- overall HEALTH_
- \(OSDMAP_FLAGS\)
- ceph:
pre-mgr-commands:
- sudo ceph config set mgr mgr/devicehealth/enable_monitoring false --force
- log-whitelist:
+ log-ignorelist:
- but it is still running
- overall HEALTH_
- \(OSDMAP_FLAGS\)
- ceph:
pre-mgr-commands:
- sudo ceph config set mgr mgr/devicehealth/enable_monitoring false --force
- log-whitelist:
+ log-ignorelist:
- but it is still running
- overall HEALTH_
- \(OSDMAP_FLAGS\)
config:
global:
osd pool default min size : 1
- log-whitelist:
+ log-ignorelist:
- objects unfound and apparently lost
- overall HEALTH_
- \(OSDMAP_FLAGS\)
create_rbd_pool: false
pre-mgr-commands:
- sudo ceph config set mgr mgr/devicehealth/enable_monitoring false --force
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(OSDMAP_FLAGS\)
- \(OSD_
- ceph:
pre-mgr-commands:
- sudo ceph config set mgr mgr/devicehealth/enable_monitoring false --force
- log-whitelist:
+ log-ignorelist:
- but it is still running
- slow request
- overall HEALTH_
- ceph:
pre-mgr-commands:
- sudo ceph config set mgr mgr/devicehealth/enable_monitoring false --force
- log-whitelist:
+ log-ignorelist:
- but it is still running
- had wrong client addr
- had wrong cluster addr
- ceph:
pre-mgr-commands:
- sudo ceph config set mgr mgr/devicehealth/enable_monitoring false --force
- log-whitelist:
+ log-ignorelist:
- missing primary copy of
- objects unfound and apparently lost
- had a read error
- ceph:
pre-mgr-commands:
- sudo ceph config set mgr mgr/devicehealth/enable_monitoring false --force
- log-whitelist:
+ log-ignorelist:
- no reply from
- overall HEALTH_
- \(MON_DOWN\)
osd max pg log entries: 1000
osd_target_pg_log_entries_per_osd: 0
osd pg log trim min: 10
- log-whitelist:
+ log-ignorelist:
- \(POOL_APP_NOT_ENABLED\)
- \(OSDMAP_FLAGS\)
- \(OSD_
pre-mgr-commands:
- sudo ceph config set mgr mgr/devicehealth/enable_monitoring false --force
fs: xfs
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(OSDMAP_FLAGS\)
- \(OSD_
- ceph:
pre-mgr-commands:
- sudo ceph config set mgr mgr/devicehealth/enable_monitoring false --force
- log-whitelist:
+ log-ignorelist:
- Reduced data availability
- OSD_.*DOWN
- \(RECENT_CRASH\)
- ceph:
pre-mgr-commands:
- sudo ceph config set mgr mgr/devicehealth/enable_monitoring false --force
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(POOL_APP_NOT_ENABLED\)
- workunit:
- ceph:
pre-mgr-commands:
- sudo ceph config set mgr mgr/devicehealth/enable_monitoring false --force
- log-whitelist:
+ log-ignorelist:
- but it is still running
- missing primary copy of
- objects unfound and apparently lost
- ceph:
pre-mgr-commands:
- sudo ceph config set mgr mgr/devicehealth/enable_monitoring false --force
- log-whitelist:
+ log-ignorelist:
- but it is still running
- missing primary copy of
- objects unfound and apparently lost
tasks:
- install:
- ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- thrashosds:
op_delay: 30
- ceph:
pre-mgr-commands:
- sudo ceph config set mgr mgr/devicehealth/enable_monitoring false --force
- log-whitelist:
+ log-ignorelist:
- but it is still running
- slow request
- overall HEALTH_
debug ms: 1
debug objecter: 20
debug rados: 20
- log-whitelist:
+ log-ignorelist:
- objects unfound and apparently lost
- overall HEALTH_
- \(OSDMAP_FLAGS\)
global:
ms inject socket failures: 5000
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
mon client directed command retry: 5
mgr:
debug monc: 10
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
- slow request
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
- slow request
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
conf:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
- osd_map_cache_size
osd scrub min interval: 60
osd scrub max interval: 120
osd max backfills: 9
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
tasks:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
conf:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
- slow request
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
- slow request
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
conf:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
conf:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
conf:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
create_rbd_pool: False
osd scrub min interval: 60
osd scrub max interval: 120
osd max backfills: 9
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
tasks:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
conf:
overrides:
ceph:
mon_bind_msgr2: false
- log-whitelist:
+ log-ignorelist:
- \(MON_DOWN\)
conf:
global:
overrides:
ceph:
mon_bind_msgr2: false
- log-whitelist:
+ log-ignorelist:
- \(MON_DOWN\)
conf:
global:
overrides:
ceph:
mon_bind_msgr2: false
- log-whitelist:
+ log-ignorelist:
- \(MON_DOWN\)
conf:
global:
overrides:
ceph:
mon_bind_msgr2: false
- log-whitelist:
+ log-ignorelist:
- \(MON_DOWN\)
conf:
global:
overrides:
ceph:
mon_bind_msgr2: false
- log-whitelist:
+ log-ignorelist:
- \(MON_DOWN\)
conf:
global:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- \(MON_DOWN\)
conf:
global:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- \(MON_DOWN\)
tasks:
- install:
ms inject socket failures: 2500
ms tcp read timeout: 5
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
mon client directed command retry: 5
osd:
osd heartbeat use min delay socket: true
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
ms inject delay max: 1
ms inject internal delays: .002
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
conf:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
conf:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
- osd_map_cache_size
filestore queue throttle high multiple: 2
filestore queue throttle max multiple: 10
osd max backfills: 9
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
tasks:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
conf:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- must scrub before tier agent can activate
tasks:
- exec:
ms inject socket failures: 2500
ms tcp read timeout: 5
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
mon client directed command retry: 5
osd:
osd heartbeat use min delay socket: true
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
ms inject delay max: 1
ms inject internal delays: .002
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
conf:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
conf:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
- osd_map_cache_size
filestore queue throttle high multiple: 2
filestore queue throttle max multiple: 10
osd max backfills: 9
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
tasks:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
conf:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- must scrub before tier agent can activate
conf:
osd:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- must scrub before tier agent can activate
conf:
osd:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- must scrub before tier agent can activate
conf:
osd:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- must scrub before tier agent can activate
conf:
osd:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- must scrub before tier agent can activate
conf:
osd:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- must scrub before tier agent can activate
conf:
osd:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- must scrub before tier agent can activate
conf:
osd:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- reached quota
- \(POOL_APP_NOT_ENABLED\)
- \(PG_AVAILABILITY\)
flavor: notcmalloc
debuginfo: true
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(PG_
conf:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
tasks:
global:
ms inject socket failures: 5000
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(MON_DOWN\)
- \(OSDMAP_FLAGS\)
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- reached quota
- overall HEALTH_
- \(CACHE_POOL_NO_HIT_SET\)
osd fast shutdown: false
debug bluestore: 1
debug bluefs: 1
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
# valgrind is slow.. we might get PGs stuck peering etc
- \(PG_
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(CACHE_POOL_NEAR_FULL\)
- \(CACHE_POOL_NO_HIT_SET\)
global:
ms inject socket failures: 5000
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(CACHE_POOL_NO_HIT_SET\)
- \(POOL_APP_NOT_ENABLED\)
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- \(SLOW_OPS\)
- slow request
tasks:
global:
ms inject socket failures: 5000
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
bdev_inject_crash_probability: .5
ceph:
fs: xfs
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(CACHE_POOL_NO_HIT_SET\)
conf:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(CACHE_POOL_NEAR_FULL\)
- \(CACHE_POOL_NO_HIT_SET\)
global:
ms inject socket failures: 5000
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(CACHE_POOL_NEAR_FULL\)
- \(CACHE_POOL_NO_HIT_SET\)
global:
ms inject socket failures: 5000
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- but it is still running
- \(OSD_SLOW_PING_TIME
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(CACHE_POOL_NEAR_FULL\)
- \(CACHE_POOL_NO_HIT_SET\)
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(CACHE_POOL_NO_HIT_SET\)
- \(POOL_APP_NOT_ENABLED\)
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(CACHE_POOL_NO_HIT_SET\)
- \(POOL_APP_NOT_ENABLED\)
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(CACHE_POOL_NO_HIT_SET\)
- \(POOL_APP_NOT_ENABLED\)
global:
ms inject socket failures: 5000
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- but it is still running
- \(OSD_SLOW_PING_TIME
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(CACHE_POOL_NEAR_FULL\)
- \(CACHE_POOL_NO_HIT_SET\)
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(CACHE_POOL_NEAR_FULL\)
- \(CACHE_POOL_NO_HIT_SET\)
tasks:
- install:
- ceph:
- log-whitelist:
+ log-ignorelist:
- 'application not enabled'
- workunit:
timeout: 30m
- install:
- ceph:
fs: xfs
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(CACHE_POOL_NO_HIT_SET\)
- \(POOL_APP_NOT_ENABLED\)
- install:
- ceph:
fs: xfs
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(CACHE_POOL_NO_HIT_SET\)
- \(POOL_APP_NOT_ENABLED\)
global:
ms inject socket failures: 5000
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
- overall HEALTH_
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
tasks:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(CACHE_POOL_NO_HIT_SET\)
- \(POOL_APP_NOT_ENABLED\)
RBD_FEATURES: "61"
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(CACHE_POOL_NO_HIT_SET\)
- \(POOL_APP_NOT_ENABLED\)
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(CACHE_POOL_NO_HIT_SET\)
- \(POOL_APP_NOT_ENABLED\)
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(CACHE_POOL_NO_HIT_SET\)
- \(POOL_APP_NOT_ENABLED\)
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(CACHE_POOL_NO_HIT_SET\)
- \(POOL_APP_NOT_ENABLED\)
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(CACHE_POOL_NO_HIT_SET\)
- \(POOL_APP_NOT_ENABLED\)
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(CACHE_POOL_NO_HIT_SET\)
- \(POOL_APP_NOT_ENABLED\)
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(CACHE_POOL_NO_HIT_SET\)
- \(POOL_APP_NOT_ENABLED\)
global:
ms inject socket failures: 5000
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- reached quota
- mons down
- overall HEALTH_
- install: null
- ceph:
fs: ext4
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(OSDMAP_FLAGS\)
- \(OSD_
- install: null
- ceph:
fs: xfs
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(OSDMAP_FLAGS\)
- \(OSD_
tasks:
- install: null
- ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(OSDMAP_FLAGS\)
- \(OSD_
- install: null
- ceph:
fs: xfs
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(OSDMAP_FLAGS\)
- \(OSD_
tasks:
- install:
- ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- overall HEALTH_
- \(OSDMAP_FLAGS\)
- install:
- ceph:
fs: ext4
- log-whitelist:
+ log-ignorelist:
- but it is still running
- overall HEALTH_
- \(POOL_APP_NOT_ENABLED\)
tasks:
- install:
- ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(OSDMAP_FLAGS\)
- \(OSD_
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(OSDMAP_FLAGS\)
- \(OSD_
tasks:
- install:
- ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
- thrashosds:
tasks:
- install:
- ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
- thrashosds:
tasks:
- install:
- ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
- thrashosds:
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- failed to encode map
conf:
mon:
global:
ms inject socket failures: 5000
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
global:
ms inject socket failures: 500
mon client directed command retry: 5
- log-whitelist:
+ log-ignorelist:
- \(OSD_SLOW_PING_TIME
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- \(MON_DOWN\)
- \(MGR_DOWN\)
- slow request
small chance to increase the number of pgs
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- objects unfound and apparently lost
- log bound mismatch
restartin remaining osds
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(FS_DEGRADED\)
- \(MDS_
- client.3
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- scrub mismatch
- ScrubResult
- wrongly marked
branch: nautilus
- print: "**** done installing nautilus"
- ceph:
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(FS_
- \(MDS_
librados C and C++ api tests
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- reached quota
- \(REQUEST_SLOW\)
tasks:
small chance to increase the number of pgs
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- wrongly marked me down
- objects unfound and apparently lost
mon_bind_msgr2: false
mon_bind_addrvec: false
fs: xfs
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(MON_DOWN\)
- \(MGR_DOWN\)
bluestore_warn_on_legacy_statfs: false
bluestore warn on no per pool omap: false
mon pg warn min per osd: 0
- log-whitelist:
+ log-ignorelist:
- evicting unresponsive client
- exec:
osd.0:
small chance to increase the number of pgs
overrides:
ceph:
- log-whitelist:
+ log-ignorelist:
- but it is still running
- wrongly marked me down
- objects unfound and apparently lost
return stdout or None
if first_in_ceph_log('\[ERR\]|\[WRN\]|\[SEC\]',
- config['log_whitelist']) is not None:
+ config['log_ignorelist']) is not None:
log.warning('Found errors (ERR|WRN|SEC) in cluster log')
ctx.summary['success'] = False
# use the most severe problem as the failure reason
if 'failure_reason' not in ctx.summary:
for pattern in ['\[SEC\]', '\[ERR\]', '\[WRN\]']:
- match = first_in_ceph_log(pattern, config['log_whitelist'])
+ match = first_in_ceph_log(pattern, config['log_ignorelist'])
if match is not None:
ctx.summary['failure_reason'] = \
'"{match}" in cluster log'.format(
tasks:
- ceph:
- log-whitelist: ['foo.*bar', 'bad message']
+ log-ignorelist: ['foo.*bar', 'bad message']
To run multiple ceph clusters, use multiple ceph tasks, and roles
with a cluster name prefix, e.g. cluster1.client.0. Roles with no
mkfs_options=config.get('mkfs_options', None),
mount_options=config.get('mount_options', None),
skip_mgr_daemons=config.get('skip_mgr_daemons', False),
- log_whitelist=config.get('log-whitelist', []),
+ log_ignorelist=config.get('log-ignorelist', []),
cpu_profile=set(config.get('cpu_profile', []),),
cluster=config['cluster'],
mon_bind_msgr2=config.get('mon_bind_msgr2', True),
return None
if first_in_ceph_log('\[ERR\]|\[WRN\]|\[SEC\]',
- config.get('log-whitelist')) is not None:
+ config.get('log-ignorelist')) is not None:
log.warning('Found errors (ERR|WRN|SEC) in cluster log')
ctx.summary['success'] = False
# use the most severe problem as the failure reason
if 'failure_reason' not in ctx.summary:
for pattern in ['\[SEC\]', '\[ERR\]', '\[WRN\]']:
- match = first_in_ceph_log(pattern, config['log-whitelist'])
+ match = first_in_ceph_log(pattern, config['log-ignorelist'])
if match is not None:
ctx.summary['failure_reason'] = \
'"{match}" in cluster log'.format(
Test handling of osd_failsafe_nearfull_ratio and osd_failsafe_full_ratio
configuration settings
- In order for test to pass must use log-whitelist as follows
+ In order for test to pass must use log-ignorelist as follows
tasks:
- chef:
- install:
- ceph:
- log-whitelist: ['OSD near full', 'OSD full dropping all updates']
+ log-ignorelist: ['OSD near full', 'OSD full dropping all updates']
- osd_failsafe_enospc:
"""
The config should be as follows:
- Must include the log-whitelist below
+ Must include the log-ignorelist below
Must enable filestore_debug_inject_read_err config
example:
- chef:
- install:
- ceph:
- log-whitelist:
+ log-ignorelist:
- 'candidate had a stat error'
- 'candidate had a read error'
- 'deep-scrub 0 missing, 1 inconsistent objects'
- chef:
- install:
- ceph:
- log-whitelist:
+ log-ignorelist:
- '!= data_digest'
- '!= omap_digest'
- '!= size'
conf:
osd:
osd max markdown count: 1000
- log-whitelist:
+ log-ignorelist:
- overall HEALTH_
- \(OSDMAP_FLAGS\)
- \(OSD_