oid: 1.3.6.1.4.1.50495.15.1.2.4.4
annotations:
description: >
- OSD {{ $labels.ceph_daemon }} was marked down and back up at least once a
- minute for 5 minutes.
+ OSD {{ $labels.ceph_daemon }} was marked down and back up at least once a
+ minute for 5 minutes.
# alert on high deviation from average PG count
- alert: high pg count deviation
expr: abs(((ceph_osd_numpg > 0) - on (job) group_left avg(ceph_osd_numpg > 0) by (job)) / on (job) group_left avg(ceph_osd_numpg > 0) by (job)) > 0.35
oid: 1.3.6.1.4.1.50495.15.1.2.4.5
annotations:
description: >
- OSD {{ $labels.ceph_daemon }} deviates by more than 30% from
- average PG count.
+ OSD {{ $labels.ceph_daemon }} deviates by more than 30% from
+ average PG count.
# alert on high commit latency...but how high is too high
- name: mds
rules: