]> git-server-git.apps.pok.os.sepia.ceph.com Git - ceph.git/commitdiff
tests: rados: sleep before ceph tell osd.0 flush_pg_stats after restart 14710/head
authorNathan Cutler <ncutler@suse.com>
Fri, 21 Apr 2017 09:05:05 +0000 (11:05 +0200)
committerNathan Cutler <ncutler@suse.com>
Tue, 20 Jun 2017 06:11:39 +0000 (08:11 +0200)
Even though we wait for HEALTH_OK after restarting the daemons, they are not
ready to respond to flush_pg_stats.

The reason why the osd is not ready for "tell" command after "ceph health"
shows that the cluster is "HEALTH_OK" is that the monitor fails to be notified
that the osd in question is not up in "heatbeat_interval". Because infernalis
does not have the osd_fast_fail_on_connection_refused support, the monitor
needs longer to detect that an osd is down, and osd_heartbeat_grace is used to
determine if an osd is down.

References: http://tracker.ceph.com/issues/16239
Signed-off-by: Nathan Cutler <ncutler@suse.com>
Signed-off-by: Kefu Chai <kchai@redhat.com>
qa/suites/rados/singleton/all/ec-lost-unfound-upgrade.yaml

index 9228b30568b34e2ec3321baed56a71a72913f025..c9add9ff4e781a87ffe8e9a1afcd1b4428643b36 100644 (file)
@@ -26,5 +26,7 @@ tasks:
 - print: "upgraded mon.a and friends"
 - ceph.restart:
     daemons: [mon.a, mon.b, mon.c, osd.0, osd.1, osd.2]
+- sleep:
+    duration: 20 # http://tracker.ceph.com/issues/16239
 - ec_lost_unfound:
     parallel_bench: false