]> git-server-git.apps.pok.os.sepia.ceph.com Git - ceph.git/commit
cephadm: run tcmu-runner through script to do restart on failure 53866/head
authorAdam King <adking@redhat.com>
Tue, 13 Jun 2023 23:54:30 +0000 (19:54 -0400)
committerAdam King <adking@redhat.com>
Fri, 6 Oct 2023 19:16:39 +0000 (15:16 -0400)
commit35b0511dd153e53e270752c8aa16984fbc23324b
treebb8ab627eed62ed11e300f64e3a1730a61b81aa2
parentb4fa663fa77ba2efb5b9f24e2aeb12bf66bce1f0
cephadm: run tcmu-runner through script to do restart on failure

Currently, cephadm runs tcmu-runner as a background
process inside the unit file deployed for iscsi
(rbd-target-api is the primary process). This means
if tcmu-runner crashes for whatever reason, systemd
will not attempt to restart it. This commits sets
up a script to serve as the container entrypoint
for the tcmu-runner container that will run
tcmu-runner and also restart it on failure
(unless there are too many failures in a short
period, at which point it gives up).

The hope is to eventually drop use of this script
for a better solution in squid onward, but this
should be helpful on older releases (quincy and
pacific at least) where we won't be able to
bring that better solution

Fixes: https://tracker.ceph.com/issues/61667
Signed-off-by: Adam King <adking@redhat.com>
(cherry picked from commit 47eb6b3f62afe993073429b02051ae0343d7aea3)

Conflicts:
src/cephadm/tests/test_cephadm.py
src/cephadm/cephadm.py
src/cephadm/tests/test_cephadm.py