]>
git-server-git.apps.pok.os.sepia.ceph.com Git - cephmetrics.git/log
Zack Cerza [Wed, 2 May 2018 16:13:00 +0000 (10:13 -0600)]
ceph-grafana: Add notification channel
Signed-off-by: Zack Cerza <zack@redhat.com>
Boris Ranto [Thu, 19 Apr 2018 19:00:58 +0000 (21:00 +0200)]
Merge pull request #160 from zmc/wip-prometheus-role
Prometheus support via ceph-mgr
Reviewed-by: Boris Ranto <branto@redhat.com>
Zack Cerza [Wed, 18 Apr 2018 03:44:06 +0000 (21:44 -0600)]
ceph-prometheus: Rename container to 'prometheus'
This change requires some manual steps:
- docker rename cephmetrics-prometheus prometheus
- mv /etc/systemd/system/cephmetrics-prometheus.service
/etc/systemd/system/prometheus.service
- systemctl daemon-reload
And finally, rerun the playbook.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Wed, 18 Apr 2018 18:59:24 +0000 (12:59 -0600)]
ceph-grafana: Use light theme
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Tue, 17 Apr 2018 20:26:48 +0000 (14:26 -0600)]
ceph-grafana: Ship a minimal grafana.ini
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Tue, 17 Apr 2018 16:38:57 +0000 (10:38 -0600)]
ceph-grafana: Use v5.0.4 for containers
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Fri, 13 Apr 2018 22:43:17 +0000 (16:43 -0600)]
ceph-grafana: Use ansible's to_json filter
'tojson' is part of jinja2 2.9, which isn't in RHEL.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 12 Apr 2018 22:39:24 +0000 (16:39 -0600)]
ceph-grafana: Fix dashboard_dir in devel_mode
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 12 Apr 2018 16:35:49 +0000 (10:35 -0600)]
Move devel_packages out of defaults
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 29 Mar 2018 19:26:32 +0000 (13:26 -0600)]
tox: Use py.test -v for integration tests
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 29 Mar 2018 00:12:42 +0000 (18:12 -0600)]
ceph-graphite: Fix test_metrics_present
If a cluster changes over time, the test broke. When looking at stored
metrics, assert that our inventory hosts are a subset of the stored
metrics as opposed to doing an equality check.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Wed, 28 Mar 2018 22:43:47 +0000 (16:43 -0600)]
ceph-graphite: Skip irrelevant integration tests
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 22 Feb 2018 21:52:58 +0000 (14:52 -0700)]
Merge pull request #154 from jeanchlopez/master
Fix a typo in the purge.yml Ansible playbook file
Boris Ranto [Mon, 15 Jan 2018 18:42:14 +0000 (19:42 +0100)]
Merge pull request #156 from ceph/wip-req-req
rpm: Require python-requests
Reviewed-by: Ken Dreyer <kdreyer@redhat.com>
Boris Ranto [Fri, 12 Jan 2018 10:34:28 +0000 (11:34 +0100)]
rpm: Require python-requests
Signed-off-by: Boris Ranto <branto@redhat.com>
Jean-Charles Lopez [Wed, 13 Dec 2017 13:10:34 +0000 (10:10 -0300)]
Merge pull request #1 from jeanchlopez/jeanchlopez-patch-purge-playbook-1
Fix a typo in purge.yml
Jean-Charles Lopez [Wed, 13 Dec 2017 13:00:24 +0000 (10:00 -0300)]
Fix a typo in purge.yml
Author JC Lopez
Fix. Change /var/lig/graphite-web with /var/lib/graphite-web
Boris Ranto [Wed, 8 Nov 2017 20:38:44 +0000 (21:38 +0100)]
Merge pull request #149 from ceph/wip-centos
ansible: Fixes for CentOS deployments
Reviewed-by: Boris Ranto <branto@redhat.com>
Zack Cerza [Thu, 26 Oct 2017 21:16:15 +0000 (15:16 -0600)]
Add python-requests to yum packages
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 26 Oct 2017 19:04:09 +0000 (13:04 -0600)]
ansible: Work with old Django versions
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 26 Oct 2017 15:28:29 +0000 (09:28 -0600)]
Merge pull request #148 from zmc/wip-fixes
Minor dashboard fixes
Zack Cerza [Wed, 25 Oct 2017 16:54:42 +0000 (10:54 -0600)]
ceph-rgw-workload.json: Set editable=False
BZ1506132
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Wed, 25 Oct 2017 16:54:18 +0000 (10:54 -0600)]
osd-node-detail.json: Don't hardcode cluster name
BZ1506200
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Tue, 24 Oct 2017 17:47:43 +0000 (11:47 -0600)]
Merge pull request #144 from ceph/wip-testinfra
Add some integration tests using testinfra
Boris Ranto [Tue, 24 Oct 2017 07:34:43 +0000 (09:34 +0200)]
Merge pull request #147 from ceph/wip-libsemanage-python
Require libsemanage-python on collector nodes
Reviewed-by: Boris Ranto <branto@redhat.com>
Zack Cerza [Fri, 20 Oct 2017 20:34:54 +0000 (14:34 -0600)]
Require libsemanage-python on collector nodes
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Tue, 17 Oct 2017 15:15:28 +0000 (09:15 -0600)]
Add some integration tests using testinfra
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 12 Oct 2017 02:55:13 +0000 (20:55 -0600)]
Merge pull request #141 from ceph/wip-fix-none-osd-colld
collectors: No update on fetch osd_stats failure
Boris Ranto [Wed, 11 Oct 2017 19:43:36 +0000 (21:43 +0200)]
Merge pull request #143 from ceph/wip-pyyaml
Ensure PyYAML is installed on the Grafana node
Reviewed-by: Boris Ranto <branto@redhat.com>
Zack Cerza [Wed, 11 Oct 2017 16:34:33 +0000 (10:34 -0600)]
Ensure PyYAML is installed on the Grafana node
It's needed by dashUpdater.py. BZ1497198
Signed-off-by: Zack Cerza <zack@redhat.com>
Boris Ranto [Wed, 11 Oct 2017 10:18:03 +0000 (12:18 +0200)]
Merge pull request #142 from ceph/ui-bz-fixes
Ui bz fixes
Reviewed-by: Boris Ranto <branto@redhat.com>
Paul Cuzner [Wed, 11 Oct 2017 04:16:42 +0000 (17:16 +1300)]
osd-information: disk size summary timings and metrics updated
Update to the query interval needed on larger configurations to make sure
all samples are present and OSD size table units updated. BZ1496186
Paul Cuzner [Wed, 11 Oct 2017 04:14:25 +0000 (17:14 +1300)]
osd-node-detail: updated to correct disk units
All disk units now showing as decimal not binary values. In addition
help text updated on the raw capacity panel to better explain how the
value is derived. BZ1496186
Paul Cuzner [Wed, 11 Oct 2017 04:12:53 +0000 (17:12 +1300)]
ceph-pools : sort and selection fixes
sortByMaxima added to queries and filter (pool_name) added to the
recovery pool overview panel to work the same as the others. BZ1499734
Paul Cuzner [Wed, 11 Oct 2017 04:11:02 +0000 (17:11 +1300)]
at-a-glance: updates to address OSD UP state issues
OSD Hosts and RGW Hosts panel changed to vonage status panel to better
reflect UP states BZ1498504
Zack Cerza [Tue, 10 Oct 2017 23:05:50 +0000 (17:05 -0600)]
Merge pull request #139 from ceph/wip-packaging
Finalize packaging changes
Boris Ranto [Tue, 10 Oct 2017 22:24:27 +0000 (00:24 +0200)]
rpm: Move the rpm patch off the gitroot
Signed-off-by: Boris Ranto <branto@redhat.com>
Zack Cerza [Mon, 9 Oct 2017 16:07:17 +0000 (10:07 -0600)]
Merge pull request #140 from ceph/wip-vonage-color
Tweak Vonage panel color in newer versions
Boris Ranto [Sat, 7 Oct 2017 09:28:09 +0000 (11:28 +0200)]
collectors: No update on fetch osd_stats failure
We currently do not check that we were able to successfully fetch the
osd_stas for a given osd_id which can result in collectors being down
for a while. We should check that we were able to fetch them before we
actually try to update them.
Signed-off-by: Boris Ranto <branto@redhat.com>
Zack Cerza [Fri, 6 Oct 2017 19:36:05 +0000 (13:36 -0600)]
Tweak Vonage panel color in newer versions
The plugin recently added a system for configurable colors, breaking our
earlier hack. Add a new one.
See: https://github.com/Vonage/Grafana_Status_panel/commit/
8633ba23bb2f336342ba3f7002a07be597b6e816
Signed-off-by: Zack Cerza <zack@redhat.com>
Boris Ranto [Fri, 6 Oct 2017 17:34:58 +0000 (19:34 +0200)]
rpm: ipaddr ansible filter requires python-netaddr
Signed-off-by: Boris Ranto <branto@redhat.com>
Boris Ranto [Thu, 5 Oct 2017 20:39:17 +0000 (22:39 +0200)]
ansible: Disable devel_mode
Signed-off-by: Boris Ranto <branto@redhat.com>
Boris Ranto [Fri, 6 Oct 2017 10:00:25 +0000 (12:00 +0200)]
ansible: Fix merge_vars.yml
Currently, we override the variables in merge_vars. However, if we run
the script several times (e.g. a host is a grafana and a collectd node)
then vars[item] is defined but it can be that it is not a mapping. In
this case we redefine the value to empty string breaking the actual
values of the variables.
We redefine e.g. devel_mode or use_epel to empty string this way making
it false for the grafana server node that is both grafana and the
collectd node.
Signed-off-by: Boris Ranto <branto@redhat.com>
Boris Ranto [Fri, 6 Oct 2017 07:16:40 +0000 (09:16 +0200)]
Merge pull request #138 from ceph/dboard-fixes
dashboard updates to address minor usability issues
Reviewed-by: Boris Ranto <branto@redhat.com>
Zack Cerza [Fri, 6 Oct 2017 01:54:45 +0000 (19:54 -0600)]
Merge pull request #137 from ceph/wip-graphite-migrate
Fix graphite migration
Paul Cuzner [Fri, 6 Oct 2017 01:37:40 +0000 (14:37 +1300)]
dashboard updates to address minor usability issues
ceph-at-a-glance: Capacity util panel now links directly to the capacity
chart in the ceph-cluster dashboard BZ
1496198
ceph-osd-information: query timespan changed from the default 1hr to 1m
aligning to the OSDs panel query on the 'at-a-glance' dashboard
BZ
1498504
Zack Cerza [Thu, 5 Oct 2017 19:43:57 +0000 (13:43 -0600)]
Use --fake-initial for migrations if necessary
Migrations won't work correctly if the db already exists. This manifests
in an error like:
django.db.utils.OperationalError: table "django_content_type" already exists
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 5 Oct 2017 19:42:50 +0000 (13:42 -0600)]
Use 'graphite-manage migrate' instead of 'syncdb'
syncdb is deprecated.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Wed, 4 Oct 2017 22:17:21 +0000 (16:17 -0600)]
Merge pull request #136 from ceph/wip-db-wait
Add a note about waiting to collect data
Zack Cerza [Wed, 4 Oct 2017 21:23:41 +0000 (15:23 -0600)]
Add a note about waiting to collect data
Users might initially be confused that immediately after deployment, the
dashboard looks broken. This is because it doesn't yet have the data it
needs to function.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Wed, 4 Oct 2017 22:09:36 +0000 (16:09 -0600)]
Merge pull request #135 from ceph/wip-sanity
Check that all hosts are referred to by FQDN
Zack Cerza [Wed, 4 Oct 2017 21:18:09 +0000 (15:18 -0600)]
Skip ansible-lint on 'systemctl show'
The linter complains about this, but ansible doesn't have the required
functionality here.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Tue, 3 Oct 2017 20:25:14 +0000 (14:25 -0600)]
Check that all hosts are referred to by FQDN
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Wed, 4 Oct 2017 21:14:26 +0000 (15:14 -0600)]
Skip linting on sed call
lineinfile won't necessarily work well for this.
Signed-off-by: Zack Cerza <zack@redhat.com>
Boris Ranto [Tue, 3 Oct 2017 18:26:44 +0000 (20:26 +0200)]
Merge pull request #132 from ceph/wip-rhsm
ansible: Do not enable rhsm repos
Reviewed-by: Ken Dreyer <kdreyer@redhat.com>
Boris Ranto [Mon, 2 Oct 2017 09:29:01 +0000 (11:29 +0200)]
ansible: Do not enable rhsm repos
We are shipping as a regular product and as such, we cannot enable
additional repos via rhsm. The customers might even not have these repos
installed (especially the storage console repo might not be available to
them).
If we need any of the packages from these repos, we need to cross-ship
them in our product (as we already do downstream).
Signed-off-by: Boris Ranto <branto@redhat.com>
Zack Cerza [Fri, 29 Sep 2017 16:17:38 +0000 (10:17 -0600)]
Merge pull request #129 from ceph/wip-remove-refs-to-prod-repo
ansible: Remove references to old prod repo
Zack Cerza [Thu, 28 Sep 2017 16:08:53 +0000 (10:08 -0600)]
Merge pull request #125 from ceph/wip-collectd-vars
Fix undefined collectd vars
Zack Cerza [Thu, 28 Sep 2017 16:08:33 +0000 (10:08 -0600)]
Merge pull request #128 from ceph/wip-prod-repos
Add support for custom repos
Boris Ranto [Thu, 28 Sep 2017 07:56:17 +0000 (09:56 +0200)]
Merge pull request #126 from ceph/selinux-fix
selinux policy update
Boris Ranto [Wed, 27 Sep 2017 11:56:21 +0000 (13:56 +0200)]
selinux: Update the policy
This policy update should cover all the newly found avc denials.
Signed-off-by: Boris Ranto <branto@redhat.com>
Boris Ranto [Wed, 27 Sep 2017 13:31:07 +0000 (15:31 +0200)]
ansible: Remove references to old prod repo
This fixes the build failure after the patch that removed the original
repo file.
Signed-off-by: Boris Ranto <branto@redhat.com>
Zack Cerza [Mon, 25 Sep 2017 23:49:58 +0000 (17:49 -0600)]
Add support for custom repos
This is to enable testing packages built from non-master branches with
devel_mode=False
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Tue, 26 Sep 2017 00:03:32 +0000 (18:03 -0600)]
Merge pull request #124 from ceph/wip-remove-prod-repo
ansible: Remove old production repo
Zack Cerza [Fri, 22 Sep 2017 21:32:43 +0000 (15:32 -0600)]
Fix undefined collectd vars
We regressed by moving the definition of a couple of these vars. Set
them in a separate task file to avoid this in the future.
Signed-off-by: Zack Cerza <zack@redhat.com>
Boris Ranto [Fri, 22 Sep 2017 09:20:12 +0000 (11:20 +0200)]
ansible: Remove old production repo
We no longer use download.ceph.com as a production repo. We should
be removing instead of installing it.
Signed-off-by: Boris Ranto <branto@redhat.com>
pcuzner [Tue, 19 Sep 2017 00:15:31 +0000 (12:15 +1200)]
Merge pull request #121 from ceph/fix-rgw
rgw: update glob pattern used to detect rgw sockets (issue #115 )
Paul Cuzner [Tue, 19 Sep 2017 00:10:26 +0000 (12:10 +1200)]
rgw: update glob pattern used to detect rgw sockets
pcuzner [Thu, 14 Sep 2017 23:04:25 +0000 (11:04 +1200)]
Merge pull request #120 from ceph/wip-poolname
Mon._get_df_stats(): Sanitize pool names
Zack Cerza [Thu, 14 Sep 2017 22:03:00 +0000 (16:03 -0600)]
Mon._get_df_stats(): Sanitize pool names
_get_pool_stats() already replaces '.' with '_'; let's copy that
behavior
Signed-off-by: Zack Cerza <zack@redhat.com>
pcuzner [Thu, 14 Sep 2017 21:46:34 +0000 (09:46 +1200)]
Merge pull request #114 from ceph/wip-health-flap
alert-status: Use max() for health alert condition
Zack Cerza [Thu, 14 Sep 2017 21:10:38 +0000 (15:10 -0600)]
Merge pull request #119 from ceph/db-changes
dashboard changes
Paul Cuzner [Tue, 12 Sep 2017 23:59:04 +0000 (11:59 +1200)]
at-a-glance: MDS and OSDS panel changes
the MDS panel has been changed to report more detail using the status-panel
plugin (replacing singlestat).
The OSDs panel queries have been changed to better calculate the out/down
values of OSDs
Paul Cuzner [Tue, 12 Sep 2017 23:57:41 +0000 (11:57 +1200)]
iscsi-overview: multiple panel fixes
Values were shown in correctly in environments where the iscsi config had
been dropped and recreated. This update addresses issues in the following
panels; path summary, unused LUNs, defined capacity. In addition the
client charts only show entries for clients with i/o or load > 0.
Paul Cuzner [Tue, 12 Sep 2017 23:55:20 +0000 (11:55 +1200)]
network/backend: dashboards updated to use the network interfaces by prefix
Before hand the network stats uses *, and therefore included metrics for
'lo' etc, which was observed to skew reporting in some environments.
pcuzner [Tue, 12 Sep 2017 21:56:51 +0000 (09:56 +1200)]
Merge pull request #116 from ceph/wip-graphite-tz
Set timezone in Graphite's config
pcuzner [Tue, 12 Sep 2017 21:54:08 +0000 (09:54 +1200)]
Merge pull request #117 from ceph/wip-tb
Tell collectd to log any Python tracebacks
Zack Cerza [Thu, 27 Jul 2017 18:48:45 +0000 (11:48 -0700)]
Tell collectd to log any Python tracebacks
So that users can generate more useful bug reports.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 7 Sep 2017 20:52:06 +0000 (14:52 -0600)]
Set timezone in Graphite's config
Oddly, graphite defaults to using a hardcoded timezone rather than the
system's. This throws off queries, so let's configure it.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 7 Sep 2017 20:09:47 +0000 (14:09 -0600)]
alert-status: Use max() for health alert condition
last() appears to invite flapping.
Signed-off-by: Zack Cerza <zack@redhat.com>
pcuzner [Wed, 6 Sep 2017 20:16:22 +0000 (08:16 +1200)]
Merge pull request #111 from ceph/wip-health-db
Don't forget to deploy the new health dashboard
pcuzner [Wed, 6 Sep 2017 20:15:51 +0000 (08:15 +1200)]
Merge pull request #112 from ceph/wip-df-stats
Collect more pool stats, and display capacity in dashboard
Zack Cerza [Tue, 5 Sep 2017 22:31:45 +0000 (16:31 -0600)]
Add pool capacity chart to ceph-cluster.json
This chart will be helpful if the user wants to inspect pool capacity
over time, as opposed to the current state.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Wed, 30 Aug 2017 19:25:29 +0000 (13:25 -0600)]
Add pool capacity to ceph-pools dashboard
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Fri, 1 Sep 2017 16:16:53 +0000 (10:16 -0600)]
ceph-grafana: Optionally update alert dashboard
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 31 Aug 2017 16:00:12 +0000 (10:00 -0600)]
dashUpdater.py: Allow updating alert dashboard
This may end up only being useful for testing, but it seems important to
have the option to update the alert dashboard.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Fri, 1 Sep 2017 15:47:43 +0000 (09:47 -0600)]
Add alert for pool capacity
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Wed, 30 Aug 2017 18:06:08 +0000 (12:06 -0600)]
ceph-grafana: Add dashboards tag
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Mon, 28 Aug 2017 21:12:58 +0000 (15:12 -0600)]
mon: Add 'ceph df' pool stats
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Tue, 29 Aug 2017 18:18:47 +0000 (12:18 -0600)]
Merge pull request #110 from ceph/wip-eventurl
Set EventURL in cephmetrics.conf
Zack Cerza [Tue, 29 Aug 2017 18:17:53 +0000 (12:17 -0600)]
Don't forget to deploy the new health dashboard
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Fri, 25 Aug 2017 22:37:50 +0000 (16:37 -0600)]
Set EventURL in cephmetrics.conf
So that events get submitted to graphite
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Fri, 25 Aug 2017 22:36:41 +0000 (16:36 -0600)]
Move some defaults into a cephmetrics-common role
This is so that ceph-collectd and ceph-grafana can share some defaults
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 24 Aug 2017 22:37:28 +0000 (16:37 -0600)]
Merge pull request #108 from ceph/support-svc-restarts
Support svc restarts
Zack Cerza [Thu, 24 Aug 2017 22:36:57 +0000 (16:36 -0600)]
Merge pull request #109 from ceph/wip-pipelining
Use SSH pipelining
Zack Cerza [Thu, 24 Aug 2017 22:27:58 +0000 (16:27 -0600)]
Use SSH pipelining
See http://docs.ansible.com/ansible/latest/become.html#becoming-an-unprivileged-user
Signed-off-by: Zack Cerza <zack@redhat.com>
Paul Cuzner [Wed, 23 Aug 2017 21:14:44 +0000 (09:14 +1200)]
mon: simplify the admin_socket read logic
The initial commit placed logic in each area that called the admin
socket. This patch separates the admin socket call out to a separate
method, so it gets checked in one place.
Some tidy up and comments added too.
Paul Cuzner [Wed, 23 Aug 2017 21:11:52 +0000 (09:11 +1200)]
rgw: add warning when >1 rgw admin socket is detected
Paul Cuzner [Wed, 23 Aug 2017 02:58:34 +0000 (14:58 +1200)]
rgw: look for the admin_socket on each call
The admin_socket name for rgw is not fixed, unlike mon/osds. Therefore
to account for svc restarts and name changes the socket name is
determined at each get_stats cycle. If the socket isn't there, the
collector just passes back the version of radosgw to the caller and
will send stats again once a socket is detected on the host
Paul Cuzner [Wed, 23 Aug 2017 02:56:37 +0000 (14:56 +1200)]
mon: account for null dict from _admin_socket
the _admin_socket method could return a null dict if the
socket is not there (i.e. ceph-mon is down). By checking for the
empty dict, the collector can remain active while ceph-mon is
stopped and restarted during normal maintenance processes on a
host.