]> git-server-git.apps.pok.os.sepia.ceph.com Git - cephmetrics.git/log
cephmetrics.git
7 years agoAllow for custom dashboards wip-cleanup
Boris Ranto [Fri, 27 Jul 2018 21:52:49 +0000 (23:52 +0200)]
Allow for custom dashboards

Signed-off-by: Boris Ranto <branto@redhat.com>
7 years agoThe node_exporter role should go before grafana
Boris Ranto [Fri, 27 Jul 2018 22:06:49 +0000 (00:06 +0200)]
The node_exporter role should go before grafana

Signed-off-by: Boris Ranto <branto@redhat.com>
7 years agoRename the dashboards
Boris Ranto [Fri, 27 Jul 2018 21:27:44 +0000 (23:27 +0200)]
Rename the dashboards

Signed-off-by: Boris Ranto <branto@redhat.com>
7 years agoansible: Fix playbook.yml
Boris Ranto [Fri, 27 Jul 2018 21:22:06 +0000 (23:22 +0200)]
ansible: Fix playbook.yml

Signed-off-by: Boris Ranto <branto@redhat.com>
7 years agoAdd srpm script
Boris Ranto [Fri, 27 Jul 2018 21:19:07 +0000 (23:19 +0200)]
Add srpm script

Signed-off-by: Boris Ranto <branto@redhat.com>
7 years agorpm: Remove unused files
Boris Ranto [Fri, 27 Jul 2018 21:17:01 +0000 (23:17 +0200)]
rpm: Remove unused files

Signed-off-by: Boris Ranto <branto@redhat.com>
7 years agoStrip out graphite/collectd
Boris Ranto [Fri, 27 Jul 2018 21:14:37 +0000 (23:14 +0200)]
Strip out graphite/collectd

Signed-off-by: Boris Ranto <branto@redhat.com>
7 years agoInitial cleanup commit
Boris Ranto [Fri, 27 Jul 2018 20:06:14 +0000 (22:06 +0200)]
Initial cleanup commit

- file hierarchy cleanup
- remove unnecessary files
- move links to their proper places in ansible directory

Signed-off-by: Boris Ranto <branto@redhat.com>
7 years agoMerge pull request #209 from ceph/wip-branto
pcuzner [Thu, 26 Jul 2018 02:33:47 +0000 (14:33 +1200)]
Merge pull request #209 from ceph/wip-branto

Downstream fixes

7 years agoosd-node-detail: Fix value repetition 209/head
Boris Ranto [Wed, 25 Jul 2018 16:16:02 +0000 (18:16 +0200)]
osd-node-detail: Fix value repetition

We did not sum the values for RAM usage so we ended up with a couple of
entries being shown for each RAM usage query.

Signed-off-by: Boris Ranto <branto@redhat.com>
7 years agoansible: Fix service_name indentation
Boris Ranto [Wed, 25 Jul 2018 16:12:38 +0000 (18:12 +0200)]
ansible: Fix service_name indentation

The service_name for node_exporter was not indented properly.

Signed-off-by: Boris Ranto <branto@redhat.com>
7 years agoMerge pull request #203 from ceph/wip-rpm-patch
Zack Cerza [Mon, 16 Jul 2018 15:57:09 +0000 (09:57 -0600)]
Merge pull request #203 from ceph/wip-rpm-patch

rpm: use_epel is no longer defined

7 years agoMerge pull request #208 from ceph/fix-throughput-units
Boris Ranto [Mon, 16 Jul 2018 07:30:32 +0000 (09:30 +0200)]
Merge pull request #208 from ceph/fix-throughput-units

Fix units used for throughput

Reviewed-by: Boris Ranto <branto@redhat.com>
7 years agoFix units used on throughput charts 208/head
Paul Cuzner [Fri, 13 Jul 2018 00:24:35 +0000 (12:24 +1200)]
Fix units used on throughput charts

Applies the same change to the prometheus based
dashboards as commit 239afda4debc80544112fe98d31af3f692678f57 did for
the graphite based dashboards

Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
7 years agoMerge pull request #204 from ceph/osd-info-updates
pcuzner [Thu, 12 Jul 2018 23:51:24 +0000 (11:51 +1200)]
Merge pull request #204 from ceph/osd-info-updates

Multiple fixes to OSD information dashboard

7 years agoFix units used for throughput
Paul Cuzner [Thu, 12 Jul 2018 23:49:33 +0000 (11:49 +1200)]
Fix units used for throughput

Throughput units were using binary representation,
whereas the expected unit is decimal (i.e.
MB/s not MiB/s)

Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1496186
Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
7 years agoMultiple fixes to OSD information dashboard 204/head
Paul Cuzner [Tue, 10 Jul 2018 23:41:30 +0000 (11:41 +1200)]
Multiple fixes to OSD information dashboard

Bluestore tables and charts updated, including;
- switched units from ms to secs which shows us too
- changed metric from commit to KV latency
- updated thresholds in bluestore tables
- switched from rate to irate for bluestore metrics
- updated bluestore text box description

Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
7 years agorpm: use_epel is no longer defined 203/head
Boris Ranto [Fri, 29 Jun 2018 11:25:17 +0000 (13:25 +0200)]
rpm: use_epel is no longer defined

Signed-off-by: Boris Ranto <branto@redhat.com>
7 years agoMerge pull request #201 from zmc/wip-uneditable-dbs
pcuzner [Tue, 3 Jul 2018 21:51:47 +0000 (09:51 +1200)]
Merge pull request #201 from zmc/wip-uneditable-dbs

Make dashboards uneditable by default

7 years agoMerge pull request #200 from zmc/wip-osd-info-db
pcuzner [Tue, 3 Jul 2018 21:50:28 +0000 (09:50 +1200)]
Merge pull request #200 from zmc/wip-osd-info-db

ceph-osd-information: Fix numerous bugs

7 years agoceph-osd-information: Fix numerous bugs 200/head
Zack Cerza [Thu, 28 Jun 2018 19:31:41 +0000 (13:31 -0600)]
ceph-osd-information: Fix numerous bugs

Too many to list here; almost every panel was
broken in some regard.

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoMerge pull request #197 from ceph/add-rgw-latencies
Zack Cerza [Tue, 3 Jul 2018 18:07:58 +0000 (12:07 -0600)]
Merge pull request #197 from ceph/add-rgw-latencies

Added RGW GET/PUT Latencies

7 years agoMerge pull request #196 from ceph/iscsi-db-updates
Zack Cerza [Tue, 3 Jul 2018 18:07:02 +0000 (12:07 -0600)]
Merge pull request #196 from ceph/iscsi-db-updates

Updated to use OS metrics from default scrape job

7 years agoMerge pull request #199 from zmc/wip-cluster-db
Zack Cerza [Tue, 3 Jul 2018 18:06:08 +0000 (12:06 -0600)]
Merge pull request #199 from zmc/wip-cluster-db

ceph-cluster: Fix column styles for version tables

7 years agoMerge pull request #198 from zmc/wip-iscsi-prom
Zack Cerza [Fri, 29 Jun 2018 15:23:10 +0000 (09:23 -0600)]
Merge pull request #198 from zmc/wip-iscsi-prom

Scrape iSCSI-related exporters

7 years agoMake dashboards uneditable by default 201/head
Zack Cerza [Thu, 28 Jun 2018 22:18:51 +0000 (16:18 -0600)]
Make dashboards uneditable by default

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoceph-prometheus: Scrape iscsi gateway exporter 198/head
Zack Cerza [Wed, 27 Jun 2018 22:58:42 +0000 (16:58 -0600)]
ceph-prometheus: Scrape iscsi gateway exporter

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoplaybook: Install node_exporter on iscsi gateways
Zack Cerza [Wed, 27 Jun 2018 22:57:44 +0000 (16:57 -0600)]
playbook: Install node_exporter on iscsi gateways

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoAdded RGW GET/PUT Latencies 197/head
Paul Cuzner [Thu, 28 Jun 2018 21:58:35 +0000 (09:58 +1200)]
Added RGW GET/PUT Latencies

Added multiple charts showing GET/PUT latencies
at overview and RGW detail levels. In addition
the failed HTTP request panel has been changed
from a singlestat to a graph to visualize the
failure rates across all RGW instances.

Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
7 years agoUpdated to use OS metrics from default scrape job 196/head
Paul Cuzner [Thu, 28 Jun 2018 04:47:03 +0000 (16:47 +1200)]
Updated to use OS metrics from default scrape job

All node_exporter scrapes are now done under the
same job (called node) so the dashboard now uses
an updated template query to identify the correct
host to pull out the OS metrics by iscsi gateway

Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
7 years agoceph-cluster: Fix column styles for version tables 199/head
Zack Cerza [Tue, 26 Jun 2018 20:32:19 +0000 (14:32 -0600)]
ceph-cluster: Fix column styles for version tables

We were using 'id', not 'ceph_daemon' and as a result that column wasn't
showing up.

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoMerge pull request #195 from zmc/wip-dashboard-fixes
Zack Cerza [Tue, 26 Jun 2018 18:50:30 +0000 (12:50 -0600)]
Merge pull request #195 from zmc/wip-dashboard-fixes

Fix mon_server queries in network-usage-by-node and set home db

7 years agoMerge pull request #187 from ceph/wip-rpm
Zack Cerza [Tue, 26 Jun 2018 18:45:09 +0000 (12:45 -0600)]
Merge pull request #187 from ceph/wip-rpm

rpm: Update spec file for recent changes

7 years agorpm: Modify node_exporter service name 187/head
Boris Ranto [Tue, 26 Jun 2018 18:20:43 +0000 (20:20 +0200)]
rpm: Modify node_exporter service name

Signed-off-by: Boris Ranto <branto@redhat.com>
7 years agoceph-grafana: Set the admin users's home db 195/head
Zack Cerza [Fri, 22 Jun 2018 21:58:16 +0000 (15:58 -0600)]
ceph-grafana: Set the admin users's home db

If a human (or API request) changes the home dashboard for the admin
account, our method of setting it at the org level will no longer be
effective. Let's keep the admin user's home dashboard set to
ceph-at-a-glance.

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoceph-node-exporter: Fix inaccurate task name
Zack Cerza [Fri, 22 Jun 2018 18:57:00 +0000 (12:57 -0600)]
ceph-node-exporter: Fix inaccurate task name

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agonetwork-usage-by-node: Fix mon_server queries
Zack Cerza [Fri, 22 Jun 2018 18:55:21 +0000 (12:55 -0600)]
network-usage-by-node: Fix mon_server queries

The variable queries needed to drop 'mon.' from the mon names, and the
panel queries needed '[[mon_servers]]' to be wrapped in parentheses.

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agorpm: Modify container name/version
Boris Ranto [Wed, 6 Jun 2018 15:14:36 +0000 (17:14 +0200)]
rpm: Modify container name/version

Signed-off-by: Boris Ranto <branto@redhat.com>
7 years agorpm: Update spec file for recent changes
Boris Ranto [Wed, 23 May 2018 09:54:49 +0000 (11:54 +0200)]
rpm: Update spec file for recent changes

Signed-off-by: Boris Ranto <branto@redhat.com>
7 years agoMerge pull request #191 from zmc/wip-osp-fixes
Boris Ranto [Wed, 20 Jun 2018 07:44:32 +0000 (09:44 +0200)]
Merge pull request #191 from zmc/wip-osp-fixes

Fixes for OSP

Reviewed-by: Boris Ranto <branto@redhat.com>
7 years agoMerge pull request #192 from zmc/unpin-testinfra
Boris Ranto [Wed, 20 Jun 2018 07:31:23 +0000 (09:31 +0200)]
Merge pull request #192 from zmc/unpin-testinfra

Unpin testinfra

Reviewed-by: Boris Ranto <branto@redhat.com>
7 years agoUnpin testinfra 192/head
Zack Cerza [Wed, 13 Jun 2018 21:36:51 +0000 (15:36 -0600)]
Unpin testinfra

1.14.0 has been released with the fix we needed.

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agonode_exporter: Allow custom service name 191/head
Zack Cerza [Thu, 7 Jun 2018 22:40:24 +0000 (16:40 -0600)]
node_exporter: Allow custom service name

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoceph-mgr: Cope with differently-named containers
Zack Cerza [Wed, 6 Jun 2018 19:23:22 +0000 (13:23 -0600)]
ceph-mgr: Cope with differently-named containers

We were expecting ceph-mgr@hostname, but let's also look for
ceph-mgr-hostname.

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoMerge pull request #188 from zmc/wip-containers
Zack Cerza [Wed, 6 Jun 2018 18:55:59 +0000 (12:55 -0600)]
Merge pull request #188 from zmc/wip-containers

containers: grafana version; systemd unit comments

7 years agoMerge pull request #190 from zmc/wip-db-fixes
Zack Cerza [Wed, 6 Jun 2018 18:55:44 +0000 (12:55 -0600)]
Merge pull request #190 from zmc/wip-db-fixes

Add some dashboard tests, and make them pass

7 years agodashboards/network-usage-by-node: Fix mon_hosts 190/head
Zack Cerza [Thu, 31 May 2018 18:34:03 +0000 (12:34 -0600)]
dashboards/network-usage-by-node: Fix mon_hosts

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agodashboards: Remove cruft
Zack Cerza [Thu, 31 May 2018 18:27:12 +0000 (12:27 -0600)]
dashboards: Remove cruft

Specifically, references to collectd, influxdb,
and hostnames.

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agodashboards: Drop 'DS_LOCAL' templating
Zack Cerza [Wed, 30 May 2018 19:46:32 +0000 (13:46 -0600)]
dashboards: Drop 'DS_LOCAL' templating

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoAdd some dashboard tests
Zack Cerza [Wed, 30 May 2018 21:01:50 +0000 (15:01 -0600)]
Add some dashboard tests

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoMerge pull request #179 from zmc/wip-drop-use_epel
Boris Ranto [Thu, 24 May 2018 09:44:09 +0000 (11:44 +0200)]
Merge pull request #179 from zmc/wip-drop-use_epel

Drop use_epel flag

Reviewed-by: Boris Ranto <branto@redhat.com>
7 years agoWarn against modifying container systemd units 188/head
Zack Cerza [Wed, 23 May 2018 22:50:42 +0000 (16:50 -0600)]
Warn against modifying container systemd units

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoceph-grafana: Default to 'latest' container version
Zack Cerza [Wed, 23 May 2018 22:48:47 +0000 (16:48 -0600)]
ceph-grafana: Default to 'latest' container version

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoMerge pull request #183 from ceph/dashboard-refresh
Zack Cerza [Wed, 23 May 2018 20:05:45 +0000 (14:05 -0600)]
Merge pull request #183 from ceph/dashboard-refresh

Dashboard refresh

7 years agoDashboard uid metadata field removed 183/head
Paul Cuzner [Tue, 22 May 2018 23:51:36 +0000 (11:51 +1200)]
Dashboard uid metadata field removed

The presence of uid in the dashboard definition is causing
compatibility issues with moving to Grafana v5.1. By removing the
uid entry, the dashboards still work and can be migrated to 5.1

7 years agoinclude pg unknown in pg state table
Paul Cuzner [Tue, 22 May 2018 23:43:03 +0000 (11:43 +1200)]
include pg unknown in pg state table

7 years agoInclude pg_unknown in PG status pie chart
Paul Cuzner [Tue, 22 May 2018 23:42:18 +0000 (11:42 +1200)]
Include pg_unknown in PG status pie chart

7 years agoAdd RAM usage to OSD Node detail dashboard
Paul Cuzner [Tue, 22 May 2018 23:14:11 +0000 (11:14 +1200)]
Add RAM usage to OSD Node detail dashboard

7 years agoChanges to health history and inclusion of pg information
Paul Cuzner [Wed, 16 May 2018 08:41:16 +0000 (20:41 +1200)]
Changes to health history and inclusion of pg information

7 years agoAdded pg distribution and pg per osd tables
Paul Cuzner [Wed, 16 May 2018 07:54:04 +0000 (19:54 +1200)]
Added pg distribution and pg per osd tables

7 years agoadd alert for high pgnums on an OSD
Paul Cuzner [Wed, 16 May 2018 04:04:20 +0000 (16:04 +1200)]
add alert for high pgnums on an OSD

7 years agoUpdate prometheus source name (align with ansible deployment)
Paul Cuzner [Tue, 22 May 2018 23:21:06 +0000 (11:21 +1200)]
Update prometheus source name (align with ansible deployment)

7 years agocapacity units updated for consistency
Paul Cuzner [Mon, 14 May 2018 04:17:09 +0000 (16:17 +1200)]
capacity units updated for consistency

7 years agoDrop encrypted osd panel, add dev-class piechart
Paul Cuzner [Mon, 14 May 2018 04:15:40 +0000 (16:15 +1200)]
Drop encrypted osd panel, add dev-class piechart

7 years agoMerge pull request #181 from zmc/wip-177
Boris Ranto [Tue, 22 May 2018 12:32:56 +0000 (14:32 +0200)]
Merge pull request #181 from zmc/wip-177

Work around #177

Reviewed-by: Boris Ranto <branto@redhat.com>
7 years agoMerge pull request #185 from zmc/wip-grafana-fixes
Boris Ranto [Tue, 22 May 2018 12:25:39 +0000 (14:25 +0200)]
Merge pull request #185 from zmc/wip-grafana-fixes

Support Grafana 5.1.x

Reviewed-by: Boris Ranto <branto@redhat.com>
7 years agoceph-grafana: Restart container when creating 185/head
Zack Cerza [Mon, 21 May 2018 22:56:10 +0000 (16:56 -0600)]
ceph-grafana: Restart container when creating

This is necessary so we can move to newer versions
 of the container image.

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoceph-grafana: Use the correct UID for 5.1.x
Zack Cerza [Mon, 21 May 2018 22:56:00 +0000 (16:56 -0600)]
ceph-grafana: Use the correct UID for 5.1.x

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoceph-grafana: Bump container version
Zack Cerza [Mon, 21 May 2018 22:55:09 +0000 (16:55 -0600)]
ceph-grafana: Bump container version

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoMerge pull request #167 from zmc/wip-mon-db-host
Zack Cerza [Thu, 17 May 2018 17:29:10 +0000 (11:29 -0600)]
Merge pull request #167 from zmc/wip-mon-db-host

Monitor the dashboard host

7 years agoMerge pull request #182 from zmc/wip-bluestore-selinux
Zack Cerza [Thu, 17 May 2018 17:28:56 +0000 (11:28 -0600)]
Merge pull request #182 from zmc/wip-bluestore-selinux

ceph-collectd: Only set SEL context on filestore

7 years agoceph-collectd: Set SEL context on bluestore 182/head
Zack Cerza [Thu, 10 May 2018 22:55:05 +0000 (16:55 -0600)]
ceph-collectd: Set SEL context on bluestore

... as well as filestore.

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoMerge pull request #175 from zmc/wip-classic-fixes
Zack Cerza [Wed, 16 May 2018 22:35:22 +0000 (16:35 -0600)]
Merge pull request #175 from zmc/wip-classic-fixes

Fixes for classic mode

7 years agoMerge pull request #180 from zmc/fix-docker-tests
Zack Cerza [Mon, 14 May 2018 18:37:18 +0000 (12:37 -0600)]
Merge pull request #180 from zmc/fix-docker-tests

ceph-docker: Update tests for 'containerized' flag

7 years agoMerge pull request #173 from zmc/wip-cpu-quota
Zack Cerza [Mon, 14 May 2018 18:36:59 +0000 (12:36 -0600)]
Merge pull request #173 from zmc/wip-cpu-quota

ansible: Set CPU/RAM quotas on containers

7 years agoceph-docker: Update tests for 'containerized' flag 180/head
Zack Cerza [Thu, 10 May 2018 22:30:11 +0000 (16:30 -0600)]
ceph-docker: Update tests for 'containerized' flag

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoWork around #177 181/head
Zack Cerza [Thu, 10 May 2018 19:16:41 +0000 (13:16 -0600)]
Work around #177

7 years agoDrop use_epel flag 179/head
Zack Cerza [Thu, 10 May 2018 22:26:53 +0000 (16:26 -0600)]
Drop use_epel flag

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoMerge pull request #166 from zmc/wip-prom-pkg
Boris Ranto [Thu, 10 May 2018 11:55:02 +0000 (13:55 +0200)]
Merge pull request #166 from zmc/wip-prom-pkg

Allow installing prometheus and node_exporter from packages

Reviewed-by: Boris Ranto <branto@redhat.com>
7 years agoMerge pull request #176 from zmc/wip-fix-notification
Boris Ranto [Thu, 10 May 2018 08:57:19 +0000 (10:57 +0200)]
Merge pull request #176 from zmc/wip-fix-notification

ceph-grafana: Use 'admin_user', not 'user'

Reviewed-by: Boris Ranto <branto@redhat.com>
7 years agoceph-grafana: Use 'admin_user', not 'user' 176/head
Zack Cerza [Wed, 9 May 2018 19:15:42 +0000 (13:15 -0600)]
ceph-grafana: Use 'admin_user', not 'user'

We missed this when merging #165 & #170

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agotox.init: Unpin ansible for ansible-syntax 166/head
Zack Cerza [Mon, 30 Apr 2018 12:29:59 +0000 (14:29 +0200)]
tox.init: Unpin ansible for ansible-syntax

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agonode_exporter: Fix RPM name
Zack Cerza [Sat, 21 Apr 2018 02:20:52 +0000 (21:20 -0500)]
node_exporter: Fix RPM name

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoceph-prometheus: Allow installation via packages
Zack Cerza [Fri, 20 Apr 2018 23:39:22 +0000 (18:39 -0500)]
ceph-prometheus: Allow installation via packages

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoceph-docker: Fail if using classic mode 175/head
Zack Cerza [Tue, 8 May 2018 18:53:50 +0000 (12:53 -0600)]
ceph-docker: Fail if using classic mode

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoclassic: Fix pie charts w/ grafana 5.0
Zack Cerza [Tue, 8 May 2018 18:48:08 +0000 (12:48 -0600)]
classic: Fix pie charts w/ grafana 5.0

When legends are off, we're seeing "Invalid dimensions for plot, width =
179, height = 0"

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoMerge pull request #170 from zmc/wip-notification
Boris Ranto [Mon, 7 May 2018 08:37:48 +0000 (10:37 +0200)]
Merge pull request #170 from zmc/wip-notification

ceph-grafana: Add notification channel

Reviewed-by: Boris Ranto <branto@redhat.com>
7 years agoansible: Set RAM quotas on containers 173/head
Zack Cerza [Fri, 4 May 2018 18:21:06 +0000 (12:21 -0600)]
ansible: Set RAM quotas on containers

We default to 4GB RAM, and we use double that value for swap.

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoansible: Set CPU quotas on containers
Zack Cerza [Fri, 4 May 2018 18:04:13 +0000 (12:04 -0600)]
ansible: Set CPU quotas on containers

We want to default to limiting each container to two cores. While docker
has a nicer way to do this, it isn't supported by ansible as of 2.5.2.
It's easy enough to use the slightly more awkward method, however.

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoplaybook.yml: Add node_exporter tag
Zack Cerza [Fri, 20 Apr 2018 23:38:46 +0000 (18:38 -0500)]
playbook.yml: Add node_exporter tag

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoMerge pull request #165 from ceph/wip-branto
Zack Cerza [Fri, 4 May 2018 17:48:26 +0000 (11:48 -0600)]
Merge pull request #165 from ceph/wip-branto

Add a couple of ansible fixes

7 years agoceph-grafana: Add notification channel 170/head
Zack Cerza [Wed, 2 May 2018 16:13:00 +0000 (10:13 -0600)]
ceph-grafana: Add notification channel

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoceph-node-exporter: Open firewall ports 167/head
Zack Cerza [Tue, 1 May 2018 18:00:00 +0000 (12:00 -0600)]
ceph-node-exporter: Open firewall ports

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoAdd dashboard for monitoring the db host itself
Zack Cerza [Tue, 1 May 2018 17:59:31 +0000 (11:59 -0600)]
Add dashboard for monitoring the db host itself

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoceph-prometheus: Add separate job for db host
Zack Cerza [Tue, 1 May 2018 17:58:27 +0000 (11:58 -0600)]
ceph-prometheus: Add separate job for db host

So that we can monitor the db host's resources, without mixing that data
in with the ceph cluster's metrics.

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoplaybook.yml: Deploy node_exporter on db host
Zack Cerza [Tue, 1 May 2018 14:52:23 +0000 (08:52 -0600)]
playbook.yml: Deploy node_exporter on db host

Signed-off-by: Zack Cerza <zack@redhat.com>
7 years agoAdd a sample group_vars file 165/head
Boris Ranto [Thu, 19 Apr 2018 22:03:22 +0000 (00:03 +0200)]
Add a sample group_vars file

Signed-off-by: Boris Ranto <branto@redhat.com>
7 years agoFix grafana admin user/password setting
Boris Ranto [Thu, 19 Apr 2018 21:16:16 +0000 (23:16 +0200)]
Fix grafana admin user/password setting

Signed-off-by: Boris Ranto <branto@redhat.com>
7 years agoExtend purge.yml
Boris Ranto [Thu, 19 Apr 2018 21:15:30 +0000 (23:15 +0200)]
Extend purge.yml

Signed-off-by: Boris Ranto <branto@redhat.com>
7 years agoSet home dashboard
Boris Ranto [Thu, 19 Apr 2018 20:36:42 +0000 (22:36 +0200)]
Set home dashboard

Signed-off-by: Boris Ranto <branto@redhat.com>