]> git-server-git.apps.pok.os.sepia.ceph.com Git - cephmetrics.git/log
cephmetrics.git
8 years agoiscsi.py: fix to defer the import of rtslib_fb 95/head
Paul Cuzner [Tue, 8 Aug 2017 21:48:55 +0000 (09:48 +1200)]
iscsi.py: fix to defer the import of rtslib_fb

the goal of the parent module cephmetrics is to be generic across the
different ceph roles. By deferring the import of rtslib to the instantiation
of the first (and only!) ISCSIGateway object cephmetrics can import this
iscsi module without a problem regardless of the runtime environment.

8 years agoMerge pull request #92 from ceph/iscsi-support
Zack Cerza [Tue, 8 Aug 2017 18:10:13 +0000 (11:10 -0700)]
Merge pull request #92 from ceph/iscsi-support

Add iSCSI gateway support

8 years agoiscsi.py: updates to address code review comments 92/head
Paul Cuzner [Mon, 7 Aug 2017 23:44:16 +0000 (11:44 +1200)]
iscsi.py: updates to address code review comments

8 years agoiscsi-overview: minor fixes and rename of the iscsi_gateways variable
Paul Cuzner [Mon, 7 Aug 2017 23:43:44 +0000 (11:43 +1200)]
iscsi-overview: minor fixes and rename of the iscsi_gateways variable

The client configuration panel was not excluding null entries, so when
rbd get unmasked from clients and not reused, they would still show up
in the table

In addition the templating variable iscsi_gateway was renamed to iscsi_gateways
aligning to the naming of the osd_servers and rgw_servers

8 years agodahsboard.yml : updated to show the variable needed by the iscsi dashboard
Paul Cuzner [Mon, 7 Aug 2017 23:41:50 +0000 (11:41 +1200)]
dahsboard.yml : updated to show the variable needed by the iscsi dashboard

the iscsi_gateways templating variable is used to generate the graphs,
so for iscsi based deployments this variable will need to be defined to
ensure the queries work correctly in grafana

8 years agoMerge pull request #88 from ceph/wip-centos
Zack Cerza [Mon, 7 Aug 2017 22:12:24 +0000 (15:12 -0700)]
Merge pull request #88 from ceph/wip-centos

Support CentOS 7 in devel_mode

8 years agocephmetrics.py: updated to detect and collect stats for iSCSI gateways
Paul Cuzner [Mon, 7 Aug 2017 00:20:10 +0000 (12:20 +1200)]
cephmetrics.py: updated to detect and collect stats for iSCSI gateways

The probe method now looks for the sysfs kernel entries that denote an
iscsi gateway is running on the node. When this dir is found an instance
of the iscsi collector (ISCSIGateway) is created and polled during
every read callback.

8 years agodashboard.yml: updated to includes the iscsi-overview dashboard
Paul Cuzner [Mon, 7 Aug 2017 00:17:54 +0000 (12:17 +1200)]
dashboard.yml: updated to includes the iscsi-overview dashboard

8 years agoiscsi.py : iscsi collector added to create stats for iSCSI gateways
Paul Cuzner [Mon, 7 Aug 2017 00:17:30 +0000 (12:17 +1200)]
iscsi.py : iscsi collector added to create stats for iSCSI gateways

The iscsi module interacts with rtslib_fb to extract LIO configuration
settings to create performance and configuration metrics.

8 years agoiscsi-overview: New dashboard to monitor iscsi gateway nodes
Paul Cuzner [Mon, 7 Aug 2017 00:15:59 +0000 (12:15 +1200)]
iscsi-overview: New dashboard to monitor iscsi gateway nodes

8 years agoMerge pull request #90 from ceph/dashboard-updates
Zack Cerza [Fri, 4 Aug 2017 04:18:26 +0000 (21:18 -0700)]
Merge pull request #90 from ceph/dashboard-updates

Dashboard updates

8 years agoosd-information: minor fixes for larger environments 90/head
Paul Cuzner [Thu, 3 Aug 2017 04:53:57 +0000 (16:53 +1200)]
osd-information: minor fixes for larger environments

In a 600+ OSD environment the charts were based on averageSeries which
was taking a long time. This has now been changed, so the comparison
chart only shows current values for a given OSD for comparison

8 years agonetwork-usage : fix yaxis units on mon/rgw and osd graphs
Paul Cuzner [Thu, 3 Aug 2017 04:20:46 +0000 (16:20 +1200)]
network-usage : fix yaxis units on mon/rgw and osd graphs

Initial commit did not use the right metric - missed due to low load, more
obvious on larger systems!

8 years agoat-a-glance: link updates to reflect dashboard name changes
Paul Cuzner [Wed, 2 Aug 2017 02:20:41 +0000 (14:20 +1200)]
at-a-glance: link updates to reflect dashboard name changes

In addition to the link updates
- scrub state reflects both scrub and deep-scrub status
- the scrub panel no longer turns to "warn" when scrub is active - it's
  a natural feature of the cluster and not a problem! However, it will
  turn red, if scrub or deep-scrub is disabled
- disk util panel changed to a graph and switched to average and 95%ile
  (95%ile on a busy cluster just shows too much variance)
- OSDs panel now links to the ceph-osd-information dashboard

8 years agoceph-backend-storage: rows reorganized and additional host tables added
Paul Cuzner [Wed, 2 Aug 2017 02:16:16 +0000 (14:16 +1200)]
ceph-backend-storage: rows reorganized and additional host tables added

OSD capacity by Host table added to the summary row, showing capacity by
host

Performance data split into two rows, allowing each one to be hidden
independently

8 years agoceph-osd-information : osd dashboard now provides summary and performance data
Paul Cuzner [Wed, 2 Aug 2017 02:13:00 +0000 (14:13 +1200)]
ceph-osd-information : osd dashboard now provides summary and performance data

Summary row shows;
- osd count
- osd up count
- osd's down
- disk size summary (pie chart showing what sizes of disk are in the cluster
- table of osd to disk size
- OSD encryption summary (how many of my OSDs are encrypted?)
- OSD type status (how many OSDs are filestore vs bluestore

Panel includes an OSD id which is used as a filter for the filestore
performance row

The performance row now shows average OSD performance for a single OSD or
all OSDs. This can then be used for side-by-side comparison with OSD
performance across the cluster at the 95%ile.

8 years agoceph-pools: rename of the dashboard that provides pool performance data
Paul Cuzner [Wed, 2 Aug 2017 02:08:59 +0000 (14:08 +1200)]
ceph-pools: rename of the dashboard that provides pool performance data

8 years agoosd-node-detail : multiple changes to better show OSD level metrics
Paul Cuzner [Wed, 2 Aug 2017 02:08:24 +0000 (14:08 +1200)]
osd-node-detail : multiple changes to better show OSD level metrics

changes include;
- support for osd and disk name to provide filters to the graphs
- add osd overview row containing
  - raw capacity panel
  - host.disk -> disk size table
  - host.disk -> osd id
- shortcut links added to other overview type dashboards

8 years agoreplacement version for ceph-rados dashboard
Paul Cuzner [Wed, 2 Aug 2017 02:04:47 +0000 (14:04 +1200)]
replacement version for ceph-rados dashboard

Renamed to ceph-cluster to better reflect the metrics and data shown on
the dashboard

8 years agodiagram updated to reflect name changes
Paul Cuzner [Wed, 2 Aug 2017 02:03:48 +0000 (14:03 +1200)]
diagram updated to reflect name changes

8 years agoremoved dashboards
Paul Cuzner [Wed, 2 Aug 2017 02:03:31 +0000 (14:03 +1200)]
removed dashboards

8 years agoMerge pull request #89 from ceph/wip-branto
Zack Cerza [Wed, 2 Aug 2017 01:12:39 +0000 (18:12 -0700)]
Merge pull request #89 from ceph/wip-branto

rpm: Drop collectors dependency for main package

8 years agorpm: Drop collectors dependency for main package 89/head
Boris Ranto [Tue, 1 Aug 2017 20:29:39 +0000 (22:29 +0200)]
rpm: Drop collectors dependency for main package

Signed-off-by: Boris Ranto <branto@redhat.com>
8 years agoMerge pull request #87 from ceph/dmcrypt-support
Zack Cerza [Mon, 31 Jul 2017 22:08:12 +0000 (15:08 -0700)]
Merge pull request #87 from ceph/dmcrypt-support

Dmcrypt support

8 years agoMention CentOS 7 support 88/head
Zack Cerza [Fri, 28 Jul 2017 22:39:50 +0000 (15:39 -0700)]
Mention CentOS 7 support

Signed-off-by: Zack Cerza <zack@redhat.com>
8 years agoceph-grafana: Require PyYAML on yum systems
Zack Cerza [Fri, 28 Jul 2017 22:38:30 +0000 (15:38 -0700)]
ceph-grafana: Require PyYAML on yum systems

Signed-off-by: Zack Cerza <zack@redhat.com>
8 years agoosd: add % used to each OSD 87/head
Paul Cuzner [Fri, 28 Jul 2017 08:02:45 +0000 (20:02 +1200)]
osd: add % used to each OSD

percent used was not available within the osd metric tree only the
physical disk. With the inclusion under the osd, the percent_used can
reference the osd_id directly easier in any queries

8 years agoosd: support device-mappper (dmcrypt) osd's/journals
Paul Cuzner [Fri, 28 Jul 2017 02:28:19 +0000 (14:28 +1200)]
osd: support device-mappper (dmcrypt) osd's/journals

dmcrypt osd/journals make use of /dev/mapper devices, so this change
supports the device mappper naming for the device links. In addition,
all disks (osd/jrnl metrics) have additional metrics; "osd_type" and
"encrypted" to help understand the status of the OSDs within the cluster.

8 years agocommon: Add osd type and encrypted state to disk objects
Paul Cuzner [Fri, 28 Jul 2017 02:23:24 +0000 (14:23 +1200)]
common: Add osd type and encrypted state to disk objects

osd_type is defined as; 0 - filestore, 1 - bluestore
encrypted is defined as; 0 - off, 1 - on

8 years agoMerge pull request #81 from zmc/wip-passwd
pcuzner [Thu, 27 Jul 2017 19:59:06 +0000 (07:59 +1200)]
Merge pull request #81 from zmc/wip-passwd

ansible: Support non-default Grafana password

8 years agoMerge pull request #79 from ceph/wip-paulc
Zack Cerza [Mon, 24 Jul 2017 23:33:35 +0000 (16:33 -0700)]
Merge pull request #79 from ceph/wip-paulc

Latest updates covering feedback and RFE's

8 years agoUpdate SELinux policy 79/head
Zack Cerza [Mon, 24 Jul 2017 23:09:02 +0000 (16:09 -0700)]
Update SELinux policy

The collectors need to be able to determine whether an OSD uses
filestore or bluestore

Signed-off-by: Zack Cerza <zack@redhat.com>
8 years agoRestore SELinux context of OSD journals
Zack Cerza [Mon, 24 Jul 2017 23:07:58 +0000 (16:07 -0700)]
Restore SELinux context of OSD journals

So that our SELinux policy can properly allow collectors to detect
whether an OSD uses filestore or bluestore

Signed-off-by: Zack Cerza <zack@redhat.com>
8 years agoalert-status dashboard : Enable default alerts
Paul Cuzner [Mon, 24 Jul 2017 02:13:09 +0000 (14:13 +1200)]
alert-status dashboard : Enable default alerts

dashUpdater has been updated to automatically set up a cephmetrics
notifications channel (if it's not already there), and the alert-status
dashboard is loaded, which references the cephmetrics channel.

The ansible templates has been updated to reflect the introduction of the
alert-status dashboard

8 years agoosd: fix determination of osd type
Paul Cuzner [Fri, 21 Jul 2017 22:25:17 +0000 (10:25 +1200)]
osd: fix determination of osd type

the presence of the type file was being relied upon across versions.
However, not all versions show this file (10.2.2 did, 10.2.7 didn't!), so
this fix looks for type and if it's there it uses it, if not it will
look for the presence of the journal link to determine if the osd
is filestore. It is assumed that bluestore will 'always' use the type
file..

8 years agoOptionally use a different Grafana admin password 81/head
Zack Cerza [Thu, 20 Jul 2017 23:21:50 +0000 (16:21 -0700)]
Optionally use a different Grafana admin password

Signed-off-by: Zack Cerza <zack@redhat.com>
8 years agoosd-node-detail: fix to templating which caused charts to show no data
Paul Cuzner [Fri, 21 Jul 2017 21:43:39 +0000 (09:43 +1200)]
osd-node-detail: fix to templating which caused charts to show no data

templating had a reference to a test server hard coded - resulting in
failed queries. This fix replaces this with $domain

8 years agodashboard relationships updated to show the alert-status dashboard
Paul Cuzner [Fri, 21 Jul 2017 03:20:36 +0000 (15:20 +1200)]
dashboard relationships updated to show the alert-status dashboard

8 years agoceph-rgw-workload: updates to metric calculations and spark line colors
Paul Cuzner [Fri, 21 Jul 2017 02:57:49 +0000 (14:57 +1200)]
ceph-rgw-workload: updates to metric calculations and spark line colors

Spark lines now blue matching the at-a-glance view for consistency

8 years agoceph-rados: added the cluster flags
Paul Cuzner [Fri, 21 Jul 2017 02:57:04 +0000 (14:57 +1200)]
ceph-rados: added the cluster flags

Cluster flags/features shows as singlestat panels alongside the monitor
state. The flag states are 0 - enabled/inactive, 1 active and 2 disabled.

thresholds are used on this panels to indicate the above states

8 years agoceph-frontend: added recovery by pool - since this is the pool dashboard!
Paul Cuzner [Fri, 21 Jul 2017 02:54:56 +0000 (14:54 +1200)]
ceph-frontend: added recovery by pool - since this is the pool dashboard!

8 years agoceph-backend: fixed bug in osd-down table query
Paul Cuzner [Fri, 21 Jul 2017 02:52:58 +0000 (14:52 +1200)]
ceph-backend: fixed bug in osd-down table query

Query was showing osd's that have been removed from the system

8 years agoAllow partial defaults overrides
Zack Cerza [Thu, 20 Jul 2017 23:18:58 +0000 (16:18 -0700)]
Allow partial defaults overrides

With this change, it becomes possible to override certain keys in each configuration dict while accepting the default values for other keys.

Signed-off-by: Zack Cerza <zack@redhat.com>
8 years agoMove Grafana configuration defaults
Zack Cerza [Thu, 20 Jul 2017 20:42:07 +0000 (13:42 -0700)]
Move Grafana configuration defaults

Signed-off-by: Zack Cerza <zack@redhat.com>
8 years agoMerge pull request #71 from ceph/wip-whisper
Zack Cerza [Thu, 20 Jul 2017 21:59:35 +0000 (14:59 -0700)]
Merge pull request #71 from ceph/wip-whisper

Make whisper retention settings configurable

8 years agoDocument whisper settings 71/head
Zack Cerza [Wed, 19 Jul 2017 20:31:37 +0000 (13:31 -0700)]
Document whisper settings

Signed-off-by: Zack Cerza <zack@redhat.com>
8 years agoMake whisper retention settings configurable
Zack Cerza [Thu, 13 Jul 2017 18:21:09 +0000 (12:21 -0600)]
Make whisper retention settings configurable

The first value must be '10s', still, for consistency with collectd

Signed-off-by: Zack Cerza <zack@redhat.com>
8 years agoMerge pull request #69 from ceph/wip-ubuntu
pcuzner [Thu, 13 Jul 2017 22:11:34 +0000 (10:11 +1200)]
Merge pull request #69 from ceph/wip-ubuntu

ansible: devel_mode deployment for Ubuntu

8 years agoMention Ubuntu support in ansible/README.md 69/head
Zack Cerza [Thu, 13 Jul 2017 16:46:40 +0000 (10:46 -0600)]
Mention Ubuntu support in ansible/README.md

Signed-off-by: Zack Cerza <zack@redhat.com>
8 years agoReload systemd later in the process
Zack Cerza [Wed, 12 Jul 2017 18:30:09 +0000 (12:30 -0600)]
Reload systemd later in the process

Since we're shipping a .service file for graphite-api

Signed-off-by: Zack Cerza <zack@redhat.com>
8 years agoceph-grafana: Use graphite-api for Ubuntu
Zack Cerza [Wed, 12 Jul 2017 17:33:34 +0000 (11:33 -0600)]
ceph-grafana: Use graphite-api for Ubuntu

They ship it already, and it's a bit less involved to set up.

Signed-off-by: Zack Cerza <zack@redhat.com>
8 years agoceph-grafana: Initial Ubuntu support
Zack Cerza [Wed, 12 Jul 2017 17:35:25 +0000 (11:35 -0600)]
ceph-grafana: Initial Ubuntu support

This commit lets us grab upstream Grafana packages in devel_mode

Signed-off-by: Zack Cerza <zack@redhat.com>
8 years agoceph-collectd: Support Ubuntu
Zack Cerza [Wed, 12 Jul 2017 16:14:50 +0000 (10:14 -0600)]
ceph-collectd: Support Ubuntu

Signed-off-by: Zack Cerza <zack@redhat.com>
8 years agoMerge pull request #64 from ceph/wip-fixes
Zack Cerza [Tue, 11 Jul 2017 23:48:48 +0000 (17:48 -0600)]
Merge pull request #64 from ceph/wip-fixes

dashUpdater now removes $domain references if domain is not provided

8 years agoscreenshot updated to show current state @ 2017-07-12 (used in wiki) 64/head
Paul Cuzner [Tue, 11 Jul 2017 23:32:34 +0000 (11:32 +1200)]
screenshot updated to show current state @ 2017-07-12 (used in wiki)

8 years agodashUpdater: remove $domain from dashboards, if domain is not configured
Paul Cuzner [Fri, 7 Jul 2017 04:01:50 +0000 (16:01 +1200)]
dashUpdater: remove $domain from dashboards, if domain is not configured

For environments that don't use dns, collectd will not provide a FQDN
on the metric name. In these circumstances, the dashboards are empty.

This fix looks for the domain setting, and if it's not supplied the
$domain reference in all queries is removed before the dashboard is loaded
into grafana.

8 years agoMerge pull request #58 from ceph/wip-paulc
Zack Cerza [Tue, 11 Jul 2017 22:42:20 +0000 (16:42 -0600)]
Merge pull request #58 from ceph/wip-paulc

Dashboard improvements and addition of OSD latencies

8 years agocommon/osd: fixes to support intelcas and nvme OSD/journals 58/head
Paul Cuzner [Tue, 11 Jul 2017 01:17:52 +0000 (13:17 +1200)]
common/osd: fixes to support intelcas and nvme OSD/journals

8 years agodashboard added to ansible template
Paul Cuzner [Fri, 7 Jul 2017 01:14:58 +0000 (13:14 +1200)]
dashboard added to ansible template

Add new ceph-osd-latency dashboard to template file

8 years agoosd: remove unused import (flatten_dict)
Paul Cuzner [Fri, 7 Jul 2017 00:16:32 +0000 (12:16 +1200)]
osd: remove unused import (flatten_dict)

8 years agoosd: add support for osd related stats, and support journal devices
Paul Cuzner [Thu, 6 Jul 2017 23:31:48 +0000 (11:31 +1200)]
osd: add support for osd related stats, and support journal devices

OSD daemons are now asked for perf data, so latencies within ceph can be
loaded to graphite. In addition the journal device is detected. If it's
not collocated on the osd device, additional disk metrics under a journal
subtree are created within graphite

8 years agocommon: changes to the Disk class
Paul Cuzner [Thu, 6 Jul 2017 23:29:06 +0000 (11:29 +1200)]
common: changes to the Disk class

Two main things;
1. Disk instances are now initialized here, instead of with the caller
devices simplying code in the osd class
2. get_real_dev function added to convert a device name of an OSD to the
name we'll use as a metric. this now provides initial support for nvme
and intelcas based osd

8 years agobase: _admin_socket function updated to allow easier reuse of the base class
Paul Cuzner [Thu, 6 Jul 2017 23:26:00 +0000 (11:26 +1200)]
base: _admin_socket function updated to allow easier reuse of the base class

8 years agomon: added debug messages to aid in diagnostics
Paul Cuzner [Thu, 6 Jul 2017 23:25:23 +0000 (11:25 +1200)]
mon: added debug messages to aid in diagnostics

8 years agoosd-node-detail: network chart updates for device names and visualisation
Paul Cuzner [Thu, 6 Jul 2017 23:23:07 +0000 (11:23 +1200)]
osd-node-detail: network chart updates for device names and visualisation

device names are whitelisted as en,eth,bond and the rates are now stacked
on the chart so you can see total throughput easier

8 years agodashboard.yml : new dashboard entry added
Paul Cuzner [Thu, 6 Jul 2017 23:21:46 +0000 (11:21 +1200)]
dashboard.yml : new dashboard entry added

The entry is needed to ensure it gets updated by the dashUpdater.py
script

8 years agodoc update : dashboard relationships diagram updated
Paul Cuzner [Thu, 6 Jul 2017 23:20:38 +0000 (11:20 +1200)]
doc update : dashboard relationships diagram updated

new dashboard added, so the diagram now reflects where it fits in the
flow

8 years agoMerge pull request #62 from ceph/wip-docs
Zack Cerza [Tue, 11 Jul 2017 18:45:43 +0000 (12:45 -0600)]
Merge pull request #62 from ceph/wip-docs

Install enhancements and docs improvements

8 years agoEnsure subscription-manager repos are enabled 62/head
Zack Cerza [Mon, 10 Jul 2017 20:27:59 +0000 (14:27 -0600)]
Ensure subscription-manager repos are enabled

We need these for dependencies of python-carbon and ceph-ansible

Signed-off-by: Zack Cerza <zack@redhat.com>
8 years agoInstall cephmetrics repo in production mode
Zack Cerza [Mon, 10 Jul 2017 19:54:34 +0000 (13:54 -0600)]
Install cephmetrics repo in production mode

Signed-off-by: Zack Cerza <zack@redhat.com>
8 years agoPrint dashboard URL at playbook end
Zack Cerza [Fri, 7 Jul 2017 19:36:07 +0000 (13:36 -0600)]
Print dashboard URL at playbook end

Signed-off-by: Zack Cerza <zack@redhat.com>
8 years agoRecommend running -playbook on ceph-ansible host
Zack Cerza [Fri, 7 Jul 2017 19:06:22 +0000 (13:06 -0600)]
Recommend running -playbook on ceph-ansible host

Signed-off-by: Zack Cerza <zack@redhat.com>
8 years agoCorrect URL in spec file
Zack Cerza [Fri, 7 Jul 2017 18:33:53 +0000 (12:33 -0600)]
Correct URL in spec file

Signed-off-by: Zack Cerza <zack@redhat.com>
8 years agoInstall repo before package
Zack Cerza [Fri, 7 Jul 2017 15:57:47 +0000 (09:57 -0600)]
Install repo before package

Signed-off-by: Zack Cerza <zack@redhat.com>
8 years agoMerge pull request #57 from ceph/wip-docs
pcuzner [Thu, 6 Jul 2017 23:10:33 +0000 (11:10 +1200)]
Merge pull request #57 from ceph/wip-docs

Mention enabling rhel-7-server-optional-rpms

8 years agoMention enabling rhel-7-server-optional-rpms 57/head
Zack Cerza [Thu, 6 Jul 2017 21:59:44 +0000 (15:59 -0600)]
Mention enabling rhel-7-server-optional-rpms

It's needed for pyserial, python-twisted-core and python-zope-interface,
which are dependencies of python-carbon

Signed-off-by: Zack Cerza <zack@redhat.com>
8 years agoMerge pull request #51 from ceph/wip-metrics-wusui
Zack Cerza [Fri, 30 Jun 2017 23:20:31 +0000 (17:20 -0600)]
Merge pull request #51 from ceph/wip-metrics-wusui

Non-developer installation instructions

8 years agoNon-developer installation instructions 51/head
Warren Usui [Thu, 29 Jun 2017 21:28:02 +0000 (17:28 -0400)]
Non-developer installation instructions

Signed-off-by: Warren Usui <wusui@magna002.ceph.redhat.com>
8 years agoMerge pull request #53 from ceph/wip-paulc
Zack Cerza [Fri, 30 Jun 2017 20:36:05 +0000 (14:36 -0600)]
Merge pull request #53 from ceph/wip-paulc

Dashboard fixes for at-a-glance and rados

8 years agoMerge pull request #54 from ceph/wip-branto
Zack Cerza [Fri, 30 Jun 2017 19:51:41 +0000 (13:51 -0600)]
Merge pull request #54 from ceph/wip-branto

rpm: Support light theme better

8 years agorpm: Support light theme better 54/head
Boris Ranto [Fri, 30 Jun 2017 08:10:00 +0000 (10:10 +0200)]
rpm: Support light theme better

Signed-off-by: Boris Ranto <branto@redhat.com>
8 years agoat-a-glance: fix for calcs on growth and forecast 53/head
Paul Cuzner [Fri, 30 Jun 2017 05:49:18 +0000 (17:49 +1200)]
at-a-glance: fix for calcs on growth and forecast

8 years agoat-a-glance: multiple fixes to mon/osd/growth and forecast panels
Paul Cuzner [Fri, 30 Jun 2017 02:05:33 +0000 (14:05 +1200)]
at-a-glance: multiple fixes to mon/osd/growth and forecast panels

MON/OSD panel queries updated to address the interpolation
problem where floats were shown. OSD panel also now shows
total OSDs
Templating update for the disk_full_threshold (2->80)
Growth/Forecast panel queries updated to account for data coming
from multiple mon's
Health Panel updated to show as RED when the cluster is in an
ERROR state

8 years agoceph-rados : fixes to capacity and health history charts
Paul Cuzner [Fri, 30 Jun 2017 02:01:19 +0000 (14:01 +1200)]
ceph-rados : fixes to capacity and health history charts

- Capacity chart has been extended to cover 7 days
- health history fixed - was showing an entry for each mon!

8 years agoMerge pull request #48 from ceph/wip-paulc
Zack Cerza [Thu, 29 Jun 2017 21:50:12 +0000 (15:50 -0600)]
Merge pull request #48 from ceph/wip-paulc

dashboard updates

8 years agoMerge pull request #50 from ceph/wip-branto
Zack Cerza [Thu, 29 Jun 2017 20:54:08 +0000 (14:54 -0600)]
Merge pull request #50 from ceph/wip-branto

ansible: Fix a typo in purge.yml

8 years agoansible: Fix a typo in purge.yml 50/head
Boris Ranto [Thu, 29 Jun 2017 20:36:44 +0000 (22:36 +0200)]
ansible: Fix a typo in purge.yml

Signed-off-by: Boris Ranto <branto@redhat.com>
8 years agoMerge pull request #49 from ceph/wip-branto
Zack Cerza [Thu, 29 Jun 2017 16:38:11 +0000 (10:38 -0600)]
Merge pull request #49 from ceph/wip-branto

ansible: Implement purge playbook

8 years agoMerge pull request #47 from ceph/develop
Zack Cerza [Thu, 29 Jun 2017 16:26:38 +0000 (10:26 -0600)]
Merge pull request #47 from ceph/develop

Set WHISPER_AUTOFLUSH to True

8 years agoansible: Implement purge playbook 49/head
Boris Ranto [Thu, 29 Jun 2017 14:07:17 +0000 (16:07 +0200)]
ansible: Implement purge playbook

This should be a good basis for purge playbook, it should support devel
as well as production modes.

Signed-off-by: Boris Ranto <branto@redhat.com>
8 years agoscreenshot changed for wiki 48/head
Paul Cuzner [Thu, 29 Jun 2017 05:00:47 +0000 (17:00 +1200)]
screenshot changed for wiki

8 years agoat-a-glance: pg status pie chart changes
Paul Cuzner [Thu, 29 Jun 2017 04:50:30 +0000 (16:50 +1200)]
at-a-glance: pg status pie chart changes

a degraded state is now shown based on the diff of pg_active and
pg_active_clean. This intermediate metric has been added to the pie
chart so it shows; active+clean, degraded and peering.

8 years agodashUpdater : Set the default Org's theme to light
Paul Cuzner [Thu, 29 Jun 2017 03:26:01 +0000 (15:26 +1200)]
dashUpdater : Set the default Org's theme to light

Most ceph UI's use a light theme, so this change aligns to that
trend.

8 years agonetwork-usage: dashboard updated to track enX interface stats
Paul Cuzner [Thu, 29 Jun 2017 03:15:58 +0000 (15:15 +1200)]
network-usage: dashboard updated to track enX interface stats

graphite doesn't support blacklisting in queries, so interface names that
we're interested in have to be whitelisted. This fix now tracks enX, ethX and
bondX interface names.

8 years agoceph-rados: display fixes to a several charts
Paul Cuzner [Thu, 29 Jun 2017 03:12:54 +0000 (15:12 +1200)]
ceph-rados: display fixes to a several charts

health history was a mess with the light theme. The chart now uses threshold lines
(amber and red), and plots health against those lines. In addition small fixes to the capacity
chart (it was stacking values!), and the monitor status table

8 years agobackend-storage: cosmetic changes to heatmap and graphs
Paul Cuzner [Thu, 29 Jun 2017 03:09:01 +0000 (15:09 +1200)]
backend-storage: cosmetic changes to heatmap and graphs

Heatmap 'spectrum' was not showing well in the light theme - this
make it more readable. In addition 'info' has been added to explain the
heatmap and what it represents.

8 years agofrontend: layout of the pools is now by pool lining up iops/throughput charts across...
Paul Cuzner [Thu, 29 Jun 2017 03:06:15 +0000 (15:06 +1200)]
frontend: layout of the pools is now by pool lining up iops/throughput charts across rows

8 years agoat-a-glance : query and cosmetic changes
Paul Cuzner [Thu, 29 Jun 2017 03:04:17 +0000 (15:04 +1200)]
at-a-glance : query and cosmetic changes

Multiple changes as follows;
- dashboard links (top) hover over was mis-aligned. fixed
- forecast value if negative now shows N/A
- forecast and growth queries updated
- disks near full set to 0, if there aren't any issues (instead of no value)
- added descriptions on various panels

8 years agostatus-panel: Update the bkgnd color to support the light theme
Paul Cuzner [Thu, 29 Jun 2017 02:53:43 +0000 (14:53 +1200)]
status-panel: Update the bkgnd color to support the light theme

By default the panel just uses 'green', which with the light theme is too
dark, making the text on the panel difficult to read. This ansible step just
updates the color in the css to make the text more readable

8 years agoSet WHISPER_AUTOFLUSH to True 47/head
Zack Cerza [Wed, 28 Jun 2017 22:33:56 +0000 (16:33 -0600)]
Set WHISPER_AUTOFLUSH to True

Related to https://github.com/ceph/cephmetrics/issues/45

Signed-off-by: Zack Cerza <zack@redhat.com>