]>
git.apps.os.sepia.ceph.com Git - ceph-cm-ansible.git/log
Zack Cerza [Mon, 1 Aug 2016 22:49:49 +0000 (16:49 -0600)]
Retry SSH connections five times
This may help with some of the problems we've been seeing.
Related to http://tracker.ceph.com/issues/16826
Inspired by
https://github.com/ansible/ansible/issues/13401#issuecomment-
216768025
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Tue, 26 Jul 2016 15:52:11 +0000 (09:52 -0600)]
Merge pull request #271 from ceph/wip-no-fw
testnode: Disable firewalld and iptables regardless of OS version
David Galloway [Tue, 26 Jul 2016 15:08:57 +0000 (11:08 -0400)]
testnode: Disable firewalld and iptables regardless of OS version
iptables was recently found installed and running on a RHEL7 system.
Previous testnode playbook runs wouldn't catch this since it shouldn't
be installed in the first place. This change ensures firewalld and
iptables are stopped on all RPM-based distros.
Fixes: http://tracker.ceph.com/issues/16809
Signed-off-by: David Galloway <dgallowa@redhat.com>
David Galloway [Mon, 25 Jul 2016 18:06:09 +0000 (14:06 -0400)]
Merge pull request #270 from ceph/wip-keys-retry
Add a retry when cloning the keys repo
Zack Cerza [Mon, 25 Jul 2016 17:52:30 +0000 (11:52 -0600)]
Add a retry when cloning the keys repo
Signed-off-by: Zack Cerza <zack@redhat.com>
David Galloway [Wed, 20 Jul 2016 22:34:10 +0000 (18:34 -0400)]
Merge pull request #269 from ceph/wip-cobbler-gitrace
Use flock to avoid race conditions with git
Zack Cerza [Wed, 20 Jul 2016 22:29:39 +0000 (16:29 -0600)]
Use flock to avoid race conditions with git
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Wed, 20 Jul 2016 15:24:22 +0000 (09:24 -0600)]
Merge pull request #265 from ceph/wip-nagios-common
Move NRPE setup to common role
vasukulkarni [Tue, 19 Jul 2016 16:40:30 +0000 (09:40 -0700)]
Merge pull request #267 from ceph/wip-no-gperftools-devel
testnode: drop gperftools-devel
Dan Mick [Mon, 18 Jul 2016 23:42:02 +0000 (16:42 -0700)]
Merge pull request #268 from ceph/wip-failure-log-debug
If a YAML error is hit, log the raw object
Reviewed-by: Dan Mick <dmick@redhat.com>
Zack Cerza [Mon, 18 Jul 2016 22:50:01 +0000 (16:50 -0600)]
If a YAML error is hit, log the raw object
We are seeing: "RepresenterError: cannot represent an object: pcp"
Signed-off-by: Zack Cerza <zack@redhat.com>
Ken Dreyer [Mon, 18 Jul 2016 17:32:57 +0000 (11:32 -0600)]
testnode: drop gperftools-devel
Stop installing the gperftools-devel package on testnodes, for the
following reasons:
1. Ceph does not require gperftools-devel at runtime, so installing it
unnecessarily just slows everything down at this point.
2. When we want to test newer builds of gperftools (say, from RHEL 7.3),
if the corresponding "gperftools-devel" package is not available on
the test node, then yum will fail the transaction.
(Additionally, gperftools-devel is no longer in EPEL 7, and it has moved to
RHEL 7 Optional in RHEL 7.2 (https://bugzilla.redhat.com/
1213879 ,
https://access.redhat.com/errata/RHEA-2015:2293). The "epel" list is no
longer the correct list for this package on redhat_7 and centos_7.)
Zack Cerza [Wed, 13 Jul 2016 18:46:36 +0000 (12:46 -0600)]
Merge pull request #266 from ceph/wip-timeout
use 120s instead of default 12s for cmd timeout
David Galloway [Wed, 13 Jul 2016 00:59:13 +0000 (20:59 -0400)]
common: Update nrpe SELinux policy
This should've been done when smart.sh replaced smart.pl.
I just didn't notice smart.sh was getting denied by SELinux until I
started monitoring disks in Octo. Evidently the new script requires
much more permission to run.
Signed-off-by: David Galloway <dgallowa@redhat.com>
David Galloway [Wed, 13 Jul 2016 00:11:50 +0000 (20:11 -0400)]
common: Create README
Signed-off-by: David Galloway <dgallowa@redhat.com>
David Galloway [Tue, 12 Jul 2016 20:58:31 +0000 (16:58 -0400)]
cobbler: Install some ansible dependencies during kickstart
Signed-off-by: David Galloway <dgallowa@redhat.com>
David Galloway [Tue, 12 Jul 2016 20:48:46 +0000 (16:48 -0400)]
common: Combine some nrpe-selinux package installations
Signed-off-by: David Galloway <dgallowa@redhat.com>
David Galloway [Tue, 12 Jul 2016 19:34:13 +0000 (15:34 -0400)]
common: Always install disk_monitoring scripts when nagios tag is called
Signed-off-by: David Galloway <dgallowa@redhat.com>
David Galloway [Tue, 12 Jul 2016 19:31:52 +0000 (15:31 -0400)]
common: Move smartmontools to disk_monitoring task
Signed-off-by: David Galloway <dgallowa@redhat.com>
David Galloway [Tue, 12 Jul 2016 19:09:40 +0000 (15:09 -0400)]
common: Enable nrpe installation without epel repo on RHEL/CentOS
Signed-off-by: David Galloway <dgallowa@redhat.com>
David Galloway [Mon, 11 Jul 2016 22:55:15 +0000 (18:55 -0400)]
common: Move nrpe package install to common role
Signed-off-by: David Galloway <dgallowa@redhat.com>
David Galloway [Mon, 11 Jul 2016 22:43:38 +0000 (18:43 -0400)]
Move NRPE setup to common role
Signed-off-by: David Galloway <dgallowa@redhat.com>
David Galloway [Mon, 11 Jul 2016 22:10:17 +0000 (18:10 -0400)]
testnode: Configure firewalld (when enabled) for NRPE
Signed-off-by: David Galloway <dgallowa@redhat.com>
vasukulkarni [Fri, 8 Jul 2016 00:40:48 +0000 (17:40 -0700)]
use 120s instead of default 12s for cmd timeout
Increase timeout from 12s to 120s for ansible cmds
Signed-off-by: Vasu Kulkarni vasu@redhat.com
David Galloway [Thu, 7 Jul 2016 19:33:29 +0000 (15:33 -0400)]
Merge pull request #264 from ceph/wip-16615
Don't use a shallow copy of keys.git
Zack Cerza [Thu, 7 Jul 2016 18:44:06 +0000 (12:44 -0600)]
Don't use a shallow copy of keys.git
http://tracker.ceph.com/issues/16615
Fixes: 16615
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Wed, 6 Jul 2016 15:53:10 +0000 (09:53 -0600)]
Merge pull request #263 from ceph/wip-cpan
testnodes: Install perl-CPAN on yum systems
David Galloway [Fri, 1 Jul 2016 23:04:57 +0000 (19:04 -0400)]
testnodes: Install perl-CPAN on yum systems
perl-CPAN is required to install Amazon::S3 using the 'cpan' command
Signed-off-by: David Galloway <dgallowa@redhat.com>
Zack Cerza [Fri, 1 Jul 2016 22:47:01 +0000 (16:47 -0600)]
Merge pull request #262 from ceph/wip-become
More Ansible v2 fixes
David Galloway [Thu, 30 Jun 2016 23:03:23 +0000 (19:03 -0400)]
Bracket bare variables for Ansible v2
Signed-off-by: David Galloway <dgallowa@redhat.com>
David Galloway [Thu, 30 Jun 2016 23:00:30 +0000 (19:00 -0400)]
Add missing 'become' directive to a few more roles
Signed-off-by: David Galloway <dgallowa@redhat.com>
Dan Mick [Thu, 30 Jun 2016 22:31:59 +0000 (15:31 -0700)]
Merge pull request #261 from ceph/wip-rclocal-msg
cobbler: Output message indicating Ansible is running after firstboot
Reviewed-by: Dan Mick <dmick@redhat.com>
David Galloway [Thu, 30 Jun 2016 22:12:33 +0000 (18:12 -0400)]
cobbler: Output message indicating Ansible is running after firstboot
Fixes: http://tracker.ceph.com/issues/14297
Signed-off-by: David Galloway <dgallowa@redhat.com>
vasukulkarni [Thu, 30 Jun 2016 20:22:50 +0000 (13:22 -0700)]
Merge pull request #260 from ceph/wip-downstream-fixes
downstream_setup role fixes
David Galloway [Thu, 30 Jun 2016 19:43:18 +0000 (15:43 -0400)]
downstream_setup: Ensure role is only run on RHEL or CentOS
Signed-off-by: David Galloway <dgallowa@redhat.com>
David Galloway [Thu, 30 Jun 2016 19:32:40 +0000 (15:32 -0400)]
downstream_setup: Add 'become' privilege escalation directive
Signed-off-by: David Galloway <dgallowa@redhat.com>
vasukulkarni [Thu, 30 Jun 2016 18:57:15 +0000 (11:57 -0700)]
Merge pull request #259 from ceph/wip-entitlements-bools
common: Fix boolean expressions
Zack Cerza [Thu, 30 Jun 2016 18:56:37 +0000 (12:56 -0600)]
Fix boolean expressions
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 30 Jun 2016 18:25:35 +0000 (12:25 -0600)]
Revert "Use ansible_user instead of ansible_ssh_user"
This reverts commit
7cc43cf253fc0dceea381ff11ed1d3e546f70ff8 .
David Galloway [Thu, 30 Jun 2016 18:21:01 +0000 (14:21 -0400)]
Merge pull request #258 from ceph/wip-pip
Add missing pip.yml
Zack Cerza [Thu, 30 Jun 2016 18:20:05 +0000 (12:20 -0600)]
Add missing pip.yml
Signed-off-by: Zack Cerza <zack@redhat.com>
David Galloway [Thu, 30 Jun 2016 18:02:25 +0000 (14:02 -0400)]
Merge pull request #255 from ceph/wip-ansible-2
DNM: Move to ansible 2
Dan Mick [Thu, 30 Jun 2016 00:47:53 +0000 (17:47 -0700)]
Merge pull request #257 from ceph/wip-disable-firewalld
testnodes: Disable firewalld service after reboot on RPM-based distros
Reviewed-by: Dan Mick <dmick@redhat.com>
David Galloway [Wed, 29 Jun 2016 18:12:29 +0000 (14:12 -0400)]
testnodes: Disable firewalld service after reboot on RPM-based distros
Fixes: http://tracker.ceph.com/issues/16455
Signed-off-by: David Galloway <dgallowa@redhat.com>
Zack Cerza [Tue, 28 Jun 2016 15:35:10 +0000 (09:35 -0600)]
Merge pull request #256 from ceph/wip-global-cpan
testnodes: Configure cpan and install Amazon::S3 on all OSes
Zack Cerza [Mon, 27 Jun 2016 18:46:53 +0000 (12:46 -0600)]
Disable become for repo cloning (2.x method)
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Mon, 27 Jun 2016 18:50:38 +0000 (12:50 -0600)]
Remove sudo role; replace with 'become' setting
The old method of setting the default behavior for become/sudo has
changed in 2.x; fortunately the new way is a bit more elegant
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 18 Feb 2016 17:48:26 +0000 (10:48 -0700)]
Quote variables used by with_items
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 18 Feb 2016 17:31:52 +0000 (10:31 -0700)]
cobbler: Install ansible from PyPI
The distros we're using don't ship ansible 2.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Tue, 16 Feb 2016 22:28:46 +0000 (15:28 -0700)]
Locate templates properly
See https://github.com/ansible/ansible/issues/14161
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Tue, 16 Feb 2016 21:55:19 +0000 (14:55 -0700)]
Use ansible_user instead of ansible_ssh_user
The option was renamed in 2.0
Signed-off-by: Zack Cerza <zack@redhat.com>
Dan Mick [Mon, 27 Jun 2016 22:09:47 +0000 (15:09 -0700)]
Merge pull request #249 from ceph/wip-gw-networking
Network configuration for gateway role
Reviewed-by: Dan Mick <dmick@redhat.com>
David Galloway [Fri, 17 Jun 2016 02:27:01 +0000 (22:27 -0400)]
gateway: Add logrotate functionality
Signed-off-by: David Galloway <dgallowa@redhat.com>
David Galloway [Fri, 17 Jun 2016 01:58:33 +0000 (21:58 -0400)]
gateway: Add README documentation for networking, firewall, fail2ban
Signed-off-by: David Galloway <dgallowa@redhat.com>
David Galloway [Fri, 17 Jun 2016 00:00:18 +0000 (20:00 -0400)]
gateway: Add fail2ban support
Signed-off-by: David Galloway <dgallowa@redhat.com>
David Galloway [Fri, 29 Apr 2016 20:52:05 +0000 (16:52 -0400)]
gateway: Configure firewalld
Signed-off-by: David Galloway <dgallowa@redhat.com>
David Galloway [Fri, 22 Apr 2016 22:24:16 +0000 (18:24 -0400)]
gateway: Enable server network config
Signed-off-by: David Galloway <dgallowa@redhat.com>
Dan Mick [Mon, 27 Jun 2016 20:32:03 +0000 (13:32 -0700)]
Merge pull request #252 from ceph/wip-cblr-wget
cobbler: testnode post-install rc.local fixes
Reviewed-by: Dan Mick <dmick@redhat.com>
Reviewed-by: Zack Cerza <zack@redhat.com>
David Galloway [Mon, 27 Jun 2016 18:07:07 +0000 (14:07 -0400)]
testnodes: Configure cpan and install Amazon::S3 on all OSes
Fixes: http://tracker.ceph.com/issues/15316
Signed-off-by: David Galloway <dgallowa@redhat.com>
Dan Mick [Fri, 24 Jun 2016 00:24:32 +0000 (17:24 -0700)]
Merge pull request #254 from ceph/wip-clone-once
Only clone the keys repo once per execution
Reviewed-by: Dan Mick <dmick@redhat.com>
Zack Cerza [Fri, 24 Jun 2016 00:22:23 +0000 (18:22 -0600)]
Only clone the keys repo once per execution
Signed-off-by: Zack Cerza <zack@redhat.com>
Dan Mick [Thu, 23 Jun 2016 23:48:22 +0000 (16:48 -0700)]
Merge pull request #253 from ceph/wip-fast-keys
users: Greatly speed up ssh pubkey deployment
Reviewed-by: Dan Mick <dmick@redhat.com>
Zack Cerza [Thu, 23 Jun 2016 23:26:02 +0000 (17:26 -0600)]
Put the keys repo in ~/.cache/src/ by default
To avoid multiple invocations by different users on the same host
stepping on each other.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 23 Jun 2016 22:30:33 +0000 (16:30 -0600)]
Remove unnecessary retry
Since each key isn't being fetched from a remote server any longer, we
can drop the retries.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 23 Jun 2016 18:57:36 +0000 (12:57 -0600)]
Speed up key deployment by using the git repo
Instead of downloading each key over HTTPS from github.com, we can
simply clone the entire repo (with depth 1) and lookup each key using
the username.
On my laptop, execution time went from 2m49s to 29s.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 23 Jun 2016 18:27:08 +0000 (12:27 -0600)]
users: Add defaults for keys_repo
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 23 Jun 2016 18:23:57 +0000 (12:23 -0600)]
users: split tasks/main.yml into separate files
Signed-off-by: Zack Cerza <zack@redhat.com>
David Galloway [Wed, 22 Jun 2016 21:58:06 +0000 (17:58 -0400)]
cobbler: testnode post-install rc.local fixes
- Instead of sleeping for 30 secs and hoping network is up, wait until we can
reliably ping the Cobbler host.
- Give ansible more time to run before wget's built-in 15min timeout is reached.
Signed-off-by: David Galloway <dgallowa@redhat.com>
Dan Mick [Tue, 21 Jun 2016 21:39:55 +0000 (14:39 -0700)]
Merge pull request #251 from ceph/wip-edeploy
users: Set UID_MIN to 1001
Reviewed-by: Dan Mick <dmick@redhat.com>
Zack Cerza [Tue, 21 Jun 2016 19:11:12 +0000 (13:11 -0600)]
users: Set UID_MIN to 1001
When building images with edeploy, UID 1000 was being taken by a
"normal" user; we need to reserve this for the teuthology user.
Signed-off-by: Zack Cerza <zack@redhat.com>
Dan Mick [Mon, 20 Jun 2016 18:09:23 +0000 (11:09 -0700)]
Merge pull request #250 from ceph/wip-no-lsb
Don't rely on LSB when testing for distro version
Reviewed-by: Dan Mick <dmick@redhat.com>
Zack Cerza [Mon, 20 Jun 2016 18:02:47 +0000 (12:02 -0600)]
Don't rely on LSB when testing for distro version
Signed-off-by: Zack Cerza <zack@redhat.com>
vasukulkarni [Fri, 17 Jun 2016 17:15:56 +0000 (10:15 -0700)]
Merge pull request #248 from ceph/wip-xenial
testnodes: Fix resolvconf issues on xenial
Zack Cerza [Thu, 16 Jun 2016 23:34:07 +0000 (17:34 -0600)]
Improve network interface discovery
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 16 Jun 2016 20:26:17 +0000 (14:26 -0600)]
Always bounce the main interface on xenial
Signed-off-by: Zack Cerza <zack@redhat.com>
David Galloway [Mon, 6 Jun 2016 20:06:06 +0000 (16:06 -0400)]
Merge pull request #247 from ceph/wip-16119
pcp: Don't fail when setting permissions
Zack Cerza [Mon, 6 Jun 2016 20:02:43 +0000 (14:02 -0600)]
pcp: Don't fail when setting permissions
http://tracker.ceph.com/issues/16119
Fixes: 16119
Signed-off-by: Zack Cerza <zack@redhat.com>
Dan Mick [Sat, 4 Jun 2016 01:14:40 +0000 (18:14 -0700)]
Merge pull request #246 from dmick/master
smart.sh: default rc to 0 (saw false UNKNOWNs with non-raid)
Dan Mick [Sat, 4 Jun 2016 01:12:01 +0000 (18:12 -0700)]
smart.sh: default rc to 0 (saw false UNKNOWNs with non-raid)
Signed-off-by: Dan Mick <dan.mick@redhat.com>
Dan Mick [Fri, 3 Jun 2016 22:34:22 +0000 (15:34 -0700)]
Merge pull request #245 from ceph/wip-newsmart
testnode: Replace old smart.pl with rewritten smart.sh script
Reviewed-by: Dan Mick <dmick@redhat.com>
David Galloway [Tue, 31 May 2016 22:44:47 +0000 (18:44 -0400)]
testnode: Replace old smart.pl with rewritten smart.sh script
Fixes: http://tracker.ceph.com/issues/14682
Signed-off-by: David Galloway <dgallowa@redhat.com>
Zack Cerza [Fri, 27 May 2016 16:05:16 +0000 (10:05 -0600)]
Merge pull request #244 from cooboos/centos
testnode: replace hardcoded 'gitbuilder.ceph.com' with {{ gitbuilder_host }}
wangsongbo [Wed, 25 May 2016 04:10:20 +0000 (12:10 +0800)]
testnode: replace hardcoded 'gitbuilder.ceph.com' with {{ gitbuilder_host }}
Signed-off-by: wangsongbo <wangsongbo@unitedstack.com>
Zack Cerza [Tue, 17 May 2016 21:35:41 +0000 (15:35 -0600)]
Merge pull request #243 from ceph/wip-sam-ansible
testnode/tasks/apt/packages.yml: just force everything
Dan Mick [Thu, 12 May 2016 04:12:45 +0000 (21:12 -0700)]
Merge pull request #241 from ceph/wip-pcp-settings
pcp: update upstream packages
Reviewed-by: Dan Mick <dmick@redhat.com>
Zack Cerza [Wed, 11 May 2016 19:51:04 +0000 (13:51 -0600)]
pcp: update upstream packages
PCP's upstream doesn't version its dependencies in all cases. This leads
to situations where, when setting upstream_repo to True, a system ends
up with some old and some new packages. This results in a potentially
broken installation.
This commit causes all installed pcp packages to be upgraded when
upstream_repo is True.
Signed-off-by: Zack Cerza <zack@redhat.com>
Samuel Just [Wed, 11 May 2016 17:12:26 +0000 (10:12 -0700)]
testnode/tasks/apt/packages.yml: just force everything
Signed-off-by: Samuel Just <sjust@redhat.com>
Dan Mick [Mon, 9 May 2016 16:48:13 +0000 (09:48 -0700)]
Merge pull request #239 from ceph/wip-pcp-settings
pcp: retention settings for managers
Zack Cerza [Thu, 5 May 2016 22:27:23 +0000 (16:27 -0600)]
Make pmlogmerge_reduce configurable
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 5 May 2016 22:24:39 +0000 (16:24 -0600)]
Make pmlogmerge_retain configurable
Signed-off-by: Zack Cerza <zack@redhat.com>
Dan Mick [Mon, 9 May 2016 15:34:24 +0000 (08:34 -0700)]
Merge pull request #240 from ceph/wip-nrpe-load-limits
testnode: Update nagios CPU load thresholds to be less strict
David Galloway [Mon, 9 May 2016 14:47:46 +0000 (10:47 -0400)]
testnode: Update nagios CPU load thresholds to be less strict
Signed-off-by: David Galloway <dgallowa@redhat.com>
Dan Mick [Wed, 4 May 2016 22:14:26 +0000 (15:14 -0700)]
Merge pull request #238 from ceph/wip-pcp-js
pcp: A dynamic dashboard with host- and time-selection
Reviewed-by: Dan Mick <dmick@redhat.com>
Zack Cerza [Wed, 4 May 2016 21:20:05 +0000 (15:20 -0600)]
Merge pull request #237 from ceph/wip-check4-selinux
testnode: Skip NRPE selinux setup if selinux disabled
Zack Cerza [Mon, 25 Apr 2016 16:39:59 +0000 (10:39 -0600)]
index.js
David Galloway [Tue, 3 May 2016 20:32:36 +0000 (16:32 -0400)]
testnode: Skip NRPE selinux setup if selinux disabled
Fixes: http://tracker.ceph.com/issues/15675
Signed-off-by: David Galloway <dgallowa@redhat.com>
Dan Mick [Thu, 28 Apr 2016 23:33:34 +0000 (16:33 -0700)]
Merge pull request #236 from ceph/wip-xenial-nicnames
cobbler: Prevent systemd from renaming NICs
Reviewed-by: Dan Mick <dmick@redhat.com>
David Galloway [Thu, 28 Apr 2016 23:02:21 +0000 (19:02 -0400)]
cobbler: Prevent systemd from renaming NICs
Signed-off-by: David Galloway <dgallowa@redhat.com>
Dan Mick [Thu, 28 Apr 2016 19:49:52 +0000 (12:49 -0700)]
Merge pull request #235 from ceph/wip-no-getty
testnode: Remove manual configuration of getty
Reviewed-by: Dan Mick <dmick@redhat.com>
David Galloway [Tue, 26 Apr 2016 20:49:28 +0000 (16:49 -0400)]
testnode: Remove manual configuration of getty
Manual configuration of the getty service is no longer required for
Trusty or later. The system-specific "console=ttySX" parameter in
/proc/cmdline is enough to configure the SOL tty during Ubuntu
installation.
See http://tracker.ceph.com/issues/15269 for further explanation.
Signed-off-by: David Galloway <dgallowa@redhat.com>