]> git.apps.os.sepia.ceph.com Git - ceph-ansible.git/log
ceph-ansible.git
7 years agoupgrade: skip luminous tasks for jewel minor update v3.0.20
Guillaume Abrioux [Thu, 25 Jan 2018 15:57:45 +0000 (16:57 +0100)]
upgrade: skip luminous tasks for jewel minor update

These tasks are needed only when upgrading to luminous.
They are not needed in Jewel minor upgrade and by the way, they fail because
`ceph versions` command doesn't exist.

Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1535382
(cherry picked from commit c7ec12d49ca3c3f936f4c7a34ef15c042ab0f699)
Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
7 years agodefaults: avoid getting stuck (ceph --connect-timeout)
Guillaume Abrioux [Wed, 24 Jan 2018 17:49:41 +0000 (18:49 +0100)]
defaults: avoid getting stuck (ceph --connect-timeout)

Sometime the playbook gets stuck because even with `--connect-timeout=`
option, the connexion to the existing ceph cluster never timeout.

As a workaround, using `timeout` command provided by coreutils will
actually timeout if we can't connect to the cluster.

Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1537003
Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit ec16cbdb1af9069de09d4a2e2e88739c2c303350)

7 years agoansible: set ssh retry option to 5
Guillaume Abrioux [Tue, 23 Jan 2018 13:38:35 +0000 (14:38 +0100)]
ansible: set ssh retry option to 5

We noticed that sometime, ceph-ansible can fail with error :

`Failed to connect to the host via ssh:`

It can occurs after the task `restart firewalld` has been played.

Setting `retries` to 5 should prevent from unexcepted ssh failure.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit 5bf564255626890973b7cc4e9622763471e561ea)

7 years agoosds: change default value for `dedicated_devices` v3.0.19
Guillaume Abrioux [Mon, 22 Jan 2018 13:28:15 +0000 (14:28 +0100)]
osds: change default value for `dedicated_devices`

This is to keep backward compatibility with stable-2.2 and satisfy the
check "verify dedicated devices have been provided" in
`check_mandatory_vars.yml`. This check is looking for
`dedicated_devices` so we need to default it's value to
`raw_journal_devices` when `raw_multi_journal` is set to `True`.

Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1536098
Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit 9306a1789c95e5abb77260dde4d9cc3df900959f)

7 years agoosd: fix a typo in roles/ceph-osd/defaults/main.yml
Guillaume Abrioux [Tue, 7 Nov 2017 08:48:29 +0000 (09:48 +0100)]
osd: fix a typo in roles/ceph-osd/defaults/main.yml

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit 39b584e540570ef79af98c3c23fdce90f02a701c)

7 years agopurge-container: use lsblk to resolv parent device v3.0.18
Guillaume Abrioux [Wed, 17 Jan 2018 08:08:16 +0000 (09:08 +0100)]
purge-container: use lsblk to resolv parent device

Using `lsblk` to resolv the parent device is better than just removing the last
char when passing it to the zap container.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit 55298fa80cf542c3d9c0275f085b89fb0e6d61f2)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agopurge-container: remove awk usage in favor of blkid
Guillaume Abrioux [Wed, 17 Jan 2018 08:06:43 +0000 (09:06 +0100)]
purge-container: remove awk usage in favor of blkid

Avoid using `awk` to get the different devices from the partlabel.
Using `blkid` is more readable.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit 58eb045d2fac02337ed47ead1cab9b4cc484a092)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoosd: skip devices marked as '/dev/dead'
Sébastien Han [Tue, 19 Dec 2017 17:54:19 +0000 (18:54 +0100)]
osd: skip devices marked as '/dev/dead'

On a non-collocated scenario, if a drive is faulty we can't really
remove it from the list of 'devices' without messing up or having to
re-arrange the order of the 'dedicated_devices'. We want to keep this
device list ordered. This will prevent the activation failing on a
device that we know is failing but we can't remove it yet to not mess up
the dedicated_devices mapping with devices.

Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit 6db4aea453b6371345b2a1db96ab449b34870235)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agorolling update: add mgr exception for jewel minor updates
Sébastien Han [Wed, 17 Jan 2018 14:18:11 +0000 (15:18 +0100)]
rolling update: add mgr exception for jewel minor updates

When update from a minor Jewel version to another, the playbook will
fail on the task "fail if no mgr host is present in the inventory".
This now can be worked around by running Ansible with_items

-e jewel_minor_update=true

Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1535382
Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit 8af745947695ff7dc543754db802ec57c3238adf)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agorgw: disable legacy unit
Sébastien Han [Thu, 18 Jan 2018 09:06:34 +0000 (10:06 +0100)]
rgw: disable legacy unit

Some systems that were deployed with old tools can leave units named
"ceph-radosgw@radosgw.gateway.service". As a consequence, they will
prevent the new unit to start.

Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1509584
Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit f88795e8433f92ddc049d3e0d87e7757448e5005)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agontp: followup cleanup
Sébastien Han [Tue, 16 Jan 2018 17:24:32 +0000 (18:24 +0100)]
ntp: followup cleanup

Checking if ntp is present is not needed anymore, these tasks are not
used anymore. So let's remove them

Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agocommon/docker-common: always start ntp
Sébastien Han [Tue, 16 Jan 2018 16:43:54 +0000 (17:43 +0100)]
common/docker-common: always start ntp

There is no need to only start ntp only if the package was present. If
the package is not present, we install it AND eventually activate + run
the service.

The original fix is part of this commit:
https://github.com/ceph/ceph-ansible/commit/849786967ac4c6235e624243019f0b54bf3340a4
However, this is a feature addition so it cannot be backported. Hence
this commit.

Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoci: test on ansible 2.4.2 v3.0.17
Sébastien Han [Thu, 21 Dec 2017 18:57:01 +0000 (19:57 +0100)]
ci: test on ansible 2.4.2

Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit 7ba25b20dcb199f81666b34cae6c1b95c30b1033)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoRevert "tests: set CEPH_STABLE_RELEASE in ceph-build"
Guillaume Abrioux [Wed, 6 Dec 2017 14:18:42 +0000 (15:18 +0100)]
Revert "tests: set CEPH_STABLE_RELEASE in ceph-build"

This reverts commit 7a1d7d92ff4d6f38be9f11f4c26909b361b58f99.

(cherry picked from commit 73a20e9b50f9212f4e610ae021b23c8e010e9991)

7 years agohandlers: avoid duplicate handler
Guillaume Abrioux [Mon, 8 Jan 2018 09:00:25 +0000 (10:00 +0100)]
handlers: avoid duplicate handler

Having handlers in both ceph-defaults and ceph-docker-common roles can make the
playbook restarting two times services. Handlers can be triggered first
time because of a change in ceph.conf and a second time because a new
image has been pulled.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit b29a42cba6a4059b2c0035572d570c0812f48d16)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agocontainer: trigger handlers on systemd file change
Guillaume Abrioux [Mon, 8 Jan 2018 14:00:32 +0000 (15:00 +0100)]
container: trigger handlers on systemd file change

When a systemd unit file is changed we should trigger handlers to
restart the services.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit 70401f955b4ff9d6d922c113b833dbd8b8ce27a8)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agocontainer: restart container when there is a new image
Sébastien Han [Fri, 15 Dec 2017 18:43:23 +0000 (19:43 +0100)]
container: restart container when there is a new image

This wasn't any good choice to implement this.
We had several options and none of them were ideal since handlers can
not be triggered cross-roles.
We could have achieved that by doing:

* option 1 was to add a dependancy in the meta of the ceph-docker-common
role. We had that long ago and we decided to stop so everything is
managed via site.yml

* option 2 was to import files from another role. This is messy and we
don't that anywhere in the current code base. We will continue to do so.

There is option 3 where we pull the image from the ceph-config role.
This is not suitable as well since the docker command won't be available
unless you run Atomic distro. This would also mean that you're trying to
pull twice. First time in ceph-config, second time in ceph-docker-common

The only option I came up with was to duplicate a bit of the ceph-config
handlers code.

Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1526513
Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit 8a19a83354cd8a4f9a729b3864850ec69be6d5da)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoDocker image pull retry
Joe Talerico [Tue, 17 Oct 2017 19:09:03 +0000 (15:09 -0400)]
Docker image pull retry

This change sets a default timeout of 300s for the image pull. If the
image pull times out (300s), we will retry 3 times by default.

fixes 1954

(cherry picked from commit ab587642885f1f518fe14ee7f1c7fc8cbbbf29f0)

7 years agodefaults: rename check_socket files for containers
Guillaume Abrioux [Wed, 10 Jan 2018 08:08:01 +0000 (09:08 +0100)]
defaults: rename check_socket files for containers

When containerized deployment, we are not looking for a socket but for a
running container.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit acfbebe67e06d64a72a855b0c4d5fd2ee8bce03a)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agocontainers: fix bug when looking for existing cluster
Guillaume Abrioux [Wed, 10 Jan 2018 09:18:27 +0000 (10:18 +0100)]
containers: fix bug when looking for existing cluster

When containerized deployment, `docker_exec_cmd` is not set before the
task which try to retrieve the current fsid is played, it means it
considers there is no existing fsid and try to generate a new one.

Typical error:

```
ok: [mon0 -> mon0] => {
    "changed": false,
    "cmd": [
        "ceph",
        "--connect-timeout",
        "3",
        "--cluster",
        "test",
        "fsid"
    ],
    "delta": "0:00:00.179909",
    "end": "2018-01-09 10:36:58.759846",
    "failed": false,
    "failed_when_result": false,
    "rc": 1,
    "start": "2018-01-09 10:36:58.579937"
}

STDERR:

Error initializing cluster client: Error('error calling conf_read_file: errno EINVAL',)
```

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit 900f447c82c722539c6eed74c98bf1988a001b3d)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agocontainer: change the way we force no logs inside the container
Sébastien Han [Tue, 9 Jan 2018 13:34:09 +0000 (14:34 +0100)]
container: change the way we force no logs inside the container

Previously we were using ceph_conf_overrides however this doesn't play
nice for softwares like TripleO that uses ceph_conf_overrides inside its
own code. For now, and since this is the only occurence of this, we can
ensure no logs through the ceph conf template.

Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1532619
Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit c2e04623a54007674ec60647a9e5ddd2da4f991b)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agomon: use crush rules for non-container too
Sébastien Han [Tue, 9 Jan 2018 12:54:50 +0000 (13:54 +0100)]
mon: use crush rules for non-container too

There is no reasons why we can't use crush rules when deploying
containers. So moving the inlcude in the main.yml so it can be called.

Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit f0787e64da45fdbefb2ff1376a0705fadf6a502d)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agotest: set UPDATE_CEPH_DOCKER_IMAGE_TAG for jewel tests
Andrew Schoen [Fri, 5 Jan 2018 19:47:10 +0000 (13:47 -0600)]
test: set UPDATE_CEPH_DOCKER_IMAGE_TAG for jewel tests

We want to be explict here and update to luminous and not
the 'latest' tag.

Signed-off-by: Andrew Schoen <aschoen@redhat.com>
(cherry picked from commit a8509fbc9c0328670224f608abea17d8e64257ab)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoswitch-to-containers: do not fail when stopping the nfs-ganesha service
Andrew Schoen [Fri, 5 Jan 2018 18:42:16 +0000 (12:42 -0600)]
switch-to-containers: do not fail when stopping the nfs-ganesha service

If we're working with a jewel cluster then this service will not exist.

This is mainly a problem with CI testing because our tests are setup to
work with both jewel and luminous, meaning that eventhough we want to
test jewel we still have a nfs-ganesha host in the test causing these
tasks to run.

Signed-off-by: Andrew Schoen <aschoen@redhat.com>
(cherry picked from commit b613321c210155f390d4ddb7dcda8dc685a6e9ea)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoswitch-to-containers: do not fail when stopping the ceph-mgr daemon
Andrew Schoen [Fri, 5 Jan 2018 18:37:36 +0000 (12:37 -0600)]
switch-to-containers: do not fail when stopping the ceph-mgr daemon

If we are working with a jewel cluster ceph mgr does not exist
and this makes the playbook fail.

Signed-off-by: Andrew Schoen <aschoen@redhat.com>
(cherry picked from commit 0b4b60e3c9cabbbda2883feb40a6f80763c66b50)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agorolling_update: do not fail the playbook if nfs-ganesha is not present
Andrew Schoen [Fri, 5 Jan 2018 16:06:53 +0000 (10:06 -0600)]
rolling_update: do not fail the playbook if nfs-ganesha is not present

The rolling update playbook was attempting to stop the
nfs-ganesha service on nodes where jewel is still installed.
The nfs-ganesha service did not exist in jewel so the task fails.

Signed-off-by: Andrew Schoen <aschoen@redhat.com>
(cherry picked from commit 997edea271b713b29f896ebb87dc6df29a60488b)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agopurge-cluster: clean some code v3.0.16
Guillaume Abrioux [Wed, 13 Dec 2017 14:23:47 +0000 (15:23 +0100)]
purge-cluster: clean some code

Avoid using regexp to match device

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit c5b7b37105e0933f2f2c69441854e889fe932399)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoosd: fix check gpt
Guillaume Abrioux [Tue, 19 Dec 2017 09:55:02 +0000 (10:55 +0100)]
osd: fix check gpt

the gpt label creation doesn't work even with parted module.
This commit fixes the gpt label creation by using parted command
instead.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit 895949d6c463c227da3dd7250c2ae228ee269872)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agopurge-cluster: wipe disk using dd
Guillaume Abrioux [Wed, 13 Dec 2017 14:24:33 +0000 (15:24 +0100)]
purge-cluster: wipe disk using dd

`bluestore_purge_osd_non_container` scenario is failing because it
keeps old osd_uuid information on devices and cause the `ceph-disk activate`
to fail when trying to redeploy a new cluster after a purge.

typical error seen :

```
2017-12-13 14:29:48.021288 7f6620651d00 -1
bluestore(/var/lib/ceph/tmp/mnt.2_3gh6/block) _check_or_set_bdev_label
bdev /var/lib/ceph/tmp/mnt.2_3gh6/block fsid
770080e2-20db-450f-bc17-81b55f167982 does not match our fsid
f33efff0-2f07-4203-ad8d-8a0844d6bda0
```

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit eeedefdf0207f04e67af490e03d895324ab609a1)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agomon: always run ceph-create-keys
Sébastien Han [Wed, 20 Dec 2017 14:29:02 +0000 (15:29 +0100)]
mon: always run ceph-create-keys

ceph-create-keys is idempotent so it's not an issue to run it each time
we play ansible. This also fix issues where the 'creates' arg skips the
task and no keys get generated on newer version, e.g during an upgrade.

Closes: https://github.com/ceph/ceph-ansible/issues/2228
Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit 0b55abe3d0fc6db6c93d963545781c05a31503bb)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agorgw: disable legacy rgw service unit
Sébastien Han [Thu, 21 Dec 2017 09:19:22 +0000 (10:19 +0100)]
rgw: disable legacy rgw service unit

When upgrading from OSP11 to OSP12 container, ceph-ansible attempts to
disable the RGW service provided by the overcloud image. The task
attempts to stop/disable ceph-rgw@{{ ansible-hostname }} and
ceph-radosgw@{{ ansible-hostname }}.service. The actual service name is
ceph-radosgw@radosgw.$name

Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1525209
Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit ad54e19262f3d523ad57ee39e64d6927b0c21dea)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agofix jewel scenarios on container
Sébastien Han [Wed, 20 Dec 2017 12:39:33 +0000 (13:39 +0100)]
fix jewel scenarios on container

When deploying Jewel from master we still need to enable this code since
the container image has such check. This check still exists because
ceph-disk is not able to create a GPT label on a drive that does not
have one.

Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit 39f2bfd5d58bae3fef2dd4fca0b2bab2e67ba21f)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agosite-docker: ability to disable fact sharing
Sébastien Han [Tue, 19 Dec 2017 14:10:05 +0000 (15:10 +0100)]
site-docker: ability to disable fact sharing

When deploying with Ansible at large scale, the delegate_facts method
consumes a lot of memory on the host that is running Ansible. This can
cause various issues like memory exhaustion on that machine.
You can now run Ansible with "-e delegate_facts_host=False" to disable
the fact sharing.

Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit c315f81dfe440945aaa90265cd3294fdea549942)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agorolling_update: do not require root to answer question
Sébastien Han [Fri, 15 Dec 2017 16:39:32 +0000 (17:39 +0100)]
rolling_update: do not require root to answer question

There is no need to ask for root on the local action. This will prompt
for a password the current user is not part of sudoers. That's
  unnecessary anyways.

Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1516947
Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit 200785832f3b56dd8c5766ec0b503c5d77b4a984)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoosd: best effort if no device is found during activation
Sébastien Han [Mon, 18 Dec 2017 15:43:37 +0000 (16:43 +0100)]
osd: best effort if no device is found during activation

We have a scenario when we switch from non-container to containers. This
means we don't know anything about the ceph partitions associated to an
OSD. Normally in a containerized context we have files containing the
preparation sequence. From these files we can get the capabilities of
each OSD. As a last resort we use a ceph-disk call inside a dummy bash
container to discover the ceph journal on the current osd.

Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1525612
Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit bbc79765f3e8b93b707b0f25f94e975c1bd85c66)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agonfs: fix package install for debian/suss systems
Sébastien Han [Tue, 19 Dec 2017 10:17:04 +0000 (11:17 +0100)]
nfs: fix package install for debian/suss systems

This resolves the following error:
E: There were unauthenticated packages and -y was used without
--allow-unauthenticated

Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit dfbef8361d3ac03788aa1f93b23907bc9595a730)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoRename fact docker_version to ceph_docker_version
Christian Berendt [Tue, 12 Dec 2017 10:06:15 +0000 (11:06 +0100)]
Rename fact docker_version to ceph_docker_version

The name docker_version is very generic and is also used by other
roles. As a result, there may be name conflicts. To avoid this a
ceph_ prefix should be used for this fact. Since it is an internal
fact renaming is not a problem.

(cherry picked from commit 50a848dc408a35c02b934bfe1511cd8aaee259be)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agodefaults: fix CI issue with ceph_uid fact
Guillaume Abrioux [Mon, 11 Dec 2017 17:48:13 +0000 (18:48 +0100)]
defaults: fix CI issue with ceph_uid fact

The CI complains because of `ceph_uid` fact which doesn't exist since
the docker image tag used in the CI doesn't match with this condition.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit 6a9b5c9632a39d290ebf707a21e98f17b064f198)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agocommon: move restapi template to config
Sébastien Han [Fri, 20 Oct 2017 09:14:13 +0000 (11:14 +0200)]
common: move restapi template to config

Closes: github.com/ceph/ceph-ansible/issues/1981
Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit ba5c6e66f03314d1b7263225e75f0f56c438db3b)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoroles: ceph-mgr: Install the ceph-mgr package on SUSE
Markos Chandras [Thu, 14 Dec 2017 18:13:09 +0000 (18:13 +0000)]
roles: ceph-mgr: Install the ceph-mgr package on SUSE

The ceph-mgr package name is identical to RedHat so add the SUSE family
to the existing task.

(cherry picked from commit 162b7d2b23b72adabdae32275962409e19ba4e0b)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoclient: don't make `osd_pool_default_pg_num` mandatory
Guillaume Abrioux [Tue, 12 Dec 2017 10:28:36 +0000 (11:28 +0100)]
client: don't make `osd_pool_default_pg_num` mandatory

making `osd_pool_default_pg_num` mandatory is a bit agressive and is
unrelated when you just want to create users keyrings.

Closes: #2241
Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit a24fd1cfd9a2f5a5daa9bee1f533cd2da0cc8fe2)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoclient: don't try to generate keys
Guillaume Abrioux [Tue, 12 Dec 2017 10:25:26 +0000 (11:25 +0100)]
client: don't try to generate keys

the entrypoint to generate users keyring is `ceph-authtool`, therefore,
it can expand the `$(ceph-authtool --gen-print-key)` inside the
container. Users must generate a keyring themselves.
This commit also adds a check to ensure keyring are properly filled when
`user_config: true`.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit ab1dd3027a4b9932e58f28b86ab46979eb1f1682)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agodocker: add missing condition for selinux tasks
Guillaume Abrioux [Tue, 12 Dec 2017 13:55:02 +0000 (14:55 +0100)]
docker: add missing condition for selinux tasks

on `client` and `mds` roles, it tries to set selinux even on non rhel
based distributions.`

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit 26afe46e1333df8bec554feb3f57ab8c60390655)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agodefault: look for the right return code on socket stat in-use
Sébastien Han [Thu, 14 Dec 2017 10:31:28 +0000 (11:31 +0100)]
default: look for the right return code on socket stat in-use

As reported in https://github.com/ceph/ceph-ansible/issues/2254, the
check with fuser is not ideal. If fuser is not available the return code
is 127. Here we want to make sure that we looking for the correct return
code, so 1.

Closes: https://github.com/ceph/ceph-ansible/issues/2254
Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit 7eaf444328c8c381c673883913cf71b8ebe9d064)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2243 from ceph/2226-bkp
Guillaume Abrioux [Mon, 11 Dec 2017 09:45:52 +0000 (10:45 +0100)]
Merge pull request #2243 from ceph/2226-bkp

[skip ci] backport of 2226

7 years agoMerge pull request #2237 from ceph/2211-bkp
Guillaume Abrioux [Mon, 11 Dec 2017 09:45:33 +0000 (10:45 +0100)]
Merge pull request #2237 from ceph/2211-bkp

Set tighter permissions on keyrings when containerized

7 years agoMerge pull request #2231 from ceph/doc_update-bkp
Guillaume Abrioux [Mon, 11 Dec 2017 09:45:12 +0000 (10:45 +0100)]
Merge pull request #2231 from ceph/doc_update-bkp

fix the ansible version for the stable-3.0 branch

7 years agoMerge pull request #2222 from ceph/2221-bkp
Guillaume Abrioux [Mon, 11 Dec 2017 09:44:43 +0000 (10:44 +0100)]
Merge pull request #2222 from ceph/2221-bkp

[skip ci] backport of 2221

7 years agoMerge pull request #2220 from squidboylan/bkp-2215
Guillaume Abrioux [Mon, 11 Dec 2017 09:44:09 +0000 (10:44 +0100)]
Merge pull request #2220 from squidboylan/bkp-2215

Backport of 2215

7 years agoMerge pull request #2218 from ceph/2202-bkp
Guillaume Abrioux [Mon, 11 Dec 2017 09:43:44 +0000 (10:43 +0100)]
Merge pull request #2218 from ceph/2202-bkp

[skip ci] backport of 2202

7 years agoMerge pull request #2217 from ceph/2214-bkp
Guillaume Abrioux [Mon, 11 Dec 2017 09:43:23 +0000 (10:43 +0100)]
Merge pull request #2217 from ceph/2214-bkp

[skip ci] backport of 2214

7 years agoMerge pull request #2216 from ceph/2212-bkp
Guillaume Abrioux [Mon, 11 Dec 2017 09:42:56 +0000 (10:42 +0100)]
Merge pull request #2216 from ceph/2212-bkp

Openstack: replaced hardcoded pool names with variables for openstack…

7 years agoMerge pull request #2203 from ceph/fix_purge_cluster-bkp
Guillaume Abrioux [Mon, 11 Dec 2017 09:42:31 +0000 (10:42 +0100)]
Merge pull request #2203 from ceph/fix_purge_cluster-bkp

purge-cluster: remove usage of `with_fileglob`

7 years agoMerge pull request #2099 from andymcc/stable-3.0
Guillaume Abrioux [Mon, 11 Dec 2017 09:42:10 +0000 (10:42 +0100)]
Merge pull request #2099 from andymcc/stable-3.0

Typo fix for radosgw@ systemd file

7 years agoUse parted module instead of command 2243/head
Andy McCrae [Thu, 30 Nov 2017 17:46:55 +0000 (17:46 +0000)]
Use parted module instead of command

(cherry picked from commit 4f1e854c79d0aa3635f35441f84e126a185f21fa)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2242 from icamys/fix2235
Guillaume Abrioux [Mon, 11 Dec 2017 09:02:26 +0000 (10:02 +0100)]
Merge pull request #2242 from icamys/fix2235

Vagrantfile: Fixed repeated OSD controller creation

7 years agoVagrantfile: Fixed repeated OSD controller creation 2242/head
Prisacari Dmitrii [Fri, 8 Dec 2017 17:09:50 +0000 (19:09 +0200)]
Vagrantfile: Fixed repeated OSD controller creation

7 years agoSet tighter permissions on keyrings when containerized 2237/head
John Fulton [Wed, 22 Nov 2017 21:38:30 +0000 (16:38 -0500)]
Set tighter permissions on keyrings when containerized

During a containerized deployment, set the permissions
of ceph.client.admin.keyring and other keyrings to
chmod 600 and chown it to ceph.

(cherry picked from commit ffae294288c3f4d1570094bc7cec2abd35fc67c3)

7 years agofix the ansible version for the stable-3.0 branch 2231/head
Yixing Yan [Mon, 6 Nov 2017 08:15:47 +0000 (16:15 +0800)]
fix the ansible version for the stable-3.0 branch

(cherry picked from commit 7d239b69082146305349666d6fd8125549b64feb)

7 years agopurge: fix bug on 'wait_for' task 2222/head
Guillaume Abrioux [Wed, 29 Nov 2017 10:10:56 +0000 (11:10 +0100)]
purge: fix bug on 'wait_for' task

this task hangs because `{{ inventory_hostname }}` doesn't resolv to an
actual ip address.
Using `hostvars[inventory_hostname]['ansible_default_ipv4']['address']`
should fix this because it will reach the node with its actual IP
address.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit aaaf980140832de694ef0ffe3282dabbf0b90081)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoBackport of 2215 2220/head
Caleb Boylan [Tue, 28 Nov 2017 19:49:09 +0000 (11:49 -0800)]
Backport of 2215

7 years agoosd: remove leftover and fix a typo 2218/head
Guillaume Abrioux [Tue, 21 Nov 2017 10:11:34 +0000 (11:11 +0100)]
osd: remove leftover and fix a typo

This task was originally needed to fix a docker installation issue
(see: #1030). This has been fixed, therefore it can be removed.

Fixes: #2199
Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit 1cba626484806f830e9ff03c48739292ce9fe6dc)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agohandlers: restart daemons only if docker is running 2217/head
Guillaume Abrioux [Mon, 27 Nov 2017 13:59:30 +0000 (14:59 +0100)]
handlers: restart daemons only if docker is running

In case where docker CLI is available but docker is not running, we
don't want to trigger the restart of the daemons.

Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1510555
Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit b26a840002c548dcbddf71a04229c16cff3699c6)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoOpenstack: replaced hardcoded pool names with variables for openstack (nova) user 2216/head
wintamute [Mon, 27 Nov 2017 10:21:05 +0000 (11:21 +0100)]
Openstack: replaced hardcoded pool names with variables for openstack (nova) user

(cherry picked from commit ebe0e60235f8dd8a9e4febec4de757b033f5d36a)
Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
7 years agoMerge pull request #2208 from ceph/2207-bkp v3.0.15
Sébastien Han [Wed, 22 Nov 2017 16:46:20 +0000 (17:46 +0100)]
Merge pull request #2208 from ceph/2207-bkp

[skip ci] backport of 2207

7 years agocommon: install ceph-common on all the machines 2208/head
Sébastien Han [Wed, 22 Nov 2017 16:11:50 +0000 (17:11 +0100)]
common: install ceph-common on all the machines

Since some daemons now install their own packages the task checking the
ceph version fails on Debian systems. So the 'ceph-common' package must
be installed on all the machines.

Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit bb7b29a9fcc33e7316bbe7dad3dc3cd5395ef8ab)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agopurge-cluster: remove usage of `with_fileglob` 2203/head
Guillaume Abrioux [Thu, 16 Nov 2017 10:49:18 +0000 (11:49 +0100)]
purge-cluster: remove usage of `with_fileglob`

`with_fileglob` loops over files on the machine where ansible-playbook
is being run.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit 947766e294cf8b18eb551a8093b0bbdca7610858)

7 years agoMerge pull request #2200 from ceph/2197-bkp v3.0.14
Guillaume Abrioux [Mon, 20 Nov 2017 13:03:05 +0000 (14:03 +0100)]
Merge pull request #2200 from ceph/2197-bkp

backport of 2197

7 years agoosd: ensure a gpt label is set on device 2200/head
Guillaume Abrioux [Fri, 17 Nov 2017 16:32:23 +0000 (17:32 +0100)]
osd: ensure a gpt label is set on device

ceph-disk prepare will fail on jewel if a GPT label is not present on
device.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit efe06be10ff6215fcc8a42919014650287b99ab7)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2188 from ceph/2182-bkp v3.0.13
Guillaume Abrioux [Thu, 16 Nov 2017 20:37:14 +0000 (21:37 +0100)]
Merge pull request #2188 from ceph/2182-bkp

[skip ci] 2182-bkp

7 years agorbd: enable ceph-rbd-mirror.target for releases <= luminous 2188/head
Guillaume Abrioux [Thu, 16 Nov 2017 09:50:30 +0000 (10:50 +0100)]
rbd: enable ceph-rbd-mirror.target for releases <= luminous

when `ceph-rbd-mirror.target` is not enabled, the service won't start
after a reboot because there is a dependency between these two units.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit ccad0ebf26c60e5856530aab1f6f7c2b1e6eae84)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2191 from ceph/empty-acl-bkp
Guillaume Abrioux [Thu, 16 Nov 2017 19:48:46 +0000 (20:48 +0100)]
Merge pull request #2191 from ceph/empty-acl-bkp

Make openstack_keys param support no acls list

7 years agoMerge pull request #2190 from ceph/fix_wildcard_remove-bkp
Guillaume Abrioux [Thu, 16 Nov 2017 18:57:55 +0000 (19:57 +0100)]
Merge pull request #2190 from ceph/fix_wildcard_remove-bkp

purge-docker: remove osd disk prepare logs

7 years agoMake openstack_keys param support no acls list 2191/head
John Fulton [Thu, 16 Nov 2017 16:29:59 +0000 (11:29 -0500)]
Make openstack_keys param support no acls list

A recent change [1] required that the openstack_keys
param always containe an acls list. However, it's
possible it might not contain that list. Thus, this
param sets a default for that list to be empty if it
is not in the structure as defined by the user.

[1] d65cbaa53952269ec9a2e76fca8203ce7ad22c2b

(cherry picked from commit d73f751b6372cbd60ddd6a24ae53232307b32fac)

7 years agoMerge pull request #2187 from ceph/fix_dmcrypt-bkp
Guillaume Abrioux [Thu, 16 Nov 2017 17:07:02 +0000 (18:07 +0100)]
Merge pull request #2187 from ceph/fix_dmcrypt-bkp

[backport][stable-3.0] Fix dmcrypt bkp

7 years agopurge-docker: remove osd disk prepare logs 2190/head
Guillaume Abrioux [Thu, 16 Nov 2017 10:36:17 +0000 (11:36 +0100)]
purge-docker: remove osd disk prepare logs

`with_fileglob` loops over files on the machine that runs the playbook.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit d9c1b61092db581428a2d22affbb6d71fa379a39)

7 years agoosd: remove leftover from osd partition 2187/head
Sébastien Han [Thu, 16 Nov 2017 13:58:40 +0000 (14:58 +0100)]
osd: remove leftover from osd partition

We used to support osds that are a partition. This is long gone so
removing this task.

Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit 932345ab2acaeb4467572266b050551db41ce3b7)

7 years agoosd: remove failed_when on activation
Sébastien Han [Thu, 16 Nov 2017 13:57:49 +0000 (14:57 +0100)]
osd: remove failed_when on activation

There is no need to continue if the activation fails.

Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit b1c13223579a034db5a4ab47460a4eac0a38b1e2)

7 years agoosd: fix bad activation for dmcrypt
Sébastien Han [Thu, 16 Nov 2017 13:55:08 +0000 (14:55 +0100)]
osd: fix bad activation for dmcrypt

We were activating dmcrypt devices with the wrong command. Basically the
first task execute the wrong activate command. The task fails but
continues because of the 'failed_when: false'. Then the right activation
sequence is being done by the next task.

Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit 80d3a242d0134de5790a015b40c8af1e37a2322b)

7 years agoMerge pull request #2183 from ceph/fix_nfs_pkg-bkp
Guillaume Abrioux [Thu, 16 Nov 2017 12:33:03 +0000 (13:33 +0100)]
Merge pull request #2183 from ceph/fix_nfs_pkg-bkp

fix: remove the duplicated code

7 years agofix: remove the duplicated code 2183/head
Yixing Yan [Thu, 16 Nov 2017 08:45:03 +0000 (16:45 +0800)]
fix: remove the duplicated code

(cherry picked from commit 097249371f0baae2b97367c219ac5973c57d60d3)

7 years agoMerge pull request #2179 from ceph/chmod-facl-on-all-mon-bkp v3.0.12
Guillaume Abrioux [Wed, 15 Nov 2017 18:40:03 +0000 (19:40 +0100)]
Merge pull request #2179 from ceph/chmod-facl-on-all-mon-bkp

Set permissions and ACLs of OpenStack keys on all ceph-mons

7 years agoSet permissions and ACLs of OpenStack keys on all ceph-mons 2179/head
John Fulton [Mon, 6 Nov 2017 22:24:48 +0000 (17:24 -0500)]
Set permissions and ACLs of OpenStack keys on all ceph-mons

If ceph-ansible deploys a Ceph cluster with "openstack_config: true"
and sets the openstack_keys map to have certain ACLs or permissions,
the requested ACLs or permissions are only set on one of the monitor
nodes [2] when they should be set on all of them.

This patch solves [3] the above issue by having the chmod and setfacl
tasks iterate the list of mon nodes (including the mon node that the
task was delegated to) to apply the chmod of setfacl to the keys in
openstack_keys.

[1]
```
openstack_keys:
  - { name: client.openstack, key: "$(ceph-authtool --gen-print-key)", mon_cap: "allow r", osd_cap: "allow class-read object_prefix rbd_children, allow rwx pool=images, allow rwx pool=vms, allow rwx pool=volumes, allow rwx pool=backups", mode: "0600", acls: ["u:nova:r--", "u:cinder:r--", "u:glance:r--", "u:gnocchi:r--"] }
```
[2]
```
$ ansible mons -m shell -b -a "ls -l /etc/ceph/ceph.client.openstack.keyring ; getfacl /etc/ceph/ceph.client.openstack.keyring"
192.168.1.26 | SUCCESS | rc=0 >>
-rw-r-----+ 1 root root 253 Nov  3 20:30 /etc/ceph/ceph.client.openstack.keyring
user::rw-
user:glance:r--
user:nova:r--
user:cinder:r--
user:gnocchi:r--
group::---
mask::r--
other::---getfacl: Removing leading '/' from absolute path names

192.168.1.29 | SUCCESS | rc=0 >>
-rw-r--r--. 1 root root 253 Nov  3 20:30 /etc/ceph/ceph.client.openstack.keyring
user::rw-
group::r--
other::r--getfacl: Removing leading '/' from absolute path names

192.168.1.23 | SUCCESS | rc=0 >>
-rw-r--r--. 1 root root 253 Nov  3 20:30 /etc/ceph/ceph.client.openstack.keyring
user::rw-
group::r--
other::r--getfacl: Removing leading '/' from absolute path names

$
```
[3]
```
(undercloud) [stack@hci-director ceph-ansible]$ ansible mons -m shell -b -a "ls -l /etc/ceph/ceph.client.openstack.keyring ; getfacl /etc/ceph/ceph.client.openstack.keyring"
192.168.1.25 | SUCCESS | rc=0 >>
-rw-r-----+ 1 root root 253 Nov 14 01:12 /etc/ceph/ceph.client.openstack.keyring
user::rw-
user:glance:r--
user:nova:r--
user:cinder:r--
user:gnocchi:r--
group::---
mask::r--
other::---getfacl: Removing leading '/' from absolute path names

192.168.1.29 | SUCCESS | rc=0 >>
-rw-r-----+ 1 root root 253 Nov 14 01:12 /etc/ceph/ceph.client.openstack.keyring
user::rw-
user:glance:r--
user:nova:r--
user:cinder:r--
user:gnocchi:r--
group::---
mask::r--
other::---getfacl: Removing leading '/' from absolute path names

192.168.1.27 | SUCCESS | rc=0 >>
-rw-r-----+ 1 root root 253 Nov 14 01:12 /etc/ceph/ceph.client.openstack.keyring
user::rw-
user:glance:r--
user:nova:r--
user:cinder:r--
user:gnocchi:r--
group::---
mask::r--
other::---getfacl: Removing leading '/' from absolute path names

(undercloud) [stack@hci-director ceph-ansible]$
```

(cherry picked from commit d65cbaa53952269ec9a2e76fca8203ce7ad22c2b)

7 years agoMerge pull request #2173 from ceph/remove_osd_zap-bkp
Guillaume Abrioux [Wed, 15 Nov 2017 06:36:07 +0000 (07:36 +0100)]
Merge pull request #2173 from ceph/remove_osd_zap-bkp

[backport][stable-3.0] tests: remove OSD_FORCE_ZAP variable from tests

7 years agotests: remove OSD_FORCE_ZAP variable from tests 2173/head
Guillaume Abrioux [Tue, 14 Nov 2017 10:58:38 +0000 (11:58 +0100)]
tests: remove OSD_FORCE_ZAP variable from tests

according to ceph/ceph-container#840, this variable is no longer needed.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit aa0b1ed11872ea6f69f69b4a376b13ae5d6e12e0)

7 years agoMerge pull request #2171 from ceph/2165-bkp
Sébastien Han [Tue, 14 Nov 2017 13:47:28 +0000 (14:47 +0100)]
Merge pull request #2171 from ceph/2165-bkp

[skip ci] backport of 2165

7 years agotests: set CEPH_STABLE_RELEASE in ceph-build 2171/head
Guillaume Abrioux [Fri, 10 Nov 2017 17:25:25 +0000 (18:25 +0100)]
tests: set CEPH_STABLE_RELEASE in ceph-build

`CEPH_STABLE_RELEASE` needs to bet set in ceph-build according
to ceph/ceph-ansible#2165

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit 7a1d7d92ff4d6f38be9f11f4c26909b361b58f99)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2169 from ceph/2167-bkp v3.0.11
Sébastien Han [Mon, 13 Nov 2017 16:40:12 +0000 (03:40 +1100)]
Merge pull request #2169 from ceph/2167-bkp

[skip ci] backport of 2167

7 years agodefaults: fix rgw restart script in handlers 2169/head
Guillaume Abrioux [Mon, 13 Nov 2017 15:27:51 +0000 (16:27 +0100)]
defaults: fix rgw restart script in handlers

Like 80d32dec, the path to the fact is not correct.
In any case, we will retrieve the IP address in hostvars, the variable
is the way we get the interface name according where it has been set
(eg.: inventory host file vs. group_vars/)

Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1510906
Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit 44df3f9102773c10011c82b5c1a20e7ae46e0001)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2168 from ceph/fix_lvm-bkp
Sébastien Han [Mon, 13 Nov 2017 15:42:29 +0000 (02:42 +1100)]
Merge pull request #2168 from ceph/fix_lvm-bkp

[skip ci] osd: skip some set_fact when osd_scenario=lvm

7 years agoosd: skip some set_fact when osd_scenario=lvm 2168/head
Guillaume Abrioux [Tue, 7 Nov 2017 14:30:08 +0000 (15:30 +0100)]
osd: skip some set_fact when osd_scenario=lvm

these tasks are not needed when using `osd_scenario: lvm`

Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1509230
Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit 238754a8446f5ae3d2eefea9420e6d61f4426481)

7 years agoMerge pull request #2164 from ceph/ansible-2.4.1.0-bkp
Guillaume Abrioux [Fri, 10 Nov 2017 17:58:55 +0000 (18:58 +0100)]
Merge pull request #2164 from ceph/ansible-2.4.1.0-bkp

[skip ci] backport of ansible-2.4.1.0

7 years agorpm: require ansible 2.4.1.0 2164/head
Ken Dreyer [Fri, 10 Nov 2017 17:11:31 +0000 (10:11 -0700)]
rpm: require ansible 2.4.1.0

2.4.0.0 has some bugs, and we're going to ship with v2.4.1.0.

(cherry picked from commit 3c999b3191678ff38d90bf1b1e718871ac3357d1)
Signed-off-by: Ken Dreyer <kdreyer@redhat.com>
7 years agoMerge pull request #2160 from ceph/bz-1510470-bkp v3.0.10
Guillaume Abrioux [Fri, 10 Nov 2017 14:19:43 +0000 (15:19 +0100)]
Merge pull request #2160 from ceph/bz-1510470-bkp

purge-docker-cluster: ensure old logs are removed

7 years agoMerge pull request #2161 from ceph/update_test-bkp
Guillaume Abrioux [Fri, 10 Nov 2017 13:12:29 +0000 (14:12 +0100)]
Merge pull request #2161 from ceph/update_test-bkp

[Backport][stable-3.0] Update test bkp

7 years agopurge: containers are not stopped 2161/head
Guillaume Abrioux [Wed, 25 Oct 2017 05:48:42 +0000 (07:48 +0200)]
purge: containers are not stopped

During purge osd, the containers are not stopped because of a typo, as a
result, all the devices can't be unmounted later.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit f90f2f3a044ce18bf2b7b6b094857d886d7d41f2)

7 years agotests: sync with ceph/ceph-build for ceph-ansible-nightlies
Guillaume Abrioux [Fri, 27 Oct 2017 11:22:28 +0000 (13:22 +0200)]
tests: sync with ceph/ceph-build for ceph-ansible-nightlies

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit de0900eae174f9b74761016c6a6927b54a4d43b7)

7 years agotests: split purge_cluster, followup on a53aa9e
Guillaume Abrioux [Fri, 20 Oct 2017 15:18:39 +0000 (17:18 +0200)]
tests: split purge_cluster, followup on a53aa9e

- split purge_cluster because we need to test filestore and bluestore
scenarios.
- clean some leftover.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit 280de30484c3bd02d824074e67a6eeedf1e720e6)

7 years agotests: followup on testing against ansible2.4
Guillaume Abrioux [Mon, 30 Oct 2017 13:45:59 +0000 (14:45 +0100)]
tests: followup on testing against ansible2.4

ceph-ansible is now being testing against ansible2.2 and ansible2.4. We
need to update tox.ini so we use the right version of testinfra
regarding which ansible version we are using.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit 97b1cb02584e3947633aed91f3029fe8317792bc)

7 years agopurge-docker-cluster: ensure old logs are removed 2160/head
Guillaume Abrioux [Thu, 9 Nov 2017 09:42:14 +0000 (10:42 +0100)]
purge-docker-cluster: ensure old logs are removed

purge-docker-cluster must remove all osd_disk_prepare logs in
`{{ ceph_osd_docker_run_script_path }}`, otherwise if you purge your
cluster and try to redeploy it, osds will fail to start since because it
will try to retrieve find a partition uuid which doesn't exist.

Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1510470
Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit fa675f2ead6216daec60048e7087122ef287bb0f)