git.apps.os.sepia.ceph.com Git - ceph-ansible.git/log

]> git.apps.os.sepia.ceph.com Git - ceph-ansible.git/log

Ken Dreyer [Tue, 19 Feb 2019 20:32:27 +0000 (13:32 -0700)]

makefile: change distro to el8

Change the default dist value to el8 for consistency, since many of the
upcoming ceph-ansible builds will be based on RHEL 8.

Signed-off-by: Ken Dreyer <kdreyer@redhat.com>

commit | commitdiff | tree

Ken Dreyer [Tue, 19 Feb 2019 20:31:15 +0000 (13:31 -0700)]

rpm: add RHEL 8 + Fedora support

Fedora and RHEL 8 have Python 3, so we must require Python 2 on RHEL 7
and Python 3 on RHEL 8.

Signed-off-by: Ken Dreyer <kdreyer@redhat.com>

commit | commitdiff | tree

David Waiting [Mon, 10 Dec 2018 14:54:18 +0000 (09:54 -0500)]

ensure at least one osd is up

The existing task checks that the number of OSDs is equal to the number of up OSDs before continuing.

The problem is that if none of the OSDs have been discovered yet, the task will exit immediately and subsequent pool creation will fail (num_osds = 0, num_up_osds = 0).

This is related to Bugzilla 1578086.

In this change, we also check that at least one OSD is present. In our testing, this results in the task correctly waiting for all OSDs to come up before continuing.

Signed-off-by: David Waiting <david_waiting@comcast.com>

commit | commitdiff | tree

Guillaume Abrioux [Mon, 18 Feb 2019 14:45:36 +0000 (15:45 +0100)]

main: add a retry/until for python installation

Add a retry/until in raw_install_python.yml to avoid unexpected
repository failures.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Mon, 18 Feb 2019 14:45:00 +0000 (15:45 +0100)]

tests: add more verbosity when running testinfra

Could be useful when troubleshooting testinfra/pytest issue.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Mon, 18 Feb 2019 13:59:18 +0000 (14:59 +0100)]

tests: share fixture instance across the whole module.

there's no need to rerun this part of the code for each function test.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Thu, 14 Feb 2019 14:36:27 +0000 (15:36 +0100)]

facts: ensure ceph_uid is set when running rhel

when hosts is running on RHEL, let's enforce ceph_uid to 167.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Thu, 14 Feb 2019 14:25:41 +0000 (15:25 +0100)]

container: fix tmpfiles.d ceph files

- fix uid/gid in ceph tmpfiles
- move file to `/etc/tmpfiles.d`

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Dimitri Savineau [Mon, 18 Feb 2019 16:38:35 +0000 (11:38 -0500)]

ansible.cfg: Add library path to configuration

Ceph module path needs to be configured if we want to avoid issues
like:

no action detected in task. This often indicates a misspelled module
name, or incorrect module path

Currently the ansible-lint command in Travis CI complains about that.

Signed-off-by: Dimitri Savineau <dsavinea@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Thu, 14 Feb 2019 09:23:19 +0000 (10:23 +0100)]

Revert "osd: container remove --pid=host"

This reverts commit bb2bbeb94115692ea072a1f6e2aed2c6c5336cec.

Looks like when not passing `--pid=host` we are facing some issues when
deploying more than 2 OSDs in containerized environment.

At the moment, we are still troubleshooting this issue but we prefer to
revert this commit so it doesn't block any PR in the CI.

As soon as we have a fix; we will push a new PR to remove `--pid=host`
(a revert of revert...)

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Wed, 13 Feb 2019 16:12:39 +0000 (17:12 +0100)]

tests: avoid 'Cannot allocate memory' error in testinfra

```
------------------------------ Captured log setup ------------------------------
display.py 174 INFO Wednesday 13 February 2019 15:54:15 +0000 (0:00:07.787) 0:02:11.607 ****
ansible.py 61 INFO RUN Ansible('setup', None, {'check': True, 'become': False}): {'_ansible_no_log': False,
'_ansible_parsed': False,
'_ansible_verbose_override': True,
'changed': False,
'module_stderr': u'Connection to 192.168.121.87 closed.\r\n',
'module_stdout': u'bash: /bin/sh: Cannot allocate memory\r\n',
'msg': u'MODULE FAILURE\nSee stdout/stderr for the exact error',
'rc': 126}
```

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Wed, 13 Feb 2019 15:35:57 +0000 (16:35 +0100)]

tests: use memory backend for cache fact

force ansible to generate facts for each run.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Wed, 13 Feb 2019 13:08:50 +0000 (14:08 +0100)]

validate: fix ntp_daemon_type check in validate

is_atomic is defined in ceph-facts or very early in main playbook.

In non containerized deployment, is_atomic is only set in ceph-facts
which is played after ceph-validate.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Wed, 13 Feb 2019 11:52:47 +0000 (12:52 +0100)]

site.yml: run ceph-validate before facts/defaults roles

ceph-validate must be run before any other role.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Wed, 13 Feb 2019 09:53:15 +0000 (10:53 +0100)]

tests: fix ubuntu-container-all_daemons

the public_network subnet used for this scenario was wrong.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Wed, 13 Feb 2019 09:07:15 +0000 (10:07 +0100)]

container: create ceph-common.conf tmpfiles.d if it doesn't exist

Otherwise the task will fail.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Mon, 11 Feb 2019 11:00:25 +0000 (12:00 +0100)]

facts: move two set_fact into ceph-facts

those two set_fact tasks should be moved in ceph-facts.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Tue, 12 Feb 2019 13:03:39 +0000 (14:03 +0100)]

switch_to_containers: support multiple rgw instances per host

add multiple rgw instances per host in switch_to_containers playbook.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Tue, 12 Feb 2019 09:56:28 +0000 (10:56 +0100)]

switch_to_containers: remove non-containerized systemd unit files

remove old systemd unit files (non-containerized) during the
switch_to_containers transition.

We have seen sometimes the unit started is the old one instead of the
new systemd unit generated.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Mon, 11 Feb 2019 10:20:30 +0000 (11:20 +0100)]

switch_to_containers: use ceph binary from container

use the ceph binary from the container instead of the host.
If the ceph CLI version isn't compatible between host and container
image, it can cause the CLI to hang.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Sun, 10 Feb 2019 07:21:43 +0000 (08:21 +0100)]

container: use tmpfiles.d to creates /run/ceph

instead of using `RuntimeDirectory` parameter in systemd unit files,
let's use a systemd `tmpfiles.d` to ensure `/run/ceph`.

Explanation:

`podman` doesn't create the `/var/run/ceph` if it doesn't exist the time
where the container is run while `docker` used to create it.
In case of `switch_to_containers` scenario, `/run/ceph` gets created by
a tmpfiles.d systemd file; when switching to containers, the systemd
unit file complains because `/run/ceph` already exists

The better fix would be to ensure `/usr/lib/tmpfiles.d/ceph-common.conf`
is removed and only rely on `RuntimeDirectory` from systemd unit file parameter
but we come from a non-containerized environment which is already running,
it means `/run/ceph` is already created and when starting the unit to
start the container, systemd will still complain and we can't simply
remove the directory if daemons are collocated.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Fri, 8 Feb 2019 07:52:03 +0000 (08:52 +0100)]

switch_to_containers: do not try to redeploy monitors

`ceph-mon` tries to redeploy monitors because it assumes it was not yet
deployed since `mon_socket_stat` and `ceph_mon_container_stat` are
undefined (indeed, we stop the daemon before calling `ceph-mon` in the
switch_to_containers playbook).

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Wed, 6 Feb 2019 13:12:14 +0000 (14:12 +0100)]

tests: rename switch_to_containers

rename switch_to_containers scenario

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Rishabh Dave [Tue, 12 Feb 2019 06:55:13 +0000 (12:25 +0530)]

fix mistake in task that aborts when ntpd is chosen on Atomic

Since it's already confusing whether ntp_daemon_type should be "ntp" or
"ntpd", fix the mistake in the title of the task that aborts if
ntp_daemon_type is set to "ntpd" and OS being used is Atomic.

Signed-off-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Mon, 11 Feb 2019 13:01:26 +0000 (14:01 +0100)]

doc: resync group_vars sample files

resync group_vars sample files with their corresponding original files.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Rishabh Dave [Tue, 5 Feb 2019 09:26:03 +0000 (14:56 +0530)]

don't install NTPd on Atomic

Since Atomic doesn't allow any installations and NTPd is not present
on Atomic image we are using, abort when ntp_daemon_type is set to ntpd.

https://github.com/ceph/ceph-ansible/issues/3572
Signed-off-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Sébastien Han [Fri, 8 Feb 2019 15:05:20 +0000 (16:05 +0100)]

mon: do not hardcode ceph uid

167 is the ceph uid for Red Hat based system, thus trying to deploy a
monitor on Debian fail since the ceph user id on that system is 64045.
This commit uses the ceph_uid variable which contains the right uid
based on system/container detection.

Closes: https://github.com/ceph/ceph-ansible/issues/3589
Signed-off-by: Sébastien Han <seb@redhat.com>

commit | commitdiff | tree

Leah Neukirchen [Thu, 7 Feb 2019 17:09:21 +0000 (18:09 +0100)]

Fix uses of default(omit) with string concatenation

When {{omit}} is concatenated with another string, it expands to something
like __omit_place_holder__63eea0d96dd6ed867b95405e11d87dddf61f448d.
However, in these use-cases we need an empty string.

Regression introduced in d53f55e807e.

Signed-off-by: Leah Neukirchen <leah.neukirchen@mayflower.de>

commit | commitdiff | tree

Patrick C. F. Ernzer [Thu, 7 Feb 2019 15:36:20 +0000 (16:36 +0100)]

setup_ntp: call handler to disable ntpd if chronyd used

The task setup chronyd called the handler disable chronyd, which of
course defeats the purpose.

Changing the task to disable ntpd instead fixes the issue of chronyd
being disabled after it got enabled.

Fixes: #3582
Signed-off-by: Patrick C. F. Ernzer pcfe@redhat.com

commit | commitdiff | tree

Guillaume Abrioux [Thu, 7 Feb 2019 22:52:51 +0000 (23:52 +0100)]

iscsi-gws: remove a leftover

remove leftover introduced by 9d590f4

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Thu, 7 Feb 2019 13:16:13 +0000 (14:16 +0100)]

iscsi: fix permission denied error

Typical error:
```
fatal: [iscsi-gw0]: FAILED! =>
msg: 'an error occurred while trying to read the file ''/home/guits/ceph-ansible/tests/functional/all_daemons/fetch/e5f4ab94-c099-4781-b592-dbd440a9d6f3/iscsi-gateway.key'': [Errno 13] Permission denied: b''/home/guits/ceph-ansible/tests/functional/all_daemons/fetch/e5f4ab94-c099-4781-b592-dbd440a9d6f3/iscsi-gateway.key'''
```

`become: True` is not needed on the following task:

`copy crt file(s) to gateway nodes`.

Since it's already set in the main playbook (site.yml/site-container.yml)

The thing is that the files get generated in the 'fetch_directory' with
root user because there is a 'delegate_to' + we run the playbook with
`become: True` (from main playbook).

The idea here is to create files under ansible user so we can open them
later to copy them on the remote machine.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Sébastien Han [Wed, 6 Feb 2019 22:09:26 +0000 (23:09 +0100)]

osd: container remove --pid=host

Let's try again with the Nautilus release.

Closes: https://github.com/ceph/ceph-ansible/issues/1297
Signed-off-by: Sébastien Han <seb@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Tue, 5 Feb 2019 10:44:38 +0000 (11:44 +0100)]

repo: update gitignore file

- do not ignore raw_install_python.yml

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Wed, 6 Feb 2019 07:32:04 +0000 (08:32 +0100)]

ansible: increase fact cache timeout

10m seems a bit low, indeed, a complete run can take more than 1h.
Let's increase it to 2h

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

John Fulton [Tue, 5 Feb 2019 20:28:37 +0000 (15:28 -0500)]

Fix CNI error when net=host is not used on OSD calls

Follow up fix that 410abd7 missed.

Related: ceph#3561

Signed-off-by: John Fulton <fulton@redhat.com>

commit | commitdiff | tree

John Fulton [Mon, 4 Feb 2019 23:21:42 +0000 (23:21 +0000)]

Create Ceph Initial Dirs earlier

Include tasks from create_ceph_initial_dirs earlier during
ceph config role.

Fixes: #3568
Signed-off-by: John Fulton <fulton@redhat.com>

commit | commitdiff | tree

John Fulton [Thu, 31 Jan 2019 21:17:20 +0000 (16:17 -0500)]

Fix CNI error when net=host is not used in some podman calls

With 'podman version 1.0.0' on RHEL8 beta the 'get ceph version' and
'ceph monitor mkfs' commands fail [1] with "error configuring network
namespace for container Missing CNI default network".

When net=host is added these errors are resolved. net=host is used in
many other calls (grep -R net=host | wc -l --> 38).

Fixes: #3561
Signed-off-by: John Fulton <fulton@redhat.com>
(cherry picked from commit 410abd77455a92a85f0674577e22af0af894964f)

commit | commitdiff | tree

Guillaume Abrioux [Tue, 5 Feb 2019 15:17:08 +0000 (16:17 +0100)]

site-container: import role ceph-facts

when ceph-container-common notifies handlers because a new container
image has been pulled, ceph-handler will throw an error because of
undefined variables since they are set in ceph-facts role.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Tue, 5 Feb 2019 13:19:20 +0000 (14:19 +0100)]

set RuntimeDirectory in all systemd unit templates

/var/run/ceph resides in a non persistent filesystem (tmpfs)
After a reboot, all daemons won't start because this directory will be
missing.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Tue, 5 Feb 2019 08:49:50 +0000 (09:49 +0100)]

tests: increase amount of memory for all vms

double the amount of memory from 512m to 1024m.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Tue, 5 Feb 2019 08:25:20 +0000 (09:25 +0100)]

osd: bind mount /var/run/udev/

without this, the command `ceph-volume lvm list --format json` hangs and
takes a very long time to complete.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Mon, 4 Feb 2019 15:58:05 +0000 (16:58 +0100)]

tests: remove useless test

`test_mon_host_line_has_correct_value()` will cover this test in
anycase. It doesn't worth to have a dedicated test for this.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Mon, 4 Feb 2019 15:14:37 +0000 (16:14 +0100)]

tests: update test_mon_host_line_has_correct_value()

since msgr2 introduction, this test must be updated.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Wed, 30 Jan 2019 13:16:19 +0000 (14:16 +0100)]

tests: add a rhel8 scenario testing

test upstream with rhel8 vagrant image

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Fri, 1 Feb 2019 14:08:53 +0000 (15:08 +0100)]

facts: set timeout_command fact in ceph-defaults

- also add `--foreground` which seems to fix some issue we are facing when
using timeout with `podman`.
- use this fact in the `is ceph running already?` task.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Wed, 16 Jan 2019 09:27:23 +0000 (10:27 +0100)]

podman: support podman installation on rhel8

Add required changes to support podman on rhel8

Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1667101
Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Patrick C. F. Ernzer [Wed, 16 Jan 2019 22:40:27 +0000 (23:40 +0100)]

ansible.cfg: change log_path to directory used by fact_caching_connection

Since fact_caching_connection uses a directory in $HOME already, write the
ansible.log to the same directory.

Fixes: #3509
Signed-off-by: Patrick C. F. Ernzer <pcfe@redhat.com>

commit | commitdiff | tree

John Fulton [Fri, 1 Feb 2019 13:32:14 +0000 (08:32 -0500)]

Make python print statements python3 compatible

The restart_osd_daemon.sh generated from the j2 template
contains a python call which uses 'print x' instead of
'print(x)'. Add the missing parentheses to make this call
compatible with both 2 and 3.

Also add parentheses to other python print calls found
in roles/ceph-client/defaults/main.yml and
infrastructure-playbooks/cluster-os-migration.yml.

Fixes: https://bugzilla.redhat.com/show_bug.cgi?id=1671721
Signed-off-by: John Fulton <fulton@redhat.com>

commit | commitdiff | tree

Andrew Schoen [Thu, 31 Jan 2019 20:33:31 +0000 (14:33 -0600)]

tests: do not run lvm_setup.yml on lvm_auto_discovery tests

Signed-off-by: Andrew Schoen <aschoen@redhat.com>

commit | commitdiff | tree

Andrew Schoen [Thu, 31 Jan 2019 19:38:53 +0000 (13:38 -0600)]

ceph-config: do not always assume containers when calculating num_osds

CEPH_CONTAINER_IMAGE should be None if containerized_deployment
is False.

Signed-off-by: Andrew Schoen <aschoen@redhat.com>

commit | commitdiff | tree

Andrew Schoen [Thu, 31 Jan 2019 16:38:20 +0000 (10:38 -0600)]

tests: fix Vagrantfile symlink for lvm-auto-discovery tests

Signed-off-by: Andrew Schoen <aschoen@redhat.com>

commit | commitdiff | tree

Andrew Schoen [Thu, 10 Jan 2019 19:25:03 +0000 (13:25 -0600)]

docs: using osd_auto_discovery and osd_scenario lvm

Signed-off-by: Andrew Schoen <aschoen@redhat.com>

commit | commitdiff | tree

Andrew Schoen [Thu, 10 Jan 2019 19:20:32 +0000 (13:20 -0600)]

tests: adds the lvm_auto_discovery container testing scenario

This tests osd_auto_discovery: True, containerized_deployment: True
and the lvm osd scenario

Signed-off-by: Andrew Schoen <aschoen@redhat.com>

commit | commitdiff | tree

Andrew Schoen [Thu, 10 Jan 2019 19:06:17 +0000 (13:06 -0600)]

tests: create as many drives for virtualbox as libvirt

This just ensures that virtualbox and libvirt are making
the same amount of devices for tests.

Signed-off-by: Andrew Schoen <aschoen@redhat.com>

commit | commitdiff | tree

Andrew Schoen [Thu, 10 Jan 2019 19:05:56 +0000 (13:05 -0600)]

tests: adds the lvm_auto_discovery scenario

Signed-off-by: Andrew Schoen <aschoen@redhat.com>

commit | commitdiff | tree

Andrew Schoen [Thu, 10 Jan 2019 17:26:19 +0000 (11:26 -0600)]

ceph-facts: generate devices when osd_auto_discovery is true

This task used to live in ceph-osd, but we need it defined here to that
ceph-config can use it when trying to determine the number of osds.

Signed-off-by: Andrew Schoen <aschoen@redhat.com>

commit | commitdiff | tree

Andrew Schoen [Thu, 10 Jan 2019 17:22:53 +0000 (11:22 -0600)]

validate: do not validate lvm config if osd_auto_discovery is true

If osd_auto_discovery is set with the lvm scenario it's expected for
lvm_volumes and devices to be empty.

Signed-off-by: Andrew Schoen <aschoen@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Fri, 1 Feb 2019 08:10:56 +0000 (09:10 +0100)]

tests: ensure iptables rule is inserted for rgw_multisite job

wip

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Thu, 31 Jan 2019 15:58:01 +0000 (16:58 +0100)]

tests: run dev_setup and lvm_setup on secondary cluster for rgw_multisite

Otherwise, the deployment of the second cluster fails.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

John Fulton [Wed, 30 Jan 2019 20:06:55 +0000 (15:06 -0500)]

Do not timeout podman/docker pull if timeout value is '0'

If user sets "docker_pull_timeout: '0'" then do not use the
timeout command when running podman/docker pull. Also, use
"timeout -s KILL"; without KILL, podman on RHEL8 beta does
not timeout and deployment can hang.

Related: https://bugzilla.redhat.com/show_bug.cgi?id=1670625
Signed-off-by: John Fulton <fulton@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Wed, 30 Jan 2019 23:07:30 +0000 (00:07 +0100)]

config: support num_osds fact setting in containerized deployment

This part of the code must be supported in containerized deployment

Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1664112
Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Wed, 30 Jan 2019 22:01:20 +0000 (23:01 +0100)]

tests: add missing hosts file for ubuntu testing

we need it for dev-ubuntu-container-all_daemons job

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Wed, 30 Jan 2019 21:58:01 +0000 (22:58 +0100)]

tests: do not deploy iscsigw node on ubuntu

iSCSI gateways can only be deployed on Red Hat OS family.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Wed, 30 Jan 2019 21:42:42 +0000 (22:42 +0100)]

tests: run lvm_setup.yml only when osd_scenario is lvm

especially for ooo_collocation scenario which is still using ceph-disk
testing.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Ramana Raja [Fri, 25 Jan 2019 10:15:21 +0000 (15:45 +0530)]

Install nfs-ganesha stable v2.7

nfs-ganesha v2.5 and 2.6 have hit EOL. Install nfs-ganesha v2.7
stable that is currently being maintained.

Signed-off-by: Ramana Raja <rraja@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Thu, 10 Jan 2019 13:08:42 +0000 (14:08 +0100)]

common: clean monitor initial keyring code

let's not be blocked by the fact we don't have the initial keyring in
`{{ fetch_directory }}`

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Tue, 8 Jan 2019 09:57:51 +0000 (10:57 +0100)]

facts: clean fsid generation code

clean some leftover and duplicate code.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Sun, 27 Jan 2019 05:11:41 +0000 (21:11 -0800)]

do not set ceph_release to dummy

When we're deploying a dev branch.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Tue, 29 Jan 2019 11:16:28 +0000 (12:16 +0100)]

tests: do not play dev_setup on containerized deployment

using `!` mark in tox.ini doesn't work on comma separated list.
The idea here is to skip all containerized scenario in dev_setup.yml and
use the `!` for the update scenario.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Mon, 28 Jan 2019 12:16:26 +0000 (13:16 +0100)]

tests: update envlist

- switch_to_containers, ooo_collocation, podman should be able to specify
which os they are running on.
- those scenarios are only container scenario.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Sat, 26 Jan 2019 02:48:28 +0000 (18:48 -0800)]

use shortname in keyring path

socket.gethostname may return a FQDN. Problem found in Linode.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Fri, 25 Jan 2019 10:05:55 +0000 (11:05 +0100)]

tests: fix rgw testinfra failure

fix the wrong path used in various rgw testinfra tests.
set `1` as default value for `radosgw_num_instances`: if
`ansible_vars.get(radosgw_num_instances)` returns `None`, we can assume
there's only 1 instance since it's the default value in ceph-defaults.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Zack Cerza [Thu, 24 Jan 2019 19:49:09 +0000 (12:49 -0700)]

Fix ceph.conf generation

877979c78 in #3486 broke ceph.conf generation entirely. Remove the stray
curly brace.

Signed-off-by: Zack Cerza <zack@redhat.com>

commit | commitdiff | tree

Noah Watkins [Thu, 17 Jan 2019 23:08:19 +0000 (15:08 -0800)]

shrink_osd: use cv zap by fsid to remove parts/lvs

Fixes:
https://bugzilla.redhat.com/show_bug.cgi?id=1569413
https://bugzilla.redhat.com/show_bug.cgi?id=1572933

Signed-off-by: Noah Watkins <noahwatkins@gmail.com>

commit | commitdiff | tree

Noah Watkins [Wed, 16 Jan 2019 23:50:23 +0000 (15:50 -0800)]

test: add missing test dependency

[nwatkins@smash ceph-ansible]$ virtualenv env
[nwatkins@smash ceph-ansible]$ env/bin/pip install -r tests/requirements.txt
[nwatkins@smash ceph-ansible]$ env/bin/python -c "import mock"
Traceback (most recent call last):
File "<string>", line 1, in <module>
ModuleNotFoundError: No module named 'mock'

Signed-off-by: Noah Watkins <noahwatkins@gmail.com>

commit | commitdiff | tree

Noah Watkins [Wed, 16 Jan 2019 23:50:08 +0000 (15:50 -0800)]

cv: support zap by osd fsid

Signed-off-by: Noah Watkins <noahwatkins@gmail.com>

commit | commitdiff | tree

Guillaume Abrioux [Thu, 24 Jan 2019 09:54:32 +0000 (10:54 +0100)]

tests: fix symlink to Vagrantfile

fix the symplink for Vagrantfile in rgw-multisite/container/secondary

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Thu, 24 Jan 2019 08:01:05 +0000 (09:01 +0100)]

tests: set CEPH_STABLE_RELEASE to mimic for dev-* jobs

CEPH_STABLE_RELEASE must be set to mimic to perform a mimic to nautilus
upgrade on dev-* jobs.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Mon, 21 Jan 2019 10:29:26 +0000 (11:29 +0100)]

tests: remove an outdated comment

This comment is outdated, let's remove it.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Mon, 7 Jan 2019 14:38:15 +0000 (15:38 +0100)]

tests: update dev_setup for nfs_ganesha_stable value

since we now set this variable in inventory host, the regexp needs to be
updated, the assignment operator is `=`.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Thu, 20 Dec 2018 12:52:22 +0000 (13:52 +0100)]

tests: rename all node

let's name all node the same way to avoid confusion.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Wed, 19 Dec 2018 13:23:48 +0000 (14:23 +0100)]

tests: set `ANSIBLE_KEEP_REMOTE_FILES`

this is useful when trying to debug an ansible module issue.
This will allow us to replay a failing module on a node when needed.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Wed, 19 Dec 2018 12:43:39 +0000 (13:43 +0100)]

tests: change ceph_osd_docker_run_script_path for add_osds

set `ceph_osd_docker_run_script_path` to /var/tmp for `add_osds`
scenario

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Wed, 19 Dec 2018 09:17:49 +0000 (10:17 +0100)]

tests: play lvm_setup.yml on all scenarios

We should play lvm_setup.yml on all scenario except `lvm_batch`.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Fri, 14 Dec 2018 11:01:16 +0000 (12:01 +0100)]

tests: reorganize directories layout

This commit reorganizes the testing directory layout.
The idea is to have more consistency with the names of scenario and
their corresponding path, eg: non-container vs. container: each scenario
has a subdirectory for container deployment.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Fri, 14 Dec 2018 09:45:04 +0000 (10:45 +0100)]

tox: add new factor for OS

This way we can test every scenario on all distribution.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Wed, 5 Dec 2018 22:19:48 +0000 (23:19 +0100)]

tox: refact environment naming

this commit refacts the way the environment are named by adding a factor
`{non_container,container}`. This will avoid a lot of duplicate
definition in tox.ini and bring kind of consistency.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Tue, 27 Nov 2018 08:02:09 +0000 (09:02 +0100)]

tests: update default value for CEPH_STABLE_RELEASE

- update value for `CEPH_STABLE_RELEASE`: next release will ship with
`nautilus`. This variable is used for stable branch only, this way, it
will be ready when next stable version will be released.

- test upgrade from mimic to ceph@master: don't run dev_setup.yml on update
scenario, and run it in [update] section so we update from mimic to
ceph@master.

- run lvm_setup.yml for all scenarios except `lvm_batch`

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Mon, 26 Nov 2018 22:22:35 +0000 (23:22 +0100)]

tests: cleanup tox definition

remove useless environment variable definition since we know we only
test ceph@master on ceph-ansible master branch.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Wed, 5 Dec 2018 08:36:22 +0000 (09:36 +0100)]

tests: require six==1.10.0

sometimes it can fail because the version of `six` package is prior to
1.10.0.
This commit ensures the version is enforced.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Rishabh Dave [Wed, 23 Jan 2019 10:49:31 +0000 (16:19 +0530)]

tests: fix identation issue in test_rgw.py

The error was introduced by the commit
1ac94c048ff1d1385de2892d0ecef7879ec563e9.

Fixes: https://github.com/ceph/ceph-ansible/issues/3528
Signed-off-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Tue, 22 Jan 2019 13:25:45 +0000 (14:25 +0100)]

override ceph_release with ceph_stable_release

when `ceph_origin` is set to `'repository'` and `ceph_repository` to
`'community'` we need to ensure `ceph_release` reflect
`ceph_stable_release`.

4a3f180f9d29d5a31468ebb3d1c5f31a53a93960 simply removed the override
while it should just have to be run only when the condition mentioned
above is satisfied.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Tue, 22 Jan 2019 10:12:37 +0000 (11:12 +0100)]

rolling_update: support multiple rgw instance

1ac94c048ff1d1385de2892d0ecef7879ec563e9 introduced the support of
multiple rgw instances on a single host but somehow has missed to
implement this feature in rolling_update.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Mon, 21 Jan 2019 15:08:00 +0000 (16:08 +0100)]

config: make sure ceph_release is set for all client node

`ceph_release` is set in `ceph-container-common` but this role is
played only on first node for clients, this means ceph-config will fail
on all client nodes except the first one.

This commit ensure ceph_release is set for all client nodes.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Sébastien Han [Mon, 21 Jan 2019 11:08:56 +0000 (12:08 +0100)]

mon: enable msgr2

Enabling msgr2 style declaration for Nautilus and above. Prior releases
will keep the right syntax.
When upgrading from Mimic to Nautilus we must maintain something in the
form of:

mon_host = [v1:127.0.0.1:6789/0,v2:127.0.0.1:3300/0]

Signed-off-by: Sébastien Han <seb@redhat.com>

commit | commitdiff | tree

Sébastien Han [Wed, 9 Jan 2019 12:23:07 +0000 (13:23 +0100)]

mon: ability to change mon listening port on container

You can now use 'ceph_mon_container_listen_port' to change the port the
monitor will listen on.
Setting the default to 3300 (assigned by IANA) since Nautilus has released the messenger2
transport protocol.

Signed-off-by: Sébastien Han <seb@redhat.com>

commit | commitdiff | tree

Sébastien Han [Mon, 21 Jan 2019 12:53:53 +0000 (13:53 +0100)]

Revert "mon: force peer addition"

This reverts commit ee08d1f89a588e878324141dd0f80c65058a377d which was
mostly to workaround a bug in ceph@master. Now, ceph@master is fixed so
reverting this. Thanks to https://github.com/ceph/ceph/pull/25900

Signed-off-by: Sébastien Han <seb@redhat.com>

commit | commitdiff | tree

Sébastien Han [Wed, 9 Jan 2019 12:22:22 +0000 (13:22 +0100)]

ceph-fact: resync group_vars file

Run ./generate_group_vars_sample.sh to populate the groups_vars file.

Signed-off-by: Sébastien Han <seb@redhat.com>

commit | commitdiff | tree

Giulio Fidente [Fri, 18 Jan 2019 08:03:40 +0000 (09:03 +0100)]

Preserve rolling_update backward compatibility with ansible < 2.5

Signed-off-by: Giulio Fidente <gfidente@redhat.com>

commit | commitdiff | tree

guihecheng [Fri, 9 Nov 2018 00:56:57 +0000 (08:56 +0800)]

rgw: add support for multiple rgw instances on a single host

With this, we could have multiple rgw instances on a single host
with a single run, don't have to use rgw-standalone.yml which does not
seems able to bind ports separately.
If you want to have multiple rgw instances, just change 'radosgw_instances'
to the number you want, which defaults to 1.
Not compatible with Multi-Site yet.

Signed-off-by: guihecheng <guihecheng@cmiot.chinamobile.com>

Unnamed repository; edit this file 'description' to name the repository.

RSS Atom