]> git.apps.os.sepia.ceph.com Git - ceph-ansible.git/log
ceph-ansible.git
7 years agoMerge pull request #2130 from ceph/2128-bkp v3.0.9
Sébastien Han [Fri, 3 Nov 2017 12:19:15 +0000 (13:19 +0100)]
Merge pull request #2130 from ceph/2128-bkp

[skip ci] 2128-bkp

7 years agoosd: enhance backward compatibility 2130/head
Sébastien Han [Thu, 2 Nov 2017 15:17:38 +0000 (16:17 +0100)]
osd: enhance backward compatibility

During the initial implementation of this 'old' thing we were falling
into this issue without noticing
https://github.com/moby/moby/issues/30341 and where blindly using --rm,
now this is fixed the prepare container disappears and thus activation
fail.
I'm fixing this for old jewel images.

Also this fixes the machine reboot case where the docker logs are
purgend. In the old scenario, we now store the log locally in the same
directory as the ceph-osd-run.sh script.

Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit d4ed9a2064e503ac4a4fe978cb9e196ca9150272)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoconfig: fix monitor_interface when not passed in the inventory file
Sébastien Han [Thu, 2 Nov 2017 13:53:46 +0000 (14:53 +0100)]
config: fix monitor_interface when not passed in the inventory file

Setting monitor_interface in group_vars/all.yml makes the
hostvars[host]['monitor_interface'] non-existing.

Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1507922
(cherry picked from commit ab7eb79212f90edbfe29faf40dac5d209c7a70a9)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2125 from ceph/2000-bkp v3.0.8
Sébastien Han [Tue, 31 Oct 2017 13:44:54 +0000 (14:44 +0100)]
Merge pull request #2125 from ceph/2000-bkp

[skip ci] backport of 2000

7 years agoci: new osd scenarios 2125/head
Sébastien Han [Thu, 5 Oct 2017 14:22:04 +0000 (16:22 +0200)]
ci: new osd scenarios

This commit add new osd scenarios, it aims to simplify the CI setup and
brings a better coverage on the OSD scenarios.
We decided to differentiate between filestore and bluestore, thinking
ahead when filestore won't be supported anymore.
So we now have two classes of tests:

* Filestore
* Bluestore

In each of those classes we have container and non-container.
Then for each we test the following:

* collocated
* collocated dmcrypt
* non-collocated
* non-collocated dmcrypt
* auto discovery collocated
* auto discovery collocated dmcrypt

This gives us a nice coverage and also reduces the footprint on the CI.
We are now up to 4 scenarios, each containing 6 OSD VMs.

Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit a53aa9e8b41606e2ff996f036a7a86679126cd92)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2120 from ceph/2090-bkp
Sébastien Han [Fri, 27 Oct 2017 15:44:11 +0000 (17:44 +0200)]
Merge pull request #2120 from ceph/2090-bkp

[skip ci] 2090-bkp

7 years agoTest ansible 2.4.1 2120/head
Sébastien Han [Mon, 23 Oct 2017 10:03:01 +0000 (12:03 +0200)]
Test ansible 2.4.1

We now test with Ansible 2.4. We had to change testinfra's version since
only recent versions work with 2.4. See:
https://github.com/philpep/testinfra/issues/249

Closes: https://github.com/ceph/ceph-ansible/issues/2087
Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit c4ad2477188c2d226a4ea2e0fa6693967d5b103c)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2119 from ceph/2117-bkp v3.0.7
Sébastien Han [Fri, 27 Oct 2017 11:51:07 +0000 (13:51 +0200)]
Merge pull request #2119 from ceph/2117-bkp

[skip ci] 2117-bkp

7 years agodefault: remove dup variable 2119/head
Sébastien Han [Fri, 27 Oct 2017 09:46:15 +0000 (11:46 +0200)]
default: remove dup variable

ceph_repository_type was declared multiple times. This commit fixes
this.

Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit d2575c7f5e5520f6ee65c5007853b3248d2c7a10)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2116 from ceph/2110-bkp
Sébastien Han [Fri, 27 Oct 2017 07:56:55 +0000 (09:56 +0200)]
Merge pull request #2116 from ceph/2110-bkp

[skip ci] 2110-bkp

7 years agoMake acls and mode parameters of opentack_keys optional 2116/head
John Fulton [Wed, 25 Oct 2017 23:46:02 +0000 (23:46 +0000)]
Make acls and mode parameters of opentack_keys optional

Only chmod or setfacl the requested keyring(s) in the
opentack_keys data structure when the mode or acls keys
of that data structure exist.

User may specify four permission combinations for the
keyring file(s): 1. only set ACL, 2. only set mode,
3. set neither mode nor ACL, 4. set mode and then ACL.

Fixes: #2092
(cherry picked from commit ae156e9f34ed9083382bedf53d8f451360ce978a)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2113 from ceph/2112-bkp v3.0.6
Sébastien Han [Thu, 26 Oct 2017 12:23:54 +0000 (14:23 +0200)]
Merge pull request #2113 from ceph/2112-bkp

[skip ci] 2112-bkp

7 years agopurge: do not reboot by default 2113/head
Sébastien Han [Thu, 26 Oct 2017 12:18:38 +0000 (14:18 +0200)]
purge: do not reboot by default

Rebooting servers is really intrusive and perhaps this is not what the
operator wants. So we disable the reboot by default now. Note that the
reboot might not happen all the time.
It can be enabled by default by running the purge playbook with -e
reboot_osd_node=True

Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1505011
Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit 2837d0a22e258cee583f14e402a99d89c9a16cd6)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2108 from ceph/2103-bkp v3.0.5
Sébastien Han [Wed, 25 Oct 2017 15:36:39 +0000 (17:36 +0200)]
Merge pull request #2108 from ceph/2103-bkp

[skip ci] 2103-bkp

7 years agoOption to set TCMALLOC_MAX_TOTAL_THREAD_CACHE_BYTES 2108/head
Andy McCrae [Mon, 23 Oct 2017 13:57:24 +0000 (14:57 +0100)]
Option to set TCMALLOC_MAX_TOTAL_THREAD_CACHE_BYTES

Use "ceph_tcmalloc_max_total_thread_cache" to set the
TCMALLOC_MAX_TOTAL_THREAD_CACHE_BYTES value inside /etc/default/ceph for
Debian installs, or /etc/sysconfig/ceph for Red Hat/CentOS installs.

By default this is set to 0, so the default package value will be used,
if specified this value will be changed to match the variable, and ceph
osd services will be restarted.

(cherry picked from commit 7f6c39102d4fc1c9ec987fe3dd06693a94ad56fb)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2107 from ceph/2104-bkp
Sébastien Han [Wed, 25 Oct 2017 15:35:43 +0000 (17:35 +0200)]
Merge pull request #2107 from ceph/2104-bkp

[skip ci] 2104-bkp

7 years agorgw/nfs: fix section duplication 2107/head
Sébastien Han [Wed, 25 Oct 2017 13:45:37 +0000 (15:45 +0200)]
rgw/nfs: fix section duplication

Once and for all, hopefully...

Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit 8670b45ef2cfcf35bac5d7f83b93099bfa1d9f9e)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2106 from ceph/2084-bkp
Sébastien Han [Wed, 25 Oct 2017 15:34:39 +0000 (17:34 +0200)]
Merge pull request #2106 from ceph/2084-bkp

[skip ci] 2084-bkp

7 years agoosd: bring backward compatibility with old Jewel images 2106/head
Sébastien Han [Fri, 20 Oct 2017 13:15:38 +0000 (15:15 +0200)]
osd: bring backward compatibility with old Jewel images

There was a huge resync from luminous to jewel in ceph-docker:
https://github.com/ceph/ceph-docker/pull/797

This change brought a new handy function to discover partitions tight to
an OSD. This function doesn't exist in the old image so the
ceph-osd-run.sh script breaks when trying to deploy Jewel OSD with that
old Jewel image version.

Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit 968ef04324e9064fcecfe88bc5464ad9c2673a13)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2083 from ceph/2.4-3.0-backward-bkp v3.0.4
Sébastien Han [Fri, 20 Oct 2017 12:01:45 +0000 (14:01 +0200)]
Merge pull request #2083 from ceph/2.4-3.0-backward-bkp

[skip ci] 2.4-3.0-backward-bkp

7 years agoall: backward compatibility between stable-2.2 and 3.0 2083/head
Sébastien Han [Wed, 18 Oct 2017 16:03:30 +0000 (18:03 +0200)]
all: backward compatibility between stable-2.2 and 3.0

stable-3.0 brought numerous changes in ceph-ansible variables, this PR
aims to maintain backward compatibility for someone running stable-2.2
upgrading to stable-3.0 but keeps its groups_vars untouched.
We will then determine the right options to make sure the upgrade works
but we are expecting that new variables should be used.

We will drop this in a near future, maybe 3.1 or 3.2.

Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit 4413511b6619e22007b7988ab9929d618e0dcd01)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoupgrade: fix upgrade jewel to luminous for nfs nodes
Guillaume Abrioux [Thu, 19 Oct 2017 16:30:08 +0000 (18:30 +0200)]
upgrade: fix upgrade jewel to luminous for nfs nodes

nfs nodes can't be upgraded from jewel to luminous because ceph-nfs role
is skipped because of the condition `when:
"ceph_release_num[ceph_release] >= ceph_release_num.luminous"`. Indeed,
package is upgraded in `ceph-nfs` role, therefore,
`ceph_release` is still set to the old version. It means the when can't
be satisfied.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit 982326373b9474231015639eac8fc52a3b0878a3)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoupgrade: fix upgrade jewel to luminous for mgr nodes
Guillaume Abrioux [Thu, 19 Oct 2017 13:07:58 +0000 (15:07 +0200)]
upgrade: fix upgrade jewel to luminous for mgr nodes

mgr nodes can't be upgraded from jewel to luminous because ceph-mgr role
is skipped because of the condition `when:
"ceph_release_num[ceph_release] >= ceph_release_num.luminous"`. Indeed,
ceph-mgr package is upgraded in `ceph-mgr` role, therefore,
`ceph_release` is still set to the old version. It means the when can't
be satisfied.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit 302e563601cd6820b1ae44fabdfb1506688c7c9b)
(cherry picked from commit 70034451e98e630605bcc618dfebd8629a97af9a)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2074 from ceph/fix_rbd_handler-bkp v3.0.3
Sébastien Han [Wed, 18 Oct 2017 09:12:58 +0000 (11:12 +0200)]
Merge pull request #2074 from ceph/fix_rbd_handler-bkp

[skip ci] fix_rbd_handler-bkp

7 years agorbd: fix restart script for jewel 2074/head
Guillaume Abrioux [Tue, 17 Oct 2017 16:28:06 +0000 (18:28 +0200)]
rbd: fix restart script for jewel

In Jewel, we don't use bootstrap-rbd keyring for rbd-mirror nodes, it
results with a socket path/name different according to which ceph
release you are deploying.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit c2850b11be8a69780eaceeb5bd5f3616979dd29a)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2072 from ceph/fix-collocation2-bkp
Sébastien Han [Wed, 18 Oct 2017 07:17:02 +0000 (09:17 +0200)]
Merge pull request #2072 from ceph/fix-collocation2-bkp

[skip ci] fix-collocation2-bkp

7 years agodefaults: fix handlers for collocation 2072/head
Sébastien Han [Tue, 17 Oct 2017 13:54:17 +0000 (15:54 +0200)]
defaults: fix handlers for collocation

When doing collocation the condition "inventory_hostname in play_hosts"
is breaking the restart workflow.

Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit 90b75185d5fc473b377fafced95d7b35a80896aa)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2070 from ceph/sort_dict-bkp
Sébastien Han [Tue, 17 Oct 2017 15:10:57 +0000 (17:10 +0200)]
Merge pull request #2070 from ceph/sort_dict-bkp

[skip ci] sort_dict-bkp

7 years agoceph-defaults: fix handlers that are always triggered 2070/head
Guillaume Abrioux [Fri, 13 Oct 2017 13:16:18 +0000 (15:16 +0200)]
ceph-defaults: fix handlers that are always triggered

Handlers are always triggered in ceph-ansible because ceph.conf file is
generated with a randomly order for the different keys/values pairs
in sections.

In python, a dict is not sorted. It means in our case each time we try
to generate the ceph.conf file it will be rendered with a random order
since the mecanism behind consist of rendering a file from a python dict
with keys/values. Therefore, as a quick workaround, forcing this dict to be
sorted before rendering the configuration file will ensure that it will be
rendered always the same way.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit ec042219e64a321fa67fce0384af76eeb238c645)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2069 from ceph/rhcs-rm-bkp
Sébastien Han [Tue, 17 Oct 2017 15:08:39 +0000 (17:08 +0200)]
Merge pull request #2069 from ceph/rhcs-rm-bkp

[skip ci] rhcs-rm-bkp

7 years agorpm: remove ability to install ceph community version 2069/head
Sébastien Han [Tue, 17 Oct 2017 09:49:41 +0000 (11:49 +0200)]
rpm: remove ability to install ceph community version

Downstream version of ceph-ansible could still trigger install from
upstream repo and import keys.

Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1503019
Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit c72ddee2d9e93e72722004b109733a68ffd6b8d1)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2066 from ceph/update-mirror-nfs-bkp
Sébastien Han [Tue, 17 Oct 2017 13:00:02 +0000 (15:00 +0200)]
Merge pull request #2066 from ceph/update-mirror-nfs-bkp

upgrade: support for rbd mirror and nfs

7 years agoupgrade: support for rbd mirror and nfs 2066/head
Sébastien Han [Mon, 16 Oct 2017 12:15:43 +0000 (14:15 +0200)]
upgrade: support for rbd mirror and nfs

- Add upgrade support for rbd mirror and nfs daemons.
- Only works with systemd (remove sysvinit and upstart occurence)
- A bit of cleanup

Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit d920d4839d029cc2eed4cb0556782a20f867ddcc)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2032 from ceph/2029-bkp v3.0.2
Sébastien Han [Wed, 11 Oct 2017 20:24:07 +0000 (22:24 +0200)]
Merge pull request #2032 from ceph/2029-bkp

[skip ci] 2029-bkp

7 years agoconfig: proper render ceph.conf when doing collocation 2032/head
Sébastien Han [Wed, 11 Oct 2017 16:29:34 +0000 (18:29 +0200)]
config: proper render ceph.conf when doing collocation

Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit aa70b07ae20407b20ec3b71320d2148788d2742e)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2024 from ceph/2022-bkp v3.0.1
Sébastien Han [Wed, 11 Oct 2017 12:24:38 +0000 (14:24 +0200)]
Merge pull request #2024 from ceph/2022-bkp

2022 bkp

7 years agoosd: rollback bindmount of /run/udev 2024/head
Sébastien Han [Wed, 11 Oct 2017 11:21:37 +0000 (13:21 +0200)]
osd: rollback bindmount of /run/udev

This is causing unknown issues when trying to start a dmcrypt container.
Basically the container is stuck at mount opening the LUKS device. This
is still unknown why this is causing trouble but we need to move
forward. Also, this doesn't seem to help in any ways to fix the race
condition we've seen.

Here is the log for dmcrypt:

cryptsetup 1.7.4 processing "cryptsetup --debug --verbose --key-file
key luksClose fbf8887d-8694-46ca-b9ff-be79a668e2a9"
Running command close.
Locking memory.
Installing SIGINT/SIGTERM handler.
Unblocking interruption on signal.
Allocating crypt device context by device
fbf8887d-8694-46ca-b9ff-be79a668e2a9.
Initialising device-mapper backend library.
dm version   [ opencount flush ]   [16384] (*1)
dm versions   [ opencount flush ]   [16384] (*1)
Detected dm-crypt version 1.14.1, dm-ioctl version 4.35.0.
Device-mapper backend running with UDEV support enabled.
dm status fbf8887d-8694-46ca-b9ff-be79a668e2a9  [ opencount flush ]
[16384] (*1)
Releasing device-mapper backend.
Trying to open and read device /dev/sdc1 with direct-io.
Allocating crypt device /dev/sdc1 context.
Trying to open and read device /dev/sdc1 with direct-io.
Initialising device-mapper backend library.
dm table fbf8887d-8694-46ca-b9ff-be79a668e2a9  [ opencount flush
securedata ]   [16384] (*1)
Trying to open and read device /dev/sdc1 with direct-io.
Crypto backend (gcrypt 1.5.3) initialized in cryptsetup library
version 1.7.4.
Detected kernel Linux 3.10.0-693.el7.x86_64 x86_64.
Reading LUKS header of size 1024 from device /dev/sdc1
Key length 32, device size 1943016847 sectors, header size 2050
sectors.
Deactivating volume fbf8887d-8694-46ca-b9ff-be79a668e2a9.
dm status fbf8887d-8694-46ca-b9ff-be79a668e2a9  [ opencount flush ]
[16384] (*1)
Udev cookie 0xd4d14e4 (semid 32769) created
Udev cookie 0xd4d14e4 (semid 32769) incremented to 1
Udev cookie 0xd4d14e4 (semid 32769) incremented to 2
Udev cookie 0xd4d14e4 (semid 32769) assigned to REMOVE task(2) with
flags         (0x0)
dm remove fbf8887d-8694-46ca-b9ff-be79a668e2a9  [ opencount flush
retryremove ]   [16384] (*1)
fbf8887d-8694-46ca-b9ff-be79a668e2a9: Stacking NODE_DEL [verify_udev]
Udev cookie 0xd4d14e4 (semid 32769) decremented to 1
Udev cookie 0xd4d14e4 (semid 32769) waiting for zero

Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit d0a9e57bfcf68e41e25a1b3868ded447d09f8199)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agopurge-iscsi: fix group name
Sébastien Han [Wed, 11 Oct 2017 10:52:12 +0000 (12:52 +0200)]
purge-iscsi: fix group name

Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1500281
Signed-off-by: Sébastien Han <seb@redhat.com>
(cherry picked from commit 85e13a864c1317849d7bf34441fa1f7b33939556)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2023 from ceph/bpk-2021-bkp
Sébastien Han [Wed, 11 Oct 2017 12:20:02 +0000 (14:20 +0200)]
Merge pull request #2023 from ceph/bpk-2021-bkp

[skip ci] bpk-2021 - Remove duplicate 'package' key

7 years agoRemove duplicate 'package' key 2023/head
Major Hayden [Tue, 10 Oct 2017 20:39:16 +0000 (15:39 -0500)]
Remove duplicate 'package' key

This patch fixes a typo where "package:" was used twice in the same
task.

(cherry picked from commit 10e1d464e5664e4f440378289b96c70f1aa1e1b3)
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2015 from ceph/fix_nfs-ganesha-repos v3.0.0
Sébastien Han [Tue, 10 Oct 2017 15:15:33 +0000 (17:15 +0200)]
Merge pull request #2015 from ceph/fix_nfs-ganesha-repos

nfs: move repository configuration in ceph-nfs role

7 years agonfs: add missing condition for debian_rhcs 2015/head
Guillaume Abrioux [Tue, 10 Oct 2017 14:26:01 +0000 (16:26 +0200)]
nfs: add missing condition for debian_rhcs

in addition to c4dcdaa20 this commit adds the missing condition on
install tasks for debian_rhcs deployment. Without them, these tasks are
played on any kind of deployment.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
7 years agoMerge pull request #2020 from jprovaznik/dexports
Sébastien Han [Tue, 10 Oct 2017 13:17:28 +0000 (15:17 +0200)]
Merge pull request #2020 from jprovaznik/dexports

Ceph-nfs dynamic exports fixes

7 years agoCeph-nfs dynamic exports fixes 2020/head
Jan Provaznik [Tue, 10 Oct 2017 10:43:23 +0000 (12:43 +0200)]
Ceph-nfs dynamic exports fixes

* DBus on host should include ganesha service file
* to allow ganesha container to respond on DBus it needs to run
  in --privileged mode (ganesha folks contacted to look at this)
* ceph_nfs_include_exports_dir variable replaced with more general
  ceph_nfs_dynamic_exports

7 years agoiscsi-gw: move repository configuration to ceph-iscsi-gw
Guillaume Abrioux [Tue, 10 Oct 2017 08:39:02 +0000 (10:39 +0200)]
iscsi-gw: move repository configuration to ceph-iscsi-gw

This is something that has nothing to do in `ceph-common`, this
is too specific to `ceph-iscsi-gw` role.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
7 years agoDocs: fix error `Unknown target name`
Guillaume Abrioux [Mon, 9 Oct 2017 19:38:31 +0000 (21:38 +0200)]
Docs: fix error `Unknown target name`

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
7 years agonfs: move repository configuration in ceph-nfs role
Guillaume Abrioux [Mon, 9 Oct 2017 19:34:17 +0000 (21:34 +0200)]
nfs: move repository configuration in ceph-nfs role

This is something that has nothing to do in `ceph-common`, this
is too specific to `ceph-nfs` role.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
7 years agoMerge pull request #2018 from ceph/purge-journal
Sébastien Han [Tue, 10 Oct 2017 08:01:18 +0000 (10:01 +0200)]
Merge pull request #2018 from ceph/purge-journal

[skip ci] purge: fix journal purge

7 years agopurge: fix journal purge 2018/head
Sébastien Han [Tue, 10 Oct 2017 07:57:39 +0000 (09:57 +0200)]
purge: fix journal purge

Using a condition when osd_scenario == 'non-collocated' was wrong since
these partitions can be collocated on a single device also. Removing the
check makes the purge of these partitions.

Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1499871
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2014 from ceph/fixes-2 v3.0.0rc19
Guillaume Abrioux [Mon, 9 Oct 2017 18:14:06 +0000 (20:14 +0200)]
Merge pull request #2014 from ceph/fixes-2

infra: use the pg check in the right place

7 years agonfs: move packages installation to own role 2014/head
Guillaume Abrioux [Mon, 9 Oct 2017 15:10:39 +0000 (17:10 +0200)]
nfs: move packages installation to own role

Make role `ceph-nfs` handling itself the installation of nfs
packages.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
7 years agomds: move installation packages in role itself
Guillaume Abrioux [Mon, 9 Oct 2017 13:37:51 +0000 (15:37 +0200)]
mds: move installation packages in role itself

Make role `ceph-mds` handling itself the installation of `ceph-mds`
package.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
7 years agoiscsi: move package install to ceph-iscsi-role
Sébastien Han [Mon, 9 Oct 2017 13:30:54 +0000 (15:30 +0200)]
iscsi: move package install to ceph-iscsi-role

Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agomgr: move installation packages in role itself
Guillaume Abrioux [Mon, 9 Oct 2017 13:26:15 +0000 (15:26 +0200)]
mgr: move installation packages in role itself

Make role `ceph-mgr` handling itself the installation of `ceph-mgr`
package because it's complicated to manage it regarding we are going to
install `jewel vs. luminous`

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
7 years agotests: adapt current testing for collocation scenario
Guillaume Abrioux [Mon, 9 Oct 2017 11:48:03 +0000 (13:48 +0200)]
tests: adapt current testing for collocation scenario

Since we introduced collocation testing scenario, we need to adapt
current tests to this new scenario.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
7 years agoci: disable vnc console
Sébastien Han [Mon, 9 Oct 2017 09:56:46 +0000 (11:56 +0200)]
ci: disable vnc console

Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoosd: bindmount /run/udev
Sébastien Han [Mon, 9 Oct 2017 08:41:14 +0000 (10:41 +0200)]
osd: bindmount /run/udev

Ensures that "udevadm" is able to check the status of udev's event queue.

Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoci: re-add osd_pool_default_size to 1 with the override
Sébastien Han [Sun, 8 Oct 2017 15:29:32 +0000 (17:29 +0200)]
ci: re-add osd_pool_default_size to 1 with the override

If we don't do this the client will create pools with a replica 3 since
osd_pool_default_size was gone in ceph-override.json. This was making
switch_to_containers failing.

Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agotox: decrease sleep to 2min instead of 5.
Sébastien Han [Sun, 8 Oct 2017 13:56:39 +0000 (15:56 +0200)]
tox: decrease sleep to 2min instead of 5.

Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoinfra: add independant purge-iscsi-gateways.yml
Sébastien Han [Sun, 8 Oct 2017 13:54:36 +0000 (15:54 +0200)]
infra: add independant purge-iscsi-gateways.yml

The current inclusion of purge-iscsi-gateways.yml in purge-cluster.yml
is not working well and blocking the CI too. So removing it from
purge-cluster.yml and re-add the original purge-iscsi-gateways.yml.

Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoconfig: do not duplicate sections when doing collocation
Sébastien Han [Sun, 8 Oct 2017 13:16:40 +0000 (15:16 +0200)]
config: do not duplicate sections when doing collocation

Prior to this commit, when collocating a RGW and NFS on the same box the
ceph.conf layout was the following:

[client.rgw.rgw0]
host = mds0
host = rgw0
rgw frontends = civetweb port=192.168.15.50:8080
num_threads=100[client.rgw.mds0]
rgw frontends = civetweb port=192.168.15.70:8080 num_threads=100
rgw frontends = civetweb port=192.168.15.50:8080 num_threads=100
keyring = /var/lib/ceph/radosgw/test-rgw.mds0/keyring
keyring = /var/lib/ceph/radosgw/test-rgw.rgw0/keyring
rgw data = /var/lib/ceph/radosgw/test-rgw.rgw0
log file = /var/log/ceph/test-rgw-mds0.log
log file = /var/log/ceph/test-rgw-rgw0.log

[mds.mds0]
host = mds0

[global]
rgw override bucket index max shards = 16
fsid = 70e1d368-57b3-4978-b746-cbffce6e56b5
rgw bucket default quota max objects = 1638400
osd_pool_default_size = 1
public network = 192.168.15.0/24
mon host = 192.168.15.10,192.168.15.11,192.168.15.12
osd_pool_default_pg_num = 8
cluster network = 192.168.16.0/24

[mds.rgw0]
host = rgw0

[client.rgw.mds0]
host = mds0
rgw data = /var/lib/ceph/radosgw/test-rgw.mds0
keyring = /var/lib/ceph/radosgw/test-rgw.mds0/keyring
rgw frontends = civetweb port=192.168.15.70:8080 num_threads=100
log file = /var/log/ceph/test-rgw-mds0.log

Basically appending all the sections. This commits solves that.
Now the sections appear like this:

-bash-4.2# cat /etc/ceph/test.conf
[client.rgw.rgw0]
log file = /var/log/ceph/test-rgw-rgw0.log
host = rgw0
keyring = /var/lib/ceph/radosgw/test-rgw.rgw0/keyring
rgw frontends = civetweb port=192.168.15.50:8080 num_threads=100

[client.rgw.mds0]
log file = /var/log/ceph/test-rgw-mds0.log
host = mds0
keyring = /var/lib/ceph/radosgw/test-rgw.mds0/keyring
rgw frontends = civetweb port=192.168.15.70:8080 num_threads=100

[global]
cluster network = 192.168.16.0/24
mon host = 192.168.15.10,192.168.15.11,192.168.15.12
osd_pool_default_size = 1
public network = 192.168.15.0/24
rgw bucket default quota max objects = 1638400
osd_pool_default_pg_num = 8
rgw override bucket index max shards = 16
fsid = 77a21980-3033-4174-9264-1abc7185bcb3

[mds.rgw0]
host = rgw0

[mds.mds0]
host = mds0

Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoinfra: use the pg check in the right place
Sébastien Han [Sun, 8 Oct 2017 12:45:48 +0000 (14:45 +0200)]
infra: use the pg check in the right place

Use the pg check before doing the pg check, not on the quorum check.
Also never quote int when doing comparaison.

Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2013 from ceph/wip-purge-cluster
Sébastien Han [Mon, 9 Oct 2017 15:18:30 +0000 (17:18 +0200)]
Merge pull request #2013 from ceph/wip-purge-cluster

A couple of purge cluster fixes

7 years agoMerge pull request #2011 from ceph/switch-pg v3.0.0rc18
Sébastien Han [Sat, 7 Oct 2017 01:42:44 +0000 (03:42 +0200)]
Merge pull request #2011 from ceph/switch-pg

[skip ci] switch: check pgs only when num_pgs > 0

7 years agoswitch: check pgs only when num_pgs > 0 2011/head
Sébastien Han [Sat, 7 Oct 2017 01:42:09 +0000 (03:42 +0200)]
switch: check pgs only when num_pgs > 0

Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2009 from ceph/fix-clean-pg
Sébastien Han [Sat, 7 Oct 2017 01:39:26 +0000 (03:39 +0200)]
Merge pull request #2009 from ceph/fix-clean-pg

[skip ci] handler: do not test if pgs_num = 0

7 years agohandler: add serial restart back 2009/head
Sébastien Han [Fri, 6 Oct 2017 23:23:30 +0000 (01:23 +0200)]
handler: add serial restart back

We now restart daemons on each machine in a serialized fashion.

Closes: https://github.com/ceph/ceph-ansible/issues/1989
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoci: use by-id instead of by-path
Sébastien Han [Fri, 6 Oct 2017 22:58:54 +0000 (00:58 +0200)]
ci: use by-id instead of by-path

by-id relies on the disk WWID which is more reliable then by-path
(pointing to the PCI info)

Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agorolling_update: perform pg check when pgs_num > 0
Sébastien Han [Fri, 6 Oct 2017 22:46:46 +0000 (00:46 +0200)]
rolling_update: perform pg check when pgs_num > 0

If num_pgs = 0 the check will never return 0.

Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agocommon: fix debian rhcs installation
Sébastien Han [Fri, 6 Oct 2017 20:17:50 +0000 (22:17 +0200)]
common: fix debian rhcs installation

Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoswitch: rework and fix clean pg wait
Sébastien Han [Fri, 6 Oct 2017 17:52:05 +0000 (19:52 +0200)]
switch: rework and fix clean pg wait

Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agopurge-docker: rework device detection
Sébastien Han [Fri, 6 Oct 2017 17:50:15 +0000 (19:50 +0200)]
purge-docker: rework device detection

we don't need "devices" and other device variable anymore, the playbook
detects that for us.

Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoshrink-osd: admin key not needed for container shrink
Sébastien Han [Fri, 6 Oct 2017 17:15:26 +0000 (19:15 +0200)]
shrink-osd: admin key not needed for container shrink

Also do some clean

Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agopurge-cluster: Do not use shell for rm 2013/head
Boris Ranto [Fri, 6 Oct 2017 20:54:34 +0000 (22:54 +0200)]
purge-cluster: Do not use shell for rm

The shell wildcard expansion of non-existing paths fails on zsh making
the whole script fail. We can use file module with with_fileglob to
alleviate the problem instead.

Signed-off-by: Boris Ranto <branto@redhat.com>
7 years agopurge-cluster: Do not fail on systemd commands
Boris Ranto [Fri, 6 Oct 2017 20:52:53 +0000 (22:52 +0200)]
purge-cluster: Do not fail on systemd commands

The systemd can't stop services if the unit files were removed before
the cluster was purged. We should just ignore these.

Signed-off-by: Boris Ranto <branto@redhat.com>
7 years agoMerge pull request #1983 from jprovaznik/suffix
Sébastien Han [Fri, 6 Oct 2017 20:40:57 +0000 (22:40 +0200)]
Merge pull request #1983 from jprovaznik/suffix

Allow to override systemd service instance id

7 years agoMerge pull request #1984 from jprovaznik/exportdir
Sébastien Han [Fri, 6 Oct 2017 20:38:13 +0000 (22:38 +0200)]
Merge pull request #1984 from jprovaznik/exportdir

Include exports dir in ceph-nfs config file

7 years agoMerge pull request #2010 from ceph/wip-nfs-set-rgw-key-permissions
Sébastien Han [Fri, 6 Oct 2017 19:47:49 +0000 (21:47 +0200)]
Merge pull request #2010 from ceph/wip-nfs-set-rgw-key-permissions

nfs: missing conditional for setting rgw key permissions

7 years agonfs: missing conditional for setting rgw key permissions 2010/head
Ali Maredia [Fri, 6 Oct 2017 19:21:35 +0000 (15:21 -0400)]
nfs: missing conditional for setting rgw key permissions

Signed-off-by: Ali Maredia <amaredia@redhat.com>
7 years agoMerge pull request #2005 from ceph/wip-nfs-export-id
Sébastien Han [Fri, 6 Oct 2017 15:05:21 +0000 (17:05 +0200)]
Merge pull request #2005 from ceph/wip-nfs-export-id

nfs: config var changes

7 years agouse get to check stdout_lines
Sébastien Han [Fri, 6 Oct 2017 14:49:46 +0000 (16:49 +0200)]
use get to check stdout_lines

During the initial play, the docker command doesn't not exist and then
there is no stdout_lines to the command. So get allows us to fix this by
declaring an array if the command fails.

Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agohandler: do not test if pgs_num = 0
Sébastien Han [Fri, 6 Oct 2017 12:59:23 +0000 (14:59 +0200)]
handler: do not test if pgs_num = 0

We don't need to wait if they are no PGS.

Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2006 from ceph/fix_jewel_nightly
Guillaume Abrioux [Fri, 6 Oct 2017 14:06:23 +0000 (16:06 +0200)]
Merge pull request #2006 from ceph/fix_jewel_nightly

Fix jewel nightly

7 years agoosd: fix `set_fact build dedicated_devices` 2006/head
Guillaume Abrioux [Fri, 6 Oct 2017 12:35:42 +0000 (14:35 +0200)]
osd: fix `set_fact build dedicated_devices`

Use an intermediate variable to build the final `dedicated_devices` list
to avoid duplicate entry in that array. (We need a 1:1 relation between
`dedicated_devices` and `devices` since we are using a `with_together`
later.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
7 years agotests: skip tests for nfs nodes when release is jewel
Guillaume Abrioux [Fri, 6 Oct 2017 10:49:39 +0000 (12:49 +0200)]
tests: skip tests for nfs nodes when release is jewel

nfs nodes are not deployed on jewel so we should skip the tests on them.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
7 years agorbd: fix bug when trying to fetch key
Guillaume Abrioux [Thu, 5 Oct 2017 20:25:34 +0000 (22:25 +0200)]
rbd: fix bug when trying to fetch key

With jewel, `bootstrap_rbd_keyring` is not set because of this condition:

```
when:
  - ceph_release_num.{{ ceph_release }} >= ceph_release_num.luminous
```

Therefore, the task `try to fetch ceph config and keys` will fail.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
7 years agoAllow to override systemd service instance id 1983/head
Jan Provaznik [Wed, 4 Oct 2017 11:09:43 +0000 (13:09 +0200)]
Allow to override systemd service instance id

It's useful to have constant service instance id when ceph-nfs
is managed by pacemaker.

7 years agonfs: config var changes 2005/head
Ali Maredia [Thu, 5 Oct 2017 20:51:23 +0000 (16:51 -0400)]
nfs: config var changes

- remove unused ganesha config vars,

- set different default Export_ids for each FSAL

Signed-off-by: Ali Maredia <amaredia@redhat.com>
7 years agoMerge pull request #2003 from ceph/debian-iso v3.0.0rc17
Sébastien Han [Thu, 5 Oct 2017 16:57:47 +0000 (18:57 +0200)]
Merge pull request #2003 from ceph/debian-iso

[skip ci] common: iso install on Debian is supported by rhcs

7 years agocommon: iso install on Debian is supported by rhcs 2003/head
Sébastien Han [Thu, 5 Oct 2017 16:33:17 +0000 (18:33 +0200)]
common: iso install on Debian is supported by rhcs

Also adds support for RCSH installation on Debian.

Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #2004 from ceph/debian-url
Sébastien Han [Thu, 5 Oct 2017 16:46:13 +0000 (18:46 +0200)]
Merge pull request #2004 from ceph/debian-url

common: fix ga verison for debian rhcs

7 years agocommon: fix ga verison for debian rhcs 2004/head
Sébastien Han [Thu, 5 Oct 2017 16:45:30 +0000 (18:45 +0200)]
common: fix ga verison for debian rhcs

Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #1985 from ceph/debian-rhcs
Sébastien Han [Thu, 5 Oct 2017 16:42:46 +0000 (18:42 +0200)]
Merge pull request #1985 from ceph/debian-rhcs

[skip ci] common: fix rhcs installation on debian

7 years agoMerge pull request #2001 from ceph/iscsi
Sébastien Han [Thu, 5 Oct 2017 16:29:06 +0000 (18:29 +0200)]
Merge pull request #2001 from ceph/iscsi

iscsi: fix wrong group name for iscsi

7 years agoMerge pull request #2002 from ceph/non-col
Sébastien Han [Thu, 5 Oct 2017 16:25:16 +0000 (18:25 +0200)]
Merge pull request #2002 from ceph/non-col

[skip ci] osd: do not do unique on dedicated_devices

7 years agoosd: do not do unique on dedicated_devices 2002/head
Sébastien Han [Thu, 5 Oct 2017 16:20:18 +0000 (18:20 +0200)]
osd: do not do unique on dedicated_devices

This is needed later, if we do unique, only the first OSD will get a
journal.

Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agocommon: fix rhcs installation on debian 1985/head
Sébastien Han [Wed, 4 Oct 2017 11:36:01 +0000 (13:36 +0200)]
common: fix rhcs installation on debian

* Change version from 2 to 3.
* use ceph_rhcs_cdn_debian_repo_version to use other repositories along
* with ceph_rhcs_cdn_debian_repo

Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoiscsi: fix wrong group name for iscsi 2001/head
Sébastien Han [Thu, 5 Oct 2017 14:40:07 +0000 (16:40 +0200)]
iscsi: fix wrong group name for iscsi

Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1498490
Signed-off-by: Sébastien Han <seb@redhat.com>
7 years agoMerge pull request #1957 from ceph/multi_env_test
Sébastien Han [Thu, 5 Oct 2017 14:00:47 +0000 (16:00 +0200)]
Merge pull request #1957 from ceph/multi_env_test

tests: make all subnet uniq per scenario

7 years agoMerge pull request #1997 from rrmichel/osd_fragment
Sébastien Han [Thu, 5 Oct 2017 13:58:49 +0000 (15:58 +0200)]
Merge pull request #1997 from rrmichel/osd_fragment

Fixing path to osd_fragment.yml