]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
7 years agoosd: debug_bluestore on shutdown
Sage Weil [Thu, 7 Sep 2017 21:12:42 +0000 (17:12 -0400)]
osd: debug_bluestore on shutdown

...just like we do with filestore etc.

Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 354dccda322b7ac8844b321f603e049128bf4779)

7 years agoos/bluestore: dump stray cache content on shutdown
Sage Weil [Thu, 7 Sep 2017 21:11:30 +0000 (17:11 -0400)]
os/bluestore: dump stray cache content on shutdown

Tracking down http://tracker.ceph.com/issues/21259.

Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 28d9b6b0e92cf51996a12a43c81f7ac2abcaecaa)

7 years agoMerge pull request #17867 from smithfarm/wip-21443-luminous
Sage Weil [Sun, 1 Oct 2017 20:56:10 +0000 (15:56 -0500)]
Merge pull request #17867 from smithfarm/wip-21443-luminous

luminous: mgr: Prometheus crash when update

Reviewed-by: Amit Kumar <amitkuma@redhat.com>
7 years agoMerge pull request #17868 from smithfarm/wip-21452-luminous
Sage Weil [Sun, 1 Oct 2017 20:55:59 +0000 (15:55 -0500)]
Merge pull request #17868 from smithfarm/wip-21452-luminous

luminous: mgr: prometheus module generates invalid output when counter names contain non-alphanum characters

Reviewed-by: Amit Kumar <amitkuma@redhat.com>
7 years agoMerge pull request #17896 from liewegas/wip-bluestore-tool-luminous
Sage Weil [Sun, 1 Oct 2017 20:55:30 +0000 (15:55 -0500)]
Merge pull request #17896 from liewegas/wip-bluestore-tool-luminous

ceph-objectstore-tool and ceph-bluestore-tool: backports from master

Reviewed-by: David Zafman <dzafman@redhat.com>
7 years agoMerge pull request #17930 from jcsp/wip-luminous-health-warn
Sage Weil [Sun, 1 Oct 2017 20:55:11 +0000 (15:55 -0500)]
Merge pull request #17930 from jcsp/wip-luminous-health-warn

mon: show legacy health warning in `status` output

Reviewed-by: Sage Weil <sage@redhat.com>
7 years agoMerge pull request #18031 from xiexingguo/wip-pr-17987
Sage Weil [Sun, 1 Oct 2017 20:54:47 +0000 (15:54 -0500)]
Merge pull request #18031 from xiexingguo/wip-pr-17987

luminous: osd/PrimaryLogPG: kick off recovery on backoffing a degraded object

Reviewed-by: Sage Weil <sage@redhat.com>
7 years agoMerge pull request #18033 from xiexingguo/wip-pr-17698
Sage Weil [Sun, 1 Oct 2017 20:54:17 +0000 (15:54 -0500)]
Merge pull request #18033 from xiexingguo/wip-pr-17698

luminous: os/bluestore: allocate entire write in one go

Reviewed-by: Sage Weil <sage@redhat.com>
7 years agoMerge pull request #18038 from jecluis/wip-21300-luminous
Sage Weil [Sun, 1 Oct 2017 20:53:56 +0000 (15:53 -0500)]
Merge pull request #18038 from jecluis/wip-21300-luminous

luminous: mon/MgrMonitor: handle cmd descs to/from disk in the absence of active mgr

8 years agoMerge pull request #17862 from smithfarm/wip-21307-luminous
Sage Weil [Fri, 29 Sep 2017 12:11:31 +0000 (07:11 -0500)]
Merge pull request #17862 from smithfarm/wip-21307-luminous

luminous: mon: Client client.admin marked osd.2 out, after it was down for 1504627577 seconds

Reviewed-by: Sage Weil <sage@redhat.com>
8 years agoMerge pull request #17865 from smithfarm/wip-21465-luminous
Sage Weil [Fri, 29 Sep 2017 12:10:55 +0000 (07:10 -0500)]
Merge pull request #17865 from smithfarm/wip-21465-luminous

luminous: OSD metadata 'backend_filestore_dev_node' is unknown even for simple deployment

Reviewed-by: Sage Weil <sage@redhat.com>
8 years agoMerge pull request #17936 from liewegas/wip-ruleset-errors-luminous
Sage Weil [Fri, 29 Sep 2017 12:10:26 +0000 (07:10 -0500)]
Merge pull request #17936 from liewegas/wip-ruleset-errors-luminous

mon/OSDMonitor: error out if setting ruleset-* ec profile property

Reviewed-by: Sage Weil <sage@redhat.com>
8 years agoMerge pull request #17946 from liewegas/wip-mgr-localpool-luminous
Sage Weil [Fri, 29 Sep 2017 12:10:06 +0000 (07:10 -0500)]
Merge pull request #17946 from liewegas/wip-mgr-localpool-luminous

luminous: mgr/localpool: module to automagically create localized pools

Reviewed-by: John Spray <john.spray@redhat.com>
8 years agoMerge pull request #17998 from liewegas/wip-pr-17978-luminous
Sage Weil [Fri, 29 Sep 2017 12:09:27 +0000 (07:09 -0500)]
Merge pull request #17998 from liewegas/wip-pr-17978-luminous

rbdmap: fix umount when multiple mounts use the same RBD

Reviewed-by: Sage Weil <sage@redhat.com>
8 years agomon/MgrMonitor: read cmd descs if empty on update_from_paxos() 18038/head
Joao Eduardo Luis [Wed, 20 Sep 2017 16:53:40 +0000 (17:53 +0100)]
mon/MgrMonitor: read cmd descs if empty on update_from_paxos()

If the MgrMonitor's `command_descs` is empty, the monitor will not send
the mgr commands to clients on `get_descriptions`. This, in turn, has
the clients sending the commands to the monitors, which will have no
idea how to handle them.

Therefore, make sure to read the `command_descs` from disk if the vector
is empty.

Fixes: http://tracker.ceph.com/issues/21300
Signed-off-by: Joao Eduardo Luis <joao@suse.de>
(cherry picked from commit 3d06079bae0fbc096d6c3639807d9be3597e841a)

8 years agomon/MgrMonitor: populate on-disk cmd descs if empty on upgrade
Joao Eduardo Luis [Wed, 27 Sep 2017 16:55:17 +0000 (17:55 +0100)]
mon/MgrMonitor: populate on-disk cmd descs if empty on upgrade

During kraken, when we first introduced the mgrs, we wouldn't populate
the on-disk command descriptions on create_initial(). Therefore, if we
are upgrading from a cluster that never had a mgr, we may end up
crashing because we have no cmd descs to load from disk.

Fixes: http://tracker.ceph.com/issues/21300
Signed-off-by: Joao Eduardo Luis <joao@suse.de>
8 years agoos/bluestore: allocate entire write in one go 18033/head
Sage Weil [Thu, 10 Aug 2017 20:44:59 +0000 (16:44 -0400)]
os/bluestore: allocate entire write in one go

On the first pass through the writes, compress data and calculate a final
amount of space we need to allocate.  On the second pass, assign the
extents to blobs and queue the writes.

This allows us to do a single allocation for all blobs, which will lead
to less fragmentation and a much better write pattern.

Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit e200f358499af8e3acb6ac4f675cc167433b53ec)

8 years agoosd/PrimaryLogPG: kick off recovery on backoffing a degraded object 18031/head
xie xingguo [Wed, 27 Sep 2017 08:05:56 +0000 (16:05 +0800)]
osd/PrimaryLogPG: kick off recovery on backoffing a degraded object

As we are now blocking frontend ops from accessing that very object!

Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
(cherry picked from commit db20328b456d70d6728fd27f17da6f2f3546e84b)

8 years agorbdmap: fix umount when multiple mounts use the same RBD 17998/head
Alexandre Marangone [Tue, 26 Sep 2017 18:35:04 +0000 (11:35 -0700)]
rbdmap: fix umount when multiple mounts use the same RBD

When a Kubernetes Pod consumes a RBD it is mounted two
times on the same host. When the host shutdown umount will
fail leading to a hung system

Signed-off-by: Alexandre Marangone <amarango@redhat.com>
(cherry picked from commit 40825daecedb2a3481021e4d36a367c339eb9b62)

8 years agoMerge pull request #17953 from ceph/wip-ceph-ansible-bport
vasukulkarni [Wed, 27 Sep 2017 16:50:44 +0000 (09:50 -0700)]
Merge pull request #17953 from ceph/wip-ceph-ansible-bport

tests - ceph-ansible backports

8 years agouse ovh instead of vps 17953/head
Vasu Kulkarni [Thu, 21 Sep 2017 22:34:31 +0000 (15:34 -0700)]
use ovh instead of vps

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
(cherry picked from commit 56d1511bdfaee5a2a0d85bd606348632bfcbf09e)
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
8 years ago12.2.1 v12.2.1
Jenkins Build Slave User [Tue, 26 Sep 2017 16:27:07 +0000 (16:27 +0000)]
12.2.1

8 years agoMerge pull request #17970 from idryomov/wip-pr-17346-luminous 33103/head
Jason Dillaman [Tue, 26 Sep 2017 15:53:56 +0000 (11:53 -0400)]
Merge pull request #17970 from idryomov/wip-pr-17346-luminous

luminous: qa/suites/krbd: rbd_xfstests job overhaul

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
8 years agoqa/suites/krbd: new rbd_xfstests job definition 17970/head
Ilya Dryomov [Fri, 18 Aug 2017 15:49:03 +0000 (17:49 +0200)]
qa/suites/krbd: new rbd_xfstests job definition

ext4 seems to be a better choice for our purposes -- less test churn,
rather small and reliable exclude list.

All excluded tests but generic/050 fail with no krbd in the mix, most
have popped up on linux-ext4 list at least once.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 6647c65ebe0688283cc8b9da9f638a4d6ffb75de)

8 years agoqa/tasks/rbd.xfstests: take exclude list from yaml
Ilya Dryomov [Thu, 17 Aug 2017 13:35:42 +0000 (15:35 +0200)]
qa/tasks/rbd.xfstests: take exclude list from yaml

Different filesystems (and further, different configurations of the
same filesystem) need different exclude lists.  Hard coding the list in
a wrapper script is inflexible.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 01914be5a3b35479037054293b85d96596cb870b)

Conflicts:
qa/run_xfstests_krbd.sh [ trivial, caused by commit
  3aae5ca6fda8 ("scripts: fix bash path in shebangs") ]

8 years agoqa/run_xfstests.sh: quit building xfstests on test nodes
Ilya Dryomov [Wed, 16 Aug 2017 09:47:19 +0000 (11:47 +0200)]
qa/run_xfstests.sh: quit building xfstests on test nodes

xfstests is a pain to build on trusty, xenial and centos7 with a single
script.  It is also very sensitive to dependencies, which again need to
be managed on all those distros -- different sets of supported commands
and switches, some versions have known bugs, etc.

Download a pre-built, statically linked tarball and use it instead.
The tarball was generated using xfstests-bld by Ted Ts'o, with a number
of tweaks by myself (mostly concerning the build environment).

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 1a982fea53c30fb246c2ab28822af6a69165ffea)

8 years agoqa/run_xfstests.sh: drop *_MKFS_OPTIONS variables
Ilya Dryomov [Wed, 16 Aug 2017 09:47:19 +0000 (11:47 +0200)]
qa/run_xfstests.sh: drop *_MKFS_OPTIONS variables

AFAICT ./check doesn't query EXT4_MKFS_OPTIONS or BTRFS_MKFS_OPTIONS,
We don't need anything special for xfs, so remove all of them to avoid
confusion.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 7a0e45046d556c6624e0225ff006c31d8dbae2a8)

8 years agoqa/run_xfstests.sh: kill FS_TYPE variable
Ilya Dryomov [Wed, 16 Aug 2017 09:47:19 +0000 (11:47 +0200)]
qa/run_xfstests.sh: kill FS_TYPE variable

./check expects FSTYP, so use that throughout.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit ca99f1e8406c14e5899d393ab0bb503061c47b9b)

8 years agodoc/mgr: add "local pool" plugin to toc 17946/head
Kefu Chai [Tue, 26 Sep 2017 04:10:28 +0000 (12:10 +0800)]
doc/mgr: add "local pool" plugin to toc

Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit 0b831e7886daad5836ca3cb58c01c123f99a1cde)

8 years agoqa/tests: skip test that use public bucket landsat-pds from AWS
Vasu Kulkarni [Wed, 20 Sep 2017 00:09:08 +0000 (17:09 -0700)]
qa/tests: skip test that use public bucket landsat-pds from AWS

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
(cherry picked from commit e11a1f773d8795dd9bd9819aaf229c6598e22ca6)

8 years agoqa/tests: update required ceph-ansbile vars
Vasu Kulkarni [Wed, 20 Sep 2017 00:06:03 +0000 (17:06 -0700)]
qa/tests: update required ceph-ansbile vars

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
(cherry picked from commit 184d5e6bad46451b3a8c5787f56148566bb42f20)

8 years agoqa/tests: Fix ceph-ansible upstream vars
Vasu Kulkarni [Wed, 20 Sep 2017 00:03:46 +0000 (17:03 -0700)]
qa/tests: Fix ceph-ansible upstream vars

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
(cherry picked from commit 077904a0b5dffe9d64feade94cf30ffc92f1c0e5)

8 years agoFix get_system_type failure due to invalid remote name
Vasu Kulkarni [Sun, 10 Sep 2017 19:42:16 +0000 (12:42 -0700)]
Fix get_system_type failure due to invalid remote name

recent changes caused the remote name to be invalid, fix the
arg passed to get_system_type

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
(cherry picked from commit 9fe58d5ed6bd2338271a9e003f51d80c2e440e23)

8 years agoStop the mgr on node which is not client, this will ensure
Vasu Kulkarni [Fri, 8 Sep 2017 17:00:49 +0000 (10:00 -0700)]
Stop the mgr on node which is not client, this will ensure
the client.0's mgr is active mgr for workunit to work.

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
(cherry picked from commit b1fd615e9f507d4243744b34887366086c1890ac)

8 years agoTest ceph-mgr RESTful api
Vasu Kulkarni [Fri, 8 Sep 2017 00:10:04 +0000 (17:10 -0700)]
Test ceph-mgr RESTful api

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
(cherry picked from commit e5b5a1b8c8d0e750d29fe41a16252dcb0adf449f)

8 years agoAdd additional variables as required by ceph-ansible for upstream
Vasu Kulkarni [Thu, 7 Sep 2017 23:54:51 +0000 (16:54 -0700)]
Add additional variables as required by ceph-ansible for upstream
installation.

possible revert if those are made default
issue: https://github.com/ceph/ceph-ansible/issues/1834

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
(cherry picked from commit e963bfd6ca0ea1525e37dc35bf3309548c7a3831)

8 years agoRename folders to fix task order
Vasu Kulkarni [Fri, 18 Aug 2017 18:35:54 +0000 (11:35 -0700)]
Rename folders to fix task order

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
(cherry picked from commit 9cc00c5c1a6e99323a4655aad5c093b8d87609f6)

8 years agouse bluestore with dmcrypt option
Vasu Kulkarni [Fri, 18 Aug 2017 18:09:50 +0000 (11:09 -0700)]
use bluestore with dmcrypt option

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
(cherry picked from commit 1041c803f1f8f7be0edd0e78df352152fa3c4e6f)

8 years agoAdd dmcrypt option
Vasu Kulkarni [Fri, 18 Aug 2017 18:08:00 +0000 (11:08 -0700)]
Add dmcrypt option

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
(cherry picked from commit f6de5d9f9e2a3666116311c146a8f2a45f3b8dbb)

8 years agoSeparate the main task from options
Vasu Kulkarni [Fri, 18 Aug 2017 18:05:01 +0000 (11:05 -0700)]
Separate the main task from options

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
(cherry picked from commit 60d00e0eadda3a7a7cfbac2acdc81a4d55a27570)

8 years agoCatchup with recent changes with ceph-ansible
Vasu Kulkarni [Fri, 18 Aug 2017 17:47:22 +0000 (10:47 -0700)]
Catchup with recent changes with ceph-ansible

Adds osd_scenario and ceph_stable_release variables

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
(cherry picked from commit 0395b844884a63a2cb7480dcc40762df6915b403)

8 years agoAdd workaround for http://tracker.ceph.com/issues/20950
Vasu Kulkarni [Thu, 24 Aug 2017 18:02:20 +0000 (11:02 -0700)]
Add workaround for http://tracker.ceph.com/issues/20950

mgr bootstrap key differs on disk, rewrite the new key
using auth get.

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
(cherry picked from commit 5c43c5972ddb04724c8659a1b8d71cda1e0acdfb)

8 years agoAdd kraken to luminous upgrade case
Vasu Kulkarni [Thu, 24 Aug 2017 16:24:52 +0000 (09:24 -0700)]
Add kraken to luminous upgrade case

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
(cherry picked from commit 71ea4987e0d55cc49e90dfbaf9722769cc892f3f)

8 years agoAdd kernel tar/untar and systemd task
Vasu Kulkarni [Mon, 7 Aug 2017 19:19:41 +0000 (12:19 -0700)]
Add kernel tar/untar and systemd task

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
(cherry picked from commit 526d4aef1cb73ffa6f73e1353dc06eed6cccb9f9)

8 years agoRestart services after upgrade
Vasu Kulkarni [Tue, 1 Aug 2017 23:32:42 +0000 (16:32 -0700)]
Restart services after upgrade

This seems to be an issue and should be fixed in our systemd scripts,
need to discuss more on this.

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
(cherry picked from commit 2116f76a1f1fbb32ca5844c1d051e22b1b657e0f)

8 years agosince the roles are mapped inside ceph-deploy, store the roles that
Vasu Kulkarni [Sat, 29 Jul 2017 06:11:41 +0000 (23:11 -0700)]
since the roles are mapped inside ceph-deploy, store the roles that
are mapped and use the new mapped role for upgrades during later
stage.

eg: mon.a is mapped to mon.mira002 during install, store this mapping
and durig upgrade map it back to appropriate name to find the hostname
with that role

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
(cherry picked from commit 8a2b9a08095dc97e0e440cc900c650d5efb55a4f)

8 years agoAdd doc string for missing ceph-deploy-branch
Vasu Kulkarni [Sat, 29 Jul 2017 00:48:09 +0000 (17:48 -0700)]
Add doc string for missing ceph-deploy-branch

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
(cherry picked from commit e7a0a4f87f2fbea0c9e27341c952245f1ad34ab8)

8 years agouse mon.a as ceph-admin node, the previous get_first_mon returns
Vasu Kulkarni [Thu, 27 Jul 2017 22:24:25 +0000 (15:24 -0700)]
use mon.a as ceph-admin node, the previous get_first_mon returns
a sorted(mons)[0] as first mon.

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
(cherry picked from commit 407ce2613d37252e48c2a648b94b065ab070ec46)

8 years agouse elif instead of else to handle cases for jewel install
Vasu Kulkarni [Thu, 27 Jul 2017 18:17:31 +0000 (11:17 -0700)]
use elif instead of else to handle cases for jewel install

jewel needs neither filestore or bluestore as an option, so provide none
when running with jewel branch.

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
(cherry picked from commit f3ec7cabf66c76d712f77a4dfefe96f374507982)

8 years agoJewel to luminous upgrade scenario using ceph-deploy
Vasu Kulkarni [Thu, 27 Jul 2017 02:18:11 +0000 (19:18 -0700)]
Jewel to luminous upgrade scenario using ceph-deploy

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
(cherry picked from commit 9c85c76a90dcf46ec2bcc6840f4fc57ed86583b5)

8 years agoAdd upgrade functionality using ceph-deploy
Vasu Kulkarni [Thu, 27 Jul 2017 01:56:08 +0000 (18:56 -0700)]
Add upgrade functionality using ceph-deploy

This is to test for customer like upgrade scenarios and to find
any issues that may be related to systemd, packaging etc

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
(cherry picked from commit 9a73127c16d3395934d003024964ed1373d81083)

8 years agoFix old flake8 issues
Vasu Kulkarni [Wed, 26 Jul 2017 22:01:15 +0000 (15:01 -0700)]
Fix old flake8 issues

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
(cherry picked from commit b409c3d163194b0b65e26b0fbc93def8e17a10b4)

8 years agoAdd option to skip mgr install for old release
Vasu Kulkarni [Wed, 26 Jul 2017 21:58:50 +0000 (14:58 -0700)]
Add option to skip mgr install for old release

Signed-off-by: Vasu Kulkarni <vasu@redhat.com>
(cherry picked from commit a9be0c387ed2ae09227dad833c8647af110bd2de)

8 years agoMerge pull request #17945 from batrick/i21540
Nathan Cutler [Mon, 25 Sep 2017 20:40:05 +0000 (22:40 +0200)]
Merge pull request #17945 from batrick/i21540

luminous: qa whitelist fixes

Reviewed-by: Nathan Cutler <ncutler@suse.com>
8 years agoMerge pull request #17821 from smithfarm/wip-21472-luminous
Nathan Cutler [Mon, 25 Sep 2017 20:32:17 +0000 (22:32 +0200)]
Merge pull request #17821 from smithfarm/wip-21472-luminous

luminous: tests: kcephfs: ignorable MDS cache too large warning

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Yan, Zheng <zyan@redhat.com>
8 years agoqa/workunits/mgr/test_localpool: simple test for localpool mode
Sage Weil [Sun, 24 Sep 2017 18:05:14 +0000 (14:05 -0400)]
qa/workunits/mgr/test_localpool: simple test for localpool mode

Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 6383fa5b3004a606c15efd2be1b7ce5d57f05086)

8 years agopybind/mgr/localpool: module to automagically create localized pools
Sage Weil [Wed, 6 Sep 2017 19:34:50 +0000 (15:34 -0400)]
pybind/mgr/localpool: module to automagically create localized pools

By default, this will create a pool per rack, 3x replication, with a host
failure domain.  Those parameters can be customized via mgr config-key
options.

Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 1a0f42b70a4c9fa68dc47f2f521d0f1e8f5bb220)

8 years agoqa: whitelist expected MDS_CACHE_OVERSIZED 17821/head
Patrick Donnelly [Mon, 28 Aug 2017 18:47:29 +0000 (11:47 -0700)]
qa: whitelist expected MDS_CACHE_OVERSIZED

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 71f0066f6ec32be18abc0c220a49e203ac3b9786)

Conflicts:
   qa/suites/kcephfs/recovery/tasks/client-limits.yaml - fix
      already applied by some other commit

8 years agoqa: whitelist expected rstat warning 17945/head
Patrick Donnelly [Mon, 28 Aug 2017 18:52:36 +0000 (11:52 -0700)]
qa: whitelist expected rstat warning

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit c53d091e5110011866aa85495ff4e5abe0442d71)

8 years agoMerge pull request #17922 from batrick/kcephfs-backports
Nathan Cutler [Mon, 25 Sep 2017 13:43:06 +0000 (15:43 +0200)]
Merge pull request #17922 from batrick/kcephfs-backports

luminous: qa: kcephfs whitelist fixes

Reviewed-by: Yan, Zheng <zyan@redhat.com>
8 years agomon/OSDMonitor: error out if setting ruleset-* ec profile property 17936/head
Sage Weil [Wed, 20 Sep 2017 20:42:01 +0000 (16:42 -0400)]
mon/OSDMonitor: error out if setting ruleset-* ec profile property

We change ruleset -> crush back in dc7a2aaf7a34b1e6af0c7b79dc44a69974c1da23.
If someone tries to use the old property, error out early, instead of
silently not doing the thing they thought they told us to do.

Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 6376d75eda648789b5e316a1ac2883708db7128e)

8 years agomon: show legacy health warning in `status` output 17930/head
John Spray [Sat, 23 Sep 2017 12:48:36 +0000 (13:48 +0100)]
mon: show legacy health warning in `status` output

Previously you only got the text of this if you were
either looking at "health detail" or if you had
already set the preluminous_compat setting (in which
case you presumably were already aware so the message
isn't doing much).

Signed-off-by: John Spray <john.spray@redhat.com>
8 years agoMerge pull request #17855 from smithfarm/wip-21490-luminous
Yuri Weinstein [Fri, 22 Sep 2017 20:25:59 +0000 (13:25 -0700)]
Merge pull request #17855 from smithfarm/wip-21490-luminous

luminous: tests: test_rebuild_simple_altpool triggers MDS assertion

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
8 years agoqa: get config only on running MDS 17855/head
Patrick Donnelly [Wed, 20 Sep 2017 22:44:40 +0000 (15:44 -0700)]
qa: get config only on running MDS

Fixes: http://tracker.ceph.com/issues/21466
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 8a535d9c72965c79692dccc6ff418b46fb86c3eb)

8 years agoqa: whitelist cache too large in client-limits 17922/head
Patrick Donnelly [Fri, 22 Sep 2017 16:44:43 +0000 (09:44 -0700)]
qa: whitelist cache too large in client-limits

Fixes: http://tracker.ceph.com/issues/21510
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 87039eb30a25158a192ab7099fa81fbb9ce02953)

8 years agoqa: whitelist failover log warnings
Patrick Donnelly [Fri, 22 Sep 2017 16:42:06 +0000 (09:42 -0700)]
qa: whitelist failover log warnings

Fixes: http://tracker.ceph.com/issues/21509
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 37a7737712f36422b5f035a0e02bab25ae2da890)

8 years agoqa: ignore evicted client in client-recovery
Patrick Donnelly [Fri, 22 Sep 2017 16:38:59 +0000 (09:38 -0700)]
qa: ignore evicted client in client-recovery

Fixes: http://tracker.ceph.com/issues/21508
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 8ecef57bb10a83587f02ce087b657d303ae7c589)

8 years agoMerge PR #17878 into luminous
Patrick Donnelly [Fri, 22 Sep 2017 16:53:43 +0000 (09:53 -0700)]
Merge PR #17878 into luminous

* refs/remotes/upstream/pull/17878/head:
qa/cephfs: Fix error in test_filtered_df

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Amit Kumar <amitkuma@redhat.com>
8 years agoMerge PR #17877 into luminous
Patrick Donnelly [Fri, 22 Sep 2017 16:53:41 +0000 (09:53 -0700)]
Merge PR #17877 into luminous

* refs/remotes/upstream/pull/17877/head:
client: Delete onsafe
client: add mountedness check inside client_lock

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
8 years agoMerge PR #17870 into luminous
Patrick Donnelly [Fri, 22 Sep 2017 16:53:39 +0000 (09:53 -0700)]
Merge PR #17870 into luminous

* refs/remotes/upstream/pull/17870/head:
mds: reset Formatter on error

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Amit Kumar <amitkuma@redhat.com>
8 years agoMerge PR #17854 into luminous
Patrick Donnelly [Fri, 22 Sep 2017 16:53:37 +0000 (09:53 -0700)]
Merge PR #17854 into luminous

* refs/remotes/upstream/pull/17854/head:
mds: void sending cap import message when inode is frozen
client: fix message order check in handle_cap_export()

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
8 years agoMerge PR #17852 into luminous
Patrick Donnelly [Fri, 22 Sep 2017 16:53:35 +0000 (09:53 -0700)]
Merge PR #17852 into luminous

* refs/remotes/upstream/pull/17852/head:
mds: check ongoing catter-gather process before capping log

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
8 years agoMerge PR #17822 into luminous
Patrick Donnelly [Fri, 22 Sep 2017 16:53:33 +0000 (09:53 -0700)]
Merge PR #17822 into luminous

* refs/remotes/upstream/pull/17822/head:
qa/cephfs: kill mount if it gets evicted by mds
qa/cephfs: fix test_evict_client

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Zheng Yan <zyan@redhat.com>
8 years agoMerge pull request #17817 from smithfarm/wip-21464-luminous
Yuri Weinstein [Fri, 22 Sep 2017 16:44:54 +0000 (09:44 -0700)]
Merge pull request #17817 from smithfarm/wip-21464-luminous

luminous: qa: ignorable MDS_READ_ONLY warning

Reviewed-by: Yan, Zheng <ukernel@gmail.com>
Reviewed-by: Yuri Weinstein <yweins@redhat.com>
8 years agoMerge pull request #17879 from smithfarm/wip-21449-luminous
Yuri Weinstein [Fri, 22 Sep 2017 16:41:18 +0000 (09:41 -0700)]
Merge pull request #17879 from smithfarm/wip-21449-luminous

luminous: tests: test_misc creates metadata pool with dummy object resulting in WRN: POOL_APP_NOT_ENABLED

Reviewed-by: Amit Kumar amitkuma@redhat.com
Reviewed-by: Yan, Zheng <ukernel@gmail.com>
8 years agoMerge pull request #17886 from smithfarm/wip-21486-luminous
Yuri Weinstein [Fri, 22 Sep 2017 16:40:46 +0000 (09:40 -0700)]
Merge pull request #17886 from smithfarm/wip-21486-luminous

luminous: tests: [kcephfs] test_client_pin times out waiting for dentry release from kernel

Reviewed-by: Amit Kumar amitkuma@redhat.com
Reviewed-by: Yan, Zheng <ukernel@gmail.com>
8 years agoMerge pull request #17887 from smithfarm/wip-21487-luminous
Yuri Weinstein [Fri, 22 Sep 2017 16:39:26 +0000 (09:39 -0700)]
Merge pull request #17887 from smithfarm/wip-21487-luminous

luminous: cephfs: MDS rank add/remove log messages say wrong number of ranks

Reviewed-by: Amit Kumar amitkuma@redhat.com
Reviewed-by: Yan, Zheng <ukernel@gmail.com>
8 years agoMerge pull request #17888 from smithfarm/wip-21488-luminous
Yuri Weinstein [Fri, 22 Sep 2017 16:37:42 +0000 (09:37 -0700)]
Merge pull request #17888 from smithfarm/wip-21488-luminous

luminous: tests: failures from pjd fstest

Reviewed-by: Amit Kumar amitkuma@redhat.com
Reviewed-by: Yan, Zheng <ukernel@gmail.com>
8 years agoceph-bluestore-tool: better default logging; --log-file and --log-level options 17896/head
Sage Weil [Wed, 13 Sep 2017 22:33:21 +0000 (18:33 -0400)]
ceph-bluestore-tool: better default logging; --log-file and --log-level options

- keep derr going to stderr (so we see 'fsck error' messages)
- hide the rest
- add friendly --log-file option

Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 9bd98b42fea1791935280f5ad19e5420d5303e03)

8 years agoceph-bluestore-tool: add 'bluefs-bdev-expand' to expand wal or db usage
Sage Weil [Thu, 7 Sep 2017 22:27:20 +0000 (18:27 -0400)]
ceph-bluestore-tool: add 'bluefs-bdev-expand' to expand wal or db usage

If you are using the wal or db devices, this will expand bluefs's usage to
include the entire block device.

Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit aa4baad529835c1999ff9cc1a2f509c52a0cc699)

8 years agoceph-bluestore-tool: add 'bluefs-bdev-sizes' command
Sage Weil [Thu, 7 Sep 2017 22:20:27 +0000 (18:20 -0400)]
ceph-bluestore-tool: add 'bluefs-bdev-sizes' command

Show bdev sizes vs owned extents.

Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 9e492f2a2217ee94670e9e0d6cf28e4b7bff5ad0)

8 years agoceph-bluestore-tool: factor out bluefs mount
Sage Weil [Thu, 7 Sep 2017 22:12:21 +0000 (18:12 -0400)]
ceph-bluestore-tool: factor out bluefs mount

Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 86db2d7b09de4362d90fba834d61978860d73309)

8 years agoos/ObjectStore: add repair interface
Sage Weil [Thu, 7 Sep 2017 16:27:35 +0000 (12:27 -0400)]
os/ObjectStore: add repair interface

Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit c7b7a1f04f78fa62890c567d0ca53874c8d75eb7)

8 years agoceph-objectstore-tool: Make pg removal require --force
David Zafman [Sat, 9 Sep 2017 00:53:07 +0000 (17:53 -0700)]
ceph-objectstore-tool: Make pg removal require --force

Add new export-remove to combine the 2 operations

Fixes: http://tracker.ceph.com/issues/21272
Signed-off-by: David Zafman <dzafman@redhat.com>
(cherry picked from commit 3bb20f6d750915d176c7a34367dac966a20faa76)

8 years agoceph-objectstore-tool: Better messages for bad --journal-path
David Zafman [Sat, 9 Sep 2017 00:09:48 +0000 (17:09 -0700)]
ceph-objectstore-tool: Better messages for bad --journal-path

Signed-off-by: David Zafman <dzafman@redhat.com>
(cherry picked from commit 49ca1fff7fc4360d2f3a9cac60c0ba651cbc4750)

8 years agotest: Fix ceph-objectstore-tool test for standalone and latest code
David Zafman [Thu, 7 Sep 2017 03:41:50 +0000 (20:41 -0700)]
test: Fix ceph-objectstore-tool test for standalone and latest code

vstart.sh now defaults to bluestore, so specify filestore
Set environment for run-standalone.sh and cmake build
Create td/cot_dir as test directory
Crush output format change
Change dir into test directory
Give a little time after pool creation
Check for core files as ceph-helpers.sh does

Signed-off-by: David Zafman <dzafman@redhat.com>
(cherry picked from commit 3ac219df2d635a08be52df3ad44cf0683de46af2)

8 years agotest: Move ceph-objectstore-tool test to standalone
David Zafman [Thu, 7 Sep 2017 03:40:37 +0000 (20:40 -0700)]
test: Move ceph-objectstore-tool test to standalone

Signed-off-by: David Zafman <dzafman@redhat.com>
(cherry picked from commit 495c32fd31b311d22f0d4509e896916bd2671615)

8 years agoMerge pull request #17812 from liewegas/wip-21410-luminous
Sage Weil [Thu, 21 Sep 2017 15:56:16 +0000 (10:56 -0500)]
Merge pull request #17812 from liewegas/wip-21410-luminous

Revert "osd/OSDMap: allow bidirectional swap of pg-upmap-items"

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
8 years agoceph: do link/rename semantic checks after srcdn is readable 17888/head
Yan, Zheng [Sun, 17 Sep 2017 08:51:04 +0000 (16:51 +0800)]
ceph: do link/rename semantic checks after srcdn is readable

For hard link, source inode must not be directory. For rename,
types of source/destination inodes must match. If srcdn is replica
and we do these checks while it's not readble, it's possible that
wrong source inode is used in these checks.

Fixes: http://tracker.ceph.com/issues/21383
Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
(cherry picked from commit 46962b253563a867707e7c5d7887abf2060cc4d7)

8 years agomon/MDSMonitor: fix clog msgs on MDS rank add/rm 17887/head
John Spray [Mon, 18 Sep 2017 17:16:40 +0000 (13:16 -0400)]
mon/MDSMonitor: fix clog msgs on MDS rank add/rm

Because these were generated before the actual
change, the get_num_in values did not make sense.

Fixes: http://tracker.ceph.com/issues/21421
Signed-off-by: John Spray <john.spray@redhat.com>
(cherry picked from commit 511439b515414ae93d791f54a187e44d5fdffa7b)

8 years agoclient: set client_try_dentry_invalidate to false by default 17886/head
Yan, Zheng [Tue, 19 Sep 2017 06:38:13 +0000 (14:38 +0800)]
client: set client_try_dentry_invalidate to false by default

By default, ceph-fuse uses side effect of 'dentry invalidation' to
trim kernel dcache if it runs on kernel < 3.18. The implemention of
kernel function d_invalidate() changed in 3.18 kernel, the method no
longer works for upstream kernel >= 3.18.

RHEL 3.10 kernel includes backport of patches that change implemention
of d_invalidate(). So checking kernel version to decide if 'dentry
invalidation' method works is unreliable.

Fixes: http://tracker.ceph.com/issues/21423
Signed-off-by: "Yan, Zheng" <zyan@redhat.com>
(cherry picked from commit 69adaabface27880dd6c8dbfdeeb06cf3f3d346a)

8 years agoqa/tasks/cephfs: Whitelist POOL_APP_NOT_ENABLED for test_misc 17879/head
Douglas Fuller [Tue, 12 Sep 2017 17:22:09 +0000 (13:22 -0400)]
qa/tasks/cephfs: Whitelist POOL_APP_NOT_ENABLED for test_misc

test_misc verifies that ceph fs new will not create a filesystem
on a pool that already contains objects. As part of the test, it
inserts a dummy object into a pool and then attempts to use it for
CephFS. This triggers POOL_APP_NOT_ENABLED. Setting the application
metadata for the pool (and having ceph fs new fail because of the
existing metadata) would then exercise a different failure case.

Signed-off-by: Douglas Fuller <dfuller@redhat.com>
(cherry picked from commit eba01c48ca8b3113b1faf353990533a5ae39232c)

8 years agoqa/cephfs: Fix error in test_filtered_df 17878/head
Douglas Fuller [Wed, 13 Sep 2017 18:00:20 +0000 (14:00 -0400)]
qa/cephfs: Fix error in test_filtered_df

ceph df accounts for pool size, so there is no need to do it in the test.

Fixes: http://tracker.ceph.com/issues/21381
Signed-off-by: Douglas Fuller <dfuller@redhat.com>
(cherry picked from commit b059cb6290d81df07bd8ec4e1ec9ef0be24892a2)

8 years agoclient: Delete onsafe 17877/head
Jos Collin [Thu, 7 Sep 2017 07:11:40 +0000 (12:41 +0530)]
client: Delete onsafe

Variable "onsafe" going out of scope leaks the storage it points to. This fixes the Coverity
Scan CID 1417473.

Signed-off-by: Jos Collin <jcollin@redhat.com>
(cherry picked from commit e24a91e9189587c800ef5709c78f1bc0e0ac8419)

8 years agoclient: add mountedness check inside client_lock
Jeff Layton [Fri, 25 Aug 2017 12:31:47 +0000 (08:31 -0400)]
client: add mountedness check inside client_lock

Currently we check for mountedness in the high level wrappers, but those
checks are lockless. It's possible to have a call that races with
ceph_unmount(). It could pass one of the is_mounted() checks in the
wrapper, and then block on the client_lock while the unmount is actually
running. Eventually it picks up and runs after the unmount returns, with
questionable results -- possibly even a crash in some cases.

For now, we can explain this away with a simple admonition that
applications should ensure that no calls are running when ceph_unmount
is called. In the future though, we may need to forcibly shut down the
mount when certain events occur (not returning a lease or delegation in
time, for instance).

Sprinkle in a bunch of "unmounting" checks after taking the client_lock,
and simply have the functions return errors (or sensible values in some
cases) when the Client is being downed. With that, we ensure that this
sort of race can't occur, even when the unmount is not being driven by
userland. Note too that in some places I've replaced assertions in the
code with error returns, as that's nicer behavior for libraries.

Note that this can't replace the ->is_mounted() checks in the lockless
wrappers as those are needed to determine whether the client pointer in
the ceph_mount_info is still valid. The admonition not to allow
ceph_unmount to race with other calls is therefore still necessary.

Signed-off-by: Jeff Layton <jlayton@redhat.com>
(cherry picked from commit efca340596ef4da2b254ff1c64ec4c462d7b95a5)

8 years agomds: reset Formatter on error 17870/head
Patrick Donnelly [Tue, 5 Sep 2017 21:44:49 +0000 (14:44 -0700)]
mds: reset Formatter on error

This prevents partial results from concatenating with the error message.

Fixes: http://tracker.ceph.com/issues/21252
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 2b2b3d2035a700a65c3c469eedae69ad31023cf6)

8 years agomgr/prometheus: only turn - into _minus at end 17868/head
John Spray [Mon, 28 Aug 2017 18:55:06 +0000 (19:55 +0100)]
mgr/prometheus: only turn - into _minus at end

...of perf counter name.

So that mds_mem_dir- becomes mds_mem_dir_minus, but
throttle-filestore_bytes becomes throttle_filestore_bytes.

At some point once this is all settled we should
probably just change Ceph's internal perf counter
naming to satisfy the major TSDB naming rules.

Signed-off-by: John Spray <john.spray@redhat.com>
(cherry picked from commit 29ac9270d43a225bb2336adaaf813c12e12a715a)

8 years agomgr/prometheus: Fix for MDS metrics
jermudgeon [Mon, 28 Aug 2017 05:26:28 +0000 (21:26 -0800)]
mgr/prometheus: Fix for MDS metrics

MDS metrics come in these forms:

mds_mem_dir #Directories
mds_mem_dir+ #Directories opened
mds_mem_dir- #Directories closed

In this case, continuing the trend of replacing all illegal characters with '_' results in…

mds_mem_dir #Directories
mds_mem_dir_ #Directories opened
mds_mem_dir_ #Directories closed

which is palpably a bad idea.

Suggested replacement for '+' = '_plus' seems fine, and a perusal of all metrics indicate that only MDS metrics end in '-' or '+' at this time.

Replacing '-' with '_minus' is probably less good for the general case, if anyone has a better idea…

I suppose another alternative would be to change MDS metrics so they don't use 'illegal' characters, but this also seems cumbersome and would break more third parties.

Fixes: http://tracker.ceph.com/issues/20899
Signed-off-by: Jeremy H Austin <jhaustin@gmail.com>
(cherry picked from commit d719cd04b294e90ab9d440ba7d033826c069a2de)

8 years agomgr: fix py calls for dne service perf counters 17867/head
John Spray [Fri, 8 Sep 2017 15:33:02 +0000 (11:33 -0400)]
mgr: fix py calls for dne service perf counters

Fixes: http://tracker.ceph.com/issues/21253
Signed-off-by: John Spray <john.spray@redhat.com>
(cherry picked from commit f9a4ca07acecd15986cbce61a6e118a6cb05af29)

8 years agoos/filestore: fix device/partition metadata detection 17865/head
Sage Weil [Tue, 8 Aug 2017 17:59:41 +0000 (13:59 -0400)]
os/filestore: fix device/partition metadata detection

The UUID thing (a) relies on partition labels to work, which isn't
always true (and won't be true for ceph-volume going forward), and
(b) reportedly doesn't work anyway.  The fd-based helper works
just fine (even for vstart).

Fixes: http://tracker.ceph.com/issues/20944
Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit e62862be0b68a17763e19ed8453436c457294505)