]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
11 years agoFix lots of linter errors
Zack Cerza [Tue, 25 Mar 2014 20:27:54 +0000 (15:27 -0500)]
Fix lots of linter errors

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoAdd --preserve-queue to teuthology-kill
Zack Cerza [Tue, 25 Mar 2014 15:15:41 +0000 (10:15 -0500)]
Add --preserve-queue to teuthology-kill

For any cases where we just want to kill the running jobs and not delete
any scheduled jobs. This only applies to killing entire runs and is not
the default.

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoPostpone creation of logger object
Zack Cerza [Tue, 25 Mar 2014 15:07:52 +0000 (10:07 -0500)]
Postpone creation of logger object

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agorgw: add option to use erasure coding instead of replication
Josh Durgin [Tue, 25 Mar 2014 01:19:41 +0000 (18:19 -0700)]
rgw: add option to use erasure coding instead of replication

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
11 years agoradosgw-agent: default to 1000 max entries
Josh Durgin [Mon, 24 Mar 2014 21:54:23 +0000 (14:54 -0700)]
radosgw-agent: default to 1000 max entries

Using a small number of max entries is only needed for one test, so
make it configurable. Setting it too low makes s3tests fail.

Fixes http://tracker.ceph.com/issues/7742

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
11 years agointernal: cleaner fix for binary gibberish in logs
Josh Durgin [Mon, 24 Mar 2014 21:26:36 +0000 (14:26 -0700)]
internal: cleaner fix for binary gibberish in logs

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
11 years agointernal: ignore binary junk in kernel logs
Josh Durgin [Mon, 24 Mar 2014 21:13:07 +0000 (14:13 -0700)]
internal: ignore binary junk in kernel logs

NUL bytes can show up here after powercycle testing

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
11 years agoceph: ignore daemon types that aren't configured
Josh Durgin [Sun, 23 Mar 2014 19:03:25 +0000 (12:03 -0700)]
ceph: ignore daemon types that aren't configured

not every cluster needs an mds

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
11 years agoradosgw-agent: use our mirror instead of github
Sage Weil [Fri, 21 Mar 2014 20:50:10 +0000 (13:50 -0700)]
radosgw-agent: use our mirror instead of github

Signed-off-by: Sage Weil <sage@inktank.com>
11 years agoMerge remote-tracking branch 'gh/firefly'
Sage Weil [Fri, 21 Mar 2014 16:12:40 +0000 (09:12 -0700)]
Merge remote-tracking branch 'gh/firefly'

11 years agoMerge pull request #231 from ceph/wip-fix-rgw
Alfredo Deza [Fri, 21 Mar 2014 15:38:26 +0000 (11:38 -0400)]
Merge pull request #231 from ceph/wip-fix-rgw

fix rgw task

11 years agorgw: handle empty conf case
Sage Weil [Fri, 21 Mar 2014 15:36:24 +0000 (08:36 -0700)]
rgw: handle empty conf case

2014-03-20T23:30:28.964 ERROR:teuthology.contextutil:Saw exception from nested tasks
Traceback (most recent call last):
  File "/home/teuthworker/teuthology-firefly/teuthology/contextutil.py", line 27, in nested
    vars.append(enter())
  File "/usr/lib/python2.7/contextlib.py", line 17, in __enter__
    return self.gen.next()
  File "/home/teuthworker/teuthology-firefly/teuthology/task/rgw.py", line 83, in ship_config
    idle_timeout = conf.get('idle_timeout', 30)
AttributeError: 'NoneType' object has no attribute 'get'

This happens with job yamls like:

- rgw:
  - client.0

(where it is a list and not a dict of clients)

Signed-off-by: Sage Weil <sage@inktank.com>
11 years agoMerge remote-tracking branch 'gh/firefly'
Sage Weil [Thu, 20 Mar 2014 23:50:15 +0000 (16:50 -0700)]
Merge remote-tracking branch 'gh/firefly'

11 years agoMerge pull request #230 from ceph/wip-7702
Sage Weil [Thu, 20 Mar 2014 23:48:44 +0000 (16:48 -0700)]
Merge pull request #230 from ceph/wip-7702

rgw: allow fastcgi idle timeout to be adjusted

Reviewed-by: Yehuda Sadeh <yehuda@inktank.com>
11 years agorgw: allow fastcgi idle timeout to be adjusted
Sage Weil [Thu, 20 Mar 2014 23:09:42 +0000 (16:09 -0700)]
rgw: allow fastcgi idle timeout to be adjusted

Signed-off-by: Sage Weil <sage@inktank.com>
11 years agoMerge branch 'firefly'
Sage Weil [Thu, 20 Mar 2014 17:35:30 +0000 (10:35 -0700)]
Merge branch 'firefly'

11 years agointernal: ignore ext4 recovery msg
Sage Weil [Tue, 18 Mar 2014 22:15:54 +0000 (15:15 -0700)]
internal: ignore ext4 recovery msg

  plana43 kernel: [   21.179205] EXT4-fs (sda1): INFO: recovery required on readonly filesystem

is harmless.

Signed-off-by: Sage Weil <sage@inktank.com>
11 years agoMerge branch 'firefly'
Sage Weil [Tue, 18 Mar 2014 22:01:04 +0000 (15:01 -0700)]
Merge branch 'firefly'

11 years agoceph.conf: tolerate 500ms of clock drift (up from 250)
Sage Weil [Tue, 18 Mar 2014 22:00:30 +0000 (15:00 -0700)]
ceph.conf: tolerate 500ms of clock drift (up from 250)

Signed-off-by: Sage Weil <sage@inktank.com>
11 years agoMerge remote-tracking branch 'gh/firefly'
Sage Weil [Tue, 18 Mar 2014 03:56:51 +0000 (20:56 -0700)]
Merge remote-tracking branch 'gh/firefly'

11 years agoMerge pull request #229 from dachary/wip-ec-pool
Sage Weil [Tue, 18 Mar 2014 01:08:15 +0000 (18:08 -0700)]
Merge pull request #229 from dachary/wip-ec-pool

ceph_manager: update ec_pool creation parameters

11 years agoceph_manager: update ec_pool creation parameters
Loic Dachary [Sun, 16 Mar 2014 17:05:42 +0000 (18:05 +0100)]
ceph_manager: update ec_pool creation parameters

As of https://github.com/ceph/ceph/pull/1477 the erasure code parameters
are controled via the osd erasure-code-profile set command instead of
being inlined and duplicated in the crush ruleset creation and the pool
creation. There is no need to create the crush ruleset, it is done
implicitly.

Signed-off-by: Loic Dachary <loic@dachary.org>
11 years agoceph_manager: default to 16 initial pgs per pool
Samuel Just [Mon, 17 Mar 2014 21:04:41 +0000 (14:04 -0700)]
ceph_manager: default to 16 initial pgs per pool

1 is really too few for anything to happen at a
reasonable rate.

Signed-off-by: Samuel Just <sam.just@inktank.com>
11 years agoceph_manager::wait_for_clean: reset timeout if we make progress
Samuel Just [Mon, 17 Mar 2014 21:03:30 +0000 (14:03 -0700)]
ceph_manager::wait_for_clean: reset timeout if we make progress

Fixes: #7748
Signed-off-by: Samuel Just <sam.just@inktank.com>
11 years agoMerge pull request #227 from ceph/wip-7749
Josh Durgin [Mon, 17 Mar 2014 19:10:27 +0000 (12:10 -0700)]
Merge pull request #227 from ceph/wip-7749

Pass '--object-sync-timeout 30' to radosgw-agent

Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
11 years agoPass '--object-sync-timeout 30' to radosgw-agent
Zack Cerza [Mon, 17 Mar 2014 18:54:52 +0000 (13:54 -0500)]
Pass '--object-sync-timeout 30' to radosgw-agent

http://tracker.ceph.com/issues/7749

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoMerge pull request #226 from ceph/wip-7714
Zack Cerza [Mon, 17 Mar 2014 18:42:27 +0000 (13:42 -0500)]
Merge pull request #226 from ceph/wip-7714

prevent undefined values in finally block

11 years agoFix max_job_time timeout
Zack Cerza [Mon, 17 Mar 2014 16:19:02 +0000 (11:19 -0500)]
Fix max_job_time timeout

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoprevent undefined values in finally block
Alfredo Deza [Fri, 14 Mar 2014 18:44:22 +0000 (14:44 -0400)]
prevent undefined values in finally block

Signed-off-by: Alfredo Deza <alfredo.deza@inktank.com>
11 years agoschedule_suite.sh: turn up journal log too
Sage Weil [Fri, 14 Mar 2014 16:35:54 +0000 (09:35 -0700)]
schedule_suite.sh: turn up journal log too

Signed-off-by: Sage Weil <sage@inktank.com>
11 years agoMake distro kernels work on Trusty.
Sandon Van Ness [Thu, 13 Mar 2014 23:02:04 +0000 (16:02 -0700)]
Make distro kernels work on Trusty.

Some grub changes and package names caused them to fail on trusty.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
11 years agoMerge remote-tracking branch 'gh/firefly'
Sage Weil [Thu, 13 Mar 2014 20:15:04 +0000 (13:15 -0700)]
Merge remote-tracking branch 'gh/firefly'

11 years agolock: allow -a with --brief
Sage Weil [Wed, 12 Mar 2014 15:57:23 +0000 (08:57 -0700)]
lock: allow -a with --brief

Signed-off-by: Sage Weil <sage@inktank.com>
11 years agoHandle newer btrfstools.
Sandon Van Ness [Wed, 12 Mar 2014 01:15:12 +0000 (18:15 -0700)]
Handle newer btrfstools.

Newer btfs userland tools needs a -f like xfs instead of a prompt
for yes. Trusty needs this change.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
11 years agoMerge pull request #222 from ceph/wip-trusty
wusui [Wed, 12 Mar 2014 01:48:13 +0000 (18:48 -0700)]
Merge pull request #222 from ceph/wip-trusty

Handle newer btrfstools.

11 years agoHandle newer btrfstools.
Sandon Van Ness [Wed, 12 Mar 2014 01:15:12 +0000 (18:15 -0700)]
Handle newer btrfstools.

Newer btfs userland tools needs a -f like xfs instead of a prompt
for yes. Trusty needs this change.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
11 years agoradosbench: cleanup will probably take longer than writing the objects
Samuel Just [Tue, 11 Mar 2014 20:55:30 +0000 (13:55 -0700)]
radosbench: cleanup will probably take longer than writing the objects

With thrashing, cleanup will require recovering the degraded objects
first, while the initial write out doesn't since they they are new
objects.

Signed-off-by: Samuel Just <sam.just@inktank.com>
11 years agoMerge pull request #220 from ceph/wip-print
wusui [Tue, 11 Mar 2014 16:41:39 +0000 (09:41 -0700)]
Merge pull request #220 from ceph/wip-print

Add print task.

11 years agoMerge pull request #221 from ceph/wip-brief
wusui [Tue, 11 Mar 2014 03:00:38 +0000 (20:00 -0700)]
Merge pull request #221 from ceph/wip-brief

lock.py: allow --brief to stand on its own (without --list)

11 years agolock.py: allow --brief to stand on its own (without --list)
Dan Mick [Tue, 11 Mar 2014 02:38:46 +0000 (19:38 -0700)]
lock.py: allow --brief to stand on its own (without --list)

I use --brief all the time, and it's annoying to have to type
the useless --list at the same time.

Signed-off-by: Dan Mick <dan.mick@inktank.com>
11 years agoAdd print task.
Yuri Weinstein [Mon, 10 Mar 2014 23:45:03 +0000 (16:45 -0700)]
Add print task.

Made a task that simply prints the output that is given to it as
an argument. Can be used like any other task (under sequential,
etc...)

IE:

tasks:
- print: "String"
- chef: null
- print: "Another String"

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
Signed-off-by: Yuri Weinstein <yuri.weinstein@inktank.com>
11 years agoradosgw-admin: test data sync with more than max-entries objects
Josh Durgin [Mon, 10 Mar 2014 07:22:11 +0000 (00:22 -0700)]
radosgw-admin: test data sync with more than max-entries objects

For now hard code the max-entries for the radosgw-agent, since
this is the only test that is affected by it.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
11 years agoradosgw-admin: check that data deletions work as well
Josh Durgin [Mon, 10 Mar 2014 07:20:28 +0000 (00:20 -0700)]
radosgw-admin: check that data deletions work as well

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
11 years agoradosgw-admin: fix typo in data sync test
Josh Durgin [Mon, 10 Mar 2014 07:19:55 +0000 (00:19 -0700)]
radosgw-admin: fix typo in data sync test

re-indent and set k.key instead of k.Key

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
11 years agoradosgw-admin: ignore 301 test against the same src and dest regions
Josh Durgin [Mon, 10 Mar 2014 07:16:28 +0000 (00:16 -0700)]
radosgw-admin: ignore 301 test against the same src and dest regions

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
11 years agorgw: create clients in all zones when regions are used
Josh Durgin [Mon, 10 Mar 2014 07:25:32 +0000 (00:25 -0700)]
rgw: create clients in all zones when regions are used

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
11 years agorgw utils: add function to get data log window configuration
Josh Durgin [Mon, 10 Mar 2014 07:22:39 +0000 (00:22 -0700)]
rgw utils: add function to get data log window configuration

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
11 years agoradosgw-agent: add logging to radosgw-admin task tests
Josh Durgin [Mon, 10 Mar 2014 07:12:00 +0000 (00:12 -0700)]
radosgw-agent: add logging to radosgw-admin task tests

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
11 years agorgw: set placement targets for zones
Josh Durgin [Tue, 26 Nov 2013 02:34:38 +0000 (18:34 -0800)]
rgw: set placement targets for zones

This lets zones use different pools for actual object data.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
11 years agoradosgw-admin: don't compare pools for different zones
Josh Durgin [Tue, 26 Nov 2013 02:33:09 +0000 (18:33 -0800)]
radosgw-admin: don't compare pools for different zones

The actual data locations will be different if they are in the same
cluster.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
11 years agoAdds radosgw-agent small file sync test
Christophe Courtaut [Thu, 10 Oct 2013 10:28:46 +0000 (12:28 +0200)]
Adds radosgw-agent small file sync test

Signed-off-by: Christophe Courtaut <christophe.courtaut@gmail.com>
11 years agoworkunit: change timeout 1h -> 3h
Sage Weil [Sun, 9 Mar 2014 17:29:37 +0000 (10:29 -0700)]
workunit: change timeout 1h -> 3h

Signed-off-by: Sage Weil <sage@inktank.com>
11 years agoAdd missing space in error message
Zack Cerza [Sat, 8 Mar 2014 23:26:14 +0000 (17:26 -0600)]
Add missing space in error message

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoMake try_push_job_info() retry using safe_while
Zack Cerza [Sat, 8 Mar 2014 21:21:45 +0000 (15:21 -0600)]
Make try_push_job_info() retry using safe_while

I've noticed sometimes try_push_job_info() fails because of server load
issues. It should try more than once (and now does).

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoUpdate safe_while's suggested usage pattern
Zack Cerza [Sat, 8 Mar 2014 21:19:31 +0000 (15:19 -0600)]
Update safe_while's suggested usage pattern

I didn't love the way safe_while was encouraged to be used and it didn't
fit right with the new no-raising behavior. Now it's encouraged to be
used like this:

with safe_while() as proceed:
    while proceed():
        do_things()

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoAdd optional _raise parameter
Zack Cerza [Sat, 8 Mar 2014 20:58:39 +0000 (14:58 -0600)]
Add optional _raise parameter

Defaults to True but if set to False, when giving up log a warning
instead of raising an exception.

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoPass timeout to _spawn_on_all_clients()
Zack Cerza [Sat, 8 Mar 2014 20:16:07 +0000 (14:16 -0600)]
Pass timeout to _spawn_on_all_clients()

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoLog correct action in CephManager.remove_pool()
Zack Cerza [Sat, 8 Mar 2014 14:12:16 +0000 (08:12 -0600)]
Log correct action in CephManager.remove_pool()

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoLog timeout value
Zack Cerza [Sat, 8 Mar 2014 14:01:00 +0000 (08:01 -0600)]
Log timeout value

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoIterate more sensibly over processes
Zack Cerza [Sat, 8 Mar 2014 13:58:13 +0000 (07:58 -0600)]
Iterate more sensibly over processes

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoChange default workunit timeout to 1h
Zack Cerza [Fri, 7 Mar 2014 21:14:50 +0000 (15:14 -0600)]
Change default workunit timeout to 1h

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoUse safe_while's action arg
Zack Cerza [Fri, 7 Mar 2014 20:04:49 +0000 (14:04 -0600)]
Use safe_while's action arg

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoAdd optional 'action' parameter to safe_while
Zack Cerza [Fri, 7 Mar 2014 20:02:33 +0000 (14:02 -0600)]
Add optional 'action' parameter to safe_while

This is to make it easier to see what actually timed out when scanning
error logs

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoUpdate safe_while users to reflect new defaults
Zack Cerza [Fri, 7 Mar 2014 19:36:35 +0000 (13:36 -0600)]
Update safe_while users to reflect new defaults

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoChange safe_while defaults to 6s 10x no increment
Zack Cerza [Fri, 7 Mar 2014 19:33:27 +0000 (13:33 -0600)]
Change safe_while defaults to 6s 10x no increment

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoLook for ready() in the right place
Zack Cerza [Fri, 7 Mar 2014 19:03:58 +0000 (13:03 -0600)]
Look for ready() in the right place

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoMerge pull request #218 from ceph/wip-radosbench-timeout
Sage Weil [Fri, 7 Mar 2014 17:39:04 +0000 (09:39 -0800)]
Merge pull request #218 from ceph/wip-radosbench-timeout

Introduce a timeout to radosbench's join phase

11 years agoUse a timeout of config.get('time') * 2
Zack Cerza [Fri, 7 Mar 2014 17:36:45 +0000 (11:36 -0600)]
Use a timeout of config.get('time') * 2

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoIntroduce a timeout to radosbench's join phase
Zack Cerza [Fri, 7 Mar 2014 17:21:31 +0000 (11:21 -0600)]
Introduce a timeout to radosbench's join phase

Set to 15min right now.

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoMark this 'while True' loop with 'finite' comment
Zack Cerza [Fri, 7 Mar 2014 05:09:44 +0000 (23:09 -0600)]
Mark this 'while True' loop with 'finite' comment

If we're going to embark on a mission to rid ourselves of
infinitely-looping while loops, it seems smart to start marking the ones
we've fixed in order to make grepping for unfixed loops easier.

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoGive up on wait_until_healthy() after 15min
Zack Cerza [Fri, 7 Mar 2014 04:31:48 +0000 (22:31 -0600)]
Give up on wait_until_healthy() after 15min

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoMerge pull request #217 from ceph/wip-fix-plus
Zack Cerza [Fri, 7 Mar 2014 02:00:09 +0000 (20:00 -0600)]
Merge pull request #217 from ceph/wip-fix-plus

suite: fix + handling

11 years agosuite: fix build_matrix for + case
Sage Weil [Fri, 7 Mar 2014 01:55:00 +0000 (17:55 -0800)]
suite: fix build_matrix for + case

The + means we should concatenate everything in the directory.  Do that.

This was totally broken before (and unused until now).

Signed-off-by: Sage Weil <sage@inktank.com>
11 years agorados: use backwards compatible args
Josh Durgin [Fri, 7 Mar 2014 01:31:07 +0000 (17:31 -0800)]
rados: use backwards compatible args

For ops that default to 0, only add arguments for them if they are
specified in the task config. This lets us use the same task across
ceph versions, even if the older version does not support new op
types, like append on dumpling.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
11 years agoMerge remote-tracking branch 'gh/firefly'
Sage Weil [Fri, 7 Mar 2014 00:57:45 +0000 (16:57 -0800)]
Merge remote-tracking branch 'gh/firefly'

11 years agoRevert "Do not spawn a parallel task if dictionary entry does not exist."
Sage Weil [Fri, 7 Mar 2014 00:56:14 +0000 (16:56 -0800)]
Revert "Do not spawn a parallel task if dictionary entry does not exist."

This reverts commit dadc9f7d0b6d9c13ccbc4edeb9f7875fc0ae0cac.

11 years agoceph.conf.template: add in sensible erasure coding defaults
Samuel Just [Wed, 5 Mar 2014 20:39:10 +0000 (12:39 -0800)]
ceph.conf.template: add in sensible erasure coding defaults

Signed-off-by: Samuel Just <sam.just@inktank.com>
11 years agoceph_manager: fix erasure coding m, k values
Samuel Just [Wed, 5 Mar 2014 20:38:51 +0000 (12:38 -0800)]
ceph_manager: fix erasure coding m, k values

Signed-off-by: Samuel Just <sam.just@inktank.com>
11 years agoLog job PID
Zack Cerza [Wed, 5 Mar 2014 20:13:04 +0000 (14:13 -0600)]
Log job PID

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoMerge pull request #216 from ceph/wip-workunit-timeout
Sage Weil [Wed, 5 Mar 2014 18:14:50 +0000 (10:14 -0800)]
Merge pull request #216 from ceph/wip-workunit-timeout

Add a 6h timeout to workunits

11 years agoAdd a 6h timeout to workunits
Zack Cerza [Wed, 5 Mar 2014 17:17:13 +0000 (11:17 -0600)]
Add a 6h timeout to workunits

The timeout is configurable, but defaults to six hours. It's implemented
by using the 'timeout' command on the remote host.

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoMerge pull request #215 from dachary/wip-ec-pool
Zack Cerza [Mon, 3 Mar 2014 20:47:52 +0000 (14:47 -0600)]
Merge pull request #215 from dachary/wip-ec-pool

ceph-manager: fix ec_pool parameters

11 years agoMerge pull request #207 from ceph/wip-7356
Zack Cerza [Mon, 3 Mar 2014 16:24:27 +0000 (10:24 -0600)]
Merge pull request #207 from ceph/wip-7356

helper for bombing out of infinite loops

11 years agoceph-manager: fix ec_pool parameters
Loic Dachary [Sat, 1 Mar 2014 23:41:21 +0000 (00:41 +0100)]
ceph-manager: fix ec_pool parameters

* the crush ruleset and the pool create parameters must be identicals
  k=2 m= 1
* the --property argument is invalid
* the failure domain is ignored on pool create

Signed-off-by: Loic Dachary <loic@dachary.org>
11 years agoMerge pull request #213 from ceph/wip-kdb-except
Alfredo Deza [Fri, 28 Feb 2014 23:32:25 +0000 (18:32 -0500)]
Merge pull request #213 from ceph/wip-kdb-except

Allow setting kdb to fail.

11 years agoAllow setting kdb to fail.
Sandon Van Ness [Fri, 28 Feb 2014 22:16:32 +0000 (14:16 -0800)]
Allow setting kdb to fail.

Some kernels (primarily Debian distro kernels) do not support
setting kdb. Rather than having the entire test fail.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
11 years agoMerge pull request #212 from ceph/wip-limit
wusui [Fri, 28 Feb 2014 19:13:46 +0000 (11:13 -0800)]
Merge pull request #212 from ceph/wip-limit

Added --limit option to teuthology-suite.

11 years agoMerge pull request #211 from ceph/wip-7554
Zack Cerza [Fri, 28 Feb 2014 17:04:23 +0000 (11:04 -0600)]
Merge pull request #211 from ceph/wip-7554

mds_thrash #7554

11 years agoAdded --limit option to teuthology-suite.
Yuri Weinstein [Fri, 28 Feb 2014 00:34:58 +0000 (00:34 +0000)]
Added --limit option to teuthology-suite.

Use --limit to limit the number of jobs being scheduled during
teuthology-suite. Also can be used with schedule_suite.sh via the
10th argument.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
Signed-off-by: Yuri Weinstein <yuri.weinstein@inktank.com>
11 years agomds_thrash: Fix a potential getitem on None
John Spray [Thu, 27 Feb 2014 14:22:02 +0000 (14:22 +0000)]
mds_thrash: Fix a potential getitem on None

get_mds_status returns None for things it can't see,
so have to check for Noneness on all its outputs.

Signed-off-by: John Spray <john.spray@inktank.com>
11 years agomds_thrash: Refactor gevent usage + get traceback
John Spray [Thu, 27 Feb 2014 14:17:54 +0000 (14:17 +0000)]
mds_thrash: Refactor gevent usage + get traceback

This simplifies the code to make MdsTrash be a greenlet
(as it logically is) rather than encapsulating one that
gets started in __init__ (spawning threads in constructors
is evil).

With this done, do_thrash is called from _run inside an
exception handler that will give us full tracebacks
if something bad happens.

Signed-off-by: John Spray <john.spray@inktank.com>
11 years agomds_thrash: PEP8-ize whitespace
John Spray [Thu, 27 Feb 2014 14:09:52 +0000 (14:09 +0000)]
mds_thrash: PEP8-ize whitespace

...so that I can edit the code in a python IDE without
it lighting up like a christmas tree!

Signed-off-by: John Spray <john.spray@inktank.com>
11 years agoWorker logging tweaks
Zack Cerza [Wed, 26 Feb 2014 23:15:37 +0000 (17:15 -0600)]
Worker logging tweaks

Change some statements' log levels; don't show bootstrap output if there
is no error.

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years ago--dead implies --refresh
Zack Cerza [Wed, 26 Feb 2014 22:41:43 +0000 (16:41 -0600)]
--dead implies --refresh

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoSymlink worker log after child starts
Zack Cerza [Wed, 26 Feb 2014 22:22:32 +0000 (16:22 -0600)]
Symlink worker log after child starts

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoIn find_job_info(), also look for orig.config.yaml
Zack Cerza [Wed, 26 Feb 2014 19:13:41 +0000 (13:13 -0600)]
In find_job_info(), also look for orig.config.yaml

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoPush complete info when reporting jobs as dead
Zack Cerza [Wed, 26 Feb 2014 16:50:39 +0000 (10:50 -0600)]
Push complete info when reporting jobs as dead

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoMerge pull request #210 from ceph/wip-queue
Zack Cerza [Wed, 26 Feb 2014 17:47:02 +0000 (11:47 -0600)]
Merge pull request #210 from ceph/wip-queue

Add teuthology-queue command for beanstalk Managmeent.

11 years agoMerge pull request #208 from ceph/wip-7485
Gregory Farnum [Wed, 26 Feb 2014 17:46:43 +0000 (09:46 -0800)]
Merge pull request #208 from ceph/wip-7485

task: Add mds_creation_failure

Reviewed-by: Greg Farnum <greg@inktank.com>
11 years agofix docstring typo
Alfredo Deza [Wed, 26 Feb 2014 13:17:48 +0000 (08:17 -0500)]
fix docstring typo

Signed-off-by: Alfredo Deza <alfredo.deza@inktank.com>