]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
12 years agolocker: try to make up for apache timeouts
Josh Durgin [Fri, 29 Mar 2013 23:33:49 +0000 (16:33 -0700)]
locker: try to make up for apache timeouts

If the lock request succeeds in updating the db, but the client gets a
timeout from apache, they can now try again and get back the machines
they just locked.

Only automatic runs have a description set when locking several
machines, so this does not affect users of teuthology-lock
--lock-many, where no description can be set in the same request.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
12 years agodo not archive on pass if 'archive-on-error: True'
Sage Weil [Fri, 29 Mar 2013 19:19:46 +0000 (12:19 -0700)]
do not archive on pass if 'archive-on-error: True'

Optional flag makes us suck down the archive (mostly, the logs, which
might be huge for some debugging tests) unless the test has failed.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agolocker: log desc too
Sage Weil [Fri, 29 Mar 2013 21:27:04 +0000 (14:27 -0700)]
locker: log desc too

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agorun: clean up machine_type thing
Sage Weil [Fri, 29 Mar 2013 19:16:39 +0000 (12:16 -0700)]
run: clean up machine_type thing

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoceph_manager: retry set_pool_property on EAGAIN
Sage Weil [Thu, 28 Mar 2013 22:24:33 +0000 (15:24 -0700)]
ceph_manager: retry set_pool_property on EAGAIN

Retry indefinitely, for now.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agorun: machine-type: foo, not machine_type: foo
Sage Weil [Thu, 28 Mar 2013 17:50:40 +0000 (10:50 -0700)]
run: machine-type: foo, not machine_type: foo

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoMerge pull request #6 from ceph/wip-mds-thrasher-logging
Sage Weil [Wed, 27 Mar 2013 15:56:04 +0000 (08:56 -0700)]
Merge pull request #6 from ceph/wip-mds-thrasher-logging

task/mds_thrash: Log mds dump after long delay

12 years agotask/mds_thrash: Log mds dump after long delay
Sam Lang [Wed, 27 Mar 2013 13:48:45 +0000 (08:48 -0500)]
task/mds_thrash: Log mds dump after long delay

In cases where the mds thrasher continuously loops
waiting for an mds to be removed from the map, or
for a new mds to become active, we want to start logging
the mds state for debugging.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
12 years agolocker: make desc optional
Sage Weil [Tue, 26 Mar 2013 20:27:53 +0000 (13:27 -0700)]
locker: make desc optional

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoceph.conf: osd debug op order = true
Sage Weil [Thu, 7 Mar 2013 05:35:41 +0000 (21:35 -0800)]
ceph.conf: osd debug op order = true

Debug the osd op ordering by default.  Most of the runs have a small number
of clients, which makes the STL maps cheap.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agolocker/api: fix DELETE
Sage Weil [Tue, 26 Mar 2013 18:40:13 +0000 (11:40 -0700)]
locker/api: fix DELETE

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoMerge branch 'wip-lock'
Sage Weil [Tue, 26 Mar 2013 18:34:33 +0000 (11:34 -0700)]
Merge branch 'wip-lock'

Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
12 years agolock: pass desc to lock operation; leave on unlock
Sage Weil [Mon, 25 Mar 2013 23:46:48 +0000 (16:46 -0700)]
lock: pass desc to lock operation; leave on unlock

Pass the desc to the lock operation.

The unlock operation now clears desc for us; no need to do it outselves.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agolocker: set desc on lock
Sage Weil [Mon, 25 Mar 2013 23:42:59 +0000 (16:42 -0700)]
locker: set desc on lock

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agolocker: clear desc on unlock
Sage Weil [Mon, 25 Mar 2013 23:41:15 +0000 (16:41 -0700)]
locker: clear desc on unlock

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agothrashosds: add test_backfill_full
Samuel Just [Thu, 21 Mar 2013 21:37:38 +0000 (14:37 -0700)]
thrashosds: add test_backfill_full

Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Dan Mick <dan.mick@inktank.com>
12 years agothrashosds.py: fix line length
Samuel Just [Thu, 21 Mar 2013 21:10:13 +0000 (14:10 -0700)]
thrashosds.py: fix line length

Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Dan Mick <dan.mick@inktank.com>
12 years agolocker: log updates
Josh Durgin [Mon, 25 Mar 2013 22:01:26 +0000 (15:01 -0700)]
locker: log updates

Note whenever locks are acquired/released, or a machine's description is updated.
Under apache, these will go to error.log.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
12 years agoceph-deploy: purge /var/lib/ceph data on finish
Sage Weil [Sun, 24 Mar 2013 22:12:59 +0000 (15:12 -0700)]
ceph-deploy: purge /var/lib/ceph data on finish

The install task does this now that the package doesn't; we
need to too.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoverify /var/lib/ceph not present on start
Sage Weil [Sun, 24 Mar 2013 03:58:46 +0000 (20:58 -0700)]
verify /var/lib/ceph not present on start

Verify there is no /var/lib/ceph, just like we do with the cephtest
directory.  We will need to change this (or make it optional) when we
allow runs against an existing cluster, but then a whole bunch of other
things will need to change then as well.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoinstall: need sudo when purging /var/lib/ceph
Sage Weil [Sun, 24 Mar 2013 03:53:51 +0000 (20:53 -0700)]
install: need sudo when purging /var/lib/ceph

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoinstall, nuke: explicitly purge /var/lib/ceph
Sage Weil [Thu, 21 Mar 2013 05:51:24 +0000 (22:51 -0700)]
install, nuke: explicitly purge /var/lib/ceph

The packages won't do this anymore.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoteuthology: cleanup client dirs for workunit task
Joe Buck [Fri, 22 Mar 2013 18:56:50 +0000 (11:56 -0700)]
teuthology: cleanup client dirs for workunit task

This patch corrects an issue where a workunit task is
not cleaning up generated directories
if the 'all' key is used to specify clients.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by: Sam Lang <sam.lang@inktank.com>
12 years agostop ignoring osd leaks
Sage Weil [Fri, 22 Mar 2013 03:40:48 +0000 (20:40 -0700)]
stop ignoring osd leaks

Note that the mds is the only one left that we are ignoring.

12 years agomoving client.keyring creation out of ceph task
tamil [Thu, 21 Mar 2013 23:14:54 +0000 (16:14 -0700)]
moving client.keyring creation out of ceph task

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
12 years agolock: make do_summary() respect --machine-type
Dan Mick [Thu, 21 Mar 2013 01:30:29 +0000 (18:30 -0700)]
lock: make do_summary() respect --machine-type

Signed-off-by: Dan Mick <dan.mick@inktank.com>
12 years agostop and restart daemons as restart only starts.
tamil [Thu, 21 Mar 2013 00:40:46 +0000 (17:40 -0700)]
stop and restart daemons as restart only starts.

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
12 years agotask/ceph: Revert extra check for running status
Sam Lang [Tue, 19 Mar 2013 20:42:51 +0000 (15:42 -0500)]
task/ceph:  Revert extra check for running status

Don't use exit status info to track daemon state.  We need to find
a better way to do this for the restart task.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
12 years agosuite: shorten subject a bit
Sage Weil [Tue, 19 Mar 2013 19:02:14 +0000 (12:02 -0700)]
suite: shorten subject a bit

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoMerge branch 'wip-4197'
David Zafman [Tue, 19 Mar 2013 18:30:41 +0000 (11:30 -0700)]
Merge branch 'wip-4197'

12 years agoosd: data loss: low space handling
David Zafman [Fri, 15 Mar 2013 04:53:44 +0000 (21:53 -0700)]
osd: data loss: low space handling

Automated test cases for feature #4197

Signed-off-by: David Zafman <david.zafman@inktank.com>
Reported-by: Sam Just <sam.just@inktank.com>
12 years agoFixed so that installation works on a brand new CentOS system.
Warren Usui [Mon, 18 Mar 2013 22:25:59 +0000 (15:25 -0700)]
Fixed so that installation works on a brand new CentOS system.

Do yum install rather than yum reinstall for CentOS.
When exiting CentOS, yum erase the ceph-release rpm.

Signed-off-by: Warren Usui <warren.usui@inktank.com>
12 years agotask/restart: Handle error from script correctly
Sam Lang [Tue, 19 Mar 2013 13:08:05 +0000 (08:08 -0500)]
task/restart:  Handle error from script correctly

The exitstatus on the process is a gevent.AsyncResult
(not an int).  Use the try/except pattern for handling
errors instead.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
12 years agos/dist-upgrade/upgrade
tamil [Mon, 18 Mar 2013 23:29:18 +0000 (16:29 -0700)]
s/dist-upgrade/upgrade

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
12 years agoFixed ceph-fuse mount point cleanup bug
Warren Usui [Sat, 16 Mar 2013 01:18:56 +0000 (18:18 -0700)]
Fixed ceph-fuse mount point cleanup bug

Tested for the existence of /sys/fs/fuse/connections/*/abort
before clobbering it.  This problem was generated when all
the machines were virtual CentOS machines.

Signed-off-by: Warren Usui <warren.usui@inktank.com>
12 years agotask/restart: Cleanup in finally
Sam Lang [Mon, 18 Mar 2013 16:28:51 +0000 (11:28 -0500)]
task/restart:  Cleanup in finally

Need to cleanup the files created for this test from
the testdir.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
12 years agotask/restart: Fix check for done
Sam Lang [Mon, 18 Mar 2013 16:27:11 +0000 (11:27 -0500)]
task/restart: Fix check for done

The last command a restart script outputs is 'done'
indicating the script does not require being restarted
further.  Handle this case properly.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
12 years agotask/restart: Restart task for testing daemon kill
Sam Lang [Mon, 11 Mar 2013 18:22:10 +0000 (13:22 -0500)]
task/restart: Restart task for testing daemon kill

The ceph daemons support being killed at a specific code point
with a config option.  In some cases, we want to test a kill point
only once for a given daemon run (such as replay that only occurs
during daemon startup).  This task allows running a script or executable
and (when the script sends a command to the task) restarting it with
a temporary config that has the appropriate kill point set.  Once
the daemon asserts and gets restarted, the original config is used.

Adds a specific restart_with_args() method to the DaemonState in the
ceph task.

Right now this task follows the workunit task closely, but uses stdout/stdin
to specify when to restart a daemon.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
12 years agoadded ceph_health check and a few log messages
tamil [Fri, 15 Mar 2013 22:50:52 +0000 (15:50 -0700)]
added ceph_health check and a few log messages

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
12 years agoFixed 'clock:' on Centos
Warren Usui [Fri, 15 Mar 2013 01:06:17 +0000 (18:06 -0700)]
Fixed 'clock:' on Centos

ntpdc commands were formerly returning -127 on CentOS

Signed-off-by: Warren Usui <warren.usui@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
12 years agoadded install.upgrade task
tamil [Fri, 15 Mar 2013 01:26:03 +0000 (18:26 -0700)]
added install.upgrade task

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
12 years agoadded task for ceph.restart
tamil [Fri, 15 Mar 2013 01:18:39 +0000 (18:18 -0700)]
added task for ceph.restart

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
12 years agoinstall: do debs for 'Debian'
Sage Weil [Thu, 14 Mar 2013 23:27:01 +0000 (16:27 -0700)]
install: do debs for 'Debian'

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoinstall: el6 -> rpm
Sage Weil [Thu, 14 Mar 2013 23:26:25 +0000 (16:26 -0700)]
install: el6 -> rpm

Slightly more general.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoAdded el6 install functionality for CentOS systems.
Warren Usui [Thu, 14 Mar 2013 21:24:50 +0000 (14:24 -0700)]
Added el6 install functionality for CentOS systems.

install_packages, remove_packages and remove_sources are now the
installation and removal functions used by teuthology.  Debian
references have been removed outside of tasks/install.py.  CentOS
functionality parallel to Debian have been added to tasks/install.py,
and el6 references have been added to nuke.py, task/ceph-fuse.y and
task/install.py.

Some files created by CentOS are removed with rm -fr.  This should
be changed once the installation/removal rpm procedure is implemented.

Signed-off-by: Warren Usui <warren.usui@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
12 years agoUse service instead of initctl to restart rsyslog.
Warren Usui [Wed, 13 Mar 2013 23:42:09 +0000 (16:42 -0700)]
Use service instead of initctl to restart rsyslog.

This change is needed to make sure teuthology works on CentOS when the
-a option is specified.

Signed-off-by: Warren Usui <warren.usui@inktank.com>
12 years agoMerge remote-tracking branch 'gh/wip-powercycle-fixes'
Sage Weil [Thu, 14 Mar 2013 01:09:06 +0000 (18:09 -0700)]
Merge remote-tracking branch 'gh/wip-powercycle-fixes'

12 years agoceph.conf: auth debug = true
Sage Weil [Wed, 13 Mar 2013 20:39:54 +0000 (13:39 -0700)]
ceph.conf: auth debug = true

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoceph-deploy: uninstall even when no archive
Sage Weil [Tue, 12 Mar 2013 00:06:57 +0000 (17:06 -0700)]
ceph-deploy: uninstall even when no archive

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agotask/ceph_manager: Only reconnect if powercycled
Sam Lang [Wed, 13 Mar 2013 16:05:45 +0000 (11:05 -0500)]
task/ceph_manager: Only reconnect if powercycled

We don't need to reconnect for remotes
that haven't been powercycled.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
12 years agoschedule_suite.sh: crank up mon logs
Sage Weil [Wed, 13 Mar 2013 15:23:59 +0000 (08:23 -0700)]
schedule_suite.sh: crank up mon logs

Until we find #4279.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agotask/thrashosds: Ipmi checking/setup in thrashosds
Sam Lang [Wed, 13 Mar 2013 15:11:06 +0000 (10:11 -0500)]
task/thrashosds: Ipmi checking/setup in thrashosds

We don't need to setup the ipmi console on runs that
don't use powercycling, so delay setup of the RemoteConsole
with ipmi to the thrashosd task and only then if the powercycle
config is set.  This avoids spurious test failures from flaky
ipmi.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
12 years agotask/ceph_manager: Check that ipmi is enabled
Sam Lang [Wed, 13 Mar 2013 03:21:12 +0000 (22:21 -0500)]
task/ceph_manager:  Check that ipmi is enabled

If powercycling was requested for the osd thrasher
we should ensure that we are able to reach the
ipmi console.  This helps us avoid weird errors.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
12 years agotask/ceph: Recreate /var/run/ceph on powercycle
Sam Lang [Wed, 13 Mar 2013 03:17:16 +0000 (22:17 -0500)]
task/ceph: Recreate /var/run/ceph on powercycle

If a node is powecycled during testing, we need to
recreate /var/run/ceph once the node restarts.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
12 years agolock: tolerate description of None
Sage Weil [Tue, 12 Mar 2013 16:49:42 +0000 (09:49 -0700)]
lock: tolerate description of None

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoteuthology: update hadoop task for new code layout
Joe Buck [Tue, 12 Mar 2013 07:51:05 +0000 (00:51 -0700)]
teuthology: update hadoop task for new code layout

Update the teuthology Hadoop task to use
the new code layout.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by: Sam Lang <sam.lang@inktank.com>
12 years agoincreasing the default memory of vm to 4 MB
tamil [Tue, 12 Mar 2013 19:12:58 +0000 (12:12 -0700)]
increasing the default memory of vm to 4 MB

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
12 years agoceph-deploy: no need to call chef explicitly
Sage Weil [Mon, 11 Mar 2013 23:22:02 +0000 (16:22 -0700)]
ceph-deploy: no need to call chef explicitly

The chef task does this.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoceph-deploy: use ceph.com git mirror
Sage Weil [Mon, 11 Mar 2013 23:21:47 +0000 (16:21 -0700)]
ceph-deploy: use ceph.com git mirror

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoceph: sudo grep /var/log/ceph/ceph.log
Sage Weil [Mon, 11 Mar 2013 00:08:55 +0000 (17:08 -0700)]
ceph: sudo grep /var/log/ceph/ceph.log

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agomds_thrash: requery mds status inside the wait loop
Sage Weil [Sun, 10 Mar 2013 06:17:15 +0000 (22:17 -0800)]
mds_thrash: requery mds status inside the wait loop

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoschedule_suite.sh: check, but do not sync, clock for each run
Sage Weil [Sun, 10 Mar 2013 05:34:24 +0000 (21:34 -0800)]
schedule_suite.sh: check, but do not sync, clock for each run

Trust NTP to actually sync things.  But check the clock skew so we can
verify it if something seems off.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agompi: substitute $TESTDIR
Sage Weil [Sun, 10 Mar 2013 05:20:43 +0000 (21:20 -0800)]
mpi: substitute $TESTDIR

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoexec/pexec: substitute $TESTDIR into command
Sage Weil [Sun, 10 Mar 2013 02:07:06 +0000 (18:07 -0800)]
exec/pexec: substitute $TESTDIR into command

It might be better to make this unescaped, but that's trickier.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoschedule_suite.sh: take machine type
Sage Weil [Fri, 8 Mar 2013 23:15:07 +0000 (15:15 -0800)]
schedule_suite.sh: take machine type

This script is getting out of hand.. pretty soon we need to move to python
and argparse.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoRevert "ceph.conf: osd debug op order = true"
Sage Weil [Fri, 8 Mar 2013 17:33:51 +0000 (09:33 -0800)]
Revert "ceph.conf: osd debug op order = true"

This reverts commit 4f68e3ee122d0d2bb18d7f75b711962b272d3edd.

Until we fix 4392... the kernel is sending ops out of order.

12 years agorados.py: add option to do many short runs
Samuel Just [Fri, 8 Mar 2013 01:47:25 +0000 (17:47 -0800)]
rados.py: add option to do many short runs

Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
12 years agoceph-deploy task
tamil [Fri, 8 Mar 2013 01:56:29 +0000 (17:56 -0800)]
ceph-deploy task

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
12 years agoadded 'extras' to install extra packages excluding ceph
tamil [Fri, 8 Mar 2013 01:43:40 +0000 (17:43 -0800)]
added 'extras' to install extra packages excluding ceph

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
12 years agolock: remove description when unlocking
Josh Durgin [Thu, 7 Mar 2013 17:40:25 +0000 (09:40 -0800)]
lock: remove description when unlocking

This will hopefully make it easier to track down what's leaving nodes unlocked.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
12 years agoceph.conf: osd debug op order = true
Sage Weil [Thu, 7 Mar 2013 05:35:41 +0000 (21:35 -0800)]
ceph.conf: osd debug op order = true

Debug the osd op ordering by default.  Most of the runs have a small number
of clients, which makes the STL maps cheap.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agotesting: fix logic in determining Hadoop master
Joe Buck [Wed, 6 Mar 2013 01:55:32 +0000 (17:55 -0800)]
testing: fix logic in determining Hadoop master

Signed-off-by: Joe Buck <jbbuck@gmail.com>
12 years agotesting: specify ceph.conf path in core-site.xml
Joe Buck [Wed, 6 Mar 2013 01:22:24 +0000 (17:22 -0800)]
testing: specify ceph.conf path in core-site.xml

Signed-off-by: Joe Buck <jbbuck@gmail.com>
12 years agoinstall: install libcephfs-java
Sage Weil [Tue, 5 Mar 2013 23:12:02 +0000 (15:12 -0800)]
install: install libcephfs-java

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoceph-fuse: install via install.py
Sage Weil [Tue, 5 Mar 2013 22:14:12 +0000 (14:14 -0800)]
ceph-fuse: install via install.py

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoImplement email task.
Warren Usui [Wed, 27 Feb 2013 19:32:37 +0000 (11:32 -0800)]
Implement email task.

Email.py was added so that the emailto attribute could be passed,
and to prevent 'module object has no attribute: email' errors from
happening.  Run.py actual performs the email operation and calls
suite.email_results to do the actual send mail operation.  The
information passed right now is the summary and config information.

Signed-off-by: Warren Usui <warren.usui@inktank.com>
12 years agoFix pass/fail display on exit.
Warren Usui [Wed, 27 Feb 2013 19:35:55 +0000 (11:35 -0800)]
Fix pass/fail display on exit.

Signed-off-by: Warren Usui <warren.usui@inktank.com>
12 years agoceph-fuse: fix install_debs call with new syntax
Sage Weil [Wed, 27 Feb 2013 18:40:50 +0000 (10:40 -0800)]
ceph-fuse: fix install_debs call with new syntax

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoinstall: poll for packages if wait-for-package: true
Sage Weil [Wed, 27 Feb 2013 05:02:14 +0000 (21:02 -0800)]
install: poll for packages if wait-for-package: true

If the yaml has

 wait-for-package: true

then block and poll for the packages to appear if they are not already
there.  This is only useful for new branches or explicit sha1's, obviously.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoinstall: fix branch/tag/sha1 selection
Sage Weil [Wed, 27 Feb 2013 05:01:11 +0000 (21:01 -0800)]
install: fix branch/tag/sha1 selection

We were only allowing selection of packages by branch!

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoMerge branch 'master' of github.com:ceph/teuthology
Sage Weil [Wed, 27 Feb 2013 01:53:56 +0000 (17:53 -0800)]
Merge branch 'master' of github.com:ceph/teuthology

12 years agoceph_manager: use an exception type
Sage Weil [Wed, 27 Feb 2013 00:47:33 +0000 (16:47 -0800)]
ceph_manager: use an exception type

Otherwise this obscures the error

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoteuthology: remove CEPH_JAVA_PATH
Joe Buck [Wed, 20 Feb 2013 23:20:45 +0000 (15:20 -0800)]
teuthology: remove CEPH_JAVA_PATH

Since teuthology now uses debian packages,
we do not need to set this in workunit.
The one test that uses this now tests for
it locally.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
12 years agoteuthology: add an extra_packages flag to install
Joe Buck [Wed, 20 Feb 2013 19:58:45 +0000 (11:58 -0800)]
teuthology: add an extra_packages flag to install

Some tests require additional packages
(e.g., java bindings, hadoop bindings).
Extend the install task to allow for those
packages to be specified in the yaml files.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by: Sam Lang <sam.lang@inktank.com>
12 years agonuke: blow away /home/ubuntu/cephtest too
Sage Weil [Tue, 26 Feb 2013 01:54:49 +0000 (17:54 -0800)]
nuke: blow away /home/ubuntu/cephtest too

(along with /tmp/cephtest)

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoAdd timer.py and display summary info in run.py.
Warren Usui [Thu, 21 Feb 2013 22:51:54 +0000 (14:51 -0800)]
Add timer.py and display summary info in run.py.

Signed-off-by: Warren Usui <warren.usui@inktank.com>
12 years agoradosgw-admin: Bug fixes for issue 4251.
caleb miles [Mon, 25 Feb 2013 22:37:23 +0000 (17:37 -0500)]
radosgw-admin: Bug fixes for issue 4251.

Fixes errors introduced by d44fb14762c3d7e2853a8e4be012cd8e4557124e.

Signed-off-by caleb.miles <caleb.miles@inktank.com>

12 years agonuke: blow away /home/ubuntu/cephtest too
Sage Weil [Tue, 26 Feb 2013 01:54:49 +0000 (17:54 -0800)]
nuke: blow away /home/ubuntu/cephtest too

(along with /tmp/cephtest)

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoAdd timer.py and display summary info in run.py.
Warren Usui [Thu, 21 Feb 2013 22:51:54 +0000 (14:51 -0800)]
Add timer.py and display summary info in run.py.

Signed-off-by: Warren Usui <warren.usui@inktank.com>
12 years agorbd: drop udev
Sage Weil [Mon, 25 Feb 2013 20:08:34 +0000 (12:08 -0800)]
rbd: drop udev

The packages have these rules now.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoexec, pexec: set TESTDIR
Sage Weil [Mon, 25 Feb 2013 17:59:21 +0000 (09:59 -0800)]
exec, pexec: set TESTDIR

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoceph.conf: debug ms = 1 for monitors
Sage Weil [Mon, 25 Feb 2013 17:09:38 +0000 (09:09 -0800)]
ceph.conf: debug ms = 1 for monitors

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agorbd: set TESTDIR when running xfstests
Sage Weil [Mon, 25 Feb 2013 07:00:26 +0000 (23:00 -0800)]
rbd: set TESTDIR when running xfstests

[elder@inktank.com: fixed typo]

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agonuke: sudo for killall
Sage Weil [Fri, 22 Feb 2013 18:51:51 +0000 (10:51 -0800)]
nuke: sudo for killall

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agotask: ceph: create monitor data directories prior to --mkfs
Joao Eduardo Luis [Fri, 22 Feb 2013 01:04:06 +0000 (01:04 +0000)]
task: ceph: create monitor data directories prior to --mkfs

The new monitor store does not create the data directory on --mkfs. We
must create it instead, much like what happens with the osds.

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
12 years agotask: mon_thrash: Thrash multiple monitors and 'maintain-quorum' option
Joao Eduardo Luis [Tue, 12 Feb 2013 15:31:28 +0000 (15:31 +0000)]
task: mon_thrash: Thrash multiple monitors and 'maintain-quorum' option

We now add a new option 'thrash-many' that by being set to true will break
the default behaviour of killing only one monitor at a time.  Instead,
this option will select up to the maximum number of killable monitors to
kill in each round.

We also add a new 'maintain-quorum' option that will limit the amount of
monitors that can be killed in each thrashing round.  If set to true, this
option will limit the amount of killable monitors up to (n/2-1).  This
means that if we are running a configuration that only has up to two
configured monitors, if 'maintain-quorum' is set to true, this task won't
run as there are no killable monitors -- in such a scenario, this option
should be set to false.

Furthermore, if 'store-thrash' is set to true, then 'maintain-quorum' must
also be set to true, as we cannot let the task to thrash all the monitor
stores, or we wouldn't be able to sync from other monitors, nor can we
let quorum be dropped, or we won't be able to resync our way into quorum.

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
12 years agotask: mon_thrash: Add 'seed' and 'store-thrash' options
Joao Eduardo Luis [Mon, 11 Feb 2013 15:11:44 +0000 (15:11 +0000)]
task: mon_thrash: Add 'seed' and 'store-thrash' options

This patch introduces an option to thrash a monitor store when we thrash
the monitors, as well as a 'store-thrash-probability' option (defaulting
to 50%).

We also took this opportunity to introduce a new 'seed' option, that ought
to allow a given run of this task to be reproducible.  This might come in
hand when attempting to reproduce a given behavior that would otherwise
be randomly triggered.

You should note that while the 'seed' option will indeed mimic past
behaviors, this only applies to a past behavior of this task: other tasks
are not affected by this value, nor are any workunits or even ceph daemons.

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
12 years agoceph.conf: log file, not log dir
Sage Weil [Fri, 22 Feb 2013 00:21:33 +0000 (16:21 -0800)]
ceph.conf: log file, not log dir

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoinstall: ignore apt-get update failures on package removal
Sage Weil [Thu, 21 Feb 2013 22:34:06 +0000 (14:34 -0800)]
install: ignore apt-get update failures on package removal

At that point we don't really care.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoceph: fix log, /var/run/ceph stupid
Sage Weil [Thu, 21 Feb 2013 21:13:51 +0000 (13:13 -0800)]
ceph: fix log, /var/run/ceph stupid

Signed-off-by: Sage Weil <sage@inktank.com>