]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
12 years agofixing the tense of my README change
Joe Buck [Mon, 22 Oct 2012 18:33:44 +0000 (11:33 -0700)]
fixing the tense of my README change

12 years agoEdit to README.bst to note that the ssh keys needed in the targets entries are the...
buck [Mon, 22 Oct 2012 18:21:07 +0000 (11:21 -0700)]
Edit to README.bst to note that the ssh keys needed in the targets entries are the hosts public ssh key and not the users defined in the same entry

12 years agoNew nfs task that performs NFS client mount of export (see knfsd)
David Zafman [Tue, 9 Oct 2012 22:50:25 +0000 (15:50 -0700)]
New nfs task that performs NFS client mount of export (see knfsd)

Fixes: 3245
Signed-off-by: David Zafman <david.zafman@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
12 years agoNew knfsd task that does an nfs server export
David Zafman [Sat, 6 Oct 2012 02:26:24 +0000 (19:26 -0700)]
New knfsd task that does an nfs server export
    Ability to specify options
    By default only export to current hosts

Fixes: 3245
Signed-off-by: David Zafman <david.zafman@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
12 years agoNew kcon_most task that enables most ceph kernel logging
David Zafman [Tue, 9 Oct 2012 00:59:47 +0000 (17:59 -0700)]
New kcon_most task that enables most ceph kernel logging

Signed-off-by: David Zafman <david.zafman@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
12 years agoFix ceph-fuse example
David Zafman [Tue, 9 Oct 2012 00:08:41 +0000 (17:08 -0700)]
Fix ceph-fuse example

Signed-off-by: David Zafman <david.zafman@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
12 years agoFix typo in README
David Zafman [Sat, 6 Oct 2012 02:25:48 +0000 (19:25 -0700)]
Fix typo in README

Signed-off-by: David Zafman <david.zafman@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
13 years agoPrinting the number of tests passed when 'all' tests are successful
tamil [Sat, 6 Oct 2012 00:33:57 +0000 (17:33 -0700)]
Printing the number of tests passed when 'all' tests are successful

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
13 years agoradosgw-admin: usage should time out after 20 minutes
Yehuda Sadeh [Tue, 2 Oct 2012 00:17:14 +0000 (17:17 -0700)]
radosgw-admin: usage should time out after 20 minutes

Not 45 seconds.

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
13 years agoconsole: add console task
Sage Weil [Mon, 1 Oct 2012 04:08:41 +0000 (21:08 -0700)]
console: add console task

Log the sol console of every target to a file in the archive dir.

13 years agointernal: stop warning about lockdep circular dependency
Sage Weil [Mon, 1 Oct 2012 04:07:58 +0000 (21:07 -0700)]
internal: stop warning about lockdep circular dependency

This is coming from xfs, currently.  Bah.

13 years agoinclude newpool in osd cap for client.0
Sage Weil [Sat, 29 Sep 2012 15:54:08 +0000 (08:54 -0700)]
include newpool in osd cap for client.0

This is needed by the kclient_workunit_kclient task.

13 years agomisc: use new syntax for osd caps
Josh Durgin [Fri, 28 Sep 2012 17:07:45 +0000 (10:07 -0700)]
misc: use new syntax for osd caps

pool=pool1,pool2 is not valid for the new grammar

13 years agofix lock held when returning to user space typo
Sage Weil [Sun, 23 Sep 2012 15:03:17 +0000 (08:03 -0700)]
fix lock held when returning to user space typo

13 years agoAllow scheduled jobs to use different teuthology branches
Josh Durgin [Fri, 21 Sep 2012 21:54:19 +0000 (14:54 -0700)]
Allow scheduled jobs to use different teuthology branches

teuthology-[schedule|suite] get a parameter to specify the branch,
to put the job in a branch-specific queue. Workers running that
branch of teuthology can pull jobs from that queue.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
13 years agoFix errors found by pyflakes
Josh Durgin [Fri, 21 Sep 2012 23:44:35 +0000 (16:44 -0700)]
Fix errors found by pyflakes

A bunch of unused imports and variables.

13 years agoignore 'lock held when returning to user space' from btrfs sb_internal crap
Sage Weil [Wed, 19 Sep 2012 21:19:53 +0000 (14:19 -0700)]
ignore 'lock held when returning to user space' from btrfs sb_internal crap

13 years agoreplace tab with spaces
Sam Lang [Tue, 18 Sep 2012 23:31:39 +0000 (16:31 -0700)]
replace tab with spaces

Signed-off-by: Sam Lang <sam.lang@inktank.com>
13 years agofix error on teardown failing to unmount /mnt
Sam Lang [Tue, 18 Sep 2012 22:56:08 +0000 (15:56 -0700)]
fix error on teardown failing to unmount /mnt

13 years agoupdate doc to be ceph-fuse task (instead of cfuse)
Sam Lang [Tue, 18 Sep 2012 20:50:36 +0000 (13:50 -0700)]
update doc to be ceph-fuse task (instead of cfuse)

13 years agoimported subprocess module in nuke script
tamil [Fri, 14 Sep 2012 22:04:40 +0000 (15:04 -0700)]
imported subprocess module in nuke script

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
13 years agonuke: add missing import
Josh Durgin [Thu, 13 Sep 2012 21:31:46 +0000 (14:31 -0700)]
nuke: add missing import

13 years agorbd: fix typo and cast to int before comparing format
Josh Durgin [Thu, 13 Sep 2012 21:29:43 +0000 (14:29 -0700)]
rbd: fix typo and cast to int before comparing format

13 years agorbd: only specify --format if not using the default
Josh Durgin [Wed, 12 Sep 2012 18:30:40 +0000 (11:30 -0700)]
rbd: only specify --format if not using the default

This lets older versions that do not support --format still work with
format 1 images.

13 years agoDon't lose tracebacks of exceptions raised in a greenlet.
Tommi Virtanen [Tue, 11 Sep 2012 18:11:39 +0000 (11:11 -0700)]
Don't lose tracebacks of exceptions raised in a greenlet.

Exception objects don't contain the traceback of where they were
raised from (to avoid cyclic data structures wrecking gc and causing
mem leaks), so the singular "raise obj" form creates a new traceback
from the current execution location, thus losing the original location
of the error.

Gevent explicitly wants to throw away the traceback, to release any
objects the greenlet may still be referring to, closing files,
releasing locks etc. In this case, we think it's safe, so stash the
exception info away in a holder object, and resurrect it on the other
side of the results queue.

http://stackoverflow.com/questions/9268916/how-to-capture-a-traceback-in-gevent

This can be reproduced easily with

from teuthology.parallel import parallel
def f():
    raise RuntimeError("bork")
with parallel() as p:
    p.spawn(f)

and looking at the resulting traceback with and without this change.

13 years agorbd: allow xfstests task to specify rbd image formats
Alex Elder [Mon, 10 Sep 2012 14:23:08 +0000 (09:23 -0500)]
rbd: allow xfstests task to specify rbd image formats

This adds the ability to specify the rbd image format to use for the
scratch and test devices for the rbd.xfstests task.

Signed-off-by: Alex Elder <elder@inktank.com>
Reviewed-by: Dan Mick <dan.mick@inktank.com>
13 years agorbd: allow image format to be specified
Alex Elder [Mon, 10 Sep 2012 14:23:08 +0000 (09:23 -0500)]
rbd: allow image format to be specified

This adds the ability to specify an rbd image format (either 1 or 2)
for an rbd image.

Signed-off-by: Alex Elder <elder@inktank.com>
Reviewed-by: Dan Mick <dan.mick@inktank.com>
13 years agoSuppress valgrind error "Invalid write 8"
tamil [Mon, 10 Sep 2012 22:02:47 +0000 (15:02 -0700)]
Suppress valgrind error "Invalid write 8"

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
13 years agoradosgw-admin: update task for new usage reporting
Yehuda Sadeh [Mon, 10 Sep 2012 18:14:12 +0000 (11:14 -0700)]
radosgw-admin: update task for new usage reporting

Usage reporting output has been modified, also use the new
--categories input param.

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
13 years agoschedule_suite.sh: try to use same branch for s3-tests.git
Sage Weil [Mon, 10 Sep 2012 18:09:25 +0000 (11:09 -0700)]
schedule_suite.sh: try to use same branch for s3-tests.git

If it isn't there, warn and fall back to master.

13 years agos3tests: run against arbitrary branch/sha1 of s3-tests.git
Sage Weil [Mon, 10 Sep 2012 18:08:57 +0000 (11:08 -0700)]
s3tests: run against arbitrary branch/sha1 of s3-tests.git

13 years agodebian ntp servers
Sage Weil [Sun, 9 Sep 2012 21:23:12 +0000 (14:23 -0700)]
debian ntp servers

13 years agotask: die on ceph error or coredump
Mike Ryan [Tue, 4 Sep 2012 16:52:38 +0000 (09:52 -0700)]
task: die on ceph error or coredump

This task allows ceph to signal to teuth that it should die immediately
by touching a file under /tmp/cephtest

Signed-off-by: Mike Ryan <mike.ryan@inktank.com>
13 years agodisable lockdep recursive warnings until #3040 is fixed
Sage Weil [Sat, 25 Aug 2012 02:23:34 +0000 (19:23 -0700)]
disable lockdep recursive warnings until #3040 is fixed

13 years agointernal: fix escaping of \b in syslog grep
Sage Weil [Thu, 23 Aug 2012 18:00:34 +0000 (11:00 -0700)]
internal: fix escaping of \b in syslog grep

13 years agosuppress this valgrind error
Sage Weil [Wed, 22 Aug 2012 22:29:44 +0000 (15:29 -0700)]
suppress this valgrind error

<error>
  <unique>0x4</unique>
  <tid>1</tid>
  <kind>InvalidWrite</kind>
  <what>Invalid write of size 8</what>
  <stack>
    <frame>
      <ip>0x400A299</ip>
      <obj>/lib/x86_64-linux-gnu/ld-2.15.so</obj>
      <fn>do_lookup_x</fn>
      <dir>/build/buildd/eglibc-2.15/elf</dir>
      <file>dl-lookup.c</file>
      <line>250</line>
    </frame>
    <frame>
      <ip>0x403122F</ip>
    </frame>
    <frame>
      <ip>0x400A522</ip>
      <obj>/lib/x86_64-linux-gnu/ld-2.15.so</obj>
      <fn>_dl_lookup_symbol_x</fn>
      <dir>/build/buildd/eglibc-2.15/elf</dir>
      <file>dl-lookup.c</file>
      <line>739</line>
    </frame>
  </stack>
  <auxwhat>Address 0x7feffeec8 is on thread 1's stack</auxwhat>
</error>

pops up recently

13 years agoceph: fix cpu_profile default
Sage Weil [Mon, 20 Aug 2012 03:16:43 +0000 (20:16 -0700)]
ceph: fix cpu_profile default

13 years agorbd.xfstests: default to 1gb (not 250mb) image
Sage Weil [Sun, 19 Aug 2012 03:10:54 +0000 (20:10 -0700)]
rbd.xfstests: default to 1gb (not 250mb) image

13 years agotask: run osd/mds/mon with Google CPU profiler via cpu_profile option
Mike Ryan [Fri, 17 Aug 2012 20:47:13 +0000 (13:47 -0700)]
task: run osd/mds/mon with Google CPU profiler via cpu_profile option

Signed-off-by: Mike Ryan <mike.ryan@inktank.com>
13 years agoceph: support tmpfs_journal option to put journal on tmpfs
Mike Ryan [Thu, 16 Aug 2012 22:50:10 +0000 (15:50 -0700)]
ceph: support tmpfs_journal option to put journal on tmpfs

Signed-off-by: Mike Ryan <mike.ryan@inktank.com>
13 years agoqueue: fix logging of child return code
Sage Weil [Tue, 14 Aug 2012 22:08:21 +0000 (15:08 -0700)]
queue: fix logging of child return code

13 years agoDisable asynchronous DNS lookups.
Tommi Virtanen [Mon, 13 Aug 2012 23:10:05 +0000 (16:10 -0700)]
Disable asynchronous DNS lookups.

Especially on older hosts, we keep triggering errors::

  ServerNotFoundError: Unable to find the server at
  teuthology.front.sepia.ceph.com: [Errno 3] name does not exist

That comes from libevent's evdns via gevent.dns and httplib2. The rate
of these errors is low enough that they seem to be perhaps timeouts,
or more arbitrary. Busy looping on DNS resolution calls has never
triggered them, so far.

With ``monkey.patch_all(dns=False)``, the teuthology process will
block as a whole whenever doing DNS resolution. This will hopefully be
rare enough that it won't matter.

The only real "fix" seems to be upgrading libraries and hoping for the
best; this commit can be reverted after that is done.

13 years agoFlush data to temp file before reading it in another process.
Tommi Virtanen [Thu, 9 Aug 2012 16:42:35 +0000 (09:42 -0700)]
Flush data to temp file before reading it in another process.

13 years agoOops tempfile now gives us file objects not fds.
Tommi Virtanen [Thu, 9 Aug 2012 16:42:13 +0000 (09:42 -0700)]
Oops tempfile now gives us file objects not fds.

13 years agoIn teuthology-worker, shuffle the child stdout/stderr into our log.
Tommi Virtanen [Wed, 8 Aug 2012 21:48:21 +0000 (14:48 -0700)]
In teuthology-worker, shuffle the child stdout/stderr into our log.

Otherwise, child can suffer a failure that does not get logged by
it's own exception handling machinery, and we have no idea why.

13 years agoMinimize scope of try-except.
Tommi Virtanen [Wed, 8 Aug 2012 21:45:49 +0000 (14:45 -0700)]
Minimize scope of try-except.

os.write and list.append won't raise CalledProcessError, and now
we don't need to try to contain them for temp file clean up reasons.

13 years agoUse tempfile.NamedTemporaryFile instead of mkstemp.
Tommi Virtanen [Wed, 8 Aug 2012 21:44:47 +0000 (14:44 -0700)]
Use tempfile.NamedTemporaryFile instead of mkstemp.

Simpler code, no manual cleanup needed. We see a littering of
zero-length temp files from teuthology-worker, and this seems
like a likely source.

13 years agotask: verify scrub detects files whose contents changed
Mike Ryan [Thu, 2 Aug 2012 17:58:08 +0000 (10:58 -0700)]
task: verify scrub detects files whose contents changed

Signed-off-by: Mike Ryan <mike.ryan@inktank.com>
13 years agotask: scrub OSDs periodically
Mike Ryan [Wed, 1 Aug 2012 22:35:38 +0000 (15:35 -0700)]
task: scrub OSDs periodically

Signed-off-by: Mike Ryan <mike.ryan@inktank.com>
13 years agokernel: push a local .deb instead of using gitbuilder
Sage Weil [Sun, 29 Jul 2012 19:40:06 +0000 (12:40 -0700)]
kernel: push a local .deb instead of using gitbuilder

This lets you specify a path to an existing kernel deb
to be pushed and installed on the remote node.

Limitations:
 - We don't build the deb for you.  figuring out what
   filename 'make deb-pkg' is building is annoying.
 - We need to be able to figure out the sha1 from the provided
   path.  It shouldn't be a problem, given the way make deb-pkg
   names the debs.

13 years agosyslog check: fix false-positive BUG matches in random strings
Sage Weil [Sun, 29 Jul 2012 19:15:06 +0000 (12:15 -0700)]
syslog check: fix false-positive BUG matches in random strings

13 years agoosd_recovery: also test unfound discovery
Sage Weil [Sat, 28 Jul 2012 17:53:09 +0000 (10:53 -0700)]
osd_recovery: also test unfound discovery

This tests for bug #2866.

13 years agoosd_recovery: test incomplete pg recovery
Sage Weil [Sat, 28 Jul 2012 17:23:02 +0000 (10:23 -0700)]
osd_recovery: test incomplete pg recovery

4-osd test to reproduce #2860 and confirm the fix.

13 years agoceph_manager: wait_for_active
Sage Weil [Sat, 28 Jul 2012 17:22:13 +0000 (10:22 -0700)]
ceph_manager: wait_for_active

13 years agoceph_manager: count 'incomplete' as 'down'
Sage Weil [Sat, 28 Jul 2012 17:22:02 +0000 (10:22 -0700)]
ceph_manager: count 'incomplete' as 'down'

13 years agoFixed the code to pass 'yes' during mkfs
tamil [Thu, 26 Jul 2012 20:48:11 +0000 (13:48 -0700)]
Fixed the code to pass 'yes' during mkfs

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
13 years agoAdded '-y' option for mkfs.ext4
tamil [Wed, 25 Jul 2012 23:38:25 +0000 (16:38 -0700)]
Added '-y' option for mkfs.ext4

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
13 years agoadmin_socket: make test optional
Sage Weil [Tue, 24 Jul 2012 22:25:56 +0000 (15:25 -0700)]
admin_socket: make test optional

If it's not there, we just verify the output is valid json.

13 years agoschedule_suite.sh: put coverage option in ceph section
Sage Weil [Mon, 23 Jul 2012 05:03:04 +0000 (22:03 -0700)]
schedule_suite.sh: put coverage option in ceph section

13 years agoceph: fix mkfs/mount option defaults
Sage Weil [Sun, 22 Jul 2012 03:18:24 +0000 (20:18 -0700)]
ceph: fix mkfs/mount option defaults

Later code expects a list, not None.

13 years agotasks: add multibench task for testing pool creation
Samuel Just [Thu, 19 Jul 2012 22:49:29 +0000 (15:49 -0700)]
tasks: add multibench task for testing pool creation

Also adds support for specifying a pool for radosbench
to create and then cleanup instead of "data".

Signed-off-by: Samuel Just <sam.just@inktank.com>
13 years agoclock: print skew with ntp servers to log to help debug time issues
Sage Weil [Wed, 18 Jul 2012 20:44:59 +0000 (13:44 -0700)]
clock: print skew with ntp servers to log to help debug time issues

13 years agonuke: log what pid we are killing when we kill it
Sage Weil [Wed, 18 Jul 2012 18:04:30 +0000 (11:04 -0700)]
nuke: log what pid we are killing when we kill it

13 years agoceph: archive mon data to a .tgz
Sage Weil [Tue, 17 Jul 2012 17:00:59 +0000 (10:00 -0700)]
ceph: archive mon data to a .tgz

Saves bandwidth, time, and space.

13 years agoset machine description to ctx.archive when auto-locking machines for a run
Sage Weil [Mon, 16 Jul 2012 17:53:25 +0000 (10:53 -0700)]
set machine description to ctx.archive when auto-locking machines for a run

13 years agoschedule/suite: schedule job, suite N times
Sage Weil [Sat, 14 Jul 2012 20:02:04 +0000 (13:02 -0700)]
schedule/suite: schedule job, suite N times

13 years agokernel: fix kernel installation when kdb: is specified
Sage Weil [Fri, 13 Jul 2012 20:57:22 +0000 (13:57 -0700)]
kernel: fix kernel installation when kdb: is specified

This normalize check would only trigger if a *single* key was specified.
Change it so that it triggers as long as all keys are in the list of valid
keys. This lets us specify both kdb: true and a sha1/branch/tag.

Phew!

13 years agoschedule_suite.sh: use workunits from ceph commit
Sage Weil [Fri, 13 Jul 2012 19:36:56 +0000 (12:36 -0700)]
schedule_suite.sh: use workunits from ceph commit

Use the workunits from the same ceph branch we are testing.

13 years agoceph: add default btrfs mkfs options
Sage Weil [Fri, 13 Jul 2012 18:30:21 +0000 (11:30 -0700)]
ceph: add default btrfs mkfs options

13 years agoceph: cleanup/simplify mount/mkfs options
Sage Weil [Fri, 13 Jul 2012 18:30:07 +0000 (11:30 -0700)]
ceph: cleanup/simplify mount/mkfs options

13 years agoworkunit: allow overrides
Sage Weil [Fri, 13 Jul 2012 18:13:31 +0000 (11:13 -0700)]
workunit: allow overrides

Pull top-level overrides into our config.  This lets you do:

overrides:
  workunit:
    branch: foo
tasks:
...
- workunit:
  clients:
    all:
    - foo
...

13 years agoworkunit: allow branch/sha1/tag to be specified
Sage Weil [Fri, 13 Jul 2012 18:12:31 +0000 (11:12 -0700)]
workunit: allow branch/sha1/tag to be specified

Pull the workunit(s) from the branch/tag/sha1 specified in the config.

13 years agoworkunit: pass branch/sha1 to test
Josh Durgin [Fri, 13 Jul 2012 17:00:50 +0000 (10:00 -0700)]
workunit: pass branch/sha1 to test

Some tests download things from the ceph repo. Let them know which
version to use through the CEPH_REF environment variable.

13 years agoAdded functionality to get mkfs and mount options for file systems
tamil [Fri, 13 Jul 2012 01:02:29 +0000 (18:02 -0700)]
Added functionality to get mkfs and mount options for file systems
from the config file,if present. Otherwise, default options are used.

The default value for inode size is changed to 2k when creating xfs.

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
13 years agofixed typo
tamil [Thu, 12 Jul 2012 23:36:40 +0000 (16:36 -0700)]
fixed typo

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
13 years agoradosgw-admin: use --bucket instead of old --bucket-id
Sage Weil [Thu, 12 Jul 2012 15:33:29 +0000 (08:33 -0700)]
radosgw-admin: use --bucket instead of old --bucket-id

The --bucket-id support was removed.

13 years agonuke: honor 'check-locks: ...' field in targets file
Sage Weil [Wed, 11 Jul 2012 21:23:51 +0000 (14:23 -0700)]
nuke: honor 'check-locks: ...' field in targets file

If you are nuking a yaml file with check-locks: false, don't check locks.

13 years agointernal: archive mon data dirs
Sage Weil [Wed, 11 Jul 2012 21:14:46 +0000 (14:14 -0700)]
internal: archive mon data dirs

These can be useful for debugging, and are usually pretty small.

Fixes: #2714
13 years agointernal: move pulling archive w/ tar to helper
Sage Weil [Wed, 11 Jul 2012 16:22:50 +0000 (09:22 -0700)]
internal: move pulling archive w/ tar to helper

13 years agouse sudo to kill teuthology proc
Sage Weil [Sat, 7 Jul 2012 03:15:55 +0000 (20:15 -0700)]
use sudo to kill teuthology proc

13 years agorun: make -a short for --archive
Sage Weil [Thu, 5 Jul 2012 20:43:19 +0000 (13:43 -0700)]
run: make -a short for --archive

13 years agowatch-suite: stupid script to watch teuth run progress
Sage Weil [Wed, 4 Jul 2012 21:47:05 +0000 (14:47 -0700)]
watch-suite: stupid script to watch teuth run progress

13 years agonuke: be more careful about kill; simplify
Sage Weil [Tue, 3 Jul 2012 23:22:38 +0000 (16:22 -0700)]
nuke: be more careful about kill; simplify

If the archive dir is specified, make sure we are killing the right
process.

Also drop the kill_process helper; it's simple enough to open-code.

13 years agonuke: nuke based on archive path
Sage Weil [Tue, 3 Jul 2012 19:53:08 +0000 (12:53 -0700)]
nuke: nuke based on archive path

Use path/config.yaml for targets, path/pid for pid to kill, and
path/owner for job owner.

13 years agovalgrind: add strptime suppressions
Sage Weil [Wed, 4 Jul 2012 21:29:55 +0000 (14:29 -0700)]
valgrind: add strptime suppressions

Precise's strptime triggers valgrind false positives.

Use ship_utilities to push the valgrind.supp file over, which is a bit
slippy.

13 years agoAdded a debug message
tamil [Tue, 3 Jul 2012 23:04:12 +0000 (16:04 -0700)]
Added a debug message

The debug message is to print the string that should be JSON.
This is to track a nightly run failure.

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
13 years agoschedule_suite: use the sha1, not branch name
Sage Weil [Tue, 3 Jul 2012 19:49:39 +0000 (12:49 -0700)]
schedule_suite: use the sha1, not branch name

Keep the entire suite run on the same commit. We were resolving the sha1,
but not using it.

13 years agonuke - optionally kill the process hung
tamil [Tue, 3 Jul 2012 19:22:26 +0000 (12:22 -0700)]
nuke - optionally kill the process hung

Added a function kill_process to kill the process hung in the nightly runs.
It takes in pid as an optional argument.

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
13 years agoceph: fix valgrind error check
Sage Weil [Mon, 2 Jul 2012 15:44:38 +0000 (08:44 -0700)]
ceph: fix valgrind error check

grep all the logs, not the dir... doh!

13 years agoMerge branch 'master' of github.com:ceph/teuthology
Mark Nelson [Fri, 29 Jun 2012 19:36:58 +0000 (14:36 -0500)]
Merge branch 'master' of github.com:ceph/teuthology

13 years agoNow using daemon-helper
Mark Nelson [Fri, 29 Jun 2012 19:36:30 +0000 (14:36 -0500)]
Now using daemon-helper

Signed-off-by: Mark Nelson <nhm@clusterfaq.org>
13 years agoadd cleanup-user.sh script
Sage Weil [Thu, 28 Jun 2012 18:14:15 +0000 (11:14 -0700)]
add cleanup-user.sh script

big hammer, use with care

13 years agoschedule_suite.sh: drop -x
Sage Weil [Tue, 26 Jun 2012 16:26:03 +0000 (09:26 -0700)]
schedule_suite.sh: drop -x

13 years agocleaned up commented code
Mark Nelson [Thu, 28 Jun 2012 16:47:16 +0000 (11:47 -0500)]
cleaned up commented code

Signed-off-by: Mark Nelson <nhm@clusterfaq.org>
13 years agoAdded blktrace task
Mark Nelson [Thu, 28 Jun 2012 00:38:12 +0000 (19:38 -0500)]
Added blktrace task

Signed-off-by: Mark Nelson <nhm@clusterfaq.org>
13 years agoignore DEADLOCK line inside lockdep splat
Sage Weil [Mon, 25 Jun 2012 22:20:19 +0000 (15:20 -0700)]
ignore DEADLOCK line inside lockdep splat

13 years agoAdd script to create a vm image with extra packages
Josh Durgin [Fri, 22 Jun 2012 02:23:42 +0000 (19:23 -0700)]
Add script to create a vm image with extra packages

13 years agoAdd a task to run a test against rbd inside of qemu.
Josh Durgin [Tue, 19 Jun 2012 21:13:39 +0000 (14:13 -0700)]
Add a task to run a test against rbd inside of qemu.

For now this task does not setup networking for the vm,
and simply runs an executable downloaded from a specified url.

It does support adding multiple rbd devices, but making use
of that with e.g. xfstests requires a bit more work.

13 years agoCheck for machine args based on local, not ctx.machines
Dan Mick [Thu, 21 Jun 2012 21:32:51 +0000 (14:32 -0700)]
Check for machine args based on local, not ctx.machines
Signed-off-by: Dan Mick <dan.mick@inktank.com>
13 years agowhitelist current lockdep warnings in syslog
Sage Weil [Thu, 21 Jun 2012 20:20:18 +0000 (13:20 -0700)]
whitelist current lockdep warnings in syslog

These are causing too much noise in the qa runs to leave, and #2617 is
sufficiently non-trivial to do this in the interim.  Putting a better
mechanism in place will include removing these coarse whitelist items and
replacing with something that specifically matches the failures we want
to ignore.