]>
git.apps.os.sepia.ceph.com Git - ceph.git/log
Joe Buck [Mon, 22 Oct 2012 18:33:44 +0000 (11:33 -0700)]
fixing the tense of my README change
buck [Mon, 22 Oct 2012 18:21:07 +0000 (11:21 -0700)]
Edit to README.bst to note that the ssh keys needed in the targets entries are the hosts public ssh key and not the users defined in the same entry
David Zafman [Tue, 9 Oct 2012 22:50:25 +0000 (15:50 -0700)]
New nfs task that performs NFS client mount of export (see knfsd)
Fixes: 3245
Signed-off-by: David Zafman <david.zafman@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
David Zafman [Sat, 6 Oct 2012 02:26:24 +0000 (19:26 -0700)]
New knfsd task that does an nfs server export
Ability to specify options
By default only export to current hosts
Fixes: 3245
Signed-off-by: David Zafman <david.zafman@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
David Zafman [Tue, 9 Oct 2012 00:59:47 +0000 (17:59 -0700)]
New kcon_most task that enables most ceph kernel logging
Signed-off-by: David Zafman <david.zafman@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
David Zafman [Tue, 9 Oct 2012 00:08:41 +0000 (17:08 -0700)]
Fix ceph-fuse example
Signed-off-by: David Zafman <david.zafman@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
David Zafman [Sat, 6 Oct 2012 02:25:48 +0000 (19:25 -0700)]
Fix typo in README
Signed-off-by: David Zafman <david.zafman@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
tamil [Sat, 6 Oct 2012 00:33:57 +0000 (17:33 -0700)]
Printing the number of tests passed when 'all' tests are successful
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
Yehuda Sadeh [Tue, 2 Oct 2012 00:17:14 +0000 (17:17 -0700)]
radosgw-admin: usage should time out after 20 minutes
Not 45 seconds.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Sage Weil [Mon, 1 Oct 2012 04:08:41 +0000 (21:08 -0700)]
console: add console task
Log the sol console of every target to a file in the archive dir.
Sage Weil [Mon, 1 Oct 2012 04:07:58 +0000 (21:07 -0700)]
internal: stop warning about lockdep circular dependency
This is coming from xfs, currently. Bah.
Sage Weil [Sat, 29 Sep 2012 15:54:08 +0000 (08:54 -0700)]
include newpool in osd cap for client.0
This is needed by the kclient_workunit_kclient task.
Josh Durgin [Fri, 28 Sep 2012 17:07:45 +0000 (10:07 -0700)]
misc: use new syntax for osd caps
pool=pool1,pool2 is not valid for the new grammar
Sage Weil [Sun, 23 Sep 2012 15:03:17 +0000 (08:03 -0700)]
fix lock held when returning to user space typo
Josh Durgin [Fri, 21 Sep 2012 21:54:19 +0000 (14:54 -0700)]
Allow scheduled jobs to use different teuthology branches
teuthology-[schedule|suite] get a parameter to specify the branch,
to put the job in a branch-specific queue. Workers running that
branch of teuthology can pull jobs from that queue.
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin [Fri, 21 Sep 2012 23:44:35 +0000 (16:44 -0700)]
Fix errors found by pyflakes
A bunch of unused imports and variables.
Sage Weil [Wed, 19 Sep 2012 21:19:53 +0000 (14:19 -0700)]
ignore 'lock held when returning to user space' from btrfs sb_internal crap
Sam Lang [Tue, 18 Sep 2012 23:31:39 +0000 (16:31 -0700)]
replace tab with spaces
Signed-off-by: Sam Lang <sam.lang@inktank.com>
Sam Lang [Tue, 18 Sep 2012 22:56:08 +0000 (15:56 -0700)]
fix error on teardown failing to unmount /mnt
Sam Lang [Tue, 18 Sep 2012 20:50:36 +0000 (13:50 -0700)]
update doc to be ceph-fuse task (instead of cfuse)
tamil [Fri, 14 Sep 2012 22:04:40 +0000 (15:04 -0700)]
imported subprocess module in nuke script
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
Josh Durgin [Thu, 13 Sep 2012 21:31:46 +0000 (14:31 -0700)]
nuke: add missing import
Josh Durgin [Thu, 13 Sep 2012 21:29:43 +0000 (14:29 -0700)]
rbd: fix typo and cast to int before comparing format
Josh Durgin [Wed, 12 Sep 2012 18:30:40 +0000 (11:30 -0700)]
rbd: only specify --format if not using the default
This lets older versions that do not support --format still work with
format 1 images.
Tommi Virtanen [Tue, 11 Sep 2012 18:11:39 +0000 (11:11 -0700)]
Don't lose tracebacks of exceptions raised in a greenlet.
Exception objects don't contain the traceback of where they were
raised from (to avoid cyclic data structures wrecking gc and causing
mem leaks), so the singular "raise obj" form creates a new traceback
from the current execution location, thus losing the original location
of the error.
Gevent explicitly wants to throw away the traceback, to release any
objects the greenlet may still be referring to, closing files,
releasing locks etc. In this case, we think it's safe, so stash the
exception info away in a holder object, and resurrect it on the other
side of the results queue.
http://stackoverflow.com/questions/
9268916 /how-to-capture-a-traceback-in-gevent
This can be reproduced easily with
from teuthology.parallel import parallel
def f():
raise RuntimeError("bork")
with parallel() as p:
p.spawn(f)
and looking at the resulting traceback with and without this change.
Alex Elder [Mon, 10 Sep 2012 14:23:08 +0000 (09:23 -0500)]
rbd: allow xfstests task to specify rbd image formats
This adds the ability to specify the rbd image format to use for the
scratch and test devices for the rbd.xfstests task.
Signed-off-by: Alex Elder <elder@inktank.com>
Reviewed-by: Dan Mick <dan.mick@inktank.com>
Alex Elder [Mon, 10 Sep 2012 14:23:08 +0000 (09:23 -0500)]
rbd: allow image format to be specified
This adds the ability to specify an rbd image format (either 1 or 2)
for an rbd image.
Signed-off-by: Alex Elder <elder@inktank.com>
Reviewed-by: Dan Mick <dan.mick@inktank.com>
tamil [Mon, 10 Sep 2012 22:02:47 +0000 (15:02 -0700)]
Suppress valgrind error "Invalid write 8"
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
Yehuda Sadeh [Mon, 10 Sep 2012 18:14:12 +0000 (11:14 -0700)]
radosgw-admin: update task for new usage reporting
Usage reporting output has been modified, also use the new
--categories input param.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Sage Weil [Mon, 10 Sep 2012 18:09:25 +0000 (11:09 -0700)]
schedule_suite.sh: try to use same branch for s3-tests.git
If it isn't there, warn and fall back to master.
Sage Weil [Mon, 10 Sep 2012 18:08:57 +0000 (11:08 -0700)]
s3tests: run against arbitrary branch/sha1 of s3-tests.git
Sage Weil [Sun, 9 Sep 2012 21:23:12 +0000 (14:23 -0700)]
debian ntp servers
Mike Ryan [Tue, 4 Sep 2012 16:52:38 +0000 (09:52 -0700)]
task: die on ceph error or coredump
This task allows ceph to signal to teuth that it should die immediately
by touching a file under /tmp/cephtest
Signed-off-by: Mike Ryan <mike.ryan@inktank.com>
Sage Weil [Sat, 25 Aug 2012 02:23:34 +0000 (19:23 -0700)]
disable lockdep recursive warnings until #3040 is fixed
Sage Weil [Thu, 23 Aug 2012 18:00:34 +0000 (11:00 -0700)]
internal: fix escaping of \b in syslog grep
Sage Weil [Wed, 22 Aug 2012 22:29:44 +0000 (15:29 -0700)]
suppress this valgrind error
<error>
<unique>0x4</unique>
<tid>1</tid>
<kind>InvalidWrite</kind>
<what>Invalid write of size 8</what>
<stack>
<frame>
<ip>0x400A299</ip>
<obj>/lib/x86_64-linux-gnu/ld-2.15.so</obj>
<fn>do_lookup_x</fn>
<dir>/build/buildd/eglibc-2.15/elf</dir>
<file>dl-lookup.c</file>
<line>250</line>
</frame>
<frame>
<ip>0x403122F</ip>
</frame>
<frame>
<ip>0x400A522</ip>
<obj>/lib/x86_64-linux-gnu/ld-2.15.so</obj>
<fn>_dl_lookup_symbol_x</fn>
<dir>/build/buildd/eglibc-2.15/elf</dir>
<file>dl-lookup.c</file>
<line>739</line>
</frame>
</stack>
<auxwhat>Address 0x7feffeec8 is on thread 1's stack</auxwhat>
</error>
pops up recently
Sage Weil [Mon, 20 Aug 2012 03:16:43 +0000 (20:16 -0700)]
ceph: fix cpu_profile default
Sage Weil [Sun, 19 Aug 2012 03:10:54 +0000 (20:10 -0700)]
rbd.xfstests: default to 1gb (not 250mb) image
Mike Ryan [Fri, 17 Aug 2012 20:47:13 +0000 (13:47 -0700)]
task: run osd/mds/mon with Google CPU profiler via cpu_profile option
Signed-off-by: Mike Ryan <mike.ryan@inktank.com>
Mike Ryan [Thu, 16 Aug 2012 22:50:10 +0000 (15:50 -0700)]
ceph: support tmpfs_journal option to put journal on tmpfs
Signed-off-by: Mike Ryan <mike.ryan@inktank.com>
Sage Weil [Tue, 14 Aug 2012 22:08:21 +0000 (15:08 -0700)]
queue: fix logging of child return code
Tommi Virtanen [Mon, 13 Aug 2012 23:10:05 +0000 (16:10 -0700)]
Disable asynchronous DNS lookups.
Especially on older hosts, we keep triggering errors::
ServerNotFoundError: Unable to find the server at
teuthology.front.sepia.ceph.com: [Errno 3] name does not exist
That comes from libevent's evdns via gevent.dns and httplib2. The rate
of these errors is low enough that they seem to be perhaps timeouts,
or more arbitrary. Busy looping on DNS resolution calls has never
triggered them, so far.
With ``monkey.patch_all(dns=False)``, the teuthology process will
block as a whole whenever doing DNS resolution. This will hopefully be
rare enough that it won't matter.
The only real "fix" seems to be upgrading libraries and hoping for the
best; this commit can be reverted after that is done.
Tommi Virtanen [Thu, 9 Aug 2012 16:42:35 +0000 (09:42 -0700)]
Flush data to temp file before reading it in another process.
Tommi Virtanen [Thu, 9 Aug 2012 16:42:13 +0000 (09:42 -0700)]
Oops tempfile now gives us file objects not fds.
Tommi Virtanen [Wed, 8 Aug 2012 21:48:21 +0000 (14:48 -0700)]
In teuthology-worker, shuffle the child stdout/stderr into our log.
Otherwise, child can suffer a failure that does not get logged by
it's own exception handling machinery, and we have no idea why.
Tommi Virtanen [Wed, 8 Aug 2012 21:45:49 +0000 (14:45 -0700)]
Minimize scope of try-except.
os.write and list.append won't raise CalledProcessError, and now
we don't need to try to contain them for temp file clean up reasons.
Tommi Virtanen [Wed, 8 Aug 2012 21:44:47 +0000 (14:44 -0700)]
Use tempfile.NamedTemporaryFile instead of mkstemp.
Simpler code, no manual cleanup needed. We see a littering of
zero-length temp files from teuthology-worker, and this seems
like a likely source.
Mike Ryan [Thu, 2 Aug 2012 17:58:08 +0000 (10:58 -0700)]
task: verify scrub detects files whose contents changed
Signed-off-by: Mike Ryan <mike.ryan@inktank.com>
Mike Ryan [Wed, 1 Aug 2012 22:35:38 +0000 (15:35 -0700)]
task: scrub OSDs periodically
Signed-off-by: Mike Ryan <mike.ryan@inktank.com>
Sage Weil [Sun, 29 Jul 2012 19:40:06 +0000 (12:40 -0700)]
kernel: push a local .deb instead of using gitbuilder
This lets you specify a path to an existing kernel deb
to be pushed and installed on the remote node.
Limitations:
- We don't build the deb for you. figuring out what
filename 'make deb-pkg' is building is annoying.
- We need to be able to figure out the sha1 from the provided
path. It shouldn't be a problem, given the way make deb-pkg
names the debs.
Sage Weil [Sun, 29 Jul 2012 19:15:06 +0000 (12:15 -0700)]
syslog check: fix false-positive BUG matches in random strings
Sage Weil [Sat, 28 Jul 2012 17:53:09 +0000 (10:53 -0700)]
osd_recovery: also test unfound discovery
This tests for bug #2866.
Sage Weil [Sat, 28 Jul 2012 17:23:02 +0000 (10:23 -0700)]
osd_recovery: test incomplete pg recovery
4-osd test to reproduce #2860 and confirm the fix.
Sage Weil [Sat, 28 Jul 2012 17:22:13 +0000 (10:22 -0700)]
ceph_manager: wait_for_active
Sage Weil [Sat, 28 Jul 2012 17:22:02 +0000 (10:22 -0700)]
ceph_manager: count 'incomplete' as 'down'
tamil [Thu, 26 Jul 2012 20:48:11 +0000 (13:48 -0700)]
Fixed the code to pass 'yes' during mkfs
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
tamil [Wed, 25 Jul 2012 23:38:25 +0000 (16:38 -0700)]
Added '-y' option for mkfs.ext4
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
Sage Weil [Tue, 24 Jul 2012 22:25:56 +0000 (15:25 -0700)]
admin_socket: make test optional
If it's not there, we just verify the output is valid json.
Sage Weil [Mon, 23 Jul 2012 05:03:04 +0000 (22:03 -0700)]
schedule_suite.sh: put coverage option in ceph section
Sage Weil [Sun, 22 Jul 2012 03:18:24 +0000 (20:18 -0700)]
ceph: fix mkfs/mount option defaults
Later code expects a list, not None.
Samuel Just [Thu, 19 Jul 2012 22:49:29 +0000 (15:49 -0700)]
tasks: add multibench task for testing pool creation
Also adds support for specifying a pool for radosbench
to create and then cleanup instead of "data".
Signed-off-by: Samuel Just <sam.just@inktank.com>
Sage Weil [Wed, 18 Jul 2012 20:44:59 +0000 (13:44 -0700)]
clock: print skew with ntp servers to log to help debug time issues
Sage Weil [Wed, 18 Jul 2012 18:04:30 +0000 (11:04 -0700)]
nuke: log what pid we are killing when we kill it
Sage Weil [Tue, 17 Jul 2012 17:00:59 +0000 (10:00 -0700)]
ceph: archive mon data to a .tgz
Saves bandwidth, time, and space.
Sage Weil [Mon, 16 Jul 2012 17:53:25 +0000 (10:53 -0700)]
set machine description to ctx.archive when auto-locking machines for a run
Sage Weil [Sat, 14 Jul 2012 20:02:04 +0000 (13:02 -0700)]
schedule/suite: schedule job, suite N times
Sage Weil [Fri, 13 Jul 2012 20:57:22 +0000 (13:57 -0700)]
kernel: fix kernel installation when kdb: is specified
This normalize check would only trigger if a *single* key was specified.
Change it so that it triggers as long as all keys are in the list of valid
keys. This lets us specify both kdb: true and a sha1/branch/tag.
Phew!
Sage Weil [Fri, 13 Jul 2012 19:36:56 +0000 (12:36 -0700)]
schedule_suite.sh: use workunits from ceph commit
Use the workunits from the same ceph branch we are testing.
Sage Weil [Fri, 13 Jul 2012 18:30:21 +0000 (11:30 -0700)]
ceph: add default btrfs mkfs options
Sage Weil [Fri, 13 Jul 2012 18:30:07 +0000 (11:30 -0700)]
ceph: cleanup/simplify mount/mkfs options
Sage Weil [Fri, 13 Jul 2012 18:13:31 +0000 (11:13 -0700)]
workunit: allow overrides
Pull top-level overrides into our config. This lets you do:
overrides:
workunit:
branch: foo
tasks:
...
- workunit:
clients:
all:
- foo
...
Sage Weil [Fri, 13 Jul 2012 18:12:31 +0000 (11:12 -0700)]
workunit: allow branch/sha1/tag to be specified
Pull the workunit(s) from the branch/tag/sha1 specified in the config.
Josh Durgin [Fri, 13 Jul 2012 17:00:50 +0000 (10:00 -0700)]
workunit: pass branch/sha1 to test
Some tests download things from the ceph repo. Let them know which
version to use through the CEPH_REF environment variable.
tamil [Fri, 13 Jul 2012 01:02:29 +0000 (18:02 -0700)]
Added functionality to get mkfs and mount options for file systems
from the config file,if present. Otherwise, default options are used.
The default value for inode size is changed to 2k when creating xfs.
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
tamil [Thu, 12 Jul 2012 23:36:40 +0000 (16:36 -0700)]
fixed typo
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
Sage Weil [Thu, 12 Jul 2012 15:33:29 +0000 (08:33 -0700)]
radosgw-admin: use --bucket instead of old --bucket-id
The --bucket-id support was removed.
Sage Weil [Wed, 11 Jul 2012 21:23:51 +0000 (14:23 -0700)]
nuke: honor 'check-locks: ...' field in targets file
If you are nuking a yaml file with check-locks: false, don't check locks.
Sage Weil [Wed, 11 Jul 2012 21:14:46 +0000 (14:14 -0700)]
internal: archive mon data dirs
These can be useful for debugging, and are usually pretty small.
Fixes: #2714
Sage Weil [Wed, 11 Jul 2012 16:22:50 +0000 (09:22 -0700)]
internal: move pulling archive w/ tar to helper
Sage Weil [Sat, 7 Jul 2012 03:15:55 +0000 (20:15 -0700)]
use sudo to kill teuthology proc
Sage Weil [Thu, 5 Jul 2012 20:43:19 +0000 (13:43 -0700)]
run: make -a short for --archive
Sage Weil [Wed, 4 Jul 2012 21:47:05 +0000 (14:47 -0700)]
watch-suite: stupid script to watch teuth run progress
Sage Weil [Tue, 3 Jul 2012 23:22:38 +0000 (16:22 -0700)]
nuke: be more careful about kill; simplify
If the archive dir is specified, make sure we are killing the right
process.
Also drop the kill_process helper; it's simple enough to open-code.
Sage Weil [Tue, 3 Jul 2012 19:53:08 +0000 (12:53 -0700)]
nuke: nuke based on archive path
Use path/config.yaml for targets, path/pid for pid to kill, and
path/owner for job owner.
Sage Weil [Wed, 4 Jul 2012 21:29:55 +0000 (14:29 -0700)]
valgrind: add strptime suppressions
Precise's strptime triggers valgrind false positives.
Use ship_utilities to push the valgrind.supp file over, which is a bit
slippy.
tamil [Tue, 3 Jul 2012 23:04:12 +0000 (16:04 -0700)]
Added a debug message
The debug message is to print the string that should be JSON.
This is to track a nightly run failure.
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
Sage Weil [Tue, 3 Jul 2012 19:49:39 +0000 (12:49 -0700)]
schedule_suite: use the sha1, not branch name
Keep the entire suite run on the same commit. We were resolving the sha1,
but not using it.
tamil [Tue, 3 Jul 2012 19:22:26 +0000 (12:22 -0700)]
nuke - optionally kill the process hung
Added a function kill_process to kill the process hung in the nightly runs.
It takes in pid as an optional argument.
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
Sage Weil [Mon, 2 Jul 2012 15:44:38 +0000 (08:44 -0700)]
ceph: fix valgrind error check
grep all the logs, not the dir... doh!
Mark Nelson [Fri, 29 Jun 2012 19:36:58 +0000 (14:36 -0500)]
Merge branch 'master' of github.com:ceph/teuthology
Mark Nelson [Fri, 29 Jun 2012 19:36:30 +0000 (14:36 -0500)]
Now using daemon-helper
Signed-off-by: Mark Nelson <nhm@clusterfaq.org>
Sage Weil [Thu, 28 Jun 2012 18:14:15 +0000 (11:14 -0700)]
add cleanup-user.sh script
big hammer, use with care
Sage Weil [Tue, 26 Jun 2012 16:26:03 +0000 (09:26 -0700)]
schedule_suite.sh: drop -x
Mark Nelson [Thu, 28 Jun 2012 16:47:16 +0000 (11:47 -0500)]
cleaned up commented code
Signed-off-by: Mark Nelson <nhm@clusterfaq.org>
Mark Nelson [Thu, 28 Jun 2012 00:38:12 +0000 (19:38 -0500)]
Added blktrace task
Signed-off-by: Mark Nelson <nhm@clusterfaq.org>
Sage Weil [Mon, 25 Jun 2012 22:20:19 +0000 (15:20 -0700)]
ignore DEADLOCK line inside lockdep splat
Josh Durgin [Fri, 22 Jun 2012 02:23:42 +0000 (19:23 -0700)]
Add script to create a vm image with extra packages
Josh Durgin [Tue, 19 Jun 2012 21:13:39 +0000 (14:13 -0700)]
Add a task to run a test against rbd inside of qemu.
For now this task does not setup networking for the vm,
and simply runs an executable downloaded from a specified url.
It does support adding multiple rbd devices, but making use
of that with e.g. xfstests requires a bit more work.
Dan Mick [Thu, 21 Jun 2012 21:32:51 +0000 (14:32 -0700)]
Check for machine args based on local, not ctx.machines
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Sage Weil [Thu, 21 Jun 2012 20:20:18 +0000 (13:20 -0700)]
whitelist current lockdep warnings in syslog
These are causing too much noise in the qa runs to leave, and #2617 is
sufficiently non-trivial to do this in the interim. Putting a better
mechanism in place will include removing these coarse whitelist items and
replacing with something that specifically matches the failures we want
to ignore.