]>
git.apps.os.sepia.ceph.com Git - ceph.git/log
Sage Weil [Mon, 10 Sep 2012 18:08:57 +0000 (11:08 -0700)]
s3tests: run against arbitrary branch/sha1 of s3-tests.git
Sage Weil [Sun, 9 Sep 2012 21:23:12 +0000 (14:23 -0700)]
debian ntp servers
Mike Ryan [Tue, 4 Sep 2012 16:52:38 +0000 (09:52 -0700)]
task: die on ceph error or coredump
This task allows ceph to signal to teuth that it should die immediately
by touching a file under /tmp/cephtest
Signed-off-by: Mike Ryan <mike.ryan@inktank.com>
Sage Weil [Sat, 25 Aug 2012 02:23:34 +0000 (19:23 -0700)]
disable lockdep recursive warnings until #3040 is fixed
Sage Weil [Thu, 23 Aug 2012 18:00:34 +0000 (11:00 -0700)]
internal: fix escaping of \b in syslog grep
Sage Weil [Wed, 22 Aug 2012 22:29:44 +0000 (15:29 -0700)]
suppress this valgrind error
<error>
<unique>0x4</unique>
<tid>1</tid>
<kind>InvalidWrite</kind>
<what>Invalid write of size 8</what>
<stack>
<frame>
<ip>0x400A299</ip>
<obj>/lib/x86_64-linux-gnu/ld-2.15.so</obj>
<fn>do_lookup_x</fn>
<dir>/build/buildd/eglibc-2.15/elf</dir>
<file>dl-lookup.c</file>
<line>250</line>
</frame>
<frame>
<ip>0x403122F</ip>
</frame>
<frame>
<ip>0x400A522</ip>
<obj>/lib/x86_64-linux-gnu/ld-2.15.so</obj>
<fn>_dl_lookup_symbol_x</fn>
<dir>/build/buildd/eglibc-2.15/elf</dir>
<file>dl-lookup.c</file>
<line>739</line>
</frame>
</stack>
<auxwhat>Address 0x7feffeec8 is on thread 1's stack</auxwhat>
</error>
pops up recently
Sage Weil [Mon, 20 Aug 2012 03:16:43 +0000 (20:16 -0700)]
ceph: fix cpu_profile default
Sage Weil [Sun, 19 Aug 2012 03:10:54 +0000 (20:10 -0700)]
rbd.xfstests: default to 1gb (not 250mb) image
Mike Ryan [Fri, 17 Aug 2012 20:47:13 +0000 (13:47 -0700)]
task: run osd/mds/mon with Google CPU profiler via cpu_profile option
Signed-off-by: Mike Ryan <mike.ryan@inktank.com>
Mike Ryan [Thu, 16 Aug 2012 22:50:10 +0000 (15:50 -0700)]
ceph: support tmpfs_journal option to put journal on tmpfs
Signed-off-by: Mike Ryan <mike.ryan@inktank.com>
Sage Weil [Tue, 14 Aug 2012 22:08:21 +0000 (15:08 -0700)]
queue: fix logging of child return code
Tommi Virtanen [Mon, 13 Aug 2012 23:10:05 +0000 (16:10 -0700)]
Disable asynchronous DNS lookups.
Especially on older hosts, we keep triggering errors::
ServerNotFoundError: Unable to find the server at
teuthology.front.sepia.ceph.com: [Errno 3] name does not exist
That comes from libevent's evdns via gevent.dns and httplib2. The rate
of these errors is low enough that they seem to be perhaps timeouts,
or more arbitrary. Busy looping on DNS resolution calls has never
triggered them, so far.
With ``monkey.patch_all(dns=False)``, the teuthology process will
block as a whole whenever doing DNS resolution. This will hopefully be
rare enough that it won't matter.
The only real "fix" seems to be upgrading libraries and hoping for the
best; this commit can be reverted after that is done.
Tommi Virtanen [Thu, 9 Aug 2012 16:42:35 +0000 (09:42 -0700)]
Flush data to temp file before reading it in another process.
Tommi Virtanen [Thu, 9 Aug 2012 16:42:13 +0000 (09:42 -0700)]
Oops tempfile now gives us file objects not fds.
Tommi Virtanen [Wed, 8 Aug 2012 21:48:21 +0000 (14:48 -0700)]
In teuthology-worker, shuffle the child stdout/stderr into our log.
Otherwise, child can suffer a failure that does not get logged by
it's own exception handling machinery, and we have no idea why.
Tommi Virtanen [Wed, 8 Aug 2012 21:45:49 +0000 (14:45 -0700)]
Minimize scope of try-except.
os.write and list.append won't raise CalledProcessError, and now
we don't need to try to contain them for temp file clean up reasons.
Tommi Virtanen [Wed, 8 Aug 2012 21:44:47 +0000 (14:44 -0700)]
Use tempfile.NamedTemporaryFile instead of mkstemp.
Simpler code, no manual cleanup needed. We see a littering of
zero-length temp files from teuthology-worker, and this seems
like a likely source.
Mike Ryan [Thu, 2 Aug 2012 17:58:08 +0000 (10:58 -0700)]
task: verify scrub detects files whose contents changed
Signed-off-by: Mike Ryan <mike.ryan@inktank.com>
Mike Ryan [Wed, 1 Aug 2012 22:35:38 +0000 (15:35 -0700)]
task: scrub OSDs periodically
Signed-off-by: Mike Ryan <mike.ryan@inktank.com>
Sage Weil [Sun, 29 Jul 2012 19:40:06 +0000 (12:40 -0700)]
kernel: push a local .deb instead of using gitbuilder
This lets you specify a path to an existing kernel deb
to be pushed and installed on the remote node.
Limitations:
- We don't build the deb for you. figuring out what
filename 'make deb-pkg' is building is annoying.
- We need to be able to figure out the sha1 from the provided
path. It shouldn't be a problem, given the way make deb-pkg
names the debs.
Sage Weil [Sun, 29 Jul 2012 19:15:06 +0000 (12:15 -0700)]
syslog check: fix false-positive BUG matches in random strings
Sage Weil [Sat, 28 Jul 2012 17:53:09 +0000 (10:53 -0700)]
osd_recovery: also test unfound discovery
This tests for bug #2866.
Sage Weil [Sat, 28 Jul 2012 17:23:02 +0000 (10:23 -0700)]
osd_recovery: test incomplete pg recovery
4-osd test to reproduce #2860 and confirm the fix.
Sage Weil [Sat, 28 Jul 2012 17:22:13 +0000 (10:22 -0700)]
ceph_manager: wait_for_active
Sage Weil [Sat, 28 Jul 2012 17:22:02 +0000 (10:22 -0700)]
ceph_manager: count 'incomplete' as 'down'
tamil [Thu, 26 Jul 2012 20:48:11 +0000 (13:48 -0700)]
Fixed the code to pass 'yes' during mkfs
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
tamil [Wed, 25 Jul 2012 23:38:25 +0000 (16:38 -0700)]
Added '-y' option for mkfs.ext4
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
Sage Weil [Tue, 24 Jul 2012 22:25:56 +0000 (15:25 -0700)]
admin_socket: make test optional
If it's not there, we just verify the output is valid json.
Sage Weil [Mon, 23 Jul 2012 05:03:04 +0000 (22:03 -0700)]
schedule_suite.sh: put coverage option in ceph section
Sage Weil [Sun, 22 Jul 2012 03:18:24 +0000 (20:18 -0700)]
ceph: fix mkfs/mount option defaults
Later code expects a list, not None.
Samuel Just [Thu, 19 Jul 2012 22:49:29 +0000 (15:49 -0700)]
tasks: add multibench task for testing pool creation
Also adds support for specifying a pool for radosbench
to create and then cleanup instead of "data".
Signed-off-by: Samuel Just <sam.just@inktank.com>
Sage Weil [Wed, 18 Jul 2012 20:44:59 +0000 (13:44 -0700)]
clock: print skew with ntp servers to log to help debug time issues
Sage Weil [Wed, 18 Jul 2012 18:04:30 +0000 (11:04 -0700)]
nuke: log what pid we are killing when we kill it
Sage Weil [Tue, 17 Jul 2012 17:00:59 +0000 (10:00 -0700)]
ceph: archive mon data to a .tgz
Saves bandwidth, time, and space.
Sage Weil [Mon, 16 Jul 2012 17:53:25 +0000 (10:53 -0700)]
set machine description to ctx.archive when auto-locking machines for a run
Sage Weil [Sat, 14 Jul 2012 20:02:04 +0000 (13:02 -0700)]
schedule/suite: schedule job, suite N times
Sage Weil [Fri, 13 Jul 2012 20:57:22 +0000 (13:57 -0700)]
kernel: fix kernel installation when kdb: is specified
This normalize check would only trigger if a *single* key was specified.
Change it so that it triggers as long as all keys are in the list of valid
keys. This lets us specify both kdb: true and a sha1/branch/tag.
Phew!
Sage Weil [Fri, 13 Jul 2012 19:36:56 +0000 (12:36 -0700)]
schedule_suite.sh: use workunits from ceph commit
Use the workunits from the same ceph branch we are testing.
Sage Weil [Fri, 13 Jul 2012 18:30:21 +0000 (11:30 -0700)]
ceph: add default btrfs mkfs options
Sage Weil [Fri, 13 Jul 2012 18:30:07 +0000 (11:30 -0700)]
ceph: cleanup/simplify mount/mkfs options
Sage Weil [Fri, 13 Jul 2012 18:13:31 +0000 (11:13 -0700)]
workunit: allow overrides
Pull top-level overrides into our config. This lets you do:
overrides:
workunit:
branch: foo
tasks:
...
- workunit:
clients:
all:
- foo
...
Sage Weil [Fri, 13 Jul 2012 18:12:31 +0000 (11:12 -0700)]
workunit: allow branch/sha1/tag to be specified
Pull the workunit(s) from the branch/tag/sha1 specified in the config.
Josh Durgin [Fri, 13 Jul 2012 17:00:50 +0000 (10:00 -0700)]
workunit: pass branch/sha1 to test
Some tests download things from the ceph repo. Let them know which
version to use through the CEPH_REF environment variable.
tamil [Fri, 13 Jul 2012 01:02:29 +0000 (18:02 -0700)]
Added functionality to get mkfs and mount options for file systems
from the config file,if present. Otherwise, default options are used.
The default value for inode size is changed to 2k when creating xfs.
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
tamil [Thu, 12 Jul 2012 23:36:40 +0000 (16:36 -0700)]
fixed typo
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
Sage Weil [Thu, 12 Jul 2012 15:33:29 +0000 (08:33 -0700)]
radosgw-admin: use --bucket instead of old --bucket-id
The --bucket-id support was removed.
Sage Weil [Wed, 11 Jul 2012 21:23:51 +0000 (14:23 -0700)]
nuke: honor 'check-locks: ...' field in targets file
If you are nuking a yaml file with check-locks: false, don't check locks.
Sage Weil [Wed, 11 Jul 2012 21:14:46 +0000 (14:14 -0700)]
internal: archive mon data dirs
These can be useful for debugging, and are usually pretty small.
Fixes: #2714
Sage Weil [Wed, 11 Jul 2012 16:22:50 +0000 (09:22 -0700)]
internal: move pulling archive w/ tar to helper
Sage Weil [Sat, 7 Jul 2012 03:15:55 +0000 (20:15 -0700)]
use sudo to kill teuthology proc
Sage Weil [Thu, 5 Jul 2012 20:43:19 +0000 (13:43 -0700)]
run: make -a short for --archive
Sage Weil [Wed, 4 Jul 2012 21:47:05 +0000 (14:47 -0700)]
watch-suite: stupid script to watch teuth run progress
Sage Weil [Tue, 3 Jul 2012 23:22:38 +0000 (16:22 -0700)]
nuke: be more careful about kill; simplify
If the archive dir is specified, make sure we are killing the right
process.
Also drop the kill_process helper; it's simple enough to open-code.
Sage Weil [Tue, 3 Jul 2012 19:53:08 +0000 (12:53 -0700)]
nuke: nuke based on archive path
Use path/config.yaml for targets, path/pid for pid to kill, and
path/owner for job owner.
Sage Weil [Wed, 4 Jul 2012 21:29:55 +0000 (14:29 -0700)]
valgrind: add strptime suppressions
Precise's strptime triggers valgrind false positives.
Use ship_utilities to push the valgrind.supp file over, which is a bit
slippy.
tamil [Tue, 3 Jul 2012 23:04:12 +0000 (16:04 -0700)]
Added a debug message
The debug message is to print the string that should be JSON.
This is to track a nightly run failure.
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
Sage Weil [Tue, 3 Jul 2012 19:49:39 +0000 (12:49 -0700)]
schedule_suite: use the sha1, not branch name
Keep the entire suite run on the same commit. We were resolving the sha1,
but not using it.
tamil [Tue, 3 Jul 2012 19:22:26 +0000 (12:22 -0700)]
nuke - optionally kill the process hung
Added a function kill_process to kill the process hung in the nightly runs.
It takes in pid as an optional argument.
Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
Sage Weil [Mon, 2 Jul 2012 15:44:38 +0000 (08:44 -0700)]
ceph: fix valgrind error check
grep all the logs, not the dir... doh!
Mark Nelson [Fri, 29 Jun 2012 19:36:58 +0000 (14:36 -0500)]
Merge branch 'master' of github.com:ceph/teuthology
Mark Nelson [Fri, 29 Jun 2012 19:36:30 +0000 (14:36 -0500)]
Now using daemon-helper
Signed-off-by: Mark Nelson <nhm@clusterfaq.org>
Sage Weil [Thu, 28 Jun 2012 18:14:15 +0000 (11:14 -0700)]
add cleanup-user.sh script
big hammer, use with care
Sage Weil [Tue, 26 Jun 2012 16:26:03 +0000 (09:26 -0700)]
schedule_suite.sh: drop -x
Mark Nelson [Thu, 28 Jun 2012 16:47:16 +0000 (11:47 -0500)]
cleaned up commented code
Signed-off-by: Mark Nelson <nhm@clusterfaq.org>
Mark Nelson [Thu, 28 Jun 2012 00:38:12 +0000 (19:38 -0500)]
Added blktrace task
Signed-off-by: Mark Nelson <nhm@clusterfaq.org>
Sage Weil [Mon, 25 Jun 2012 22:20:19 +0000 (15:20 -0700)]
ignore DEADLOCK line inside lockdep splat
Josh Durgin [Fri, 22 Jun 2012 02:23:42 +0000 (19:23 -0700)]
Add script to create a vm image with extra packages
Josh Durgin [Tue, 19 Jun 2012 21:13:39 +0000 (14:13 -0700)]
Add a task to run a test against rbd inside of qemu.
For now this task does not setup networking for the vm,
and simply runs an executable downloaded from a specified url.
It does support adding multiple rbd devices, but making use
of that with e.g. xfstests requires a bit more work.
Dan Mick [Thu, 21 Jun 2012 21:32:51 +0000 (14:32 -0700)]
Check for machine args based on local, not ctx.machines
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Sage Weil [Thu, 21 Jun 2012 20:20:18 +0000 (13:20 -0700)]
whitelist current lockdep warnings in syslog
These are causing too much noise in the qa runs to leave, and #2617 is
sufficiently non-trivial to do this in the interim. Putting a better
mechanism in place will include removing these coarse whitelist items and
replacing with something that specifically matches the failures we want
to ignore.
Sage Weil [Wed, 20 Jun 2012 18:35:43 +0000 (11:35 -0700)]
record owner at start of run
So that we can clean up easily even when we don't finish and there is no
summary.yaml.
Josh Durgin [Wed, 20 Jun 2012 17:13:48 +0000 (10:13 -0700)]
teuthology-ls: tolerate non-existent 'success' key in summary file
Sage Weil [Wed, 20 Jun 2012 00:29:32 +0000 (17:29 -0700)]
schedule_suite: enable kdb
Among other things, you can attach to the console after the fact and type
'dmesg' to see wtf happened.
Sage Weil [Wed, 20 Jun 2012 00:24:01 +0000 (17:24 -0700)]
kernel: enable/disable kdb
This hard-codes ttyS1, which is what we use on sepia.
Yehuda Sadeh [Tue, 19 Jun 2012 21:30:00 +0000 (14:30 -0700)]
add usage log tests to radosgw-admin tasks
tests 'usage show' and 'usage trim'
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Sage Weil [Sun, 17 Jun 2012 19:16:25 +0000 (12:16 -0700)]
sync clock at start of every run
Sage Weil [Sun, 17 Jun 2012 03:14:35 +0000 (20:14 -0700)]
tolerate 250ms clock drift
Sage Weil [Sat, 16 Jun 2012 22:05:46 +0000 (15:05 -0700)]
include suite in archive dir
Sage Weil [Sat, 16 Jun 2012 20:59:46 +0000 (13:59 -0700)]
whitelist 'slow request' in qa runs
Sage Weil [Thu, 14 Jun 2012 21:03:39 +0000 (14:03 -0700)]
radosgw-admin: fix for non-numeric bucket ids
Sage Weil [Thu, 14 Jun 2012 21:03:29 +0000 (14:03 -0700)]
radosgw-admin: test max buckets limit
Sage Weil [Thu, 14 Jun 2012 21:02:40 +0000 (14:02 -0700)]
radosgw-admin: remove buckets before user
Otherwise user delete will fail.
Sage Weil [Thu, 14 Jun 2012 21:00:57 +0000 (14:00 -0700)]
radosgw-admin: fix swift subuser/key tests
Need to do 'subuser (add|rm)', not 'key (add|rm)'.
Sage Weil [Thu, 14 Jun 2012 20:23:24 +0000 (13:23 -0700)]
schedule_suite.sh: add flavors, check/fix sha1s, optional templates
This should be everything we need to use this for the nightlies, with the
exception of updating the git trees, which can happen explicitly in the
crontab.
Josh Durgin [Mon, 11 Jun 2012 19:31:22 +0000 (12:31 -0700)]
workunit: grab 'all' config from the right variable
Josh Durgin [Mon, 11 Jun 2012 01:43:35 +0000 (18:43 -0700)]
workunit: allow setting environment variables
This is useful for e.g. running the same tests against rbd in new and
old formats.
Dan Mick [Thu, 7 Jun 2012 20:20:02 +0000 (13:20 -0700)]
--summary: add total counts, also note free machines
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Dan Mick [Thu, 7 Jun 2012 03:29:28 +0000 (20:29 -0700)]
new variable lock hid lock() function
Dan Mick [Wed, 6 Jun 2012 22:15:47 +0000 (15:15 -0700)]
teuthology-lock: add --summary and --brief options
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Sage Weil [Wed, 6 Jun 2012 23:00:55 +0000 (16:00 -0700)]
pull s3-tests.git using git, not http
Sage Weil [Wed, 6 Jun 2012 20:32:56 +0000 (13:32 -0700)]
ceph: simplify 'cluster' mon log handling
It's not a special file in the mon_data directory anymore, but intead
something in archive that will get slurped up normally. Make sure we
grep for badness from the proper location.
Dan Mick [Wed, 6 Jun 2012 01:41:45 +0000 (18:41 -0700)]
Pass up unmodified exceptions from connection.connect()
This allows useful errors to be reported from things like
mismatched hostkeys, etc.
Dan Mick [Wed, 6 Jun 2012 01:33:36 +0000 (18:33 -0700)]
More shortnames fixes:
- Allow shortnames in teuthology-updatekeys as well
- Use list comprehensions instead of map()
Eleanor Cawthon [Tue, 5 Jun 2012 22:30:51 +0000 (15:30 -0700)]
task/: Added object map benchmarking test
Signed-off-by: Eleanor Cawthon <eleanor.cawthon@inktank.com>
Dan Mick [Tue, 5 Jun 2012 00:41:57 +0000 (17:41 -0700)]
Allow short names to teuthology-lock (e.g. "plana14")
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Sam Just <sam.just@inktank.com>
Sage Weil [Fri, 1 Jun 2012 04:39:33 +0000 (21:39 -0700)]
fix up dist var
This lets you override the default (now precise) in the ceph config yaml,
e.g.
- ceph:
dist: oneiric
branch: master
Dan Mick [Fri, 1 Jun 2012 00:09:20 +0000 (17:09 -0700)]
Change hardcoded oneiric to precise
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Sam Zaydel [Thu, 24 May 2012 16:37:46 +0000 (09:37 -0700)]
Added python-dev to list of required packages.
Sage Weil [Mon, 21 May 2012 03:50:19 +0000 (20:50 -0700)]
rbd.xfstests: default to 250mb instead of 100mb
Sage Weil [Sun, 6 May 2012 04:22:40 +0000 (21:22 -0700)]
schedule_suite: fix 'slow request' whitelist