]>
git.apps.os.sepia.ceph.com Git - ceph.git/log
Greg Farnum [Fri, 30 Sep 2011 16:26:42 +0000 (09:26 -0700)]
s3-tests: use radosgw-admin instead of radosgw_admin
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Josh Durgin [Thu, 29 Sep 2011 16:09:31 +0000 (09:09 -0700)]
ceph_manager: parse osd numbers with dots
This is necessary since wip-dot-names was merged.
Sage Weil [Fri, 23 Sep 2011 15:57:18 +0000 (08:57 -0700)]
rename c* -> ceph-*
Leave cfuse task name unchanged for now...
Josh Durgin [Fri, 23 Sep 2011 01:23:36 +0000 (18:23 -0700)]
queue: results_timeout needs to be converted to a string
Samuel Just [Thu, 22 Sep 2011 20:23:05 +0000 (13:23 -0700)]
task/watch_notify_stress.py: add simple watch_notify stress test
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Josh Durgin [Wed, 21 Sep 2011 18:05:18 +0000 (11:05 -0700)]
schedule: put results timeout in the job
The default was always being used instead.
Greg Farnum [Tue, 20 Sep 2011 17:04:01 +0000 (10:04 -0700)]
lockfile: increase interval to prevent incorrect locking orders
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum [Thu, 15 Sep 2011 16:24:52 +0000 (09:24 -0700)]
lockfile: don't fail cleanup if no lock procs exist
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Tommi Virtanen [Fri, 16 Sep 2011 18:32:15 +0000 (11:32 -0700)]
workunit: Fetch source from github.
Needed an elaborate dance because Github won't let us download
an archive of a subdirectory.
Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com>
Tommi Virtanen [Fri, 16 Sep 2011 18:09:45 +0000 (11:09 -0700)]
s3tests: Clone repository from github.
Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com>
Tommi Virtanen [Fri, 16 Sep 2011 18:08:38 +0000 (11:08 -0700)]
coverage: Fetch source from github.
Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com>
Samuel Just [Fri, 16 Sep 2011 00:26:03 +0000 (17:26 -0700)]
ceph.py: remove unused variables mds_daemons and mon_daemons
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Samuel Just [Wed, 14 Sep 2011 23:31:58 +0000 (16:31 -0700)]
ceph.py/cephmanager.py: add ctx.daemons for restarting daemons
ctx.daemons will now be an instance of CephState.
ctx.daemons.get_daemon(role, id).stop() to stop daemon, retart() to
restart the daemon, etc.
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Samuel Just [Wed, 14 Sep 2011 23:28:06 +0000 (16:28 -0700)]
testsnaps: LD_PRELOAD needed for librados
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Tommi Virtanen [Tue, 13 Sep 2011 21:53:02 +0000 (14:53 -0700)]
Move orchestra to teuthology.orchestra so there's just one top-level package.
Tommi Virtanen [Tue, 13 Sep 2011 21:10:12 +0000 (14:10 -0700)]
Merge orchestra into teuthology.
There are too many things called Orchestra out there,
including Ubuntu's new multi-machine service orchestration
framework. The code might still be beneficial outside of
teuthology, but it can be spun off at that time.
Conflicts:
bootstrap
requirements.txt
setup.py
Tommi Virtanen [Fri, 9 Sep 2011 20:22:03 +0000 (13:22 -0700)]
Callers of task s3tests.create_users don't need to provide dummy "fixtures" dict.
Josh Durgin [Fri, 9 Sep 2011 17:31:08 +0000 (10:31 -0700)]
thrashosds: fix timeout when no options are specified
Josh Durgin [Fri, 9 Sep 2011 01:09:11 +0000 (18:09 -0700)]
thrashosds: fail if cluster doesn't finally become clean in 5 minutes
Josh Durgin [Thu, 8 Sep 2011 21:09:13 +0000 (14:09 -0700)]
thrasher: get coverage and cores from calling ceph commands
Josh Durgin [Thu, 8 Sep 2011 21:07:23 +0000 (14:07 -0700)]
thrashosds: wait for every pg to go active and clean before exiting
Josh Durgin [Thu, 8 Sep 2011 19:54:23 +0000 (12:54 -0700)]
thrasher: clean up a bit
Josh Durgin [Thu, 8 Sep 2011 00:50:12 +0000 (17:50 -0700)]
autotest: allow tests to be run on all clients
Josh Durgin [Wed, 7 Sep 2011 23:54:24 +0000 (16:54 -0700)]
rbd: allow specifying all clients
Greg Farnum [Tue, 6 Sep 2011 18:29:04 +0000 (11:29 -0700)]
locktest: don't fail cleanup if the dir doesn't exist
We're doing this the cheapest way possible: make the dir!
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Sage Weil [Sat, 3 Sep 2011 22:07:21 +0000 (15:07 -0700)]
teuthology: do a deep merge of input yaml fragments
Concatenate lists, and recursively combine dicts.
If you specify inputs like
foo:
- a
- b
and
foo:
- c
you should get
foo:
- a
- b
- c
Dicts should also be merged (last one wins), and the merging is deep. E.g.
foo:
a:
b:
c: 1
and
foo:
a:
b:
c: 2
is
foo:
a:
b:
c: 2
Fixes: #1497
Josh Durgin [Sat, 3 Sep 2011 02:12:16 +0000 (19:12 -0700)]
lock: default to only listing machines you have locked
--all removes this restriction
Josh Durgin [Sat, 3 Sep 2011 00:58:19 +0000 (17:58 -0700)]
rgw: run as an external fastcgi server to match dho
Sage Weil [Fri, 2 Sep 2011 18:07:10 +0000 (11:07 -0700)]
don't eat exceptions for breakfast
fixes
0c2bee1514c1b1e65ca5d52459062e5a45da2d7b
Greg Farnum [Wed, 31 Aug 2011 21:40:55 +0000 (14:40 -0700)]
locktest: make it actually run the executable test
This was missing an argument (the file to run on!) and apparently
that didn't cause the command to output a failure return code.
Additionally, the ceph wrappers were blocking a crash and falsely
reporting success back to teuthology. (Yikes!)
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Josh Durgin [Thu, 1 Sep 2011 22:35:27 +0000 (15:35 -0700)]
nuke: synchronize clocks after reboot, and optionally synchronize all clocks
Sage Weil [Wed, 31 Aug 2011 20:56:42 +0000 (13:56 -0700)]
thrashosds: make it work when first mon isn't mon.0
Sage Weil [Wed, 31 Aug 2011 20:21:30 +0000 (13:21 -0700)]
thrashosds: no camelcaps, add some whitespace
Josh Durgin [Thu, 1 Sep 2011 17:44:46 +0000 (10:44 -0700)]
nuke: remove unused import
Josh Durgin [Thu, 1 Sep 2011 17:33:20 +0000 (10:33 -0700)]
nuke: localize again imports so they occur after gevent monkey-patching
This is necessary to make ssh work properly.
Josh Durgin [Thu, 1 Sep 2011 02:46:10 +0000 (19:46 -0700)]
nuke: reboot if rbd is mounted
Josh Durgin [Thu, 1 Sep 2011 00:43:14 +0000 (17:43 -0700)]
schedule: add a way to delete jobs from the queue
Josh Durgin [Thu, 1 Sep 2011 00:13:06 +0000 (17:13 -0700)]
parallel: don't hang if no tasks were spawned
This makes
6d919152178cfbd69dc5d50cdab40fc99db166a6 work.
Josh Durgin [Wed, 31 Aug 2011 23:48:58 +0000 (16:48 -0700)]
workunits: remove unused variable
Josh Durgin [Wed, 31 Aug 2011 21:36:32 +0000 (14:36 -0700)]
nuke: add option to reboot all nodes
Josh Durgin [Wed, 31 Aug 2011 21:36:01 +0000 (14:36 -0700)]
Fix pyflakes warnings.
Josh Durgin [Wed, 31 Aug 2011 00:21:36 +0000 (17:21 -0700)]
coverage: remove debugging
Josh Durgin [Wed, 31 Aug 2011 00:12:14 +0000 (17:12 -0700)]
workunit: save coverage and coredumps
Anything that runs a ceph utility should be using these commands.
Greg Farnum [Tue, 30 Aug 2011 22:48:58 +0000 (15:48 -0700)]
workunits: rework a little bit to allow "all" clients in a run
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Sage Weil [Wed, 24 Aug 2011 21:07:11 +0000 (14:07 -0700)]
cfuse: support running through valgrind
Also switch up the config code so we can take per-client options.
Greg Farnum [Mon, 29 Aug 2011 23:47:22 +0000 (16:47 -0700)]
valgrind: don't run valgrind_post if there's no valgrind
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum [Mon, 29 Aug 2011 20:58:09 +0000 (13:58 -0700)]
valgrind: scan logs for bad results
It's not sophisticated but it will warn you about a node
if at least one node has issues.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum [Mon, 29 Aug 2011 19:39:38 +0000 (12:39 -0700)]
valgrind: use xml output for tools that support it
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Josh Durgin [Mon, 29 Aug 2011 19:42:45 +0000 (12:42 -0700)]
suite: add option to send an email if the entire suite passed
Josh Durgin [Fri, 26 Aug 2011 00:11:33 +0000 (17:11 -0700)]
Generate coverage at the end of a suite run,
and optionally email failures and ongoing jobs.
Josh Durgin [Fri, 26 Aug 2011 00:09:03 +0000 (17:09 -0700)]
queue: delete every job when it finishes, so only running jobs are buried
Josh Durgin [Thu, 4 Aug 2011 01:08:14 +0000 (18:08 -0700)]
Add teuthology-coverage for analyzing test coverage for a suite run.
Josh Durgin [Tue, 14 Jun 2011 18:57:29 +0000 (11:57 -0700)]
Add scripts to analyze coverage for a single teuthology run.
Greg Farnum [Thu, 25 Aug 2011 22:27:30 +0000 (15:27 -0700)]
thrasher: improve documentation a little
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum [Thu, 25 Aug 2011 22:19:30 +0000 (15:19 -0700)]
thrasher: add option to mark OSDs down instead of out.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum [Thu, 25 Aug 2011 22:18:42 +0000 (15:18 -0700)]
thrasher: allow a config to set values
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum [Thu, 25 Aug 2011 21:38:34 +0000 (14:38 -0700)]
thrasher: remove redundant wait_till_clean()
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum [Wed, 24 Aug 2011 23:48:14 +0000 (16:48 -0700)]
coverage: create dir conditionally
We don't need to create the dir if we aren't using coverage.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum [Wed, 17 Aug 2011 21:44:39 +0000 (14:44 -0700)]
lockfile: add a lockfile task
This allows pretty highly configurable testing of
fcntl locking via a teuthology task.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Sage Weil [Wed, 24 Aug 2011 17:03:43 +0000 (10:03 -0700)]
lock: --list-targets: list locks and dump result in targets: yaml format.
Sage Weil [Wed, 24 Aug 2011 04:00:26 +0000 (21:00 -0700)]
check ceph cluster log for badness (ERR, WRN, SEC)
Sage Weil [Tue, 23 Aug 2011 05:04:57 +0000 (22:04 -0700)]
ceph: copy cluster log file to archive/ceph.log
Sage Weil [Mon, 22 Aug 2011 00:26:15 +0000 (17:26 -0700)]
workunits: set CEPH_CONF environment
This allows any ceph util we run (including the rados-api tests) find
the config and keyrings they need.
Sage Weil [Sun, 21 Aug 2011 22:14:02 +0000 (15:14 -0700)]
rbd: make default image 10G instead of 1G
Sage Weil [Wed, 10 Aug 2011 20:34:38 +0000 (13:34 -0700)]
suite: support a suite consisting of multiple collections
suite = many collections, and maybe some shared files
collection = a collection of facets
facet = a config fragment
Greg Farnum [Wed, 17 Aug 2011 17:35:37 +0000 (10:35 -0700)]
valgrind: Document!
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum [Wed, 17 Aug 2011 17:32:57 +0000 (10:32 -0700)]
Merge branch 'wip-valgrind'
Greg Farnum [Wed, 17 Aug 2011 17:06:58 +0000 (10:06 -0700)]
include log in valgrind log file names
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum [Wed, 17 Aug 2011 17:05:13 +0000 (10:05 -0700)]
ceph task: split up arguments a little more
This allows selective daemon kill signal changes. With valgrind
daemons we want term instead of kill, for instance.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum [Wed, 17 Aug 2011 17:04:31 +0000 (10:04 -0700)]
valgrind: move valgrind logs to log dir
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum [Mon, 15 Aug 2011 22:35:42 +0000 (15:35 -0700)]
ceph: split up daemon-running arguments and insert valgrind ones
This setup should let us insert other kinds of things too, if we
need them.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum [Mon, 15 Aug 2011 22:32:23 +0000 (15:32 -0700)]
ceph: Set up valgrind as a flavor, and create a dir for logging.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum [Mon, 15 Aug 2011 22:31:18 +0000 (15:31 -0700)]
ceph task: pass the full config to the daemon startup subs
So far as I can tell there is no reason to reduce them to
the coverage config, and I want the full config for my
soon-to-exist valgrind options.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Tommi Virtanen [Mon, 15 Aug 2011 16:36:06 +0000 (09:36 -0700)]
Add assert to catch simple typos in roles list.
Input of "roles:\n- [mds,1]" used to make teuthology crash
in a non-obviou way.
Greg Farnum [Wed, 10 Aug 2011 23:16:11 +0000 (16:16 -0700)]
Merge branch 'wip-nuke'
Conflicts:
teuthology/task/kernel.py
Greg Farnum [Tue, 9 Aug 2011 20:30:47 +0000 (13:30 -0700)]
manypools: remove commented-out code
This accidentally got left in from my development.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum [Wed, 10 Aug 2011 23:06:45 +0000 (16:06 -0700)]
teuthology-nuke: split the big main function
It was getting a bit big, but now all the functions fit on
one screen each.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum [Wed, 10 Aug 2011 22:38:57 +0000 (15:38 -0700)]
teuthology-nuke: move it into its own file.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum [Wed, 10 Aug 2011 21:19:23 +0000 (14:19 -0700)]
teuthology-nuke: identify and reboot machines with kernel mounts
This includes untested code for just force-unmounting them
when that works again, but for now it does a full reboot-and-
reconnect cycle.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum [Wed, 10 Aug 2011 17:55:02 +0000 (10:55 -0700)]
teuthology-nuke: use a more robust cfuse mount finder
This way it can remove cfuse mounts in any location on
the system.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum [Wed, 10 Aug 2011 17:47:50 +0000 (10:47 -0700)]
teuthology-nuke: split out different pieces into different loops
This will let us behave more intelligently on things like
nuking kernel mounts.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum [Wed, 10 Aug 2011 17:37:04 +0000 (10:37 -0700)]
Move reconnect function from kernel task to misc.py
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Tommi Virtanen [Wed, 10 Aug 2011 20:40:00 +0000 (13:40 -0700)]
Configure grub to default to the right kernel, not the greatest installed one.
This is sticky; that is, even if you install other kernels (manually/via fab/etc),
grub will keep booting up the one that was last enabled via teuthology config.
Use teuthology to switch kernels and it'll just work.
If the kernel the grub default points to is removed, grub will fall back to
booting the kernel with the greatest version number.
Closes: http://tracker.newdream.net/issues/1364
Tommi Virtanen [Wed, 10 Aug 2011 20:22:14 +0000 (13:22 -0700)]
Handle socket.timeout when waiting for a reconnect.
Now it gets ignored, just like the other harmless socket errors.
Tommi Virtanen [Wed, 10 Aug 2011 20:21:39 +0000 (13:21 -0700)]
Wait up to 300 seconds for a reboot.
At least sepia86 was reliably slower than the previous 180 second default.
Sage Weil [Wed, 10 Aug 2011 19:47:20 +0000 (12:47 -0700)]
ceph: fix max_mds calculation
Signed-off-by: Sage Weil <sage@newdream.net>
Greg Farnum [Wed, 10 Aug 2011 00:17:08 +0000 (17:17 -0700)]
kernel: comment reconnect task, clean up reporting
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum [Tue, 9 Aug 2011 20:30:47 +0000 (13:30 -0700)]
manypools: remove commented-out code
This accidentally got left in from my development.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Tommi Virtanen [Tue, 9 Aug 2011 23:25:00 +0000 (16:25 -0700)]
Make rbd task use mnt.N not mnt.client.N as mountpoint.
Everything else expects this, so e.g. workunits wouldn't work with rbd.
Tommi Virtanen [Tue, 9 Aug 2011 23:11:32 +0000 (16:11 -0700)]
Make sure workunit task does not create mnt.N by itself.
This used to hide a bug in the rbd task, where rbd
created the mountpoint with the wrong name. The workunits
ended up running against the local filesystem.
Tommi Virtanen [Tue, 9 Aug 2011 22:42:17 +0000 (15:42 -0700)]
Add interactive-on-error, to pause and explore on error.
Closes: http://tracker.newdream.net/issues/1291
Stephon Striplin [Tue, 9 Aug 2011 20:43:46 +0000 (13:43 -0700)]
allow s3tests.create_users defaults be overridden
Tommi Virtanen [Tue, 9 Aug 2011 20:40:56 +0000 (13:40 -0700)]
Add simple unit test for get_clients.
Sage Weil [Tue, 9 Aug 2011 20:23:58 +0000 (13:23 -0700)]
Revert "fix get_clients"
This reverts commit
83b6678e79904793bf31e82bbecad7bf16c1b2b5 . The bug I was
hitting was actually fxied by
06e3e69c293b20c0ce5df526fa923a979c1d8cfc .
Gregory Farnum [Mon, 1 Aug 2011 20:19:15 +0000 (13:19 -0700)]
teuthology: add task manypools
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Sage Weil [Fri, 5 Aug 2011 21:35:22 +0000 (14:35 -0700)]
new gitbuilder ref/branch naming
no origin_ prefix
Sage Weil [Thu, 4 Aug 2011 22:03:05 +0000 (15:03 -0700)]
cfuse, kclient: print remote host
Sage Weil [Thu, 4 Aug 2011 22:01:49 +0000 (15:01 -0700)]
fix get_clients
Only return the clients that are listed (not _all_ clients). There might
be a combination of cfuse and kclient (or other) clients here!
Sage Weil [Thu, 4 Aug 2011 17:41:50 +0000 (10:41 -0700)]
tasks/kclient: don't clobber remote
Sage Weil [Thu, 28 Jul 2011 17:28:57 +0000 (10:28 -0700)]
use coverage_dir