]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
13 years agosmoke: add cls unit tests in validator
Sage Weil [Fri, 21 Sep 2012 15:38:05 +0000 (08:38 -0700)]
smoke: add cls unit tests in validator

13 years agorados: run class unit tests through validators
Sage Weil [Fri, 21 Sep 2012 15:36:58 +0000 (08:36 -0700)]
rados: run class unit tests through validators

13 years agomove rgw tasks to separate suite
Sage Weil [Fri, 21 Sep 2012 15:36:08 +0000 (08:36 -0700)]
move rgw tasks to separate suite

13 years agorados: test all rados classes
Sage Weil [Thu, 20 Sep 2012 22:40:50 +0000 (15:40 -0700)]
rados: test all rados classes

13 years agotest_cls_rbd has moved
Sage Weil [Thu, 20 Sep 2012 22:35:51 +0000 (15:35 -0700)]
test_cls_rbd has moved

13 years agochanged the debug value for mds from 10 to 20
tamil [Mon, 10 Sep 2012 22:45:30 +0000 (15:45 -0700)]
changed the debug value for mds from 10 to 20

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
13 years agorbd: add msgr failure injection
Sage Weil [Tue, 31 Jul 2012 15:06:31 +0000 (08:06 -0700)]
rbd: add msgr failure injection

13 years agoavoid doing filestore idempotency tester 2x w/ and w/o msgr failures
Sage Weil [Tue, 31 Jul 2012 18:54:02 +0000 (11:54 -0700)]
avoid doing filestore idempotency tester 2x w/ and w/o msgr failures

13 years agorados: add msgr failure injection
Sage Weil [Tue, 31 Jul 2012 15:05:11 +0000 (08:05 -0700)]
rados: add msgr failure injection

13 years agomove kclient + blogbench to marginal
Sage Weil [Tue, 21 Aug 2012 20:01:28 +0000 (13:01 -0700)]
move kclient + blogbench to marginal

Periodically fails #1945

13 years agoinclude mds debugging on ffsb
Sage Weil [Tue, 21 Aug 2012 17:57:32 +0000 (10:57 -0700)]
include mds debugging on ffsb

hopefully we can track down #1947

13 years agomarginal: remove verify collection (unused)
Sage Weil [Mon, 20 Aug 2012 20:54:36 +0000 (13:54 -0700)]
marginal: remove verify collection (unused)

13 years agocrank up pjd debugging
Sage Weil [Wed, 1 Aug 2012 03:37:00 +0000 (20:37 -0700)]
crank up pjd debugging

13 years agoseparate regression suite into topical categories rados, rbd, fs
Sage Weil [Tue, 31 Jul 2012 16:55:45 +0000 (09:55 -0700)]
separate regression suite into topical categories rados, rbd, fs

13 years agoadd osd-recovery-incomplete
Sage Weil [Sat, 28 Jul 2012 17:54:52 +0000 (10:54 -0700)]
add osd-recovery-incomplete

13 years agofix adminsocket test
Sage Weil [Fri, 27 Jul 2012 20:42:51 +0000 (13:42 -0700)]
fix adminsocket test

13 years agoadmin-socket: test generic admin socket commands
Sage Weil [Wed, 25 Jul 2012 04:36:38 +0000 (21:36 -0700)]
admin-socket: test generic admin socket commands

13 years agomarginal kclient+ffsb: enable mds logging to catch badess
Sage Weil [Mon, 23 Jul 2012 03:50:12 +0000 (20:50 -0700)]
marginal kclient+ffsb: enable mds logging to catch badess

See #1947

13 years agomove misc, blogbench back into active kernel suite
Sage Weil [Sun, 22 Jul 2012 04:08:59 +0000 (21:08 -0700)]
move misc, blogbench back into active kernel suite

these were removed from regression ages ago, and only recently put back in
marginal.  they seem fine.

13 years agomove all kernel tests to kernel suite; symlink collections from regression
Sage Weil [Mon, 23 Jul 2012 03:47:32 +0000 (20:47 -0700)]
move all kernel tests to kernel suite; symlink collections from regression

Make regression a union of other topical suites.

13 years agothis fails reliably
Sage Weil [Sun, 22 Jul 2012 03:59:04 +0000 (20:59 -0700)]
this fails reliably

13 years agoregression: do some tests on ext4
Sage Weil [Sat, 21 Jul 2012 00:36:43 +0000 (17:36 -0700)]
regression: do some tests on ext4

13 years agomove cfuse+dbench back to regression for verify, too
Sage Weil [Fri, 20 Jul 2012 20:14:28 +0000 (13:14 -0700)]
move cfuse+dbench back to regression for verify, too

13 years agomove cfuse + dbench from marginal to regression
Sage Weil [Wed, 18 Jul 2012 03:05:30 +0000 (20:05 -0700)]
move cfuse + dbench from marginal to regression

Fixed #1737, yay!

13 years agomove cfuse + ffsb from marginal to regression
Sage Weil [Mon, 16 Jul 2012 17:35:25 +0000 (10:35 -0700)]
move cfuse + ffsb from marginal to regression

This has had no failures.

13 years agomove cfuse + fsx back into regression suite
Sage Weil [Mon, 16 Jul 2012 16:41:35 +0000 (09:41 -0700)]
move cfuse + fsx back into regression suite

No failures in marginal.  The objectcacher fixes that came out of the
rbd_fsx stuff probably fixed the original problem?

13 years agofix wrongly marked down whitelist
Sage Weil [Thu, 12 Jul 2012 23:05:12 +0000 (16:05 -0700)]
fix wrongly marked down whitelist

This used to have '...or wrong addr' but it doesn't any more.

13 years agorbd: test with layering enabled
Josh Durgin [Wed, 11 Jul 2012 17:59:08 +0000 (10:59 -0700)]
rbd: test with layering enabled

RBD_FEATURES=0 hits a bug that's fixed in wip-rbd-parent.

Once that's merged, we can add RBD_FEATURES=0 tests back in.

13 years agoffsb is marginal, remove from smoke suite
Sage Weil [Wed, 11 Jul 2012 15:27:30 +0000 (08:27 -0700)]
ffsb is marginal, remove from smoke suite

13 years agoRevert "smoke: add msgr failures"
Sage Weil [Wed, 11 Jul 2012 03:26:25 +0000 (20:26 -0700)]
Revert "smoke: add msgr failures"

This reverts commit 9278e231e64f49c3205c2ded8b1f2d3b27265eac.

13 years agomove cfuse fsx into marginal suite
Sage Weil [Wed, 11 Jul 2012 02:57:56 +0000 (19:57 -0700)]
move cfuse fsx into marginal suite

This should probably pass, given the testing that ObjectCacher gets these
days with librbd_fsx.

13 years agoremove suites/stress/basic
Sage Weil [Wed, 11 Jul 2012 02:56:39 +0000 (19:56 -0700)]
remove suites/stress/basic

13 years agomove some old flaky tasks into marginal suite
Sage Weil [Wed, 11 Jul 2012 02:56:01 +0000 (19:56 -0700)]
move some old flaky tasks into marginal suite

These were pulled out of regression a while ago.  Put them into the
marginal suite where they will be regularly run and we can evaluate the
severity of the problems they cause.

13 years agomove qemu_iozone test to marginal suite
Sage Weil [Sat, 7 Jul 2012 00:04:02 +0000 (17:04 -0700)]
move qemu_iozone test to marginal suite

13 years agoincrease thrashosds timeout
Samuel Just [Fri, 6 Jul 2012 17:02:29 +0000 (10:02 -0700)]
increase thrashosds timeout

13 years agomove other ffsb workloads to marginal suite
Sage Weil [Wed, 4 Jul 2012 19:46:03 +0000 (12:46 -0700)]
move other ffsb workloads to marginal suite

13 years agomove locktest to marginal suite
Sage Weil [Wed, 4 Jul 2012 00:39:59 +0000 (17:39 -0700)]
move locktest to marginal suite

This fails 1 in 10 times or something like that.

13 years agosmoke: add msgr failures
Sage Weil [Sun, 1 Jul 2012 22:36:50 +0000 (15:36 -0700)]
smoke: add msgr failures

13 years agofewer hosts for mon tests
Sage Weil [Mon, 2 Jul 2012 19:26:10 +0000 (12:26 -0700)]
fewer hosts for mon tests

13 years agoadd rbd_xfstests to kernel suite
Sage Weil [Sun, 1 Jul 2012 21:27:38 +0000 (14:27 -0700)]
add rbd_xfstests to kernel suite

13 years agoqemu_iozone: use a larger image
Josh Durgin [Fri, 29 Jun 2012 18:02:29 +0000 (11:02 -0700)]
qemu_iozone: use a larger image

The default is not large enough.

13 years agokernel suite
Sage Weil [Fri, 29 Jun 2012 16:12:51 +0000 (09:12 -0700)]
kernel suite

13 years agoinclude ceph task in librbd collection
Sage Weil [Tue, 26 Jun 2012 04:21:33 +0000 (21:21 -0700)]
include ceph task in librbd collection

13 years agomove kclient_workunit_suites_ffsb to marginal suite
Sage Weil [Mon, 25 Jun 2012 22:30:27 +0000 (15:30 -0700)]
move kclient_workunit_suites_ffsb to marginal suite

until #1947 is fixed

13 years agoAdd some tests inside qemu for the librbd suite
Josh Durgin [Fri, 22 Jun 2012 01:18:03 +0000 (18:18 -0700)]
Add some tests inside qemu for the librbd suite

13 years agoMove librbd tests to rbd suite
Josh Durgin [Fri, 22 Jun 2012 01:16:28 +0000 (18:16 -0700)]
Move librbd tests to rbd suite

This lets us generate jobs with different caching settings instead of
hardcoding them.

13 years agomove cfuse + dbench task that triggers #1737 to marginal suite
Sage Weil [Wed, 20 Jun 2012 18:23:20 +0000 (11:23 -0700)]
move cfuse + dbench task that triggers #1737 to marginal suite

13 years agodon't dup ceph task for new fsx jobs
Sage Weil [Sun, 17 Jun 2012 15:58:59 +0000 (08:58 -0700)]
don't dup ceph task for new fsx jobs

13 years agoRun fsx on rbd with thrashing
Josh Durgin [Fri, 15 Jun 2012 18:59:43 +0000 (11:59 -0700)]
Run fsx on rbd with thrashing

13 years agoIncrease number of ops done by fsx against rbd.
Josh Durgin [Fri, 15 Jun 2012 18:55:33 +0000 (11:55 -0700)]
Increase number of ops done by fsx against rbd.

Especially in the no-cache case, this should detect more races. The
fiemap problem is detectable on plana after ~5000 fsx ops.

13 years agoadd radosgw-admin test to regression suite
Sage Weil [Thu, 14 Jun 2012 21:06:34 +0000 (14:06 -0700)]
add radosgw-admin test to regression suite

We wrote this test ages ago, but forgot to add it!  Fixed up a few things
that have changed since then.

13 years agoAdd test for cls_rbd
Josh Durgin [Mon, 11 Jun 2012 05:37:12 +0000 (22:37 -0700)]
Add test for cls_rbd

13 years agoTest old and new rbd formats
Josh Durgin [Mon, 11 Jun 2012 04:44:55 +0000 (21:44 -0700)]
Test old and new rbd formats

13 years agoUpdate for new workunit task syntax
Josh Durgin [Mon, 11 Jun 2012 04:26:50 +0000 (21:26 -0700)]
Update for new workunit task syntax

13 years agoregression: fix new rados, rbd test yamls
Sage Weil [Fri, 8 Jun 2012 21:35:56 +0000 (14:35 -0700)]
regression: fix new rados, rbd test yamls

Don't start cluster twice!

13 years agorun rados, rbd api tests under thrashing
Sage Weil [Fri, 8 Jun 2012 18:55:30 +0000 (11:55 -0700)]
run rados, rbd api tests under thrashing

13 years agoadd rados_stress_watch to regression
Sage Weil [Thu, 31 May 2012 23:44:24 +0000 (16:44 -0700)]
add rados_stress_watch to regression

13 years agorbd_fsx in write-through mode
Sage Weil [Tue, 8 May 2012 23:07:10 +0000 (16:07 -0700)]
rbd_fsx in write-through mode

13 years agouse fewer nodes for the simple singleton tasks
Sage Weil [Tue, 1 May 2012 03:11:44 +0000 (20:11 -0700)]
use fewer nodes for the simple singleton tasks

13 years agoadd rbd_fsx_[no]cache jobs to regression suite
Sage Weil [Thu, 19 Apr 2012 20:33:32 +0000 (13:33 -0700)]
add rbd_fsx_[no]cache jobs to regression suite

13 years agogather logs for cfuse dbench workload, hopefully catch #1737
Sage Weil [Wed, 18 Apr 2012 22:19:49 +0000 (15:19 -0700)]
gather logs for cfuse dbench workload, hopefully catch #1737

13 years agodump_stuck: whitelist 'wrongly marked me down'
Sage Weil [Mon, 16 Apr 2012 03:39:56 +0000 (20:39 -0700)]
dump_stuck: whitelist 'wrongly marked me down'

The test marks the osds down.. they may generate this error if they get
that faster than they get the signal via the daemon-wrapper.

13 years agoadd rbd_xfstests to regression suite
Sage Weil [Sat, 14 Apr 2012 05:27:24 +0000 (22:27 -0700)]
add rbd_xfstests to regression suite

13 years agomove tasks:cfuse_workunit_suites_dbench.yaml to stress pending #1737 fix
Sage Weil [Fri, 13 Apr 2012 05:56:09 +0000 (22:56 -0700)]
move tasks:cfuse_workunit_suites_dbench.yaml to stress pending #1737 fix

13 years agoadd smoke suite
Sage Weil [Sun, 25 Mar 2012 04:47:15 +0000 (21:47 -0700)]
add smoke suite

This could probably be collapsed into a bunch of singleton tasks to make
it simpler to track how many actual jobs result, but it was simpler to
make it a subset of regression.  And probably that'll be easier to maintain
moving forward.

Tried to avoid any jobs that took more than 10 minutes (tho there are a few
in here).  Kept both valgrind and lockdep jobs, and dropped many of those
from the basic collection (esp api tests).

We'll see how long this takes on plana and adjust up/down from there,
depending on how long we want to wait for it.

13 years agoadd osd-recovery test
Sage Weil [Sat, 24 Mar 2012 23:07:47 +0000 (16:07 -0700)]
add osd-recovery test

13 years agorenamed backfill -> osd_backfill
Sage Weil [Sat, 24 Mar 2012 23:07:38 +0000 (16:07 -0700)]
renamed backfill -> osd_backfill

13 years agodisable rbd thrash workload, #2174
Sage Weil [Wed, 14 Mar 2012 22:51:51 +0000 (15:51 -0700)]
disable rbd thrash workload, #2174

13 years agoRevert "disable rbd thrash workload, #2174"
Sage Weil [Thu, 15 Mar 2012 17:32:39 +0000 (10:32 -0700)]
Revert "disable rbd thrash workload, #2174"

This reverts commit 1bec416c7c7ff8a6462d94baaba8e7da73e88ab4.

Fixed with #2174

13 years agodisable rbd thrash workload, #2174
Sage Weil [Wed, 14 Mar 2012 22:51:51 +0000 (15:51 -0700)]
disable rbd thrash workload, #2174

13 years agothrash: put client on separate machine from osds
Sage Weil [Tue, 13 Mar 2012 17:49:33 +0000 (10:49 -0700)]
thrash: put client on separate machine from osds

This allows us to run kenrel clients (kclient, rbd) against the thrashing
cluster.

13 years agoremove dup ceph tasks from new thrash workloads
Sage Weil [Mon, 12 Mar 2012 22:22:17 +0000 (15:22 -0700)]
remove dup ceph tasks from new thrash workloads

13 years agoclusters/fixed-3.yaml: 2 -> 6 osds
Sage Weil [Mon, 12 Mar 2012 04:50:03 +0000 (21:50 -0700)]
clusters/fixed-3.yaml: 2 -> 6 osds

plana nodes have 3 scratch disks... use them!

13 years agoRevert "disable s3tests on valgrind/lockdep until #2103 is fixed"
Sage Weil [Mon, 12 Mar 2012 04:32:45 +0000 (21:32 -0700)]
Revert "disable s3tests on valgrind/lockdep until #2103 is fixed"

This reverts commit 9f757ca9511374f6565d74263e242c74e39f8a3f.

13 years agoadd rbd, kclient workloads to regression thrash collection
Sage Weil [Mon, 12 Mar 2012 04:28:45 +0000 (21:28 -0700)]
add rbd, kclient workloads to regression thrash collection

This will get us some kernel osd_client osd restart coverage.

13 years agofix typo, ceph-fyuse -> ceph-fuse
Sage Weil [Sun, 11 Mar 2012 20:03:41 +0000 (13:03 -0700)]
fix typo, ceph-fyuse -> ceph-fuse

13 years agouse dbench workunit, not the autotest one
Sage Weil [Sun, 11 Mar 2012 04:01:57 +0000 (20:01 -0800)]
use dbench workunit, not the autotest one

The autotest one uses an old tarball that doesn't build.  Workunit assumes
the dbench package is installed.

13 years agodisable s3tests on valgrind/lockdep until #2103 is fixed
Sage Weil [Fri, 2 Mar 2012 06:04:15 +0000 (22:04 -0800)]
disable s3tests on valgrind/lockdep until #2103 is fixed

13 years agodump-stuck: set pg stuck threshold to match test
Josh Durgin [Wed, 29 Feb 2012 23:45:25 +0000 (15:45 -0800)]
dump-stuck: set pg stuck threshold to match test

13 years agono peer as part of lost_unfound
Sage Weil [Mon, 27 Feb 2012 22:52:35 +0000 (14:52 -0800)]
no peer as part of lost_unfound

13 years agomove peer to separate test for now
Sage Weil [Mon, 27 Feb 2012 01:09:41 +0000 (17:09 -0800)]
move peer to separate test for now

13 years agolost_unfound: do peer after, until wait_for_clean propagates last_epoch_started
Sage Weil [Sun, 26 Feb 2012 05:35:31 +0000 (21:35 -0800)]
lost_unfound: do peer after, until wait_for_clean propagates last_epoch_started

The peer task does wait_for_clean, and then lost_unfound immediately marks
something down.  But the PGs become clean before the replica last_epoch_started
is moved forward in time, which means they block waiting for the now down
OSD.  Needlessly.

Until we fix this, just do the peer test after.

13 years agofix lockdep.yaml conf syntax
Sage Weil [Sat, 25 Feb 2012 05:39:55 +0000 (21:39 -0800)]
fix lockdep.yaml conf syntax

13 years agorun radosgw through valgrind for s3tests
Sage Weil [Fri, 24 Feb 2012 23:20:00 +0000 (15:20 -0800)]
run radosgw through valgrind for s3tests

13 years agodo peer test along with lost_unfound
Sage Weil [Fri, 24 Feb 2012 23:04:27 +0000 (15:04 -0800)]
do peer test along with lost_unfound

13 years agorename valgrind -> verify, add in runs under lockdep
Sage Weil [Fri, 24 Feb 2012 20:49:26 +0000 (12:49 -0800)]
rename valgrind -> verify, add in runs under lockdep

13 years agoAdd test for 'ceph pg dump_stuck'
Josh Durgin [Wed, 22 Feb 2012 00:21:05 +0000 (16:21 -0800)]
Add test for 'ceph pg dump_stuck'

13 years agoadd valgrind collection to regression suite
Sage Weil [Tue, 21 Feb 2012 18:02:44 +0000 (10:02 -0800)]
add valgrind collection to regression suite

Run a smaller set of tests with valgrind on the mon, osd, and mds.

Valgrind is currently ignoring leaks, but this will pick up use-after-free
and similar badness.

13 years agocfuse -> ceph-fuse
Sage Weil [Mon, 20 Feb 2012 20:49:35 +0000 (12:49 -0800)]
cfuse -> ceph-fuse

13 years agothrashing: whitelist 'objects unfound and apparently lost' message
Sage Weil [Sat, 18 Feb 2012 21:56:47 +0000 (13:56 -0800)]
thrashing: whitelist 'objects unfound and apparently lost' message

This can happen when we mark OSDs down... if the objects are found when
the osds come back up then we're fine.  if not, it won't go clean, and the
test will fail for that reason.

13 years agoadd regression/multifs collection; run rgw tests under both xfs and btrfs
Sage Weil [Wed, 15 Feb 2012 05:49:26 +0000 (21:49 -0800)]
add regression/multifs collection; run rgw tests under both xfs and btrfs

13 years agorename fs files
Sage Weil [Tue, 14 Feb 2012 16:58:30 +0000 (08:58 -0800)]
rename fs files

13 years agoregression/thrash on xfs and btrfs both
Sage Weil [Tue, 14 Feb 2012 00:45:04 +0000 (16:45 -0800)]
regression/thrash on xfs and btrfs both

13 years agobtrfs: 1 -> fs: btrfs
Sage Weil [Mon, 13 Feb 2012 23:29:52 +0000 (15:29 -0800)]
btrfs: 1 -> fs: btrfs

13 years agoadd snap thrashing covering a small number of objects
Sage Weil [Sat, 11 Feb 2012 21:40:44 +0000 (13:40 -0800)]
add snap thrashing covering a small number of objects

The snaps-many-objects has a relatively low density of ops-per-object. This
hammers on a small number of them and does a better job of validating the
correctness wrt snaps.

13 years agomove snap thrashing back into regression suite
Sage Weil [Sat, 11 Feb 2012 21:39:46 +0000 (13:39 -0800)]
move snap thrashing back into regression suite

13 years agomove kclient_workunit_suites_blogbench.yaml to stress suite
Sage Weil [Sat, 11 Feb 2012 00:40:03 +0000 (16:40 -0800)]
move kclient_workunit_suites_blogbench.yaml to stress suite

This is consistently failing due to an mds/kclient interaction.

13 years agoadd backfill test
Sage Weil [Wed, 1 Feb 2012 00:37:57 +0000 (16:37 -0800)]
add backfill test

13 years agomake 6-osd-2-machine simpler... single monitor
Sage Weil [Sun, 29 Jan 2012 05:11:32 +0000 (21:11 -0800)]
make 6-osd-2-machine simpler... single monitor

13 years agoregression: add admin socket test for objecter requests.
Josh Durgin [Sat, 28 Jan 2012 02:08:31 +0000 (18:08 -0800)]
regression: add admin socket test for objecter requests.