]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
12 years agoceph_manager: add test_map_discontinuity to thrasher
Samuel Just [Fri, 26 Jul 2013 02:43:08 +0000 (19:43 -0700)]
ceph_manager: add test_map_discontinuity to thrasher

Signed-off-by: Samuel Just <sam.just@inktank.com>
12 years agosamba: run lsof and fuser after shutdown
Greg Farnum [Thu, 25 Jul 2013 18:04:50 +0000 (11:04 -0700)]
samba: run lsof and fuser after shutdown

The clients are pretty regularly reporting busy on unmount when
samba runs above them. This will hopefully give us some info about why.

Signed-off-by: Greg Farnum <greg@inktank.com>
12 years agoMerge branch 'master' of github.com:ceph/teuthology
Sandon Van Ness [Fri, 26 Jul 2013 02:50:39 +0000 (19:50 -0700)]
Merge branch 'master' of github.com:ceph/teuthology

12 years agoMerge remote-tracking branch 'origin/wip-sandon-vm'
Sandon Van Ness [Fri, 26 Jul 2013 02:50:02 +0000 (19:50 -0700)]
Merge remote-tracking branch 'origin/wip-sandon-vm'

Conflicts:
teuthology/lock.py
teuthology/misc.py
teuthology/task/install.py

12 years agoadded test_mon_destroy option to ceph-deploy task
tamil [Fri, 26 Jul 2013 00:12:52 +0000 (17:12 -0700)]
added test_mon_destroy option to ceph-deploy task

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
12 years agoAdded --os-type argument to run.py
Sandon Van Ness [Thu, 25 Jul 2013 22:33:11 +0000 (15:33 -0700)]
Added --os-type argument to run.py

Because for some reason it was never there before?

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
12 years agoAdded get_distro() to misc.py
Sandon Van Ness [Thu, 25 Jul 2013 21:45:02 +0000 (14:45 -0700)]
Added get_distro() to misc.py

Since getting the ostype is used multiple places I made a
function for it and modified the existing code to use
said function. I also added tests for the function.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
12 years agoFix priority so it only is added once.
Sandon Van Ness [Thu, 25 Jul 2013 21:24:43 +0000 (14:24 -0700)]
Fix priority so it only is added once.

Needed for baremetal so the sed doesn't keep adding
the priority line in the repos file.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
12 years agoceph_manager: try both new and old tell mon.* syntax
Sage Weil [Thu, 25 Jul 2013 20:54:20 +0000 (13:54 -0700)]
ceph_manager: try both new and old tell mon.* syntax

So we can test old code with new teuthology (e.g., upgrade testing)

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoschedule_suite: less ceph-deploy client debugging
Sage Weil [Thu, 25 Jul 2013 19:46:04 +0000 (12:46 -0700)]
schedule_suite: less ceph-deploy client debugging

12 years agoschedule_suite.sh: select ceph-dpeloy branch
Sage Weil [Thu, 25 Jul 2013 19:45:52 +0000 (12:45 -0700)]
schedule_suite.sh: select ceph-dpeloy branch

12 years agoschedule_suite.sh: don't need full ms debug on mons
Sage Weil [Thu, 25 Jul 2013 16:10:47 +0000 (09:10 -0700)]
schedule_suite.sh: don't need full ms debug on mons

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoceph: make restart take both a list of daemons and other arguments
Sage Weil [Wed, 24 Jul 2013 20:14:02 +0000 (13:14 -0700)]
ceph: make restart take both a list of daemons and other arguments

Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Tamil Muthamizhan <tamil@inktank.com>
12 years agoteuth: rgw task example config update
Joe Buck [Wed, 24 Jul 2013 18:44:53 +0000 (11:44 -0700)]
teuth: rgw task example config update

Due to bug #5716, pools need to start with a '.' at present.
Updating the examples to follow this convention.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
12 years agoteuth: fix issue in cleanup code
Joe Buck [Wed, 24 Jul 2013 19:24:33 +0000 (12:24 -0700)]
teuth: fix issue in cleanup code

The post-yield code in create_dirs needed to
be tweaked to correctly delete the {tdir}/apache
directory (if it exists) on each client.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
12 years agoceph: wait-for-osds-up option for restart
Sage Weil [Tue, 23 Jul 2013 23:55:43 +0000 (16:55 -0700)]
ceph: wait-for-osds-up option for restart

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoceph: wait_for_osds_up
Sage Weil [Tue, 23 Jul 2013 23:55:33 +0000 (16:55 -0700)]
ceph: wait_for_osds_up

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agorgw: add multi-region and zone support
Josh Durgin [Mon, 22 Jul 2013 17:38:28 +0000 (10:38 -0700)]
rgw: add multi-region and zone support

Take client<->zone/region and the associated pools from ceph.conf, so
we don't have to invent a new format to specify it.

General region info is added to a new configuration section in the rgw
task. Each client is assumed to be a different zone, and a system user
is created with the key specified in the yaml, so it can be passed to
later task configuration as well. This isn't strictly necessary, but
avoids having to lookup this info in later tasks through something
like radosgw-admin.

Ports are allocated automatically because there's no obvious mapping
from host to client in the task configuration. Later tests can get the
endpoints desired by reading the region map.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
12 years agotask_util: move rados command here
Josh Durgin [Mon, 22 Jul 2013 21:21:51 +0000 (14:21 -0700)]
task_util: move rados command here

Six copies are replaced with one, with an added option to check status
automatically. This should probably be used in a few places where the
return code is ignored.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
12 years agorgw: move common rgwadmin function to a new utility file
Josh Durgin [Fri, 19 Jul 2013 22:16:16 +0000 (15:16 -0700)]
rgw: move common rgwadmin function to a new utility file

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
12 years agoteuth: reworked rgw to support regions / zones
Joe Buck [Sat, 13 Jul 2013 18:28:57 +0000 (11:28 -0700)]
teuth: reworked rgw to support regions / zones

Signed-off-by: Joe Buck <jbbuck@gmail.com>
12 years agoteuthology: updating RGW task to support regions
Joe Buck [Thu, 11 Jul 2013 00:45:35 +0000 (17:45 -0700)]
teuthology: updating RGW task to support regions

Extending the rgw.py task to allow for regions and zones
to be specified in the YAML file.

Signed-off-by: Joe Buck <jbbuck@gmail.com>
12 years agoRe-create guest if it already exists
Sandon Van Ness [Tue, 23 Jul 2013 23:10:24 +0000 (16:10 -0700)]
Re-create guest if it already exists

In some cases tests fail or nuke fails and the guest is
not properly destroyed. This will look to see if it gets
an error due to the guest already existing or its disks
existing and will re-create the guest.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
12 years agoAdd distro type to schedule_suite.sh
Sandon Van Ness [Tue, 23 Jul 2013 23:09:07 +0000 (16:09 -0700)]
Add distro type to schedule_suite.sh

Also added machinetype to the name.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
12 years agoWait a little longer before recreating VMs
Sandon Van Ness [Tue, 23 Jul 2013 22:52:32 +0000 (15:52 -0700)]
Wait a little longer before recreating VMs

Just to allow for the create to still work incase the os
volume is fairly large (takes a while to resize) and in
case the host machine is bogged down due to disk I/O.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
12 years agoUse os_type instead of vm_type. Add os_version
Sandon Van Ness [Tue, 23 Jul 2013 22:50:50 +0000 (15:50 -0700)]
Use os_type instead of vm_type. Add os_version

Use os_type instead of vm_type for more generic naming
for when we start re-imaging bare metal. Also added a
os_version dictionary for default versions of distros
that we want over-riding what downburst defaults are.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
12 years agoceph: add wait_for_mon_quorum command
Sage Weil [Tue, 23 Jul 2013 21:43:56 +0000 (14:43 -0700)]
ceph: add wait_for_mon_quorum command

tasks:
...
- ceph.wait_for_mon_quorum: [a, b]
...

will block until the mon quorum consists of exactly [a, b].  This is
compared directly to the relevant field from 'ceph quorum_status'
which has the alphanumeric names only.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agosequential, parallel: allow entries to be references to top-level config
Sage Weil [Mon, 22 Jul 2013 20:03:24 +0000 (13:03 -0700)]
sequential, parallel: allow entries to be references to top-level config

Often we want to build a test collection that substitutes different
sequences of tasks into a parallel/sequential construction.  However, the
yaml combination that happens when generating jobs is not smart enough to
substitute some fragment into a deeply-nested piece of yaml.

Instead, make these sequences top-level entries in the config dict, and
reference them.  For example:

tasks:
- install:
- ceph:
- parallel:
  - workload
  - upgrade-sequence
workload:
  workunit:
    - something
upgrade-sequence:
  install.restart: [osd.0, osd.1]

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agothrashosds: add delay option after recovery
Samuel Just [Mon, 22 Jul 2013 23:24:41 +0000 (16:24 -0700)]
thrashosds: add delay option after recovery

Signed-off-by: Samuel Just <sam.just@inktank.com>
12 years agotask: mon_clock_skew_check: grab max-skew value from ceph-mon's config
Joao Eduardo Luis [Mon, 22 Jul 2013 16:58:41 +0000 (17:58 +0100)]
task: mon_clock_skew_check: grab max-skew value from ceph-mon's config

Instead of relying on hardcoded values, obtain the max-skew default from
'ceph-mon --show-config-value mon_clock_drift_allowed' to match the mon's
expectation.

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
12 years agoadmin_socket: loop until the socket command succeeds
Sage Weil [Fri, 19 Jul 2013 22:10:43 +0000 (15:10 -0700)]
admin_socket: loop until the socket command succeeds

Sometimes the thing we're talking to is slow to start, or to register the
command we are running.  Loop in that case, at least for a while.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoapache.conf: load modules conditionally
Yehuda Sadeh [Sat, 20 Jul 2013 18:30:24 +0000 (11:30 -0700)]
apache.conf: load modules conditionally

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
12 years agoceph-deploy: do not test mon destroy by default
Sage Weil [Sat, 20 Jul 2013 05:19:09 +0000 (22:19 -0700)]
ceph-deploy: do not test mon destroy by default

12 years agotask: mon_clock_skew_check: by default, use max skew from global config
Joao Eduardo Luis [Thu, 18 Jul 2013 16:31:27 +0000 (17:31 +0100)]
task: mon_clock_skew_check: by default, use max skew from global config

If not defined, defaults to 0.05;  if 'max-skew' however is defined, it
must override whatever is on the config.

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
12 years agotask: mon_clock_skew_check: missing 'str'.format() key crashed the test
Joao Eduardo Luis [Thu, 18 Jul 2013 14:58:23 +0000 (15:58 +0100)]
task: mon_clock_skew_check: missing 'str'.format() key crashed the test

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
12 years agomon_thrash: tolerate scrub command failure
Sage Weil [Sat, 20 Jul 2013 04:53:33 +0000 (21:53 -0700)]
mon_thrash: tolerate scrub command failure

We can get EBUSY if there is an election going on.

12 years agoMerge branch 'wip-teuthREAD-wusui'
Warren Usui [Sat, 20 Jul 2013 02:18:28 +0000 (19:18 -0700)]
Merge branch 'wip-teuthREAD-wusui'

12 years agoUpdate to describe tasks and parameters to tasks, including the install
Warren Usui [Fri, 19 Jul 2013 03:00:38 +0000 (20:00 -0700)]
Update to describe tasks and parameters to tasks, including the install
parameters requested in 4470.  Added more information to the vm section,
and included a section documenting the test suites.

Signed-off-by: Warren Usui <warren.usui@inktank.com>
Fixes: 4470
Reviewed By: Dan Mick and Alfredo Deza

Include Alfredo's suggestions

12 years agoMerge branch 'wip-centos-rgw'
Josh Durgin [Fri, 19 Jul 2013 21:44:51 +0000 (14:44 -0700)]
Merge branch 'wip-centos-rgw'

12 years agorgw: turn off continue print on centos
Josh Durgin [Fri, 19 Jul 2013 21:42:38 +0000 (14:42 -0700)]
rgw: turn off continue print on centos

Also note that a ceph.conf setting is needed for that too

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
12 years agoinstall: remove ceph-release rpm file after it's used
Josh Durgin [Fri, 19 Jul 2013 21:36:18 +0000 (14:36 -0700)]
install: remove ceph-release rpm file after it's used

No need for it anymore, it's just taking up space

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
12 years agoMerge remote-tracking branch 'gh/next'
Sage Weil [Fri, 19 Jul 2013 18:27:19 +0000 (11:27 -0700)]
Merge remote-tracking branch 'gh/next'

12 years agosleep: new task
Sage Weil [Fri, 19 Jul 2013 18:26:58 +0000 (11:26 -0700)]
sleep: new task

12 years agoMerge pull request #18 from alfredodeza/master
Sage Weil [Fri, 19 Jul 2013 17:10:15 +0000 (10:10 -0700)]
Merge pull request #18 from alfredodeza/master

Safer dict.get in queues.py

12 years agofailed dict.get evaluates to None
Alfredo Deza [Fri, 19 Jul 2013 13:01:28 +0000 (09:01 -0400)]
failed dict.get evaluates to None

12 years agosafer get by falling back to a dictionary
Alfredo Deza [Fri, 19 Jul 2013 12:59:53 +0000 (08:59 -0400)]
safer get by falling back to a dictionary

12 years agoMerge branch 'wip-machine-based-workers' into next
Sandon Van Ness [Fri, 19 Jul 2013 02:52:25 +0000 (19:52 -0700)]
Merge branch 'wip-machine-based-workers' into next

12 years agoMerge branch 'wip-machine-based-workers'
Sandon Van Ness [Fri, 19 Jul 2013 02:37:25 +0000 (19:37 -0700)]
Merge branch 'wip-machine-based-workers'

12 years agoceph: do not ignore osd leaks
Sage Weil [Wed, 17 Jul 2013 18:20:01 +0000 (11:20 -0700)]
ceph: do not ignore osd leaks

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agonuke: killall ceph-disk, too
Sage Weil [Thu, 18 Jul 2013 19:31:11 +0000 (12:31 -0700)]
nuke: killall ceph-disk, too

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoWorker processes by machine type instead of teuthology branch.
Sandon Van Ness [Thu, 18 Jul 2013 19:04:08 +0000 (12:04 -0700)]
Worker processes by machine type instead of teuthology branch.

teuthology-suite and schedulewill now take --worker instead of
--branch. The branch is set by setting teuthology_branch in the
yaml used to schedule the job.

The teuthology branches are assumed to be in ~/teuthology-$branch
of whatever user is running the workers.

12 years agoschedule_suite.sh: escape ceph-deploy overrides
Sage Weil [Thu, 18 Jul 2013 18:38:00 +0000 (11:38 -0700)]
schedule_suite.sh: escape ceph-deploy overrides

12 years agoceph-deploy: support overrides
Sage Weil [Thu, 18 Jul 2013 18:21:07 +0000 (11:21 -0700)]
ceph-deploy: support overrides

Something like

overrides:
  ceph-deploy:
    foo: bar

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoMerge remote-tracking branch 'gh/next'
Sage Weil [Thu, 18 Jul 2013 04:33:50 +0000 (21:33 -0700)]
Merge remote-tracking branch 'gh/next'

12 years agoMerge branch 'wip-machine-type'
Sage Weil [Thu, 18 Jul 2013 03:59:54 +0000 (20:59 -0700)]
Merge branch 'wip-machine-type'

Reviewed-by: Sandon Van Ness <sandon@inktank.com>
12 years agolock: filter machine type for --list, --list-targets
Sage Weil [Sat, 13 Jul 2013 20:11:40 +0000 (13:11 -0700)]
lock: filter machine type for --list, --list-targets

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agolock: make --summary list all machines by default
Sage Weil [Sat, 13 Jul 2013 20:09:15 +0000 (13:09 -0700)]
lock: make --summary list all machines by default

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agolock: drop machine-type default, but require for lock-many
Sage Weil [Sat, 13 Jul 2013 20:09:07 +0000 (13:09 -0700)]
lock: drop machine-type default, but require for lock-many

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoceph.conf.template: enable osd debug verify stray on activate
Samuel Just [Thu, 18 Jul 2013 01:14:58 +0000 (18:14 -0700)]
ceph.conf.template: enable osd debug verify stray on activate

Signed-off-by: Samuel Just <sam.just@inktank.com>
12 years agoradosgw-admin: adapt task to recent changes
Yehuda Sadeh [Wed, 17 Jul 2013 21:05:26 +0000 (14:05 -0700)]
radosgw-admin: adapt task to recent changes

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
12 years agoMerge branch 'master' of github.com:ceph/teuthology
tamil [Wed, 17 Jul 2013 00:41:57 +0000 (17:41 -0700)]
Merge branch 'master' of github.com:ceph/teuthology

12 years agoadded overrides for ceph-deploy
tamil [Wed, 17 Jul 2013 00:41:32 +0000 (17:41 -0700)]
added overrides for ceph-deploy

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
12 years agoworkunit: set CEPH_CLI_TEST_DUP_COMMAND
Sage Weil [Wed, 17 Jul 2013 00:15:55 +0000 (17:15 -0700)]
workunit: set CEPH_CLI_TEST_DUP_COMMAND

This will make the CLI do every mon command twice and make sure they both
succeed.  This catches problems with mon command idempotency faster than
waiting for random failures trigger.

12 years agoadded conf section to ceph-deploy task
tamil [Wed, 17 Jul 2013 00:14:33 +0000 (17:14 -0700)]
added conf section to ceph-deploy task

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
12 years agoCreated tasktest to test sequential and parallel tasks.
Warren Usui [Fri, 12 Jul 2013 03:24:09 +0000 (20:24 -0700)]
Created tasktest to test sequential and parallel tasks.
Added sequential task and parallel task.
Changed _run_one_task to run_one_task (now called by new tasks too).

Fix #4969
Signed-off-by: Warren Usui <warren.usui@inktank.com>
12 years agocalling mon destroy command after mds create
tamil [Tue, 16 Jul 2013 00:04:21 +0000 (17:04 -0700)]
calling mon destroy command after mds create

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
12 years agoceph_manager: drop -t arg prefix for pg dump_stuck
Sage Weil [Sat, 13 Jul 2013 21:07:28 +0000 (14:07 -0700)]
ceph_manager: drop -t arg prefix for pg dump_stuck

This is no longer needed, and ugly to support.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoRemove btrfs-tools and xfsprogs install step.
Sandon Van Ness [Sat, 13 Jul 2013 03:56:07 +0000 (20:56 -0700)]
Remove btrfs-tools and xfsprogs install step.

We already install btrfs-tools and xfsprogs with ceph-qa-chef
Doing it here was just causing problems on non-ubuntu
distros and I really see no point for it to have it now.

12 years agoAllow Overrides for ceph-deploy task
Sandon Van Ness [Sat, 13 Jul 2013 03:54:23 +0000 (20:54 -0700)]
Allow Overrides for ceph-deploy task

This is needed so we can set the ceph branch for ceph-deploy
to use via the main yaml which is created via the suite
scheduler.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
12 years agoWorkaround repopriority of yum local repo.
Sandon Van Ness [Sat, 13 Jul 2013 03:24:19 +0000 (20:24 -0700)]
Workaround repopriority of yum local repo.

Because of issues with package name differences vps are
setup to use repopriority and our local repo (which has
some ceph/librados stuff in it) gets high priority so
the ceph.repo that is created on the machine from
ceph-release basically gets ignored. This change makes
it so ceph.repo is the same priority level as our local
repo.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
12 years agoDon't install kernels on non-ubuntu VPS
Sandon Van Ness [Sat, 13 Jul 2013 03:20:45 +0000 (20:20 -0700)]
Don't install kernels on non-ubuntu VPS

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
12 years agoRe-create guest if it doesn't come up right.
Sandon Van Ness [Sat, 13 Jul 2013 03:19:51 +0000 (20:19 -0700)]
Re-create guest if it doesn't come up right.

In some rare cases (mainly centos/rhel after creating the
guest with downburst it does not come up right. It
gets a kernel panic at boot. Usually just turning it off
and then back on again is enough but to be on the safe
side I figured it should be re-created instead. This
insures you don't get hung jobs from a guest that didn't
come up correctly.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
12 years agoAdd description option to lock.lock()
Sandon Van Ness [Sat, 13 Jul 2013 03:16:28 +0000 (20:16 -0700)]
Add description option to lock.lock()

For some reason lock_many() has a description but lock()
does not. This was useful in my testing of unlocking and
re-locking VPS machines to destroy.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
12 years agoUse ceph.com mirror instea of github for ceph-qa-chef on VPS.
Sandon Van Ness [Sat, 13 Jul 2013 03:13:47 +0000 (20:13 -0700)]
Use ceph.com mirror instea of github for ceph-qa-chef on VPS.

Github.com is unreliable so use the ceph mirror instead.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
12 years agoList IP address in orchestra.run output.
Sandon Van Ness [Sat, 13 Jul 2013 03:10:36 +0000 (20:10 -0700)]
List IP address in orchestra.run output.

Figuring out which machines output is coming from when things
are being executed on multiple machines can be a huge pain.
This prints the IP in the logs so you can easily see where one
machine stops and another begins.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
12 years agoFix Missed parenthesis.
Sandon Van Ness [Sat, 13 Jul 2013 03:08:06 +0000 (20:08 -0700)]
Fix Missed parenthesis.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
12 years agoVM: Use mac addresses from DB instead of randomizing.
Sandon Van Ness [Mon, 8 Jul 2013 23:54:22 +0000 (16:54 -0700)]
VM: Use mac addresses from DB instead of randomizing.

In order to make IP addresses less likely to change and to allow
a smaller DHCP pool to be used I generated static MAC addresses
for all the vpm entries in the DB. I also put the correct entries
for all the other types of machines as well for their primary
(eth0) mac address as well in order to keep things standardized
and so there is another location where we have this information.

Without this fix going through a few tests would exhaust the DHCP
pool which at the time was around 460 IP addresses for virtual
machines and has since been upped to ~690 IP addresses.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
Reviewed-by: Warren Usui <warren.usui@inktank.com>
12 years agoceph.conf: enable old message assert
Sage Weil [Fri, 12 Jul 2013 22:18:50 +0000 (15:18 -0700)]
ceph.conf: enable old message assert

If this triggers, the RECONNECT_SEQ feature is broken (and
maybe we've caught #5517).

12 years agorgw: use different daemon name and apache module paths
Josh Durgin [Wed, 10 Jul 2013 01:52:26 +0000 (18:52 -0700)]
rgw: use different daemon name and apache module paths

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
12 years agomisc: move system type checking to a generic location
Josh Durgin [Wed, 10 Jul 2013 01:52:00 +0000 (18:52 -0700)]
misc: move system type checking to a generic location

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
12 years agos3tests: fix client configurations that aren't dictionaries
Josh Durgin [Wed, 10 Jul 2013 01:50:52 +0000 (18:50 -0700)]
s3tests: fix client configurations that aren't dictionaries

They're always used as dictionaries later on.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
12 years agoAdd mon create and destroy with an optional argument mon_initial_members
tamil [Tue, 9 Jul 2013 18:12:29 +0000 (11:12 -0700)]
Add mon create and destroy with an optional argument mon_initial_members

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
12 years agolock: fix typo
Sage Weil [Tue, 9 Jul 2013 05:22:22 +0000 (22:22 -0700)]
lock: fix typo

12 years agoVM: Use mac addresses from DB instead of randomizing.
Sandon Van Ness [Mon, 8 Jul 2013 23:54:22 +0000 (16:54 -0700)]
VM: Use mac addresses from DB instead of randomizing.

In order to make IP addresses less likely to change and to allow
a smaller DHCP pool to be used I generated static MAC addresses
for all the vpm entries in the DB. I also put the correct entries
for all the other types of machines as well for their primary
(eth0) mac address as well in order to keep things standardized
and so there is another location where we have this information.

Without this fix going through a few tests would exhaust the DHCP
pool which at the time was around 460 IP addresses for virtual
machines and has since been upped to ~690 IP addresses.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
Reviewed-by: Warren Usui <warren.usui@inktank.com>
12 years agoMerge pull request #17 from ceph/wip-mon-thrash
Sage Weil [Mon, 8 Jul 2013 17:40:27 +0000 (10:40 -0700)]
Merge pull request #17 from ceph/wip-mon-thrash

mon thrash improvements

Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
12 years agomon_thrasher: add pause/unpause of mons to thrashing
Sage Weil [Sat, 6 Jul 2013 01:04:40 +0000 (18:04 -0700)]
mon_thrasher: add pause/unpause of mons to thrashing

This adds an additional element of laggyness to the cluster which should
cause mons to call new elections.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agodaemon-helper: send arbitrary signals via stdin
Sage Weil [Sat, 6 Jul 2013 01:01:57 +0000 (18:01 -0700)]
daemon-helper: send arbitrary signals via stdin

Each byte written to stdin will be interpreted as a signal.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agomon_thrash: optionally scrub after each iteration (default true)
Sage Weil [Fri, 5 Jul 2013 21:23:56 +0000 (14:23 -0700)]
mon_thrash: optionally scrub after each iteration (default true)

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agomon_thrash: fix more naming
Sage Weil [Fri, 5 Jul 2013 21:23:37 +0000 (14:23 -0700)]
mon_thrash: fix more naming

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agomon_thrash: use _ instead of - consistently
Sage Weil [Fri, 5 Jul 2013 17:30:25 +0000 (10:30 -0700)]
mon_thrash: use _ instead of - consistently

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoFix VM issues.
Sandon Van Ness [Thu, 4 Jul 2013 02:07:35 +0000 (19:07 -0700)]
Fix VM issues.

Fix of #5494 although bad description. Instead of adding a wait
the code used to detect if the guest was back up is fixed. The
previous code appeared to assume only one machine and broke
when it was waiting for multiple machines if the guests did not
come up within 10 seconds of each other

Make nuke not do the normal stuff if the machine is a VPS as we
just destroy them when they get unlocked.

Instead of getting downburst options from ~/.teuthology.yaml get
it from the yaml given to teuthology for the test/task instead.

Fixed an error that would make all the default downburst values
not take effect if any of them were set via a yaml.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
Reviewed-by: Warren Usui <warren.usui@inktank.com>
12 years agoceph: don't check leaks on client.* (i.e., radosgw)
Sage Weil [Wed, 3 Jul 2013 16:59:21 +0000 (09:59 -0700)]
ceph: don't check leaks on client.* (i.e., radosgw)

...until we fix them.  This way we can see other valgrind issues.

12 years agoradosgw-admin: add missing quote
Sage Weil [Mon, 1 Jul 2013 21:21:55 +0000 (14:21 -0700)]
radosgw-admin: add missing quote

12 years agoradosgw-admin: test 'bucket list' command (all buckets)
Sage Weil [Mon, 1 Jul 2013 21:21:48 +0000 (14:21 -0700)]
radosgw-admin: test 'bucket list' command (all buckets)

Verifies fix for #5455
Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoUpdate keys if they have changed before locking
Sandon Van Ness [Thu, 27 Jun 2013 21:08:09 +0000 (14:08 -0700)]
Update keys if they have changed before locking

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
12 years agoceph: disable logrotate
Sage Weil [Thu, 27 Jun 2013 00:48:03 +0000 (17:48 -0700)]
ceph: disable logrotate

This screwed up the log archival step at the end, and generally makes a
mess of automated runs.

Fixe: #5451

12 years agodump_stuck: fix test
Sage Weil [Tue, 25 Jun 2013 19:45:22 +0000 (12:45 -0700)]
dump_stuck: fix test

The mon-osd-report-timeout setting shouldn't be there!  We will set the
other item explicitly, and remove both from the suite yaml.

Fixes: #5440
12 years agoMerge pull request #15 from ceph/wip-ulimits
Sage Weil [Mon, 24 Jun 2013 23:18:36 +0000 (16:18 -0700)]
Merge pull request #15 from ceph/wip-ulimits

Reviewed-by: Warren Usui <warren.usui@inktank.com>
12 years agoMerge pull request #16 from ceph/wip-5431
Sage Weil [Mon, 24 Jun 2013 18:01:48 +0000 (11:01 -0700)]
Merge pull request #16 from ceph/wip-5431

Reviewed-by: Warren Usui <warren.usui@inktank.com>
12 years agorados: fix multiclient tests
Sage Weil [Mon, 24 Jun 2013 03:44:38 +0000 (20:44 -0700)]
rados: fix multiclient tests

Each client (not run) gets its own pool!

Signed-off-by: Sage Weil <sage@inktank.com>