]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
11 years agomake sure we are using a dict
Alfredo Deza [Mon, 27 Jan 2014 21:06:10 +0000 (16:06 -0500)]
make sure we are using a dict

Signed-off-by: Alfredo Deza <alfredo.deza@inktank.com>
(cherry picked from commit fe4c97dc4f5f893df526f8694d4b9fafb064f9d3)

11 years agoAttempt to fix #7241
Zack Cerza [Tue, 28 Jan 2014 16:05:03 +0000 (10:05 -0600)]
Attempt to fix #7241

This involves moving everything in build_ceph_cluster() inside the try:
block, so if an exception fails the cleanup in the finally: block will
actually be executed.

11 years agoFix PEP-8 linter complaints
Zack Cerza [Mon, 27 Jan 2014 19:02:30 +0000 (13:02 -0600)]
Fix PEP-8 linter complaints

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoMerge pull request #175 from ceph/wip-tcmalloc
Sage Weil [Tue, 28 Jan 2014 05:14:17 +0000 (21:14 -0800)]
Merge pull request #175 from ceph/wip-tcmalloc

valgrind: ignore tcmalloc uninitialized memory

11 years agoBe more verbose about log file locations
Zack Cerza [Mon, 27 Jan 2014 18:28:53 +0000 (12:28 -0600)]
Be more verbose about log file locations

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoSymlink worker logs into job archive dir
Zack Cerza [Fri, 24 Jan 2014 16:19:43 +0000 (10:19 -0600)]
Symlink worker logs into job archive dir

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoMerge pull request #177 from ceph/wip-7213
Alfredo Deza [Fri, 24 Jan 2014 13:54:37 +0000 (05:54 -0800)]
Merge pull request #177 from ceph/wip-7213

use create-initial for deploying monitors

11 years agouse create-initial for deploying monitors
Alfredo Deza [Thu, 23 Jan 2014 21:41:49 +0000 (16:41 -0500)]
use create-initial for deploying monitors

Signed-off-by: Alfredo Deza <alfredo.deza@inktank.com>
11 years agocall wait() on the teuthology-results Popen object
Zack Cerza [Fri, 17 Jan 2014 16:05:21 +0000 (10:05 -0600)]
call wait() on the teuthology-results Popen object

This ought to fix the issue where zombie teuthology-results processes
stick around.

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoMerge pull request #176 from ceph/wip-7164
Alfredo Deza [Thu, 16 Jan 2014 21:00:37 +0000 (13:00 -0800)]
Merge pull request #176 from ceph/wip-7164

lock.py: request only rsa keys from ssh-keyscan

11 years agolock.py: request only rsa keys from ssh-keyscan
Dan Mick [Thu, 16 Jan 2014 20:51:39 +0000 (12:51 -0800)]
lock.py: request only rsa keys from ssh-keyscan

New versions of ssh-keyscan return two keytypes if possible; this
breaks the comparison of "number of lines of output from keyscan"
to "number of hosts we request keys from".  Fix by asking for only
one type of key (as older ssh-keyscans did).

Fixes: #7164
Signed-off-by: Dan Mick <dan.mick@inktank.com>
11 years agoceph_manager: in test_map_discontinuity, delay killing osd for 20s
Samuel Just [Thu, 16 Jan 2014 17:57:42 +0000 (09:57 -0800)]
ceph_manager: in test_map_discontinuity, delay killing osd for 20s

This should give enough time for the replica to get the message
marking them clean.

Signed-off-by: Samuel Just <sam.just@inktank.com>
11 years agoLog a warning when killing long-running jobs.
Zack Cerza [Thu, 16 Jan 2014 16:53:53 +0000 (10:53 -0600)]
Log a warning when killing long-running jobs.

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoKill jobs that run for over 3 days (configurable)
Zack Cerza [Thu, 16 Jan 2014 16:38:39 +0000 (10:38 -0600)]
Kill jobs that run for over 3 days (configurable)

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoMerge pull request #174 from ceph/wip-7171
Sage Weil [Fri, 10 Jan 2014 19:08:31 +0000 (11:08 -0800)]
Merge pull request #174 from ceph/wip-7171

thrashosds: change min_in from 2 -> 3

11 years agovalgrind: ignore tcmalloc uninitialized memory
Sage Weil [Fri, 10 Jan 2014 19:02:06 +0000 (11:02 -0800)]
valgrind: ignore tcmalloc uninitialized memory

This is the main source of noise when running valgrind +
tcmalloc.  Apparently there are other issues, so I think we
still need the notcmalloc gitbuilder, but this gets us part of
the way.

Signed-off-by: Sage Weil <sage@inktank.com>
11 years agothrashosds: change min_in from 2 -> 3
Sage Weil [Fri, 10 Jan 2014 19:00:55 +0000 (11:00 -0800)]
thrashosds: change min_in from 2 -> 3

See #7171. In rare cases CRUSH can't handle it when only 2/6 of
the OSDs are marked in.  Avoid those situations for now.

Signed-off-by: Sage Weil <sage@inktank.com>
11 years agoMerge pull request #169 from ceph/wip-utsrelease
Zack Cerza [Fri, 10 Jan 2014 17:22:40 +0000 (09:22 -0800)]
Merge pull request #169 from ceph/wip-utsrelease

kernel: use utsrelease string for need_to_install() purposes

11 years agokernel: use utsrelease string for need_to_install() purposes
Ilya Dryomov [Fri, 10 Jan 2014 10:26:09 +0000 (12:26 +0200)]
kernel: use utsrelease string for need_to_install() purposes

Currently, to see if a node has rebooted into the right kernel,
need_to_install() compares a given 40-char commit hash with a 7-char
commit hash abbreviation it pulls from the output of 'uname -r'.

gitbuilders can now export UTS_RELEASE kernel version string through
.../$SHA1/version file.  Use this string instead of the 40-char commit
hash and compare it with the output of 'uname -r' directly.  This saves
us the parsing exercise and, more importantly, makes it possible to
install clean tagged kernels using 'tag:' element, which wasn't
possible before because version string of such kernels doesn't have
a commit hash in it.

If version file is unavailable, fallback to the existing way of doing
things.

Signed-off-by: Ilya Dryomov <ilya.dryomov@inktank.com>
11 years agoMerge pull request #173 from ceph/wip-ceph-deploy
Alfredo Deza [Thu, 9 Jan 2014 13:43:04 +0000 (05:43 -0800)]
Merge pull request #173 from ceph/wip-ceph-deploy

fix ceph-dpeloy tests to use 2x replication

11 years agoschedule_suite: 2x replication for ceph-deploy
Sage Weil [Thu, 9 Jan 2014 01:35:10 +0000 (17:35 -0800)]
schedule_suite: 2x replication for ceph-deploy

This avoids having to update all of the tests.

Signed-off-by: Sage Weil <sage@inktank.com>
11 years agoMerge pull request #172 from ceph/wip-teuthvmcon-wusui
Zack Cerza [Tue, 7 Jan 2014 23:09:39 +0000 (15:09 -0800)]
Merge pull request #172 from ceph/wip-teuthvmcon-wusui

Fix a bug where ctx.config['targets'] was looped through again

11 years agoFix a bug where ctx.config['targets'] was looped through again
Warren Usui [Tue, 7 Jan 2014 22:22:57 +0000 (14:22 -0800)]
Fix a bug where ctx.config['targets'] was looped through again
in connect().  The bug caused vm behavior to happen for a
target if any of the machines in the cluster was a vm.  The code
was also changed to set the key to none only if rsa or dsa keys
were used on a vm.

Fixes: 7113
Signed-off-by: Warren Usui <warren.usui@inktank.com>
11 years agoerror after 15 minutes of waiting for gatherkeys
Alfredo Deza [Mon, 6 Jan 2014 18:50:35 +0000 (13:50 -0500)]
error after 15 minutes of waiting for gatherkeys

Signed-off-by: Alfredo Deza <alfredo.deza@inktank.com>
(cherry picked from commit ad18717505c407f2c8bb995af8fd516b61957661)

11 years agoEnable reporting of entire runs as dead
Zack Cerza [Mon, 6 Jan 2014 16:56:26 +0000 (10:56 -0600)]
Enable reporting of entire runs as dead

11 years agoRe-raise exceptions caught in the watchdog
Zack Cerza [Fri, 3 Jan 2014 21:45:18 +0000 (15:45 -0600)]
Re-raise exceptions caught in the watchdog

11 years agoUse response.text if response.json is None
Zack Cerza [Fri, 3 Jan 2014 21:08:45 +0000 (15:08 -0600)]
Use response.text if response.json is None

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoStrip stdout lines
Zack Cerza [Fri, 3 Jan 2014 21:01:31 +0000 (15:01 -0600)]
Strip stdout lines

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoCatch and log unhandled exceptions in the watchdog
Zack Cerza [Fri, 3 Jan 2014 20:56:46 +0000 (14:56 -0600)]
Catch and log unhandled exceptions in the watchdog

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoAdd 'emperor' to list of branches with reporting
Zack Cerza [Fri, 3 Jan 2014 20:45:25 +0000 (14:45 -0600)]
Add 'emperor' to list of branches with reporting

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoWork around a change in pip 1.5 regarding wheels
Zack Cerza [Fri, 3 Jan 2014 18:41:11 +0000 (12:41 -0600)]
Work around a change in pip 1.5 regarding wheels

The error message was "pip's wheel support requires setuptools >= 0.8
for dist-info support."

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoBe safer when calling ./bootstrap
Zack Cerza [Fri, 3 Jan 2014 17:55:13 +0000 (11:55 -0600)]
Be safer when calling ./bootstrap

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoUse CentOS Gitbuilder sha1 instead of Fedora for non-ubuntu.
Sandon Van Ness [Fri, 3 Jan 2014 02:30:08 +0000 (18:30 -0800)]
Use CentOS Gitbuilder sha1 instead of Fedora for non-ubuntu.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
11 years agobreak out of the while loop after 15 minutes
Alfredo Deza [Fri, 13 Dec 2013 19:46:29 +0000 (14:46 -0500)]
break out of the while loop after 15 minutes

Signed-off-by: Alfredo Deza <alfredo.deza@inktank.com>
(cherry picked from commit bef6eb74dcaa37b70b1eab4d28bfa10abb0049d0)
Signed-off-by: Zack Cerza <zack@cerza.org>
11 years agoSleep once outside of the watchdog loop
Zack Cerza [Tue, 31 Dec 2013 20:25:05 +0000 (14:25 -0600)]
Sleep once outside of the watchdog loop

Hopefully this will prevent the double-posting of jobs.

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoMerge pull request #168 from ktdreyer/readme-formatting
Alfredo Deza [Tue, 31 Dec 2013 13:53:51 +0000 (05:53 -0800)]
Merge pull request #168 from ktdreyer/readme-formatting

format bullets in README

11 years agoformat bullets in README
Ken Dreyer [Tue, 31 Dec 2013 02:42:39 +0000 (19:42 -0700)]
format bullets in README

11 years agoSet the content-type in report_job()
Zack Cerza [Mon, 30 Dec 2013 22:20:52 +0000 (16:20 -0600)]
Set the content-type in report_job()

11 years agoSplit out ResultsSerializer.job_info()
Zack Cerza [Mon, 30 Dec 2013 16:05:16 +0000 (10:05 -0600)]
Split out ResultsSerializer.job_info()

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoPort from httplib2 to requests module
Zack Cerza [Mon, 16 Dec 2013 17:39:49 +0000 (11:39 -0600)]
Port from httplib2 to requests module

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agovalgrind.supp: ignore libnss3 leaks
Sage Weil [Sun, 22 Dec 2013 06:21:49 +0000 (22:21 -0800)]
valgrind.supp: ignore libnss3 leaks

These just started popping up when I updated the notcmalloc gitbuilder, probably
because of an updated libnss version.   Whitelist it!

Signed-off-by: Sage Weil <sage@inktank.com>
11 years agorbd: bump the default scratch size for xfstests to 10G
Ilya Dryomov [Mon, 23 Dec 2013 17:54:11 +0000 (19:54 +0200)]
rbd: bump the default scratch size for xfstests to 10G

autobuild-ceph.git commit 53db7a34aba5 had silently changed the default
elevator from cfq to deadline, which made xfstests 167 very unhappy.
It looks like with deadline and noop elevators it requires a ~6G
scratch partition.  Bump the default scratch image size to 10G.

Signed-off-by: Ilya Dryomov <ilya.dryomov@inktank.com>
11 years agoRevert "valgrind.supp: ignore libnss3 leaks"
Sage Weil [Sun, 22 Dec 2013 17:50:12 +0000 (09:50 -0800)]
Revert "valgrind.supp: ignore libnss3 leaks"

This reverts commit 572dc88a7cc295cb06354e6f004f7ad665b101f4.

This didn't occur on next; I think there may be a real leak on the ceph
side.

11 years agovalgrind.supp: ignore libnss3 leaks
Sage Weil [Sun, 22 Dec 2013 06:21:49 +0000 (22:21 -0800)]
valgrind.supp: ignore libnss3 leaks

These just started popping up.  Probably because I gave the
gitbuilders a kick?

Signed-off-by: Sage Weil <sage@inktank.com>
11 years agoMerge pull request #166 from ceph/wip-lockspell-wusui
SandonV [Fri, 20 Dec 2013 20:48:42 +0000 (12:48 -0800)]
Merge pull request #166 from ceph/wip-lockspell-wusui

Fix spelling error in comment.

11 years agoFix spelling error in teuthology/task/locktest.py comment
Warren Usui [Fri, 20 Dec 2013 20:31:24 +0000 (12:31 -0800)]
Fix spelling error in teuthology/task/locktest.py comment

11 years agoAdd ability to mark jobs as 'dead'
Zack Cerza [Fri, 20 Dec 2013 15:52:12 +0000 (09:52 -0600)]
Add ability to mark jobs as 'dead'

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoAllow passing multiple job_ids
Zack Cerza [Thu, 19 Dec 2013 22:43:11 +0000 (16:43 -0600)]
Allow passing multiple job_ids

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoImplement single-job killing
Zack Cerza [Thu, 19 Dec 2013 22:12:56 +0000 (16:12 -0600)]
Implement single-job killing

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoFor teuthology-kill, s/suite/run/
Zack Cerza [Thu, 19 Dec 2013 21:39:15 +0000 (15:39 -0600)]
For teuthology-kill, s/suite/run/

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoMerge pull request #165 from ceph/wip-7042-fix-wusui
SandonV [Thu, 19 Dec 2013 22:27:16 +0000 (14:27 -0800)]
Merge pull request #165 from ceph/wip-7042-fix-wusui

Do not run local handling fix if local parameter is not found.

11 years agoDo not run local handling fix if local parameter is not found.
Warren Usui [Thu, 19 Dec 2013 22:20:12 +0000 (14:20 -0800)]
Do not run local handling fix if local parameter is not found.
Fixes: 7042
Signed-off-by: Warren Usui <warren.usui@inktank.com>
11 years agoMerge pull request #156 from ceph/teuthology-doc-hadoop-wusui
Zack Cerza [Thu, 19 Dec 2013 17:27:14 +0000 (09:27 -0800)]
Merge pull request #156 from ceph/teuthology-doc-hadoop-wusui

Added docstrings.  Cleaned up code (broke up long lines, removed unused

11 years agoMerge pull request #164 from ceph/wip-rados
Zack Cerza [Thu, 19 Dec 2013 17:24:21 +0000 (09:24 -0800)]
Merge pull request #164 from ceph/wip-rados

rados: add in more (optional) op types

11 years agoMerge pull request #160 from ceph/wip-fix-5149-wusui
Zack Cerza [Thu, 19 Dec 2013 17:23:36 +0000 (09:23 -0800)]
Merge pull request #160 from ceph/wip-fix-5149-wusui

Added handling of a 'local' option inside install.py which specifies

11 years agoLog calls to teuthology-report more verbosely
Zack Cerza [Thu, 19 Dec 2013 16:25:51 +0000 (10:25 -0600)]
Log calls to teuthology-report more verbosely

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoCatch every exception here, for now.
Zack Cerza [Tue, 17 Dec 2013 17:02:30 +0000 (11:02 -0600)]
Catch every exception here, for now.

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoUse saucy gitbuilder for arm package checking.
Sandon Van Ness [Wed, 18 Dec 2013 20:38:50 +0000 (12:38 -0800)]
Use saucy gitbuilder for arm package checking.

Some-how missed it checks both sha1 and package version file
and package version was still the quantal gitbuilder which wont
work as the hardware is down.

This was causing scheduling failures.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
11 years agorados: add in more (optional) op types
Sage Weil [Wed, 18 Dec 2013 19:41:58 +0000 (11:41 -0800)]
rados: add in more (optional) op types

Signed-off-by: Sage Weil <sage@inktank.com>
11 years agoUse shell=True to call teuthology-report
Zack Cerza [Mon, 16 Dec 2013 20:22:22 +0000 (14:22 -0600)]
Use shell=True to call teuthology-report

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoCatch OSError if script isn't in $PATH
Zack Cerza [Mon, 16 Dec 2013 19:34:37 +0000 (13:34 -0600)]
Catch OSError if script isn't in $PATH

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoRevert "Use path when calling teuthology-report. …"
Zack Cerza [Mon, 16 Dec 2013 17:43:06 +0000 (11:43 -0600)]
Revert "Use path when calling teuthology-report. …"

This reverts commit e4b5ab811e954a5b134d413aeb338805b5e3441d.

11 years agoUse path when calling teuthology-report. …
Sandon Van Ness [Sat, 14 Dec 2013 15:14:51 +0000 (07:14 -0800)]
Use path when calling teuthology-report. …

The 'teuthology-report' command is probably not going to exist
in $PATH so get the location of the running command and assume its
in the same path.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
11 years agoMerge pull request #162 from jcsp/fsid-conf
Zack Cerza [Fri, 13 Dec 2013 17:25:30 +0000 (09:25 -0800)]
Merge pull request #162 from jcsp/fsid-conf

Fix FSID not being set in ceph.conf

11 years agoMerge pull request #161 from jcsp/ssh-config
Zack Cerza [Fri, 13 Dec 2013 17:24:23 +0000 (09:24 -0800)]
Merge pull request #161 from jcsp/ssh-config

Respect .ssh/config when opening SSH connections

11 years agoSkip the 'dead' report on old branches
Zack Cerza [Fri, 13 Dec 2013 15:56:23 +0000 (09:56 -0600)]
Skip the 'dead' report on old branches

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoUse saucy gitbuilder when grabbing sha1 for arm.
Sandon Van Ness [Fri, 13 Dec 2013 00:04:38 +0000 (16:04 -0800)]
Use saucy gitbuilder when grabbing sha1 for arm.

Old quantal gitbuilders are gone until hardware comes back. Use
the new saucy gitbuilders instead.

11 years agoMake sure to report all results.
Zack Cerza [Thu, 12 Dec 2013 23:33:53 +0000 (17:33 -0600)]
Make sure to report all results.

If a just-finished job was using a teuthology branch not known to
contain the reporting feature, then report the job via the
teuthology-report script. Note that in some cases this will result in
double reporting but the extra load should be negligible.

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoEnable reporting of single jobs
Zack Cerza [Thu, 12 Dec 2013 22:54:56 +0000 (16:54 -0600)]
Enable reporting of single jobs

(also switch to docopt)

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoRemove the child's stderr completely
Zack Cerza [Thu, 12 Dec 2013 21:45:58 +0000 (15:45 -0600)]
Remove the child's stderr completely

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoFix FSID not being set in ceph.conf
John Spray [Thu, 12 Dec 2013 21:33:19 +0000 (13:33 -0800)]
Fix FSID not being set in ceph.conf

Symptom was that 'ceph --admin-daemon... config get fsid'
returned zeros, while correct fsid was present in cluster maps.
Fix it by populating FSID in ceph.conf, after extracting it from
monmap.

11 years agoWhen starting a job, tell paddles it's running
Zack Cerza [Thu, 12 Dec 2013 17:47:45 +0000 (11:47 -0600)]
When starting a job, tell paddles it's running

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoLonger timeout after sync/reboot.
Sandon Van Ness [Thu, 12 Dec 2013 02:07:43 +0000 (18:07 -0800)]
Longer timeout after sync/reboot.

With only a 5 second sleep via ssh and python it looks like a
race-condition was sometimes hitting where it would think
the machine is back up before the reboot command had completed.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
11 years agoRespect .ssh/config when opening SSH connections
John Spray [Wed, 11 Dec 2013 21:08:51 +0000 (13:08 -0800)]
Respect .ssh/config when opening SSH connections

This handles that case where your private key is
in a non-default location that you're pointing
to in ~/.ssh/config.

11 years agoAdded handling of a 'local' option inside install.py which specifies
Warren Usui [Wed, 11 Dec 2013 07:45:38 +0000 (23:45 -0800)]
Added handling of a 'local' option inside install.py which specifies
a local directory containing deb or rpm files to be installed.

Fixes: 5149
Signed-off-by: Warren Usui <warren.usui@inktank.com>
11 years agoUse continue, not break
Zack Cerza [Tue, 10 Dec 2013 22:47:35 +0000 (16:47 -0600)]
Use continue, not break

Fixes a bug where not all pids were being collected

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoTweak logic for pid lookup
Zack Cerza [Tue, 10 Dec 2013 22:35:05 +0000 (16:35 -0600)]
Tweak logic for pid lookup

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoFix indentation
Zack Cerza [Tue, 10 Dec 2013 22:25:28 +0000 (16:25 -0600)]
Fix indentation

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoDon't show child's stderr, but show archive path
Zack Cerza [Tue, 10 Dec 2013 19:19:56 +0000 (13:19 -0600)]
Don't show child's stderr, but show archive path

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoAdd debug statements
Zack Cerza [Tue, 10 Dec 2013 16:06:16 +0000 (10:06 -0600)]
Add debug statements

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoMerge pull request #159 from ceph/wip-cache
Zack Cerza [Tue, 10 Dec 2013 16:02:51 +0000 (08:02 -0800)]
Merge pull request #159 from ceph/wip-cache

rados: allow existing pool(s) to be used

11 years agorados: allow existing pool(s) to be used
Sage Weil [Tue, 10 Dec 2013 00:02:13 +0000 (16:02 -0800)]
rados: allow existing pool(s) to be used

Signed-off-by: Sage Weil <sage@inktank.com>
11 years agoceph.conf: put 2x command in [global]
Sage Weil [Mon, 9 Dec 2013 23:37:58 +0000 (15:37 -0800)]
ceph.conf: put 2x command in [global]

so that osdmaptool sees it.

Signed-off-by: Sage Weil <sage@inktank.com>
11 years agoCreate a DateTime object from the timestamp
Zack Cerza [Mon, 9 Dec 2013 22:57:11 +0000 (16:57 -0600)]
Create a DateTime object from the timestamp

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoMake -a optional
Zack Cerza [Mon, 9 Dec 2013 22:40:27 +0000 (16:40 -0600)]
Make -a optional

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoAdd missing req: psutil
Zack Cerza [Mon, 9 Dec 2013 22:32:45 +0000 (16:32 -0600)]
Add missing req: psutil

11 years agoMerge pull request #151 from ceph/wip-distro-kernel
Zack Cerza [Mon, 9 Dec 2013 21:16:33 +0000 (13:16 -0800)]
Merge pull request #151 from ceph/wip-distro-kernel

Wip distro kernel

11 years agoAuto-restart
Zack Cerza [Mon, 9 Dec 2013 20:56:49 +0000 (14:56 -0600)]
Auto-restart

If /tmp/teuthology-restart-workers is newer than the running process,
restart.

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoMerge pull request #158 from ceph/wip-nuke
Zack Cerza [Mon, 9 Dec 2013 21:01:03 +0000 (13:01 -0800)]
Merge pull request #158 from ceph/wip-nuke

make nuke behave

11 years agonuke: ignore exceptions while issuing reboot command
Sage Weil [Mon, 9 Dec 2013 19:42:12 +0000 (11:42 -0800)]
nuke: ignore exceptions while issuing reboot command

I'm seeing failed tasks (and nuke) leak machines.  It looks like we are
getting an exception on the '... reboot -f -n' command when we should be
ignoring it and waiting for the machine to restart.

For example:
   http://qa-proxy.ceph.com/teuthology/sage-2013-12-08_19:25:06-rados:thrash-wip-tier-foo-basic-plana/136321/teuthology.log

Signed-off-by: Sage Weil <sage@inktank.com>
11 years agoRemove unused variable.
Sandon Van Ness [Mon, 9 Dec 2013 19:42:06 +0000 (11:42 -0800)]
Remove unused variable.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
11 years agoAdded additional comments.
Sandon Van Ness [Mon, 9 Dec 2013 19:35:23 +0000 (11:35 -0800)]
Added additional comments.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
11 years agoceph.conf: default to 2x
Sage Weil [Sat, 7 Dec 2013 21:20:58 +0000 (13:20 -0800)]
ceph.conf: default to 2x

A bunch of our tests rely on this; they need to be fixed
before we can run at 3x.

Signed-off-by: Sage Weil <sage@inktank.com>
11 years agonuke: fix sync before reboot timeout
Sage Weil [Sat, 7 Dec 2013 01:42:23 +0000 (17:42 -0800)]
nuke: fix sync before reboot timeout

If you do 'timeout 5 sync' and sync hangs, timeout will block trying to
kill it.

Instead, just background sync, wait a few seconds, and reboot.  This means
we wait a few seconds even if sync returns immediately, but who cares!

Signed-off-by: Sage Weil <sage@inktank.com>
11 years agoMerge pull request #157 from ceph/wip-watchdog
Alfredo Deza [Fri, 6 Dec 2013 14:18:14 +0000 (06:18 -0800)]
Merge pull request #157 from ceph/wip-watchdog

Implement a watchdog for queued jobs

11 years agoImplement a watchdog for queued jobs
Zack Cerza [Thu, 5 Dec 2013 23:37:25 +0000 (17:37 -0600)]
Implement a watchdog for queued jobs

This continually posts the run's status to the results server, if
configured, at an interval defaulting to 600 seconds.

Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
11 years agoA create_if_vm call was made more than once when a lock-many style lock
Warren Usui [Thu, 5 Dec 2013 01:49:21 +0000 (17:49 -0800)]
A create_if_vm call was made more than once when a lock-many style lock
was performed.  This caused downburst to run twice, and the second
downburst fails as a result of the first downburst running.

Fixes: 6933
11 years agoMerge branch 'teuthology-fix-downburst-yaml-wusui'
Warren Usui [Thu, 5 Dec 2013 01:36:14 +0000 (17:36 -0800)]
Merge branch 'teuthology-fix-downburst-yaml-wusui'

11 years agoImplement --downburst-conf parameter for teuthology-lock.
Warren Usui [Mon, 2 Dec 2013 22:37:12 +0000 (14:37 -0800)]
Implement --downburst-conf parameter for teuthology-lock.
Load the appropriate yaml information when found (this formerly
did not work).  Make sure teuthology --lock works with a downburst
entry in the yaml files.  Document how this works in README.rst.

Fixes: #6921
Reviewed-by: Dan Mick
11 years agoAdded docstrings. Cleaned up code (broke up long lines, removed unused
Warren Usui [Wed, 4 Dec 2013 02:16:04 +0000 (18:16 -0800)]
Added docstrings.  Cleaned up code (broke up long lines, removed unused
variable references, pep8 formatted most of the code (one set of long lines
remains), and changed some variable and method names to conform to pylint
standards).

Fixes: 6530