]>
git.apps.os.sepia.ceph.com Git - teuthology.git/log
Loic Dachary [Tue, 21 Oct 2014 04:16:25 +0000 (21:16 -0700)]
display informative error message if schedule fails to deep_merge
Display the two yaml structures that fail to merge in case of an
exception. That's a useful hint to find out what's wrong.
Signed-off-by: Loic Dachary <loic-201408@dachary.org>
John Spray [Mon, 20 Oct 2014 18:45:14 +0000 (19:45 +0100)]
Merge pull request #342 from ceph/wip-5876
task/nfs: stop nfs server after test finishes
Reviewed-by: John Spray <john.spray@redhat.com>
Yan, Zheng [Wed, 15 Oct 2014 07:47:57 +0000 (15:47 +0800)]
task/nfs: stop nfs server after test finishes more thoroughly
In addition to "exportfs -au", we take a sledgehammer to it
by shutting down all the worker threads and flushing out
the client export table.
Signed-off-by: Yan, Zheng <zyan@redhat.com>
Signed-off-by: Greg Farnum <Greg@inktank.com>
Zack Cerza [Mon, 20 Oct 2014 15:58:25 +0000 (09:58 -0600)]
Better error handling in list_locks()
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Mon, 20 Oct 2014 14:33:42 +0000 (08:33 -0600)]
Fix linter errors
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Mon, 20 Oct 2014 14:31:24 +0000 (08:31 -0600)]
Merge pull request #343 from ceph/wip-dmick
Fixes for #9811 and #9812
Dan Mick [Sat, 18 Oct 2014 05:11:05 +0000 (22:11 -0700)]
task.ssh_keys(): manage authorized_keys, not keys2
See the issue for a description, but authorized_keys2 is dead
on RHEL7 and dying elsewhere.
Fixes: #9812
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Dan Mick [Sat, 18 Oct 2014 05:06:51 +0000 (22:06 -0700)]
lock.lock_many(): use passed os_{type,version}, not ctx
Affects teuthology --lock with os_{type,version} in the yaml.
Fixes: #9811
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Zack Cerza [Thu, 16 Oct 2014 19:57:29 +0000 (13:57 -0600)]
Instead of setting status='pass', set success=True
Keeping this old behavior will mean that any tasks that are not updated
to use set_status() and still simply set success=False will still be
able to properly report their failures.
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Thu, 16 Oct 2014 18:54:07 +0000 (12:54 -0600)]
Use new functions provided by job_status
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Thu, 16 Oct 2014 18:46:21 +0000 (12:46 -0600)]
Add new job_status module
Provides methods for getting and setting job status. This is necessary
because we used to just use a boolean called 'success'. In order to stop
reporting infrastructure errors as failures, we need more than just a
boolean.
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Wed, 15 Oct 2014 18:57:04 +0000 (12:57 -0600)]
Remove setting of unused 's3_branch' value
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Tue, 14 Oct 2014 15:38:41 +0000 (09:38 -0600)]
Stop obsuring stderr when installing with yum
Over time, the code that actually used the buffer we stored stderr in
was removed, leaving us holding error messages and never displaying
them.
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Mon, 13 Oct 2014 22:17:43 +0000 (16:17 -0600)]
Properly quote args that are strings
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Mon, 13 Oct 2014 20:37:18 +0000 (14:37 -0600)]
On SSH connection loss, mark jobs as 'dead'
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Tue, 7 Oct 2014 16:52:34 +0000 (10:52 -0600)]
Remove stray exit statement
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Tue, 7 Oct 2014 16:49:40 +0000 (10:49 -0600)]
Don't assume Ubuntu; also support MacOS X
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Tue, 7 Oct 2014 16:04:41 +0000 (10:04 -0600)]
Drop libmysqlclient-dev
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Mon, 6 Oct 2014 20:43:09 +0000 (14:43 -0600)]
Use os.path.expanduser() instead of /home
... for locating downburst executable
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Fri, 3 Oct 2014 21:12:29 +0000 (15:12 -0600)]
Minor fix for vps-only locks
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Fri, 3 Oct 2014 18:15:16 +0000 (12:15 -0600)]
Merge pull request #339 from ceph/wip-9292-wusui
Cleaned up documentation.
Warren Usui [Fri, 3 Oct 2014 02:17:59 +0000 (19:17 -0700)]
Cleaned up documentation.
Removed some Inktank references, mentioned the split of tasks (between
teuthology/task and chef-qa-suite/tasks), fixed some wording, removed
chef reference that no longer applies.
Did not update the teuthology-suite section at the end of this file
because further documentation changes are coming.
Fixes: #9292
Signed-off-by: Warren Usui <warren.usui@inktank.com>
Zack Cerza [Wed, 1 Oct 2014 23:15:10 +0000 (17:15 -0600)]
Use arch in lock_machines()
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Wed, 1 Oct 2014 23:14:59 +0000 (17:14 -0600)]
Query arch in lock_many()
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Wed, 1 Oct 2014 22:55:55 +0000 (16:55 -0600)]
Use uname -m instead of uname -p
Debian hosts were reporting as 'unknown'
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Wed, 1 Oct 2014 20:58:07 +0000 (14:58 -0600)]
All BaseExceptions should hit failure codepath
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Tue, 30 Sep 2014 19:36:36 +0000 (13:36 -0600)]
Move VersionNotFoundError to exceptions module
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Tue, 30 Sep 2014 19:35:13 +0000 (13:35 -0600)]
Move ScheduleFailError to exceptions module
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Tue, 30 Sep 2014 15:13:46 +0000 (09:13 -0600)]
Use os_type and os_version in lock_machines()
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Mon, 29 Sep 2014 21:00:22 +0000 (15:00 -0600)]
Query os_type and os_version in lock_many()
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Mon, 29 Sep 2014 21:00:06 +0000 (15:00 -0600)]
Remove default for --os-type
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Mon, 29 Sep 2014 13:54:34 +0000 (07:54 -0600)]
Add OS.__repr__() and OS.to_dict()
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Mon, 29 Sep 2014 13:39:49 +0000 (07:39 -0600)]
Move OS class to its own files
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Greg Farnum [Mon, 29 Sep 2014 19:38:04 +0000 (12:38 -0700)]
tasks/nfs: sync before unmounting or unexporting nfs clients/servers
We have some hope that this will resolve #8576 in the Ceph tracker.
Signed-off-by: Greg Farnum <greg@inktank.com>
Reviewed-by: Sage Weil <sage@redhat.com>
Zack Cerza [Fri, 26 Sep 2014 20:46:00 +0000 (14:46 -0600)]
Bump requests version to >= 2.3.0
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Thu, 25 Sep 2014 16:12:10 +0000 (10:12 -0600)]
Allow mixing of bare-metal machine_types
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Fri, 26 Sep 2014 17:12:59 +0000 (11:12 -0600)]
Merge pull request #335 from ceph/wip-kernel-downgrade
Fix RPM kernel downgrade.
Greg Farnum [Thu, 25 Sep 2014 19:43:39 +0000 (12:43 -0700)]
nfs: more lsof output, on clients too
Signed-off-by: Greg Farnum <greg@inktank.com>
Zack Cerza [Thu, 25 Sep 2014 21:22:06 +0000 (15:22 -0600)]
Revert "Move default os_type to misc.get_distro()"
This reverts commit
3675a2e25c5d9a370922b714871492f31ed1783d .
Zack Cerza [Thu, 25 Sep 2014 21:22:00 +0000 (15:22 -0600)]
Revert "Add os_type and os_version args to lock_many()"
This reverts commit
f39b6958c4dd94f7e682833f4cf3122c4537dcde .
Zack Cerza [Thu, 25 Sep 2014 21:02:10 +0000 (15:02 -0600)]
Merge pull request #337 from ceph/wip-9597
lock.py: Log failing os-version if it doesn't validate
Dan Mick [Thu, 25 Sep 2014 20:56:45 +0000 (13:56 -0700)]
lock.py: Log failing os-version if it doesn't validate
Fixes: #9597
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Zack Cerza [Thu, 25 Sep 2014 15:38:52 +0000 (09:38 -0600)]
Add 'lsb_release -a' parsing back to remote.OS
It can now parse either that or /etc/os-release, and remote.Remote will
attempt the latter first.
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Tue, 23 Sep 2014 15:55:22 +0000 (09:55 -0600)]
Add os_type and os_version args to lock_many()
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Tue, 23 Sep 2014 15:50:40 +0000 (09:50 -0600)]
Move default os_type to misc.get_distro()
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Wed, 24 Sep 2014 21:24:33 +0000 (15:24 -0600)]
Merge pull request #336 from ceph/wip-nfs
enable lsof when unmounting nfs clients
Greg Farnum [Wed, 24 Sep 2014 05:44:37 +0000 (22:44 -0700)]
enable lsof when unmounting nfs clients
Because the lsof interface is strange, use the verbose flag so we get
some output even if there is no use, and ignore errors because it will
return 1 if there's no filesystem use.
Signed-off-by: Greg Farnum <greg@inktank.com>
Sandon Van Ness [Fri, 19 Sep 2014 21:58:25 +0000 (14:58 -0700)]
Fix RPM kernel downgrade.
Force even if an older version or existing.
Signed-off-by: Sandon Van Ness <sandon@inktank.com>
Zack Cerza [Fri, 19 Sep 2014 17:19:27 +0000 (11:19 -0600)]
Merge pull request #334 from ceph/wip-9519-wusui
Handle list-json downburst failure.
Zack Cerza [Fri, 19 Sep 2014 16:12:59 +0000 (10:12 -0600)]
Refuse to schedule jobs with 'multi' machine_type
That's not a real machine_type - it's a beanstalkd tube that is used
when a machine_type like 'plana,mira,burnupi' is used.
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Fri, 19 Sep 2014 15:42:36 +0000 (09:42 -0600)]
Raise ConnectionLostError when the socket dies
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Warren Usui [Thu, 18 Sep 2014 20:29:18 +0000 (13:29 -0700)]
Handle list-json downburst failure.
Use check_output instead of Popen and communicate()
Fixes: #9519
Signed-off-by: Warren Usui <warren.usui@inktank.com>
Zack Cerza [Thu, 18 Sep 2014 19:43:22 +0000 (13:43 -0600)]
Improve os-release parsing
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Thu, 18 Sep 2014 18:24:18 +0000 (12:24 -0600)]
Rewrite teuthology-updatekeys
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Wed, 17 Sep 2014 21:29:27 +0000 (15:29 -0600)]
Skip hostname test
Due to the workaround put in place recently
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Wed, 17 Sep 2014 21:26:39 +0000 (15:26 -0600)]
Re-implement Distribution as OS
It now uses /etc/os-release instead of lsb_release, like the cool kids
are doing.
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Wed, 17 Sep 2014 16:46:23 +0000 (10:46 -0600)]
Merge pull request #328 from ceph/wip-8700-wusui
Use docopts to parse daemon-helper
Zack Cerza [Wed, 17 Sep 2014 16:41:56 +0000 (10:41 -0600)]
Merge pull request #333 from ceph/wip-9495-wusui
Run chef before operations that need it.
Zack Cerza [Wed, 17 Sep 2014 16:31:04 +0000 (10:31 -0600)]
Merge pull request #331 from ceph/wip-transport-none
orchestra: fix ConnectionLostError path in wait()
Zack Cerza [Wed, 17 Sep 2014 16:23:08 +0000 (10:23 -0600)]
Add a workaround for Remote.hostname
Since 'hostname --fqdn' isn't working reliably
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Warren Usui [Wed, 17 Sep 2014 03:23:42 +0000 (20:23 -0700)]
Run chef before operations that need it.
Chef, if it is needed, should be run before kernel tasks
and before all other user defined tasks. It should not be
run by default.
Fixes: 9495
Signed-off-by: Warren Usui <warren.usui@inktank.com>
Warren Usui [Mon, 15 Sep 2014 18:05:14 +0000 (11:05 -0700)]
Fix daemon-helper argument parsing.
Fixes: 8700
Signed-off-by: Warren Usui <warren.usui@inktank.com>
Zack Cerza [Tue, 16 Sep 2014 17:58:51 +0000 (11:58 -0600)]
Merge pull request #332 from ceph/wip-yaml-path
teuthology-suite: fix handling of extra YAML files
John Spray [Tue, 16 Sep 2014 13:46:32 +0000 (14:46 +0100)]
teuthology-suite: fix handling of extra YAML files
Previously these had to be absolute paths, or relative
paths had to be present in the pwd and in ceph-qa-suite.
With this change you can now pass a relative path
like 'debug/mds_client.yaml' to reference a debug
fragment from ceph-qa-suite.
Signed-off-by: John Spray <john.spray@redhat.com>
John Spray [Tue, 16 Sep 2014 13:08:41 +0000 (14:08 +0100)]
orchestra: fix ConnectionLostError path in wait()
Sometimes it would generate AttributeError instead
when it tried to call is_active() on None.
Signed-off-by: John Spray <john.spray@redhat.com>
Zack Cerza [Mon, 15 Sep 2014 16:57:59 +0000 (10:57 -0600)]
Merge pull request #326 from ceph/wip-9318-wusui
Avoid running chef automatically.
Zack Cerza [Mon, 15 Sep 2014 16:57:25 +0000 (10:57 -0600)]
Merge pull request #297 from ceph/wip-8712-wusui
Check os-type and os-version for vps
Zack Cerza [Fri, 12 Sep 2014 16:01:05 +0000 (10:01 -0600)]
Make push_inventory() a regular function
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Warren Usui [Fri, 12 Sep 2014 00:34:54 +0000 (17:34 -0700)]
Avoid running chef automatically
teuthology --lock formerly automatically ran chef on startup of
virtual machines. This covered up a bug where ceph-deploy would
not install properly. These changes remove the automatic running
of chef, and fixed some related issues when setting up keys on
virtual machines.
Fixes 9318:
Signed-off-by: Warren Usui <warren.usui@inktank.com>
Zack Cerza [Mon, 8 Sep 2014 19:13:54 +0000 (13:13 -0600)]
Use teuthology.task.internal.push_inventory
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Mon, 8 Sep 2014 19:13:31 +0000 (13:13 -0600)]
Add push_inventory() subtask
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Mon, 8 Sep 2014 19:12:52 +0000 (13:12 -0600)]
Add lock.update_inventory()
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Mon, 8 Sep 2014 15:57:42 +0000 (09:57 -0600)]
Add Remote.inventory_info
A property that returns a dict of attributes in the format that the
paddles lock server likes.
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Mon, 8 Sep 2014 15:55:22 +0000 (09:55 -0600)]
Add Remote.host_key
Turns out we can easily grab the ssh key from paramiko once a connection
is established.
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Mon, 8 Sep 2014 15:36:00 +0000 (09:36 -0600)]
Set Remote.hostname by asking the host itself
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Fri, 5 Sep 2014 15:32:33 +0000 (09:32 -0600)]
Add Remote.arch
Returns the result of 'uname -p'
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Thu, 5 Jun 2014 21:50:24 +0000 (16:50 -0500)]
Add caching lsb_release parser
In far too many places do we remotely exec lsb_release in some form,
then parse the output. In shifting to a more stable interface between
teuthology and its tasks, this seemed important to me.
Remote objects now have a 'distro' property that, when first accessed,
calls 'lsb_release -a' remotely and parses the results.
I've included tests and documentation.
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Warren Usui [Tue, 9 Sep 2014 03:12:24 +0000 (20:12 -0700)]
Check os-type and os-version for vps
Fixes: 8712
Signed-off-by: Warren Usui warren.usui@inktank.com
Alfredo Deza [Wed, 10 Sep 2014 12:44:58 +0000 (08:44 -0400)]
Merge pull request #327 from dachary/wip-exec-documentation
documentation: updates to the exec tasks
Loic Dachary [Wed, 10 Sep 2014 10:19:50 +0000 (12:19 +0200)]
documentation: updates to the exec tasks
* Quoting is important
* Behavior on error
Signed-off-by: Loic Dachary <loic-201408@dachary.org>
Zack Cerza [Tue, 9 Sep 2014 23:55:08 +0000 (17:55 -0600)]
In Remote.run(), attempt to reconnect if necessary
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Tue, 9 Sep 2014 22:30:37 +0000 (16:30 -0600)]
Move more exceptions
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Sage Weil [Tue, 9 Sep 2014 23:28:12 +0000 (16:28 -0700)]
valgrind: suppress tcmalloc crap for centos 6.5
Fixes: #9336
Signed-off-by: Sage Weil <sage@redhat.com>
Zack Cerza [Mon, 8 Sep 2014 20:53:07 +0000 (14:53 -0600)]
Remove unnecessary filter in do_update_keys()
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Mon, 8 Sep 2014 19:15:04 +0000 (13:15 -0600)]
Don't explode if get_status fails
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Tue, 2 Sep 2014 14:45:01 +0000 (08:45 -0600)]
Merge pull request #325 from ceph/wip-valgrind
valgrind: use --time-stamp=yes
Sage Weil [Sat, 30 Aug 2014 16:22:42 +0000 (09:22 -0700)]
valgrind: adjust suppressions for wheezy
Signed-off-by: Sage Weil <sage@redhat.com>
Sage Weil [Sat, 30 Aug 2014 01:58:09 +0000 (18:58 -0700)]
valgrind: use --time-stamp=yes
This doesn't help on the precise version, but maybe it will on
future versions... because if so, it would be really helpful
it matching valgrind errors to daemon logs!
In any case, though, I verified it is at worse harmless.
Signed-off-by: Sage Weil <sage@redhat.com>
Sandon Van Ness [Fri, 29 Aug 2014 17:37:30 +0000 (10:37 -0700)]
Update downburst docs.
Signed-off-by: Sandon Van Ness <sandon@inktank.com>
Zack Cerza [Fri, 29 Aug 2014 17:19:09 +0000 (11:19 -0600)]
Merge pull request #324 from ceph/wip-hadoop-nuke
nuke: kill hadoop processes
John Spray [Fri, 29 Aug 2014 16:40:48 +0000 (17:40 +0100)]
nuke: kill hadoop processes
Signed-off-by: John Spray <john.spray@redhat.com>
Zack Cerza [Fri, 29 Aug 2014 01:19:07 +0000 (19:19 -0600)]
Drop --allow-hosts None
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Fri, 29 Aug 2014 01:13:37 +0000 (19:13 -0600)]
Use setuptools, not distribute
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Sage Weil [Thu, 28 Aug 2014 22:53:19 +0000 (15:53 -0700)]
Merge pull request #323 from ceph/pypi
Prepare for PyPI
Reviewed-by: Sage Weil <sage@redhat.com>
Zack Cerza [Thu, 28 Aug 2014 22:46:28 +0000 (16:46 -0600)]
Add install_requires and tests_require
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Thu, 28 Aug 2014 22:44:33 +0000 (16:44 -0600)]
Add MANIFEST.in
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Thu, 28 Aug 2014 22:44:21 +0000 (16:44 -0600)]
Add LICENSE
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza [Thu, 28 Aug 2014 14:40:59 +0000 (08:40 -0600)]
Merge pull request #308 from ceph/wip-6146-wusui
Add ability to update with ceph-deploy
Zack Cerza [Wed, 27 Aug 2014 21:43:10 +0000 (15:43 -0600)]
Merge pull request #314 from ceph/wip-8923
Speed up teuthology-kill.
Sandon Van Ness [Wed, 27 Aug 2014 21:41:01 +0000 (14:41 -0700)]
Add more info in comments.
Signed-off-by: Sandon Van Ness <sandon@inktank.com>