]>
git.apps.os.sepia.ceph.com Git - teuthology.git/log
Zack Cerza [Thu, 23 Apr 2015 16:51:54 +0000 (10:51 -0600)]
Split internal.connect() into two subtasks
One creates the ctx.cluster object; the other initiates connections
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 23 Apr 2015 16:50:58 +0000 (10:50 -0600)]
Pass timeout value down to connection.connect()
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 23 Apr 2015 16:21:46 +0000 (10:21 -0600)]
Add timeout arg to connect()
Default behavior is unchanged
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Wed, 15 Apr 2015 18:31:08 +0000 (12:31 -0600)]
Don't call Remote.connect() in Remote.__init__()
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Wed, 15 Apr 2015 18:15:44 +0000 (12:15 -0600)]
Make Remote.reconnect() retry optionally
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Wed, 15 Apr 2015 16:58:04 +0000 (10:58 -0600)]
Properly close connection on object deletion
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Tue, 14 Apr 2015 17:38:15 +0000 (11:38 -0600)]
Merge pull request #460 from zhouyuan/mkdir_p
Make parent directories as needed
Andrew Schoen [Tue, 14 Apr 2015 17:27:07 +0000 (12:27 -0500)]
Merge pull request #463 from ceph/wip-suite-tests
Basic teuthology.suite.build_matrix() unit tests
Zack Cerza [Tue, 14 Apr 2015 15:30:14 +0000 (09:30 -0600)]
Make test_gitbuilder_url() use the local config
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Mon, 13 Apr 2015 22:18:13 +0000 (16:18 -0600)]
Add unit tests for suite.build_matrix()
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Mon, 13 Apr 2015 22:17:40 +0000 (16:17 -0600)]
Parametrize build_matrix() for unit tests
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Fri, 10 Apr 2015 16:30:53 +0000 (10:30 -0600)]
Always call unlock_one() with the user arg
I'd seen a few cases where vps creation would fail, and then unlocking
failed because we were incorrectly guessing the owner of the lock.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 9 Apr 2015 21:42:13 +0000 (15:42 -0600)]
Skip passing os_type/version when destroying
Signed-off-by: Zack Cerza <zack@redhat.com>
Andrew Schoen [Thu, 9 Apr 2015 20:50:10 +0000 (15:50 -0500)]
Merge pull request #459 from ceph/wip-downburst
Refactor downburst provisioning
Zack Cerza [Thu, 9 Apr 2015 16:33:26 +0000 (10:33 -0600)]
Add basic unit tests for Downburst
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 9 Apr 2015 16:30:45 +0000 (10:30 -0600)]
Make *_if_vm() more unit-testable
Allow passing in a custom Downburst instance. Also clean up several
linter issues.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Fri, 27 Feb 2015 20:40:07 +0000 (13:40 -0700)]
Use provision.downburst_executable()
Signed-off-by: Zack Cerza <zack@redhat.com>
Andrew Schoen [Tue, 7 Apr 2015 22:54:07 +0000 (17:54 -0500)]
Merge pull request #461 from ceph/wip-update-inventory
Add teuthology-update-inventory command
Zack Cerza [Tue, 7 Apr 2015 16:54:28 +0000 (10:54 -0600)]
Add teuthology-update-inventory command
This ought to also replace teuthology-updatekeys, but might not be as
fast yet.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Fri, 13 Feb 2015 17:41:36 +0000 (10:41 -0700)]
Move downburst code into new Downburst class
Signed-off-by: Zack Cerza <zack@redhat.com>
Yuan Zhou [Tue, 7 Apr 2015 04:35:38 +0000 (12:35 +0800)]
Make parent directories as needed
with '-p' when mkdir
Signed-off-by: Yuan Zhou <yuan.zhou@intel.com>
Zack Cerza [Fri, 13 Feb 2015 03:20:45 +0000 (20:20 -0700)]
Clarify error logging in destroy_if_vm()
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 12 Feb 2015 21:57:57 +0000 (14:57 -0700)]
Be smarter about logging in create_if_vm()
Also return False on failure.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 12 Feb 2015 21:54:32 +0000 (14:54 -0700)]
Strip ssh-keyscan output before logging it
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Fri, 3 Apr 2015 03:41:43 +0000 (21:41 -0600)]
Be more tolerant of git connection errors
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Wed, 1 Apr 2015 23:30:48 +0000 (17:30 -0600)]
Be slightly smarter when looking for stale locks
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Fri, 27 Mar 2015 18:58:53 +0000 (12:58 -0600)]
Merge pull request #458 from ceph/wip-11248
Disable version check when ceph-deploy is used to install ceph.
Andrew Schoen [Fri, 27 Mar 2015 17:43:04 +0000 (10:43 -0700)]
Disable version check when ceph-deploy is used to install ceph.
See: http://tracker.ceph.com/issues/11248
Signed-off-by: Andrew Schoen <aschoen@redhat.com>
Zack Cerza [Fri, 27 Mar 2015 17:23:16 +0000 (11:23 -0600)]
On BranchNotFoundError, mark job as dead
We had been simply deleting jobs from paddles if the teuthology branch
was not found. Instead, mark them as dead with a useful failure_reason.
Signed-off-by: Zack Cerza <zack@redhat.com>
Andrew Schoen [Thu, 26 Mar 2015 22:41:31 +0000 (15:41 -0700)]
Merge pull request #457 from ceph/wip-lttng-valgrind
valgrind: ignore lttng warning on un-cleaned-up state
Greg Farnum [Thu, 26 Mar 2015 21:13:45 +0000 (14:13 -0700)]
valgrind: ignore lttng warning on un-cleaned-up state
This appears to have come from
dca722ec7b2a7fc9214844ec92310074b5cb2faa ,
which merged in support for use of lttng ust functions on fork. Valgrind
started warning on a Leak_StillReachable with lttng_ust_init in
the function call trace.
Hopefully this is specific enough that we don't mask out any other
lttng errors.
Fixes: #11247
Signed-off-by: Greg Farnum <gfarnum@redhat.com>
Zack Cerza [Thu, 26 Mar 2015 20:16:22 +0000 (13:16 -0700)]
Merge pull request #456 from ceph/wip-11243
Verify version for other projects besides ceph in the install task
Andrew Schoen [Thu, 26 Mar 2015 16:37:03 +0000 (09:37 -0700)]
Verify version for other projects besides ceph in the install task
Fixes: http://tracker.ceph.com/issues/11243
Signed-off-by: Andrew Schoen <aschoen@redhat.com>
Zack Cerza [Tue, 24 Mar 2015 23:13:35 +0000 (16:13 -0700)]
Avoid double-unlocking with unlock_on_failure
If both unlock_on_failure and nuke-on-error are set, don't unlock now
because we're just going to nuke (and unlock) later.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Mon, 23 Mar 2015 19:05:41 +0000 (13:05 -0600)]
Merge pull request #451 from ceph/verify-version
Verify that the correct ceph version is installed on updates and install
Zack Cerza [Mon, 23 Mar 2015 15:43:30 +0000 (09:43 -0600)]
Merge pull request #454 from ktdreyer/wip-comment-libvirt-bootstrap
bootstrap: explain the system python-libvirt pkg
Ken Dreyer [Mon, 23 Mar 2015 15:30:26 +0000 (09:30 -0600)]
bootstrap: explain the system python-libvirt pkg
In #sepia today, Zack explained the reasoning for the python-libvirt
package on the Debian family of distros.
Andrew Schoen [Thu, 19 Mar 2015 21:19:55 +0000 (16:19 -0500)]
Merge pull request #453 from ceph/wip-11170
Allow running teuthology-lock --update on broken vps nodes
Zack Cerza [Thu, 19 Mar 2015 20:58:26 +0000 (14:58 -0600)]
Tweak ssh-keyscan timing
Make the call itself timeout after 1s instead of 5s. Make update_lock()
try 15 times, sleeping for 1s between attempts. A full timeout of
update_lock() should take ~30s.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 19 Mar 2015 20:32:58 +0000 (14:32 -0600)]
Use safe_while instead of an unsafe loop
If ssh-keyscan fails, we would hang forever. Stop that nonsense, and
give up eventually, print a warning, and move on.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 19 Mar 2015 20:32:06 +0000 (14:32 -0600)]
Fix error formatting when 'action' is passed
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 19 Mar 2015 19:17:35 +0000 (13:17 -0600)]
Merge pull request #452 from dmick/master
lock.py: allow machines list for --brief
Dan Mick [Thu, 19 Mar 2015 18:47:29 +0000 (11:47 -0700)]
lock.py: allow machines list for --brief
There was never any reason not to.
Fixes: #11167
Signed-off-by: Dan Mick <dan.mick@redhat.com>
Andrew Schoen [Wed, 18 Mar 2015 16:11:32 +0000 (11:11 -0500)]
Merge pull request #424 from ceph/wip-5629
Download ELFs (regular + debug) from remotes when coredump occurs.
Zack Cerza [Tue, 17 Mar 2015 16:44:21 +0000 (10:44 -0600)]
Use Remote.get_file()
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Tue, 17 Mar 2015 16:35:47 +0000 (10:35 -0600)]
Fix some linter issues
Signed-off-by: Zack Cerza <zack@redhat.com>
Sandon Van Ness [Tue, 17 Feb 2015 21:21:24 +0000 (13:21 -0800)]
Download ELFs (regular + debug) from remotes when coredump occurs.
Per issue #5629
Since we need the binaries (regular stripped + non-stripped with
debug symbols) to do debugging this will make it more convinient
for developers so they don't have to track down the files from
the specific build in question and instead they will just be
sitting in the archive folder with the coredump files.
Signed-off-by: Sandon Van Ness <sandon@redhat.com>
Andrew Schoen [Thu, 26 Feb 2015 21:12:29 +0000 (15:12 -0600)]
Verify that the correct ceph version is installed on updates and install
This adds a function to teuthology.packaging that can be reused to
fetch the installed version of a package for the given remote.
Signed-off-by: Andrew Schoen <aschoen@redhat.com>
Andrew Schoen [Thu, 5 Mar 2015 17:38:29 +0000 (11:38 -0600)]
Merge pull request #450 from ceph/wip-10989
Add optional exclude_packages arg to upgrade subtask
Zack Cerza [Thu, 5 Mar 2015 16:59:25 +0000 (09:59 -0700)]
Add optional exclude_packages arg to upgrade
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Wed, 4 Mar 2015 17:11:42 +0000 (10:11 -0700)]
Smarter option handling in upgrade_common()
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Wed, 4 Mar 2015 15:48:37 +0000 (08:48 -0700)]
Fix a couple bugs and mark down hosts as down
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Tue, 3 Mar 2015 22:12:40 +0000 (15:12 -0700)]
Don't use yaml in FakeNamespace's str()
Signed-off-by: Zack Cerza <zack@redhat.com>
Andrew Schoen [Tue, 3 Mar 2015 21:56:32 +0000 (15:56 -0600)]
Merge pull request #449 from ceph/wip-repr
Don't use yaml in FakeNamespace's repr()
Zack Cerza [Tue, 3 Mar 2015 21:55:34 +0000 (14:55 -0700)]
Merge pull request #441 from ceph/wip-10803
Bug #10803: Try apt-get -f install to workaround nuke failure.
Sandon Van Ness [Fri, 13 Feb 2015 18:46:05 +0000 (10:46 -0800)]
Bug #10803: Try apt-get -f install to workaround nuke failure.
Sometimes packages or dpkg gets in a weird state where a nuke will
fail to run and simply running an apt-get -f install on the
machine will fix things up so it can be nuked again. Currently
we only ran that being dependent on dpkg --configure completing
where as I changed it so it will still try apt-get -f install
to potentially require less manual intervention when cleaning up
machines. I kept the original behavior but adding some arguments
to insure it will not prompt you for a yes/no (like apt-get -f
install usually would do).
Signed-off-by: Sandon Van Ness <sandon@redhat.com>
Zack Cerza [Tue, 3 Mar 2015 21:19:42 +0000 (14:19 -0700)]
Don't use yaml in FakeNamespace's repr()
Signed-off-by: Zack Cerza <zack@redhat.com>
Josh Durgin [Tue, 3 Mar 2015 00:35:25 +0000 (16:35 -0800)]
install/nuke: add rbd-fuse package
This is now used by rbd's merge_diff test. rbd-fuse has been around
since before dumpling, so this is compatible with all current ceph
testing.
Signed-off-by: Josh Durgin <jdurgin@redhat.com>
Zack Cerza [Wed, 25 Feb 2015 19:16:23 +0000 (12:16 -0700)]
Merge pull request #447 from ceph/upgrade-issue
Do not specify a version when upgrading ceph on rpm-based systems.
Andrew Schoen [Tue, 24 Feb 2015 16:56:57 +0000 (10:56 -0600)]
Do not specify a version when upgrading ceph on rpm-based systems.
Because of the split of ceph-devel, upgrading ceph-devel with an
explicit version does not allow check_obsoletes to work.
For, http://tracker.ceph.com/issues/10936
Signed-off-by: Andrew Schoen <aschoen@redhat.com>
Andrew Schoen [Wed, 25 Feb 2015 15:48:34 +0000 (09:48 -0600)]
Merge pull request #448 from ceph/hadoop-workunits
hadoop: support rhel and ubuntu
Noah Watkins [Wed, 25 Feb 2015 02:36:49 +0000 (18:36 -0800)]
hadoop: support rhel and ubuntu
Signed-off-by: Noah Watkins <noahwatkins@gmail.com>
Zack Cerza [Tue, 24 Feb 2015 20:05:34 +0000 (13:05 -0700)]
Merge pull request #427 from ceph/travisci
add Travis CI integration
Sage Weil [Mon, 23 Feb 2015 22:14:47 +0000 (14:14 -0800)]
Merge pull request #446 from ceph/wip-10926-andrew
Do not install packages with a specified version, fixes #10926
Andrew Schoen [Mon, 23 Feb 2015 20:47:40 +0000 (14:47 -0600)]
Do not install packages with a specified version, fixes #10926
Now that we've got yum setup to check for obsoletes we don't need to
specify a version. Specifying the version actually keeps yum from
checking for obsoletes.
Signed-off-by: Andrew Schoen <aschoen@redhat.com>
Andrew Schoen [Mon, 23 Feb 2015 18:17:49 +0000 (12:17 -0600)]
Merge pull request #445 from ceph/hadoop
hadoop: clean-up and sigkill delay fix
Noah Watkins [Mon, 23 Feb 2015 00:53:49 +0000 (16:53 -0800)]
hadoop: add yarn sigkill delay hack
This instructs yarn to wait more time (10sec) than the default (250ms)
before killing containers to give ceph clients time to properly
shutdown.
Signed-off-by: Noah Watkins <noahwatkins@gmail.com>
Noah Watkins [Mon, 23 Feb 2015 00:53:09 +0000 (16:53 -0800)]
hadoop: use dict-to-conf converter
Signed-off-by: Noah Watkins <noahwatkins@gmail.com>
Andrew Schoen [Wed, 18 Feb 2015 18:55:35 +0000 (12:55 -0600)]
Merge pull request #443 from ceph/wip-10910
Allow Remote.get_file() to use original filename
Zack Cerza [Wed, 18 Feb 2015 17:05:18 +0000 (10:05 -0700)]
Allow Remote.get_file() to use original filename
If dest_dir != '/tmp', attempt to use the original filename.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Wed, 18 Feb 2015 18:48:26 +0000 (11:48 -0700)]
Update out-of-date integration test
Signed-off-by: Zack Cerza <zack@redhat.com>
Andrew Schoen [Tue, 17 Feb 2015 20:57:07 +0000 (14:57 -0600)]
Merge pull request #442 from ceph/wip-7364
Don't fail just because apt-get update does
Zack Cerza [Tue, 17 Feb 2015 20:26:34 +0000 (13:26 -0700)]
Don't fail just because apt-get update does
If we fail later anyway, fine. If not, yay!
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Tue, 17 Feb 2015 17:37:40 +0000 (10:37 -0700)]
Don't fail if task runs twice
This was happening because _yum_unset_check_obsoletes() was failing
Signed-off-by: Zack Cerza <zack@redhat.com>
Andrew Schoen [Mon, 16 Feb 2015 20:15:54 +0000 (14:15 -0600)]
Merge pull request #440 from ceph/wip-10893
Set check_obsoletes = 1 in yum's priorities.conf
Zack Cerza [Mon, 16 Feb 2015 20:08:57 +0000 (13:08 -0700)]
Set check_obsoletes = 1 in yum's priorities.conf
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Mon, 16 Feb 2015 16:03:57 +0000 (09:03 -0700)]
Merge pull request #439 from ceph/pytest-better-errors
put test failures in failure_reason; skip test_correct_os_version for debian
Andrew Schoen [Fri, 13 Feb 2015 21:05:18 +0000 (15:05 -0600)]
Make an exception for debian in tests.test_correct_os_version
This is because of a known issue where downburst gives us 7.1 when we
ask for 7.0. We're ok with this behavior for now. See: issue #10878
Signed-off-by: Andrew Schoen <aschoen@redhat.com>
Andrew Schoen [Fri, 13 Feb 2015 17:32:59 +0000 (11:32 -0600)]
Include test failures in ctx.summary['failure_reason']
I also added a new line to make reading the log nicer
Signed-off-by: Andrew Schoen <aschoen@redhat.com>
Andrew Schoen [Fri, 13 Feb 2015 20:58:42 +0000 (14:58 -0600)]
Merge pull request #438 from ceph/wip-10879
Avoid race condition in find_stale_locks()
Zack Cerza [Fri, 13 Feb 2015 19:25:19 +0000 (12:25 -0700)]
Avoid race condition in find_stale_locks()
Because of the way we were checking nodes against running jobs, it was
possible to falsely report nodes as stale if they were part of a job
that was transitioning from the waiting to running state.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Fri, 13 Feb 2015 15:36:32 +0000 (08:36 -0700)]
Merge pull request #437 from ceph/pytest-noterminalreporter
Fixes the issue of pytest failing with scheduled jobs
Andrew Schoen [Thu, 12 Feb 2015 21:19:25 +0000 (15:19 -0600)]
Remove the pytest default TerminalReporter in the tests task
This fixes the issue of pytest IO Error failure in a scheduled job
Signed-off-by: Andrew Schoen <aschoen@redhat.com>
Andrew Schoen [Thu, 12 Feb 2015 21:14:30 +0000 (15:14 -0600)]
If there is an error running pytest, mark job as dead
Signed-off-by: Andrew Schoen <aschoen@redhat.com>
Andrew Schoen [Thu, 12 Feb 2015 18:42:50 +0000 (12:42 -0600)]
Merge pull request #435 from ceph/wip-10856
Safer vps unlocking
Zack Cerza [Thu, 12 Feb 2015 17:04:23 +0000 (10:04 -0700)]
Pass a description to unlock_one()
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 12 Feb 2015 17:03:44 +0000 (10:03 -0700)]
Add optional description arg to unlock_one()
It gets passed to the lock server.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 12 Feb 2015 17:00:38 +0000 (10:00 -0700)]
Add optional description arg to destroy_if_vm()
If it is passed and doesn't match the one received from the lock server,
don't destroy the VM.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 12 Feb 2015 18:14:55 +0000 (11:14 -0700)]
Merge pull request #436 from ceph/wip-hadoop-linter
hadoop: remove parallel from import; fixes linter warning
Greg Farnum [Thu, 12 Feb 2015 18:06:57 +0000 (10:06 -0800)]
hadoop: remove parallel from import; fixes linter warning
Signed-off-by: Greg Farnum <gfarnum@redhat.com>
Gregory Farnum [Thu, 12 Feb 2015 05:03:56 +0000 (21:03 -0800)]
Merge pull request #434 from ceph/hadoop
Hadoop 2.0 Task
This will need more updating, but pulling it in-tree gets better in-situ testing, and the hadoop task is isolated.
Zack Cerza [Wed, 11 Feb 2015 22:35:14 +0000 (15:35 -0700)]
Merge pull request #432 from ceph/pytest-stdout
Make pytest capture stdout when running the tests task
Andrew Schoen [Wed, 11 Feb 2015 22:23:10 +0000 (16:23 -0600)]
Make pytest capture stdout when running the tests task
This makes the logs easier to read.
Signed-off-by: Andrew Schoen <aschoen@redhat.com>
Noah Watkins [Tue, 20 Jan 2015 18:55:28 +0000 (10:55 -0800)]
hadoop: separate ceph/hdfs config actions
Signed-off-by: Noah Watkins <noahwatkins@gmail.com>
Noah Watkins [Mon, 19 Jan 2015 23:55:41 +0000 (15:55 -0800)]
hadoop: support cephfs
Signed-off-by: Noah Watkins <noahwatkins@gmail.com>
Noah Watkins [Mon, 19 Jan 2015 22:52:48 +0000 (14:52 -0800)]
hadoop: easier config creation
Signed-off-by: Noah Watkins <noahwatkins@gmail.com>
Noah Watkins [Mon, 19 Jan 2015 22:25:37 +0000 (14:25 -0800)]
misc: create prepend_lines helper
Signed-off-by: Noah Watkins <noahwatkins@gmail.com>
Noah Watkins [Mon, 19 Jan 2015 22:25:21 +0000 (14:25 -0800)]
hadoop: 2x
Signed-off-by: Noah Watkins <noahwatkins@gmail.com>
Andrew Schoen [Wed, 11 Feb 2015 20:31:32 +0000 (14:31 -0600)]
Merge pull request #431 from ceph/wip-10825
Smarter sources.list/.repo removal
Zack Cerza [Wed, 11 Feb 2015 19:03:37 +0000 (12:03 -0700)]
Tweak _remove_sources_list_{deb,rpm}()
Make their implementations more concise, their logging more verbose, and
avoid running apt-get update if no sources list was removed.
Signed-off-by: Zack Cerza <zack@redhat.com>