]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
12 years agoteuthology-lock --summary: allow --machine-type=all
Dan Mick [Wed, 5 Jun 2013 00:46:05 +0000 (17:46 -0700)]
teuthology-lock --summary:  allow --machine-type=all

Signed-off-by: Dan Mick <dan.mick@inktank.com>
12 years agoceph_manager: don't say you have no arguments and then list them
Dan Mick [Tue, 4 Jun 2013 23:11:19 +0000 (16:11 -0700)]
ceph_manager: don't say you have no arguments and then list them

Calling ceph pg dump --format=json works better without -- before pg
(how did this work before?...)

Signed-off-by: Dan Mick <dan.mick@inktank.com>
12 years agoceph: fix valgrind grep output parsing
Sage Weil [Tue, 4 Jun 2013 16:07:53 +0000 (09:07 -0700)]
ceph: fix valgrind grep output parsing

When you pass a single file to zgrep you don't get the filename prefix,
which confuses the split line a few lines down.

12 years agoceph: debug valgrind error
Sage Weil [Mon, 3 Jun 2013 16:57:17 +0000 (09:57 -0700)]
ceph: debug valgrind error

  File "/var/lib/teuthworker/teuthology-master/teuthology/task/ceph.py", line 215, in valgrind_post
    (file, kind) = line.split(':')
ValueError: need more than 1 value to unpack

12 years agovalgrind: add another leveldb suppression
Sage Weil [Fri, 31 May 2013 05:07:30 +0000 (22:07 -0700)]
valgrind: add another leveldb suppression

12 years agovalgrind: update suppressions for leveldb, libc leaks from mon
Sage Weil [Thu, 30 May 2013 18:25:32 +0000 (11:25 -0700)]
valgrind: update suppressions for leveldb, libc leaks from mon

These result in clean valgrind leak checks on the mon (at least with my
limited vstart testing).

12 years agoceph: fix valgrind log check
Sage Weil [Wed, 22 May 2013 20:22:21 +0000 (13:22 -0700)]
ceph: fix valgrind log check

- logs are gzipped; use zgrep
- wait for the proc to exit before looking at stdout

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoschedule_suite.sh: resolve ceph sha1 using deb gitbuilder, not tarball
Sage Weil [Wed, 22 May 2013 16:25:40 +0000 (09:25 -0700)]
schedule_suite.sh: resolve ceph sha1 using deb gitbuilder, not tarball

The tarball one is old and largely obsolete.

12 years agothrashosds: sync before doing powercycle testing
Sage Weil [Mon, 20 May 2013 19:26:49 +0000 (12:26 -0700)]
thrashosds: sync before doing powercycle testing

Hopefully fixes #5112

12 years agoschedule_suite.sh: 8hr -> 10hr suite timeout
Sage Weil [Mon, 20 May 2013 18:23:50 +0000 (11:23 -0700)]
schedule_suite.sh: 8hr -> 10hr suite timeout

Still missing some slow rbd tests.

12 years agoinstall: make overrides grouped by project
Sage Weil [Sat, 18 May 2013 01:53:02 +0000 (18:53 -0700)]
install: make overrides grouped by project

This lets us set different overrides for e.g. ceph vs samba, and makes it
so the schedule_teuthology.sh overrides don't specify a ceph sha1 for
samba installs.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoclient config will be done only after the cluster is operational.
tamil [Fri, 17 May 2013 19:08:45 +0000 (12:08 -0700)]
client config will be done only after the cluster is operational.

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
12 years agoset permission for config file
tamil [Thu, 16 May 2013 20:14:06 +0000 (13:14 -0700)]
set permission for config file

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
12 years agoschedule_suite.sh: put sha1 in install: overrides, not ceph:
Sage Weil [Thu, 16 May 2013 18:29:42 +0000 (11:29 -0700)]
schedule_suite.sh: put sha1 in install: overrides, not ceph:

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoadded UserKnownHostsfile to ssh config
tamil [Thu, 16 May 2013 16:49:40 +0000 (09:49 -0700)]
added UserKnownHostsfile to ssh config

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
12 years agoFix scrub_test.py permission error
David Zafman [Tue, 14 May 2013 23:17:10 +0000 (16:17 -0700)]
Fix scrub_test.py permission error

Add description of yaml file including log-whitelist
Add sudo to dd that corrupts data

Signed-off-by: David Zafman <david.zafman@inktank.com>
Reviewed-by: Warren Usui <warren.usui@inktank.com>:wq
12 years agoqemu: load the kvm module before trying to use it
Josh Durgin [Mon, 13 May 2013 21:19:59 +0000 (14:19 -0700)]
qemu: load the kvm module before trying to use it

It should be loaded before this, but in some cases it is not for some reason.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
12 years agoschedule_suite.sh: bump suite timeout from 6->8 hours
Sage Weil [Sun, 12 May 2013 00:07:14 +0000 (17:07 -0700)]
schedule_suite.sh: bump suite timeout from 6->8 hours

This captures the current slow rbd tasks.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agolocaldir: create/cleanup mnt.foo dir on local fs
Sage Weil [Sat, 11 May 2013 06:01:04 +0000 (23:01 -0700)]
localdir: create/cleanup mnt.foo dir on local fs

This creates and cleans up a local mnt dir that can be consumed
by other tasks (like workunit, samba, etc), just like any
other client (ceph-fuse, kclient), except it is just a dir on
the local fs.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoDo not scan for vm locks when listing all machines.
Warren Usui [Fri, 10 May 2013 17:18:44 +0000 (10:18 -0700)]
Do not scan for vm locks when listing all machines.

Fixes: #4830
Signed-off-by: Warren Usui <warren.usui@inktank.com>
12 years agoRevert "radosgw-admin: Test bucket list for bucket starting with underscore."
Yehuda Sadeh [Fri, 10 May 2013 17:18:42 +0000 (10:18 -0700)]
Revert "radosgw-admin: Test bucket list for bucket starting with underscore."

This reverts commit fa70eb8f67371568f47ae237606be63024164214.

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
12 years agonuke: don't require noipmi in ctx
Josh Durgin [Fri, 10 May 2013 01:20:59 +0000 (18:20 -0700)]
nuke: don't require noipmi in ctx

This is called from run.py too, which won't have ctx.noipmi.
The default of using impmi is fine for now for run.py.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
12 years agodefault project to ceph and extra_pkgs to none
tamil [Wed, 8 May 2013 23:05:34 +0000 (16:05 -0700)]
default project to ceph and extra_pkgs to none

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
12 years agoMerge branch 'wip-teuth4768a-wusui'
Warren Usui [Wed, 8 May 2013 02:27:51 +0000 (19:27 -0700)]
Merge branch 'wip-teuth4768a-wusui'

Conflicts:
teuthology/task/install.py

12 years agoinstall: default to ceph project throughout
Sage Weil [Tue, 7 May 2013 23:00:15 +0000 (16:00 -0700)]
install: default to ceph project throughout

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoceph_manager: add timeout option to revive, increase for power_cycle
Samuel Just [Mon, 6 May 2013 21:10:11 +0000 (14:10 -0700)]
ceph_manager: add timeout option to revive, increase for power_cycle

Signed-off-by: Samuel Just <sam.just@inktank.com>
12 years agoMerge branch 'next'
Sage Weil [Tue, 7 May 2013 04:31:36 +0000 (21:31 -0700)]
Merge branch 'next'

12 years agoinstall: only remove ceph data of project is ceph
Sage Weil [Tue, 7 May 2013 01:03:41 +0000 (18:03 -0700)]
install: only remove ceph data of project is ceph

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agotask/cifs-mount.py: Task for mounting cifs
Sam Lang [Wed, 1 May 2013 15:56:04 +0000 (10:56 -0500)]
task/cifs-mount.py: Task for mounting cifs

The cifs-mount task mounts a smb endpoint from the
first available smbd server (the samba.0 role).  This
task is similar to the ceph-fuse task, file system
tests can be run on the resulting mount point.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
12 years agotask/samba.py: Samba task to setup/start smbd
Sam Lang [Wed, 1 May 2013 15:55:53 +0000 (10:55 -0500)]
task/samba.py: Samba task to setup/start smbd

The samba task sets up samba on all 'samba' roles
with ceph as the backend storage module.  The task
creates a smb.conf file that points to ceph, and
starts smbd.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
12 years agotask/daemon-helper: Add nostdin option
Sam Lang [Thu, 2 May 2013 15:49:50 +0000 (10:49 -0500)]
task/daemon-helper: Add nostdin option

Some daemons (smbd) will try to read from stdin and check if its a
socket, using that for sending/receiving messages.  If /dev/null is
used for stdin, the daemon aborts.  This patch adds a 'nostdin' option
to the daemon-helper so that the daemon can be started without /dev/null
as stdin.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
12 years agotask/install.py: Allow installation of non-ceph
Sam Lang [Wed, 1 May 2013 15:55:38 +0000 (10:55 -0500)]
task/install.py: Allow installation of non-ceph

Generalizes the install task to specify a "project" which defaults to
'ceph', but can be configured to install different project packages,
for example:

install:
  project: samba
  extra_packages: samba

The default install task uses 'ceph' as the project, and relies on an
existing set of defined packages to install.  For other projects, the
packages to be installed must be specified with the extra_packages
field.  Multiple install tasks can be specified:

install:
install:
  project: samba
  extra_packages: samba

Which installs ceph packages and then samba packages.

Also, cleanup in nuke.py so that nuke and install use the same list of
packages when doing the remove steps.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
12 years agoFix teuthology installations on physical Centos machines.
Warren Usui [Wed, 1 May 2013 03:40:22 +0000 (20:40 -0700)]
Fix teuthology installations on physical Centos machines.
Yum installs of packages specify a pacakge number.  Initial
install of yum source changed to not fail if already done.
Added yum cleans where necessary.

Fixes: #4768
Signed-off-by: Warren Usui <warren.usui@inktank.com>
12 years agos3tests: add force-branch with higher precdence than 'branch'
Sage Weil [Thu, 2 May 2013 20:47:46 +0000 (13:47 -0700)]
s3tests: add force-branch with higher precdence than 'branch'

This way we can force a branch despite something in overrides.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agonuke.py: Allow ipmi power cycling to be skipped
Sam Lang [Tue, 30 Apr 2013 20:10:35 +0000 (15:10 -0500)]
nuke.py: Allow ipmi power cycling to be skipped

Some nodes don't have ipmi setup.  Allow nuke to
skip the ipmi checking if -i (--noipmi) is specified.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
12 years agoFor vms, fix some bad default configuration settings.
Warren Usui [Wed, 1 May 2013 20:28:47 +0000 (13:28 -0700)]
For vms, fix some bad default configuration settings.

Fixes: #4881
Signed-off-by: Warren Usui <warren.usui@inktank.com>
12 years agoceph_manager: mount_osd_data expects osd as a str
Samuel Just [Wed, 1 May 2013 20:14:53 +0000 (13:14 -0700)]
ceph_manager: mount_osd_data expects osd as a str

Signed-off-by: Samuel Just <sam.just@inktank.com>
12 years agoceph.py: set up ctx.disk_config outside of the loop
Samuel Just [Wed, 1 May 2013 20:14:35 +0000 (13:14 -0700)]
ceph.py: set up ctx.disk_config outside of the loop

Signed-off-by: Samuel Just <sam.just@inktank.com>
12 years agoceph.py: the journal component does not current work with restart
Samuel Just [Wed, 1 May 2013 20:13:52 +0000 (13:13 -0700)]
ceph.py: the journal component does not current work with restart

Removing for the time being.

Signed-off-by: Samuel Just <sam.just@inktank.com>
12 years agoMerge remote branch 'origin/next'
Josh Durgin [Wed, 1 May 2013 16:52:02 +0000 (09:52 -0700)]
Merge remote branch 'origin/next'

12 years agofix some errors found by pyflakes
Josh Durgin [Wed, 1 May 2013 00:07:53 +0000 (17:07 -0700)]
fix some errors found by pyflakes

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
12 years agos3tests: revert useless portion of 1c50db6a4630d07e72144dafd985c397f8a42dc5
Josh Durgin [Tue, 30 Apr 2013 20:23:22 +0000 (13:23 -0700)]
s3tests: revert useless portion of 1c50db6a4630d07e72144dafd985c397f8a42dc5

Perhaps it was attempting to debug something, but it shouldn't have been committed.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
12 years agorgw tests: remove users after each test
Josh Durgin [Tue, 30 Apr 2013 23:49:04 +0000 (16:49 -0700)]
rgw tests: remove users after each test

These should all be cleanup up at some point. They're
almost all the same code.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
12 years agorgw tests: clean up immediately after the test
Josh Durgin [Tue, 30 Apr 2013 23:47:34 +0000 (16:47 -0700)]
rgw tests: clean up immediately after the test

There's no need for an explicit cleanup function, so move it back
to where it came from (except in s3roundtrip, which did not have it).

Instead, since these use a nested contextmanager, pass through
and yield to the top-level run_tasks after the nested
contextmanager has finished (and thus run all the cleanup steps
in the subtasks for this test).

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
12 years agoceph: allow restarting radosgw
Josh Durgin [Tue, 30 Apr 2013 23:39:48 +0000 (16:39 -0700)]
ceph: allow restarting radosgw

Only split once, since radosgw will have client.X after it.
Monitors and MDSs may have names with more .s as well.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
12 years agorgw: add to ctx.daemons so it can be stopped/started dynamically
Josh Durgin [Tue, 30 Apr 2013 23:37:48 +0000 (16:37 -0700)]
rgw: add to ctx.daemons so it can be stopped/started dynamically

Name the daemon after the client it runs on, since only
one per host is supported anyway.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
12 years agomisc: move daemon stopping function to a generic place
Josh Durgin [Tue, 30 Apr 2013 23:35:11 +0000 (16:35 -0700)]
misc: move daemon stopping function to a generic place

This will be useful for other daemons, like radosgw, in the future.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
12 years agoVerbose output on ceph-qa-chef.
Sandon Van Ness [Tue, 30 Apr 2013 20:04:28 +0000 (13:04 -0700)]
Verbose output on ceph-qa-chef.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
12 years agomisc: default base_test_dir to /home/ubuntu/cephtest
Sage Weil [Tue, 30 Apr 2013 16:15:37 +0000 (09:15 -0700)]
misc: default base_test_dir to /home/ubuntu/cephtest

This matches what the teuthworker is currently doing.

12 years agoswift, s3readwrite: add missing yield
Yehuda Sadeh [Tue, 30 Apr 2013 14:06:03 +0000 (07:06 -0700)]
swift, s3readwrite: add missing yield

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
12 years agoDisable quiet mode wget output on wget for ceph-qa-chef
Sandon Van Ness [Tue, 30 Apr 2013 00:11:27 +0000 (17:11 -0700)]
Disable quiet mode wget output on wget for ceph-qa-chef

So maybe I can get a better idea of what is causing it to fail.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
12 years agos3tests, s3readwrite, swift: cleanup explicitly
Yehuda Sadeh [Mon, 29 Apr 2013 18:24:04 +0000 (11:24 -0700)]
s3tests, s3readwrite, swift: cleanup explicitly

Cleaning up test dir explicitly after run, so that
consecutive runs don't fail.

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
12 years agoworkunit: use passed refspec rather than checking sha1 again
Samuel Just [Sun, 28 Apr 2013 19:15:42 +0000 (12:15 -0700)]
workunit: use passed refspec rather than checking sha1 again

Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
12 years agoinstall.upgrade: apt-get install instead of upgrade
Sage Weil [Sun, 28 Apr 2013 17:28:52 +0000 (10:28 -0700)]
install.upgrade: apt-get install instead of upgrade

Upgrade does not actually upgrade in some cases; use install!

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoinstall: prefer 'branch' over 'sha1'
Sage Weil [Sun, 28 Apr 2013 16:35:45 +0000 (09:35 -0700)]
install: prefer 'branch' over 'sha1'

The upgrade tasks specify 'branch' in the job file, but the
schedule_suite.sh script sets a sha1 in the overrides.  Make
the upgrade tests actually test an upgrade by preferring branch
over sha1 when both are specified.

This is fragile, but ought to do the trick for now!

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoFix improperly spaced line.
Warren Usui [Fri, 26 Apr 2013 19:42:41 +0000 (12:42 -0700)]
Fix improperly spaced line.

12 years agoUse get('field', default) to assign downburst values for vps.
Warren Usui [Thu, 25 Apr 2013 19:37:17 +0000 (12:37 -0700)]
Use get('field', default) to assign downburst values for vps.

Fixes: #4592
Signed-off-by: Warren Usui <warren.usui@inktank.com>
Reviewed by: Dan Mick <dan.mick@inktank.com>

12 years agorepair_test.py: Additional test cases
David Zafman [Wed, 24 Apr 2013 22:50:28 +0000 (15:50 -0700)]
repair_test.py: Additional test cases

Test repair with more than 1 damaged object and with different types of damage
Regression test for bug #4778

Signed-off-by: David Zafman <david.zafman@inktank.com>
Reviewed-by: Samuel Just <sam.just@inktank.com>
12 years agoAdd changes to make teuthology suites work on vms.
Warren Usui [Wed, 24 Apr 2013 00:28:27 +0000 (17:28 -0700)]
Add changes to make teuthology suites work on vms.

Fixes: #4719
Signed-off-by: Warren Usui <warren.usui@inktank.com>
Reviewed by: Dan Mick <dan.mick@inktank.com>

12 years agoMerge branch 'wip-teuthologyfix4693-wusui'
Warren Usui [Tue, 23 Apr 2013 23:30:56 +0000 (16:30 -0700)]
Merge branch 'wip-teuthologyfix4693-wusui'

12 years agoCheck downburst paths. Display an appropriate error message if an
Warren Usui [Tue, 23 Apr 2013 04:08:22 +0000 (21:08 -0700)]
Check downburst paths. Display an appropriate error message if an
executable downburst cannot be found.

Fixes: #4693
Signed-off-by: Warren Usui <warren.usui@inktank.com>
Reviewed by: Dan Mick <dan.mick@inktank.com>

12 years agoMerge branch 'next' of github.com:ceph/teuthology into next
Sandon Van Ness [Tue, 23 Apr 2013 18:23:36 +0000 (11:23 -0700)]
Merge branch 'next' of github.com:ceph/teuthology into next

12 years agoIncrease IPMI attempts to try to get around Flakey IPMI.
Sandon Van Ness [Tue, 23 Apr 2013 17:35:37 +0000 (10:35 -0700)]
Increase IPMI attempts to try to get around Flakey IPMI.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
Reviewed-by: Sam Lang <sam.lang@inktank.com>
12 years agoceph.conf: enable full debugging on the mon
Sage Weil [Tue, 23 Apr 2013 18:02:27 +0000 (11:02 -0700)]
ceph.conf: enable full debugging on the mon

12 years agoIncrease IPMI attempts to try to get around Flakey IPMI.
Sandon Van Ness [Tue, 23 Apr 2013 17:35:37 +0000 (10:35 -0700)]
Increase IPMI attempts to try to get around Flakey IPMI.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
Reviewed-by: Sam Lang <sam.lang@inktank.com>
12 years agoMerge branch 'next'
Sage Weil [Mon, 22 Apr 2013 20:01:11 +0000 (13:01 -0700)]
Merge branch 'next'

12 years agoceph-deploy: fix stop command
Sage Weil [Mon, 22 Apr 2013 20:01:02 +0000 (13:01 -0700)]
ceph-deploy: fix stop command

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoadded ceph.client.admin.keyring on the client to run rbd and rados tests
tamil [Sat, 20 Apr 2013 01:23:54 +0000 (18:23 -0700)]
added ceph.client.admin.keyring on the client to run rbd and rados tests

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
12 years agoadded extra packages required by ceph-deploy for rbd and rados tests
tamil [Sat, 20 Apr 2013 01:13:01 +0000 (18:13 -0700)]
added extra packages required by ceph-deploy for rbd and rados tests

Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
12 years agoMerge branch 'next'
Sage Weil [Thu, 18 Apr 2013 18:11:38 +0000 (11:11 -0700)]
Merge branch 'next'

12 years agoceph-deploy: stop daemons, archive, then purge[data]
Sage Weil [Thu, 18 Apr 2013 15:06:52 +0000 (08:06 -0700)]
ceph-deploy: stop daemons, archive, then purge[data]

Purge removes logs, and we want to archive those, so explicitly shut down
all daemons before doing the archiving step.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoceph.conf: lower mon disk avail warning threshold
Sage Weil [Wed, 3 Apr 2013 15:38:52 +0000 (08:38 -0700)]
ceph.conf: lower mon disk avail warning threshold

Only wanr when we hit 90% instead of default 70%

Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit cf4bf09b2c4dae034332fb893cf96ed31adb7a4b)

12 years agoMerge branch 'next'
Sam Lang [Wed, 17 Apr 2013 23:09:39 +0000 (18:09 -0500)]
Merge branch 'next'

Conflicts:
teuthology/lock.py
teuthology/lockstatus.py
teuthology/misc.py
teuthology/task/install.py

12 years agomisc: Fix for case status['description'] == None
Sam Lang [Wed, 17 Apr 2013 22:38:36 +0000 (17:38 -0500)]
misc: Fix for case status['description'] == None

Skip the machine that has a description, but the
value is None.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
Reviewed-by: Warren Usui <warren.usui@inktank.com>
12 years agoradosgw-admin-rest: Add task for RESTful admin api.
caleb miles [Wed, 17 Apr 2013 15:45:59 +0000 (08:45 -0700)]
radosgw-admin-rest: Add task for RESTful admin api.

Signed-off-by caleb miles <caleb.miles@inktank.com>

12 years agoradosgw-admin-rest: Add task for RESTful admin api.
caleb miles [Wed, 17 Apr 2013 15:45:59 +0000 (08:45 -0700)]
radosgw-admin-rest: Add task for RESTful admin api.

Signed-off-by caleb miles <caleb.miles@inktank.com>

12 years agomisc: Check for 'None' string from yaml
Sam Lang [Wed, 17 Apr 2013 00:08:45 +0000 (19:08 -0500)]
misc: Check for 'None' string from yaml

The description attribute from the machines yaml returned by the
locker might be the string 'None'.  Need to explicitly check for
that to avoid using a test dir of /tmp/cephtest/None.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
12 years agolock: Fix import cycle breakage
Sam Lang [Fri, 12 Apr 2013 17:55:54 +0000 (12:55 -0500)]
lock: Fix import cycle breakage

fa2049f caused an import cycle between lock.py and misc.py.  Move the
needed functions from lock.py to lockstatus.py so that we can avoid the
import cycle.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
Conflicts:
teuthology/lock.py

12 years agoRevert "Revert "Install.py: Prevent prompts from breaking apt""
Dan Mick [Mon, 15 Apr 2013 18:24:31 +0000 (11:24 -0700)]
Revert "Revert "Install.py: Prevent prompts from breaking apt""

This reverts commit 67a616a97927efdc4fbcc5edb0d0cf4a724d90e2.

Sigh.  As it turns out, /etc/default/grub being hacked also
causes the same problem.  I think there's a way to fix that cleanly
as well, but until then, replacing the "accept installed version"
hack here so jobs can run.

12 years agoRevert "Install.py: Prevent prompts from breaking apt"
Dan Mick [Fri, 12 Apr 2013 17:56:14 +0000 (10:56 -0700)]
Revert "Install.py: Prevent prompts from breaking apt"

This reverts commit 5995ae7e78dd19f4036f891db9db9fec97d6eab5.

With the changes to ceph-qa-chef and the teuthology kernel task,
we're no longer touching packaged file /etc/grub.d/10_linux, which
was the reason for this apt forcing.  Remove so that we find other
package problems that might be masked by this; we can always
put it back if there are such problems until we can fix those as well.

Signed-off-by: Dan Mick <dan.mick@inktank.com>
(cherry picked from commit c2b0828b19a426c6d73cb2f614333200ae66bac3)

12 years agoInstall.py: Prevent prompts from breaking apt
Sandon Van Ness [Fri, 5 Apr 2013 02:15:14 +0000 (19:15 -0700)]
Install.py: Prevent prompts from breaking apt

Change apt commands to prevent prompts from coming up (forcing
non-interactive mode) so things like grub or other stuff doesn't
break teuthology runs.

Signed-off-by: Sandon Van Ness <sandon@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
12 years agomisc: Check for 'None' string from yaml
Sam Lang [Wed, 17 Apr 2013 00:08:45 +0000 (19:08 -0500)]
misc: Check for 'None' string from yaml

The description attribute from the machines yaml returned by the
locker might be the string 'None'.  Need to explicitly check for
that to avoid using a test dir of /tmp/cephtest/None.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
12 years agomisc: Use pythonic 'is not None' for jobid case
Sam Lang [Sat, 13 Apr 2013 15:12:45 +0000 (10:12 -0500)]
misc: Use pythonic 'is not None' for jobid case

The conditional 'if global_jobid:' evaluates to true
in some cases even when global_jobid is None.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
12 years agomisc: Fix name parsing
Sam Lang [Fri, 12 Apr 2013 22:02:07 +0000 (17:02 -0500)]
misc: Fix name parsing

Use last two digits of year.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
12 years agolock: Fix import cycle breakage
Sam Lang [Fri, 12 Apr 2013 17:55:54 +0000 (12:55 -0500)]
lock: Fix import cycle breakage

fa2049f caused an import cycle between lock.py and misc.py.  Move the
needed functions from lock.py to lockstatus.py so that we can avoid the
import cycle.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
Conflicts:
teuthology/lock.py

12 years agomisc: Use job id and make short path for testdir
Sam Lang [Thu, 11 Apr 2013 14:23:10 +0000 (09:23 -0500)]
misc: Use job id and make short path for testdir

Nightlies run on teuthology currently use a testdir of
/home/ubuntu/cephtest, but this causes stale job errors occasionally
from the previous tests not getting properly cleaned up, which prevents
the nightlies from running successfully.

The misc.py get_testdir() function can specify a testdir that is
specific to the job, but previously the path was too long and would
cause separate job failures.

This patch does two things to resolve that.  First, it uses the job id
from the teuthology run if one exists.  This should be a relatively
short number that will identify the job run effectively.  Second,
if the job id isn't available, it creates a shortened form of the
job's name, for example the job name:

teuthology-2013-04-09_23:51:49-rgw-next-testing-basic

becomes:

te1304092351rntb

Signed-off-by: Sam Lang <sam.lang@inktank.com>
12 years agoceph-deploy: purge before archiving
Sage Weil [Wed, 17 Apr 2013 03:50:50 +0000 (20:50 -0700)]
ceph-deploy: purge before archiving

Purge will uninstall and (in so doing) stop the daemons. This avoids trying
to tar up the mon data or logs while they are being written to, which
avoids errors like

2013-04-16T20:21:47.103 INFO:teuthology.task.ceph-deploy:Archiving mon data...
2013-04-16T20:21:47.545 INFO:teuthology.orchestra.run.err:tar: ./ceph-mira089/store.db/000009.log: file changed as we read it

Also drop the unnecessary uninstall (it is implied by purge).

Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 4befae4fbea2413c9a8848ba195f336293619938)

12 years agoceph-deploy: purge before archiving
Sage Weil [Wed, 17 Apr 2013 03:50:50 +0000 (20:50 -0700)]
ceph-deploy: purge before archiving

Purge will uninstall and (in so doing) stop the daemons. This avoids trying
to tar up the mon data or logs while they are being written to, which
avoids errors like

2013-04-16T20:21:47.103 INFO:teuthology.task.ceph-deploy:Archiving mon data...
2013-04-16T20:21:47.545 INFO:teuthology.orchestra.run.err:tar: ./ceph-mira089/store.db/000009.log: file changed as we read it

Also drop the unnecessary uninstall (it is implied by purge).

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoscheduled_suite.sh: check clock skew at start and end of run
Sage Weil [Wed, 3 Apr 2013 21:00:25 +0000 (14:00 -0700)]
scheduled_suite.sh: check clock skew at start and end of run

Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 5c80201ec4a4a97367e4d7243cd046a3a8c808fa)

12 years agomisc: Fix close() call to pass in fd
Sam Lang [Mon, 15 Apr 2013 21:26:22 +0000 (16:26 -0500)]
misc: Fix close() call to pass in fd

fd is an int, we need to use os.close().

Signed-off-by: Sam Lang <sam.lang@inktank.com>
12 years agomisc: Fix bug in calling function remote_mktemp()
Sam Lang [Mon, 15 Apr 2013 21:16:34 +0000 (16:16 -0500)]
misc: Fix bug in calling function remote_mktemp()

Changed the name of the function to get a remote temporary filename,
need to update all the locations where it gets called.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
12 years agoMerge pull request #11 from ceph/wip-4717
Sam Lang [Mon, 15 Apr 2013 18:44:59 +0000 (11:44 -0700)]
Merge pull request #11 from ceph/wip-4717

misc: Use tempfile.mkstemp() instead of tempnam

12 years agomisc: Use tempfile.mkstemp() instead of tempnam
Sam Lang [Fri, 12 Apr 2013 20:52:47 +0000 (15:52 -0500)]
misc: Use tempfile.mkstemp() instead of tempnam

tempnam() is considered an unsafe security risk because the filename
generated is easy to guess and can be symlinked in advance.  Use
mkstemp() instead.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
Reviewed-by: Joe Buck <jbbuck@gmail.com>
12 years agoRevert "Revert "Install.py: Prevent prompts from breaking apt""
Dan Mick [Mon, 15 Apr 2013 18:24:31 +0000 (11:24 -0700)]
Revert "Revert "Install.py: Prevent prompts from breaking apt""

This reverts commit 67a616a97927efdc4fbcc5edb0d0cf4a724d90e2.

Sigh.  As it turns out, /etc/default/grub being hacked also
causes the same problem.  I think there's a way to fix that cleanly
as well, but until then, replacing the "accept installed version"
hack here so jobs can run.

12 years agomisc: Use pythonic 'is not None' for jobid case
Sam Lang [Sat, 13 Apr 2013 15:12:45 +0000 (10:12 -0500)]
misc: Use pythonic 'is not None' for jobid case

The conditional 'if global_jobid:' evaluates to true
in some cases even when global_jobid is None.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
12 years agomisc: Fix name parsing
Sam Lang [Fri, 12 Apr 2013 22:02:07 +0000 (17:02 -0500)]
misc: Fix name parsing

Use last two digits of year.

Signed-off-by: Sam Lang <sam.lang@inktank.com>
12 years agoRevert "Install.py: Prevent prompts from breaking apt"
Dan Mick [Fri, 12 Apr 2013 17:56:14 +0000 (10:56 -0700)]
Revert "Install.py: Prevent prompts from breaking apt"

This reverts commit 5995ae7e78dd19f4036f891db9db9fec97d6eab5.

With the changes to ceph-qa-chef and the teuthology kernel task,
we're no longer touching packaged file /etc/grub.d/10_linux, which
was the reason for this apt forcing.  Remove so that we find other
package problems that might be masked by this; we can always
put it back if there are such problems until we can fix those as well.

Signed-off-by: Dan Mick <dan.mick@inktank.com>
(cherry picked from commit c2b0828b19a426c6d73cb2f614333200ae66bac3)

12 years agokernel.py: put submenu name in 01_ceph_kernel if necessary
Dan Mick [Tue, 9 Apr 2013 22:53:49 +0000 (15:53 -0700)]
kernel.py: put submenu name in 01_ceph_kernel if necessary

We had been writing 01_ceph_kernel with the kernel title, and
relying on the fact that grub.cfg would never have submenus in it
(implemented by a hack to /etc/grub.d/10_linux which neutered its
submenu creation).  However, that hack was modifying a package file,
and got in the way of later apt commands.  Rather than doing it
that way, this divines the title of the submenu and sets the
default variable to "submenu>kernel", which works to select the
desired kernel.

It depends on there being only one level of submenu, and on the
format of the menuentry and submenu commands, dictated by grub2.
None of this is likely to work at all outside Ubuntu.

Fixes: #4496
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Dan Mick <dan.mick@inktank.com>
(cherry picked from commit 52aec32a7da07ca6e9a22ecedde78dafb4b74dfc)

12 years agopeer.py: we can't assume pg query state will match mon pg state
Samuel Just [Fri, 12 Apr 2013 22:01:04 +0000 (15:01 -0700)]
peer.py: we can't assume pg query state will match mon pg state

The pg state could easily have changed in the mean time,
for example, from recovery_wait to recovering.

Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Greg Farnum <greg@inktank.com>
12 years agoMerge pull request #10 from ceph/wip-fix-importcycle
Sam Lang [Fri, 12 Apr 2013 20:06:55 +0000 (13:06 -0700)]
Merge pull request #10 from ceph/wip-fix-importcycle

lock: Fix import cycle breakage

Reviewed-by: Warren Usui <warren.usui@inktank.com>