]> git.apps.os.sepia.ceph.com Git - teuthology.git/log
teuthology.git
3 years agokill: Check if sudo works before using it 1703/head
Zack Cerza [Wed, 12 Jan 2022 19:21:27 +0000 (12:21 -0700)]
kill: Check if sudo works before using it

Also, use sudo's -n flag to avoid the password prompt and resulting
delay.

Fixes: https://tracker.ceph.com/issues/53853
Signed-off-by: Zack Cerza <zack@redhat.com>
3 years agoMerge pull request #1723 from ceph/wip-standalone
Zack Cerza [Fri, 11 Mar 2022 23:21:25 +0000 (16:21 -0700)]
Merge pull request #1723 from ceph/wip-standalone

fixes for running non-scheduled jobs

3 years agotask/ansible: check for archive before using it 1723/head
Josh Durgin [Mon, 7 Mar 2022 18:47:33 +0000 (18:47 +0000)]
task/ansible: check for archive before using it

Running a non-scheduled job you may not have archiving configured.

Signed-off-by: Josh Durgin <jdurgin@redhat.com>
3 years agorun: make fetching tasks work for non-scheduled jobs
Josh Durgin [Mon, 7 Mar 2022 18:34:36 +0000 (18:34 +0000)]
run: make fetching tasks work for non-scheduled jobs

The qa path should be the default (just like it's set by
teuthology-suite), and we actually need to do the import to access the
tasks.

Signed-off-by: Josh Durgin <jdurgin@redhat.com>
3 years agoMerge pull request #1721 from ceph/more-tox-gha
Kamoltat Sirivadhna [Wed, 9 Mar 2022 17:36:40 +0000 (12:36 -0500)]
Merge pull request #1721 from ceph/more-tox-gha

github actions: Run docs build during CI
Reviewed-by: Zack Cerza zack@redhat.com
3 years agogithub actions: Run docs build during CI 1721/head
Zack Cerza [Fri, 4 Mar 2022 22:29:49 +0000 (15:29 -0700)]
github actions: Run docs build during CI

If we do this, we can drop the Jenkins job. The openstack tests are
currently all skipped, so we can just leave them out for now.

Signed-off-by: Zack Cerza <zack@redhat.com>
3 years agoMerge pull request #1715 from ceph/pynacl
Josh Durgin [Wed, 2 Mar 2022 18:20:55 +0000 (10:20 -0800)]
Merge pull request #1715 from ceph/pynacl

Update pynacl

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
3 years agoMerge pull request #1714 from ceph/ui-connect
Josh Durgin [Wed, 2 Mar 2022 18:19:56 +0000 (10:19 -0800)]
Merge pull request #1714 from ceph/ui-connect

teuthology-update-inventory: Call Remote.connect()

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
3 years agoMerge pull request #1712 from ceph/remote-is-container
Josh Durgin [Wed, 2 Mar 2022 18:19:07 +0000 (10:19 -0800)]
Merge pull request #1712 from ceph/remote-is-container

Remote: Add is_container property

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
3 years agoMerge pull request #1718 from kamoltat/wip-ksirivad-dispatcher-notes-doc
Kamoltat Sirivadhna [Mon, 28 Feb 2022 16:07:36 +0000 (11:07 -0500)]
Merge pull request #1718 from kamoltat/wip-ksirivad-dispatcher-notes-doc

docs: add dispatcher trouble-shooting notes
Reviewed by: Aishwarya Mathuria <amathuri@redhat.com>

3 years agodocs: add dispatcher trouble-shooting notes 1718/head
Kamoltat Sirivadhna [Mon, 28 Feb 2022 15:15:00 +0000 (10:15 -0500)]
docs: add dispatcher trouble-shooting notes

documenting https://tracker.ceph.com/issues/54366

Signed-off-by: Kamoltat Sirivadhna <ksirivad@redhat.com>
3 years agoMerge pull request #1717 from ceph/1716b
David Galloway [Thu, 24 Feb 2022 18:24:01 +0000 (13:24 -0500)]
Merge pull request #1717 from ceph/1716b

selinux.py: Ignore both contexts for NetworkManager_dispatcher_t

3 years agoselinux.py: Ignore both contexts for NetworkManager_dispatcher_t 1717/head
David Galloway [Thu, 24 Feb 2022 16:50:14 +0000 (11:50 -0500)]
selinux.py: Ignore both contexts for NetworkManager_dispatcher_t

Signed-off-by: David Galloway <dgallowa@redhat.com>
3 years agoMerge pull request #1716 from ceph/54337
Yuri Weinstein [Wed, 23 Feb 2022 16:37:23 +0000 (08:37 -0800)]
Merge pull request #1716 from ceph/54337

selinux.py: Ignore new NetworkManager denials

Reviewed-by: Brad Hubbard <bhubbard@redhat.com>
Reviewed-by: Venky Shankar vshankar@redhat.com
3 years agoselinux.py: Ignore new NetworkManager denials 1716/head
David Galloway [Tue, 22 Feb 2022 15:04:56 +0000 (10:04 -0500)]
selinux.py: Ignore new NetworkManager denials

Fixes: https://tracker.ceph.com/issues/54337
Signed-off-by: David Galloway <dgallowa@redhat.com>
3 years agoMerge pull request #1713 from ceph/ansible-log
Zack Cerza [Thu, 17 Feb 2022 16:57:40 +0000 (09:57 -0700)]
Merge pull request #1713 from ceph/ansible-log

task.ansible: Don't direct task's logging to file

3 years agoMerge pull request #1711 from ceph/cluster-filter
Zack Cerza [Wed, 16 Feb 2022 19:10:47 +0000 (12:10 -0700)]
Merge pull request #1711 from ceph/cluster-filter

Cluster: Add filter() function

3 years agorequirements: Update pynacl 1715/head
Zack Cerza [Thu, 20 Jan 2022 21:58:56 +0000 (14:58 -0700)]
requirements: Update pynacl

This avoids having to compile it from source on
every image build.

Signed-off-by: Zack Cerza <zack@redhat.com>
3 years agoupdate-requirements.sh: Write intermediate file
Zack Cerza [Thu, 20 Jan 2022 21:58:48 +0000 (14:58 -0700)]
update-requirements.sh: Write intermediate file

The previous method was only writing the .txt file; when debugging
dependency issues you'll often want to see the intermediate. I don't
see a reason to not always write it.

Signed-off-by: Zack Cerza <zack@redhat.com>
3 years agoteuthology-update-inventory: Call Remote.connect() 1714/head
Zack Cerza [Fri, 14 Jan 2022 19:10:14 +0000 (12:10 -0700)]
teuthology-update-inventory: Call Remote.connect()

Signed-off-by: Zack Cerza <zack@redhat.com>
3 years agoRemote: Add is_container property 1712/head
Zack Cerza [Mon, 17 Jan 2022 19:19:53 +0000 (12:19 -0700)]
Remote: Add is_container property

Signed-off-by: Zack Cerza <zack@redhat.com>
3 years agotask.ansible: Don't direct task's logging to file 1713/head
Zack Cerza [Tue, 1 Feb 2022 21:21:39 +0000 (14:21 -0700)]
task.ansible: Don't direct task's logging to file

We only want output generated by ansible itself in ansible.log

Signed-off-by: Zack Cerza <zack@redhat.com>
3 years agoMerge pull request #1695 from kshtsk/wip-cluster-run-sync
kyr [Tue, 15 Feb 2022 18:47:57 +0000 (19:47 +0100)]
Merge pull request #1695 from kshtsk/wip-cluster-run-sync

orchestra/cluster: make run parallel and wait

3 years agoCluster: Add filter() function 1711/head
Zack Cerza [Mon, 17 Jan 2022 20:38:10 +0000 (13:38 -0700)]
Cluster: Add filter() function

Signed-off-by: Zack Cerza <zack@redhat.com>
3 years agoMerge pull request #1707 from kamoltat/wip-ksirivad-add-ansible-inventory-docker...
Kamoltat Sirivadhna [Mon, 7 Feb 2022 15:44:15 +0000 (10:44 -0500)]
Merge pull request #1707 from kamoltat/wip-ksirivad-add-ansible-inventory-docker-compose

docs/docker-compose: add ansible inventory to README
Reviewed by: Josh Durgin <jdurgin@redhat.com>

3 years agoMerge PR #1710 into master
Patrick Donnelly [Thu, 3 Feb 2022 14:57:11 +0000 (09:57 -0500)]
Merge PR #1710 into master

* refs/pull/1710/head:
teuthology/task/selinux: ignore sss_cache

Reviewed-by: Yuri Weinstein <yweins@redhat.com>
3 years agoteuthology/task/selinux: ignore sss_cache 1710/head
Patrick Donnelly [Thu, 3 Feb 2022 14:21:30 +0000 (09:21 -0500)]
teuthology/task/selinux: ignore sss_cache

Fixes: https://tracker.ceph.com/issues/54125
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
3 years agoMerge PR #1708 into master
Patrick Donnelly [Thu, 3 Feb 2022 14:18:14 +0000 (09:18 -0500)]
Merge PR #1708 into master

* refs/pull/1708/head:
opsys.py: Change default CentOS version to 8 Stream

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
3 years agoMerge pull request #1709 from kshtsk/wip-auto-get-mt-for-kill
kyr [Tue, 1 Feb 2022 17:46:32 +0000 (18:46 +0100)]
Merge pull request #1709 from kshtsk/wip-auto-get-mt-for-kill

kill: get machine type from paddles for the run

3 years agokill: get machine type from paddles for the run 1709/head
Kyr Shatskyy [Tue, 1 Feb 2022 16:08:27 +0000 (17:08 +0100)]
kill: get machine type from paddles for the run

When calling kill for a run which is still in beanstalkd
queue there is no job directories created in archive
and there is no way to find out which machine type to use
the teuthology-kill reports that you must manually provide
machine type for the run, which is often borring and quite
inconvenient.

Instead of torturing user to recall a tube for the run name
we just ask paddles if it has anything logged and use the
machine type from what we receive from it.

Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
3 years agoopsys.py: Change default CentOS version to 8 Stream 1708/head
David Galloway [Mon, 31 Jan 2022 19:26:27 +0000 (14:26 -0500)]
opsys.py: Change default CentOS version to 8 Stream

8.X is no more

Signed-off-by: David Galloway <dgallowa@redhat.com>
3 years agodocs/docker-compose: add ansible inventory to README 1707/head
Kamoltat [Tue, 25 Jan 2022 01:02:56 +0000 (01:02 +0000)]
docs/docker-compose: add ansible inventory to README

Added instructions that will help users
set up ansible inventory files in after they
built their local teuthology container. This
part is needed when ansible tasks are executed
when jobs are running.

Signed-off-by: Kamoltat <ksirivad@redhat.com>
3 years agoMerge pull request #1706 from batrick/log-warning
Josh Durgin [Mon, 24 Jan 2022 16:01:48 +0000 (08:01 -0800)]
Merge pull request #1706 from batrick/log-warning

teuthology: convert deprecated method name

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
3 years agoteuthology: convert deprecated method name 1706/head
Patrick Donnelly [Thu, 20 Jan 2022 15:11:48 +0000 (10:11 -0500)]
teuthology: convert deprecated method name

Avoiding this warning:

    /home/runner/work/teuthology/teuthology/teuthology/task/install/__init__.py:285: DeprecationWarning: The 'warn' method is deprecated, use 'warning' instead

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
3 years agoMerge pull request #1678 from ceph/wip-ksirivad-move-ansible-log
Zack Cerza [Thu, 20 Jan 2022 00:27:40 +0000 (17:27 -0700)]
Merge pull request #1678 from ceph/wip-ksirivad-move-ansible-log

task/ansible: move ansible logs to ansible.log

3 years agotask/ansible: move ansible logs to ansible.log 1678/head
Kamoltat Sirivadhna [Wed, 22 Sep 2021 19:26:01 +0000 (15:26 -0400)]
task/ansible: move ansible logs to ansible.log

Currently ansible tasks are logging to
teuthology.log. We want to separate this to
it's own log files for better readability.

Signed-off-by: Kamoltat Sirivadhna <ksirivad@redhat.com>
3 years agoMerge pull request #1665 from ceph/wip-ksirivad-docker-compose-script
Zack Cerza [Fri, 14 Jan 2022 22:21:20 +0000 (15:21 -0700)]
Merge pull request #1665 from ceph/wip-ksirivad-docker-compose-script

Added dockerfile + docker-compose script dev setup

3 years agoRemoved beanstalkd in teuthology image 1665/head
Kamoltat Sirivadhna [Fri, 14 Jan 2022 22:02:47 +0000 (17:02 -0500)]
Removed beanstalkd in teuthology image

Signed-off-by: Kamoltat Sirivadhna <ksirivad@redhat.com>
3 years agoMoved Dockerfile
Kamoltat Sirivadhna [Fri, 14 Jan 2022 21:11:28 +0000 (16:11 -0500)]
Moved Dockerfile

Moved docker file to:

docs/docker-compose/Dockerfile

Signed-off-by: Kamoltat Sirivadhna <ksirivad@redhat.com>
3 years agoUpdated README file as we still use beanstalk
Kamoltat Sirivadhna [Fri, 14 Jan 2022 18:40:47 +0000 (13:40 -0500)]
Updated  README file as we still use beanstalk

update README file as we change back to beanstalk
as queue.

Signed-off-by: Kamoltat Sirivadhna <ksirivad@redhat.com>
3 years agoCompose teuthology master branch and use beanstalk
Kamoltat Sirivadhna [Fri, 14 Jan 2022 18:34:23 +0000 (13:34 -0500)]
Compose teuthology master branch and use beanstalk

Compose teuthology version that uses beanstalk.

Signed-off-by: Kamoltat Sirivadhna <ksirivad@redhat.com>
3 years agoMerge PR #1702 into master
Patrick Donnelly [Wed, 12 Jan 2022 17:02:02 +0000 (12:02 -0500)]
Merge PR #1702 into master

* refs/pull/1702/head:
teuthology/tasks/kernel: switch firmware branch name

Reviewed-by: Venky Shankar <vshankar@redhat.com>
3 years agoteuthology/tasks/kernel: switch firmware branch name 1702/head
Patrick Donnelly [Wed, 12 Jan 2022 14:52:55 +0000 (09:52 -0500)]
teuthology/tasks/kernel: switch firmware branch name

It is now "main".

Fixes: https://tracker.ceph.com/issues/53852
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
3 years agoMerge pull request #1701 from rakeshgm/vscode-to-gitignore
Vasu Kulkarni [Sat, 8 Jan 2022 17:28:46 +0000 (09:28 -0800)]
Merge pull request #1701 from rakeshgm/vscode-to-gitignore

gitignore: ignore .vscode dir

3 years agogitignore: ignore .vscode dir 1701/head
rakeshgm [Fri, 24 Dec 2021 14:04:42 +0000 (19:34 +0530)]
gitignore: ignore .vscode dir

Signed-off-by: rakeshgm <rakeshgm@redhat.com>
3 years agoMerge pull request #1687 from kamoltat/wip-mark-machine-down-reimage-error
Kamoltat Sirivadhna [Wed, 8 Dec 2021 17:39:37 +0000 (12:39 -0500)]
Merge pull request #1687 from kamoltat/wip-mark-machine-down-reimage-error

dispatcher/supervisor: Mark machine down if reimage error >= 10x
Reviewed-by: Kamoltat <ksirivad@redhat.com>
3 years agoMerge pull request #1699 from kshtsk/wip-strip-left-not
Josh Durgin [Tue, 30 Nov 2021 02:31:37 +0000 (18:31 -0800)]
Merge pull request #1699 from kshtsk/wip-strip-left-not

misc/sh: do not strip output from left

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
3 years agoMerge pull request #1697 from kshtsk/wip-beanstalk-yaml-load
Josh Durgin [Tue, 30 Nov 2021 02:27:38 +0000 (18:27 -0800)]
Merge pull request #1697 from kshtsk/wip-beanstalk-yaml-load

teuthology/beanstalk: workaround yaml load deprecation

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
3 years agoMerge pull request #1696 from sebastian-philipp/matrix-print-item
Josh Durgin [Tue, 30 Nov 2021 01:26:17 +0000 (17:26 -0800)]
Merge pull request #1696 from sebastian-philipp/matrix-print-item

teuthology/suite/matrix.py: Add item to assert msg

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
3 years agodispatcher/supervisor: Mark machine down if reimage error >= 10x 1687/head
Kamoltat [Fri, 15 Oct 2021 14:36:14 +0000 (14:36 +0000)]
dispatcher/supervisor: Mark machine down if reimage error >= 10x

Introduce a new feature where we mark the machine down
if we found more than 10 consecutive reimaging failure
when performing teuthology tests.

Signed-off-by: Kamoltat <ksirivad@redhat.com>
3 years agoAdded more stuff to documentation
Kamoltat Sirivadhna [Thu, 16 Sep 2021 14:44:19 +0000 (10:44 -0400)]
Added more stuff to documentation

Signed-off-by: Kamoltat Sirivadhna <ksirivad@redhat.com>
3 years agoInitialize README files + changes reflecting Zack's review
Kamoltat Sirivadhna [Wed, 15 Sep 2021 19:33:05 +0000 (15:33 -0400)]
Initialize README files + changes reflecting Zack's review

Signed-off-by: Kamoltat Sirivadhna <ksirivad@redhat.com>
3 years agoAdded dockerfile + docker-compose script dev setup
Kamoltat Sirivadhna [Wed, 4 Aug 2021 19:00:09 +0000 (15:00 -0400)]
Added dockerfile + docker-compose script dev setup

Developers can now use start.sh to build the images
and set up postgresql, paddles, pulpito and teuthology
for development.

This PR is also pending for:
https://github.com/ceph/teuthology/pull/1650
https://github.com/ceph/paddles/pull/94

to be merged, as currently we use these branches
as images for paddles and pulpito.

Signed-off-by: Kamoltat Sirivadhna <ksirivad@redhat.com>
3 years agoMerge pull request #1700 from dmick/wip-log-record-asctime
Zack Cerza [Tue, 23 Nov 2021 18:09:42 +0000 (11:09 -0700)]
Merge pull request #1700 from dmick/wip-log-record-asctime

Stop using LogRecord.asctime (not stable); move to pytest 6.2.5

3 years agorequirements.txt: update pytest 1700/head
Dan Mick [Tue, 23 Nov 2021 01:23:42 +0000 (17:23 -0800)]
requirements.txt: update pytest

This requires the previous commit dropping use of LogRecord.asctime

Signed-off-by: Dan Mick <dmick@redhat.com>
3 years agotest_misc.py: fix bad assumption about LogRecord fields
Dan Mick [Tue, 16 Nov 2021 01:37:23 +0000 (17:37 -0800)]
test_misc.py: fix bad assumption about LogRecord fields

The test was using LogRecord's asctime attribute to calculate a time
difference between two log entries.  Although the attribute is documented
with no caveat, others have run into the problem that it does not exist
on logging.LogRecord unless a formatter with a format string referencing
{asctime} has been used.  Since there's a 'created' time that's more
appropriate for this test anyway, use that instead.

This commit enables updating pytest, because pytest's logging init
code has changed: https://github.com/pytest-dev/pytest/discussions/9324

Signed-off-by: Dan Mick <dmick@redhat.com>
3 years agoteuthology/beanstalk: workaround yaml load deprecation 1697/head
Kyr Shatskyy [Wed, 17 Nov 2021 21:49:30 +0000 (22:49 +0100)]
teuthology/beanstalk: workaround yaml load deprecation

There is a deprecation error when beanstalk parser called:

  YAMLLoadWarning: calling yaml.load() without Loader=... is deprecated, as the default Loader is unsafe

Since beanstalkc3 does not want to apply a fix for that
we can override yaml parser to suppress the error message
from the log.

Fixes: https://tracker.ceph.com/issues/44225
Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
3 years agomisc/sh: do not strip output from left 1699/head
Kyr Shatskyy [Thu, 14 Oct 2021 14:40:15 +0000 (16:40 +0200)]
misc/sh: do not strip output from left

We want to see output of commands with nice formatting,
for example when there are yaml data

Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
3 years agoteuthology/suite/matrix.py: Add item to assert msg 1696/head
Sebastian Wagner [Tue, 16 Nov 2021 13:08:56 +0000 (14:08 +0100)]
teuthology/suite/matrix.py: Add item to assert msg

Give folks at least some hint about where things might
be broken:

```
AssertionError: Sum requires non-empty _submats: 0-distro
```

Signed-off-by: Sebastian Wagner <sewagner@redhat.com>
3 years agoorchestra/cluster: drop teuthology.misc 1695/head
Kyr Shatskyy [Sat, 13 Nov 2021 12:07:20 +0000 (13:07 +0100)]
orchestra/cluster: drop teuthology.misc

Use sudo_write_file and write_file directly from remote.

Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
3 years agoorchestra/cluster: run parallel and wait
Kyr Shatskyy [Sat, 13 Nov 2021 11:47:28 +0000 (12:47 +0100)]
orchestra/cluster: run parallel and wait

Make cluster.run able to execute command in parallel on all
nodes and wait for completion.

In order to run a command in parallel on all nodes of the
cluster, instead of combination:

  run.wait(cluster.run(wait=False, args=...))

this patch makes it possible to do the same just by:

  cluster.run(parallel=True, args=...)

the wait arguments defaults to True in this case.

Also the behavior of the method is not changed
when wait set to False (like in the first example),
so it is supposed to be backward compatible.

Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
3 years agoMerge pull request #1692 from ceph/parallel-kernel
Sage Weil [Mon, 15 Nov 2021 18:18:46 +0000 (13:18 -0500)]
Merge pull request #1692 from ceph/parallel-kernel

task/{kernel,clock}: parallelize per-role work

3 years agotask/kernel: parallelize per-role work 1692/head
Sage Weil [Mon, 8 Nov 2021 19:39:00 +0000 (13:39 -0600)]
task/kernel: parallelize per-role work

Signed-off-by: Sage Weil <sage@newdream.net>
3 years agotask/clock: parallelize
Sage Weil [Wed, 10 Nov 2021 22:22:44 +0000 (16:22 -0600)]
task/clock: parallelize

Signed-off-by: Sage Weil <sage@newdream.net>
3 years agotasks/internal/syslog: gather journalctl -b0 gather-journalctl-b0 1694/head
Sage Weil [Thu, 11 Nov 2021 17:28:22 +0000 (11:28 -0600)]
tasks/internal/syslog: gather journalctl -b0

Signed-off-by: Sage Weil <sage@newdream.net>
3 years agoMerge pull request #1693 from dmick/wip-build-complete
Dan Mick [Thu, 11 Nov 2021 19:50:12 +0000 (11:50 -0800)]
Merge pull request #1693 from dmick/wip-build-complete

teuthology/packaging.py: fix build_complete: search for requested arch

3 years agoMerge pull request #1690 from ceph/debug-coredump
Sage Weil [Wed, 10 Nov 2021 17:28:51 +0000 (12:28 -0500)]
Merge pull request #1690 from ceph/debug-coredump

task/internal/__init__: print core file output before splitting

3 years agoteuthology/packaging.py: fix build_complete: search for requested arch 1693/head
Dan Mick [Tue, 9 Nov 2021 19:49:09 +0000 (11:49 -0800)]
teuthology/packaging.py: fix build_complete: search for requested arch

The workaround from https://github.com/ceph/teuthology/pull/1649 was
necessary because my original algorithm was faulty: when searching
through all the builds for a ref/sha1, one must match the arch
requested by the call to build_complete (in the Builder object);
that arch's presence in the shaman api/search result is not enough
of a match, as it can contain multiple arches in multiple states
of build success.  Only a failure *on the requested arch* should be
considered a "requested build not complete".

(note: this will still currently fail a request for a build whose
repo is complete but container build failed, as "build complete"
currently conflates those two statuses.  Teuthology does not
contain the information whether a build is being requested for
packages, containers, or both.)

Also add testing for build_complete().

Fixes: https://tracker.ceph.com/issues/53205
Signed-off-by: Dan Mick <dmick@redhat.com>
3 years agoMerge pull request #1691 from ljflores/wip-installation
Neha Ojha [Thu, 4 Nov 2021 18:29:50 +0000 (11:29 -0700)]
Merge pull request #1691 from ljflores/wip-installation

docs: improve instructions for installing teuthology

Reviewed-by: Kamoltat <ksirivad@redhat.com>
Reviewed-by: Aishwarya Mathuria <amathuri@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
3 years agodocs: improve instructions for installing teuthology 1691/head
Laura Flores [Tue, 2 Nov 2021 15:40:26 +0000 (15:40 +0000)]
docs: improve instructions for installing teuthology

This only applies to the "Ubuntu, Fedora, & Suse/Opensuse" section.

Signed-off-by: Laura Flores <lflores@redhat.com>
3 years agotask/internal/__init__: print core file output before splitting 1690/head
Sage Weil [Thu, 28 Oct 2021 13:58:39 +0000 (08:58 -0500)]
task/internal/__init__: print core file output before splitting

Debugging this failure:

Traceback (most recent call last):
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_c56135d151713269e811ede3163c9743c2e269de/teuthology/run_tasks.py", line 176, in run_tasks
    suppress = manager.__exit__(*exc_info)
  File "/usr/lib/python3.6/contextlib.py", line 88, in __exit__
    next(self.gen)
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_c56135d151713269e811ede3163c9743c2e269de/teuthology/task/internal/__init__.py", line 398, in archive
    fetch_binaries_for_coredumps(path, rem)
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_c56135d151713269e811ede3163c9743c2e269de/teuthology/task/internal/__init__.py", line 320, in fetch_binaries_for_coredumps
    dump_program = dump_out.split("from '")[1].split(' ')[0]
IndexError: list index out of range

...on output that should look like this:

./remote/smithi084/coredump/1635398181.133353.core: ELF 64-bit LSB core file x86-64, version 1 (SYSV), SVR4-style, from '/bin/podman stop ceph-462d7c58-37ab-11ec-8c28-001a4aab830c-node-exporter-smithi', real uid: 0, effective uid: 0, real gid: 0, effective gid: 0, execfn: '/bin/podman', platform: 'x86_64'

Signed-off-by: Sage Weil <sage@newdream.net>
3 years agoMerge pull request #1689 from ceph/supervisor-unlock-safety-fix
David Galloway [Wed, 20 Oct 2021 20:36:50 +0000 (16:36 -0400)]
Merge pull request #1689 from ceph/supervisor-unlock-safety-fix

supervisor: Add missing continue statement

3 years agosupervisor: Add missing continue statement 1689/head
Zack Cerza [Wed, 20 Oct 2021 20:34:39 +0000 (14:34 -0600)]
supervisor: Add missing continue statement

I missed this in #1688 ðŸ˜­

Signed-off-by: Zack Cerza <zack@redhat.com>
3 years agoMerge pull request #1688 from ceph/supervisor-unlock-safety
Zack Cerza [Wed, 20 Oct 2021 20:29:41 +0000 (14:29 -0600)]
Merge pull request #1688 from ceph/supervisor-unlock-safety

supervisor: Don't unlock nodes w/ bad description

3 years agosupervisor: Don't unlock nodes w/ bad description 1688/head
Zack Cerza [Wed, 20 Oct 2021 18:51:30 +0000 (12:51 -0600)]
supervisor: Don't unlock nodes w/ bad description

Very rarely, we enter a situation where nodes get used by two jobs
simultaneously. We can break this cycle if jobs refuse to unlock a
node that is locked by a different job.

This will not entirely prevent the problem, but it will keep it from
perpetuating itself.

Signed-off-by: Zack Cerza <zack@redhat.com>
3 years agoMerge PR #1685 into master
Patrick Donnelly [Fri, 15 Oct 2021 13:51:28 +0000 (09:51 -0400)]
Merge PR #1685 into master

* refs/pull/1685/head:
qa: remove extra debug message

Reviewed-by: Sage Weil <sage@redhat.com>
3 years agoMerge PR #1686 into master
Patrick Donnelly [Fri, 15 Oct 2021 13:50:41 +0000 (09:50 -0400)]
Merge PR #1686 into master

* refs/pull/1686/head:
tasks/kernel: fix missing arg to install_kernel

Reviewed-by: Sage Weil <sage@redhat.com>
3 years agotasks/kernel: fix missing arg to install_kernel i52944 1686/head
Patrick Donnelly [Fri, 15 Oct 2021 02:39:11 +0000 (22:39 -0400)]
tasks/kernel: fix missing arg to install_kernel

Fixes: https://tracker.ceph.com/issues/52944
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
3 years agoqa: remove extra debug message 1685/head
Patrick Donnelly [Fri, 15 Oct 2021 02:29:46 +0000 (22:29 -0400)]
qa: remove extra debug message

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
3 years agoMerge PR #1682 into master
Patrick Donnelly [Thu, 14 Oct 2021 13:26:52 +0000 (09:26 -0400)]
Merge PR #1682 into master

* refs/pull/1682/head:
nuke: ignore internal directory listing when scanning for stale mounts in debugfs directory

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
3 years agoMerge pull request #1668 from ideepika/wip-flavor
Ilya Dryomov [Wed, 13 Oct 2021 15:52:50 +0000 (17:52 +0200)]
Merge pull request #1668 from ideepika/wip-flavor

suite: kernel_flavor >> flavor

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
Reviewed-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
3 years agoremove gitbuilder tests 1668/head
Deepika Upadhyay [Thu, 16 Sep 2021 12:49:43 +0000 (18:19 +0530)]
remove gitbuilder tests

gitbuilder has been replaced by shaman project and no longer being used,
the same functional testing we are doing with TestShamanProject,
therefore there is no need to maintain testing for obsolete code.

Signed-off-by: Deepika Upadhyay <dupadhya@redhat.com>
3 years agos/basic/default we no longer need mapping of basic to default
Deepika Upadhyay [Mon, 13 Sep 2021 20:59:07 +0000 (02:29 +0530)]
s/basic/default we no longer need mapping of basic to default

finally we are substituting basic as default when tring to do URI search,
we can remove this mangling and directly make use of default flavor,
we do not use flavor 'basic' for determining kernel flavor now.

Signed-off-by: Deepika Upadhyay <dupadhya@redhat.com>
3 years agosuite: kernel_flavor >> flavor
Deepika Upadhyay [Fri, 23 Jul 2021 14:21:49 +0000 (14:21 +0000)]
suite: kernel_flavor >> flavor

since addressing comment:
FIXME: ceph flavor and kernel flavor are separate things
remove basic -> default(flavor) mapping & update s/basic/default in docs

Signed-off-by: Deepika Upadhyay <dupadhya@redhat.com>
3 years agosuite: add flavor in base build_config
Deepika Upadhyay [Fri, 23 Jul 2021 14:13:05 +0000 (14:13 +0000)]
suite: add flavor in base build_config

build flavor remains common for all jobs and is something that is
good to look into while our jobs are being scheduled

This will be useful when changing from default flavor
to ones in available (crimson/jaeger) by using --flavor arg

teuthology/suite/test: update unit tests to test build flavor

Signed-off-by: Deepika Upadhyay <dupadhya@redhat.com>
3 years agosuite: update available flavor for teuthology-suite
Deepika Upadhyay [Fri, 23 Jul 2021 14:09:36 +0000 (14:09 +0000)]
suite: update available flavor for teuthology-suite

we now have new build flavors crimson and jaeger, update them in docs

Signed-off-by: Deepika Upadhyay <dupadhya@redhat.com>
3 years agoMerge pull request #1683 from ceph/kernel-hwe
Sage Weil [Fri, 8 Oct 2021 15:51:20 +0000 (10:51 -0500)]
Merge pull request #1683 from ceph/kernel-hwe

tasks/kernel: add 'hwe' config flag for ubuntu distro hwe kernel

3 years agotasks/kernel: add 'hwe' config flag for ubuntu distro hwe kernel 1683/head
Sage Weil [Thu, 7 Oct 2021 15:03:22 +0000 (10:03 -0500)]
tasks/kernel: add 'hwe' config flag for ubuntu distro hwe kernel

The hwe kernel supports nvme_loop, but the non-hwe kernel does not.

I don't want to futz with the 'distro' moniker (although that is another
valid approach) because only some tests need hwe, and I can imagine a
situation where we want to run tests on both kernels.  Note that this
flag doesn't rule out adding support for something like
'-k distro-hwe' later.

Signed-off-by: Sage Weil <sage@newdream.net>
3 years agonuke: ignore internal directory listing when scanning for stale mounts in debugfs... 1682/head
Venky Shankar [Wed, 6 Oct 2021 11:12:59 +0000 (07:12 -0400)]
nuke: ignore internal directory listing when scanning for stale mounts in debugfs directory

kclient patchset:

    https://patchwork.kernel.org/project/ceph-devel/list/?series=556049

introduces meta directory to add debugging entries. This needs to be filtered
when scanning ceph debugfs directory.

Signed-off-by: Venky Shankar <vshankar@redhat.com>
4 years agoMerge pull request #1673 from kamoltat/wip-ksirivad-fix-dic-update
Zack Cerza [Fri, 24 Sep 2021 20:23:24 +0000 (14:23 -0600)]
Merge pull request #1673 from kamoltat/wip-ksirivad-fix-dic-update

Fix dictionary update in try_push_job_info()

4 years agoMerge pull request #1675 from amathuria/wip-amathuri-fix-unlock
Zack Cerza [Mon, 20 Sep 2021 17:20:16 +0000 (11:20 -0600)]
Merge pull request #1675 from amathuria/wip-amathuri-fix-unlock

lock/ops.py: Fix retry mechanism for unlock_one

4 years agolock/ops.py: Fix retry mechanism for unlock_one 1675/head
Aishwarya Mathuria [Wed, 15 Sep 2021 18:10:57 +0000 (23:40 +0530)]
lock/ops.py: Fix retry mechanism for unlock_one

In unlock_one, we currently have a retry mechanism that is only triggered on a particular exception. With this change, we retry the request to unlock no matter what the cause of failure.

Fixes: https://tracker.ceph.com/issues/50921
Signed-off-by: Aishwarya Mathuria <amathuri@redhat.com>
4 years agoFix dictionary update in try_push_job_info() 1673/head
Kamoltat [Mon, 13 Sep 2021 21:14:41 +0000 (21:14 +0000)]
Fix dictionary update in try_push_job_info()

the function `try_push_job_info()` is not
updating `job_info` dictionary properly since
we want to update `job_info` with `extra_info`,
however, in lines 498 and 499 we are assigning
`job_info` to a copy of `extra_info` and updating
`job_info` with `job_config` which is incorrect.
Instead, we should assign `job_info` with
a copy of `job_config` and update `job_info` with
`extra_info`

4 years agoMerge pull request #1677 from ceph/no-fudge
Zack Cerza [Thu, 16 Sep 2021 22:38:21 +0000 (16:38 -0600)]
Merge pull request #1677 from ceph/no-fudge

tests: Drop fudge

4 years agotests: Drop fudge 1677/head
Zack Cerza [Thu, 16 Sep 2021 19:08:05 +0000 (13:08 -0600)]
tests: Drop fudge

Use mock instead.

Signed-off-by: Zack Cerza <zack@redhat.com>
4 years agoMerge pull request #1674 from ceph/wip-51944
Josh Durgin [Thu, 16 Sep 2021 00:37:24 +0000 (17:37 -0700)]
Merge pull request #1674 from ceph/wip-51944

supervisor: To preserve logs, delay nuking

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: David Galloway <dgallowa@redhat.com>
4 years agoMerge pull request #1676 from ceph/gh-actions
Zack Cerza [Wed, 15 Sep 2021 23:22:34 +0000 (17:22 -0600)]
Merge pull request #1676 from ceph/gh-actions

Add GitHub Actions definition for CI

4 years agotest_worker: Don't access the network! 1676/head
Zack Cerza [Wed, 15 Sep 2021 21:23:34 +0000 (15:23 -0600)]
test_worker: Don't access the network!

Signed-off-by: Zack Cerza <zack@redhat.com>
4 years agoAdd GitHub Actions definition for CI
Zack Cerza [Wed, 15 Sep 2021 20:21:31 +0000 (14:21 -0600)]
Add GitHub Actions definition for CI

Signed-off-by: Zack Cerza <zack@redhat.com>
4 years agosupervisor: To preserve logs, delay nuking 1674/head
Zack Cerza [Wed, 15 Sep 2021 17:46:33 +0000 (11:46 -0600)]
supervisor: To preserve logs, delay nuking

The previous behavior was causing machines to get nuked before any
attempt to fetch logs. If a machine took longer than 60s to become
available, collecting logs would fail. Since we also nuke after this
step, don't bother here.

Fixes: https://tracker.ceph.com/issues/51944
Signed-off-by: Zack Cerza <zack@redhat.com>