]>
git.apps.os.sepia.ceph.com Git - teuthology.git/log
Dan Mick [Fri, 16 Jun 2023 22:48:04 +0000 (15:48 -0700)]
Merge pull request #1856 from ceph/fog-debug
fog: Add more debug logging
Zack Cerza [Fri, 16 Jun 2023 16:24:29 +0000 (10:24 -0600)]
FOG._wait_for_ready(): Use instance logger
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Fri, 16 Jun 2023 16:23:42 +0000 (10:23 -0600)]
dispatcher/supervisor: Set root logger level
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Wed, 14 Jun 2023 20:53:36 +0000 (14:53 -0600)]
fog: Add more debug logging
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Wed, 14 Jun 2023 15:33:28 +0000 (09:33 -0600)]
Merge pull request #1854 from ceph/bootstrap-c9s
Zack Cerza [Tue, 13 Jun 2023 19:08:40 +0000 (13:08 -0600)]
Merge pull request #1853 from ceph/reimage-no-ctx
Zack Cerza [Mon, 12 Jun 2023 21:48:34 +0000 (15:48 -0600)]
bootstrap: Tolerate a missing lsb_release
This fixes the lack of support for CentOS 9.Stream
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Mon, 12 Jun 2023 21:37:56 +0000 (15:37 -0600)]
provision: Avoid a possible AttributeError
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Wed, 7 Jun 2023 23:26:37 +0000 (17:26 -0600)]
Merge pull request #1846 from ceph/stdin-killer
Patrick Donnelly [Thu, 18 May 2023 13:24:57 +0000 (09:24 -0400)]
teuthology/task/install: add stdin-killer helper
This helper tool runs commands which may or may not take data on stdin.
Like "daemon-helper", if stdin signals EOF, stdin-killer will kill the
command but only as a last resort. It forwards EOF to the command by
closing the command's stdin (pipe) and then waiting a configurable
amount of time for the command to gracefully exit.
Additionally, if stdout or stderr are hung up -- i.e. the ssh parent
process has terminated -- then stdin-killer also detects this and
initiates the graceful shutdown of the command. This is something
daemon-helper does not do.
In general, this tool is a superior replacement of the daemon-helper
tool because you can write to the command's stdin normally.
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Thu, 18 May 2023 13:20:57 +0000 (09:20 -0400)]
setup.cfg: install binary helpers
These are used by vstart_runner.py for local dev operations. Install
them so they are available in the virtualenv bin directory.
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Wed, 17 May 2023 18:32:19 +0000 (14:32 -0400)]
teuthology/task/install: reorganize binary helpers
We intend to install these so move them into an appropriately named
directory.
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Zack Cerza [Wed, 31 May 2023 22:12:50 +0000 (16:12 -0600)]
Merge pull request #1803 from jdurgin/wip-configurable-timeouts
Zack Cerza [Wed, 31 May 2023 20:00:35 +0000 (14:00 -0600)]
Merge pull request #1851 from ceph/reimage-failures
Zack Cerza [Thu, 18 May 2023 00:12:13 +0000 (18:12 -0600)]
fog: Verify reimaged machine OS
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Fri, 26 May 2023 18:23:34 +0000 (12:23 -0600)]
Merge pull request #1850 from ceph/unmask-unlock-response
Dan Mick [Wed, 24 May 2023 21:48:00 +0000 (14:48 -0700)]
Merge pull request #1849 from ceph/prom-reimage-results
exporter: Instrument node reimaging success/fail
Zack Cerza [Wed, 24 May 2023 17:53:19 +0000 (11:53 -0600)]
lock.ops.unlock_one: Fail sooner on 403, with msg
In the case of e.g. owners values not matching on an unlock attempt, we
were exhausting all retries and failing to display the exact reason for
the unlock failure. We can simply break on 403 errors and let the rest
of the function do its thing.
Signed-off-by: Zack Cerza <zack@redhat.com>
Josh Durgin [Tue, 20 Dec 2022 18:53:20 +0000 (18:53 +0000)]
fog: make timeouts configurable
This will help with the sepia lab, being able to increase these
temporarily to handle a new fog server that is sometimes exceeding the
hardcoded timeouts.
Signed-off-by: Josh Durgin <jdurgin@redhat.com>
Josh Durgin [Tue, 20 Dec 2022 18:50:42 +0000 (18:50 +0000)]
contextutil: allow safe_while to use an explicit timeout
Signed-off-by: Josh Durgin <jdurgin@redhat.com>
Zack Cerza [Tue, 23 May 2023 23:43:58 +0000 (17:43 -0600)]
nuke: Fix an import issue
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Tue, 28 Mar 2023 22:22:33 +0000 (16:22 -0600)]
lock.ops.reimage_machines: Drop incorrect log msg
This message was being logged when the reimage started, not finished.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Tue, 23 May 2023 19:53:23 +0000 (13:53 -0600)]
exporter: Instrument node reimaging success/fail
Signed-off-by: Zack Cerza <zack@redhat.com>
Dan Mick [Tue, 23 May 2023 00:39:23 +0000 (17:39 -0700)]
Merge pull request #1847 from ceph/fog-timeout
fog: Increase timeout in wait_for_deploy_task()
Dan Mick [Tue, 23 May 2023 00:15:25 +0000 (17:15 -0700)]
test_fog: Fix up test for preceding commit "Increase timeout"
Signed-off-by: Dan Mick <dmick@redhat.com>
Zack Cerza [Mon, 22 May 2023 23:59:36 +0000 (17:59 -0600)]
fog: Increase timeout in wait_for_deploy_task()
When too many reimaging ops are running concurrently, we're seeing
timeouts. This isn't a true fix, but should help things until we've got
one.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Mon, 22 May 2023 20:45:58 +0000 (14:45 -0600)]
Merge pull request #1843 from dmick/wip-kernel-sort
Casey Bodley [Fri, 19 May 2023 16:21:52 +0000 (12:21 -0400)]
Merge pull request #1844 from cbodley/wip-install-copr
task: install supports enable_coprs array
Reviewed-by: Ken Dreyer <kdreyer@redhat.com>
Reviewed-by: Zack Cerza <zcerza@redhat.com>
Casey Bodley [Wed, 17 May 2023 20:21:37 +0000 (16:21 -0400)]
task: install supports enable_coprs array
enable the installation of packages in fedora copr repositories
Signed-off-by: Casey Bodley <cbodley@redhat.com>
Dan Mick [Wed, 17 May 2023 00:05:15 +0000 (17:05 -0700)]
task/kernel.py: sort installed kernels by version
rpm -q --last sorts by timestamp-of-install, which does not
necessarily correlate with "latest version". sort -rV does.
Signed-off-by: Dan Mick <dmick@redhat.com>
kyr [Wed, 26 Apr 2023 18:49:18 +0000 (20:49 +0200)]
Merge pull request #1804 from ceph/dependabot/pip/wheel-0.38.1
build(deps): bump wheel from 0.36.2 to 0.38.1
dependabot[bot] [Wed, 26 Apr 2023 18:00:51 +0000 (18:00 +0000)]
build(deps): bump wheel from 0.36.2 to 0.38.1
Bumps [wheel](https://github.com/pypa/wheel) from 0.36.2 to 0.38.1.
- [Release notes](https://github.com/pypa/wheel/releases)
- [Changelog](https://github.com/pypa/wheel/blob/main/docs/news.rst)
- [Commits](https://github.com/pypa/wheel/compare/0.36.2...0.38.1)
---
updated-dependencies:
- dependency-name: wheel
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Kamoltat Sirivadhna [Wed, 5 Apr 2023 14:05:41 +0000 (10:05 -0400)]
Merge pull request #1831 from kamoltat/wip-ksirivad-rerun-readme
teuthology-suite: --seed & --subset now also stored in teuthology.log, config.yaml and orig.config.yaml
Reviewed-by: Zack Cerza <zcerza@redhat.com>
Dan Mick [Tue, 4 Apr 2023 01:57:20 +0000 (18:57 -0700)]
Merge pull request #1834 from ceph/disp_ls_remote
worker.prep_job: Skip job if ls_remote fails
Zack Cerza [Mon, 3 Apr 2023 22:07:37 +0000 (16:07 -0600)]
worker.prep_job: Skip job if ls_remote fails
This is preferable to letting the dispatcher die.
Signed-off-by: Zack Cerza <zack@redhat.com>
Kamoltat [Mon, 3 Apr 2023 18:35:01 +0000 (18:35 +0000)]
suite/run.py: Added seed and subset to base_config
In addition to being stored in results.log
`--seed` and `--subset` are now also stored in:
`teuthlogy.log`, `config.yaml` and `orig.config.yaml`.
Fixes: https://tracker.ceph.com/issues/59300
Signed-off-by: Kamoltat <ksirivad@redhat.com>
Kamoltat [Thu, 30 Mar 2023 17:12:17 +0000 (17:12 +0000)]
teuthology-suite: Log errors & warnings if results.log is missing during --rerun
Notify the user if `results.log` is missing when
they issue a rerun.
Also, edited teuthology-suite doc
to inform the user about how `--rerun` by
default parse `--seed`, `--subset` and
`--no-nested-subset` from `results.log`
by default
Fixes: https://tracker.ceph.com/issues/59300
Signed-off-by: Kamoltat <ksirivad@redhat.com>
Zack Cerza [Fri, 31 Mar 2023 22:08:58 +0000 (16:08 -0600)]
Merge pull request #1832 from ceph/deps
Zack Cerza [Fri, 31 Mar 2023 19:54:32 +0000 (13:54 -0600)]
Merge pull request #1827 from ceph/tox-no-osp
Zack Cerza [Thu, 23 Mar 2023 19:13:31 +0000 (13:13 -0600)]
Drop argparse as a requirement
It's part of the standard library.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 23 Mar 2023 19:12:43 +0000 (13:12 -0600)]
requirements.txt: Add prometheus_client
This should have been added a couple PRs back.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Fri, 31 Mar 2023 17:03:51 +0000 (11:03 -0600)]
Merge pull request #1833 from ceph/gha-ubuntu-version
Zack Cerza [Thu, 30 Mar 2023 22:26:45 +0000 (16:26 -0600)]
bootstrap: apt-get update before installing
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 30 Mar 2023 22:09:18 +0000 (16:09 -0600)]
.github: Explicitly define test matrix
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 30 Mar 2023 16:43:05 +0000 (10:43 -0600)]
Merge pull request #1824 from batrick/parallel-gzip
Patrick Donnelly [Wed, 22 Mar 2023 14:48:43 +0000 (10:48 -0400)]
teuthology: do not compress tarballs when pulling dir
Where we use this, it's for pulling log files that are already
compressed. Do not waste time double compressing!
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Tue, 21 Mar 2023 14:37:25 +0000 (10:37 -0400)]
teuthology/misc: give verbose gzip output
For future analysis.
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Tue, 21 Mar 2023 14:36:36 +0000 (10:36 -0400)]
teuthology/misc: use medium compression
To speed things up.
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Tue, 21 Mar 2023 14:36:09 +0000 (10:36 -0400)]
teuthology/misc: parallelize gzip
Our machines have lots of cores, use them!
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
Zack Cerza [Thu, 23 Mar 2023 18:43:50 +0000 (12:43 -0600)]
Merge pull request #1826 from ceph/job-time
Zack Cerza [Thu, 23 Mar 2023 18:42:12 +0000 (12:42 -0600)]
tox: Don't run openstack by default
It's quite time-consuming, and we're not sure if it's in use at all.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 23 Mar 2023 18:22:15 +0000 (12:22 -0600)]
sueprvisor: Do not instrument certain job times
This should only really include first/last-in-suite jobs.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Wed, 22 Mar 2023 22:15:24 +0000 (16:15 -0600)]
Merge pull request #1819 from ceph/timing
Zack Cerza [Wed, 22 Mar 2023 15:59:20 +0000 (09:59 -0600)]
task.internal: Fix import of get_builder_project
This is another problematic circular import.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Fri, 17 Mar 2023 01:20:22 +0000 (19:20 -0600)]
task.kernel: Make task() a context manager
This will cause it to be instrumented for timing via the bits in
run_tasks.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 16 Mar 2023 17:28:12 +0000 (11:28 -0600)]
exporter: Make JobResults a singleton
This should avoid "Duplicated timeseries in CollectorRegistry" errors.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 9 Mar 2023 21:43:57 +0000 (14:43 -0700)]
docs: Add new document for teuthology-exporter
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 9 Mar 2023 19:40:15 +0000 (12:40 -0700)]
supervisor: Add instrumentation for reimaging time
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 9 Mar 2023 19:22:53 +0000 (12:22 -0700)]
repo_utils: Add instrumentation for bootstrap time
And fix a couple of linter errors.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 9 Mar 2023 18:42:06 +0000 (11:42 -0700)]
run_tasks: Add instrumentation for task time
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 9 Mar 2023 18:38:04 +0000 (11:38 -0700)]
supervisor: Add instrumentation for job time
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 9 Mar 2023 18:28:02 +0000 (11:28 -0700)]
dispatcher: Add instrumentation for locking time
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Tue, 21 Mar 2023 22:21:40 +0000 (16:21 -0600)]
Merge pull request #1825 from ceph/cl-import
Zack Cerza [Tue, 21 Mar 2023 17:45:16 +0000 (11:45 -0600)]
nuke: Fix import of task.internal.check_lock
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Tue, 21 Mar 2023 16:52:34 +0000 (10:52 -0600)]
Merge pull request #1823 from ceph/fix-kill
Zack Cerza [Mon, 20 Mar 2023 19:30:50 +0000 (13:30 -0600)]
task.cephmetrics: Fix broken import
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Wed, 15 Mar 2023 21:54:28 +0000 (15:54 -0600)]
Fix more circular imports
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Mon, 20 Mar 2023 19:28:48 +0000 (13:28 -0600)]
Test all module imports
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Wed, 15 Mar 2023 21:28:32 +0000 (15:28 -0600)]
Restore some import statements
Some of the circular import fixes were overkill.
Signed-off-by: Zack Cerza <zack@redhat.com>
Dan Mick [Thu, 16 Mar 2023 21:17:16 +0000 (14:17 -0700)]
Merge pull request #1822 from ceph/fix-kill
kill.kill_job: Tweak job status reporting
Zack Cerza [Thu, 16 Mar 2023 16:44:09 +0000 (10:44 -0600)]
kill.kill_job: Tweak job status reporting
Signed-off-by: Zack Cerza <zack@redhat.com>
(cherry picked from commit
45117a8781ce6438f22140c00d05435c7781648b )
Zack Cerza [Wed, 8 Mar 2023 23:05:55 +0000 (16:05 -0700)]
Merge pull request #1817 from ceph/exporter
Zack Cerza [Tue, 7 Mar 2023 20:54:01 +0000 (13:54 -0700)]
Fix a circular import
This was exposed during development of the exporter.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Tue, 7 Mar 2023 22:09:46 +0000 (15:09 -0700)]
setup.cfg: Set flake8 max line length to 100
It's what black uses.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Tue, 7 Mar 2023 18:48:22 +0000 (11:48 -0700)]
teuthology-exporter: Report job status
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Tue, 21 Feb 2023 20:34:06 +0000 (13:34 -0700)]
Add teuthology-exporter, a Prometheus exporter
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Tue, 21 Feb 2023 23:33:02 +0000 (16:33 -0700)]
Add active_machine_types config item
I'm adding this so that the exporter can know which machines types to
care about.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Fri, 3 Mar 2023 18:00:40 +0000 (11:00 -0700)]
Merge pull request #1818 from ceph/fix-nop
Zack Cerza [Fri, 3 Mar 2023 17:39:17 +0000 (10:39 -0700)]
task.ansible: Fix an IndexError w/ 0 remotes
Signed-off-by: Zack Cerza <zack@redhat.com>
kyr [Tue, 28 Feb 2023 21:03:21 +0000 (22:03 +0100)]
Merge pull request #1815 from kshtsk/wip-sle15sp3
orchestra/opsys: add sle 15 sp5
Zack Cerza [Tue, 28 Feb 2023 17:10:33 +0000 (10:10 -0700)]
Merge pull request #1816 from ceph/paddles-fault-tolerance
Zack Cerza [Tue, 28 Feb 2023 00:47:33 +0000 (17:47 -0700)]
supervisor: Ignore some reporting failures
Specifically, the pings coming from the watchdog loop
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Tue, 28 Feb 2023 00:39:25 +0000 (17:39 -0700)]
dispatcher: Do not time out when locking machines
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Tue, 28 Feb 2023 00:31:07 +0000 (17:31 -0700)]
safe_while: Retry forever when tries == -1
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 23 Feb 2023 21:12:54 +0000 (14:12 -0700)]
Merge pull request #1812 from ceph/orch-ssh-key
Kyr Shatskyy [Thu, 16 Feb 2023 23:47:37 +0000 (00:47 +0100)]
orchestra/opsys: add sle 15 sp5
Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
NitzanMordhai [Tue, 14 Feb 2023 05:51:08 +0000 (07:51 +0200)]
Merge pull request #1814 from NitzanMordhai/wip-nitzan-scrapy-deadjobs-nonetype-match
scrapy: comparing dead job skip typeNone
Zack Cerza [Fri, 10 Feb 2023 21:24:42 +0000 (14:24 -0700)]
Merge pull request #1793 from cybozu/fix-docker-compose-postgres
Specify PostgreSQL v14 to avoid permission errors.
NitzanMordhai [Thu, 9 Feb 2023 08:42:08 +0000 (08:42 +0000)]
scrapy: comparing dead job skip typeNone
SequenceMatcher can't get NoneType when comparing 2 dead jobs,
adding check and compare for those jobs.
Fixes: https://tracker.ceph.com/issues/58264
Signed-off-by: Nitzan Mordechai <nmordec@redhat.com>
Zack Cerza [Wed, 1 Feb 2023 19:43:43 +0000 (12:43 -0700)]
orchestra.connection: Look for 'ssh_key' in config
On systems where SSH is configured using ~/.ssh/config.d/, we don't have
a great way of locating and processing the full configuration, so we
might not find the correct key to use. Allow the user to specify one key
to use for all test nodes if they wish.
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Thu, 2 Feb 2023 19:36:59 +0000 (12:36 -0700)]
Merge pull request #1811 from ceph/wip-58491
misc.get_system_type: Use Remote.os
Zack Cerza [Wed, 1 Feb 2023 17:49:48 +0000 (10:49 -0700)]
misc.get_system_type: Use Remote.os
... instead of lsb_release, as newer distros don't ship it.
Fixes: #58491
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Mon, 30 Jan 2023 22:48:56 +0000 (15:48 -0700)]
Merge pull request #1810 from ceph/selinux-denial-fix
task/selinux: Fix regressed grepping of audit logs
Zack Cerza [Mon, 30 Jan 2023 19:46:59 +0000 (12:46 -0700)]
task/selinux: Fix regressed grepping of audit logs
Fixes: https://tracker.ceph.com/issues/58610
Introduced:
a106217
Signed-off-by: Zack Cerza <zack@redhat.com>
Dan Mick [Fri, 27 Jan 2023 01:40:55 +0000 (17:40 -0800)]
Merge pull request #1805 from ceph/el-py39
bootstrap: Correct python3 package names for EL8/9
kyr [Wed, 25 Jan 2023 20:12:16 +0000 (21:12 +0100)]
Merge pull request #1807 from kshtsk/wip-update-leap
orchestra/opsys: add newer sle and leap versions
Kyr Shatskyy [Tue, 24 Jan 2023 12:31:37 +0000 (13:31 +0100)]
orchestra/opsys: add newer sle and leap versions
Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
Zack Cerza [Tue, 17 Jan 2023 23:39:27 +0000 (16:39 -0700)]
bootstrap: Correct python3 package names for EL8/9
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Wed, 25 Jan 2023 19:09:02 +0000 (12:09 -0700)]
Merge pull request #1806 from ceph/fix-pytest
tests: Fix warnings, deprecations, and broken tests
Dan Mick [Tue, 24 Jan 2023 01:22:30 +0000 (17:22 -0800)]
Merge pull request #1802 from ceph/faster-newest
suite: Speed up 'newest' feature