]> git.apps.os.sepia.ceph.com Git - teuthology.git/log
teuthology.git
2 months agoAdd jq and curl in docker-compose Dockerfile 2050/head
deepssin [Thu, 26 Jun 2025 11:12:41 +0000 (07:12 -0400)]
Add jq and curl in docker-compose Dockerfile

Signed-off-by: deepssin <deepssin@redhat.com>
3 months agoRevive teuthology-openstack
deepssin [Mon, 16 Jun 2025 13:55:21 +0000 (09:55 -0400)]
Revive teuthology-openstack

Signed-off-by: deepssin <deepssin@redhat.com>
5 months agoMerge pull request #2039 from ceph/dependabot/pip/jinja2-3.1.6
kyr [Thu, 3 Apr 2025 10:50:04 +0000 (12:50 +0200)]
Merge pull request #2039 from ceph/dependabot/pip/jinja2-3.1.6

build(deps): bump jinja2 from 3.1.5 to 3.1.6

5 months agoMerge pull request #2035 from VallariAg/ci-integration-logs-archive
Zack Cerza [Wed, 2 Apr 2025 19:55:55 +0000 (13:55 -0600)]
Merge pull request #2035 from VallariAg/ci-integration-logs-archive

ci: save integration logs artifacts

5 months agoci: save intergation logs artifacts 2035/head
Vallari Agrawal [Fri, 7 Mar 2025 06:39:32 +0000 (12:09 +0530)]
ci: save intergation logs artifacts

Mount volume for archive_dir and then
upload it's logfiles as GA artifacts,
so we can use download-artifacts action
to download logs.

Signed-off-by: Vallari Agrawal <val.agl002@gmail.com>
6 months agobuild(deps): bump jinja2 from 3.1.5 to 3.1.6 2039/head
dependabot[bot] [Fri, 21 Mar 2025 18:20:29 +0000 (18:20 +0000)]
build(deps): bump jinja2 from 3.1.5 to 3.1.6

Bumps [jinja2](https://github.com/pallets/jinja) from 3.1.5 to 3.1.6.
- [Release notes](https://github.com/pallets/jinja/releases)
- [Changelog](https://github.com/pallets/jinja/blob/main/CHANGES.rst)
- [Commits](https://github.com/pallets/jinja/compare/3.1.5...3.1.6)

---
updated-dependencies:
- dependency-name: jinja2
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
6 months agoMerge pull request #2038 from ceph/actions-pinning
David Galloway [Fri, 21 Mar 2025 18:19:25 +0000 (14:19 -0400)]
Merge pull request #2038 from ceph/actions-pinning

workflows/dev_container: Pin docker actions

6 months agoworkflows/dev_container: Pin docker actions 2038/head
Zack Cerza [Fri, 21 Mar 2025 18:02:06 +0000 (12:02 -0600)]
workflows/dev_container: Pin docker actions

These are the most recent releases for these actions.

Signed-off-by: Zack Cerza <zack@cerza.org>
6 months agoMerge pull request #2026 from badone/main
kyr [Fri, 21 Mar 2025 10:52:03 +0000 (11:52 +0100)]
Merge pull request #2026 from badone/main

task/internal/__init__.py:  Revisit core program extraction regex

6 months agotask/internal/__init__.py: Revisit core program extraction regex 2026/head
Brad Hubbard [Tue, 18 Feb 2025 03:56:18 +0000 (13:56 +1000)]
task/internal/__init__.py:  Revisit core program extraction regex

Fixes: https://tracker.ceph.com/issues/61586
Signed-off-by: Brad Hubbard <bhubbard@redhat.com>
6 months agoMerge pull request #1773 from batrick/1772-revert
Ilya Dryomov [Fri, 7 Mar 2025 17:12:37 +0000 (18:12 +0100)]
Merge pull request #1773 from batrick/1772-revert

Revert "Merge PR #1772 into master"

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
Reviewed-by: Kyr Shatskyy <kyrylo.shatskyy@gmail.com>
6 months agoMerge pull request #2034 from VallariAg/fix-locked-grace-period
Zack Cerza [Thu, 6 Mar 2025 16:25:52 +0000 (09:25 -0700)]
Merge pull request #2034 from VallariAg/fix-locked-grace-period

lock/query.py: add grace_time param to node_active_job()

6 months agolock/query.py: add grace_time param to node_active_job() 2034/head
Vallari Agrawal [Thu, 6 Mar 2025 13:38:29 +0000 (19:08 +0530)]
lock/query.py: add grace_time param to node_active_job()

This is because we only want this grace period
when looking for stale jobs. Not when jobs are
being unlocked by the supervisor by unlock_one_safe().

Fixes: https://github.com/ceph/teuthology/pull/2033#issuecomment-2703733380
Signed-off-by: Vallari Agrawal <val.agl002@gmail.com>
6 months agoMerge pull request #2033 from ceph/node-cleanup-grace
Zack Cerza [Wed, 5 Mar 2025 16:50:39 +0000 (09:50 -0700)]
Merge pull request #2033 from ceph/node-cleanup-grace

node-cleanup: Grace period for inactive jobs

6 months agonode-cleanup: Grace period for inactive jobs 2033/head
Zack Cerza [Tue, 4 Mar 2025 23:37:53 +0000 (16:37 -0700)]
node-cleanup: Grace period for inactive jobs

Once a job is marked finished, the supervisor may still be waiting to unlock its
nodes. Give jobs five minutes to clean up nodes before we consider them "stale".

Signed-off-by: Zack Cerza <zack@cerza.org>
6 months agonode-cleanup: Use plain log format
Zack Cerza [Tue, 4 Mar 2025 23:09:30 +0000 (16:09 -0700)]
node-cleanup: Use plain log format

Signed-off-by: Zack Cerza <zack@cerza.org>
6 months agonode-cleanup: Provide link to job
Zack Cerza [Tue, 4 Mar 2025 21:43:59 +0000 (14:43 -0700)]
node-cleanup: Provide link to job

Instead of just the raw description.

Signed-off-by: Zack Cerza <zack@cerza.org>
6 months agoRevert "Merge PR #1772 into master" 1773/head
Patrick Donnelly [Thu, 23 Jun 2022 14:18:47 +0000 (10:18 -0400)]
Revert "Merge PR #1772 into master"

This reverts commit 082ae7ef4302fa54665ed0a2535e8e254118dcfd, reversing
changes made to 31a356e0c737f1781079045bb8dc366982be8713.

The original ea170935d4b1c78dc6fd6beae6b3fda65b296f57 removed a method that had
been moved to ceph.git but that broke upgrade tests from releases up to Octopus
where get_valgrind_args had not been backported. So a revert was done until all
upgrade paths have the method.

Now that Quincy is EOL, we can revert the revert.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
6 months agoMerge pull request #2030 from jmundack/add_codeowners
Zack Cerza [Thu, 27 Feb 2025 21:09:29 +0000 (14:09 -0700)]
Merge pull request #2030 from jmundack/add_codeowners

Add CODEOWNERS file

6 months agoAdd CODEOWNERS file 2030/head
Joseph Mundackal [Thu, 27 Feb 2025 00:15:10 +0000 (19:15 -0500)]
Add CODEOWNERS file

6 months agoMerge pull request #2020 from ceph/dependabot/pip/virtualenv-20.26.6
kyr [Thu, 27 Feb 2025 00:13:31 +0000 (01:13 +0100)]
Merge pull request #2020 from ceph/dependabot/pip/virtualenv-20.26.6

build(deps): bump virtualenv from 20.25.0 to 20.26.6

6 months agoMerge pull request #2028 from kshtsk/wip-docs-laptop-dispatcher
Zack Cerza [Wed, 26 Feb 2025 21:12:56 +0000 (14:12 -0700)]
Merge pull request #2028 from kshtsk/wip-docs-laptop-dispatcher

docs/laptop: replace workers with dispatcher

6 months agoMerge pull request #2027 from kshtsk/wip-almarocky
Zack Cerza [Wed, 26 Feb 2025 21:09:59 +0000 (14:09 -0700)]
Merge pull request #2027 from kshtsk/wip-almarocky

opsys: add AlmaLinux and RockyLinux and Tumbleweed to distro codename map

6 months agoorchestra/opsys: add tumbleweed version to the distro version map 2027/head
Kyrylo Shatskyy [Wed, 26 Feb 2025 18:42:05 +0000 (19:42 +0100)]
orchestra/opsys: add tumbleweed version to the distro version map

openSUSE Tumbleweed is a rolling release, so it has fixed version number
and changable version id is represented by a date of snapshot,
example of vm image:

openSUSE-Tumbleweed-Minimal-VM.x86_64-1.0.0-Cloud-Snapshot20250224.qcow2

In comparison, openSUSE Leap has regular release, incremental version, for example:
15.0, 15.1, ..., 15.6, etc., and recently introduced 16.0 alpha.

Because there is no sense to stick to the date, it is changing almost daily,
it is suggested to fix Tumbleweed version to 1.0 to distinguish from Leap.
As a side effect, once Teuthology locks a node with imaged Tumbleweed
it updates os version correctly to date of release in paddles, and
pulpito display it interface correspondingly as the date.

Signed-off-by: Kyrylo Shatskyy <kyrylo.shatskyy@clyso.com>
6 months agoorchestra/opsys: add 'alma' and 'rocky' to default os map
Kyr Shatskyy [Wed, 19 Feb 2025 22:50:39 +0000 (23:50 +0100)]
orchestra/opsys: add 'alma' and 'rocky' to default os map

Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@clyso.com>
6 months agoorchestra/opsys: add almalinux versions
Kyr Shatskyy [Tue, 11 Feb 2025 14:47:56 +0000 (15:47 +0100)]
orchestra/opsys: add almalinux versions

Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@clyso.com>
6 months agoorchestra/opsys: add 'rocky' to DISTRO_CODENAME_MAP
Kyr Shatskyy [Wed, 5 Feb 2025 17:28:32 +0000 (18:28 +0100)]
orchestra/opsys: add 'rocky' to DISTRO_CODENAME_MAP

Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@clyso.com>
6 months agodocs/laptop: replace workers with dispatcher 2028/head
Kyr Shatskyy [Tue, 25 Feb 2025 15:59:16 +0000 (16:59 +0100)]
docs/laptop: replace workers with dispatcher

Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@gmail.com>
7 months agoMerge pull request #2025 from ceph/nuke-unlock
Zack Cerza [Thu, 13 Feb 2025 21:57:30 +0000 (14:57 -0700)]
Merge pull request #2025 from ceph/nuke-unlock

supervisor: Unlock nodes after reimage failure

7 months agonode_active_job: Improve http error handling 2025/head
Zack Cerza [Mon, 3 Feb 2025 21:17:32 +0000 (14:17 -0700)]
node_active_job: Improve http error handling

Signed-off-by: Zack Cerza <zack1@ibm.com>
7 months agonode_cleanup: Log owner when query fails
Zack Cerza [Mon, 3 Feb 2025 21:12:10 +0000 (14:12 -0700)]
node_cleanup: Log owner when query fails

Signed-off-by: Zack Cerza <zack1@ibm.com>
7 months agosupervisor: Unlock nodes after reimage failure
Zack Cerza [Mon, 3 Feb 2025 21:06:57 +0000 (14:06 -0700)]
supervisor: Unlock nodes after reimage failure

Signed-off-by: Zack Cerza <zack1@ibm.com>
7 months agonode_cleanup: Log a summary of what was done
Zack Cerza [Mon, 3 Feb 2025 21:04:11 +0000 (14:04 -0700)]
node_cleanup: Log a summary of what was done

Signed-off-by: Zack Cerza <zack1@ibm.com>
7 months agoMerge pull request #2024 from kshtsk/wip-64452
Zack Cerza [Thu, 30 Jan 2025 22:28:35 +0000 (15:28 -0700)]
Merge pull request #2024 from kshtsk/wip-64452

7 months agoMerge pull request #1984 from kshtsk/wip-laptop-dispatcher
Zack Cerza [Thu, 30 Jan 2025 22:27:13 +0000 (15:27 -0700)]
Merge pull request #1984 from kshtsk/wip-laptop-dispatcher

7 months agoorchestra/run: better handle weird args 2024/head
Kyr Shatskyy [Thu, 30 Jan 2025 11:18:24 +0000 (12:18 +0100)]
orchestra/run: better handle weird args

If one of the command args in remote.run is None then there is exception
occurs while trying to qoute argument list:

   TypeError: expected string or bytes-like object

This fix addresses the issue and gives user better idea what is
happening, and how to fix the error.

Fixes: https://tracker.ceph.com/issues/64452
Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@clyso.com>
7 months agoMerge pull request #2022 from kshtsk/wip-stuck-on-vm-create-fail
kyr [Thu, 30 Jan 2025 12:42:01 +0000 (13:42 +0100)]
Merge pull request #2022 from kshtsk/wip-stuck-on-vm-create-fail

teuthology lock is stuck if vm create failed

7 months agodocs/laptop: update localhost setup instructions 1984/head
Kyr Shatskyy [Mon, 29 Jul 2024 11:47:32 +0000 (13:47 +0200)]
docs/laptop: update localhost setup instructions

Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@clyso.com>
7 months agoprovision/downburst: log multiline stderr on error 2022/head
Kyr Shatskyy [Fri, 10 Jan 2025 14:15:21 +0000 (15:15 +0100)]
provision/downburst: log multiline stderr on error

For better logs readability just log downburst's stderr
line-by-line.

Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@clyso.com>
7 months agolock/ops: don't misguide user while waiting for vms up
Kyr Shatskyy [Thu, 16 Jan 2025 17:35:52 +0000 (18:35 +0100)]
lock/ops: don't misguide user while waiting for vms up

The message "virtual machine is still unavailable" is mistakenly
shown to user, when vms are actually available.

Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@clyso.com>
7 months agolock/cli: don't update inventory if failed to create a vm
Kyr Shatskyy [Fri, 10 Jan 2025 14:09:24 +0000 (15:09 +0100)]
lock/cli: don't update inventory if failed to create a vm

If we were not able to create a vm we don't need to update machine's
inventory, because there is going to be an ssh connection established
to the host and teuthology gets stuck infinitely trying to connect to
the machine which does not exist.

Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@clyso.com>
7 months agoMerge pull request #2014 from ceph/wip-nitzan-add-timout-param-stop-type
Zack Cerza [Mon, 27 Jan 2025 18:52:35 +0000 (11:52 -0700)]
Merge pull request #2014 from ceph/wip-nitzan-add-timout-param-stop-type

7 months agoMerge pull request #2021 from kshtsk/wip-job-archive-in-supervisor
Zack Cerza [Thu, 23 Jan 2025 19:38:46 +0000 (12:38 -0700)]
Merge pull request #2021 from kshtsk/wip-job-archive-in-supervisor

7 months agodispatcher/supervisor: use job config only to unlock targets 2021/head
Kyr Shatskyy [Sat, 18 Jan 2025 00:24:51 +0000 (01:24 +0100)]
dispatcher/supervisor: use job config only to unlock targets

No need to use serializer, because everything needed to unlock
machines already present in job config.

Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@clyso.com>
8 months agobuild(deps): bump virtualenv from 20.25.0 to 20.26.6 2020/head
dependabot[bot] [Mon, 13 Jan 2025 18:19:41 +0000 (18:19 +0000)]
build(deps): bump virtualenv from 20.25.0 to 20.26.6

Bumps [virtualenv](https://github.com/pypa/virtualenv) from 20.25.0 to 20.26.6.
- [Release notes](https://github.com/pypa/virtualenv/releases)
- [Changelog](https://github.com/pypa/virtualenv/blob/main/docs/changelog.rst)
- [Commits](https://github.com/pypa/virtualenv/compare/20.25.0...20.26.6)

---
updated-dependencies:
- dependency-name: virtualenv
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
8 months agoMerge pull request #2019 from ceph/dependabot/pip/jinja2-3.1.5
kyr [Fri, 10 Jan 2025 23:13:24 +0000 (00:13 +0100)]
Merge pull request #2019 from ceph/dependabot/pip/jinja2-3.1.5

build(deps): bump jinja2 from 3.1.4 to 3.1.5

8 months agoMerge pull request #2017 from kshtsk/wip-drop-future-print
kyr [Fri, 10 Jan 2025 09:30:38 +0000 (10:30 +0100)]
Merge pull request #2017 from kshtsk/wip-drop-future-print

teuthology: drop __future__ import for print

8 months agobuild(deps): bump jinja2 from 3.1.4 to 3.1.5 2019/head
dependabot[bot] [Wed, 8 Jan 2025 20:50:52 +0000 (20:50 +0000)]
build(deps): bump jinja2 from 3.1.4 to 3.1.5

Bumps [jinja2](https://github.com/pallets/jinja) from 3.1.4 to 3.1.5.
- [Release notes](https://github.com/pallets/jinja/releases)
- [Changelog](https://github.com/pallets/jinja/blob/main/CHANGES.rst)
- [Commits](https://github.com/pallets/jinja/compare/3.1.4...3.1.5)

---
updated-dependencies:
- dependency-name: jinja2
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
8 months agoMerge pull request #2010 from chrisphoffman/wip-dd-syncseek
kyr [Tue, 7 Jan 2025 13:22:21 +0000 (14:22 +0100)]
Merge pull request #2010 from chrisphoffman/wip-dd-syncseek

teuthology: Add support for seek and sync in write_file

8 months agoMerge pull request #1998 from phlogistonjohn/jjm-config-alt-path
kyr [Tue, 7 Jan 2025 13:22:00 +0000 (14:22 +0100)]
Merge pull request #1998 from phlogistonjohn/jjm-config-alt-path

config: allow reading teuthology config from env var location

8 months agoMerge pull request #2018 from ceph/devcontainer-no-pat
Zack Cerza [Mon, 6 Jan 2025 23:11:43 +0000 (16:11 -0700)]
Merge pull request #2018 from ceph/devcontainer-no-pat

8 months agocontainers/teuthology-dev: Remove access token 2018/head
Zack Cerza [Mon, 6 Jan 2025 22:33:08 +0000 (15:33 -0700)]
containers/teuthology-dev: Remove access token

This container is built and pushed via GitHub Actions. GHA likes to provision a
personal access token for each job that gives tightly-scoped access to the git
repository to the job. When we build our container, we end up including
`.git/config`, which contains the token. Later, in ceph-dev-stack's CI, an
`ls-remote` is run against ceph.git, which ends up causing git to prompt for
credentials even though the repo is public. Removing the token should allow
reading all the relevant repos from the built container image.

Signed-off-by: Zack Cerza <zack@redhat.com>
8 months agoteuthology: Add tests for seek and sync in write_file 2010/head
Christopher Hoffman [Mon, 6 Jan 2025 14:44:49 +0000 (14:44 +0000)]
teuthology: Add tests for seek and sync in write_file

Signed-off-by: Christopher Hoffman <choffman@redhat.com>
8 months agoteuthology: Add support for seek and sync in write_file
Christopher Hoffman [Fri, 25 Oct 2024 15:50:59 +0000 (15:50 +0000)]
teuthology: Add support for seek and sync in write_file

Signed-off-by: Christopher Hoffman <choffman@redhat.com>
8 months agoconfig: allow reading teuthology config from env var location 1998/head
John Mulligan [Fri, 9 Aug 2024 14:15:15 +0000 (10:15 -0400)]
config: allow reading teuthology config from env var location

Allow changing the default "user" location of the teuthology
configuration yaml using the (optional) TEUTHOLOGY_CONFIG environment
variable. This change aids my effort to run a customized local
teuthology environment.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
8 months agoMerge pull request #2015 from thuvh/lab_setup_python3
kyr [Wed, 25 Dec 2024 20:11:21 +0000 (21:11 +0100)]
Merge pull request #2015 from thuvh/lab_setup_python3

doc: use python3 cause of deprecating of python on ubuntu

8 months agodoc: use python3 cause of deprecating of python on ubuntu 2015/head
Hoai-Thu Vuong [Fri, 13 Dec 2024 04:06:06 +0000 (11:06 +0700)]
doc: use python3 cause of deprecating of python on ubuntu

Signed-off-by: Hoai-Thu Vuong <thuvh87@gmail.com>
9 months agoMerge pull request #2016 from ceph/dependabot/pip/configobj-5.0.9
kyr [Fri, 20 Dec 2024 22:57:37 +0000 (23:57 +0100)]
Merge pull request #2016 from ceph/dependabot/pip/configobj-5.0.9

build(deps): bump configobj from 5.0.8 to 5.0.9

9 months agoteuthology: drop __future__ import for print 2017/head
Kyrylo Shatskyy [Fri, 20 Dec 2024 00:00:07 +0000 (01:00 +0100)]
teuthology: drop __future__ import for print

We don't need to support backward compatibility with python 2.

Signed-off-by: Kyrylo Shatskyy <kyrylo.shatskyy@gmail.com>
9 months agobuild(deps): bump configobj from 5.0.8 to 5.0.9 2016/head
dependabot[bot] [Mon, 16 Dec 2024 22:51:42 +0000 (22:51 +0000)]
build(deps): bump configobj from 5.0.8 to 5.0.9

Bumps [configobj](https://github.com/DiffSK/configobj) from 5.0.8 to 5.0.9.
- [Release notes](https://github.com/DiffSK/configobj/releases)
- [Changelog](https://github.com/DiffSK/configobj/blob/release/CHANGES.rst)
- [Commits](https://github.com/DiffSK/configobj/compare/v5.0.8...v5.0.9)

---
updated-dependencies:
- dependency-name: configobj
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
9 months agoteuthology/misc: Add timeout parameter to stop_daemons_of_type for better flexibility 2014/head
Nitzan Mordechai [Tue, 10 Dec 2024 06:12:00 +0000 (06:12 +0000)]
teuthology/misc: Add timeout parameter to stop_daemons_of_type for better flexibility

Updated stop_daemons_of_type to accept a timeout parameter,
allowing dynamic control over the timeout value passed to the
stop function of each daemon.

Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>
9 months agoMerge pull request #2013 from ceph/dependabot/pip/ansible-core-2.17.7
kyr [Wed, 4 Dec 2024 11:05:23 +0000 (12:05 +0100)]
Merge pull request #2013 from ceph/dependabot/pip/ansible-core-2.17.7

build(deps): bump ansible-core from 2.16.6 to 2.17.7

9 months agobuild(deps): bump ansible-core from 2.16.6 to 2.17.7 2013/head
dependabot[bot] [Tue, 3 Dec 2024 22:03:51 +0000 (22:03 +0000)]
build(deps): bump ansible-core from 2.16.6 to 2.17.7

Bumps [ansible-core](https://github.com/ansible/ansible) from 2.16.6 to 2.17.7.
- [Release notes](https://github.com/ansible/ansible/releases)
- [Commits](https://github.com/ansible/ansible/compare/v2.16.6...v2.17.7)

---
updated-dependencies:
- dependency-name: ansible-core
  dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
10 months agoMerge pull request #2009 from ceph/fog-image
Zack Cerza [Tue, 19 Nov 2024 17:10:53 +0000 (10:10 -0700)]
Merge pull request #2009 from ceph/fog-image

10 months agoMerge pull request #2008 from batrick/fragment-merge-base-config
Zack Cerza [Fri, 25 Oct 2024 16:20:21 +0000 (10:20 -0600)]
Merge pull request #2008 from batrick/fragment-merge-base-config

10 months agoprovision.fog: Be more informative re: API errors 2009/head
Zack Cerza [Thu, 24 Oct 2024 17:54:29 +0000 (11:54 -0600)]
provision.fog: Be more informative re: API errors

Signed-off-by: Zack Cerza <zack@redhat.com>
10 months agoteuthology/tasks/kernel: default branch should be "distro" fragment-merge-base-config 2008/head
Patrick Donnelly [Sat, 19 Oct 2024 17:39:38 +0000 (13:39 -0400)]
teuthology/tasks/kernel: default branch should be "distro"

Generally, we don't want to install any kernel by default. There also is no
"main" branch for the ceph-client.git project.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
10 months agoteuthology/suite: merge base_config with other fragments
Patrick Donnelly [Fri, 18 Oct 2024 12:44:13 +0000 (08:44 -0400)]
teuthology/suite: merge base_config with other fragments

Presently the code tries to merge the base_config when the worker starts
running. There's no need to construct it this way and it prevents sharing the
"defaults" with the fragment merging infrastructure. It also prevents
overriding defaults like:

    kernel
        branch: wip-pdonnell-i66704
        client:
            branch: wip-pdonnell-i66704
            flavor: default
            kdb: 1
            sha1: 745cacd8f31e50d7f3b6039bbd8c9a8dfc07bf03
        flavor: default
        kdb: 1
        sha1: 745cacd8f31e50d7f3b6039bbd8c9a8dfc07bf03

A YAML fragment can set kernel.client but it cannot delete the defaults for
kernel.(branch|flavor|kdb|sha1) because there's no way to remove YAML elements
via a deep merge.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
10 months agoMerge pull request #2002 from ceph/node-cleanup-false-pos
Zack Cerza [Mon, 21 Oct 2024 23:05:25 +0000 (17:05 -0600)]
Merge pull request #2002 from ceph/node-cleanup-false-pos

11 months agoteuthology/suite: set the kernel branch in the kernel config
Patrick Donnelly [Thu, 17 Oct 2024 19:59:07 +0000 (15:59 -0400)]
teuthology/suite: set the kernel branch in the kernel config

For inspection by scripts.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
11 months agoteuthology/suite: print base config before config merge
Patrick Donnelly [Thu, 17 Oct 2024 19:58:30 +0000 (15:58 -0400)]
teuthology/suite: print base config before config merge

So it's easier to see its value before any configs are generated.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
11 months agoMerge pull request #2004 from ceph/suite-user
Zack Cerza [Mon, 7 Oct 2024 18:23:00 +0000 (12:23 -0600)]
Merge pull request #2004 from ceph/suite-user

suite: use owner value for user name

11 months agoMerge pull request #2007 from ceph/wip-68285
Zack Cerza [Mon, 7 Oct 2024 18:22:31 +0000 (12:22 -0600)]
Merge pull request #2007 from ceph/wip-68285

run: Fix reference to ctx for interactive jobs

11 months agoMerge pull request #2006 from joscollin/wip-B68037-drop-packages_2-1
Zack Cerza [Mon, 7 Oct 2024 17:21:35 +0000 (11:21 -0600)]
Merge pull request #2006 from joscollin/wip-B68037-drop-packages_2-1

teuthology/task: drop packages that are not built

11 months agorun: Fix reference to ctx for interactive jobs 2007/head
Zack Cerza [Wed, 2 Oct 2024 16:45:34 +0000 (10:45 -0600)]
run: Fix reference to ctx for interactive jobs

Fixes: https://tracker.ceph.com/issues/68285
Signed-off-by: Zack Cerza <zack@redhat.com>
11 months agotest_run: Add test_main_interactive
Zack Cerza [Wed, 2 Oct 2024 16:44:59 +0000 (10:44 -0600)]
test_run: Add test_main_interactive

Signed-off-by: Zack Cerza <zack@redhat.com>
11 months agoteuthology/task: drop packages that are not built 2006/head
Jos Collin [Tue, 1 Oct 2024 03:24:07 +0000 (08:54 +0530)]
teuthology/task: drop packages that are not built

Fixes the error `Unable to find a match: python3-ceph`.

Fixes: https://tracker.ceph.com/issues/68037
Signed-off-by: Jos Collin <jcollin@redhat.com>
11 months agoMerge pull request #2005 from ceph/wip-B68037-drop-packages
Zack Cerza [Wed, 25 Sep 2024 15:55:47 +0000 (09:55 -0600)]
Merge pull request #2005 from ceph/wip-B68037-drop-packages

teuthology/task: drop packages that are not built

11 months agoMerge pull request #1936 from amathuria/wip-amat-fix-suite-sha1
Zack Cerza [Tue, 24 Sep 2024 17:44:01 +0000 (11:44 -0600)]
Merge pull request #1936 from amathuria/wip-amat-fix-suite-sha1

suite/run: Add a option to specify suite sha1

11 months agoteuthology/task: drop packages that are not built 2005/head
Jos Collin [Tue, 24 Sep 2024 11:02:23 +0000 (16:32 +0530)]
teuthology/task: drop packages that are not built

FIxes: https://tracker.ceph.com/issues/68037
Signed-off-by: Jos Collin <jcollin@redhat.com>
11 months agosuite: use owner value for user name 2004/head
Zack Cerza [Tue, 13 Aug 2024 20:29:45 +0000 (14:29 -0600)]
suite: use owner value for user name

Signed-off-by: Zack Cerza <zack@redhat.com>
12 months agolock: Avoid querying paddles for non-jobs 2002/head
Zack Cerza [Thu, 29 Aug 2024 23:03:20 +0000 (17:03 -0600)]
lock: Avoid querying paddles for non-jobs

When we encounter a node that's locked with a description that doesn't look
like it points to a job, avoid the inevitable 404 we'd get from paddles. Without
this, the cleanup process gets short-circuited.

Signed-off-by: Zack Cerza <zack@redhat.com>
12 months agoMerge pull request #2001 from ceph/bootstrap
Dan Mick [Wed, 28 Aug 2024 22:31:44 +0000 (15:31 -0700)]
Merge pull request #2001 from ceph/bootstrap

bootstrap: Simplify dependency checking/install

12 months agobootstrap: Simplify dependency checking/install 2001/head
Zack Cerza [Wed, 28 Aug 2024 21:42:29 +0000 (15:42 -0600)]
bootstrap: Simplify dependency checking/install

Signed-off-by: Zack Cerza <zack@redhat.com>
12 months agoMerge pull request #1997 from kshtsk/wip-exclude-tests-from-wheel 1.2.1
Zack Cerza [Mon, 26 Aug 2024 22:55:58 +0000 (16:55 -0600)]
Merge pull request #1997 from kshtsk/wip-exclude-tests-from-wheel

setup.cfg: exclude tests from wheel package

13 months agoMerge pull request #2000 from ceph/user
Dan Mick [Tue, 20 Aug 2024 23:51:36 +0000 (16:51 -0700)]
Merge pull request #2000 from ceph/user

orchestra: Respect username if set in ssh config

13 months agoorchestra: Respect username if set in ssh config 2000/head
Zack Cerza [Tue, 20 Aug 2024 22:24:33 +0000 (16:24 -0600)]
orchestra: Respect username if set in ssh config

Signed-off-by: Zack Cerza <zack@redhat.com>
13 months agosetup.cfg: exclude tests from packaging 1997/head
Kyr Shatskyy [Sun, 4 Aug 2024 13:12:42 +0000 (15:12 +0200)]
setup.cfg: exclude tests from packaging

Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@clyso.com>
13 months agoMerge pull request #1995 from kshtsk/wip-fix-1816 1.2.0
kyr [Wed, 14 Aug 2024 22:53:30 +0000 (00:53 +0200)]
Merge pull request #1995 from kshtsk/wip-fix-1816

contextutil.safe_while does not sleep between tries if needed to loop infinitely

13 months agoMerge pull request #1999 from ljflores/wip-revert-tracker-66742
Laura Flores [Wed, 14 Aug 2024 20:07:56 +0000 (15:07 -0500)]
Merge pull request #1999 from ljflores/wip-revert-tracker-66742

Revert "orchestra/daemon/cephadmunit.py: fix method kill_cmd()"

13 months agoRevert "orchestra/daemon/cephadmunit.py: fix method kill_cmd()" 1999/head
Laura Flores [Wed, 14 Aug 2024 19:37:06 +0000 (14:37 -0500)]
Revert "orchestra/daemon/cephadmunit.py: fix method kill_cmd()"

This reverts commit 2c6d64a222b4d38d4782ed407e045a86df0a6524.

Related tracker: https://tracker.ceph.com/issues/66883

13 months agoMerge pull request #1983 from ceph/expiry
Zack Cerza [Thu, 8 Aug 2024 18:59:24 +0000 (12:59 -0600)]
Merge pull request #1983 from ceph/expiry

Add job expiration dates

13 months agoMerge pull request #1990 from kshtsk/wip-downburst-fixes
Zack Cerza [Thu, 8 Aug 2024 16:22:10 +0000 (10:22 -0600)]
Merge pull request #1990 from kshtsk/wip-downburst-fixes

bunch of downburst fixes

13 months agosupervisor: Check for job expiration 1983/head
Zack Cerza [Thu, 1 Aug 2024 18:16:04 +0000 (12:16 -0600)]
supervisor: Check for job expiration

This commit isn't strictly necessary for the feature's implementation, but will
allow testing the feature on the production teuthology cluster before merging.

Signed-off-by: Zack Cerza <zack@redhat.com>
13 months agoAdd job expiration dates
Zack Cerza [Mon, 1 Jul 2024 23:00:46 +0000 (17:00 -0600)]
Add job expiration dates

This feature has two parts:
* Specifying expiration dates when scheduling test runs
* A global maximum age

Expiration dates are provided by passing `--expire` to `teuthology-suite` with
a relative value like `1d` (one day), `1w` (one week), or an absolute value like
`1999-12-31_23:59:59`.

A new configuration item, `max_job_age`, is specified in seconds. This defaults
to two weeks.

When the dispatcher checks the queue for the next job to run, it will first
compare the job's `timestamp` value - which reflects the time the job was
scheduled. If more than `max_job_age` seconds have passed, the job is skipped
and marked dead. It next checks for an `expire` value; if that value is in the
past, the job is skipped and marked dead. Otherwise, it will be run as usual.

Signed-off-by: Zack Cerza <zack@redhat.com>
13 months agosuite: Ensure teuthology config is consistent between tests
Zack Cerza [Thu, 1 Aug 2024 19:50:51 +0000 (13:50 -0600)]
suite: Ensure teuthology config is consistent between tests

test_init.py was making modifications to the config object that persisted
between tests. When I fixed that, initially some tests in test_run_.py started
failing because of settings in my local ~/.teuthology.yaml. This change causes
all of the tests in suite.test to use default config values.

Signed-off-by: Zack Cerza <zack@redhat.com>
13 months agotest_misc: Speed up and fix two test bugs
Zack Cerza [Thu, 1 Aug 2024 00:45:57 +0000 (18:45 -0600)]
test_misc: Speed up and fix two test bugs

One test had a missing assert; another had a comparison that would never fire
because of an expected exception being raised during the call.

Signed-off-by: Zack Cerza <zack@redhat.com>
13 months agoteuthology-dev: Allow passing more suite args
Zack Cerza [Tue, 9 Jul 2024 19:53:37 +0000 (13:53 -0600)]
teuthology-dev: Allow passing more suite args

Signed-off-by: Zack Cerza <zack@redhat.com>
13 months agoutil.time: Add parse_timestamp
Zack Cerza [Wed, 31 Jul 2024 20:11:52 +0000 (14:11 -0600)]
util.time: Add parse_timestamp

And move the format string to the time module.

Signed-off-by: Zack Cerza <zack@redhat.com>
13 months agoutil.time: Add new submodule with parse_offset()
Zack Cerza [Mon, 8 Jul 2024 22:24:10 +0000 (16:24 -0600)]
util.time: Add new submodule with parse_offset()

Signed-off-by: Zack Cerza <zack@redhat.com>