]>
git.apps.os.sepia.ceph.com Git - teuthology.git/log
Brad Hubbard [Tue, 27 Jul 2021 04:04:04 +0000 (14:04 +1000)]
ceph_ansible: Remove --system-site-packages
Upgrading ansible is problematic as it conflicts with the installed
package on Ubuntu 20.04 so don't try to use system packages.
Add LANG environment variable to 'pip install ansible' command to work
around pip failing due to file names in ansible package with exotic
characters.
Fixes: https://tracker.ceph.com/issues/51856
Signed-off-by: Brad Hubbard <bhubbard@redhat.com>
Josh Durgin [Fri, 16 Jul 2021 17:49:50 +0000 (10:49 -0700)]
Merge pull request #1659 from kshtsk/wip-exclude-image
openstack: add exclude_image regex parameter
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Josh Durgin [Thu, 15 Jul 2021 20:17:27 +0000 (13:17 -0700)]
Merge pull request #1592 from kshtsk/wip-podman
add docker-compose scripts for development setups
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
kyr [Thu, 15 Jul 2021 19:56:38 +0000 (21:56 +0200)]
Merge pull request #1660 from kshtsk/wip-job-threshold
suite: rename disable-num-jobs-check to job-threshold
Kyr Shatskyy [Thu, 15 Jul 2021 10:32:21 +0000 (12:32 +0200)]
suite: rename disable-num-jobs-check to job-threshold
Rename --disable-num-jobs-check to --job-threshold:
- for shorter recallable name;
- to allow change threshold value via parameter;
- to allow define default threshold value in teuthology config.
Use `--job-threshold 0` to disable job threshold check.
Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
Kyr Shatskyy [Thu, 15 Jul 2021 11:10:05 +0000 (13:10 +0200)]
docker-compose: add some usage notes to the file
Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
Kyr Shatskyy [Wed, 14 Jul 2021 14:48:45 +0000 (16:48 +0200)]
openstack: add exclude_image regex parameter
Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
Neha Ojha [Mon, 12 Jul 2021 21:13:53 +0000 (14:13 -0700)]
Merge pull request #1657 from ceph/wip-max-jobs
teuthology/suite/run.py, scripts/suite.py: disallow scheduling too many jobs
Reviewed-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
Neha Ojha [Thu, 1 Jul 2021 18:28:05 +0000 (18:28 +0000)]
teuthology/suite/run.py, scripts/suite.py: disallow scheduling too many jobs
Add check_num_jobs() to prevent users from accidentally scheduling too many jobs, like
in rfriedma-2021-06-26_19:32:15-rados-wip-ronenf-scrubs-config-distro-basic-smithi.
JOBS_TO_SCHEDULE_THRESHOLD, set to 500 (most runs have fewer jobs than this),
will disallow users from scheduling more than 500 jobs. Users can schedule
more than 500 jobs by disabling this check using the --disable-num-jobs-check flag.
Signed-off-by: Neha Ojha <nojha@redhat.com>
David Galloway [Wed, 7 Jul 2021 15:42:13 +0000 (11:42 -0400)]
Merge pull request #1654 from ceph/wip-focal
Update distro maps
David Galloway [Wed, 7 Jul 2021 14:22:55 +0000 (10:22 -0400)]
tests: Update latest distro versions
Signed-off-by: David Galloway <dgallowa@redhat.com>
Josh Durgin [Mon, 21 Jun 2021 21:46:12 +0000 (14:46 -0700)]
Merge pull request #1653 from ceph/nuke-9
nuke: kill -9 the teuthology process
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
David Galloway [Thu, 10 Jun 2021 15:02:59 +0000 (11:02 -0400)]
orchestra/opsys: Add CentOS 9 Stream
Signed-off-by: David Galloway <dgallowa@redhat.com>
David Galloway [Thu, 10 Jun 2021 15:02:26 +0000 (11:02 -0400)]
orchestra/opsys: Update latest distro versions
Signed-off-by: David Galloway <dgallowa@redhat.com>
Sage Weil [Wed, 9 Jun 2021 19:51:37 +0000 (14:51 -0500)]
nuke: kill -9 the teuthology process
If the process has been kill -STOPped, then we'll unlock the machines, but
the process will stick around and we'll try to nuke it again later,
zapping the machines after they're being used by some other job, leading
to failures. (Usually this manifests as an error when the other job stops
where it has trouble gzipping the logs.)
Use -9 to make sure even STOPped processes are killed.
Signed-off-by: Sage Weil <sage@newdream.net>
kyr [Thu, 27 May 2021 11:01:42 +0000 (13:01 +0200)]
Merge pull request #1625 from ideepika/wip-machine-type
teuthology-suite: add default machine type(smithi)
Deepika Upadhyay [Fri, 5 Mar 2021 07:04:33 +0000 (07:04 +0000)]
teuthology-suite: pick _machine_type /etc/teuthology.yml if not specified explicitly
right now, users have to always pass --machine-type when scheduling a
run, when not specified, command fails with no machine type specified
error.
Instead of failing, we can have `default_machine_type` which in our case
should pick smithi, specified in /etc/teuthology.yml
Signed-off-by: Deepika Upadhyay <dupadhya@redhat.com>
Josh Durgin [Tue, 25 May 2021 14:22:32 +0000 (07:22 -0700)]
Merge pull request #1645 from ceph/exec-all-hosts
tasks/exec: add all-roles, all-hosts keys
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Patrick Donnelly [Fri, 21 May 2021 23:44:06 +0000 (16:44 -0700)]
Merge PR #1571 into master
* refs/pull/1571/head:
rpm: retry installing packages if mirrors are temporarily unreachable
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
Brad Hubbard [Fri, 21 May 2021 22:05:22 +0000 (08:05 +1000)]
Merge pull request #1649 from badone/wip-tracker-50922-container-build-wrong-arch
Restrict build_complete check to x86_64
Reviewed-by: Yuri Weinstein <yweinste@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Brad Hubbard [Fri, 21 May 2021 05:22:26 +0000 (15:22 +1000)]
Restrict build_complete check to x86_64
Without this restriction a failed arm64 build will result in the
container build reporting failure.
Fixes: https://tracker.ceph.com/issues/50922
Signed-off-by: Brad Hubbard <bhubbard@redhat.com>
Vasu Kulkarni [Wed, 19 May 2021 16:27:49 +0000 (09:27 -0700)]
Merge pull request #1648 from sunilkumarn417/add-install
Include install task to get all RPM pkgs
sunilkumarn417 [Wed, 19 May 2021 16:09:32 +0000 (21:39 +0530)]
Include install task to get all RPM pkgs
Signed-off-by: sunilkumarn417 <sunnagar@redhat.com>
Josh Durgin [Tue, 18 May 2021 15:59:44 +0000 (08:59 -0700)]
Merge pull request #1646 from sunilkumarn417/container-tool-setup
task/internal/redhat.py: added container tool login support
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: rakeshgm <rakeshgm@redhat.com>
sunilkumarn417 [Fri, 14 May 2021 09:08:31 +0000 (14:38 +0530)]
task/internal/redhat.py: added container tool login support,
this task is essential to access monitoring images from
Red Hat registry source.
Signed-off-by: sunilkumarn417 <sunnagar@redhat.com>
Sage Weil [Mon, 17 May 2021 16:19:07 +0000 (11:19 -0500)]
Merge pull request #1647 from ceph/no-journald-to-syslog
task/internal/syslog: avoid failing runs when ceph daemon logs go to syslog misc.log
Sage Weil [Fri, 14 May 2021 15:31:25 +0000 (10:31 -0500)]
task/internal/syslog: ignore misc.log
These regexes are all intended for kernel errors. Ceph daemon
logs may leak into misc.log (*shakes fist at systemd-journald*)
and cause false positives (i.e., test failures).
Signed-off-by: Sage Weil <sage@newdream.net>
kyr [Sat, 15 May 2021 12:36:36 +0000 (14:36 +0200)]
Merge pull request #1564 from ceph/wip-lemme-kill-runs
kill.py: Allow deleting runs where bootstrap is failing
Sage Weil [Mon, 10 May 2021 16:13:25 +0000 (11:13 -0500)]
tasks/exec: add all-roles, all-hosts keys
'all' is ambiguous!
Signed-off-by: Sage Weil <sage@newdream.net>
Kyr Shatskyy [Fri, 29 Jan 2021 19:49:57 +0000 (20:49 +0100)]
docker-compose: add healthcheck for paddles postgres
Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
Kyr Shatskyy [Wed, 23 Dec 2020 10:42:15 +0000 (11:42 +0100)]
add docker-compose for starting paddles and pulpito
Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
Kyr Shatskyy [Fri, 11 Dec 2020 17:56:47 +0000 (18:56 +0100)]
docs: fix openSUSE qcow image links
Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
Kefu Chai [Fri, 30 Apr 2021 09:55:20 +0000 (17:55 +0800)]
Merge pull request #1644 from tchaikov/wip-pip
bootstrap: do not pass '--use-feature=2020-resolver' to pip
Reviewed-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
Patrick Donnelly [Thu, 29 Apr 2021 23:55:15 +0000 (16:55 -0700)]
Merge PR #1626 into master
* refs/pull/1626/head:
orchestra: move methods for shell commands from remote.Remote
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Kefu Chai [Thu, 29 Apr 2021 14:43:14 +0000 (22:43 +0800)]
bootstrap: do not pass '--use-feature=2020-resolver' to pip
this reverts
f2607ee8ce149f2951c5fd62c259fc4fa3ddcb5a
to silence the warning from pip:
WARNING: --use-feature=2020-resolver no longer has any effect, since it
is now the default dependency resolver in pip. This will become an error
in pip 21.0.
Signed-off-by: Kefu Chai <kchai@redhat.com>
Josh Durgin [Thu, 29 Apr 2021 02:30:22 +0000 (19:30 -0700)]
Merge pull request #1642 from jdurgin/wip-retry-paddles-reads
lock/query: make robust against paddles errors
Reviewed-by: Sage Weil <sage@redhat.com>
Josh Durgin [Tue, 20 Apr 2021 05:49:43 +0000 (01:49 -0400)]
lock/query: make robust against paddles errors
Retry paddles requests, and for get_status() return an empty dict
rather than None so callers behave.
get_status() failing in particular has caused the dispatcher and jobs
to fail several times over the past few weeks. With this change, we
should be able to run multiple paddles workers again, since all the
common callers will retry on error.
Signed-off-by: Josh Durgin <jdurgin@redhat.com>
Kefu Chai [Mon, 12 Apr 2021 12:29:33 +0000 (20:29 +0800)]
Merge pull request #1641 from tchaikov/always-be-happy
task/internal: do not fail the script if systemd-sysusers core file not found
Reviewed-by: Sage Weil <sage@redhat.com>
Kefu Chai [Mon, 12 Apr 2021 05:16:10 +0000 (13:16 +0800)]
task/internal: do not fail the script if systemd-sysusers core file not found
in
79f373c1769ea4f9d744cf33c5b0a0e026922d0f , we started to filter out
the systemd-sysusers core files. but the script fails if no such a file
is found, like:
2021-04-12T02:58:51.065 ERROR:teuthology.run_tasks:Manager failed: internal.coredump
Traceback (most recent call last):
File "/home/teuthworker/src/git.ceph.com_git_teuthology_85d61eae4759f46ce21e9a37cd816a7a1a66c9d5/teuthology/run_tasks.py", line 176, in run_tasks
suppress = manager.__exit__(*exc_info)
File "/usr/lib/python3.6/contextlib.py", line 88, in __exit__
next(self.gen)
File "/home/teuthworker/src/git.ceph.com_git_teuthology_85d61eae4759f46ce21e9a37cd816a7a1a66c9d5/teuthology/task/internal/__init__.py", line 479, in coredump
wait=False,
File "/home/teuthworker/src/git.ceph.com_git_teuthology_85d61eae4759f46ce21e9a37cd816a7a1a66c9d5/teuthology/orchestra/run.py", line 479, in wait
proc.wait()
File "/home/teuthworker/src/git.ceph.com_git_teuthology_85d61eae4759f46ce21e9a37cd816a7a1a66c9d5/teuthology/orchestra/run.py", line 161, in wait
self._raise_for_status()
File "/home/teuthworker/src/git.ceph.com_git_teuthology_85d61eae4759f46ce21e9a37cd816a7a1a66c9d5/teuthology/orchestra/run.py", line 183, in _raise_for_status
node=self.hostname, label=self.label
teuthology.exceptions.CommandFailedError: Command failed on smithi165 with status 1: "sudo sysctl -w kernel.core_pattern=core && sudo bash -c 'for f in `find /home/ubuntu/cephtest/archive/coredump
-type f`; do file $f | grep -q systemd-sysusers && rm $f ; done' && rmdir --ignore-fail-on-non-empty -- /home/ubuntu/cephtest/archive/coredump"
in this change, we ensure that the script never fails by adding `|| true`.
Signed-off-by: Kefu Chai <kchai@redhat.com>
Kefu Chai [Mon, 12 Apr 2021 05:15:24 +0000 (13:15 +0800)]
task/internal: split embedded shell into lines
for better readability
Signed-off-by: Kefu Chai <kchai@redhat.com>
Rishabh Dave [Thu, 4 Mar 2021 11:08:37 +0000 (16:38 +0530)]
orchestra: move methods for shell commands from remote.Remote
Move methods that issue commands via shell and that don't necessarily
need to depend on SHH from class Remote to a different class. This
enables applications like vstart_runner.py (in Ceph repo) to reuse these
methods for running tests locally without necessarily depending on SSH
and without duplicating them in vstart_runner.py.
Signed-off-by: Rishabh Dave <ridave@redhat.com>
Patrick Donnelly [Wed, 31 Mar 2021 18:06:08 +0000 (11:06 -0700)]
Merge PR #1634 into master
* refs/pull/1634/head:
orchestra/remote: extend mktemp() to accept data
Reviewed-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Rishabh Dave [Fri, 26 Mar 2021 09:26:11 +0000 (14:56 +0530)]
orchestra/remote: extend mktemp() to accept data
Extend remote.Remote.mktemp() to accept data as a parameter and write
the data to the temporary file after it is created.
Signed-off-by: Rishabh Dave <ridave@redhat.com>
Josh Durgin [Mon, 29 Mar 2021 21:58:24 +0000 (14:58 -0700)]
Merge pull request #1636 from ideepika/fix-interactive-error
check ctx.archive is present or not in yaml config
Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Deepika Upadhyay [Mon, 29 Mar 2021 14:46:51 +0000 (20:16 +0530)]
check ctx.archive is present or not in yaml config
this specifically is for interactive on error mode where we usually do
not specify archive_path which fails without this check
Signed-off-by: Deepika Upadhyay <dupadhya@redhat.com>
Josh Durgin [Thu, 25 Mar 2021 17:05:33 +0000 (10:05 -0700)]
Merge pull request #1633 from jdurgin/wip-retry-paddles-writes
report, lock.ops: retry write requests to paddles
Reviewed-by: Neha Ojha <nojha@redhat.com>
Josh Durgin [Sun, 21 Mar 2021 22:28:52 +0000 (18:28 -0400)]
report, lock.ops: retry write requests to paddles
For more contended cases of updating job status and machine keys,
where we've seen 500 errors from DB conflicts, use random intervals
for the retries.
This is the teuthology half of fixing:
https://tracker.ceph.com/issues/49864
Signed-off-by: Josh Durgin <jdurgin@redhat.com>
Sage Weil [Sun, 21 Mar 2021 18:16:35 +0000 (13:16 -0500)]
Merge pull request #1632 from ceph/revert-nuke
Revert "Merge pull request #1631 from jdurgin/wip-nuke-poweroff"
Sage Weil [Sun, 21 Mar 2021 16:39:13 +0000 (11:39 -0500)]
Revert "Merge pull request #1631 from jdurgin/wip-nuke-poweroff"
This reverts commit
c48eb744081d22bc82d7d099d4edb67ae02551e0 , reversing
changes made to
b96569170f15eae4604f361990ea65737b28dff1 .
This is causing log gzipping to fail because the logs already exist as .gz files.
My guess is that the logs are left over from previous, but I'm not sure how
that would happen.
In any case, the merge of this PR corresponds exactly to when we started seeing
the log gzip failures.
Signed-off-by: Sage Weil <sage@newdream.net>
Josh Durgin [Fri, 19 Mar 2021 22:50:18 +0000 (15:50 -0700)]
Merge pull request #1631 from jdurgin/wip-nuke-poweroff
nuke: don't power-off machines when not rebooting
Reviewed-by: Neha Ojha <nojha@redhat.com>
Josh Durgin [Fri, 19 Mar 2021 21:01:20 +0000 (21:01 +0000)]
nuke: don't power-off machines when not rebooting
This ensures jobs that time out can still have their logs gathered.
Signed-off-by: Josh Durgin <jdurgin@redhat.com>
Josh Durgin [Sat, 13 Mar 2021 03:30:42 +0000 (19:30 -0800)]
Merge pull request #1628 from ceph/ignore-systemd-sysusers-core
task/internal: ignore systemd-sysusers core file
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Sage Weil [Fri, 12 Mar 2021 17:58:47 +0000 (11:58 -0600)]
task/internal: ignore systemd-sysusers core file
This is related to dnsmasq. When installing hte kubic podman 3.0.1
packages,
Running scriptlet: dnsmasq-2.79-13.el8_3.1.x86_64 14/16
/var/tmp/rpm-tmp.6MFp00: line 5: 9079 Segmentation fault (core dumped) systemd-sysusers - &> /dev/null <<SYSTEMD_INLINE_EOF
u dnsmasq - "Dnsmasq DHCP and DNS server" /var/lib/dnsmasq
SYSTEMD_INLINE_EOF
Installing : dnsmasq-2.79-13.el8_3.1.x86_64 14/16
warning: group dnsmasq does not exist - using root
warning: group dnsmasq does not exist - using root
warning: group dnsmasq does not exist - using root
Running scriptlet: dnsmasq-2.79-13.el8_3.1.x86_64 14/16
/var/tmp/rpm-tmp.pfCGxn: line 3: 9089 Segmentation fault (core dumped) systemd-sysusers &> /dev/null
Installing : podman-3.0.1-2.el8.3.2.x86_64 15/16
Installing : podman-plugins-3.0.1-2.el8.3.2.x86_64 16/16
Running scriptlet: container-selinux-2:2.145.0-1.el8.noarch 16/16
Running scriptlet: podman-plugins-3.0.1-2.el8.3.2.x86_64 16/16
/var/tmp/rpm-tmp.bFfmjl: line 6: 11098 Segmentation fault (core dumped) /usr/bin/systemd-sysusers
warning: %triggerin(systemd-239-18.el8.x86_64) scriptlet failed, exit status 139
Error in <unknown> scriptlet in rpm package podman-plugins
Verifying : dnsmasq-2.79-13.el8_3.1.x86_64 1/16
Nothing to do with us.
Signed-off-by: Sage Weil <sage@newdream.net>
kyr [Fri, 12 Mar 2021 09:20:23 +0000 (10:20 +0100)]
Merge pull request #1573 from smithfarm/wip-45570
orchestra/console: raise RuntimeError when fail to power on
Josh Durgin [Thu, 11 Mar 2021 16:48:34 +0000 (08:48 -0800)]
Merge pull request #1627 from ceph/wip-debug-levels
suite/placeholder.py: lower osd specific debug levels
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Neha Ojha [Wed, 10 Mar 2021 23:33:55 +0000 (23:33 +0000)]
suite/placeholder.py: lower osd specific debug levels
Signed-off-by: Neha Ojha <nojha@redhat.com>
Brad Hubbard [Tue, 9 Mar 2021 22:20:21 +0000 (08:20 +1000)]
Merge pull request #1620 from ceph/wip-badone-ceph-ansible-tracker-49485
ceph_ansible: Satisfy 'six' dependency
Reviewed-by: Yuri Weinstein <yweins@redhat.com>
Sage Weil [Sat, 27 Feb 2021 20:13:30 +0000 (14:13 -0600)]
selinux: fix typo
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil [Sat, 27 Feb 2021 17:39:02 +0000 (11:39 -0600)]
Merge pull request #1622 from ceph/ignore-selinux-sssd
selinux: ignore issues with sssd
Sage Weil [Sat, 27 Feb 2021 15:26:36 +0000 (09:26 -0600)]
selinux: ignore issues with sssd
['type=AVC msg=audit(
1614438637 .552:5615): avc: denied { read } for pid=876 comm="sssd" name="resolv.conf" dev="sda1" ino=265261 scontext=system_u:system_r:sssd_t:s0 tcontext=unconfined_u:object_r:admin_home_t:s0 tclass=file permissive=1']
(currently seen on rhel 8.3)
Signed-off-by: Sage Weil <sage@newdream.net>
kyr [Fri, 26 Feb 2021 22:49:30 +0000 (23:49 +0100)]
Merge pull request #1621 from kshtsk/wip-math-gcd
suite/matrix: use math.gcd instead of fractions.gcd
Kyr Shatskyy [Fri, 26 Feb 2021 14:13:59 +0000 (15:13 +0100)]
requirements: use ansible 2.9
Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
Kyr Shatskyy [Fri, 26 Feb 2021 10:20:28 +0000 (11:20 +0100)]
requirements: bump up cffi to 1.14.5
Needs for run on Big Sur with python3.9 from brew and addresses
building error for cffi wheel:
clang -Wno-unused-result -Wsign-compare -Wunreachable-code -fno-common -dynamic -DNDEBUG -g -fwrapv -O3 -Wall -isysroot /Library/Developer/CommandLineTools/SDKs/MacOSX.sdk -DUSE__THREAD -DHAVE_SYNC_SYNCHRONIZE -I/Library/Developer/CommandLineTools/SDKs/MacOSX.sdk/usr/include/ffi -I/Library/Developer/CommandLineTools/SDKs/MacOSX.sdk/usr/include/ffi -I/opt/homebrew/include -I/opt/homebrew/opt/openssl@1.1/include -I/opt/homebrew/opt/sqlite/include -I/opt/homebrew/opt/tcl-tk/include -I/Users/kyr/kshtsk/teuthology/virtualenv/include -I/opt/homebrew/Cellar/python@3.9/3.9.2_1/Frameworks/Python.framework/Versions/3.9/include/python3.9 -c c/_cffi_backend.c -o build/temp.macosx-11-arm64-3.9/c/_cffi_backend.o
c/_cffi_backend.c:6185:5: warning: 'PyEval_InitThreads' is deprecated [-Wdeprecated-declarations]
PyEval_InitThreads();
^
/opt/homebrew/Cellar/python@3.9/3.9.2_1/Frameworks/Python.framework/Versions/3.9/include/python3.9/ceval.h:130:1: note: 'PyEval_InitThreads' has been explicitly marked deprecated here
Py_DEPRECATED(3.9) PyAPI_FUNC(void) PyEval_InitThreads(void);
^
/opt/homebrew/Cellar/python@3.9/3.9.2_1/Frameworks/Python.framework/Versions/3.9/include/python3.9/pyport.h:508:54: note: expanded from macro 'Py_DEPRECATED'
#define Py_DEPRECATED(VERSION_UNUSED) __attribute__((__deprecated__))
^
c/_cffi_backend.c:6245:9: error: implicit declaration of function 'ffi_prep_closure' is invalid in C99 [-Werror,-Wimplicit-function-declaration]
if (ffi_prep_closure(closure, &cif_descr->cif,
^
1 warning and 1 error generated.
error: command '/usr/bin/clang' failed with exit code 1
Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
Kyr Shatskyy [Fri, 26 Feb 2021 13:13:31 +0000 (14:13 +0100)]
requirements.in: stick ansible version to 2.8 version
Since we are not ready for ansible 3 from ceph-cm-ansible point of view:
2021-02-26T12:45:17.668 INFO:teuthology.task.ansible.out:ERROR! couldn't resolve module/action 'firewalld'. This often indicates a misspelling, missing collection, or incorrect module path.
Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
Kyr Shatskyy [Fri, 26 Feb 2021 11:57:29 +0000 (12:57 +0100)]
requirements.in: stick pytest to 3.7.1 version
Untill someone fixes unittests.
Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
Kyr Shatskyy [Fri, 26 Feb 2021 10:19:46 +0000 (11:19 +0100)]
suite/matrix: latest py3 deprecates fractions.gcd
Signed-off-by: Kyrylo Shatskyy <kyr@top.local>
Brad Hubbard [Thu, 25 Feb 2021 08:38:31 +0000 (18:38 +1000)]
ceph_ansible: Satisfy 'six' dependency
Fixes: https://tracker.ceph.com/issues/49485
Signed-off-by: Brad Hubbard <bhubbard@redhat.com>
Sage Weil [Thu, 18 Feb 2021 21:58:10 +0000 (15:58 -0600)]
Merge pull request #1618 from ceph/valgrind-soname
misc: make valgrind behave with tcmalloc
Sage Weil [Thu, 18 Feb 2021 16:23:14 +0000 (10:23 -0600)]
misc: make valgrind behave with tcmalloc
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil [Thu, 18 Feb 2021 14:43:46 +0000 (08:43 -0600)]
Merge pull request #1617 from ceph/no-fsid-for-state
orchestra/daemon/state: do not pass fsid property to run() later
Sage Weil [Wed, 17 Feb 2021 18:47:45 +0000 (13:47 -0500)]
orchestra/daemon/state: do not pass fsid property to run() later
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil [Wed, 17 Feb 2021 15:59:14 +0000 (09:59 -0600)]
Merge pull request #1616 from ceph/ignore-signal-exceptions
orchestra/daemon/cephadmunit: ignore exception when sending signal
Sage Weil [Wed, 17 Feb 2021 03:27:32 +0000 (21:27 -0600)]
orchestra/daemon/cephadmunit: ignore exception when sending signal
The osd thrashing is sending lots of signals (sighup) and can easily race with
a daemon shutting down entirely.
This makes us match the behavior of the original state.py signal() method.
Signed-off-by: Sage Weil <sage@newdream.net>
Josh Durgin [Tue, 16 Feb 2021 01:38:24 +0000 (17:38 -0800)]
Merge pull request #1615 from jdurgin/wip-debug-ms
suite: lower debug_ms for osd back to 1
Reviewed-by: Neha Ojha <nojha@redhat.com>
Josh Durgin [Tue, 16 Feb 2021 00:15:45 +0000 (19:15 -0500)]
suite: lower debug_ms for osd back to 1
This was increased for some mgr issues in
044384be450a557f56a2b39bf7d0e71e69d45cd3 , but isn't helping much now
and is filling up disks for long-running tests.
Signed-off-by: Josh Durgin <jdurgin@redhat.com>
Josh Durgin [Sat, 13 Feb 2021 00:29:38 +0000 (16:29 -0800)]
Merge pull request #1614 from jdurgin/wip-nuke-tests
nuke: fix no_reboot only being present in the cli and add unit tests
Reviewed-by: Neha Ojha <nojha@redhat.com>
Josh Durgin [Fri, 12 Feb 2021 22:54:17 +0000 (22:54 +0000)]
test_nuke: add unit tests for internal nuke options
Signed-off-by: Josh Durgin <jdurgin@redhat.com>
Josh Durgin [Fri, 12 Feb 2021 22:53:38 +0000 (22:53 +0000)]
nuke: only use no_reboot on the cli
Signed-off-by: Josh Durgin <jdurgin@redhat.com>
Josh Durgin [Fri, 12 Feb 2021 18:29:47 +0000 (10:29 -0800)]
Merge pull request #1613 from jdurgin/wip-nuke-keep-logs
nuke: only use keep_logs from the cli
Reviewed-by: Neha Ojha <nojha@redhat.com>
Josh Durgin [Thu, 11 Feb 2021 22:59:52 +0000 (22:59 +0000)]
nuke: only use keep_logs from the cli
nuke() is called outside of the cli with a ctx that does not include
all the cli args. Use a default parameter for the functions instead of ctx.
Signed-off-by: Josh Durgin <jdurgin@redhat.com>
Sage Weil [Thu, 11 Feb 2021 22:43:28 +0000 (16:43 -0600)]
Merge pull request #1612 from ceph/nicer-ls
ls: nicer ls output
Sage Weil [Wed, 10 Feb 2021 22:22:53 +0000 (22:22 +0000)]
ls: nicer ls output
- no error when teuthology.log is missing (provisioning)
- leave off pid
Signed-off-by: Sage Weil <sage@redhat.com>
kyr [Thu, 11 Feb 2021 14:15:57 +0000 (15:15 +0100)]
Merge pull request #1611 from ceph/dependabot/pip/cryptography-3.3.2
build(deps): bump cryptography from 3.2 to 3.3.2
dependabot[bot] [Thu, 11 Feb 2021 14:09:41 +0000 (14:09 +0000)]
build(deps): bump cryptography from 3.2 to 3.3.2
Bumps [cryptography](https://github.com/pyca/cryptography) from 3.2 to 3.3.2.
- [Release notes](https://github.com/pyca/cryptography/releases)
- [Changelog](https://github.com/pyca/cryptography/blob/master/CHANGELOG.rst)
- [Commits](https://github.com/pyca/cryptography/compare/3.2...3.3.2)
Signed-off-by: dependabot[bot] <support@github.com>
kyr [Thu, 11 Feb 2021 14:07:51 +0000 (15:07 +0100)]
Merge pull request #1609 from ceph/dependabot/pip/httplib2-0.19.0
build(deps): bump httplib2 from 0.18.0 to 0.19.0
Josh Durgin [Tue, 9 Feb 2021 22:10:00 +0000 (14:10 -0800)]
Merge pull request #1610 from jdurgin/wip-supervisor-timeouts
supervisor: improve error handling for dead jobs
Reviewed-by: Andrew Schoen <aschoen@redhat.com>
Josh Durgin [Tue, 9 Feb 2021 21:33:34 +0000 (21:33 +0000)]
supervisor: send paddles the reason a jobs is marked dead
Signed-off-by: Josh Durgin <jdurgin@redhat.com>
Josh Durgin [Tue, 9 Feb 2021 21:16:46 +0000 (21:16 +0000)]
supervisor: kill processes before gathering logs
When we hit the max job timeout, we need to stop the test programs
before collecting logs or else we run into errors like 'file size
changed while zipping' trying to compress them, and we can't save them
or stop the job.
Signed-off-by: Josh Durgin <jdurgin@redhat.com>
Josh Durgin [Tue, 9 Feb 2021 19:24:02 +0000 (19:24 +0000)]
nuke: allow not rebooting again
The default behavior was changed to always reboot in
1d47a121b385e2656e9314e9d63faf68a8e865e4 but the --reboot-all option
remained. Keep the original option around for compatibility with
existing scripts.
Signed-off-by: Josh Durgin <jdurgin@redhat.com>
Josh Durgin [Tue, 9 Feb 2021 18:54:28 +0000 (18:54 +0000)]
nuke: add option to preserve logs on remote machines
This will be helpful for killing jobs that hit the max_job_timeout
while still being able to collect logs from them.
Signed-off-by: Josh Durgin <jdurgin@redhat.com>
dependabot[bot] [Mon, 8 Feb 2021 20:52:33 +0000 (20:52 +0000)]
build(deps): bump httplib2 from 0.18.0 to 0.19.0
Bumps [httplib2](https://github.com/httplib2/httplib2) from 0.18.0 to 0.19.0.
- [Release notes](https://github.com/httplib2/httplib2/releases)
- [Changelog](https://github.com/httplib2/httplib2/blob/master/CHANGELOG)
- [Commits](https://github.com/httplib2/httplib2/compare/v0.18.0...v0.19.0)
Signed-off-by: dependabot[bot] <support@github.com>
David Galloway [Fri, 5 Feb 2021 17:37:38 +0000 (12:37 -0500)]
Merge pull request #1608 from kshtsk/fix-docs
readme: fix teuthology docs link at docs.ceph.com
kyr [Fri, 5 Feb 2021 17:19:24 +0000 (18:19 +0100)]
Merge pull request #1601 from sebastian-philipp/prio-add-job-count
teuthology-suite: Add job count to priority error msg.
Kyr Shatskyy [Fri, 5 Feb 2021 17:15:49 +0000 (18:15 +0100)]
readme: fix teuthology docs link at docs.ceph.com
Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
kyr [Fri, 5 Feb 2021 16:19:02 +0000 (17:19 +0100)]
Merge pull request #1607 from kshtsk/ver-1.1.0
version: increase version to 1.1.0 since we have dispatcher
kyr [Fri, 5 Feb 2021 16:11:44 +0000 (17:11 +0100)]
Merge pull request #1606 from kshtsk/supervisor-log
dispatcher: add .log extension for supervisor log
Kyr Shatskyy [Fri, 5 Feb 2021 16:09:40 +0000 (17:09 +0100)]
version: increase version to 1.1.0 since we have dispatcher
Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
Kyr Shatskyy [Fri, 5 Feb 2021 16:04:52 +0000 (17:04 +0100)]
dispatcher: add .log extension for supervisor log
It would be great to have an extension for easy log identification.
Signed-off-by: Kyr Shatskyy <kyrylo.shatskyy@suse.com>
Dan Mick [Thu, 4 Feb 2021 23:09:08 +0000 (15:09 -0800)]
Merge pull request #1605 from jdurgin/wip-supervisor-connect-error
dispatcher/supervisor: always unlock machines and save status
Josh Durgin [Thu, 4 Feb 2021 22:56:53 +0000 (17:56 -0500)]
dispatcher/supervisor: always unlock machines and save status
If we can't connect to the machines anymore, we still need to clean
up.
Signed-off-by: Josh Durgin <jdurgin@redhat.com>