]>
git.apps.os.sepia.ceph.com Git - ceph-build.git/log
Dan Mick [Wed, 19 Jul 2023 20:31:31 +0000 (13:31 -0700)]
ansible: use 'inventory_hostname' to look up jenkins labels
inventory_hostname is as it appears in the inventory file, and keeping
that consistent is easier than dealing with short vs long names
Note: this requires a change to ceph-sepia-secrets to standardize on
the inventory form of the name
Signed-off-by: Dan Mick <dmick@redhat.com>
Guillaume Abrioux [Mon, 7 Aug 2023 15:21:55 +0000 (17:21 +0200)]
Merge pull request #2156 from ceph/fix-container-flake8
ceph-container-flake8: fix test
Guillaume Abrioux [Mon, 7 Aug 2023 15:10:57 +0000 (17:10 +0200)]
ceph-container-flake8: fix test
Looks like this job has never worked.
The file should be bind-mounted in /code, not in /
There's no need to pass arguments, the entrypoint automatically
scan /code and run flake8 against any present files.
Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
Dan Mick [Thu, 3 Aug 2023 07:13:28 +0000 (00:13 -0700)]
Merge pull request #2154 from Matan-B/wip-matanb-crimson-centos9
ceph-dev*: build crimson on centos9 (main and reef)
Ionut BALUTOIU [Wed, 2 Aug 2023 08:12:46 +0000 (11:12 +0300)]
Merge pull request #2155 from petrutlucian94/log_disk_space
Lucian Petrut [Wed, 2 Aug 2023 08:01:58 +0000 (08:01 +0000)]
windows: log available disk space
Some Windows test jobs have failed after the OSDs ran out of disk
space. Those jobs use Linux VMs with 128GBs of storage and run
3 OSDs, each having 15GB.
In order to get a better picture, we'll log the disk space usage,
including the vstart directory contents.
Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
Zack Cerza [Wed, 26 Jul 2023 22:46:46 +0000 (16:46 -0600)]
Merge pull request #2147 from dmick/wip-jenkins-builders
ansible: enhance library/jenkins_node to allow node update
Matan [Wed, 26 Jul 2023 12:45:01 +0000 (15:45 +0300)]
ceph-dev*: build crimson on centos9 (main and reef)
Signed-off-by: Matan Breizman <mbreizma@redhat.com>
Yuri Weinstein [Tue, 25 Jul 2023 02:44:44 +0000 (22:44 -0400)]
Merge pull request #2152 from dmick/wip-vault-pass
ansible: remove vault_password_file; jenkins-build doesn't have one
Reviewed-by: Yuri Weinstein <yweinste@redhat.com>
Dan Mick [Tue, 25 Jul 2023 02:35:26 +0000 (19:35 -0700)]
ansible: remove vault_password_file; jenkins-build doesn't have one
Jenkins also uses ansible for some builds, and this config item breaks
the run, because although the ansible secret isn't needed, if you configure
it, ansible requires it. <sigh>
Signed-off-by: Dan Mick <dmick@redhat.com>
Ilya Dryomov [Mon, 24 Jul 2023 08:55:19 +0000 (10:55 +0200)]
Merge pull request #2150 from lxbsz/centos9
kernel: enable centos9 build
Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
Xiubo Li [Mon, 24 Jul 2023 06:47:35 +0000 (14:47 +0800)]
kernel: enable the centos9 build
commit
4b61a221e4e2f53dc7df3137fc34f418c3f3b879 just forgot to enable
it after adding centos9.
Signed-off-by: Xiubo Li <xiubli@redhat.com>
Zack Cerza [Fri, 21 Jul 2023 16:10:06 +0000 (10:10 -0600)]
Merge pull request #2149 from dmick/wip-preserve-temp-jobs
jenkins-job-builder: don't delete jobs named 'preserve-*'
Dan Mick [Fri, 21 Jul 2023 02:43:05 +0000 (19:43 -0700)]
jenkins-job-builder: don't delete jobs named 'preserve-*'
This is to allow job development without losing the jobs and past builds
as other ceph-build PRs are pushed. It's assumed that if you create a job
with 'preserve-' in its title that you will clean it up when you're done.
Signed-off-by: Dan Mick <dmick@redhat.com>
Ilya Dryomov [Thu, 20 Jul 2023 11:14:21 +0000 (13:14 +0200)]
Merge pull request #2141 from ceph/wip-tcmu-runner-repo-url
tcmu-runner: set $RELEASE before it's used
Reviewed-by: Adam Kraitman <akraitma@redhat.com>
Laura Flores [Wed, 19 Jul 2023 21:28:02 +0000 (16:28 -0500)]
Merge pull request #2148 from ljflores/wip-increase-timeout
ceph-dev-new-setup/config/definitions: increase timeout for cloning ci repo
Laura Flores [Mon, 17 Jul 2023 21:44:44 +0000 (16:44 -0500)]
ceph-dev-new-setup/config/definitions: increase timeout for cloning ci repo
Might help address failures like
```
fatal: fetch-pack: invalid index-pack output`
```
and
```
error: rev-list died of signal 15
error: github.com:ceph/ceph-ci.git did not send all necessary objects
```
Signed-off-by: Laura Flores <lflores@redhat.com>
Dan Mick [Sat, 15 Jul 2023 01:09:08 +0000 (18:09 -0700)]
library/jenkins_node: improve diagnostics on failure.
Capture exception args for exceptions that have no msg or message,
and capture a traceback.
Signed-off-by: Dan Mick <dmick@redhat.com>
Dan Mick [Sat, 15 Jul 2023 01:07:32 +0000 (18:07 -0700)]
examples/builder.yml, library/jenkins_node: support node update
Original implementation would only permit nodes to be created if they
did not exist. Add support to update certain attributes of the node
with a run against extant nodes. Only permit a few attribute updates.
This is primarily motivated by the desire to support maintaining node
labels with ansible.
Signed-off-by: Dan Mick <dmick@redhat.com>
Dan Mick [Sat, 15 Jul 2023 01:02:20 +0000 (18:02 -0700)]
Merge pull request #2139 from dmick/wip-jenkins-builders
Fix up a number of problems with ansible/examples/builder.yml
Nizamudeen A [Fri, 14 Jul 2023 12:11:10 +0000 (17:41 +0530)]
Merge pull request #2146 from rhcs-dashboard/cleanup-script-cephadm-e2e-nightly
dashboard-cephadm-e2e-nightly: move the cleanup script to postbuild step
Dan Mick [Thu, 8 Jun 2023 09:07:09 +0000 (02:07 -0700)]
library/jenkins_node: use python3
Signed-off-by: Dan Mick <dmick@redhat.com>
Dan Mick [Thu, 8 Jun 2023 09:05:24 +0000 (02:05 -0700)]
files/ssh/hostkeys/github.com.pub: update the github host key
github reissued its host key in March 2023 (see
https://github.blog/2023-03-23-we-updated-our-rsa-ssh-host-key/).
Record here for use in setting up jenkins builders.
Signed-off-by: Dan Mick <dmick@redhat.com>
Dan Mick [Thu, 8 Jun 2023 09:03:39 +0000 (02:03 -0700)]
ansible/examples/builder.yml: kill any rogue agent/slave.jar procs
Some time ago slave.jar was renamed agent.jar, and there were
builders running the old version, sometimes as root (which caused
problems when the job would check out git workspaces as root
that could then not be removed by a job running as jenkins-build).
Clean up the crufty procs, if any.
Signed-off-by: Dan Mick <dmick@redhat.com>
Dan Mick [Thu, 8 Jun 2023 09:02:50 +0000 (02:02 -0700)]
ansible/examples/builder.yml: fix egregious spelling error
also add a tag for the register step; will be useful for updating
tags once I get updating working
Signed-off-by: Dan Mick <dmick@redhat.com>
Dan Mick [Tue, 11 Jul 2023 21:31:37 +0000 (14:31 -0700)]
Merge pull request #2144 from ljflores/wip-crimson-pacific-2
ceph-dev-cron, ceph-dev-new-trigger: don't build crimson for pacific
Dan Mick [Thu, 8 Jun 2023 09:02:07 +0000 (02:02 -0700)]
ansible/examples/builder.yml: use chown -R to change perms
chown -R is *much* faster than the ansible file module
Signed-off-by: Dan Mick <dmick@redhat.com>
Dan Mick [Thu, 8 Jun 2023 09:01:10 +0000 (02:01 -0700)]
ansible/examples/builder.yml: set builder labels from group_vars
Signed-off-by: Dan Mick <dmick@redhat.com>
Dan Mick [Thu, 8 Jun 2023 08:59:28 +0000 (01:59 -0700)]
ansible/examples/builder.yml: use secret for jenkins_api_token
Put it into variable 'token' for all the other tasks.
Signed-off-by: Dan Mick <dmick@redhat.com>
Nizamudeen A [Mon, 10 Jul 2023 06:06:29 +0000 (11:36 +0530)]
dashboard-cephadm-e2e-nightly: move the cleanup script to postbuild step
Signed-off-by: Nizamudeen A <nia@redhat.com>
Nizamudeen A [Mon, 10 Jul 2023 05:26:45 +0000 (10:56 +0530)]
Merge pull request #2145 from rhcs-dashboard/cleanup-script-cephadm-e2e
dashboard-cephadm-e2e: move the cleanup script to postbuild step
Nizamudeen A [Fri, 7 Jul 2023 10:27:49 +0000 (15:57 +0530)]
dashboard-cephadm-e2e: move the cleanup script to postbuild step
Signed-off-by: Nizamudeen A <nia@redhat.com>
Laura Flores [Thu, 6 Jul 2023 16:47:48 +0000 (11:47 -0500)]
ceph-dev-cron, ceph-dev-new-trigger: don't build crimson for pacific
Followup to PR https://github.com/ceph/ceph-build/pull/2142; removes
crimson pacific from additional build jobs.
Signed-off-by: Laura Flores <lflores@redhat.com>
Dan Mick [Thu, 29 Jun 2023 00:52:47 +0000 (17:52 -0700)]
Merge pull request #2143 from dmick/wip-sync-pull
scripts/sync-pull: update argument check to look for $3
Dan Mick [Wed, 28 Jun 2023 23:14:19 +0000 (16:14 -0700)]
scripts/sync-pull: update argument check to look for $3
Catch the error where three arguments are not supplied.
Signed-off-by: Dan Mick <dmick@redhat.com>
Zack Cerza [Mon, 26 Jun 2023 16:56:37 +0000 (10:56 -0600)]
Merge pull request #2142 from ljflores/wip-crimson-pacific
Laura Flores [Mon, 26 Jun 2023 15:47:13 +0000 (10:47 -0500)]
ceph-dev-trigger/config/definitions: don't build crimson for pacific
Crimson changes will not be backported to pacific, so it doesn't make
sense to keep building it.
Signed-off-by: Laura Flores <lflores@redhat.com>
Ilya Dryomov [Sat, 24 Jun 2023 11:46:35 +0000 (13:46 +0200)]
tcmu-runner: fix CEPH_BRANCH parameter description
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
Ilya Dryomov [Sat, 24 Jun 2023 11:32:37 +0000 (13:32 +0200)]
tcmu-runner: set $RELEASE before it's used
get_rpm_dist() call is misplaced. The Ceph repo URL ends up missing
the distro version which causes the job to fail after some timeouts:
REPO_URL=https://shaman.ceph.com/api/repos/ceph/main/latest/centos//flavors/default/repo
...
curl --fail -L https://shaman.ceph.com/api/repos/ceph/main/latest/centos//flavors/default/repo
...
curl: (22) The requested URL returned error: 504 Gateway Timeout
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
Guillaume Abrioux [Mon, 12 Jun 2023 22:13:57 +0000 (00:13 +0200)]
Merge pull request #2140 from dmick/wip-ceph-release-rpm
ceph-release-rpm build script missing space before =
Dan Mick [Mon, 12 Jun 2023 22:08:32 +0000 (15:08 -0700)]
ceph-release-rpm build script missing space before =
This causes a syntax error. Also, the script didn't have -e set, so
the error went unnoticed
Signed-off-by: Dan Mick <dmick@redhat.com>
Laura Flores [Mon, 5 Jun 2023 17:32:50 +0000 (12:32 -0500)]
Merge pull request #2137 from ljflores/wip-ceph-dev-new-build
ceph-dev-new-build/config/definitions: give proper permission to remove a folder
Laura Flores [Mon, 5 Jun 2023 17:02:28 +0000 (12:02 -0500)]
ceph-dev-new-build/config/definitions: give proper permission to remove a folder
Solution to fix this in ceph-dev-new-build:
```
[EnvInject] - Inject global passwords.
[EnvInject] - Mask passwords that will be passed as build parameters.
Failed to evaluate name macro:org.jenkinsci.plugins.tokenmacro.MacroEvaluationException: Error processing tokens
[gigantic] $ /bin/sh -xe /tmp/jenkins6948878898411108393.sh
+ echo 'Cleaning up top-level workarea (shared among workspaces)'
Cleaning up top-level workarea (shared among workspaces)
+ sudo rm -rf dist
+ rm -rf venv
+ rm -rf release
rm: cannot remove 'release/
18.0.0-4267-g48627125 /
ceph-18.0.0-4267-g48627125 ': Permission denied
rm: cannot remove 'release/
18.0.0-4267-g48627125 /version': Permission denied
rm: cannot remove 'release/
18.0.0-4267-g48627125 /ceph_18
.0.0-4267-g48627125 .orig.tar.gz': Permission denied
rm: cannot remove 'release/
18.0.0-4267-g48627125 /branch': Permission denied
rm: cannot remove 'release/
18.0.0-4267-g48627125 /ceph_18
.0.0-4267-g48627125 -1.dsc': Permission denied
rm: cannot remove 'release/
18.0.0-4267-g48627125 /other_envvars': Permission denied
rm: cannot remove 'release/
18.0.0-4267-g48627125 /ceph.spec': Permission denied
...
rm: cannot remove 'release/
18.0.0-4267-g48627125 /rpm/el8/BUILD/
ceph-18.0.0-4267-g48627125 /qa/suites/rgw/dbstore/cluster.yaml': Permission denied
rm: cannot remove 'release/
18.0.0-4267-g48627125 /rpm/el8/BUILD/
ceph-18.0.0-4267-g48627125 /qa/suites/rgw/dbstore/overrides.yaml': Permission denied
rm: cannot remove 'release/
18.0.0-4267-g48627125 /rpm/el8/BUILD/
ceph-18.0.0-4267-g48627125 /qa/suites/rgw/dbstore/s3tests-branch.yaml': Permission denied
rm: cannot remove 'release/
18.0.0-4267-g48627125 /rpm/el8/BUILD/
ceph-18.0.0-4267-g48627125 /qa/suites/rgw/dbstore/supported-random-distro$': Permission denied
rm: cannot remove 'release/
18.0.0-4267-g48627125 /rpm/el8/BUILD/
ceph-18.0.0-4267-g48627125 /qa/suites/rgw/dbstore/tasks/rgw_s3tests.yaml': Permission denied
rm: cannot remove 'release/
18.0.0-4267-g48627125 /rpm/el8/BUILD/
ceph-18.0.0-4267-g48627125 /qa/suites/rgw/dbstore/tasks/.qa': Permission denied
rm: cannot remove 'release/
18.0.0-4267-g48627125 /rpm/el8/BUILD/
ceph-18.0.0-4267-g48627125 /qa/suites/rgw/dbstore/.qa': Permission denied
rm: cannot remove 'release/
18.0.0-4267-g48627125 /rpm/el8/BUILD/
ceph-18.0.0-4267-g48627125 /qa/suites/rgw/hadoop-s3a/%': Permission denied
rm: cannot remove 'release/
18.0.0-4267-g48627125 /rpm/el8/BUILD/
ceph-18.0.0-4267-g48627125 /qa/suites/rgw/hadoop-s3a/clusters/fixed-2.yaml': Permission denied
```
Also add a line to list ownership before removing folders.
Signed-off-by: Laura Flores <lflores@redhat.com>
sdf
Ilya Dryomov [Mon, 29 May 2023 09:17:32 +0000 (11:17 +0200)]
Merge pull request #2136 from petrutlucian94/win_bluestore
windows: use bluestore instead of memstore by default
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
Lucian Petrut [Thu, 25 May 2023 07:55:39 +0000 (07:55 +0000)]
windows: use bluestore instead of memstore by default
We've been using memstore in order to put as much load as possible
on the Windows clients in a relatively short timeframe.
However, we've been hitting memstore segfaults:
https://tracker.ceph.com/issues/61416
For this reason, we'll make the OSD backend configurable,
defaulting to bluestore for now.
Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
Zack Cerza [Thu, 25 May 2023 20:55:55 +0000 (14:55 -0600)]
Merge pull request #2135 from dmick/main
Dan Mick [Thu, 25 May 2023 02:34:53 +0000 (19:34 -0700)]
ceph-dev-new-trigger: keep 50 jobs to help diagnose dual builds
Signed-off-by: Dan Mick <dmick@redhat.com>
Zack Cerza [Mon, 22 May 2023 21:19:24 +0000 (15:19 -0600)]
Merge pull request #2127 from dmick/main
Zack Cerza [Mon, 22 May 2023 20:47:22 +0000 (14:47 -0600)]
Merge pull request #2134 from dmick/wip-build-utils
Dan Mick [Sat, 20 May 2023 00:22:28 +0000 (17:22 -0700)]
scripts/build_utils.sh: limit .gz files copied to chacra
Don't copy *every* .gz file from the release tree
(which includes the source tree); just ceph*.gz
Fixes: https://tracker.ceph.com/issues/61306
Signed-off-by: Dan Mick <dmick@redhat.com>
Zack Cerza [Fri, 5 May 2023 22:39:01 +0000 (16:39 -0600)]
Merge pull request #2132 from ceph/jjb-wheel
Zack Cerza [Fri, 5 May 2023 22:16:45 +0000 (16:16 -0600)]
build_utils.sh: Install wheel
Signed-off-by: Zack Cerza <zack@redhat.com>
Zack Cerza [Fri, 5 May 2023 22:07:08 +0000 (16:07 -0600)]
jenkins-job-builder: Install dataclasses
Signed-off-by: Zack Cerza <zack@redhat.com>
Laura Flores [Fri, 5 May 2023 20:09:02 +0000 (15:09 -0500)]
Merge pull request #2130 from ljflores/wip-requests-toolbelt
scripts/build_utils.sh: pin urllib3 and requests
Laura Flores [Thu, 4 May 2023 22:48:17 +0000 (17:48 -0500)]
scripts/build_utils.sh: pin urllib3 and requests
The latest version of chacractl requires requests-toolbelt==0.9.1.
This is not compatible with the latest version of urllib3, which
dropped support for appengine.
We need to pin urllib3 to an older version that still supports
appengine, and requests to an older version that still supports
the older urllib3 until charactl allows a higher version of
requests-toolbelt.
Fixes: https://tracker.ceph.com/issues/59652
Signed-off-by: Laura Flores <lflores@redhat.com>
Guillaume Abrioux [Thu, 27 Apr 2023 08:17:06 +0000 (10:17 +0200)]
Merge pull request #2129 from ceph/cephadm-ansible-reef
cephadm-ansible: add el9 testing
Guillaume Abrioux [Thu, 27 Apr 2023 07:43:18 +0000 (09:43 +0200)]
cephadm-ansible: add el9 testing
el9 RPMs weren't available so `el9` distribution was left commented out so
far. Given that we now have el9 packages, let's add el9 testing.
Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
Nizamudeen A [Thu, 27 Apr 2023 05:54:42 +0000 (11:24 +0530)]
Merge pull request #2128 from dmick/wip-remove-ceph-qa-emails
ceph-api-nightly, ceph-dashboard-cephadm-e2e-nightly: stop emails
Dan Mick [Thu, 27 Apr 2023 00:30:04 +0000 (17:30 -0700)]
ceph-api-nightly, ceph-dashboard-cephadm-e2e-nightly: stop emails
Remove the email notifications from the configuration. They're
just cluttering the ceph-qa inbox and no one is acting on them.
Signed-off-by: Dan Mick <dan.mick@redhat.com>
Dan Mick [Wed, 19 Apr 2023 02:36:41 +0000 (19:36 -0700)]
ceph-dev-setup: save only 25 builds (rather than 2 weeks)
ceph-dev-setup is using 67G out of the 200G on /var/lib/jenkins,
far larger than any other job. There's no reason to save all these jobs.
Signed-off-by: Dan Mick <dmick@redhat.com>
Dan Mick [Fri, 7 Apr 2023 21:18:01 +0000 (14:18 -0700)]
Merge pull request #2122 from ceph/add_centos9_main
ceph-release-rpm: add CentOS 9 build
Ken Dreyer [Fri, 7 Apr 2023 15:24:34 +0000 (11:24 -0400)]
Merge pull request #2125 from dmick/wip-sign-rpms
Use createrepo_c, and change order of signing operations to avoid invalid repomd.xml sigs
Dan Mick [Fri, 7 Apr 2023 10:20:50 +0000 (03:20 -0700)]
scripts/sign-rpms: signing repomd.xml has to come after all updates
We had been signing, and then running createrepo, which changed repomd
and thus invalidated the signature.
Signed-off-by: Dan Mick <dmick@redhat.com>
Dan Mick [Fri, 7 Apr 2023 10:18:50 +0000 (03:18 -0700)]
scripts/sign-rpms: use createrepo_c on newer Ubuntu releases (post-focal)
Signed-off-by: Dan Mick <dmick@redhat.com>
Zack Cerza [Thu, 6 Apr 2023 17:58:03 +0000 (11:58 -0600)]
Merge pull request #2124 from ceph/signing-fixes
scripts: signing fixes
Ken Dreyer [Thu, 6 Apr 2023 16:09:57 +0000 (12:09 -0400)]
sync-pull: use jq instead of python
We installed jq on the signer host so we do not need /usr/bin/python
Ken Dreyer [Thu, 6 Apr 2023 16:03:49 +0000 (12:03 -0400)]
scripts: make signing scripts executable
This makes it easier to run these interactively.
Nizamudeen A [Tue, 4 Apr 2023 06:02:22 +0000 (11:32 +0530)]
Merge pull request #2123 from rhcs-dashboard/grafana-update
ceph-grafana-trigger: update the grafana version
Nizamudeen A [Mon, 3 Apr 2023 16:25:45 +0000 (21:55 +0530)]
ceph-grafana-trigger: update the grafana version
Signed-off-by: Nizamudeen A <nia@redhat.com>
Ernesto Puerta [Wed, 22 Mar 2023 19:11:21 +0000 (20:11 +0100)]
ceph-release-rpm: add CentOS 9 build
Guillaume Abrioux [Wed, 8 Mar 2023 18:23:24 +0000 (19:23 +0100)]
Merge pull request #2121 from ceph/cv-test-suite-update
ceph-volume: do not test filestore against reef onward
Guillaume Abrioux [Wed, 8 Mar 2023 15:58:25 +0000 (16:58 +0100)]
ceph-volume: do not test filestore against reef onward
filestore support has been dropped as of Reef release.
This commits drops ceph-volume filestore testing for reef onward.
Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
Ionut BALUTOIU [Mon, 6 Mar 2023 09:57:12 +0000 (11:57 +0200)]
Merge pull request #2120 from petrutlucian94/mgr_deps
windows: install python3-bcrypt dependency
Lucian Petrut [Sat, 4 Mar 2023 07:28:05 +0000 (09:28 +0200)]
windows: install python3-bcrypt dependency
The Windows job is failing because of a missing mgr dependency.
We'll update the setup script, ensuring that python3-bcrypt is
installed.
Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
Josh Durgin [Thu, 2 Mar 2023 23:39:19 +0000 (15:39 -0800)]
Merge pull request #2119 from ceph/wip-reef
Update releases - add reef, remove octopus and older
Reviewed-by: Neha Ojha <nojha@redhat.com>
Josh Durgin [Thu, 2 Mar 2023 18:14:16 +0000 (13:14 -0500)]
Update releases - add reef, remove octopus and older
Signed-off-by: Josh Durgin <jdurgin@redhat.com>
Dan Mick [Thu, 2 Mar 2023 11:07:12 +0000 (03:07 -0800)]
Merge pull request #2118 from ceph/ceph-devstack
Add ceph-devstack job
Zack Cerza [Fri, 17 Feb 2023 21:26:45 +0000 (14:26 -0700)]
Add ceph-devstack job
Signed-off-by: Zack Cerza <zack@redhat.com>
Lucian Petrut [Fri, 17 Feb 2023 11:43:59 +0000 (13:43 +0200)]
Merge pull request #2117 from ionutbalutoiu/fix-ssh-exec
utils: fix ssh_exec function
Ionut Balutoiu [Fri, 17 Feb 2023 11:38:43 +0000 (13:38 +0200)]
utils: fix ssh_exec function
We need to use `return` inside the function to properly return the
exit code of the function. Using `exit` will abruptly exit the
entire script, which is not what we want.
Signed-off-by: Ionut Balutoiu <ibalutoiu@cloudbasesolutions.com>
Lucian Petrut [Fri, 17 Feb 2023 07:33:35 +0000 (09:33 +0200)]
Merge pull request #2116 from ionutbalutoiu/enable-win-event-logs
ceph-windows: Re-enable Windows event logs collection
Ionut Balutoiu [Thu, 16 Feb 2023 17:43:13 +0000 (19:43 +0200)]
ceph-windows: Enable Windows event log collection
The Windows event log collection was failing due `ssh_exec` abruptly
closing the connection, because the default timeout of `30 secs` was exceeded.
This change re-enables the Windows event log collection by increasing the
SSH timeout to `30 minutes`.
Additionally, the `ssh_exec` function is updated to print an appropriate
error message when SSH timeout is exceeded.
Signed-off-by: Ionut Balutoiu <ibalutoiu@cloudbasesolutions.com>
Ionut Balutoiu [Thu, 16 Feb 2023 17:30:59 +0000 (19:30 +0200)]
ceph-windows: Cleanup collect-event-logs.ps1 script
* Rely on the default `$ErrorActionPreference` value (which is `Continue`).
* Add new function `SanitizeName` that it's used to sanitize the name of the log files.
The function code existed before, but it was duplicated in the script.
* General PowerShell code cleanup.
Signed-off-by: Ionut Balutoiu <ibalutoiu@cloudbasesolutions.com>
Ionut BALUTOIU [Thu, 16 Feb 2023 09:02:54 +0000 (11:02 +0200)]
Merge pull request #2115 from petrutlucian94/log_collection_errors
windows: install dokany
Lucian Petrut [Thu, 16 Feb 2023 08:41:06 +0000 (10:41 +0200)]
windows: install dokany
Dokany is required in order to map cephfs on Windows through
ceph-dokan.
For this reason, we're updating the Windows image build script
to include Dokany.
Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
Ionut BALUTOIU [Thu, 16 Feb 2023 08:14:17 +0000 (10:14 +0200)]
Merge pull request #2114 from petrutlucian94/log_collection_errors
windows: temporarily disable event log collection
Lucian Petrut [Thu, 16 Feb 2023 08:02:43 +0000 (10:02 +0200)]
windows: temporarily disable event log collection
We're attempting to collect Windows event logs at the end of the
job.
The issue is that some logs cannot be collected and the job
terminates abruptly despite the error action being set to "ignore".
We'll temporarily skip this step in order to unblock the Windows
job.
Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
Ionut BALUTOIU [Wed, 15 Feb 2023 17:15:27 +0000 (19:15 +0200)]
Merge pull request #2113 from petrutlucian94/log_collection_errors
windows: fix log collection
Lucian Petrut [Wed, 15 Feb 2023 14:41:26 +0000 (16:41 +0200)]
windows: fix log collection
The "Get-WinEvent" command used to retrieve Windows event log
messages can fail if the specified log has no entries.
We're using the "SilentlyContinue" action to avoid erroring out
in such cases.
However, the script still terminates abruptly while collecting
logs. For this reason, we'll use the "Ignore" error action instead.
We'd rather not have test failures just because we failed to
retrieve some Windows event log entries.
Unlike "SilentlyContinue", "Ignore" doesn't populate the global
$Error variable, which may be used when running the script
remotely.
While at it, we're adding some log messages at the end of the
"run_tests" and "collect-event-logs.ps1" scripts.
Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
Lucian Petrut [Wed, 15 Feb 2023 08:48:38 +0000 (10:48 +0200)]
Merge pull request #2112 from stefan-chivu/fix-event-logs-v2
ceph-windows: Fixed txt event log dump
Stefan Chivu [Wed, 15 Feb 2023 08:42:08 +0000 (08:42 +0000)]
ceph-windows: Fixed txt event log dump
The event log collection script was throwing an error if no event
logs could be found for the filters applied. Therefore, the
Get-WinEvent call in DumpEventLogTxt has been modified to silently
continue if such case arises.
Signed-off-by: Stefan Chivu <schivu@cloudbasesolutions.com>
Lucian Petrut [Tue, 14 Feb 2023 12:05:11 +0000 (14:05 +0200)]
Merge pull request #2111 from stefan-chivu/fix-event-logs
ceph-windows: Fix event log artifacts
Stefan Chivu [Tue, 14 Feb 2023 11:47:03 +0000 (11:47 +0000)]
ceph-windows: Fix event log artifacts
Previously, the windows event log artifacts were being dumped in
evtx format and then converted to txt by individually querying them
using wevtutil.
After doing this, the resulting txt logs contained xml entries that
omitted the actual event message.
Now, they will be directly dumped as txt using Get-WinEvent and the
evtx logs can be obtained using the -IncludeEvtxFiles flag.
Signed-off-by: Stefan Chivu <schivu@cloudbasesolutions.com>
Ionut BALUTOIU [Fri, 10 Feb 2023 13:22:08 +0000 (15:22 +0200)]
Merge pull request #2110 from petrutlucian94/win_logs
windows: collect additional information
Lucian Petrut [Fri, 10 Feb 2023 13:07:12 +0000 (15:07 +0200)]
windows: collect additional information
In some cases, the OSDs seem to crash while running Windows tests.
In order to be able to investigate the issue, we'll need to collect
additional information. This commit fetches the following:
* the entire journal since the last boot
* ceph logs
* available memory at the end of the job
Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
Ilya Dryomov [Fri, 10 Feb 2023 11:57:31 +0000 (12:57 +0100)]
Merge pull request #2109 from petrutlucian94/win_jq_package
windows: install jq package
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
Lucian Petrut [Fri, 10 Feb 2023 11:47:25 +0000 (13:47 +0200)]
windows: install jq package
Windows jobs run ceph clusters in Ubuntu VMs. This change ensures
that the jq binary used by vstart will be available.
Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
Ionut BALUTOIU [Wed, 8 Feb 2023 13:42:01 +0000 (15:42 +0200)]
Merge pull request #2108 from petrutlucian94/increase_vm_mem
increase vm memory for windows test jobs
Lucian Petrut [Wed, 8 Feb 2023 11:19:12 +0000 (13:19 +0200)]
increase vm memory for windows test jobs
Windows test jobs use two vms: a windows vm that takes 8gb of ram
and a linux one that currently uses 32gb of ram.
We're using memstore with 5gb per osd. It seems that the linux vm
is running out of memory, which is why the OSDs can get
terminated while running the tests.
We'll go ahead and increase the linux vm memory to 64gb.
Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
Ionut BALUTOIU [Tue, 7 Feb 2023 16:21:09 +0000 (18:21 +0200)]
Merge pull request #2107 from petrutlucian94/fix_win_ceph_call
windows: fix running "ceph status"