]> git.apps.os.sepia.ceph.com Git - ceph-build.git/log
ceph-build.git
2 years agoansible: use 'inventory_hostname' to look up jenkins labels
Dan Mick [Wed, 19 Jul 2023 20:31:31 +0000 (13:31 -0700)]
ansible: use 'inventory_hostname' to look up jenkins labels

inventory_hostname is as it appears in the inventory file, and keeping
that consistent is easier than dealing with short vs long names
Note: this requires a change to ceph-sepia-secrets to standardize on
the inventory form of the name

Signed-off-by: Dan Mick <dmick@redhat.com>
2 years agoMerge pull request #2156 from ceph/fix-container-flake8
Guillaume Abrioux [Mon, 7 Aug 2023 15:21:55 +0000 (17:21 +0200)]
Merge pull request #2156 from ceph/fix-container-flake8

ceph-container-flake8: fix test

2 years agoceph-container-flake8: fix test 2156/head
Guillaume Abrioux [Mon, 7 Aug 2023 15:10:57 +0000 (17:10 +0200)]
ceph-container-flake8: fix test

Looks like this job has never worked.
The file should be bind-mounted in /code, not in /
There's no need to pass arguments, the entrypoint automatically
scan /code and run flake8 against any present files.

Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
2 years agoMerge pull request #2154 from Matan-B/wip-matanb-crimson-centos9
Dan Mick [Thu, 3 Aug 2023 07:13:28 +0000 (00:13 -0700)]
Merge pull request #2154 from Matan-B/wip-matanb-crimson-centos9

ceph-dev*: build crimson on centos9 (main and reef)

2 years agoMerge pull request #2155 from petrutlucian94/log_disk_space
Ionut BALUTOIU [Wed, 2 Aug 2023 08:12:46 +0000 (11:12 +0300)]
Merge pull request #2155 from petrutlucian94/log_disk_space

2 years agowindows: log available disk space 2155/head
Lucian Petrut [Wed, 2 Aug 2023 08:01:58 +0000 (08:01 +0000)]
windows: log available disk space

Some Windows test jobs have failed after the OSDs ran out of disk
space. Those jobs use Linux VMs with 128GBs of storage and run
3 OSDs, each having 15GB.

In order to get a better picture, we'll log the disk space usage,
including the vstart directory contents.

Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
2 years agoMerge pull request #2147 from dmick/wip-jenkins-builders
Zack Cerza [Wed, 26 Jul 2023 22:46:46 +0000 (16:46 -0600)]
Merge pull request #2147 from dmick/wip-jenkins-builders

ansible: enhance library/jenkins_node to allow node update

2 years agoceph-dev*: build crimson on centos9 (main and reef) 2154/head
Matan [Wed, 26 Jul 2023 12:45:01 +0000 (15:45 +0300)]
ceph-dev*: build crimson on centos9 (main and reef)

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
2 years agoMerge pull request #2152 from dmick/wip-vault-pass
Yuri Weinstein [Tue, 25 Jul 2023 02:44:44 +0000 (22:44 -0400)]
Merge pull request #2152 from dmick/wip-vault-pass

ansible: remove vault_password_file; jenkins-build doesn't have one

Reviewed-by: Yuri Weinstein <yweinste@redhat.com>
2 years agoansible: remove vault_password_file; jenkins-build doesn't have one 2152/head
Dan Mick [Tue, 25 Jul 2023 02:35:26 +0000 (19:35 -0700)]
ansible: remove vault_password_file; jenkins-build doesn't have one

Jenkins also uses ansible for some builds, and this config item breaks
the run, because although the ansible secret isn't needed, if you configure
it, ansible requires it.  <sigh>

Signed-off-by: Dan Mick <dmick@redhat.com>
2 years agoMerge pull request #2150 from lxbsz/centos9
Ilya Dryomov [Mon, 24 Jul 2023 08:55:19 +0000 (10:55 +0200)]
Merge pull request #2150 from lxbsz/centos9

kernel: enable centos9 build

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
2 years agokernel: enable the centos9 build 2150/head
Xiubo Li [Mon, 24 Jul 2023 06:47:35 +0000 (14:47 +0800)]
kernel: enable the centos9 build

commit 4b61a221e4e2f53dc7df3137fc34f418c3f3b879 just forgot to enable
it after adding centos9.

Signed-off-by: Xiubo Li <xiubli@redhat.com>
2 years agoMerge pull request #2149 from dmick/wip-preserve-temp-jobs
Zack Cerza [Fri, 21 Jul 2023 16:10:06 +0000 (10:10 -0600)]
Merge pull request #2149 from dmick/wip-preserve-temp-jobs

jenkins-job-builder: don't delete jobs named 'preserve-*'

2 years agojenkins-job-builder: don't delete jobs named 'preserve-*' 2149/head
Dan Mick [Fri, 21 Jul 2023 02:43:05 +0000 (19:43 -0700)]
jenkins-job-builder: don't delete jobs named 'preserve-*'

This is to allow job development without losing the jobs and past builds
as other ceph-build PRs are pushed.  It's assumed that if you create a job
with 'preserve-' in its title that you will clean it up when you're done.

Signed-off-by: Dan Mick <dmick@redhat.com>
2 years agoMerge pull request #2141 from ceph/wip-tcmu-runner-repo-url
Ilya Dryomov [Thu, 20 Jul 2023 11:14:21 +0000 (13:14 +0200)]
Merge pull request #2141 from ceph/wip-tcmu-runner-repo-url

tcmu-runner: set $RELEASE before it's used

Reviewed-by: Adam Kraitman <akraitma@redhat.com>
2 years agoMerge pull request #2148 from ljflores/wip-increase-timeout
Laura Flores [Wed, 19 Jul 2023 21:28:02 +0000 (16:28 -0500)]
Merge pull request #2148 from ljflores/wip-increase-timeout

ceph-dev-new-setup/config/definitions: increase timeout for cloning ci repo

2 years agoceph-dev-new-setup/config/definitions: increase timeout for cloning ci repo 2148/head
Laura Flores [Mon, 17 Jul 2023 21:44:44 +0000 (16:44 -0500)]
ceph-dev-new-setup/config/definitions: increase timeout for cloning ci repo

Might help address failures like
```
fatal: fetch-pack: invalid index-pack output`
```
and
```
error: rev-list died of signal 15
error: github.com:ceph/ceph-ci.git did not send all necessary objects
```

Signed-off-by: Laura Flores <lflores@redhat.com>
2 years agolibrary/jenkins_node: improve diagnostics on failure. 2147/head
Dan Mick [Sat, 15 Jul 2023 01:09:08 +0000 (18:09 -0700)]
library/jenkins_node: improve diagnostics on failure.

Capture exception args for exceptions that have no msg or message,
and capture a traceback.

Signed-off-by: Dan Mick <dmick@redhat.com>
2 years agoexamples/builder.yml, library/jenkins_node: support node update
Dan Mick [Sat, 15 Jul 2023 01:07:32 +0000 (18:07 -0700)]
examples/builder.yml, library/jenkins_node: support node update

Original implementation would only permit nodes to be created if they
did not exist.  Add support to update certain attributes of the node
with a run against extant nodes.  Only permit a few attribute updates.
This is primarily motivated by the desire to support maintaining node
labels with ansible.

Signed-off-by: Dan Mick <dmick@redhat.com>
2 years agoMerge pull request #2139 from dmick/wip-jenkins-builders
Dan Mick [Sat, 15 Jul 2023 01:02:20 +0000 (18:02 -0700)]
Merge pull request #2139 from dmick/wip-jenkins-builders

Fix up a number of problems with ansible/examples/builder.yml

2 years agoMerge pull request #2146 from rhcs-dashboard/cleanup-script-cephadm-e2e-nightly
Nizamudeen A [Fri, 14 Jul 2023 12:11:10 +0000 (17:41 +0530)]
Merge pull request #2146 from rhcs-dashboard/cleanup-script-cephadm-e2e-nightly

dashboard-cephadm-e2e-nightly: move the cleanup script to postbuild step

2 years agolibrary/jenkins_node: use python3 2139/head
Dan Mick [Thu, 8 Jun 2023 09:07:09 +0000 (02:07 -0700)]
library/jenkins_node: use python3

Signed-off-by: Dan Mick <dmick@redhat.com>
2 years agofiles/ssh/hostkeys/github.com.pub: update the github host key
Dan Mick [Thu, 8 Jun 2023 09:05:24 +0000 (02:05 -0700)]
files/ssh/hostkeys/github.com.pub: update the github host key

github reissued its host key in March 2023 (see
https://github.blog/2023-03-23-we-updated-our-rsa-ssh-host-key/).
Record here for use in setting up jenkins builders.

Signed-off-by: Dan Mick <dmick@redhat.com>
2 years agoansible/examples/builder.yml: kill any rogue agent/slave.jar procs
Dan Mick [Thu, 8 Jun 2023 09:03:39 +0000 (02:03 -0700)]
ansible/examples/builder.yml: kill any rogue agent/slave.jar procs

Some time ago slave.jar was renamed agent.jar, and there were
builders running the old version, sometimes as root (which caused
problems when the job would check out git workspaces as root
that could then not be removed by a job running as jenkins-build).
Clean up the crufty procs, if any.

Signed-off-by: Dan Mick <dmick@redhat.com>
2 years agoansible/examples/builder.yml: fix egregious spelling error
Dan Mick [Thu, 8 Jun 2023 09:02:50 +0000 (02:02 -0700)]
ansible/examples/builder.yml: fix egregious spelling error

also add a tag for the register step; will be useful for updating
tags once I get updating working

Signed-off-by: Dan Mick <dmick@redhat.com>
2 years agoMerge pull request #2144 from ljflores/wip-crimson-pacific-2
Dan Mick [Tue, 11 Jul 2023 21:31:37 +0000 (14:31 -0700)]
Merge pull request #2144 from ljflores/wip-crimson-pacific-2

ceph-dev-cron, ceph-dev-new-trigger: don't build crimson for pacific

2 years agoansible/examples/builder.yml: use chown -R to change perms
Dan Mick [Thu, 8 Jun 2023 09:02:07 +0000 (02:02 -0700)]
ansible/examples/builder.yml: use chown -R to change perms

chown -R is *much* faster than the ansible file module

Signed-off-by: Dan Mick <dmick@redhat.com>
2 years agoansible/examples/builder.yml: set builder labels from group_vars
Dan Mick [Thu, 8 Jun 2023 09:01:10 +0000 (02:01 -0700)]
ansible/examples/builder.yml: set builder labels from group_vars

Signed-off-by: Dan Mick <dmick@redhat.com>
2 years agoansible/examples/builder.yml: use secret for jenkins_api_token
Dan Mick [Thu, 8 Jun 2023 08:59:28 +0000 (01:59 -0700)]
ansible/examples/builder.yml: use secret for jenkins_api_token

Put it into variable 'token' for all the other tasks.

Signed-off-by: Dan Mick <dmick@redhat.com>
2 years agodashboard-cephadm-e2e-nightly: move the cleanup script to postbuild step 2146/head
Nizamudeen A [Mon, 10 Jul 2023 06:06:29 +0000 (11:36 +0530)]
dashboard-cephadm-e2e-nightly: move the cleanup script to postbuild step

Signed-off-by: Nizamudeen A <nia@redhat.com>
2 years agoMerge pull request #2145 from rhcs-dashboard/cleanup-script-cephadm-e2e
Nizamudeen A [Mon, 10 Jul 2023 05:26:45 +0000 (10:56 +0530)]
Merge pull request #2145 from rhcs-dashboard/cleanup-script-cephadm-e2e

dashboard-cephadm-e2e: move the cleanup script to postbuild step

2 years agodashboard-cephadm-e2e: move the cleanup script to postbuild step 2145/head
Nizamudeen A [Fri, 7 Jul 2023 10:27:49 +0000 (15:57 +0530)]
dashboard-cephadm-e2e: move the cleanup script to postbuild step

Signed-off-by: Nizamudeen A <nia@redhat.com>
2 years agoceph-dev-cron, ceph-dev-new-trigger: don't build crimson for pacific 2144/head
Laura Flores [Thu, 6 Jul 2023 16:47:48 +0000 (11:47 -0500)]
ceph-dev-cron, ceph-dev-new-trigger: don't build crimson for pacific

Followup to PR https://github.com/ceph/ceph-build/pull/2142; removes
crimson pacific from additional build jobs.

Signed-off-by: Laura Flores <lflores@redhat.com>
2 years agoMerge pull request #2143 from dmick/wip-sync-pull
Dan Mick [Thu, 29 Jun 2023 00:52:47 +0000 (17:52 -0700)]
Merge pull request #2143 from dmick/wip-sync-pull

scripts/sync-pull: update argument check to look for $3

2 years agoscripts/sync-pull: update argument check to look for $3 2143/head
Dan Mick [Wed, 28 Jun 2023 23:14:19 +0000 (16:14 -0700)]
scripts/sync-pull: update argument check to look for $3

Catch the error where three arguments are not supplied.

Signed-off-by: Dan Mick <dmick@redhat.com>
2 years agoMerge pull request #2142 from ljflores/wip-crimson-pacific
Zack Cerza [Mon, 26 Jun 2023 16:56:37 +0000 (10:56 -0600)]
Merge pull request #2142 from ljflores/wip-crimson-pacific

2 years agoceph-dev-trigger/config/definitions: don't build crimson for pacific 2142/head
Laura Flores [Mon, 26 Jun 2023 15:47:13 +0000 (10:47 -0500)]
ceph-dev-trigger/config/definitions: don't build crimson for pacific

Crimson changes will not be backported to pacific, so it doesn't make
sense to keep building it.

Signed-off-by: Laura Flores <lflores@redhat.com>
2 years agotcmu-runner: fix CEPH_BRANCH parameter description 2141/head
Ilya Dryomov [Sat, 24 Jun 2023 11:46:35 +0000 (13:46 +0200)]
tcmu-runner: fix CEPH_BRANCH parameter description

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2 years agotcmu-runner: set $RELEASE before it's used
Ilya Dryomov [Sat, 24 Jun 2023 11:32:37 +0000 (13:32 +0200)]
tcmu-runner: set $RELEASE before it's used

get_rpm_dist() call is misplaced.  The Ceph repo URL ends up missing
the distro version which causes the job to fail after some timeouts:

  REPO_URL=https://shaman.ceph.com/api/repos/ceph/main/latest/centos//flavors/default/repo
  ...
  curl --fail -L https://shaman.ceph.com/api/repos/ceph/main/latest/centos//flavors/default/repo
  ...
  curl: (22) The requested URL returned error: 504 Gateway Timeout

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2 years agoMerge pull request #2140 from dmick/wip-ceph-release-rpm
Guillaume Abrioux [Mon, 12 Jun 2023 22:13:57 +0000 (00:13 +0200)]
Merge pull request #2140 from dmick/wip-ceph-release-rpm

ceph-release-rpm build script missing space before =

2 years agoceph-release-rpm build script missing space before = 2140/head
Dan Mick [Mon, 12 Jun 2023 22:08:32 +0000 (15:08 -0700)]
ceph-release-rpm build script missing space before =

This causes a syntax error.  Also, the script didn't have -e set, so
the error went unnoticed

Signed-off-by: Dan Mick <dmick@redhat.com>
2 years agoMerge pull request #2137 from ljflores/wip-ceph-dev-new-build
Laura Flores [Mon, 5 Jun 2023 17:32:50 +0000 (12:32 -0500)]
Merge pull request #2137 from ljflores/wip-ceph-dev-new-build

ceph-dev-new-build/config/definitions: give proper permission to remove a folder

2 years agoceph-dev-new-build/config/definitions: give proper permission to remove a folder 2137/head
Laura Flores [Mon, 5 Jun 2023 17:02:28 +0000 (12:02 -0500)]
ceph-dev-new-build/config/definitions: give proper permission to remove a folder

Solution to fix this in ceph-dev-new-build:
```
[EnvInject] - Inject global passwords.
[EnvInject] - Mask passwords that will be passed as build parameters.
Failed to evaluate name macro:org.jenkinsci.plugins.tokenmacro.MacroEvaluationException: Error processing tokens
[gigantic] $ /bin/sh -xe /tmp/jenkins6948878898411108393.sh
+ echo 'Cleaning up top-level workarea (shared among workspaces)'
Cleaning up top-level workarea (shared among workspaces)
+ sudo rm -rf dist
+ rm -rf venv
+ rm -rf release
rm: cannot remove 'release/18.0.0-4267-g48627125/ceph-18.0.0-4267-g48627125': Permission denied
rm: cannot remove 'release/18.0.0-4267-g48627125/version': Permission denied
rm: cannot remove 'release/18.0.0-4267-g48627125/ceph_18.0.0-4267-g48627125.orig.tar.gz': Permission denied
rm: cannot remove 'release/18.0.0-4267-g48627125/branch': Permission denied
rm: cannot remove 'release/18.0.0-4267-g48627125/ceph_18.0.0-4267-g48627125-1.dsc': Permission denied
rm: cannot remove 'release/18.0.0-4267-g48627125/other_envvars': Permission denied
rm: cannot remove 'release/18.0.0-4267-g48627125/ceph.spec': Permission denied
...
rm: cannot remove 'release/18.0.0-4267-g48627125/rpm/el8/BUILD/ceph-18.0.0-4267-g48627125/qa/suites/rgw/dbstore/cluster.yaml': Permission denied
rm: cannot remove 'release/18.0.0-4267-g48627125/rpm/el8/BUILD/ceph-18.0.0-4267-g48627125/qa/suites/rgw/dbstore/overrides.yaml': Permission denied
rm: cannot remove 'release/18.0.0-4267-g48627125/rpm/el8/BUILD/ceph-18.0.0-4267-g48627125/qa/suites/rgw/dbstore/s3tests-branch.yaml': Permission denied
rm: cannot remove 'release/18.0.0-4267-g48627125/rpm/el8/BUILD/ceph-18.0.0-4267-g48627125/qa/suites/rgw/dbstore/supported-random-distro$': Permission denied
rm: cannot remove 'release/18.0.0-4267-g48627125/rpm/el8/BUILD/ceph-18.0.0-4267-g48627125/qa/suites/rgw/dbstore/tasks/rgw_s3tests.yaml': Permission denied
rm: cannot remove 'release/18.0.0-4267-g48627125/rpm/el8/BUILD/ceph-18.0.0-4267-g48627125/qa/suites/rgw/dbstore/tasks/.qa': Permission denied
rm: cannot remove 'release/18.0.0-4267-g48627125/rpm/el8/BUILD/ceph-18.0.0-4267-g48627125/qa/suites/rgw/dbstore/.qa': Permission denied
rm: cannot remove 'release/18.0.0-4267-g48627125/rpm/el8/BUILD/ceph-18.0.0-4267-g48627125/qa/suites/rgw/hadoop-s3a/%': Permission denied
rm: cannot remove 'release/18.0.0-4267-g48627125/rpm/el8/BUILD/ceph-18.0.0-4267-g48627125/qa/suites/rgw/hadoop-s3a/clusters/fixed-2.yaml': Permission denied
```

Also add a line to list ownership before removing folders.

Signed-off-by: Laura Flores <lflores@redhat.com>
sdf

2 years agoMerge pull request #2136 from petrutlucian94/win_bluestore
Ilya Dryomov [Mon, 29 May 2023 09:17:32 +0000 (11:17 +0200)]
Merge pull request #2136 from petrutlucian94/win_bluestore

windows: use bluestore instead of memstore by default

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
2 years agowindows: use bluestore instead of memstore by default 2136/head
Lucian Petrut [Thu, 25 May 2023 07:55:39 +0000 (07:55 +0000)]
windows: use bluestore instead of memstore by default

We've been using memstore in order to put as much load as possible
on the Windows clients in a relatively short timeframe.

However, we've been hitting memstore segfaults:
https://tracker.ceph.com/issues/61416

For this reason, we'll make the OSD backend configurable,
defaulting to bluestore for now.

Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
2 years agoMerge pull request #2135 from dmick/main
Zack Cerza [Thu, 25 May 2023 20:55:55 +0000 (14:55 -0600)]
Merge pull request #2135 from dmick/main

2 years agoceph-dev-new-trigger: keep 50 jobs to help diagnose dual builds 2135/head
Dan Mick [Thu, 25 May 2023 02:34:53 +0000 (19:34 -0700)]
ceph-dev-new-trigger: keep 50 jobs to help diagnose dual builds

Signed-off-by: Dan Mick <dmick@redhat.com>
2 years agoMerge pull request #2127 from dmick/main
Zack Cerza [Mon, 22 May 2023 21:19:24 +0000 (15:19 -0600)]
Merge pull request #2127 from dmick/main

2 years agoMerge pull request #2134 from dmick/wip-build-utils
Zack Cerza [Mon, 22 May 2023 20:47:22 +0000 (14:47 -0600)]
Merge pull request #2134 from dmick/wip-build-utils

2 years agoscripts/build_utils.sh: limit .gz files copied to chacra 2134/head
Dan Mick [Sat, 20 May 2023 00:22:28 +0000 (17:22 -0700)]
scripts/build_utils.sh: limit .gz files copied to chacra

Don't copy *every* .gz file from the release tree
(which includes the source tree); just ceph*.gz

Fixes: https://tracker.ceph.com/issues/61306
Signed-off-by: Dan Mick <dmick@redhat.com>
2 years agoMerge pull request #2132 from ceph/jjb-wheel
Zack Cerza [Fri, 5 May 2023 22:39:01 +0000 (16:39 -0600)]
Merge pull request #2132 from ceph/jjb-wheel

2 years agobuild_utils.sh: Install wheel 2132/head
Zack Cerza [Fri, 5 May 2023 22:16:45 +0000 (16:16 -0600)]
build_utils.sh: Install wheel

Signed-off-by: Zack Cerza <zack@redhat.com>
2 years agojenkins-job-builder: Install dataclasses
Zack Cerza [Fri, 5 May 2023 22:07:08 +0000 (16:07 -0600)]
jenkins-job-builder: Install dataclasses

Signed-off-by: Zack Cerza <zack@redhat.com>
2 years agoMerge pull request #2130 from ljflores/wip-requests-toolbelt
Laura Flores [Fri, 5 May 2023 20:09:02 +0000 (15:09 -0500)]
Merge pull request #2130 from ljflores/wip-requests-toolbelt

scripts/build_utils.sh: pin urllib3 and requests

2 years agoscripts/build_utils.sh: pin urllib3 and requests 2130/head
Laura Flores [Thu, 4 May 2023 22:48:17 +0000 (17:48 -0500)]
scripts/build_utils.sh: pin urllib3 and requests

The latest version of chacractl requires requests-toolbelt==0.9.1.
This is not compatible with the latest version of urllib3, which
dropped support for appengine.

We need to pin urllib3 to an older version that still supports
appengine, and requests to an older version that still supports
the older urllib3 until charactl allows a higher version of
requests-toolbelt.

Fixes: https://tracker.ceph.com/issues/59652
Signed-off-by: Laura Flores <lflores@redhat.com>
2 years agoMerge pull request #2129 from ceph/cephadm-ansible-reef
Guillaume Abrioux [Thu, 27 Apr 2023 08:17:06 +0000 (10:17 +0200)]
Merge pull request #2129 from ceph/cephadm-ansible-reef

cephadm-ansible: add el9 testing

2 years agocephadm-ansible: add el9 testing 2129/head
Guillaume Abrioux [Thu, 27 Apr 2023 07:43:18 +0000 (09:43 +0200)]
cephadm-ansible: add el9 testing

el9 RPMs weren't available so `el9` distribution was left commented out so
far. Given that we now have el9 packages, let's add el9 testing.

Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
2 years agoMerge pull request #2128 from dmick/wip-remove-ceph-qa-emails
Nizamudeen A [Thu, 27 Apr 2023 05:54:42 +0000 (11:24 +0530)]
Merge pull request #2128 from dmick/wip-remove-ceph-qa-emails

ceph-api-nightly, ceph-dashboard-cephadm-e2e-nightly: stop emails

2 years agoceph-api-nightly, ceph-dashboard-cephadm-e2e-nightly: stop emails 2128/head
Dan Mick [Thu, 27 Apr 2023 00:30:04 +0000 (17:30 -0700)]
ceph-api-nightly, ceph-dashboard-cephadm-e2e-nightly: stop emails

Remove the email notifications from the configuration.  They're
just cluttering the ceph-qa inbox and no one is acting on them.

Signed-off-by: Dan Mick <dan.mick@redhat.com>
2 years agoceph-dev-setup: save only 25 builds (rather than 2 weeks) 2127/head
Dan Mick [Wed, 19 Apr 2023 02:36:41 +0000 (19:36 -0700)]
ceph-dev-setup: save only 25 builds (rather than 2 weeks)

ceph-dev-setup is using 67G out of the 200G on /var/lib/jenkins,
far larger than any other job.  There's no reason to save all these jobs.

Signed-off-by: Dan Mick <dmick@redhat.com>
2 years agoMerge pull request #2122 from ceph/add_centos9_main
Dan Mick [Fri, 7 Apr 2023 21:18:01 +0000 (14:18 -0700)]
Merge pull request #2122 from ceph/add_centos9_main

ceph-release-rpm: add CentOS 9 build

2 years agoMerge pull request #2125 from dmick/wip-sign-rpms
Ken Dreyer [Fri, 7 Apr 2023 15:24:34 +0000 (11:24 -0400)]
Merge pull request #2125 from dmick/wip-sign-rpms

Use createrepo_c, and change order of signing operations to avoid invalid repomd.xml sigs

2 years agoscripts/sign-rpms: signing repomd.xml has to come after all updates 2125/head
Dan Mick [Fri, 7 Apr 2023 10:20:50 +0000 (03:20 -0700)]
scripts/sign-rpms: signing repomd.xml has to come after all updates

We had been signing, and then running createrepo, which changed repomd
and thus invalidated the signature.

Signed-off-by: Dan Mick <dmick@redhat.com>
2 years agoscripts/sign-rpms: use createrepo_c on newer Ubuntu releases (post-focal)
Dan Mick [Fri, 7 Apr 2023 10:18:50 +0000 (03:18 -0700)]
scripts/sign-rpms: use createrepo_c on newer Ubuntu releases (post-focal)

Signed-off-by: Dan Mick <dmick@redhat.com>
2 years agoMerge pull request #2124 from ceph/signing-fixes
Zack Cerza [Thu, 6 Apr 2023 17:58:03 +0000 (11:58 -0600)]
Merge pull request #2124 from ceph/signing-fixes

scripts: signing fixes

2 years agosync-pull: use jq instead of python 2124/head
Ken Dreyer [Thu, 6 Apr 2023 16:09:57 +0000 (12:09 -0400)]
sync-pull: use jq instead of python

We installed jq on the signer host so we do not need /usr/bin/python

2 years agoscripts: make signing scripts executable
Ken Dreyer [Thu, 6 Apr 2023 16:03:49 +0000 (12:03 -0400)]
scripts: make signing scripts executable

This makes it easier to run these interactively.

2 years agoMerge pull request #2123 from rhcs-dashboard/grafana-update
Nizamudeen A [Tue, 4 Apr 2023 06:02:22 +0000 (11:32 +0530)]
Merge pull request #2123 from rhcs-dashboard/grafana-update

ceph-grafana-trigger: update the grafana version

2 years agoceph-grafana-trigger: update the grafana version 2123/head
Nizamudeen A [Mon, 3 Apr 2023 16:25:45 +0000 (21:55 +0530)]
ceph-grafana-trigger: update the grafana version

Signed-off-by: Nizamudeen A <nia@redhat.com>
2 years agoceph-release-rpm: add CentOS 9 build 2122/head
Ernesto Puerta [Wed, 22 Mar 2023 19:11:21 +0000 (20:11 +0100)]
ceph-release-rpm: add CentOS 9 build

2 years agoMerge pull request #2121 from ceph/cv-test-suite-update
Guillaume Abrioux [Wed, 8 Mar 2023 18:23:24 +0000 (19:23 +0100)]
Merge pull request #2121 from ceph/cv-test-suite-update

ceph-volume: do not test filestore against reef onward

2 years agoceph-volume: do not test filestore against reef onward 2121/head
Guillaume Abrioux [Wed, 8 Mar 2023 15:58:25 +0000 (16:58 +0100)]
ceph-volume: do not test filestore against reef onward

filestore support has been dropped as of Reef release.
This commits drops ceph-volume filestore testing for reef onward.

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
2 years agoMerge pull request #2120 from petrutlucian94/mgr_deps
Ionut BALUTOIU [Mon, 6 Mar 2023 09:57:12 +0000 (11:57 +0200)]
Merge pull request #2120 from petrutlucian94/mgr_deps

windows: install python3-bcrypt dependency

2 years agowindows: install python3-bcrypt dependency 2120/head
Lucian Petrut [Sat, 4 Mar 2023 07:28:05 +0000 (09:28 +0200)]
windows: install python3-bcrypt dependency

The Windows job is failing because of a missing mgr dependency.

We'll update the setup script, ensuring that python3-bcrypt is
installed.

Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
2 years agoMerge pull request #2119 from ceph/wip-reef
Josh Durgin [Thu, 2 Mar 2023 23:39:19 +0000 (15:39 -0800)]
Merge pull request #2119 from ceph/wip-reef

Update releases - add reef, remove octopus and older

Reviewed-by: Neha Ojha <nojha@redhat.com>
2 years agoUpdate releases - add reef, remove octopus and older 2119/head
Josh Durgin [Thu, 2 Mar 2023 18:14:16 +0000 (13:14 -0500)]
Update releases - add reef, remove octopus and older

Signed-off-by: Josh Durgin <jdurgin@redhat.com>
2 years agoMerge pull request #2118 from ceph/ceph-devstack
Dan Mick [Thu, 2 Mar 2023 11:07:12 +0000 (03:07 -0800)]
Merge pull request #2118 from ceph/ceph-devstack

Add ceph-devstack job

2 years agoAdd ceph-devstack job 2118/head
Zack Cerza [Fri, 17 Feb 2023 21:26:45 +0000 (14:26 -0700)]
Add ceph-devstack job

Signed-off-by: Zack Cerza <zack@redhat.com>
2 years agoMerge pull request #2117 from ionutbalutoiu/fix-ssh-exec
Lucian Petrut [Fri, 17 Feb 2023 11:43:59 +0000 (13:43 +0200)]
Merge pull request #2117 from ionutbalutoiu/fix-ssh-exec

utils: fix ssh_exec function

2 years agoutils: fix ssh_exec function 2117/head
Ionut Balutoiu [Fri, 17 Feb 2023 11:38:43 +0000 (13:38 +0200)]
utils: fix ssh_exec function

We need to use `return` inside the function to properly return the
exit code of the function. Using `exit` will abruptly exit the
entire script, which is not what we want.

Signed-off-by: Ionut Balutoiu <ibalutoiu@cloudbasesolutions.com>
2 years agoMerge pull request #2116 from ionutbalutoiu/enable-win-event-logs
Lucian Petrut [Fri, 17 Feb 2023 07:33:35 +0000 (09:33 +0200)]
Merge pull request #2116 from ionutbalutoiu/enable-win-event-logs

ceph-windows: Re-enable Windows event logs collection

2 years agoceph-windows: Enable Windows event log collection 2116/head
Ionut Balutoiu [Thu, 16 Feb 2023 17:43:13 +0000 (19:43 +0200)]
ceph-windows: Enable Windows event log collection

The Windows event log collection was failing due `ssh_exec` abruptly
closing the connection, because the default timeout of `30 secs` was exceeded.

This change re-enables the Windows event log collection by increasing the
SSH timeout to `30 minutes`.

Additionally, the `ssh_exec` function is updated to print an appropriate
error message when SSH timeout is exceeded.

Signed-off-by: Ionut Balutoiu <ibalutoiu@cloudbasesolutions.com>
2 years agoceph-windows: Cleanup collect-event-logs.ps1 script
Ionut Balutoiu [Thu, 16 Feb 2023 17:30:59 +0000 (19:30 +0200)]
ceph-windows: Cleanup collect-event-logs.ps1 script

* Rely on the default `$ErrorActionPreference` value (which is `Continue`).
* Add new function `SanitizeName` that it's used to sanitize the name of the log files.
  The function code existed before, but it was duplicated in the script.
* General PowerShell code cleanup.

Signed-off-by: Ionut Balutoiu <ibalutoiu@cloudbasesolutions.com>
2 years agoMerge pull request #2115 from petrutlucian94/log_collection_errors
Ionut BALUTOIU [Thu, 16 Feb 2023 09:02:54 +0000 (11:02 +0200)]
Merge pull request #2115 from petrutlucian94/log_collection_errors

windows: install dokany

2 years agowindows: install dokany 2115/head
Lucian Petrut [Thu, 16 Feb 2023 08:41:06 +0000 (10:41 +0200)]
windows: install dokany

Dokany is required in order to map cephfs on Windows through
ceph-dokan.

For this reason, we're updating the Windows image build script
to include Dokany.

Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
2 years agoMerge pull request #2114 from petrutlucian94/log_collection_errors
Ionut BALUTOIU [Thu, 16 Feb 2023 08:14:17 +0000 (10:14 +0200)]
Merge pull request #2114 from petrutlucian94/log_collection_errors

windows: temporarily disable event log collection

2 years agowindows: temporarily disable event log collection 2114/head
Lucian Petrut [Thu, 16 Feb 2023 08:02:43 +0000 (10:02 +0200)]
windows: temporarily disable event log collection

We're attempting to collect Windows event logs at the end of the
job.

The issue is that some logs cannot be collected and the job
terminates abruptly despite the error action being set to "ignore".

We'll temporarily skip this step in order to unblock the Windows
job.

Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
2 years agoMerge pull request #2113 from petrutlucian94/log_collection_errors
Ionut BALUTOIU [Wed, 15 Feb 2023 17:15:27 +0000 (19:15 +0200)]
Merge pull request #2113 from petrutlucian94/log_collection_errors

windows: fix log collection

2 years agowindows: fix log collection 2113/head
Lucian Petrut [Wed, 15 Feb 2023 14:41:26 +0000 (16:41 +0200)]
windows: fix log collection

The "Get-WinEvent" command used to retrieve Windows event log
messages can fail if the specified log has no entries.

We're using the "SilentlyContinue" action to avoid erroring out
in such cases.

However, the script still terminates abruptly while collecting
logs. For this reason, we'll use the "Ignore" error action instead.
We'd rather not have test failures just because we failed to
retrieve some Windows event log entries.

Unlike "SilentlyContinue", "Ignore" doesn't populate the global
$Error variable, which may be used when running the script
remotely.

While at it, we're adding some log messages at the end of the
"run_tests" and "collect-event-logs.ps1" scripts.

Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
2 years agoMerge pull request #2112 from stefan-chivu/fix-event-logs-v2
Lucian Petrut [Wed, 15 Feb 2023 08:48:38 +0000 (10:48 +0200)]
Merge pull request #2112 from stefan-chivu/fix-event-logs-v2

ceph-windows: Fixed txt event log dump

2 years agoceph-windows: Fixed txt event log dump 2112/head
Stefan Chivu [Wed, 15 Feb 2023 08:42:08 +0000 (08:42 +0000)]
ceph-windows: Fixed txt event log dump

The event log collection script was throwing an error if no event
logs could be found for the filters applied. Therefore, the
Get-WinEvent call in DumpEventLogTxt has been modified to silently
continue if such case arises.

Signed-off-by: Stefan Chivu <schivu@cloudbasesolutions.com>
2 years agoMerge pull request #2111 from stefan-chivu/fix-event-logs
Lucian Petrut [Tue, 14 Feb 2023 12:05:11 +0000 (14:05 +0200)]
Merge pull request #2111 from stefan-chivu/fix-event-logs

ceph-windows: Fix event log artifacts

2 years agoceph-windows: Fix event log artifacts 2111/head
Stefan Chivu [Tue, 14 Feb 2023 11:47:03 +0000 (11:47 +0000)]
ceph-windows: Fix event log artifacts

Previously, the windows event log artifacts were being dumped in
evtx format and then converted to txt by individually querying them
using wevtutil.

After doing this, the resulting txt logs contained xml entries that
omitted the actual event message.

Now, they will be directly dumped as txt using Get-WinEvent and the
evtx logs can be obtained using the -IncludeEvtxFiles flag.

Signed-off-by: Stefan Chivu <schivu@cloudbasesolutions.com>
2 years agoMerge pull request #2110 from petrutlucian94/win_logs
Ionut BALUTOIU [Fri, 10 Feb 2023 13:22:08 +0000 (15:22 +0200)]
Merge pull request #2110 from petrutlucian94/win_logs

windows: collect additional information

2 years agowindows: collect additional information 2110/head
Lucian Petrut [Fri, 10 Feb 2023 13:07:12 +0000 (15:07 +0200)]
windows: collect additional information

In some cases, the OSDs seem to crash while running Windows tests.

In order to be able to investigate the issue, we'll need to collect
additional information. This commit fetches the following:

* the entire journal since the last boot
* ceph logs
* available memory at the end of the job

Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
2 years agoMerge pull request #2109 from petrutlucian94/win_jq_package
Ilya Dryomov [Fri, 10 Feb 2023 11:57:31 +0000 (12:57 +0100)]
Merge pull request #2109 from petrutlucian94/win_jq_package

windows: install jq package

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
2 years agowindows: install jq package 2109/head
Lucian Petrut [Fri, 10 Feb 2023 11:47:25 +0000 (13:47 +0200)]
windows: install jq package

Windows jobs run ceph clusters in Ubuntu VMs. This change ensures
that the jq binary used by vstart will be available.

Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
2 years agoMerge pull request #2108 from petrutlucian94/increase_vm_mem
Ionut BALUTOIU [Wed, 8 Feb 2023 13:42:01 +0000 (15:42 +0200)]
Merge pull request #2108 from petrutlucian94/increase_vm_mem

increase vm memory for windows test jobs

2 years agoincrease vm memory for windows test jobs 2108/head
Lucian Petrut [Wed, 8 Feb 2023 11:19:12 +0000 (13:19 +0200)]
increase vm memory for windows test jobs

Windows test jobs use two vms: a windows vm that takes 8gb of ram
and a linux one that currently uses 32gb of ram.

We're using memstore with 5gb per osd. It seems that the linux vm
is running out of memory, which is why the OSDs can get
terminated while running the tests.

We'll go ahead and increase the linux vm memory to 64gb.

Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
2 years agoMerge pull request #2107 from petrutlucian94/fix_win_ceph_call
Ionut BALUTOIU [Tue, 7 Feb 2023 16:21:09 +0000 (18:21 +0200)]
Merge pull request #2107 from petrutlucian94/fix_win_ceph_call

windows: fix running "ceph status"