]> git.apps.os.sepia.ceph.com Git - ceph-ci.git/log
ceph-ci.git
2 years agotest/osd: add read balancer unit tests
Laura Flores [Thu, 12 Jan 2023 23:36:50 +0000 (23:36 +0000)]
test/osd: add read balancer unit tests

This commit adds unit test coverage to the read balancing
feature, including small vs. large osdmap scenarios,
random osdmap scenarios, and scenarios that involve
tweaking primary affinity on an OSD.

Signed-off-by: Laura Flores <lflores@redhat.com>
2 years agotools, test/cli: add read balancer to osdmaptool
Laura Flores [Wed, 1 Feb 2023 23:52:17 +0000 (23:52 +0000)]
tools, test/cli: add read balancer to osdmaptool

This commit adds the capability to balance reads on a
given osdmap with the osdmaptool. The user has the option
of performing a "dry run" of read balancing OR taking it a
step further and applying the results to a live cluster.

Performing a "dry run" would involve simply running an
osdmaptool command and inspecting the results.

The template for the command is:
`osdmaptool <osd map file> --read <file for command output> --read-pool <pool name>`

An example command a user might run is:
`osdmaptool om --read out.txt --read-pool default.rgw.control`

This commit also adds a `--vstart` flag that allows a user to print ceph
commands in the outfile formatted for a vstart cluster. An example command
a user might run is:
`./bin/osdmaptool om --vstart --read out.txt --read-pool default.rgw.control`
The out.txt file would contain ceph commands prefixed with `./bin/`.
The `--vstart` flag may also be applied to an `--upmap` osdmaptool command.

If the user wants to apply read balancing results from their dry run to a
live cluster, they may either manually apply the ceph commands from the out
file, or run `source <outfile>`.

Signed-off-by: Laura Flores <lflores@redhat.com>
2 years agoosd: implement read balancer
Laura Flores [Wed, 1 Feb 2023 23:50:53 +0000 (23:50 +0000)]
osd: implement read balancer

This commit implements two functions:

1. calc_desired_primary_distribution
   Based on the number of pgs in a pool and the pool's
   replica count, we calculate the ideal number of primary
   pgs that should be assigned to each OSD on that pool in
   order for reads to be balanced.

2. balance_primaries
   This is the overall algorithm used to balance reads (primary
   pgs) in a pool. Based on the first function, we re-distribute
   primaries on the OSDs in a pool so each OSD has the ideal
   number of primaries. This is done without data movement.

Signed-off-by: Laura Flores <lflores@redhat.com>
2 years agoMerge pull request #50207 from rhcs-dashboard/fix-58827-main
Ernesto Puerta [Wed, 22 Feb 2023 14:57:38 +0000 (15:57 +0100)]
Merge pull request #50207 from rhcs-dashboard/fix-58827-main

mgr/dashboard: fix constraints.txt

Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>
2 years agoMerge PR #50175 into main
Venky Shankar [Wed, 22 Feb 2023 14:17:43 +0000 (19:47 +0530)]
Merge PR #50175 into main

* refs/pull/50175/head:
cephfs: upgrade cephfs-shell's path wherever necessary

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Dhairya Parmar <dparmar@redhat.com>
2 years agoMerge pull request #49979 from ceph/dependabot-npm_and_yarn-src-pybind-mgr-dashboard...
Ernesto Puerta [Wed, 22 Feb 2023 13:56:05 +0000 (14:56 +0100)]
Merge pull request #49979 from ceph/dependabot-npm_and_yarn-src-pybind-mgr-dashboard-frontend-http-cache-semantics-4.1.1

mgr/dashboard: bump http-cache-semantics from 4.1.0 to 4.1.1 in /src/pybind/mgr/dashboard/frontend

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
2 years agoMerge pull request #49192 from ceph/dependabot-npm_and_yarn-src-pybind-mgr-dashboard...
Ernesto Puerta [Wed, 22 Feb 2023 11:09:25 +0000 (12:09 +0100)]
Merge pull request #49192 from ceph/dependabot-npm_and_yarn-src-pybind-mgr-dashboard-frontend-decode-uri-component-0.2.2

mgr/dashboard: bump decode-uri-component from 0.2.0 to 0.2.2 in /src/pybind/mgr/dashboard/frontend

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
2 years agomgr/dashboard: fix constraints.txt
Ernesto Puerta [Wed, 22 Feb 2023 10:31:47 +0000 (11:31 +0100)]
mgr/dashboard: fix constraints.txt

Replaces strict version matching with compatible version syntax.

Reference: https://peps.python.org/pep-0440/#version-specifiers

Fixes: https://tracker.ceph.com/issues/58827
Signed-off-by: Ernesto Puerta <epuertat@redhat.com>
2 years agoMerge PR #45669 into main
Venky Shankar [Wed, 22 Feb 2023 09:11:09 +0000 (14:41 +0530)]
Merge PR #45669 into main

* refs/pull/45669/head:
client: switch to use 32 bits ext_num_fwd
client: switch to use 32 bits ext_num_retry
ceph_fs.h: add 32 bits extended num_retry and num_fwd support
ceph_fs.h: switch to use its own encode/decode helpers

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2 years agoMerge PR #49934 into main
Venky Shankar [Wed, 22 Feb 2023 06:11:44 +0000 (11:41 +0530)]
Merge PR #49934 into main

* refs/pull/49934/head:
qa: add test_fscrypt_dummy_encryption test case support
qa: add 'options' parameter support for write_local_config
qa: add ceph.exclude file to exclude individual tests
qa: add require_kernel_mount helper support
qa: rename test_fscrypt to test_fscrypt_encrypt

Reviewed-by: Dhairya Parmar <dparmar@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
2 years agoMerge pull request #50203 from zdover23/wip-doc-2023-02-22-radosgw-multisite-refine...
zdover23 [Wed, 22 Feb 2023 04:23:01 +0000 (14:23 +1000)]
Merge pull request #50203 from zdover23/wip-doc-2023-02-22-radosgw-multisite-refine-top-matter

doc/rgw: clarify multisite.rst top matter

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
2 years agodoc/rgw: clarify multisite.rst top matter
Zac Dover [Wed, 22 Feb 2023 03:36:40 +0000 (13:36 +1000)]
doc/rgw: clarify multisite.rst top matter

Improve the pragmatics of the top matter of multisite.rst. Organize the
text into sections, where doing so makes the nature of multi-site
configurations clearer.

Co-authored-by: Cole Mitchell <cole.mitchell@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
2 years agoMerge pull request #49605 from myoungwon/wip-rbm-integration-step2
Yingxin [Wed, 22 Feb 2023 01:37:07 +0000 (09:37 +0800)]
Merge pull request #49605 from myoungwon/wip-rbm-integration-step2

crimson/os/seastore: enable metadata (*_INTERNAL and *_LEAF ) allocation using OOL path

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
2 years agoMerge pull request #49722 from ceph/dependabot-npm_and_yarn-src-pybind-mgr-dashboard...
Ernesto Puerta [Wed, 22 Feb 2023 00:48:01 +0000 (01:48 +0100)]
Merge pull request #49722 from ceph/dependabot-npm_and_yarn-src-pybind-mgr-dashboard-frontend-loader-utils-1.4.2

mgr/dashboard: bump loader-utils from 1.4.0 to 1.4.2 in /src/pybind/mgr/dashboard/frontend

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
2 years agoMerge pull request #50042 from ceph/dependabot-npm_and_yarn-src-pybind-mgr-dashboard...
Ernesto Puerta [Tue, 21 Feb 2023 23:26:23 +0000 (00:26 +0100)]
Merge pull request #50042 from ceph/dependabot-npm_and_yarn-src-pybind-mgr-dashboard-frontend-sideway-formula-3.0.1

mgr/dashboard: bump @sideway/formula from 3.0.0 to 3.0.1 in /src/pybind/mgr/dashboard/frontend

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
2 years agocephfs: upgrade cephfs-shell's path wherever necessary
Rishabh Dave [Mon, 20 Feb 2023 04:47:10 +0000 (10:17 +0530)]
cephfs: upgrade cephfs-shell's path wherever necessary

Commit dc69033763cc116c6ccdf1f97149a74248691042 moves cephfs-shell from
"<CEPH-REPO-ROOT>/src/tools/cephfs/" to
"<CEPH-REPO-ROOT>/src/tools/cephfs/shell" but cephfs-shell's location in
src/vstart.sh and qa/tasks/cephfs/test_cephfs_shell.py is left
un-updated. This produces a broken vstart_environment.sh and broken
export command in test_cephfs_shell.py.

Introduced-by: dc69033763cc116c6ccdf1f97149a74248691042
Fixes: https://tracker.ceph.com/issues/58795
Signed-off-by: Rishabh Dave <ridave@redhat.com>
2 years agoqa: add test_fscrypt_dummy_encryption test case support
Xiubo Li [Tue, 31 Jan 2023 02:33:07 +0000 (10:33 +0800)]
qa: add test_fscrypt_dummy_encryption test case support

This will test around 130 test cases with the test_dummy_encryption
option enabled.

Signed-off-by: Xiubo Li <xiubli@redhat.com>
2 years agoMerge pull request #50165 from zdover23/wip-doc-2023-02-19-radosgw-multisite-multisit...
zdover23 [Tue, 21 Feb 2023 04:52:21 +0000 (14:52 +1000)]
Merge pull request #50165 from zdover23/wip-doc-2023-02-19-radosgw-multisite-multisite-top-matter-cleanup

doc/rgw: multisite ref. top matter cleanup

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
2 years agoqa: add 'options' parameter support for write_local_config
Xiubo Li [Tue, 31 Jan 2023 13:58:39 +0000 (21:58 +0800)]
qa: add 'options' parameter support for write_local_config

This will allow the fscrypt test case to add dedicated options.

Signed-off-by: Xiubo Li <xiubli@redhat.com>
2 years agoqa: add ceph.exclude file to exclude individual tests
Xiubo Li [Tue, 31 Jan 2023 03:09:16 +0000 (11:09 +0800)]
qa: add ceph.exclude file to exclude individual tests

Skip some tests, which will take to much time or will fail, for now.

Signed-off-by: Xiubo Li <xiubli@redhat.com>
2 years agoMerge pull request #50186 from zdover23/wip-doc-2023-02-21-radosgw-session-tags-rende...
zdover23 [Tue, 21 Feb 2023 04:09:30 +0000 (14:09 +1000)]
Merge pull request #50186 from zdover23/wip-doc-2023-02-21-radosgw-session-tags-render-fix

doc/rgw: session-tags.rst - fix link to keycloak

Reviewed-by: Cole Mitchell <cole.mitchell@gmail.com>
2 years agoMerge PR #49154 into main
Venky Shankar [Tue, 21 Feb 2023 04:03:18 +0000 (09:33 +0530)]
Merge PR #49154 into main

* refs/pull/49154/head:
qa: add tests for cephfs-top
doc: updates --dump and --dumpfs
cephfs-top: dump values to stdout

Reviewed-by: Neeraj Pratap Singh <neesingh@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Anthony D Atri <anthony.datri@gmail.com>
2 years agodoc/rgw: session-tags.rst - fix link to keycloak
Zac Dover [Tue, 21 Feb 2023 03:55:03 +0000 (13:55 +1000)]
doc/rgw: session-tags.rst - fix link to keycloak

Fix a link in the doc/radosgw/session-tags.rst file that was intended to
link to the doc/radosgw/keycloak.rst file.

Signed-off-by: Zac Dover <zac.dover@proton.me>
2 years agoMerge pull request #43145 from pritha-srivastava/wip-rgw-sts-doc-fixes
zdover23 [Tue, 21 Feb 2023 03:48:28 +0000 (13:48 +1000)]
Merge pull request #43145 from pritha-srivastava/wip-rgw-sts-doc-fixes

rgw/sts: correcting documentation errors.

Reviewed-by: Matt Benjamin <mbenjamin@redhat.com>
Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
2 years agoMerge pull request #49843 from shreyanshjain7174/compression_58410
Yuri Weinstein [Mon, 20 Feb 2023 22:29:38 +0000 (14:29 -0800)]
Merge pull request #49843 from shreyanshjain7174/compression_58410

rados: Set snappy as default value in ms_osd_compression_algorithm

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
2 years agoMerge pull request #49487 from trociny/wip-58269
Yuri Weinstein [Mon, 20 Feb 2023 22:28:49 +0000 (14:28 -0800)]
Merge pull request #49487 from trociny/wip-58269

mgr: don't dump global config holding gil

Reviewed-by: Tim Serong <tserong@suse.com>
2 years agoMerge pull request #49379 from zhsgao/osd_deep_scrub_stride
Yuri Weinstein [Mon, 20 Feb 2023 22:27:31 +0000 (14:27 -0800)]
Merge pull request #49379 from zhsgao/osd_deep_scrub_stride

osd/scrub: add a variable to store osd_deep_scrub_stride

Reviewed-by: Ronen Friedman <rfriedma@redhat.com>
2 years agoMerge pull request #49199 from ifed01/wip-ifed-fix-require-osd-release
Yuri Weinstein [Mon, 20 Feb 2023 22:26:39 +0000 (14:26 -0800)]
Merge pull request #49199 from ifed01/wip-ifed-fix-require-osd-release

mon: avoid exception when setting require-osd-release more than 2

Reviewed-by: Neha Ojha <nojha@redhat.com>
2 years agoMerge pull request #45414 from ifed01/wip-ifed-log-before-assert-in-avl
Yuri Weinstein [Mon, 20 Feb 2023 22:26:11 +0000 (14:26 -0800)]
Merge pull request #45414 from ifed01/wip-ifed-log-before-assert-in-avl

os/bluestore: log before assert in AvlAllocator

Reviewed-by: Adam Kupczyk <akupczyk@redhat.com>
2 years agoMerge pull request #49335 from yuvalif/wip-yuval-fix-58167
Yuval Lifshitz [Mon, 20 Feb 2023 20:29:51 +0000 (22:29 +0200)]
Merge pull request #49335 from yuvalif/wip-yuval-fix-58167

prevent anonymous topic operations

Reviewed-By: pritha-srivastava, cbodley
2 years agoMerge pull request #48709 from yanghonggang/yhg-jaeger-port
Yuval Lifshitz [Mon, 20 Feb 2023 20:27:06 +0000 (22:27 +0200)]
Merge pull request #48709 from yanghonggang/yhg-jaeger-port

common: add a config option for jaeger agent port

Reviewed-By: tchaikov, zenomri, yuvalif
2 years agoMerge pull request #49910 from myoungwon/fix-58587
Laura Flores [Mon, 20 Feb 2023 20:24:03 +0000 (14:24 -0600)]
Merge pull request #49910 from myoungwon/fix-58587

qa/workunits/rados/test_dedup_tool.sh: reset dedup tier during tests

2 years agoMerge pull request #49950 from cbodley/wip-qa-ragweed-tox
Casey Bodley [Mon, 20 Feb 2023 14:29:50 +0000 (09:29 -0500)]
Merge pull request #49950 from cbodley/wip-qa-ragweed-tox

qa/rgw: run ragweed tests with tox

Reviewed-by: Ali Maredia <amaredia@redhat.com>
2 years agoMerge pull request #49962 from robbat2/main-post_object_upload_size_rgw_chunk_size_bug
Casey Bodley [Mon, 20 Feb 2023 14:28:41 +0000 (09:28 -0500)]
Merge pull request #49962 from robbat2/main-post_object_upload_size_rgw_chunk_size_bug

rgw: Fix Browser POST content-length-range min value

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2 years agoMerge pull request #50166 from zdover23/wip-doc-2023-02-19-radosgw-multisite-zonegrou...
zdover23 [Mon, 20 Feb 2023 08:21:16 +0000 (18:21 +1000)]
Merge pull request #50166 from zdover23/wip-doc-2023-02-19-radosgw-multisite-zonegroup-setup-cleanup

doc/rgw: refine "Setting a Zonegroup"

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
2 years agodoc/rgw: refine "Setting a Zonegroup"
Zac Dover [Sun, 19 Feb 2023 13:50:43 +0000 (23:50 +1000)]
doc/rgw: refine "Setting a Zonegroup"

Streamline the "Setting a Zonegroup" section by separating out the
necessary prerequisite reading from the procedure itself.

I also corrected a typo in the word "following" in an unrelated section.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
2 years agoMerge pull request #50164 from zdover23/wip-doc-2023-02-19-radosgw-multisite-execute...
zdover23 [Mon, 20 Feb 2023 00:55:32 +0000 (10:55 +1000)]
Merge pull request #50164 from zdover23/wip-doc-2023-02-19-radosgw-multisite-execute-removal

doc/radosgw: s/execute/run/ in multisite.rst

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
2 years agoMerge pull request #50160 from cbodley/wip-qa-rgw-notification-example
Casey Bodley [Sun, 19 Feb 2023 13:41:52 +0000 (08:41 -0500)]
Merge pull request #50160 from cbodley/wip-qa-rgw-notification-example

qa/rgw: update boto3 example path in notification_tests.py

Reviewed-by: Yuval Lifshitz <ylifshit@redhat.com>
2 years agodoc/rgw: multisite ref. top matter cleanup
Zac Dover [Sun, 19 Feb 2023 13:20:45 +0000 (23:20 +1000)]
doc/rgw: multisite ref. top matter cleanup

Edit the top matter in "Multi-site Configuration Reference" so that it
is semantically more natural.

Signed-off-by: Zac Dover <zac.dover@proton.me>
2 years agodoc/radosgw: s/execute/run/ in multisite.rst
Zac Dover [Sun, 19 Feb 2023 06:46:49 +0000 (16:46 +1000)]
doc/radosgw: s/execute/run/ in multisite.rst

Replace "execute" with "run" in fourteen out of fifteen instances in
multisite.rst. "Execute" is plain old too grand in most of these
instances.

Signed-off-by: Zac Dover <zac.dover@proton.me>
2 years agoMerge pull request #48683 from linuxbox2/wip-rgwlc-faststop
J. Eric Ivancich [Sat, 18 Feb 2023 18:58:02 +0000 (13:58 -0500)]
Merge pull request #48683 from linuxbox2/wip-rgwlc-faststop

rgwlc: prevent lc for one bucket from exceeding time budget

Reviewed-by: Adam C. Emerson <aemerson@redhat.com>
2 years agoMerge pull request #49785 from theanalyst/rgw-swift-key
J. Eric Ivancich [Sat, 18 Feb 2023 18:56:14 +0000 (13:56 -0500)]
Merge pull request #49785 from theanalyst/rgw-swift-key

rgw: swift : check for valid key in POST forms

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
Reviewed-by: Adam C. Emerson <aemerson@redhat.com>
2 years agoMerge pull request #49951 from ivancich/wip-clean-up-2023-Jan-23-coverity
J. Eric Ivancich [Sat, 18 Feb 2023 18:54:26 +0000 (13:54 -0500)]
Merge pull request #49951 from ivancich/wip-clean-up-2023-Jan-23-coverity

rgw: clean up January 21, 2023 coverity scan issues

Reviewed-by: Soumya Koduri <skoduri@redhat.com>
Reviewed-by: Adam C. Emerson <aemerson@redhat.com>
2 years agoMerge pull request #50157 from guits/fix-simple-scan-cv
Guillaume Abrioux [Sat, 18 Feb 2023 18:26:25 +0000 (19:26 +0100)]
Merge pull request #50157 from guits/fix-simple-scan-cv

ceph-volume: legacy_encrypted() shouldn't call lsblk() when device is 'tmpfs'

2 years agoqa/rgw: update boto3 example path in notification_tests.py
Casey Bodley [Sat, 18 Feb 2023 14:30:36 +0000 (09:30 -0500)]
qa/rgw: update boto3 example path in notification_tests.py

fallout from https://github.com/ceph/ceph/pull/50102

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2 years agoceph-volume: legacy_encrypted() shouldn't call lsblk() when device is 'tmpfs'
Guillaume Abrioux [Sat, 18 Feb 2023 02:05:51 +0000 (03:05 +0100)]
ceph-volume: legacy_encrypted() shouldn't call lsblk() when device is 'tmpfs'

When the device variable is rebound and become 'tmpfs', lsblk() shouldn't be called.

Fixes: https://tracker.ceph.com/issues/58784
Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
2 years agoMerge pull request #50154 from zdover23/wip-doc-2023-02-18-glossary-period-rgw
Anthony D'Atri [Sat, 18 Feb 2023 00:01:31 +0000 (19:01 -0500)]
Merge pull request #50154 from zdover23/wip-doc-2023-02-18-glossary-period-rgw

doc/glossary: add "Period" to glossary

2 years agodoc/glossary: add "Period" to glossary
Zac Dover [Fri, 17 Feb 2023 22:03:06 +0000 (08:03 +1000)]
doc/glossary: add "Period" to glossary

Add definition for "period" to glossary.

Signed-off-by: Zac Dover <zac.dover@proton.me>
2 years agoMerge pull request #50020 from rkachach/fix_issue_58660
Adam King [Fri, 17 Feb 2023 18:10:28 +0000 (13:10 -0500)]
Merge pull request #50020 from rkachach/fix_issue_58660

mgr/rgw: fix module crash because of typing_extensions missing

Reviewed-by: John Mulligan <jmulligan@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
2 years agoMerge pull request #49728 from rkachach/fix_issue_58241
Adam King [Fri, 17 Feb 2023 18:07:21 +0000 (13:07 -0500)]
Merge pull request #49728 from rkachach/fix_issue_58241

cephadm: Adding poststop actions and setting TimeoutStartSec to 200s

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Michael Fritch <mfritch@suse.com>
2 years agoMerge pull request #49754 from adk3798/ha-nfs-active-failover
Adam King [Fri, 17 Feb 2023 18:02:08 +0000 (13:02 -0500)]
Merge pull request #49754 from adk3798/ha-nfs-active-failover

mgr/cephadm: fix backends service in haproxy config with multiple nfs of same rank

Reviewed-by: John Mulligan <jmulligan@redhat.com>
Reviewed-by: Michael Fritch <mfritch@suse.com>
Reviewed-by: Redouane Kachach <rkachach@redhat.com>
2 years agoMerge pull request #48973 from trociny/wip-58051
Adam King [Fri, 17 Feb 2023 17:47:59 +0000 (12:47 -0500)]
Merge pull request #48973 from trociny/wip-58051

mgr/cephadm: try to avoid pull when getting container image info

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Michael Fritch <mfritch@suse.com>
2 years agoMerge pull request #50016 from max-licc/nobjects_begin_crash
J. Eric Ivancich [Fri, 17 Feb 2023 17:40:00 +0000 (12:40 -0500)]
Merge pull request #50016 from max-licc/nobjects_begin_crash

rgw: catches nobjects_begin() exceptions

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2 years agoMerge pull request #50041 from cbodley/wip-cls-rgw-cancel-deleted
J. Eric Ivancich [Fri, 17 Feb 2023 17:26:47 +0000 (12:26 -0500)]
Merge pull request #50041 from cbodley/wip-cls-rgw-cancel-deleted

cls/rgw: remove index entry after cancelling last racing delete op

Reviewed-by: J. Eric Ivancich <ivancich@redhat.com>
Reviewed-by: Cory Snyder <csnyder@iland.com>
2 years agoMerge pull request #50059 from cbodley/wip-rgw-stream-errors
J. Eric Ivancich [Fri, 17 Feb 2023 17:25:41 +0000 (12:25 -0500)]
Merge pull request #50059 from cbodley/wip-rgw-stream-errors

rgw/beast: StreamIO remembers connection errors for graceful shutdown

Reviewed-by: Yixin Jin <yjin77@yahoo.ca>
2 years agoMerge pull request #50131 from yima77/rgw_fix_segfault_due_to_concurrent_socket_use_a...
J. Eric Ivancich [Fri, 17 Feb 2023 17:21:48 +0000 (12:21 -0500)]
Merge pull request #50131 from yima77/rgw_fix_segfault_due_to_concurrent_socket_use_at_timeout

rgw: Fix segfault due to concurrent socket use at timeout

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2 years agoMerge pull request #50143 from ivancich/wip-flight-getobj-fix
J. Eric Ivancich [Fri, 17 Feb 2023 17:21:19 +0000 (12:21 -0500)]
Merge pull request #50143 from ivancich/wip-flight-getobj-fix

rgw/flight: don't access non-existant flight store during GetObj

Reviewed-by: Matt Benjamin <mbenjamin@redhat.com>
2 years agoMerge pull request #50096 from nbalacha/syncpercent-fix
Ilya Dryomov [Fri, 17 Feb 2023 17:11:03 +0000 (18:11 +0100)]
Merge pull request #50096 from nbalacha/syncpercent-fix

rbd-mirror: fix syncing_percent calculation logic in get_replay_status()

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
2 years agoMerge pull request #49614 from isodude/wip-librbd-misalign-discard
Ilya Dryomov [Fri, 17 Feb 2023 17:09:39 +0000 (18:09 +0100)]
Merge pull request #49614 from isodude/wip-librbd-misalign-discard

librbd: Fix local rbd mirror journals growing forever

Reviewed-by: Mykola Golub <mgolub@suse.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
2 years agomgr/cephadm: fix error_ok not being passed in unit tests
Adam King [Sun, 22 Jan 2023 16:52:10 +0000 (11:52 -0500)]
mgr/cephadm: fix error_ok not being passed in unit tests

Signed-off-by: Adam King <adking@redhat.com>
2 years agomgr/cephadm: try to avoid pull when getting container image info
Mykola Golub [Sun, 20 Nov 2022 13:11:11 +0000 (13:11 +0000)]
mgr/cephadm: try to avoid pull when getting container image info

Fixes: https://tracker.ceph.com/issues/58051
Signed-off-by: Mykola Golub <mykola.golub@clyso.com>
2 years agomgr/cephadm: allow _run_cephadm_json to be silent on error
Mykola Golub [Sat, 26 Nov 2022 08:39:54 +0000 (08:39 +0000)]
mgr/cephadm: allow _run_cephadm_json to be silent on error

Signed-off-by: Mykola Golub <mykola.golub@clyso.com>
2 years agoMerge pull request #49866 from rkachach/fix_issue_58466
Adam King [Fri, 17 Feb 2023 15:32:06 +0000 (10:32 -0500)]
Merge pull request #49866 from rkachach/fix_issue_58466

cephadm: using short hostname to create the initial mon and mgr

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: John Mulligan <jmulligan@redhat.com>
2 years agoMerge pull request #49961 from stefan-chivu/test_rbd_wnbd_latency
Lucian Petrut [Fri, 17 Feb 2023 10:11:15 +0000 (12:11 +0200)]
Merge pull request #49961 from stefan-chivu/test_rbd_wnbd_latency

qa: Added latency to results table in test_rbd_wnbd.py

2 years agocrimson/os/seastore: fix traversal behavior in scan_mapped_space to avoid replay...
myoungwon oh [Tue, 14 Feb 2023 09:39:54 +0000 (18:39 +0900)]
crimson/os/seastore: fix traversal behavior in scan_mapped_space to avoid replay-allocation conflicts

1. Traverse the allocations from backref-tree leaf-node entries;
2. Traverse the pending alloc-deltas by sequence;
3. Traverse the allocations from backref-tree internal-node entries;

Signed-off-by: Myoungwon Oh <myoungwon.oh@samsung.com>
2 years agoqa/rgw: add tox task before ragweed
Casey Bodley [Tue, 31 Jan 2023 18:09:36 +0000 (13:09 -0500)]
qa/rgw: add tox task before ragweed

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2 years agoqa/ragweed: run ragweed tests with tox
Casey Bodley [Tue, 31 Jan 2023 18:07:34 +0000 (13:07 -0500)]
qa/ragweed: run ragweed tests with tox

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2 years agoMerge pull request #49709 from yima77/rgw_fix_mp_upload_obj_leak_due_to_reupload
Casey Bodley [Thu, 16 Feb 2023 17:22:41 +0000 (12:22 -0500)]
Merge pull request #49709 from yima77/rgw_fix_mp_upload_obj_leak_due_to_reupload

rgw: fix multipart upload object leaks due to re-upload

Reviewed-by: Casey Bodley <cbodley@redhat.com>
2 years agorgw/flight: don't access non-existant flight store during GetObj
J. Eric Ivancich [Thu, 16 Feb 2023 15:29:31 +0000 (10:29 -0500)]
rgw/flight: don't access non-existant flight store during GetObj

The front end must be configured via ceph.conf to start up both the
flight_server and the flight_store. RGWGetObj needs to check for the
existence of a flight_store prior to trying to use it.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
2 years agocrimson/os/seastore: In rewrite_extent, do not resolve addr if address is RANDOM_BLOCK
myoungwon oh [Tue, 14 Feb 2023 09:22:24 +0000 (18:22 +0900)]
crimson/os/seastore: In rewrite_extent, do not resolve addr if address is RANDOM_BLOCK

Signed-off-by: Myoungwon Oh <myoungwon.oh@samsung.com>
Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
2 years agocrimson/os/seastore: determine whether lba_backref_node can be allocated to OOL
myoungwon oh [Tue, 13 Dec 2022 00:56:58 +0000 (09:56 +0900)]
crimson/os/seastore: determine whether lba_backref_node can be allocated to OOL

Signed-off-by: Myoungwon Oh <myoungwon.oh@samsung.com>
Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
2 years agoquincy: qa: update test_rbd_wnbd.py to report latency in seconds
Lucian Petrut [Tue, 7 Feb 2023 13:13:54 +0000 (15:13 +0200)]
quincy: qa: update test_rbd_wnbd.py to report latency in seconds

At the moment, the latency results are reported in nanoseconds.
In order to improve readability, we'll convert it to seconds.

While at it, we'll fix the fio duration report, which we're
wrongfully dividing by 1000 twice.

Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
2 years agoqa: Added latency to results table in test_rbd_wnbd.py
Stefan Chivu [Wed, 1 Feb 2023 15:38:52 +0000 (15:38 +0000)]
qa: Added latency to results table in test_rbd_wnbd.py

Added data regarding completion latency (ns) to the fio results
table, such as min, max, mean, stddev, etc.

Signed-off-by: Stefan Chivu <schivu@cloudbasesolutions.com>
2 years agoqa/workunits/rbd-nbd: work around "rbd feature disable" hang
Ilya Dryomov [Thu, 16 Feb 2023 11:53:02 +0000 (12:53 +0100)]
qa/workunits/rbd-nbd: work around "rbd feature disable" hang

"rbd feature disable" appears to reliably hang if the corresponding
remote request is proxied to rbd-nbd (because rbd-nbd happens to own
the exclusive lock after a series of blkdiscard calls) [1].  Work
around it here by enabling journaling before the image is mapped
and disabling it after the image is unmapped.

Also, don't assert on the output of "rbd journal inspect --verbose"
having a certain number of entries.  This is racy: if the script gets
delayed after the last blkdiscard call for some reason, there may be
fewer entries present in the journal or none at all.

[1] https://tracker.ceph.com/issues/58740

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2 years agotest/librbd: add LengthModifiedDiscardJournalAppendEnabled test
Ilya Dryomov [Thu, 16 Feb 2023 11:51:04 +0000 (12:51 +0100)]
test/librbd: add LengthModifiedDiscardJournalAppendEnabled test

Currently nothing triggers the length_modified case in
ImageDiscardRequest::prune_object_extents() in isolation. It's only
triggered in DiscardGranularityJournalAppendEnabled test together with
the prune_required case and a bad refactoring could easily break the
length_modified logic again.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
2 years agoMerge pull request #50109 from rhcs-dashboard/landing-page-v3
Nizamudeen A [Thu, 16 Feb 2023 10:51:12 +0000 (16:21 +0530)]
Merge pull request #50109 from rhcs-dashboard/landing-page-v3

mgr/dashboard: Landing page v3

Reviewed-by: bryanmontalvan <NOT@FOUND>
Reviewed-by: Nizamudeen A <nia@redhat.com>
2 years agoqa: add tests for cephfs-top
Jos Collin [Wed, 1 Feb 2023 12:14:47 +0000 (17:44 +0530)]
qa: add tests for cephfs-top

* Updated the existing tests
* Added new qa tests for --dump and --dumpfs options
* Updated the cluster configuration to 2 mds, 2 clients

Fixes: https://tracker.ceph.com/issues/57014
Signed-off-by: Jos Collin <jcollin@redhat.com>
2 years agodoc: updates --dump and --dumpfs
Jos Collin [Wed, 30 Nov 2022 09:20:28 +0000 (14:50 +0530)]
doc: updates --dump and --dumpfs

Fixes: https://tracker.ceph.com/issues/57014
Signed-off-by: Jos Collin <jcollin@redhat.com>
2 years agocephfs-top: dump values to stdout
Jos Collin [Mon, 21 Nov 2022 09:35:01 +0000 (15:05 +0530)]
cephfs-top: dump values to stdout

Fixes: https://tracker.ceph.com/issues/57014
Signed-off-by: Jos Collin <jcollin@redhat.com>
2 years agoMerge pull request #50114 from rhcs-dashboard/cephadm-prom-401-error
Nizamudeen A [Thu, 16 Feb 2023 06:50:51 +0000 (12:20 +0530)]
Merge pull request #50114 from rhcs-dashboard/cephadm-prom-401-error

mgr/dashboard: ignore the rules 400 error in dashboard kcli e2e

Reviewed-by: Pegonzal <NOT@FOUND>
Reviewed-by: Avan Thakkar <athakkar@redhat.com>
2 years agorbd-mirror: fix syncing_percent calculation logic in get_replay_status()
N Balachandran [Thu, 16 Feb 2023 04:57:02 +0000 (10:27 +0530)]
rbd-mirror: fix syncing_percent calculation logic in get_replay_status()

When a snapshot sync is resumed and the get_replay_status function
is called before handle_copy_image_progress, the syncing_percent
value may be greater than 100 as the m_local_object_count is still
set to zero. This commit sets the syncing_percent to 0 in such cases.

Fixes: https://tracker.ceph.com/issues/58706
Signed-off-by: N Balachandran <nibalach@redhat.com>
2 years agoMerge pull request #50111 from rzarzynski/wip-crimson-osd-ftbfs-fmtptr
Yingxin [Thu, 16 Feb 2023 02:30:56 +0000 (10:30 +0800)]
Merge pull request #50111 from rzarzynski/wip-crimson-osd-ftbfs-fmtptr

crimson/osd: fix FTBFS due to smart-ptr-to-bool conversion in fmt::ptr

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
2 years agoMerge pull request #50138 from zdover23/wip-doc-2023-02-16-radosgw-realms-refinement
Anthony D'Atri [Thu, 16 Feb 2023 01:55:25 +0000 (20:55 -0500)]
Merge pull request #50138 from zdover23/wip-doc-2023-02-16-radosgw-realms-refinement

doc/rgw: refine "Realms" section

2 years agoclient: switch to use 32 bits ext_num_fwd
Xiubo Li [Tue, 5 Jul 2022 04:59:11 +0000 (12:59 +0800)]
client: switch to use 32 bits ext_num_fwd

The MDS will increase the forward count, and if the forward count
is less than the one saved in request in client side, that means
the MDS is old version and it was overflowed. Then just stop
forwarding.

Fixes: https://tracker.ceph.com/issues/57854
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2 years agoclient: switch to use 32 bits ext_num_retry
Xiubo Li [Tue, 5 Jul 2022 04:59:11 +0000 (12:59 +0800)]
client: switch to use 32 bits ext_num_retry

Check the CEPHFS_FEATURE_32BITS_RETRY_FWD feature bit and if not
set, that means it's connecting to an old MDS and will limit the
max retry to 256 times.

Fixes: https://tracker.ceph.com/issues/57854
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2 years agoceph_fs.h: add 32 bits extended num_retry and num_fwd support
Xiubo Li [Tue, 5 Jul 2022 04:59:11 +0000 (12:59 +0800)]
ceph_fs.h: add 32 bits extended num_retry and num_fwd support

To make sure both the num_retry and num_fwd in all the structs have
the same type, which is 32 bits.

And also for old cephs and clients they still could use the 8 bits
members.

Fixes: https://tracker.ceph.com/issues/57854
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2 years agoceph_fs.h: switch to use its own encode/decode helpers
Xiubo Li [Tue, 5 Jul 2022 03:47:16 +0000 (11:47 +0800)]
ceph_fs.h: switch to use its own encode/decode helpers

Prepare switching num_retry and num_fwd to __le32.

Fixes: https://tracker.ceph.com/issues/57854
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2 years agoqa: add require_kernel_mount helper support
Xiubo Li [Tue, 31 Jan 2023 02:12:51 +0000 (10:12 +0800)]
qa: add require_kernel_mount helper support

This will simplify the code later.

Signed-off-by: Xiubo Li <xiubli@redhat.com>
2 years agoqa: rename test_fscrypt to test_fscrypt_encrypt
Xiubo Li [Tue, 31 Jan 2023 02:01:46 +0000 (10:01 +0800)]
qa: rename test_fscrypt to test_fscrypt_encrypt

The test_fscrypt_encrypt will only run the 'encrypt' related test
cases without 'test_dummy_encryption' option enabled. This will
test the filename and content verification.

After this I will add the whole test cases with 'test_dummy_encryption'
option.

Signed-off-by: Xiubo Li <xiubli@redhat.com>
2 years agodoc/rgw: refine "Realms" section
Zac Dover [Wed, 15 Feb 2023 23:56:53 +0000 (09:56 +1000)]
doc/rgw: refine "Realms" section

Make various syntactical and semantic improvements to "Realms" section.

Signed-off-by: Zac Dover <zac.dover@proton.me>
2 years agoMerge pull request #50119 from zdover23/wip-doc-2023-02-15-multisite-zonegroup-spelli...
zdover23 [Wed, 15 Feb 2023 23:37:13 +0000 (09:37 +1000)]
Merge pull request #50119 from zdover23/wip-doc-2023-02-15-multisite-zonegroup-spelling-correction

doc/rgw: s/[Zz]one [Gg]roup/zonegroup/g

2 years agoMerge pull request #50017 from guits/fix-simple-scan-dir
Guillaume Abrioux [Wed, 15 Feb 2023 19:09:30 +0000 (20:09 +0100)]
Merge pull request #50017 from guits/fix-simple-scan-dir

ceph-volume: do not raise RuntimeError in util.lsblk

2 years agorgw: Fix segfault due to concurrent socket use at timeout
Yixin Jin [Wed, 15 Feb 2023 17:08:19 +0000 (17:08 +0000)]
rgw: Fix segfault due to concurrent socket use at timeout

This commit fixes a potential segfault risk when
rgw timeout handler works on the socket in one
thread while it is concurrently used by another.
The details of the fix are:

1. Instead of calling socket close(), which resets
descriptor_data in boost::asio socket and risks
segfault due to concurrent use of the socket,
the timeout handler now calls cancel() to abort
all pending ops followed by shutdown() to disable
the underlying transport. The eventual closure of
the socket will be done in the socket destructor.

2. Expose the actual boost::asio socket via get_socket()
from Connection so that the timeout handler can call
cancel() and shutdown() on it, although the socket data
member is already accessible. It allows future expansion
that wants to hide the socket even though it renders the
existing close() less useful.

Fixes: https://tracker.ceph.com/issues/58670
Signed-off-by: Yixin Jin <yjin77@yahoo.ca>
2 years agoMerge pull request #50062 from phlogistonjohn/jjm-more-cephadm-tests
Adam King [Wed, 15 Feb 2023 16:04:38 +0000 (11:04 -0500)]
Merge pull request #50062 from phlogistonjohn/jjm-more-cephadm-tests

cephadm: a few more utility function tests

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Redouane Kachach <rkachach@redhat.com>
2 years agoMerge pull request #50084 from adk3798/cephadm-autopep8-error
Adam King [Wed, 15 Feb 2023 14:17:40 +0000 (09:17 -0500)]
Merge pull request #50084 from adk3798/cephadm-autopep8-error

cephadm: fix autopep8 arg parsing on newer tox versions

Reviewed-by: Redouane Kachach <rkachach@redhat.com>
2 years agoMerge PR #48053 into main
Venky Shankar [Wed, 15 Feb 2023 13:28:29 +0000 (18:58 +0530)]
Merge PR #48053 into main

* refs/pull/48053/head:
test/libcephfs: fix rebasing issues
libcephfs: replace errno.h errors with CEPHFS_E*
messages: avoid converting ceph errors on Windows
client: use CEPHFS_E*
test/libcephfs: use CEPHFS_E* errors
libcephfs: switch to CEPHFS_E* errors
test/libcephfs: disable flaky timestamp assertion on Windows
client: use _setattrx when changing timestamps
client: set nsec to 0 when converting stat struct on Windows
test/libcephfs: skip dirent inode check on Windows
client: avoid trimming inodes on Windows
test/libcephfs: address windows issues
test/libcephfs: include compat.h
test/libcephfs: enable the tests on Windows

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Anthony D Atri <anthony.datri@gmail.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>
2 years agomgr/dashboard: dashboard-v3: cluster-utilization card
Pedro Gonzalez Gomez [Tue, 11 Oct 2022 17:43:34 +0000 (19:43 +0200)]
mgr/dashboard: dashboard-v3: cluster-utilization card

mgr/dashboard: some improvements

mgr/dashboard: updated promqls and improved tooltips

mgr/dashboard: some fixes and improvements

mgr/dashboard: added square box plugin and improved/fixed code

mgr/dashboard: added maxValue option for graphs and added tooltips to text

mgr/dashboard: removed unnecessary variables

mgr/dashboard: fix openapi

tracker: https://tracker.ceph.com/issues/57863
Signed-off-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
2 years agomgr/dashboard: dashboard-v3: inventory card
bryanmontalvan [Wed, 10 Aug 2022 18:22:02 +0000 (14:22 -0400)]
mgr/dashboard: dashboard-v3: inventory card

This commit starts the inventory-card which is one of the cards which
will be located in the landing-page revamp

tracker: https://tracker.ceph.com/issues/58065
Signed-off-by: bryanmontalvan <bmontalv@redhat.com>
Signed-off-by: Nizamudeen A <nia@redhat.com>
Signed-off-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
2 years agomgr/dashboard: dashboard-v3: status card
bryanmontalvan [Wed, 3 Aug 2022 01:39:05 +0000 (21:39 -0400)]
mgr/dashboard: dashboard-v3: status card

This commit is the bare-bones work of the status card. The only logic
written in this commit is the Cluster health status icon.

tracker: https://tracker.ceph.com/issues/58728
Signed-off-by: bryanmontalvan <bmontalv@redhat.com>
mgr/dashboard: introduce active alerts to status cards

Signed-off-by: Nizamudeen A <nia@redhat.com>
2 years agoxmgr/dashboard: dashboard-v3: capacity card
Pedro Gonzalez Gomez [Mon, 1 Aug 2022 20:17:29 +0000 (22:17 +0200)]
xmgr/dashboard: dashboard-v3: capacity card

tracker: https://tracker.ceph.com/issues/57862
Signed-off-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
mgr/dashboard: fix openapi tox error

Signed-off-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
mgr/dashboard: capacity-card set interval and display bytes used

Signed-off-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
mgr/dashboard: Landing Page v3

Signed-off-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>