]> git.apps.os.sepia.ceph.com Git - ceph-ci.git/log
ceph-ci.git
3 years agorgw: target_layout to be optional
Shilpa Manjarabad Jagannath [Mon, 6 Jul 2020 09:47:15 +0000 (15:17 +0530)]
rgw: target_layout to be optional

Signed-off-by: Shilpa Manjarabad Jagannath <smanjara@redhat.com>
3 years agorgw: use rgw::BucketReshardState for reshard state change
Shilpa Jagannath [Mon, 29 Jun 2020 19:17:53 +0000 (00:47 +0530)]
rgw: use rgw::BucketReshardState for reshard state change

Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>
3 years agorgw: fetch updated bucketinfo rather than bucket_id in guard_reshard
Shilpa Jagannath [Wed, 10 Jun 2020 14:09:55 +0000 (19:39 +0530)]
rgw: fetch updated bucketinfo rather than bucket_id in guard_reshard

To be able to read the updated layout.

Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>
3 years agorgw: use BucketReshardState states in appropriate places
Shilpa Jagannath [Tue, 9 Jun 2020 03:05:27 +0000 (08:35 +0530)]
rgw: use BucketReshardState states in appropriate places

This replaces cls_rgw_reshard_status.

Remove unused parameters in cls_rgw_bucket_instance_entry.
Other minor cleanup fixes.

Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>
3 years agorgw/dynamic-resharding: Allow clean_index to take generation number
Shilpa Jagannath [Thu, 21 May 2020 12:59:15 +0000 (18:29 +0530)]
rgw/dynamic-resharding: Allow clean_index to take generation number

Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>
3 years agorgw/dynamic-resharding: reshard bucket using the same instance id
Shilpa Jagannath [Thu, 21 May 2020 12:54:29 +0000 (18:24 +0530)]
rgw/dynamic-resharding: reshard bucket using the same instance id

Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>
3 years agorgw/dynamic-resharding: remove creation of new bucket instance
Shilpa Jagannath [Thu, 21 May 2020 12:33:01 +0000 (18:03 +0530)]
rgw/dynamic-resharding: remove creation of new bucket instance

Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>
3 years agotest/rgw: add --gen to radosgw-admin/help.t
Casey Bodley [Wed, 3 Feb 2021 15:39:18 +0000 (10:39 -0500)]
test/rgw: add --gen to radosgw-admin/help.t

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: add generations to error repo with binary format
Casey Bodley [Fri, 22 Jan 2021 23:28:50 +0000 (18:28 -0500)]
rgw: add generations to error repo with binary format

adds a backward-compatible binary encoding for error repo keys that can
contain a generation number along with the bucket and shard

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: add generation support to BILog admin APIs
Shilpa Jagannath [Thu, 3 Dec 2020 06:23:40 +0000 (11:53 +0530)]
rgw: add generation support to BILog admin APIs

Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>
3 years agorgw: remove rgw_bucket_shard_sync_info::full_marker
Casey Bodley [Fri, 20 Nov 2020 17:32:48 +0000 (12:32 -0500)]
rgw: remove rgw_bucket_shard_sync_info::full_marker

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: disable backoff on data sync error_retry_time
Casey Bodley [Wed, 18 Nov 2020 21:24:11 +0000 (16:24 -0500)]
rgw: disable backoff on data sync error_retry_time

this backoff is triggered often by the per-bucket lease for full sync,
and causes tests to fail with checkpoint timeouts

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agoqa/rgw: rgw_multisite_tests task loads default plugins
Adam C. Emerson [Sat, 8 May 2021 00:01:11 +0000 (20:01 -0400)]
qa/rgw: rgw_multisite_tests task loads default plugins

the ability to filter tests by attribute is provided by the
nose.plugins.attrib plugin, which wasn't being loaded by default

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agoqa/rgw: disable multisite tests for 'bucket sync disable'
Casey Bodley [Tue, 10 Nov 2020 16:36:31 +0000 (11:36 -0500)]
qa/rgw: disable multisite tests for 'bucket sync disable'

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agoqa/rgw: add missing meta checkpoint to test_multipart_object_sync
Casey Bodley [Tue, 10 Nov 2020 21:38:10 +0000 (16:38 -0500)]
qa/rgw: add missing meta checkpoint to test_multipart_object_sync

if metadata sync hasn't finished, the 'bucket checkpoint' commands may
not find its bucket info

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: RGWSyncBucketCR holds lease over Init state too
Casey Bodley [Tue, 10 Nov 2020 20:00:40 +0000 (15:00 -0500)]
rgw: RGWSyncBucketCR holds lease over Init state too

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: fix up BucketShardIncrementalSync log message
Casey Bodley [Tue, 6 Oct 2020 21:59:44 +0000 (17:59 -0400)]
rgw: fix up BucketShardIncrementalSync log message

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: move full sync from SyncBucketShard to SyncBucket
Casey Bodley [Tue, 6 Oct 2020 21:59:41 +0000 (17:59 -0400)]
rgw: move full sync from SyncBucketShard to SyncBucket

renamed ListBucketShardCR to ListRemoteBucketCR and removed the shard-id
parameter

renamed BucketFullSyncShardMarkerTrack to BucketFullSyncMarkerTrack,
which now updates the bucket-level rgw_bucket_sync_status

renamed BucketShardFullSyncCR to BucketFullSyncCR

BucketSyncCR now takes a bucket-wide lease during full sync

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agoradosgw-admin: 'bucket sync checkpoint' waits for full sync
Casey Bodley [Tue, 6 Oct 2020 21:59:39 +0000 (17:59 -0400)]
radosgw-admin: 'bucket sync checkpoint' waits for full sync

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agoradosgw-admin: 'bucket sync status' displays new full sync status
Casey Bodley [Tue, 6 Oct 2020 21:59:36 +0000 (17:59 -0400)]
radosgw-admin: 'bucket sync status' displays new full sync status

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: add rgw_read_bucket_full_sync_status()
Casey Bodley [Tue, 6 Oct 2020 21:59:34 +0000 (17:59 -0400)]
rgw: add rgw_read_bucket_full_sync_status()

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: InitBucketShardSyncStatus always sets state to Incremental
Casey Bodley [Tue, 6 Oct 2020 21:59:31 +0000 (17:59 -0400)]
rgw: InitBucketShardSyncStatus always sets state to Incremental

full sync happens as the bucket level, so the shards will always start
in StateIncrementalSync

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: split SyncBucket from SyncBucketShard
Casey Bodley [Tue, 6 Oct 2020 21:59:29 +0000 (17:59 -0400)]
rgw: split SyncBucket from SyncBucketShard

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: add InitBucketFullSyncStatusCR
Casey Bodley [Tue, 6 Oct 2020 21:59:28 +0000 (17:59 -0400)]
rgw: add InitBucketFullSyncStatusCR

a coroutine to initialize a bucket for full sync using a new bucket-wide
sync status object

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: RGWSimpleRadosWriteAttrsCR supports exclusive create
Casey Bodley [Tue, 13 Oct 2020 13:56:11 +0000 (09:56 -0400)]
rgw: RGWSimpleRadosWriteAttrsCR supports exclusive create

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: system objects can set exclusive on set_attrs()
Casey Bodley [Tue, 13 Oct 2020 13:54:56 +0000 (09:54 -0400)]
rgw: system objects can set exclusive on set_attrs()

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: add exclusive flag to RGWSimpleRadosWriteCR
Casey Bodley [Tue, 6 Oct 2020 21:59:26 +0000 (17:59 -0400)]
rgw: add exclusive flag to RGWSimpleRadosWriteCR

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: generalize error handling in RGWShardCollectCR
Casey Bodley [Tue, 6 Oct 2020 21:59:24 +0000 (17:59 -0400)]
rgw: generalize error handling in RGWShardCollectCR

RGWShardCollectCR was hard-coded to ignore ENOENT errors and print a
'failed to fetch log status' error message. this moves that logic into a
handle_result() virtual function. it also exposes the member variables
'status' and 'max_concurrent' as protected, so they can be consulted or
modified by overrides of handle_result() and spawn_next()

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: add sync_bucket_shard_cr() factory function
Casey Bodley [Tue, 6 Oct 2020 21:59:22 +0000 (17:59 -0400)]
rgw: add sync_bucket_shard_cr() factory function

allows other code to spawn this coroutine without having the class
definition

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: rename to RGWSyncBucketShardCR
Casey Bodley [Tue, 6 Oct 2020 21:59:20 +0000 (17:59 -0400)]
rgw: rename to RGWSyncBucketShardCR

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: add full_status_oid() for buckets
Casey Bodley [Tue, 6 Oct 2020 21:59:18 +0000 (17:59 -0400)]
rgw: add full_status_oid() for buckets

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: use const for string constants in rgw_data_sync.cc
Casey Bodley [Tue, 6 Oct 2020 21:59:16 +0000 (17:59 -0400)]
rgw: use const for string constants in rgw_data_sync.cc

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: rename to inc_status_oid
Casey Bodley [Tue, 6 Oct 2020 21:59:14 +0000 (17:59 -0400)]
rgw: rename to inc_status_oid

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: add data structures for bucket sync status
Casey Bodley [Tue, 6 Oct 2020 21:59:11 +0000 (17:59 -0400)]
rgw: add data structures for bucket sync status

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: rename to rgw_read_bucket_inc_sync_status
Casey Bodley [Tue, 6 Oct 2020 21:59:09 +0000 (17:59 -0400)]
rgw: rename to rgw_read_bucket_inc_sync_status

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: remove unused RGWRunBucketsSyncBySourceCR
Casey Bodley [Tue, 6 Oct 2020 21:59:07 +0000 (17:59 -0400)]
rgw: remove unused RGWRunBucketsSyncBySourceCR

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: RGWSimpleRadosReadCR copies out objv_tracker
Casey Bodley [Tue, 6 Oct 2020 21:59:00 +0000 (17:59 -0400)]
rgw: RGWSimpleRadosReadCR copies out objv_tracker

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agoMerge pull request #46404 from phlogistonjohn/jjm-obj-format-pkg-fix
Adam King [Fri, 27 May 2022 12:14:58 +0000 (08:14 -0400)]
Merge pull request #46404 from phlogistonjohn/jjm-obj-format-pkg-fix

debian: include the new object_format.py file

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
3 years agoMerge pull request #46356 from xxhdx1985126/wip-backref-gc-opt
Yingxin [Fri, 27 May 2022 08:23:22 +0000 (16:23 +0800)]
Merge pull request #46356 from xxhdx1985126/wip-backref-gc-opt

crimson/os/seastore/segment_cleaner: add dedicated backref trimming process

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
3 years agoMerge pull request #45544 from wxypro/fix-typos-cephfs
Venky Shankar [Fri, 27 May 2022 05:43:43 +0000 (11:13 +0530)]
Merge pull request #45544 from wxypro/fix-typos-cephfs

common/options: fix typo

Reviewed-by: Venky Shankar <vshankar@redhat.com>
3 years agodebian: include the new object_format.py file
John Mulligan [Thu, 26 May 2022 14:56:44 +0000 (10:56 -0400)]
debian: include the new object_format.py file

This was missed when adding object_format.py to the RPM packaging.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
3 years agocrimson/os/seastore/segment_cleaner: add dedicated backref trimming process
Xuehan Xu [Fri, 20 May 2022 09:23:03 +0000 (17:23 +0800)]
crimson/os/seastore/segment_cleaner: add dedicated backref trimming process

Space reclamation needs to merge backrefs up to the point where the latest
release of extents within the scope of the reclamation process happened.
When the journal size is large, that merge may generate a transaction
record with size exceeds the max record size threshold. So we need have a
backref trimming process that merge most of the backrefs before the space
reclamation happens.

This commit also fixes issue: https://tracker.ceph.com/issues/55692, by
repeating the inflight backrefs trimming transaction when it's
invalidated by other trans on the ROOT block

Fixes: https://tracker.ceph.com/issues/55692
Signed-off-by: Xuehan Xu <xxhdx1985126@gmail.com>
3 years agoMerge pull request #46271 from xxhdx1985126/wip-backref-new_mapping-opt
Yingxin [Thu, 26 May 2022 02:24:37 +0000 (10:24 +0800)]
Merge pull request #46271 from xxhdx1985126/wip-backref-new_mapping-opt

crimson/os/seastore: backref related clean ups

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
3 years agoMerge commit 'b76b6ea16899861d6c266797b3c11d9f68f924d7' into main
Patrick Donnelly [Wed, 25 May 2022 20:01:57 +0000 (16:01 -0400)]
Merge commit 'b76b6ea16899861d6c266797b3c11d9f68f924d7' into main

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
3 years agoMerge pull request #46390 from gabriellasroman/inclusive-lang
David Galloway [Wed, 25 May 2022 19:47:07 +0000 (15:47 -0400)]
Merge pull request #46390 from gabriellasroman/inclusive-lang

teuthology-cronjobs: master->main

3 years agoteuthology-cronjobs: master->main
Gabriella [Wed, 25 May 2022 12:01:11 +0000 (08:01 -0400)]
teuthology-cronjobs: master->main

Signed-off-by: Gabriella Roman <gsroman@bu.edu>
3 years agoMerge pull request #44486 from batrick/fs-workload-kclient-switches
Venky Shankar [Wed, 25 May 2022 01:24:53 +0000 (06:54 +0530)]
Merge pull request #44486 from batrick/fs-workload-kclient-switches

qa: reduce fs:workload use of random selection

Reviewed-by: Venky Shankar <vshankar@redhat.com>
3 years agoMerge pull request #46385 from ceph/inclusive-rename-yaml
David Galloway [Tue, 24 May 2022 21:04:50 +0000 (17:04 -0400)]
Merge pull request #46385 from ceph/inclusive-rename-yaml

Rename/re-symlink whitelist_*.yaml

3 years agoRename/re-symlink whitelist_*.yaml
Zack Cerza [Tue, 24 May 2022 19:31:12 +0000 (13:31 -0600)]
Rename/re-symlink whitelist_*.yaml

whitelist_health.yaml -> ignorelist_health.yaml
whitelist_wrongly_marked_down.yaml -> ignore_wrongly_marked_down.yaml

This was mostly addressed in
https://github.com/ceph/ceph/commit/2ee9365d0bc220523632753f02c61eb5c2f4fbdd,
but the rename wasn't done there.

Signed-off-by: Zack Cerza <zack@cerza.org>
3 years agoMerge pull request #44967 from batrick/nested-subset-doc
Zack Cerza [Tue, 24 May 2022 16:23:48 +0000 (10:23 -0600)]
Merge pull request #44967 from batrick/nested-subset-doc

doc/dev: document nested subsets in teuthology

3 years agoMerge pull request #46099 from NitzanMordhai/wip-nitzan-rados-test-delete-pools-parallel
Yuri Weinstein [Tue, 24 May 2022 15:02:26 +0000 (08:02 -0700)]
Merge pull request #46099 from NitzanMordhai/wip-nitzan-rados-test-delete-pools-parallel

systest: fix race with pool delete after socket injection

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
3 years agoMerge pull request #45825 from NitzanMordhai/wip-nitzan-watch-notify-reconnect-107
Yuri Weinstein [Tue, 24 May 2022 15:01:45 +0000 (08:01 -0700)]
Merge pull request #45825 from NitzanMordhai/wip-nitzan-watch-notify-reconnect-107

librados/watch_notify: reconnect after socket injection

Reviewed-by: Brad Hubbard <bhubbard@redhat.com>
3 years agoMerge pull request #45805 from ifed01/wip-ifed-more-stats
Yuri Weinstein [Tue, 24 May 2022 15:01:19 +0000 (08:01 -0700)]
Merge pull request #45805 from ifed01/wip-ifed-more-stats

os/bluestore: more debug information from BlueStore

Reviewed-by: Adam Kupczyk <akupczyk@redhat.com>
3 years agoMerge pull request #45409 from NUABO/tancz
Yuri Weinstein [Tue, 24 May 2022 14:59:36 +0000 (07:59 -0700)]
Merge pull request #45409 from NUABO/tancz

mon: add proxy to cache tier options

Reviewed-by: Neha Ojha <nojha@redhat.com>
3 years agoMerge pull request #46324 from ceph/dgalloway-nodockerio
Adam King [Tue, 24 May 2022 14:27:03 +0000 (10:27 -0400)]
Merge pull request #46324 from ceph/dgalloway-nodockerio

qa: Pull from quay.io not docker.io

Reviewed-by: Adam King <adking@redhat.com>
3 years agoMerge pull request #46366 from kalebskeithley/rocksdb
kalebskeithley [Tue, 24 May 2022 11:06:50 +0000 (07:06 -0400)]
Merge pull request #46366 from kalebskeithley/rocksdb

rocksdb: build with rocksdb-7.y.z

3 years agoMerge pull request #46185 from dparmar18/fix_rm_error_output
Rishabh Dave [Tue, 24 May 2022 10:43:48 +0000 (16:13 +0530)]
Merge pull request #46185 from dparmar18/fix_rm_error_output

cephfs-shell: fix rm cmd's error output

Reviewed-by: Rishabh Dave <ridave@redhat.com>
Reviewed-by: Neeraj Pratap Singh <neesingh@redhat.com>
3 years agoMerge pull request #46160 from dparmar18/fix_tracker_40860
Rishabh Dave [Tue, 24 May 2022 10:40:26 +0000 (16:10 +0530)]
Merge pull request #46160 from dparmar18/fix_tracker_40860

cephfs-shell: fix rmdir cmd

Reviewed-by: Rishabh Dave <ridave@redhat.com>
3 years agoMerge pull request #46249 from rhcs-dashboard/rbd-mirroring-commands
Ernesto Puerta [Tue, 24 May 2022 10:38:14 +0000 (12:38 +0200)]
Merge pull request #46249 from rhcs-dashboard/rbd-mirroring-commands

mgr/dashboard: enable/disable/promote/demote/resync mirroring images

Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
3 years agocrimson/os/seastore/segment_cleaner: downgrade the level of unnecessary logs to DEBUG
Xuehan Xu [Sun, 15 May 2022 23:59:54 +0000 (07:59 +0800)]
crimson/os/seastore/segment_cleaner: downgrade the level of unnecessary logs to DEBUG

Signed-off-by: Xuehan Xu <xxhdx1985126@gmail.com>
3 years agocrimson/os/seastore/backref: rename BackrefManager::batch_insert_from_cache to Backre...
Xuehan Xu [Sun, 15 May 2022 07:24:03 +0000 (15:24 +0800)]
crimson/os/seastore/backref: rename BackrefManager::batch_insert_from_cache to BackrefManager::merge_cached_backrefs

Signed-off-by: Xuehan Xu <xxhdx1985126@gmail.com>
3 years agocrimson/os/seastore/segment_cleaner: don't repeat backref mapping retrieval when...
Xuehan Xu [Sun, 15 May 2022 06:45:51 +0000 (14:45 +0800)]
crimson/os/seastore/segment_cleaner: don't repeat backref mapping retrieval when gc transactions get invalidated

Signed-off-by: Xuehan Xu <xxhdx1985126@gmail.com>
3 years agocrimson/os/seastore: mandate all access to backrefs to go through backref manager
Xuehan Xu [Fri, 13 May 2022 08:50:10 +0000 (16:50 +0800)]
crimson/os/seastore: mandate all access to backrefs to go through backref manager

this would avoid other components' unnecessary dependency on Cache

Signed-off-by: Xuehan Xu <xxhdx1985126@gmail.com>
3 years agoMerge pull request #45728 from lxbsz/wip-55144
Venky Shankar [Tue, 24 May 2022 02:24:16 +0000 (07:54 +0530)]
Merge pull request #45728 from lxbsz/wip-55144

client: stop retrying the request when exceeding 256 times

Reviewed-by: Venky Shankar <vshankar@redhat.com>
3 years agoMerge pull request #46034 from mchangir/mgr-snap_schedule-remove-subvolume-support
Venky Shankar [Tue, 24 May 2022 01:43:32 +0000 (07:13 +0530)]
Merge pull request #46034 from mchangir/mgr-snap_schedule-remove-subvolume-support

mgr/snap_schedule: remove subvol interface

Reviewed-by: Venky Shankar <vshankar@redhat.com>
3 years agoMerge pull request #46351 from cyx1231st/wip-seastore-cleaner-cleanup
Yingxin [Tue, 24 May 2022 01:34:48 +0000 (09:34 +0800)]
Merge pull request #46351 from cyx1231st/wip-seastore-cleaner-cleanup

crimson/os/seastore/segment_cleaner: minor cleanups

Reviewed-by: Kefu Chai <tchaikov@gmail.com>
Reviewed-by: Xuehan Xu <xxhdx1985126@gmail.com>
3 years agoMerge pull request #46314 from rishabh-d-dave/qa-fs-fsid
Venky Shankar [Tue, 24 May 2022 01:26:13 +0000 (06:56 +0530)]
Merge pull request #46314 from rishabh-d-dave/qa-fs-fsid

qa/vstart_runner: store FSID in LocalContext instances

Reviewed-by: Venky Shankar <vshankar@redhat.com>
3 years agorocksdb: build with rocksdb-7.y.z
Kaleb S. KEITHLEY [Mon, 23 May 2022 11:41:26 +0000 (07:41 -0400)]
rocksdb: build with rocksdb-7.y.z

RocksDB 7, specifically 7.2.2 has landed in Fedora 37/rawhide.

https://tracker.ceph.com/issues/55730

Signed-off-by: Kaleb S. KEITHLEY <kkeithle@redhat.com>
3 years agoMerge pull request #46346 from ljflores/wip-lflores-testing-recovery
Neha Ojha [Mon, 23 May 2022 23:26:27 +0000 (16:26 -0700)]
Merge pull request #46346 from ljflores/wip-lflores-testing-recovery

qa/suites/rados/thrash-erasure-code-big/thrashers: add `osd max backfills` setting to mapgap and pggrow

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Sridhar Seshasayee <sseshasa@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
3 years agoMerge pull request #46124 from SMIL-Infra/reopen_cluster_logs
Vikhyat Umrao [Mon, 23 May 2022 19:49:35 +0000 (12:49 -0700)]
Merge pull request #46124 from SMIL-Infra/reopen_cluster_logs

mon/LogMonitor: reopen log files on SIGHUP

3 years agoMerge pull request #46233 from rhcs-dashboard/fix-overlapping-form-icons
Ernesto Puerta [Mon, 23 May 2022 18:30:08 +0000 (20:30 +0200)]
Merge pull request #46233 from rhcs-dashboard/fix-overlapping-form-icons

mgr/dashboard: form field validation icons overlap with other icons

Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>
3 years agoMerge pull request #45985 from soumyakoduri/wip-skoduri-dbstore-tests
Casey Bodley [Mon, 23 May 2022 16:55:41 +0000 (12:55 -0400)]
Merge pull request #45985 from soumyakoduri/wip-skoduri-dbstore-tests

rgw/qa: Run s3tests on dbstore backend in teuthology

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
3 years agomgr/dashboard: expose image mirroring commands as endpoints
Pere Diaz Bou [Thu, 12 May 2022 18:29:01 +0000 (20:29 +0200)]
mgr/dashboard: expose image mirroring commands as endpoints

Expose:
  - enable/disable mirroring in image
  - promote/demote (primary and non-primary)
  - resync
  - snapshot mode:
    - mirror image snapshot (manual snapshot)
    - schedule

Fixes: https://tracker.ceph.com/issues/55645
Signed-off-by: Pere Diaz Bou <pdiazbou@redhat.com>
3 years agoclient: stop retrying the request when exceeding 256 times
Xiubo Li [Thu, 31 Mar 2022 07:16:49 +0000 (15:16 +0800)]
client: stop retrying the request when exceeding 256 times

The type of 'retry_attempt' in 'MetaRequest' is 'int', while in
'ceph_mds_request_head' the type of 'num_retry' is '__u8'. So in
case the request retries exceeding 256 times, the MDS will receive
a incorrect retry seq.

In this case it's ususally a bug in MDS and continue retrying the
request makes no sense. For now let's limit it to 256. In future
this could be fixed in ceph code, so avoid using the hardcode here.

Fixes: https://tracker.ceph.com/issues/55144
Signed-off-by: Xiubo Li <xiubli@redhat.com>
3 years agoMerge pull request #45844 from lxbsz/wip-55253
Venky Shankar [Mon, 23 May 2022 05:57:33 +0000 (11:27 +0530)]
Merge pull request #45844 from lxbsz/wip-55253

client: switch AT_NO_ATTR_SYNC to AT_STATX_DONT_SYNC

Reviewed-by: Venky Shankar <vshankar@redhat.com>
3 years agoMerge pull request #46337 from rishabh-d-dave/missing-arg-exit-code
Venky Shankar [Mon, 23 May 2022 05:35:03 +0000 (11:05 +0530)]
Merge pull request #46337 from rishabh-d-dave/missing-arg-exit-code

cephfs-shell: fixes related to Cmd2ArgparseError

Reviewed-by: Venky Shankar <vshankar@redhat.com>
3 years agorgw/qa: Run s3tests on dbstore backend in teuthology
Soumya Koduri [Thu, 5 May 2022 18:33:56 +0000 (00:03 +0530)]
rgw/qa: Run s3tests on dbstore backend in teuthology

Add a new test-suite to run s3tests on dbstore backend.

Signed-off-by: Soumya Koduri <skoduri@redhat.com>
3 years agoMerge pull request #44264 from tchaikov/wip-mipel
Kefu Chai [Sat, 21 May 2022 12:27:06 +0000 (20:27 +0800)]
Merge pull request #44264 from tchaikov/wip-mipel

cmake: do not use GCC extension when detecting 16-byte atomic op

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
3 years agoMerge pull request #45835 from tchaikov/wip-55189
Kefu Chai [Sat, 21 May 2022 12:10:30 +0000 (20:10 +0800)]
Merge pull request #45835 from tchaikov/wip-55189

common/ceph_json: dump bool using f->dump_bool()

Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
3 years agoMerge pull request #46355 from tchaikov/wip-no-annobin
Kefu Chai [Sat, 21 May 2022 09:34:02 +0000 (17:34 +0800)]
Merge pull request #46355 from tchaikov/wip-no-annobin

ceph.spec.in: disable annobin plugin if compile with gcc-toolset

Reviewed-by: Yuri Weinstein <yweins@redhat.com>
3 years agocommon/ceph_json: dump bool using f->dump_bool()
Kefu Chai [Sun, 10 Apr 2022 01:23:59 +0000 (09:23 +0800)]
common/ceph_json: dump bool using f->dump_bool()

as per https://www.json.org/json-en.html, JSON encodes bool as
"true" or "false", without the quotes. before this change, the quotes
are always added when encoding boolean values.

but this change is not backward compatible.

encode_json()'s bool overload is used by rgw. it uses JSONObj
defined in common/ceph_json.h to decode JSON-encoded structs.
and it does not differentiate bool from str when decoding a boolean
value despite that it could have check the "quoted" member variable
of JSONObj for validating the type of value. so we should be fine.

Fixes: https://tracker.ceph.com/issues/55189
Signed-off-by: Kefu Chai <tchaikov@gmail.com>
3 years agoMerge pull request #46293 from ronen-fr/wip-rf-sched-test
Ronen Friedman [Sat, 21 May 2022 07:03:17 +0000 (10:03 +0300)]
Merge pull request #46293 from ronen-fr/wip-rf-sched-test

test/osd: unit-tests for the scrubber scheduler

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Nitzan Mordechai <nmordech@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
3 years agoMerge pull request #45467 from phlogistonjohn/jjm-format-exp2
Adam King [Fri, 20 May 2022 22:36:24 +0000 (18:36 -0400)]
Merge pull request #45467 from phlogistonjohn/jjm-format-exp2

A module and decorator for generically handling format= in python mgr modules

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Redouane Kachach <rkachach@redhat.com>
3 years agoMerge pull request #45786 from adk3798/staggered-upgrade
Adam King [Fri, 20 May 2022 22:33:36 +0000 (18:33 -0400)]
Merge pull request #45786 from adk3798/staggered-upgrade

mgr/cephadm: staggered upgrade

Reviewed-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
Reviewed-by: Redouane Kachach <rkachach@redhat.com>
3 years agoceph.spec.in: disable annobin plugin if compile with gcc-toolset
Kefu Chai [Fri, 20 May 2022 06:25:10 +0000 (14:25 +0800)]
ceph.spec.in: disable annobin plugin if compile with gcc-toolset

we are seeing FTBFS failures when compiling the packages with seastar
enabled, like:

Package gcc-toolset-9-annobin-9.08-4.el8.x86_64 is already installed.
...
Package redhat-rpm-config-130-1.el8.noarch is already installed.
...
CFLAGS=-O2 -g -pipe -Wall -Werror=format-security  -Wp,-D_GLIBCXX_ASSERTIONS -fexceptions -fstack-protector-strong -grecord-gcc-switches -specs=/usr/lib/rpm/redhat/redhat-hardened-cc1 -specs=/usr/lib/rpm/redhat/redhat-annobin-cc1 -m64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection
...
-- The CXX compiler identification is GNU 9.2.1
....
    Building CXX object CMakeFiles/cmTC_386fb.dir/testCXXCompiler.cxx.o
    /opt/rh/gcc-toolset-9/root/usr/bin/c++   -O2 -g -pipe -Wall -Werror=format-security  -Wp,-D_GLIBCXX_ASSERTIONS -fexceptions -fstack-protector-strong -grecord-gcc-switches -specs=/usr/lib/rpm/redhat/redhat-hardened-cc1 -specs=/usr/lib/rpm/redhat/redhat-annobin-cc1 -m64 -mtune=generic -fasynchronous-unwind-tables -fstack-clash-protection -fcf-protection  -o CMakeFiles/cmTC_386fb.dir/testCXXCompiler.cxx.o -c /home/jenkins-build/build/workspace/ceph-dev-new-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos8/DIST/centos8/MACHINE_SIZE/gigantic/release/17.2.0-209-g3e01cd41/rpm/el8/BUILD/ceph-17.2.0-209-g3e01cd41/x86_64-redhat-linux-gnu/CMakeFiles/CMakeTmp/testCXXCompiler.cxx
    cc1plus: fatal error: inaccessible plugin file /opt/rh/gcc-toolset-9/root/usr/lib/gcc/x86_64-redhat-linux/9/plugin/gcc-annobin.so expanded from short plugin name gcc-annobin: No such file or directory

the root cause is that redhat-rpm-config expects gcc-annobin:

$ cat /usr/lib/rpm/redhat/redhat-annobin-cc1 *cc1_options: + %{!-fno-use-annobin:%{!iplugindir*:%:find-plugindir()} -fplugin=gcc-annobin}

while this plugin is not installed by gcc-toolset-9-annobin, which installs annobin.so though:

$ rpm -ql gcc-toolset-9-annobin| grep annobin.so /opt/rh/gcc-toolset-9/root/usr/lib/gcc/x86_64-redhat-linux/9/plugin/annobin.so /opt/rh/gcc-toolset-9/root/usr/lib/gcc/x86_64-redhat-linux/9/plugin/annobin.so.0 /opt/rh/gcc-toolset-9/root/usr/lib/gcc/x86_64-redhat-linux/9/plugin/annobin.so.0.0.0

but gcc-toolset-8-annobin provides this file. upgrading to
gcc-toolset-11 does not help. see https://centos.pkgs.org/8-stream/centos-appstream-x86_64/gcc-toolset-11-annobin-plugin-gcc-10.23-1.el8.x86_64.rpm.html
so, the intermediate solution would be to disable the plugin, if
we want to use gcc-toolset to build rpm packages.

in this change, _annotated_build is undefined to prevent the compiler
from adding extra information to the binary. in general this change
shuold be safe, without these information, it'd be hard to tell if
the binary is hardened or what ABI version it expects. see
also https://fedoraproject.org/wiki/Changes/Annobin

Signed-off-by: Kefu Chai <tchaikov@gmail.com>
3 years agoqa/cephfs: remove temporary files
Rishabh Dave [Thu, 19 May 2022 18:29:25 +0000 (23:59 +0530)]
qa/cephfs: remove temporary files

These temporary files don't matter for test execution with teuthology
but they do matter for execution with vstart_runner.py since the test
fails if these files exist already. And tests are often run repeatedly
with vstart_runner.py, unlike with teuthology.

Fixes: https://tracker.ceph.com/issues/55719
Signed-off-by: Rishabh Dave <ridave@redhat.com>
3 years agocrimson/os/seastore/cache: ignore maybe unused variables
Yingxin Cheng [Thu, 19 May 2022 07:44:16 +0000 (15:44 +0800)]
crimson/os/seastore/cache: ignore maybe unused variables

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
3 years agocrimson/os/seastore/segment_cleaner: cleanup logs
Yingxin Cheng [Fri, 20 May 2022 03:04:09 +0000 (11:04 +0800)]
crimson/os/seastore/segment_cleaner: cleanup logs

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
3 years agocrimson/os/seastore/segment_cleaner: move implementations to cc
Yingxin Cheng [Fri, 20 May 2022 02:30:11 +0000 (10:30 +0800)]
crimson/os/seastore/segment_cleaner: move implementations to cc

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
3 years agoMerge pull request #46335 from cyx1231st/wip-seastore-onode-fix-unaligned-access
Samuel Just [Fri, 20 May 2022 02:16:29 +0000 (19:16 -0700)]
Merge pull request #46335 from cyx1231st/wip-seastore-onode-fix-unaligned-access

crimson/onode-staged-tree: fix unaligned reference to shard_pool_t::pool

Reviewed-by: Samuel Just <sjust@redhat.com>
3 years agoMerge pull request #46333 from athanatos/sjust/wip-55705
Samuel Just [Fri, 20 May 2022 02:14:29 +0000 (19:14 -0700)]
Merge pull request #46333 from athanatos/sjust/wip-55705

crimson: generalize journal_seq_t to tolerate different paddr_t types

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
Reviewed-by: Myoungwon Oh <myoungwon.oh@samsung.com>
3 years agoMerge pull request #46310 from cyx1231st/wip-seastore-tune-cleaner
Samuel Just [Fri, 20 May 2022 02:11:44 +0000 (19:11 -0700)]
Merge pull request #46310 from cyx1231st/wip-seastore-tune-cleaner

crimson/os/seastore: increase journal size and decrease rewrite-dirty size

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Xuehan Xu <xxhdx1985126@gmail.com>
3 years agoMerge pull request #46122 from tchaikov/wip-pmem
Kefu Chai [Thu, 19 May 2022 23:55:39 +0000 (07:55 +0800)]
Merge pull request #46122 from tchaikov/wip-pmem

blk/pmem: refactor pmem_check_file_type() using std::filesystem

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
3 years agoqa/suites/rados/thrash-erasure-code-big/thrashers: add `osd max backfills` setting...
Laura Flores [Mon, 16 May 2022 22:59:42 +0000 (17:59 -0500)]
qa/suites/rados/thrash-erasure-code-big/thrashers: add `osd max backfills` setting to mapgap and pggrow

All `rados/thrash-erasure-code-big` tests that die due to the “wait_for_recovery” timeout have one thing in common: They contain either `thrashers/pggrow` or `thrashers/mapgap`.

The difference between pggrow and mapgap vs. all other non-offending thrashers (default, careful, fastread, and morepggrow) is that they lack an override setting for `osd max backfills`. `osd max backfills` is the max number of backfill operations allowed to/from an OSD. The higher the number, the quicker the recovery. By default, this value is 1. On all of the non-offending thrashers (default, careful, fastread, and morepggrow), the default 1 value gets overridden in their .yaml files with a value > 1. This is not the case for pggrow and mapgap, however, as they lack an `osd max backfills` override setting.

The mclock op scheduler is known to override `osd max backfills` with a high value, but all of the thrash-erasure-code-big thrashers have their op queue set to “debug_random”, which chooses randomly between op queues (the debug_random op queue is set to override the default mclock_scheduler in qa/config/rados.yaml). So, coupled with the “debug_random” op queue, the low `osd max backfill` setting is causing some tests to time out in recovery.

WITHOUT `osd max backfills`, as they are now, “mapgap” and “pggrow” tests die due to timed-out recovery about 17/100 times, as seen here with a pggrow test: http://pulpito.front.sepia.ceph.com/lflores-2022-05-18_14:24:29-rados:thrash-erasure-code-big-master-distro-default-smithi/

WITH `osd max backfills` specified, as I have suggested in this PR, 99/100 tests passed, with one test failing for a different reason:
http://pulpito.front.sepia.ceph.com/lflores-2022-05-17_22:40:27-rados:thrash-erasure-code-big-master-distro-default-smithi/

I also scheduled 145 tests WITH `osd max backfills` that are a mix of pggrow and mapgap thrashers. 144/145 tests passed, with one test failing for a different reason. http://pulpito.front.sepia.ceph.com/lflores-2022-05-17_15:27:54-rados:thrash-erasure-code-big-master-distro-default-smithi/

Fixes: https://tracker.ceph.com/issues/51076
Signed-off-by: Laura Flores <lflores@redhat.com>
3 years agoMerge pull request #45562 from tchaikov/wip-vstart-with-pmem
Radoslaw Zarzynski [Thu, 19 May 2022 22:08:57 +0000 (00:08 +0200)]
Merge pull request #45562 from tchaikov/wip-vstart-with-pmem

vstart.sh: add options to enable PMEM bluestore backend

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
3 years agodoc/cephadm: staggered upgrade docs
Adam King [Tue, 19 Apr 2022 17:20:45 +0000 (13:20 -0400)]
doc/cephadm: staggered upgrade docs

Signed-off-by: Adam King <adking@redhat.com>
3 years agomgr/cephadm: unit test for staggered upgrade param validation
Adam King [Wed, 13 Apr 2022 04:36:02 +0000 (00:36 -0400)]
mgr/cephadm: unit test for staggered upgrade param validation

Signed-off-by: Adam King <adking@redhat.com>
3 years agoqa/suites/orch/cephadm: staggered upgrade test
Adam King [Tue, 12 Apr 2022 16:39:26 +0000 (12:39 -0400)]
qa/suites/orch/cephadm: staggered upgrade test

Signed-off-by: Adam King <adking@redhat.com>
3 years agomgr/cephadm: autopep8 changes
Adam King [Tue, 5 Apr 2022 21:38:20 +0000 (17:38 -0400)]
mgr/cephadm: autopep8 changes

Signed-off-by: Adam King <adking@redhat.com>
3 years agomgr/cephadm: make use of new upgrade control parameters
Adam King [Fri, 1 Apr 2022 13:41:01 +0000 (09:41 -0400)]
mgr/cephadm: make use of new upgrade control parameters

Fixes: https://tracker.ceph.com/issues/54135
Signed-off-by: Adam King <adking@redhat.com>