]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
3 years agorgw: Add generation to ChangeStatus
Adam C. Emerson [Tue, 8 Feb 2022 18:11:44 +0000 (13:11 -0500)]
rgw: Add generation to ChangeStatus

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
3 years agorgw: Compare log.gen to log.gen
Adam C. Emerson [Mon, 7 Feb 2022 22:00:25 +0000 (17:00 -0500)]
rgw: Compare log.gen to log.gen

And refuse to remove the only log.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
3 years agorgw: Don't erase bucket attributes on trim
Adam C. Emerson [Wed, 2 Feb 2022 20:53:41 +0000 (15:53 -0500)]
rgw: Don't erase bucket attributes on trim

Writing bucket instance info is surprising, as if you pass a null
pointer for the attributes, it just erases all the attributes.

To avoid disturbing users and other 'system objects', make a special
case that we can pass in explicitly.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
3 years agorgw/reshard: resolve inconsistent cache warnings
Yuval Lifshitz [Tue, 1 Feb 2022 09:04:06 +0000 (11:04 +0200)]
rgw/reshard: resolve inconsistent cache warnings

use an API that does not check for cache inconsistency
hence, "WARNING: The bucket info cache is inconsistent" warnings is removed from reshard

Signed-off-by: Yuval Lifshitz <ylifshit@redhat.com>
3 years agotest/rgw: test_bucket_reshard verifies that ACLs are preserved
Casey Bodley [Mon, 17 Jan 2022 22:14:54 +0000 (17:14 -0500)]
test/rgw: test_bucket_reshard verifies that ACLs are preserved

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: save bucket instance xattrs when resharding cancelled
J. Eric Ivancich [Fri, 7 Jan 2022 19:43:05 +0000 (14:43 -0500)]
rgw: save bucket instance xattrs when resharding cancelled

There appears to be a long-standing bug in RGW such that when
resharding is cancelled and the bucket instance is updated to reflect
the new resharding status, the xattrs were lost. The xattrs are used
to store metadata such as ACLs and LifeCycle policies.

This commit makes sure that all call paths that lead to a cancelled
reshard provide the xattrs, so they can be included when the bucket
instance info is updated.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
3 years agorgw: resharding causes bucket attributes to be lost
J. Eric Ivancich [Fri, 7 Jan 2022 17:32:48 +0000 (12:32 -0500)]
rgw: resharding causes bucket attributes to be lost

With the new resharding code, some bucket metadata that is stored as
xattrs (e.g., ACLs, life-cycle policies) were not sent with the
updated bucket instance data when resharding completed. As a result,
resharding has a regression where that metadata is lost after a
successful reshard.

This commit restores the variable in the RGWBucketReshard class that
maintains the bucket attributes, so they can be saved when the bucket
instance object is updated.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
3 years agorgw: add indexless bucket logic to "bucket radoslist"
J. Eric Ivancich [Mon, 17 Jan 2022 21:14:16 +0000 (16:14 -0500)]
rgw: add indexless bucket logic to "bucket radoslist"

The "bucket radoslist" sub-command of radosgw-admin is supposed to
list all rados objects tied to one or all directories and thereby
provide a way to determine orphaned rados objects.

But indexless buckets don't provide an index to employ for this
purpose. So warnings or errors should be provided depending on the
circumstances.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
3 years agorgw: update indexless bucket check for bucket stats
J. Eric Ivancich [Mon, 24 Jan 2022 21:10:57 +0000 (16:10 -0500)]
rgw: update indexless bucket check for bucket stats

The code for bucket stats was recently updated to check for an
indexless bucket before proceeding. The interface on RGWBucketInfo was
recently expanded to support these types of checks, so it is now used.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
3 years agorgw: add streamlined ways to handle indexless buckets correctly
J. Eric Ivancich [Mon, 24 Jan 2022 21:08:01 +0000 (16:08 -0500)]
rgw: add streamlined ways to handle indexless buckets correctly

Determining whether a bucket is indexless starting with an
RGWBucketInfo object requires traversing multiple data structures and
"inside knowledge" blurring the line between interface and
implementation. The same applies for retrieving the current index for
non-indexless buckets.

This commit adds to the RGWBucketInfo interface to make this
information readily accessible.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
3 years agorgw/multisite: add type to RGW_OP_SYNC_DATALOG_NOTIFY2
Yuval Lifshitz [Sun, 16 Jan 2022 16:35:23 +0000 (18:35 +0200)]
rgw/multisite: add type to RGW_OP_SYNC_DATALOG_NOTIFY2

without that the following errors are happening during sync:

ERROR: AWS4 completion for operation: 0, NOT IMPLEMENTED
op->ERRORHANDLER: err_no=-2201 new_err_no=-2201

Signed-off-by: Yuval Lifshitz <ylifshit@redhat.com>
3 years agorgw: "bucket check --fix" should delete damaged multipart uploads from bi
J. Eric Ivancich [Tue, 21 Dec 2021 22:27:37 +0000 (17:27 -0500)]
rgw: "bucket check --fix" should delete damaged multipart uploads from bi

As one of the steps in `radosgw-admin bucket check --fix ...` it looks
for bucket index entries for incomplete multipart uploads that do not
have a corresponding ".meta" entry in the same bucket index. It then
intends to delete those entries, however the function that it calls
to perform the bucket index deletions was flawed and did not direct
the removals to the appropriate shard(s), but instead a non-existant
oid.

This commit determines the appropriate shard for each of the entries
to be removed and asynchronously issues "dir suggest changes" to each
of the shards to remove the entries.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
3 years agorgw: `radosgw-admin bucket stats` on indexless bucket crashes
J. Eric Ivancich [Thu, 23 Dec 2021 21:25:26 +0000 (16:25 -0500)]
rgw: `radosgw-admin bucket stats` on indexless bucket crashes

The new bucket layout code didn't check whether the bucket is
indexless prior to asking for the last entry in the layout log. The
layout log appears to be empty for an indexless bucket, thereby
putting the runtime in an undefined state that later may cause a
failed assertion.

This commit adds two safety checks and returns -EINVAL along with
putting useful information on stderr when either stats are requested
on an indexless bucket or when the layout log is empty.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
3 years agorgw: fix reshard cancelling race condition
Yuval Lifshitz [Wed, 8 Dec 2021 19:35:25 +0000 (21:35 +0200)]
rgw: fix reshard cancelling race condition

this is happening when resharding while objects are uploaded
tests steps are here:
https://gist.github.com/yuvalif/060f66f03511bff881e952287df3087b

Signed-off-by: Yuval Lifshitz <ylifshit@redhat.com>
3 years agorgw: preserve 'bucket sync disable' over reshard
Casey Bodley [Wed, 8 Dec 2021 15:11:22 +0000 (10:11 -0500)]
rgw: preserve 'bucket sync disable' over reshard

if bucket sync is disabled, apply that flag to new index objects on
bucket reshard

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw/multisite: handle shard_progress correctly in RunBucketSources
Casey Bodley [Mon, 22 Nov 2021 19:23:01 +0000 (14:23 -0500)]
rgw/multisite: handle shard_progress correctly in RunBucketSources

we run bucket sync on each of the sync pipes, so size the vector
accordingly

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agoRevert "rgw: cr: add prealloc_stack()"
Casey Bodley [Mon, 22 Nov 2021 18:51:10 +0000 (13:51 -0500)]
Revert "rgw: cr: add prealloc_stack()"

This reverts commit 7970f355497f48ee5a18bf3a0bc034226c6d225c.

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agoRevert "rgw: bucket sync: track progress by stack id"
Casey Bodley [Mon, 22 Nov 2021 18:50:07 +0000 (13:50 -0500)]
Revert "rgw: bucket sync: track progress by stack id"

This reverts commit c0baf3eb34c6c1de7e4de2e35cb62e219c174b0b.

Signed-off-by: Casey Bodley <cbodley@redhat.com>
Conflicts:
src/rgw/rgw_data_sync.cc no longer loops over num_shards

3 years agorgw/multisite: RunBucketSourcesSync no longer takes optional target
Casey Bodley [Mon, 22 Nov 2021 18:05:40 +0000 (13:05 -0500)]
rgw/multisite: RunBucketSourcesSync no longer takes optional target

RGWDataSyncSingleEntryCR is the only caller of RGWRunBucketSourcesSyncCR

it always provides a source_bs, and never provides a target_bs. so remove
all the complexity related to target_bs, and the idea that we'd need to
sync several source bucket shards related to the target bucket

we now just have the single loop over the target buckets that use the
given bucket as a source

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agoradosgw-admin: allow reshard commands in multisite on secondary
Casey Bodley [Fri, 15 Oct 2021 14:57:55 +0000 (10:57 -0400)]
radosgw-admin: allow reshard commands in multisite on secondary

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: fix for uninitialized oldest_gen/latest_gen
Casey Bodley [Mon, 18 Oct 2021 18:15:33 +0000 (14:15 -0400)]
rgw: fix for uninitialized oldest_gen/latest_gen

when data sync queries RGWOp_BILog_Info from an un-upgraded gateway, it
doesn't include the oldest_gen/latest_gen fields. so initialize these
variables to 0 by default

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: enable RGWReshard thread on any zone that supports it
Casey Bodley [Tue, 12 Oct 2021 16:45:29 +0000 (12:45 -0400)]
rgw: enable RGWReshard thread on any zone that supports it

enable the background dynamic resharding thread based on
RGWSI_Zone::can_reshard(), which takes the zonegroup features into
account

Fixes: https://tracker.ceph.com/issues/52877
Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: prevent reshard from creating too many log generations
Casey Bodley [Tue, 28 Sep 2021 14:45:27 +0000 (10:45 -0400)]
rgw: prevent reshard from creating too many log generations

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: remove per-shard sync status object after incremental sync finishes
Shilpa Jagannath [Fri, 23 Jul 2021 06:58:24 +0000 (12:28 +0530)]
rgw: remove per-shard sync status object after incremental sync finishes

Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>
Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agoradosgw-admin: bucket sync status guards against shard count mismatch
Casey Bodley [Fri, 20 Aug 2021 14:39:34 +0000 (10:39 -0400)]
radosgw-admin: bucket sync status guards against shard count mismatch

if the remote gives us more shards than we expect, just count those
shards as 'behind' and avoid out-of-bounds access of shard_status

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agoradosgw-admin: bucket sync status handles missing full status
Casey Bodley [Thu, 19 Aug 2021 21:21:37 +0000 (17:21 -0400)]
radosgw-admin: bucket sync status handles missing full status

if the full sync status object is missing, it's possible that we just
haven't started syncing it again after upgrading from just the per-shard
status objects

in this case, as long as we have a log generation 0, assume that we just
haven't initialized the full status object and try to read the gen=0
per-shard incremental status for comparison

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: rgw_read_bucket_inc_sync_status doesn't need bucket info
Casey Bodley [Thu, 19 Aug 2021 20:12:21 +0000 (16:12 -0400)]
rgw: rgw_read_bucket_inc_sync_status doesn't need bucket info

all we need to construct the per-shard bucket sync status object names
are the bucket names themselves, which we already have from
rgw_sync_bucket_pipe

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: resize status vector before reading inc_sync_status
Casey Bodley [Thu, 19 Aug 2021 20:30:49 +0000 (16:30 -0400)]
rgw: resize status vector before reading inc_sync_status

rgw_read_bucket_inc_sync_status() uses the size of this vector as the
'num_shards', so we need to resize it appropriately beforehand

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: RGWOp_BILog_Status reads full status unconditionally
Casey Bodley [Thu, 19 Aug 2021 20:21:55 +0000 (16:21 -0400)]
rgw: RGWOp_BILog_Status reads full status unconditionally

the calls to rgw_read_bucket_inc_sync_status() depend on
sync_status.incremental_gen, which we need to read via
rgw_read_bucket_full_sync_status() regardless of whether
we're returning it to the client (version > 1)

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: RGWCollectBucketSyncStatusCR doesn't need the shard count
Adam C. Emerson [Fri, 10 Sep 2021 16:28:28 +0000 (12:28 -0400)]
rgw: RGWCollectBucketSyncStatusCR doesn't need the shard count

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
3 years agorgw: RunBucketSourceSync uses num_shards from remote bilog info
Adam C. Emerson [Fri, 10 Sep 2021 16:06:53 +0000 (12:06 -0400)]
rgw: RunBucketSourceSync uses num_shards from remote bilog info

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
3 years agorgw: RGWListBucketIndexesCR only needs zero shard
Adam C. Emerson [Fri, 10 Sep 2021 15:38:05 +0000 (11:38 -0400)]
rgw: RGWListBucketIndexesCR only needs zero shard

We only need to check one shard, and everything has shard zero.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
3 years agorgw: sync checkpoint gets num_shards from remote bilog info
Adam C. Emerson [Fri, 10 Sep 2021 15:00:07 +0000 (11:00 -0400)]
rgw: sync checkpoint gets num_shards from remote bilog info

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
3 years agorgw: RGWRemoteBucketManager constructor takes num_shards
Adam C. Emerson [Thu, 2 Sep 2021 21:58:58 +0000 (17:58 -0400)]
rgw: RGWRemoteBucketManager constructor takes num_shards

The logic for getting it was moved to its caller.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
3 years agorgw: InitBucketFullSyncStatusCR gets num shards from remote
Adam C. Emerson [Thu, 2 Sep 2021 21:36:09 +0000 (17:36 -0400)]
rgw: InitBucketFullSyncStatusCR gets num shards from remote

As specified in rgw_bucket_index_marker_info, unless we're doing the
compatibility check, in which case we look at generation 0.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
3 years agorgw: read shard count using remote bilog info during bucket sync
Shilpa Jagannath [Thu, 10 Jun 2021 16:51:21 +0000 (22:21 +0530)]
rgw: read shard count using remote bilog info during bucket sync

Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>
3 years agodoc/rgw: document zone features
Casey Bodley [Mon, 8 Mar 2021 17:48:13 +0000 (12:48 -0500)]
doc/rgw: document zone features

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: RGWSI_Zone::can_reshard() respects zonegroup 'resharding' feature
Casey Bodley [Wed, 3 Mar 2021 21:58:33 +0000 (16:58 -0500)]
rgw: RGWSI_Zone::can_reshard() respects zonegroup 'resharding' feature

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agoradosgw-admin: 'sync status' shows enabled/disabled zonegroup features
Casey Bodley [Wed, 3 Mar 2021 21:55:23 +0000 (16:55 -0500)]
radosgw-admin: 'sync status' shows enabled/disabled zonegroup features

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agoradosgw-admin: zone/zonegroup commands support --enable-feature=x --disable-feature=y
Casey Bodley [Wed, 3 Mar 2021 21:54:58 +0000 (16:54 -0500)]
radosgw-admin: zone/zonegroup commands support --enable-feature=x --disable-feature=y

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: add set of 'features' to zone and zonegroup
Casey Bodley [Wed, 3 Mar 2021 19:13:05 +0000 (14:13 -0500)]
rgw: add set of 'features' to zone and zonegroup

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agojson: encoding for flat_set accepts all template params
Casey Bodley [Wed, 3 Mar 2021 19:05:24 +0000 (14:05 -0500)]
json: encoding for flat_set accepts all template params

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw/multisite: don't delete per shard status on init
Yuval Lifshitz [Wed, 16 Jun 2021 09:32:25 +0000 (12:32 +0300)]
rgw/multisite: don't delete per shard status on init

and pass correct generation and num shards when deleting
per shard status objects when disabling during incremental sync

Signed-off-by: Yuval Lifshitz <ylifshit@redhat.com>
3 years agorgw/multisite: support enable right after disable
Yuval Lifshitz [Mon, 14 Jun 2021 14:03:35 +0000 (17:03 +0300)]
rgw/multisite: support enable right after disable

Signed-off-by: Yuval Lifshitz <ylifshit@redhat.com>
3 years agorgw/multisite: remove the retry mechanism
Yuval Lifshitz [Thu, 27 May 2021 15:54:31 +0000 (18:54 +0300)]
rgw/multisite: remove the retry mechanism

when writign the sync status object

Signed-off-by: Yuval Lifshitz <ylifshit@redhat.com>
3 years agorgw/multisite: allow bucket sync disable/enable
Yuval Lifshitz [Tue, 18 May 2021 15:59:54 +0000 (18:59 +0300)]
rgw/multisite: allow bucket sync disable/enable

Signed-off-by: Yuval Lifshitz <ylifshit@redhat.com>
3 years agorgw/multisite: track shard sync status objects per generation
Yuval Lifshitz [Tue, 25 May 2021 18:11:25 +0000 (21:11 +0300)]
rgw/multisite: track shard sync status objects per generation

Signed-off-by: Yuval Lifshitz <ylifshit@redhat.com>
3 years agorgw: remove destination shard id from rgw_bucket_sync_pair_info
Casey Bodley [Wed, 9 Jun 2021 16:26:54 +0000 (12:26 -0400)]
rgw: remove destination shard id from rgw_bucket_sync_pair_info

the sync_pair is used as input to RGWBucketPipeSyncStatusManager::status_oid()
to generate the per-shard sync status object names

this sync status tracks incremental bucket sync, which reads changes
from a source bucket's bilog shard, and copies objects from the remote
source bucket to the local destination bucket

this doesn't require sync to know anything about the destination bucket
shards, so rgw_bucket_sync_pair_info and status_oid() now only track the
the destination's rgw_bucket instead of rgw_bucket_shard

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: Trim old generations in BucketTrimInstanceCR
Adam C. Emerson [Fri, 14 May 2021 22:59:48 +0000 (18:59 -0400)]
rgw: Trim old generations in BucketTrimInstanceCR

Only one generation per call.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
3 years agorgw: Add RGWRadosRemoveOidCR
Adam C. Emerson [Tue, 18 May 2021 22:34:43 +0000 (18:34 -0400)]
rgw: Add RGWRadosRemoveOidCR

A more generally applicable way of removing objects in coroutines.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
3 years agorgw: Add RGWAsyncPutBucketInstanceInfoCR
Adam C. Emerson [Tue, 18 May 2021 21:56:27 +0000 (17:56 -0400)]
rgw: Add RGWAsyncPutBucketInstanceInfoCR

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
3 years agorgw: Trim bilog with generation
Adam C. Emerson [Mon, 26 Apr 2021 22:45:09 +0000 (18:45 -0400)]
rgw: Trim bilog with generation

From the REST interface and radosgw-admin. Assume Generation 0 if none
provided and error if it doesn't exist.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
3 years agorgw: Bilog trim takes markers as string view
Adam C. Emerson [Tue, 27 Apr 2021 23:31:57 +0000 (19:31 -0400)]
rgw: Bilog trim takes markers as string view

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
3 years agocommon: get_str_vec takes std::string_view
Adam C. Emerson [Tue, 27 Apr 2021 23:31:15 +0000 (19:31 -0400)]
common: get_str_vec takes std::string_view

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
3 years agorgw: BucketInstanceTrimCR knows about generations
Adam C. Emerson [Fri, 14 May 2021 19:44:01 +0000 (15:44 -0400)]
rgw: BucketInstanceTrimCR knows about generations

Fetch the current generation from remote peers and trim the minimum
marker on the minimum generation.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
3 years agorgw: Add cast from bucket_index_log_layout
Adam C. Emerson [Fri, 14 May 2021 19:26:42 +0000 (15:26 -0400)]
rgw: Add cast from bucket_index_log_layout

To bucket_index_layout_generation

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
3 years agorgw: add sync_info to BILog_Status output
Adam C. Emerson [Thu, 13 May 2021 21:11:52 +0000 (17:11 -0400)]
rgw: add sync_info to BILog_Status output

Needed so we can get the incremental generation.

Guard this behind a version check and return the original output if
less than 2.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
3 years agorgw: fix incremental sync by using the right generation for bilog listing
Shilpa Jagannath [Wed, 2 Jun 2021 17:57:06 +0000 (23:27 +0530)]
rgw: fix incremental sync by using the right generation for bilog listing

Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>
3 years agorgw: on bucket reshard, write datalog entries for each shard of the previous generation
Shilpa Jagannath [Fri, 21 May 2021 06:47:49 +0000 (12:17 +0530)]
rgw: on bucket reshard, write datalog entries for each shard of the previous generation

Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>
3 years agoqa/rgw: temporarily disable multisite reshard tests
Casey Bodley [Tue, 25 May 2021 18:37:26 +0000 (14:37 -0400)]
qa/rgw: temporarily disable multisite reshard tests

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agotest/rgw: add multisite test for full sync after reshard
Casey Bodley [Fri, 30 Apr 2021 20:22:33 +0000 (16:22 -0400)]
test/rgw: add multisite test for full sync after reshard

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agotest/rgw: add simple multisite reshard test
Casey Bodley [Fri, 30 Apr 2021 19:08:39 +0000 (15:08 -0400)]
test/rgw: add simple multisite reshard test

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agoradosgw-admin: 'bucket sync checkpoint' waits for generation to catch up
Casey Bodley [Tue, 18 May 2021 19:55:28 +0000 (15:55 -0400)]
radosgw-admin: 'bucket sync checkpoint' waits for generation to catch up

poll on rgw_read_bucket_full_sync_status() until
full_status.incremental_gen catches up to the latest_gen we got from
rgw_read_remote_bilog_info()

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: get_bucket_instance_ids() uses num_shards from layout
Casey Bodley [Wed, 19 May 2021 18:03:48 +0000 (14:03 -0400)]
rgw: get_bucket_instance_ids() uses num_shards from layout

knock out a TODO that was causing this assertion failure in
RGWRados::get_bucket_stats() after a reshard:

  ceph_assert(headers.size() == bucket_instance_ids.size());

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: rgw_read_remote_bilog_info() returns rgw_bucket_index_marker_info
Casey Bodley [Thu, 20 May 2021 15:15:26 +0000 (11:15 -0400)]
rgw: rgw_read_remote_bilog_info() returns rgw_bucket_index_marker_info

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: adding generation number to async notification
Shilpa Jagannath [Mon, 15 Feb 2021 14:46:29 +0000 (20:16 +0530)]
rgw: adding generation number to async notification

Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>
3 years agorgw: add custom json encode/decode for the v1 notify API
Casey Bodley [Tue, 23 Feb 2021 19:55:50 +0000 (14:55 -0500)]
rgw: add custom json encode/decode for the v1 notify API

this adds wrapper structs rgw_data_notify_v1_encoder and
rgw_data_notify_v1_decoder that can encode/decode the v1 json format
directly on the v2 data structure

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agotest/rgw: fix python error on test failure
Casey Bodley [Fri, 14 May 2021 17:49:09 +0000 (13:49 -0400)]
test/rgw: fix python error on test failure

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: fix handling of bucket layout in metadata sync
Casey Bodley [Fri, 14 May 2021 16:30:21 +0000 (12:30 -0400)]
rgw: fix handling of bucket layout in metadata sync

clear the bucket layout we get from the metadata master, and overwrite it
with our zone's defaults

without clearing the layout, init_default_bucket_layout() was adding another
log layout in addition to the one from the master. this caused the bilog
list API to provide a 'next_log' when only gen=0 exists

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw/multisite: fix bucket shard state init function
Yuval Lifshitz [Fri, 14 May 2021 09:27:09 +0000 (12:27 +0300)]
rgw/multisite: fix bucket shard state init function

* make sure src/dest shard ids are the same in sync pair
* copy sync pair by value in coroutine loop

Signed-off-by: Yuval Lifshitz <ylifshit@redhat.com>
3 years agorgw: update bucket sync status after bucket shards finishes current gen
Shilpa Jagannath [Mon, 5 Apr 2021 20:15:45 +0000 (01:45 +0530)]
rgw: update bucket sync status after bucket shards finishes current gen

Signed-off-by: Shilpa Jagannath <smanjara@redhat.com>
3 years agorgw: reshard preserves old index in multisite
Casey Bodley [Fri, 26 Mar 2021 15:00:57 +0000 (11:00 -0400)]
rgw: reshard preserves old index in multisite

if the old index is still referenced by an InIndex log layout, we can't
call clean_index() to remove the index objects yet. log trimming will do
that later, once the bilogs are no longer needed

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: BILog_List handles requests for generation=0
Casey Bodley [Thu, 25 Feb 2021 20:39:26 +0000 (15:39 -0500)]
rgw: BILog_List handles requests for generation=0

previous logic treated requests for generation=0 as the latest gen

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: zero-initialize rgw_bucket_sync_status::incremental_gen
Casey Bodley [Tue, 9 Feb 2021 23:40:29 +0000 (18:40 -0500)]
rgw: zero-initialize rgw_bucket_sync_status::incremental_gen

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: handle older/newer generations after reading bucket sync status
Casey Bodley [Tue, 9 Feb 2021 23:00:14 +0000 (18:00 -0500)]
rgw: handle older/newer generations after reading bucket sync status

wait until we've read the bucket sync status and found that we're in
incremental sync before we start using incremental_gen for comparison

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: Handle entries of the wrong generation
Adam C. Emerson [Mon, 14 Dec 2020 05:56:23 +0000 (00:56 -0500)]
rgw: Handle entries of the wrong generation

Drop entries from past generations.

Send entries of future generations to the error repo for retry.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
3 years agorgw: rgw_data_change can decode v1 format if gen was 0
Casey Bodley [Wed, 10 Feb 2021 00:04:19 +0000 (19:04 -0500)]
rgw: rgw_data_change can decode v1 format if gen was 0

but if gen>0, require decoders to understand the v2 format. this way,
old clients can't decode entries with gen>0, so they won't be able to
serve them to other zones

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: rename rgw_data_change::gen_id
Casey Bodley [Wed, 10 Feb 2021 00:03:43 +0000 (19:03 -0500)]
rgw: rename rgw_data_change::gen_id

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: add gen parameter to RGWDataChangesLog::add_entry
Adam C. Emerson [Mon, 14 Dec 2020 02:13:44 +0000 (21:13 -0500)]
rgw: add gen parameter to RGWDataChangesLog::add_entry

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
3 years agorgw: Add gen_id to rgw_data_change
Adam C. Emerson [Mon, 14 Dec 2020 01:30:52 +0000 (20:30 -0500)]
rgw: Add gen_id to rgw_data_change

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
3 years agorgw: vector<rgw_data_change_log_entry> not list
Adam C. Emerson [Sun, 13 Dec 2020 23:54:28 +0000 (18:54 -0500)]
rgw: vector<rgw_data_change_log_entry> not list

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
3 years agorgw: add json encoding of bucket layout types
Casey Bodley [Mon, 1 Feb 2021 19:39:39 +0000 (14:39 -0500)]
rgw: add json encoding of bucket layout types

adds a "layout" section to RGWBucketInfo

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agojson: add encode_json() overload for string_view
Casey Bodley [Mon, 1 Feb 2021 19:34:54 +0000 (14:34 -0500)]
json: add encode_json() overload for string_view

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: refactor per-entry reshard logic into separate function
Casey Bodley [Tue, 2 Feb 2021 17:51:14 +0000 (12:51 -0500)]
rgw: refactor per-entry reshard logic into separate function

this cuts down on nesting and avoids the need for goto

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: reshard adds a log layout for the new index
Casey Bodley [Mon, 1 Feb 2021 17:04:36 +0000 (12:04 -0500)]
rgw: reshard adds a log layout for the new index

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agoradosgw-admin: try reshard even if bucket is resharding
Casey Bodley [Mon, 1 Feb 2021 17:02:44 +0000 (12:02 -0500)]
radosgw-admin: try reshard even if bucket is resharding

allow reshard in case a previous reshard failed. if the reshard is
actually still in progress, we'll fail to get the reshard lock

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agotest: fix threading for FaultInjector death tests
Casey Bodley [Tue, 19 Jan 2021 14:27:13 +0000 (09:27 -0500)]
test: fix threading for FaultInjector death tests

addresses test timeout and warning message:

[WARNING] /home/jenkins-build/build/workspace/ceph-pull-requests/src/googletest/googletest/src/gtest-death-test.cc:1121:: Death tests use fork(), which is unsafe particularly in a threaded context. For this test, Google Test detected 3 threads. See https://github.com/google/googletest/blob/master/googletest/docs/advanced.md#death-tests-and-threads for more explanation and suggested solutions, especially if this is the last message you see before your test times out.

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agoradosgw-admin: remove fault injection options from usage
Casey Bodley [Fri, 18 Dec 2020 21:01:15 +0000 (16:01 -0500)]
radosgw-admin: remove fault injection options from usage

these are only used for testing, not administration

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: RGWBucketReshard doesn't need a friend
Casey Bodley [Fri, 18 Dec 2020 16:19:55 +0000 (11:19 -0500)]
rgw: RGWBucketReshard doesn't need a friend

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: fix decode of cls_rgw reshard types
Casey Bodley [Fri, 18 Dec 2020 15:46:49 +0000 (10:46 -0500)]
rgw: fix decode of cls_rgw reshard types

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: BucketReshardManager takes layouts
Casey Bodley [Fri, 18 Dec 2020 15:23:55 +0000 (10:23 -0500)]
rgw: BucketReshardManager takes layouts

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: clean up uses of BucketShard::init() without info
Casey Bodley [Fri, 18 Dec 2020 15:22:23 +0000 (10:22 -0500)]
rgw: clean up uses of BucketShard::init() without info

the rgw_bucket overload of BucketShard::init() has to look up the bucket
info. use the RGWBucketInfo overload when we have one

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agoqa/rgw: disable coredumps for reshard fault injection
Casey Bodley [Thu, 17 Dec 2020 17:19:12 +0000 (12:19 -0500)]
qa/rgw: disable coredumps for reshard fault injection

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agotest/rgw: add test_bucket_reshard() for fault injection testing
Casey Bodley [Wed, 16 Dec 2020 23:17:29 +0000 (18:17 -0500)]
test/rgw: add test_bucket_reshard() for fault injection testing

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agotest/rgw: test_rgw_reshard.py exec_cmd() can return error code
Casey Bodley [Wed, 16 Dec 2020 23:16:41 +0000 (18:16 -0500)]
test/rgw: test_rgw_reshard.py exec_cmd() can return error code

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agoradosgw-admin: 'bucket reshard' returns positive error codes
Casey Bodley [Wed, 16 Dec 2020 23:13:43 +0000 (18:13 -0500)]
radosgw-admin: 'bucket reshard' returns positive error codes

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: reshard first cleans up old-style reshards
Casey Bodley [Wed, 16 Dec 2020 18:57:28 +0000 (13:57 -0500)]
rgw: reshard first cleans up old-style reshards

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: refactor reshard init/cleanup with fault injection
Casey Bodley [Tue, 15 Dec 2020 18:57:31 +0000 (13:57 -0500)]
rgw: refactor reshard init/cleanup with fault injection

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: add typedef for ReshardFaultInjector
Casey Bodley [Tue, 15 Dec 2020 17:55:03 +0000 (12:55 -0500)]
rgw: add typedef for ReshardFaultInjector

Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agorgw: BucketReshardManager stores BucketReshardShards by value
Casey Bodley [Tue, 15 Dec 2020 17:07:14 +0000 (12:07 -0500)]
rgw: BucketReshardManager stores BucketReshardShards by value

Signed-off-by: Casey Bodley <cbodley@redhat.com>