git.apps.os.sepia.ceph.com Git - ceph.git/log

]> git.apps.os.sepia.ceph.com Git - ceph.git/log

projects / ceph.git / log

summary | shortlog | log | commit | commitdiff | tree
first ⋅ prev ⋅ next

commit | commitdiff | tree

Sridhar Seshasayee [Sat, 29 Apr 2023 04:48:11 +0000 (10:18 +0530)]

qa/: Override mClock profile to 'high_recovery_ops' for qa tests

The qa tests are not client I/O centric and mostly focus on triggering
recovery/backfills and monitor them for completion within a finite amount
of time. The same holds true for scrub operations.

Therefore, an mClock profile that optimizes background operations is a
better fit for qa related tests. The osd_mclock_profile is therefore
globally overriden to 'high_recovery_ops' profile for the Rados suite as
it fits the requirement.

Also, many standalone tests expect recovery and scrub operations to
complete within a finite time. To ensure this, the osd_mclock_profile
options is set to 'high_recovery_ops' as part of the run_osd() function
in ceph-helpers.sh.

A subset of standalone tests explicitly used 'high_recovery_ops' profile.
Since the profile is now set as part of run_osd(), the earlier overrides
are redundant and therefore removed from the tests.

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>

commit | commitdiff | tree

Sridhar Seshasayee [Tue, 11 Apr 2023 17:57:05 +0000 (23:27 +0530)]

doc/: Modify mClock configuration documentation to reflect profile changes

Modify the relevant documentation to reflect:

- change in the default mClock profile to 'balanced'
- new allocations for ops across mClock profiles
- change in the osd_max_backfills limit
- miscellaneous changes related to warnings.

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>

commit | commitdiff | tree

Sridhar Seshasayee [Tue, 11 Apr 2023 16:47:53 +0000 (22:17 +0530)]

common/options/osd.yaml.in: Change mclock max sequential bandwidth for SSDs

The osd_mclock_max_sequential_bandwidth_ssd is changed to 1200 MiB/s as
a reasonable middle ground considering the broad range of SSD capabilities.
This allows the mClock's cost model to extract the SSDs capability
depending on the cost of the IO being performed.

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>

commit | commitdiff | tree

Sridhar Seshasayee [Tue, 11 Apr 2023 16:30:11 +0000 (22:00 +0530)]

osd/: Retain the default osd_max_backfills limit to 1 for mClock

The earlier limit of 3 was still aggressive enough to have an impact on
the client and other competing operations. Retain the current default
for mClock. This can be modified if necessary after setting the
osd_mclock_override_recovery_settings option.

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>

commit | commitdiff | tree

Samuel Just [Tue, 11 Apr 2023 15:15:38 +0000 (08:15 -0700)]

common/options/osd.yaml.in: change mclock profile default to balanced

Let's use the middle profile as the default.
Modify the standalone tests accordingly.

Signed-off-by: Samuel Just <sjust@redhat.com>
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>

commit | commitdiff | tree

Samuel Just [Tue, 11 Apr 2023 15:10:04 +0000 (08:10 -0700)]

osd/scheduler/mClockScheduler: avoid limits for recovery

Now that recovery operations are split between background_recovery and
background_best_effort, rebalance qos params to avoid penalizing
background_recovery while idle.

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Mon, 10 Apr 2023 21:18:49 +0000 (14:18 -0700)]

osd/: add counters for ops delayed due to degraded|unreadable target

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Thu, 6 Apr 2023 21:15:02 +0000 (14:15 -0700)]

osd/: add counters for queue latency for PGRecovery[Context]

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Thu, 6 Apr 2023 20:50:48 +0000 (20:50 +0000)]

osd/: add per-op latency averages for each recovery related message

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Thu, 6 Apr 2023 07:04:05 +0000 (00:04 -0700)]

osd/: differentiate priority for PGRecovery[Context]

PGs with degraded objects should be higher priority.

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Thu, 6 Apr 2023 05:57:48 +0000 (22:57 -0700)]

osd/: add MSG_OSD_PG_(BACKFILL|BACKFILL_REMOVE|SCAN) as recovery messages

Otherwise, these end up as PGOpItem and therefore as immediate:

class PGOpItem : public PGOpQueueable {
...
  op_scheduler_class get_scheduler_class() const final {
    auto type = op->get_req()->get_type();
    if (type == CEPH_MSG_OSD_OP ||
  type == CEPH_MSG_OSD_BACKOFF) {
      return op_scheduler_class::client;
    } else {
      return op_scheduler_class::immediate;
    }
  }
...
};

This was probably causing a bunch of extra interference with client
ops.

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Thu, 6 Apr 2023 05:57:42 +0000 (22:57 -0700)]

osd/: differentiate scheduler class for undersized/degraded vs data movement

Recovery operations on pgs/objects that have fewer than the configured
number of copies should be treated more urgently than operations on
pgs/objects that simply need to be moved to a new location.

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Thu, 6 Apr 2023 04:30:18 +0000 (04:30 +0000)]

osd/.../OpSchedulerItem: add MSG_OSD_PG_PULL to is_recovery_msg

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Thu, 6 Apr 2023 04:23:23 +0000 (04:23 +0000)]

osd/: move PGRecoveryMsg check from osd into PGRecoveryMsg::is_recovery_msg

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Thu, 6 Apr 2023 03:45:19 +0000 (03:45 +0000)]

osd/: move get_recovery_op_priority into PeeringState next to get_*_priority

Consolidate methods governing recovery scheduling in PeeringState.

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Tue, 4 Apr 2023 23:34:17 +0000 (23:34 +0000)]

osd/scheduler: simplify qos specific params in OpSchedulerItem

is_qos_item() was only used in operator<< for OpSchedulerItem. However,
it's actually useful to see priority for mclock items since it affects
whether it goes into the immediate queues and, for some types, the
class. Unconditionally display both class_id and priority.

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Tue, 4 Apr 2023 23:22:59 +0000 (23:22 +0000)]

osd/scheduler: remove unused PGOpItem::maybe_get_mosd_op

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Tue, 4 Apr 2023 23:13:41 +0000 (23:13 +0000)]

osd/scheduler: remove OpQueueable::get_order_locker() and supporting machinery

Apparently unused.

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Tue, 4 Apr 2023 23:05:56 +0000 (23:05 +0000)]

osd/scheduler: remove OpQueueable::get_op_type() and supporting machinery

Apparently unused.

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Mon, 3 Apr 2023 20:31:46 +0000 (13:31 -0700)]

PeeringState::clamp_recovery_priority: use std::clamp

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Sridhar Seshasayee [Sat, 25 Mar 2023 07:14:40 +0000 (12:44 +0530)]

doc: Modify mClock configuration documentation to reflect new cost model

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>

commit | commitdiff | tree

Sridhar Seshasayee [Tue, 21 Feb 2023 12:24:36 +0000 (17:54 +0530)]

osd: Retain overridden mClock recovery settings across osd restarts

Fix an issue where an overridden mClock recovery setting (set prior to
an osd restart) could be lost after an osd restart.

For e.g., consider that prior to an osd restart, the option
'osd_max_backfill' was successfully set to a value different from the
mClock default. If the osd was restarted for some reason, the
boot-up sequence was incorrectly resetting the backfill value to the
mclock default within the async local/remote reservers. This fix
ensures that no change is made if the current overriden value is
different from the mClock default.

Modify an existing standalone test to verify that the local and remote
async reservers are updated to the desired number of backfills under
normal conditions and also across osd restarts.

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>

commit | commitdiff | tree

Sridhar Seshasayee [Mon, 20 Mar 2023 12:29:17 +0000 (17:59 +0530)]

osd: Set default max active recovery and backfill limits for mClock

Client ops are sensitive to the recovery load and must be carefully
set for osds whose underlying device is HDD. Tests revealed that
recoveries with osd_max_backfills = 10 and osd_recovery_max_active_hdd = 5
were still aggressive and overwhelmed client ops. The built-in defaults
for mClock are now set to:

    1) osd_recovery_max_active_hdd = 3
    2) osd_recovery_max_active_ssd = 10
    3) osd_max_backfills = 3

The above may be modified if necessary by setting
osd_mclock_override_recovery_settings option.

Fixes: https://tracker.ceph.com/issues/58529
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>

commit | commitdiff | tree

Samuel Just [Wed, 29 Mar 2023 06:29:58 +0000 (23:29 -0700)]

osd/scheduler/mClockScheduler: make is_rotational const

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Wed, 29 Mar 2023 07:10:57 +0000 (00:10 -0700)]

osd/scheduler/mClockScheduler: simplify profile handling

Previously, setting default configs from the configured profile was
split across:
- enable_mclock_profile_settings
- set_mclock_profile - sets mclock_profile class member
- set_*_allocations - updates client_allocs class member
- set_profile_config - sets profile based on client_allocs class member

This made tracing the effect of changing the profile pretty challenging
due passing state through class member variables.

Instead, define a simple profile_t with three constexpr values
corresponding to the three profiles and handle it all in a single
set_config_defaults_from_profile() method.

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Sridhar Seshasayee [Thu, 9 Feb 2023 15:17:44 +0000 (20:47 +0530)]

osd: Modify mClock scheduler's cost model to represent cost in bytes

The mClock scheduler's cost model for HDDs/SSDs is modified and now
represents the cost of an IO in terms of bytes.

The cost parameters, namely, osd_mclock_cost_per_io_usec_[hdd|ssd]
and osd_mclock_cost_per_byte_usec_[hdd|ssd] which represent the cost
of an IO in secs are inaccurate and therefore removed.

The new model considers the following aspects of an osd to calculate
the cost of an IO:

- osd_mclock_max_capacity_iops_[hdd|ssd] (existing option)
   The measured random write IOPS at 4 KiB block size. This is
   measured during OSD boot-up using OSD bench tool.
- osd_mclock_max_sequential_bandwidth_[hdd|ssd] (new config option)
   The maximum sequential bandwidth of of the underlying device.
   For HDDs, 150 MiB/s is considered, and for SSDs 750 MiB/s is
   considered in the cost calculation.

The following important changes are made to arrive at the overall
cost of an IO,

1. Represent QoS reservation and limit config parameter as proportion:
The reservation and limit parameters are now set in terms of a
proportion of the OSD's max IOPS capacity. The earlier representation
was in terms of IOPS per OSD shard which required the user to perform
calculations before setting the parameter. Representing the
reservation and limit in terms of proportions is much more intuitive
and simpler for a user.

2. Cost per IO Calculation:
Using the above config options, osd_bandwidth_cost_per_io for the osd is
calculated and set. It is the ratio of the max sequential bandwidth and
the max random write iops of the osd. It is a constant and represents the
base cost of an IO in terms of bytes. This is added to the actual size of
the IO(in bytes) to represent the overall cost of the IO operation.See
mClockScheduler::calc_scaled_cost().

3. Cost calculation in Bytes:
The settings for reservation and limit in terms a fraction of the OSD's
maximum IOPS capacity is converted to Bytes/sec before updating the
mClock server's ClientInfo structure. This is done for each OSD op shard
using osd_bandwidth_capacity_per_shard shown below:

    (res|lim)  = (IOPS proportion) * osd_bandwidth_capacity_per_shard
    (Bytes/sec)   (unitless)             (bytes/sec)

The above result is updated within the mClock server's ClientInfo
structure for different op_scheduler_class operations. See
mClockScheduler::ClientRegistry::update_from_config().

The overall cost of an IO operation (in secs) is finally determined
during the tag calculations performed in the mClock server. See
crimson::dmclock::RequestTag::tag_calc() for more details.

4. Profile Allocations:
Optimize mClock profile allocations due to the change in the cost model
and lower recovery cost.

5. Modify standalone tests to reflect the change in the QoS config
parameter representation of reservation and limit options.

Fixes: https://tracker.ceph.com/issues/58529
Fixes: https://tracker.ceph.com/issues/59080
Signed-off-by: Samuel Just <sjust@redhat.com>
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>

commit | commitdiff | tree

Sridhar Seshasayee [Thu, 2 Feb 2023 10:00:26 +0000 (15:30 +0530)]

osd: update PGRecovery queue item cost to reflect object size

Previously, we used a static value of osd_recovery_cost (20M
by default) for PGRecovery. For pools with relatively small
objects, this causes mclock to backfill very very slowly as
20M massively overestimates the amount of IO each recovery
queue operation requires. Instead, add a cost_per_object
parameter to OSDService::awaiting_throttle and set it to the
average object size in the PG being queued.

Fixes: https://tracker.ceph.com/issues/58606
Signed-off-by: Samuel Just <sjust@redhat.com>
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>

commit | commitdiff | tree

Sridhar Seshasayee [Thu, 2 Feb 2023 08:12:39 +0000 (13:42 +0530)]

osd: update OSDService::queue_recovery_context to specify cost

Previously, we always queued this with cost osd_recovery_cost which
defaults to 20M. With mclock, this caused these items to be delayed
heavily. Instead, base the cost on the operation queued.

Fixes: https://tracker.ceph.com/issues/58606
Signed-off-by: Samuel Just <sjust@redhat.com>
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>

commit | commitdiff | tree

Sridhar Seshasayee [Fri, 3 Feb 2023 05:36:06 +0000 (11:06 +0530)]

osd/osd_types: use appropriate cost value for PullOp

See included comments -- previous values did not account for object
size. This causes problems for mclock which is much more strict
in how it interprets costs.

Fixes: https://tracker.ceph.com/issues/58607
Signed-off-by: Samuel Just <sjust@redhat.com>
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>

commit | commitdiff | tree

Sridhar Seshasayee [Wed, 25 Jan 2023 08:19:59 +0000 (13:49 +0530)]

osd/osd_types: use appropriate cost value for PushReplyOp

See included comments -- previous values did not account for object
size. This causes problems for mclock which is much more strict
in how it interprets costs.

Fixes: https://tracker.ceph.com/issues/58529
Signed-off-by: Samuel Just <sjust@redhat.com>
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>

commit | commitdiff | tree

zdover23 [Thu, 27 Apr 2023 00:44:25 +0000 (10:44 +1000)]

Merge pull request #51239 from zdover23/wip-doc-2023-04-27-backport-51154-to-reef

reef: doc/rados/ops: edit user-management.rst (3 of x)

Reviewed-by: Cole Mitchell <cole.mitchell@gmail.com>

commit | commitdiff | tree

Zac Dover [Thu, 20 Apr 2023 08:25:00 +0000 (10:25 +0200)]

doc/rados/ops: edit user-management.rst (3 of x)

Line-edit doc/rados/user-management.rst (3 of x).

https://tracker.ceph.com/issues/58485

Follows https://github.com/ceph/ceph/pull/51140.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 97b751ed8f8917f9d4d9cbca03f224e6518836ef)

commit | commitdiff | tree

zdover23 [Thu, 27 Apr 2023 00:09:09 +0000 (10:09 +1000)]

Merge pull request #51155 from zdover23/wip-doc-2023-04-20-backport-51140-to-reef

reef: doc/rados: edit user-management (2 of x)

Reviewed-by: Cole Mitchell <cole.mitchell@gmail.com>

commit | commitdiff | tree

Anthony D'Atri [Wed, 26 Apr 2023 22:25:55 +0000 (18:25 -0400)]

Merge pull request #51235 from zdover23/wip-doc-2023-04-27-backport-51204-to-reef

reef: doc/cephfs: explain cephfs data and metadata set

commit | commitdiff | tree

Zac Dover [Tue, 25 Apr 2023 07:46:53 +0000 (17:46 +1000)]

doc/cephfs: explain cephfs data and metadata set

Explain how to set application metadata for the CephFS data pool and the
CephFS metadata pool.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 9152f9700420f9735533f276559af87dff97bd75)

commit | commitdiff | tree

Casey Bodley [Wed, 26 Apr 2023 15:18:00 +0000 (11:18 -0400)]

Merge pull request #51012 from cbodley/wip-59358

reef: rgw/keystone: use secret key from EC2 for sigv4 streaming mode

Reviewed-by: Shilpa Jagannath <smanjara@redhat.com>

commit | commitdiff | tree

Anthony D'Atri [Wed, 26 Apr 2023 00:21:53 +0000 (20:21 -0400)]

Merge pull request #51220 from zdover23/wip-doc-2023-04-26-backport-51193-to-reef

reef: doc/start: rewrite intro paragraph

commit | commitdiff | tree

Zac Dover [Mon, 24 Apr 2023 11:02:16 +0000 (13:02 +0200)]

doc/start: rewrite intro paragraph

Rewrite the first paragraph in doc/start/intro.rst.

Signed-off-by: Zac Dover <zac.dover@proton.me>
Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit bea01d5f1469030253a3403dbb9e2c9fa97806ac)

commit | commitdiff | tree

Casey Bodley [Tue, 25 Apr 2023 18:09:41 +0000 (14:09 -0400)]

Merge pull request #51022 from cbodley/wip-59151

reef: rgw: install rgw scripts with common files rather than radosgw files

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>

commit | commitdiff | tree

Casey Bodley [Tue, 25 Apr 2023 15:35:26 +0000 (11:35 -0400)]

Merge pull request #51019 from cbodley/wip-59273

reef: rgw/admin: 'data sync status' formats binary error repo entries

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>

commit | commitdiff | tree

Casey Bodley [Tue, 25 Apr 2023 15:35:01 +0000 (11:35 -0400)]

Merge pull request #51024 from cbodley/wip-59133

reef: rgw/s3: DeleteObjects response uses correct delete_marker flag

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>

commit | commitdiff | tree

Casey Bodley [Tue, 25 Apr 2023 15:11:45 +0000 (11:11 -0400)]

Merge pull request #51015 from cbodley/wip-59292

reef: qa/rgw: add rgw/upgrade suite

Reviewed-by: Ali Maredia <amaredia@redhat.com>

commit | commitdiff | tree

Casey Bodley [Tue, 25 Apr 2023 15:02:48 +0000 (11:02 -0400)]

Merge pull request #51014 from cbodley/wip-59280

reef: rgw: set init_check_compat when bucket sync status doesn't exist

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>

commit | commitdiff | tree

Casey Bodley [Tue, 25 Apr 2023 15:01:40 +0000 (11:01 -0400)]

Merge pull request #51020 from cbodley/wip-59275

reef: rgw/sts: Fixes get_cert_url improper url path concatenation

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>

commit | commitdiff | tree

Casey Bodley [Tue, 25 Apr 2023 14:10:17 +0000 (10:10 -0400)]

Merge pull request #51145 from cbodley/wip-59493

reef: cmake/rgw: librgw tests depend on ALLOC_LIBS

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>

commit | commitdiff | tree

Casey Bodley [Tue, 25 Apr 2023 14:09:49 +0000 (10:09 -0400)]

Merge pull request #51013 from cbodley/wip-59278

reef: rgw: fix CopyObj crash after admin override

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>

commit | commitdiff | tree

Casey Bodley [Tue, 25 Apr 2023 14:09:17 +0000 (10:09 -0400)]

Merge pull request #51017 from cbodley/wip-59360

reef: rgw: fix rgw cache invalidation after unregister_watch() error

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>

commit | commitdiff | tree

Casey Bodley [Tue, 25 Apr 2023 14:09:08 +0000 (10:09 -0400)]

Merge pull request #51018 from cbodley/wip-59377

reef: rgw/civetweb: handle old clients with transfer-encoding: chunked.

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>

commit | commitdiff | tree

Casey Bodley [Tue, 25 Apr 2023 14:08:33 +0000 (10:08 -0400)]

Merge pull request #51021 from cbodley/wip-59356

reef: rgw/sse-s3: fix bucket encryption of multipart upload

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>

commit | commitdiff | tree

Casey Bodley [Tue, 25 Apr 2023 14:08:02 +0000 (10:08 -0400)]

Merge pull request #51023 from cbodley/wip-59232

reef: rgw/notifications: support bucket notification with bucket policy

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
Reviewed-by: Yuval Lifshitz <ylifshit@redhat.com>

commit | commitdiff | tree

Casey Bodley [Tue, 25 Apr 2023 14:07:43 +0000 (10:07 -0400)]

Merge pull request #51025 from cbodley/wip-59145

reef: rgw: Do not duplicate query-string in ops-log

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>

commit | commitdiff | tree

Casey Bodley [Tue, 25 Apr 2023 14:07:31 +0000 (10:07 -0400)]

Merge pull request #51026 from cbodley/wip-59028

reef: rgw: use unique_ptr for flat_map emplace in BucketTrimWatcher

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
Reviewed-by: Yuval Lifshitz <ylifshit@redhat.com>

commit | commitdiff | tree

Casey Bodley [Tue, 25 Apr 2023 14:07:18 +0000 (10:07 -0400)]

Merge pull request #51027 from cbodley/wip-59013

reef: rgw/notifications: fetch object state to get size, in rgw_lc.cc

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
Reviewed-by: Yuval Lifshitz <ylifshit@redhat.com>

commit | commitdiff | tree

Casey Bodley [Tue, 25 Apr 2023 14:06:46 +0000 (10:06 -0400)]

Merge pull request #51028 from cbodley/wip-59220

reef: qa/rgw: unpin centos for verify suite

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Tue, 25 Apr 2023 08:05:09 +0000 (10:05 +0200)]

Merge pull request #50880 from guits/wip-59310-reef

reef: ceph-volume: fix issue with fast device allocs when there are multiple PVs per VG

commit | commitdiff | tree

Anthony D'Atri [Sun, 23 Apr 2023 21:19:18 +0000 (23:19 +0200)]

Merge pull request #51181 from zdover23/wip-doc-2023-04-23-backport-51177-to-reef

reef: doc/start: edit first 150 lines of documenting-ceph

commit | commitdiff | tree

Anthony D'Atri [Sun, 23 Apr 2023 21:15:41 +0000 (23:15 +0200)]

Merge pull request #51184 from zdover23/wip-doc-2023-04-23-backport-51178-to-reef

reef: doc/glossary: add "Placement Groups" definition

commit | commitdiff | tree

Zac Dover [Sat, 22 Apr 2023 08:55:38 +0000 (10:55 +0200)]

doc/glossary: add "Placement Groups" definition

Add a definition of "Placement Groups" to the Glossary.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 9f37ea651f9ee2c51e0705b9b58ed356f1bc56e6)

commit | commitdiff | tree

Zac Dover [Sat, 22 Apr 2023 07:03:12 +0000 (09:03 +0200)]

doc/start: edit first 50 lines of documenting-ceph

Edit the first 150 lines of doc/start/documenting-ceph.rst. This is part
of an initiative to harvest the fruits of Cephalocon 2023, at which
documentation proved to be in demand to a surprising degree.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit dd37f94aa4f1de947b1eaf5d82cc529925f5823e)

commit | commitdiff | tree

Nizamudeen A [Thu, 20 Apr 2023 17:39:25 +0000 (23:09 +0530)]

Merge pull request #51151 from rhcs-dashboard/wip-59468-reef

reef: mgr/dashboard: skip Create OSDs step in Cluster expansion

Reviewed-by: Pegonzal <NOT@FOUND>

commit | commitdiff | tree

Zac Dover [Tue, 18 Apr 2023 20:59:09 +0000 (22:59 +0200)]

doc/rados: edit user-management (2 of x)

Line-edit doc/rados/user-management.rst (2 of x). Some internal
references had to be removed, but these will be repaired when the next
part of this file is updated in a future PR.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit e3575bb72f307a27d49fedf3692ca661e3d613a5)

commit | commitdiff | tree

Nizamudeen A [Fri, 14 Apr 2023 19:33:11 +0000 (01:03 +0530)]

mgr/dashboard: skip Create OSDs step in Cluster expansion

Its to ensure OSDs are not deployed on all hosts because that would make
the host draining impossible

Fixes: https://tracker.ceph.com/issues/59457
Signed-off-by: Nizamudeen A <nia@redhat.com>
(cherry picked from commit 0f6d23a7aa024495a79cdedc80f7a00902115b6b)

commit | commitdiff | tree

Casey Bodley [Thu, 13 Apr 2023 16:26:44 +0000 (09:26 -0700)]

cmake/rgw: librgw tests depend on ALLOC_LIBS

somehow this stops tcmalloc from crashing on ubuntu 20.04

Fixes: https://tracker.ceph.com/issues/59269
Signed-off-by: Casey Bodley <cbodley@redhat.com>
(cherry picked from commit d5f97e6543906d5be898b24a1c10269ace309c76)

commit | commitdiff | tree

zdover23 [Tue, 18 Apr 2023 14:07:39 +0000 (16:07 +0200)]

Merge pull request #51125 from zdover23/wip-doc-2023-04-17-backport-50639-to-reef

reef: doc: account for PG autoscaling being the default

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Nizamudeen A [Tue, 18 Apr 2023 13:32:03 +0000 (19:02 +0530)]

Merge pull request #51121 from rhcs-dashboard/wip-59465-reef

reef: mgr/dashboard: remove unncessary hyperlink in landing page

Reviewed-by: Pegonzal <NOT@FOUND>

commit | commitdiff | tree

zdover23 [Tue, 18 Apr 2023 09:48:34 +0000 (11:48 +0200)]

Merge pull request #51123 from zdover23/wip-doc-2023-04-17-backport-49762-to-reef

reef: vstart: fix text format

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Conrad Hoffmann [Wed, 22 Mar 2023 22:03:57 +0000 (23:03 +0100)]

doc: account for PG autoscaling being the default

The current documentation tries really hard to convince people to set
both `osd_pool_default_pg_num` and `osd_pool_default_pgp_num` in their
configs, but at least the latter has undesirable side effects on any
Ceph version that has PG autoscaling enabled by default (at least quincy
and beyond).

Assume a cluster with defaults of `64` for `pg_num` and `pgp_num`.
Starting `radosgw` will fail as it tries to create various pools without
providing values for `pg_num` or `pgp_num`. This triggers the following
in `OSDMonitor::prepare_new_pool()`:

- `pg_num` is set to `1`, because autoscaling is enabled
- `pgp_num` is set to `osd pool default pgp_num`, which we set to `64`
- This is an invalid setup, so the pool creation fails

Likewise, `ceph osd pool create mypool` (without providing values for
`pg_num` or `pgp_num`) does not work.

Following this rationale:

- Not providing a default value for `pgp_num` will always do the right
  thing, unless you use advanced features, in which case you can be
  expected to set both values on pool creation
- Setting `osd_pool_default_pgp_num` in your config breaks pool creation
  for various cases

This commit:

- Removes `osd_pool_default_pgp_num` from all example configs
- Adds mentions of the autoscaling and how it interacts with the default
  values in various places

For each file that was touched, the following maintenance was also
performed:

- Change interternal spaces to underscores for config values
- Remove mentions of filestore or any of its settings
- Fix minor inconsistencies, like indentation etc.

There is also a ticket which I think is very relevant and fixed by this,
though it only captures part of the broader issue addressed here:

Fixes: https://tracker.ceph.com/issues/47176
Signed-off-by: Conrad Hoffmann <ch@bitfehler.net>
(cherry picked from commit 402d2eacbc67f7a6d47d8f90d9ed757fc20931a6)

commit | commitdiff | tree

Rongqi Sun [Tue, 17 Jan 2023 05:55:01 +0000 (13:55 +0800)]

vstart: fix text format

Signed-off-by: Rongqi Sun <sunrongqi@huawei.com>
(cherry picked from commit 57dc8ce51602543d42a2f82cb829eda1c231b434)

commit | commitdiff | tree

Nizamudeen A [Mon, 17 Apr 2023 09:38:06 +0000 (15:08 +0530)]

mgr/dashboard: remove unncessary hyperlink in landing page

Fixes: https://tracker.ceph.com/issues/59462
Signed-off-by: Nizamudeen A <nia@redhat.com>
(cherry picked from commit 7e7da955445ecf37bc43fc296298bd91d0d8a140)

commit | commitdiff | tree

Nizamudeen A [Mon, 17 Apr 2023 13:38:33 +0000 (19:08 +0530)]

Merge pull request #51080 from rhcs-dashboard/wip-59452-reef

reef: mgr/dashboard: fix cephadm e2e expression changed error

Reviewed-by: Aashish Sharma <aasharma@redhat.com>

commit | commitdiff | tree

colemitchell [Mon, 17 Apr 2023 10:43:33 +0000 (12:43 +0200)]

Merge pull request #51116 from zdover23/wip-doc-2023-04-17-backport-51114-to-reef

reef: doc/radosgw: format part of s3select

Reviewed-by: Cole Mitchell <cole.mitchell.ceph@gmail.com>

commit | commitdiff | tree

Cole Mitchell [Mon, 17 Apr 2023 09:34:49 +0000 (05:34 -0400)]

doc/radosgw: format part of s3select

Partially format the 'Basic Workflow' section's introduction and 'Basic Functionalities' subsection in s3select. Nothing else is being fixed.

Signed-off-by: Cole Mitchell <cole.mitchell.ceph@gmail.com>
(cherry picked from commit 13cf134c0610509da52aa68e11e26f0740002bde)

commit | commitdiff | tree

Anthony D'Atri [Sun, 16 Apr 2023 20:25:39 +0000 (22:25 +0200)]

Merge pull request #51110 from zdover23/wip-doc-2023-04-16-backport-50941-to-reef

reef: doc/foundation: Update Foundation members for April 2023

commit | commitdiff | tree

zdover23 [Sun, 16 Apr 2023 16:24:30 +0000 (18:24 +0200)]

Merge pull request #51107 from zdover23/wip-doc-2023-04-16-backport-51099-to-reef

reef: doc/dev: format command in cephfs-mirroring

Reviewed-by: Cole Mitchell <cole.mitchell.ceph@gmail.com>

commit | commitdiff | tree

zdover23 [Sun, 16 Apr 2023 16:22:40 +0000 (18:22 +0200)]

Merge pull request #51096 from zdover23/wip-doc-2023-04-16-backport-51062-to-reef

reef: doc/glossary: add "Hybrid Storage"

Reviewed-by: Cole Mitchell <cole.mitchell.ceph@gmail.com>

commit | commitdiff | tree

zdover23 [Sun, 16 Apr 2023 16:18:43 +0000 (18:18 +0200)]

Merge pull request #51092 from zdover23/wip-doc-2023-04-16-backport-51091-to-reef

reef: doc/mgr/prometheus: fix confval reference

Reviewed-by: Cole Mitchell <cole.mitchell.ceph@gmail.com>

commit | commitdiff | tree

Mike Perez [Fri, 7 Apr 2023 21:00:36 +0000 (14:00 -0700)]

doc/foundation: Update Foundation members for April 2023

Signed-off-by: Mike Perez <thingee@gmail.com>
(cherry picked from commit 759f26a99f7a1b52954e12e080304b867af81418)

commit | commitdiff | tree

Zac Dover [Sun, 16 Apr 2023 09:11:27 +0000 (11:11 +0200)]

doc/dev: format command in cephfs-mirroring

Correctly format a command in doc/dev/cephfs-mirroring/#creating-users.

Reported by casanlin@init7.net at
https://pad.ceph.com/p/Report_Documentation_Bugs

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 408219bfca6b1e698229967e76d22d10028b7c20)

commit | commitdiff | tree

colemitchell [Sun, 16 Apr 2023 14:43:15 +0000 (10:43 -0400)]

Merge pull request #51104 from zdover23/wip-doc-2023-04-16-backport-51103-to-reef

reef: doc/radosgw: format part of s3select

Reviewed-by: Cole Mitchell <cole.mitchell.ceph@gmail.com>

commit | commitdiff | tree

Cole Mitchell [Sun, 16 Apr 2023 13:13:56 +0000 (09:13 -0400)]

doc/radosgw: format part of s3select

Format the first section of s3select. Nothing else is being fixed.

Signed-off-by: Cole Mitchell <cole.mitchell.ceph@gmail.com>
(cherry picked from commit a6a84471a7af154e7ccc93f51df2fc9744dc606c)

commit | commitdiff | tree

Zac Dover [Thu, 13 Apr 2023 12:01:44 +0000 (14:01 +0200)]

doc/glossary: add "Hybrid Storage"

Add "Hybrid Storage" to the glossary.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit dc8148d0727b307fb3baa30baf9dfee9bf8a247e)

commit | commitdiff | tree

Piotr Parczewski [Sat, 15 Apr 2023 21:16:35 +0000 (23:16 +0200)]

doc/mgr/prometheus: fix confval reference

Signed-off-by: Piotr Parczewski <piotr@stackhpc.com>
(cherry picked from commit b9b75dafe248e07b21f2958023697397094cc537)

commit | commitdiff | tree

Anthony D'Atri [Sat, 15 Apr 2023 09:46:55 +0000 (11:46 +0200)]

Merge pull request #51087 from zdover23/wip-doc-2023-04-15-backport-51086-to-reef

reef: doc/rados/ops: remove ceph-medic from monitoring

commit | commitdiff | tree

Zac Dover [Sat, 15 Apr 2023 07:42:31 +0000 (09:42 +0200)]

doc/rados/ops: remove ceph-medic from monitoring

Remove mention of ceph-medic from doc/rados/operations/monitoring.rst,
because it is no longer supported.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 42cd28a2a639e68a44838ae4e7f875cb6bd5d97b)

commit | commitdiff | tree

Nizamudeen A [Fri, 14 Apr 2023 06:03:16 +0000 (11:33 +0530)]

mgr/dashboard: fix cephadm e2e expression changed error

tried to fix this issue from the daemon component sometime ago several
times but it didn't work. So force ignoring the error

Fixes: https://tracker.ceph.com/issues/59444
Signed-off-by: Nizamudeen A <nia@redhat.com>
(cherry picked from commit f7e29e5ab85fabcf2524656bb456a2955fa8608d)

commit | commitdiff | tree

Nizamudeen A [Fri, 14 Apr 2023 08:24:57 +0000 (13:54 +0530)]

Merge pull request #51010 from rhcs-dashboard/wip-59420-reef

reef: mgr/dashboard: fix eviction of all FS clients

Reviewed-by: Pegonzal <NOT@FOUND>
Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Anthony D'Atri [Thu, 13 Apr 2023 12:32:26 +0000 (08:32 -0400)]

Merge pull request #51063 from zdover23/wip-doc-2023-04-13-backport-50713-to-reef

reef: doc/glossary: improve "CephX" entry

commit | commitdiff | tree

Zac Dover [Tue, 28 Mar 2023 08:42:11 +0000 (18:42 +1000)]

doc/glossary: improve "CephX" entry

Improve the glossary entry for "CephX".

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 02e3a5cb763987eeaee2dd1a7543d2762aaad7fe)

commit | commitdiff | tree

Ilya Dryomov [Thu, 13 Apr 2023 12:03:55 +0000 (14:03 +0200)]

Merge pull request #50919 from idryomov/wip-rbd-reef-backports-1

reef: RBD backports (batch 1)

Reviewed-by: Mykola Golub <mgolub@suse.com>
Reviewed-by: Christopher Hoffman <choffman@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Thu, 13 Apr 2023 10:10:20 +0000 (15:40 +0530)]

Merge pull request #51058 from rhcs-dashboard/wip-59436-reef

reef: mgr/dashboard: rbd-mirror force promotion

Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Tue, 11 Apr 2023 20:43:58 +0000 (22:43 +0200)]

qa/suites/rbd: install qemu-utils in addition to qemu-block-extra on Ubuntu

qemu-utils is usually pre-installed but, due to what appears to be
a Ubuntu packaging bug, it's not upgraded when qemu-block-extra is
installed:

  The following NEW packages will be installed:
    qemu-block-extra
  The following packages will be upgraded:
    qemu-system-common qemu-system-data qemu-system-gui qemu-system-x86

However, the version of the block driver must match exactly the version
of the qemu-img tool, so the above leads to:

  $ qemu-img convert -f qcow2 -O raw /home/ubuntu/cephtest/qemu/base.client.0.0.qcow2 rbd:rbd/client.0.0
  Failed to initialize module: /usr/lib/x86_64-linux-gnu/qemu/block-rbd.so
  Note: only modules from the same build can be loaded.
  qemu: module block-block-rbd not found, do you want to install qemu-block-extra package?
  qemu-img: Unknown protocol 'rbd'

Fixes: https://tracker.ceph.com/issues/59431
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit c529fdd63a5aae2c598078df05fe9bbef40042dc)

commit | commitdiff | tree

Pedro Gonzalez Gomez [Wed, 5 Apr 2023 15:42:52 +0000 (17:42 +0200)]

mgr/dashboard: rbd-mirror force promotion

resolves: https://tracker.ceph.com/issues/59327
Signed-off-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
(cherry picked from commit 9696b6a04830297c23c4cccd6e7c225f183ba0b2)

commit | commitdiff | tree

zdover23 [Wed, 12 Apr 2023 09:41:54 +0000 (19:41 +1000)]

Merge pull request #51035 from zdover23/wip-doc-2023-04-12-backport-50993-to-reef

reef: doc/rados/operations: edit monitoring.rst

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Guillaume Abrioux [Wed, 12 Apr 2023 07:58:53 +0000 (09:58 +0200)]

Merge pull request #50994 from guits/cv-bkp-50473-reef

ceph-volume: update the OS before deploying Ceph (reef)

commit | commitdiff | tree

Zac Dover [Tue, 11 Apr 2023 04:15:47 +0000 (14:15 +1000)]

doc/rados/operations: edit monitoring.rst

Line-edit the final third of doc/rados/operations/monitoring.rst.

Follows https://github.com/ceph/ceph/pull/50834.

https://tracker.ceph.com/issues/58485

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit b9ccad80608953fc0af779e8cad93971d47649b6)

commit | commitdiff | tree

Nizamudeen A [Tue, 11 Apr 2023 16:11:42 +0000 (21:41 +0530)]

Merge pull request #51006 from rhcs-dashboard/wip-59402-reef

reef: mgr/dashboard: fix create osd default selected as recommended not working

Reviewed-by: Pegonzal <NOT@FOUND>
Reviewed-by: Aashish Sharma <aasharma@redhat.com>

commit | commitdiff | tree

Casey Bodley [Wed, 22 Mar 2023 17:57:57 +0000 (13:57 -0400)]

qa/rgw: unpin centos for verify suite

use a random supported distro instead of centos

Fixes: https://tracker.ceph.com/issues/54102
Signed-off-by: Casey Bodley <cbodley@redhat.com>
(cherry picked from commit 4bc1f376b901f809748b751d45899e512738c934)

commit | commitdiff | tree

yuval Lifshitz [Fri, 16 Dec 2022 19:01:06 +0000 (14:01 -0500)]

rgwlc/notifications: also fix etag

Signed-off-by: Matt Benjamin <mbenjamin@redhat.com>
(cherry picked from commit 7c58b2a9f3cb3f31398065d862e264bb248760bf)

commit | commitdiff | tree

Matt Benjamin [Thu, 15 Dec 2022 19:55:16 +0000 (14:55 -0500)]

rgw/notifications: fetch object state to get size, in rgw_lc.cc

Failure to call get_obj_state() leaves object size and other members
uninitialized, and appears to result in in lc delete notifications
with 0 for object size.

Fixes: https://tracker.ceph.com/issues/58287
Signed-off-by: Matt Benjamin <mbenjamin@redhat.com>
(cherry picked from commit b20a66767f782c06258fb0a5551ee45d6dccb91c)

commit | commitdiff | tree

Vedansh Bhartia [Thu, 2 Mar 2023 13:04:53 +0000 (18:34 +0530)]

rgw: use unique_ptr for flat_map emplace in BucketTrimWatcher

When emplacing objects into the trim notify handler of
BucketTrimWatcher, use a unique_ptr for the handler so that it is
destroyed if the emplace fails.

Though the destructor is already called, this behaviour cannot be relied
upon. std::map does not exhibit the same behaviour, and would have
leaked memory had it been used instead.

Fixes: https://tracker.ceph.com/issues/57938
Signed-off-by: Vedansh Bhartia <vedanshbhartia@gmail.com>
(cherry picked from commit 43ef4753eb338781529a7dc8360eab13d56fce85)

Unnamed repository; edit this file 'description' to name the repository.