git.apps.os.sepia.ceph.com Git - ceph.git/log

]> git.apps.os.sepia.ceph.com Git - ceph.git/log

projects / ceph.git / log

summary | shortlog | log | commit | commitdiff | tree
first ⋅ prev ⋅ next

commit | commitdiff | tree

Sridhar Seshasayee [Tue, 11 Apr 2023 16:48:51 +0000 (22:18 +0530)]

common/options/osd.yaml.in: Change mclock max sequential bandwidth for SSDs

The osd_mclock_max_sequential_bandwidth_ssd is changed to 1200 MiB/s as
a reasonable middle ground considering the broad range of SSD capabilities.
This allows the mClock's cost model to extract the SSDs capability
depending on the cost of the IO being performed.

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>

commit | commitdiff | tree

Sridhar Seshasayee [Tue, 11 Apr 2023 16:28:35 +0000 (21:58 +0530)]

osd/: Retain the default osd_max_backfills limit to 1 for mClock

The earlier limit of 3 was still aggressive enough to have an impact on
the client and other competing operations. Retain the current default
for mClock. This can be modified if necessary after setting the
osd_mclock_override_recovery_settings option.

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>

commit | commitdiff | tree

Samuel Just [Tue, 11 Apr 2023 15:15:38 +0000 (08:15 -0700)]

common/options/osd.yaml.in: change mclock profile default to balanced

Let's use the middle profile as the default.
Modify the standalone tests accordingly.

Signed-off-by: Samuel Just <sjust@redhat.com>
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>

commit | commitdiff | tree

Samuel Just [Tue, 11 Apr 2023 15:10:04 +0000 (08:10 -0700)]

osd/scheduler/mClockScheduler: avoid limits for recovery

Now that recovery operations are split between background_recovery and
background_best_effort, rebalance qos params to avoid penalizing
background_recovery while idle.

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Mon, 10 Apr 2023 21:18:49 +0000 (14:18 -0700)]

osd/: add counters for ops delayed due to degraded|unreadable target

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Thu, 6 Apr 2023 21:15:02 +0000 (14:15 -0700)]

osd/: add counters for queue latency for PGRecovery[Context]

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Thu, 6 Apr 2023 20:50:48 +0000 (20:50 +0000)]

osd/: add per-op latency averages for each recovery related message

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Thu, 6 Apr 2023 07:04:05 +0000 (00:04 -0700)]

osd/: differentiate priority for PGRecovery[Context]

PGs with degraded objects should be higher priority.

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Thu, 6 Apr 2023 05:57:48 +0000 (22:57 -0700)]

osd/: add MSG_OSD_PG_(BACKFILL|BACKFILL_REMOVE|SCAN) as recovery messages

Otherwise, these end up as PGOpItem and therefore as immediate:

class PGOpItem : public PGOpQueueable {
...
  op_scheduler_class get_scheduler_class() const final {
    auto type = op->get_req()->get_type();
    if (type == CEPH_MSG_OSD_OP ||
  type == CEPH_MSG_OSD_BACKOFF) {
      return op_scheduler_class::client;
    } else {
      return op_scheduler_class::immediate;
    }
  }
...
};

This was probably causing a bunch of extra interference with client
ops.

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Thu, 6 Apr 2023 05:57:42 +0000 (22:57 -0700)]

osd/: differentiate scheduler class for undersized/degraded vs data movement

Recovery operations on pgs/objects that have fewer than the configured
number of copies should be treated more urgently than operations on
pgs/objects that simply need to be moved to a new location.

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Thu, 6 Apr 2023 04:30:18 +0000 (04:30 +0000)]

osd/.../OpSchedulerItem: add MSG_OSD_PG_PULL to is_recovery_msg

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Thu, 6 Apr 2023 04:23:23 +0000 (04:23 +0000)]

osd/: move PGRecoveryMsg check from osd into PGRecoveryMsg::is_recovery_msg

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Thu, 6 Apr 2023 03:45:19 +0000 (03:45 +0000)]

osd/: move get_recovery_op_priority into PeeringState next to get_*_priority

Consolidate methods governing recovery scheduling in PeeringState.

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Tue, 4 Apr 2023 23:34:17 +0000 (23:34 +0000)]

osd/scheduler: simplify qos specific params in OpSchedulerItem

is_qos_item() was only used in operator<< for OpSchedulerItem. However,
it's actually useful to see priority for mclock items since it affects
whether it goes into the immediate queues and, for some types, the
class. Unconditionally display both class_id and priority.

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Tue, 4 Apr 2023 23:22:59 +0000 (23:22 +0000)]

osd/scheduler: remove unused PGOpItem::maybe_get_mosd_op

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Tue, 4 Apr 2023 23:13:41 +0000 (23:13 +0000)]

osd/scheduler: remove OpQueueable::get_order_locker() and supporting machinery

Apparently unused.

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Tue, 4 Apr 2023 23:05:56 +0000 (23:05 +0000)]

osd/scheduler: remove OpQueueable::get_op_type() and supporting machinery

Apparently unused.

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Samuel Just [Mon, 3 Apr 2023 20:31:46 +0000 (13:31 -0700)]

PeeringState::clamp_recovery_priority: use std::clamp

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Sridhar Seshasayee [Sat, 25 Mar 2023 07:16:09 +0000 (12:46 +0530)]

doc: Modify mClock configuration documentation to reflect new cost model

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>

commit | commitdiff | tree

Sridhar Seshasayee [Tue, 21 Feb 2023 13:01:32 +0000 (18:31 +0530)]

osd: Retain overridden mClock recovery settings across osd restarts

Fix an issue where an overridden mClock recovery setting (set prior to
an osd restart) could be lost after an osd restart.

For e.g., consider that prior to an osd restart, the option
'osd_max_backfill' was successfully set to a value different from the
mClock default. If the osd was restarted for some reason, the
boot-up sequence was incorrectly resetting the backfill value to the
mclock default within the async local/remote reservers. This fix
ensures that no change is made if the current overriden value is
different from the mClock default.

Modify an existing standalone test to verify that the local and remote
async reservers are updated to the desired number of backfills under
normal conditions and also across osd restarts.

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>

commit | commitdiff | tree

Sridhar Seshasayee [Mon, 20 Mar 2023 13:24:57 +0000 (18:54 +0530)]

osd: Set default max active recovery and backfill limits for mClock

Client ops are sensitive to the recovery load and must be carefully
set for osds whose underlying device is HDD. Tests revealed that
recoveries with osd_max_backfills = 10 and osd_recovery_max_active_hdd = 5
were still aggressive and overwhelmed client ops. The built-in defaults
for mClock are now set to:

    1) osd_recovery_max_active_hdd = 3
    2) osd_recovery_max_active_ssd = 10
    3) osd_max_backfills = 3

The above may be modified if necessary by setting
osd_mclock_override_recovery_settings option.

Fixes: https://tracker.ceph.com/issues/58529
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>

commit | commitdiff | tree

Sridhar Seshasayee [Wed, 29 Mar 2023 19:33:08 +0000 (01:03 +0530)]

osd/scheduler/mClockScheduler: make is_rotational const

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Sridhar Seshasayee [Wed, 29 Mar 2023 19:31:29 +0000 (01:01 +0530)]

osd/scheduler/mClockScheduler: simplify profile handling

Previously, setting default configs from the configured profile was
split across:
- enable_mclock_profile_settings
- set_mclock_profile - sets mclock_profile class member
- set_*_allocations - updates client_allocs class member
- set_profile_config - sets profile based on client_allocs class member

This made tracing the effect of changing the profile pretty challenging
due passing state through class member variables.

Instead, define a simple profile_t with three constexpr values
corresponding to the three profiles and handle it all in a single
set_config_defaults_from_profile() method.

Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Sridhar Seshasayee [Thu, 9 Feb 2023 15:35:22 +0000 (21:05 +0530)]

osd: Modify mClock scheduler's cost model to represent cost in bytes

The mClock scheduler's cost model for HDDs/SSDs is modified and now
represents the cost of an IO in terms of bytes.

The cost parameters, namely, osd_mclock_cost_per_io_usec_[hdd|ssd]
and osd_mclock_cost_per_byte_usec_[hdd|ssd] which represent the cost
of an IO in secs are inaccurate and therefore removed.

The new model considers the following aspects of an osd to calculate
the cost of an IO:

- osd_mclock_max_capacity_iops_[hdd|ssd] (existing option)
   The measured random write IOPS at 4 KiB block size. This is
   measured during OSD boot-up using OSD bench tool.
- osd_mclock_max_sequential_bandwidth_[hdd|ssd] (new config option)
   The maximum sequential bandwidth of of the underlying device.
   For HDDs, 150 MiB/s is considered, and for SSDs 750 MiB/s is
   considered in the cost calculation.

The following important changes are made to arrive at the overall
cost of an IO,

1. Represent QoS reservation and limit config parameter as proportion:
The reservation and limit parameters are now set in terms of a
proportion of the OSD's max IOPS capacity. The earlier representation
was in terms of IOPS per OSD shard which required the user to perform
calculations before setting the parameter. Representing the
reservation and limit in terms of proportions is much more intuitive
and simpler for a user.

2. Cost per IO Calculation:
Using the above config options, osd_bandwidth_cost_per_io for the osd is
calculated and set. It is the ratio of the max sequential bandwidth and
the max random write iops of the osd. It is a constant and represents the
base cost of an IO in terms of bytes. This is added to the actual size of
the IO(in bytes) to represent the overall cost of the IO operation.See
mClockScheduler::calc_scaled_cost().

3. Cost calculation in Bytes:
The settings for reservation and limit in terms a fraction of the OSD's
maximum IOPS capacity is converted to Bytes/sec before updating the
mClock server's ClientInfo structure. This is done for each OSD op shard
using osd_bandwidth_capacity_per_shard shown below:

    (res|lim)  = (IOPS proportion) * osd_bandwidth_capacity_per_shard
    (Bytes/sec)   (unitless)             (bytes/sec)

The above result is updated within the mClock server's ClientInfo
structure for different op_scheduler_class operations. See
mClockScheduler::ClientRegistry::update_from_config().

The overall cost of an IO operation (in secs) is finally determined
during the tag calculations performed in the mClock server. See
crimson::dmclock::RequestTag::tag_calc() for more details.

4. Profile Allocations:
Optimize mClock profile allocations due to the change in the cost model
and lower recovery cost.

5. Modify standalone tests to reflect the change in the QoS config
parameter representation of reservation and limit options.

Fixes: https://tracker.ceph.com/issues/58529
Fixes: https://tracker.ceph.com/issues/59080
Signed-off-by: Samuel Just <sjust@redhat.com>
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>

commit | commitdiff | tree

Sridhar Seshasayee [Fri, 3 Feb 2023 12:23:06 +0000 (17:53 +0530)]

osd: update PGRecovery queue item cost to reflect object size

Previously, we used a static value of osd_recovery_cost (20M
by default) for PGRecovery. For pools with relatively small
objects, this causes mclock to backfill very very slowly as
20M massively overestimates the amount of IO each recovery
queue operation requires. Instead, add a cost_per_object
parameter to OSDService::awaiting_throttle and set it to the
average object size in the PG being queued.

Fixes: https://tracker.ceph.com/issues/58606
Signed-off-by: Samuel Just <sjust@redhat.com>
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>

commit | commitdiff | tree

Sridhar Seshasayee [Fri, 3 Feb 2023 12:17:38 +0000 (17:47 +0530)]

osd: update OSDService::queue_recovery_context to specify cost

Previously, we always queued this with cost osd_recovery_cost which
defaults to 20M. With mclock, this caused these items to be delayed
heavily. Instead, base the cost on the operation queued.

Fixes: https://tracker.ceph.com/issues/58606
Signed-off-by: Samuel Just <sjust@redhat.com>
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>

commit | commitdiff | tree

Sridhar Seshasayee [Fri, 3 Feb 2023 12:12:46 +0000 (17:42 +0530)]

osd/osd_types: use appropriate cost value for PullOp

See included comments -- previous values did not account for object
size. This causes problems for mclock which is much more strict
in how it interprets costs.

Fixes: https://tracker.ceph.com/issues/58607
Signed-off-by: Samuel Just <sjust@redhat.com>
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>

commit | commitdiff | tree

Sridhar Seshasayee [Thu, 2 Feb 2023 12:16:27 +0000 (17:46 +0530)]

osd/osd_types: use appropriate cost value for PushReplyOp

See included comments -- previous values did not account for object
size. This causes problems for mclock which is much more strict
in how it interprets costs.

Fixes: https://tracker.ceph.com/issues/58529
Signed-off-by: Samuel Just <sjust@redhat.com>
Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>

commit | commitdiff | tree

Anthony D'Atri [Sun, 7 May 2023 10:37:39 +0000 (06:37 -0400)]

Merge pull request #51378 from zdover23/wip-doc-2023-05-07-backport-51322-to-quincy

quincy: doc/rados: stretch-mode: stretch cluster issues

commit | commitdiff | tree

Zac Dover [Wed, 3 May 2023 05:16:07 +0000 (15:16 +1000)]

doc/rados: stretch-mode: stretch cluster issues

Edit "Stretch Cluster Issues", which might better be called "Netsplits"
or "Recognizing Netsplits".

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 6c1baffb85556120672b45cce89b93a20e7b09a2)

commit | commitdiff | tree

Nizamudeen A [Fri, 5 May 2023 15:19:51 +0000 (20:49 +0530)]

Merge pull request #51252 from rhcs-dashboard/fix-pg-imbalancy-quincy

quincy: mgr/dashboard: fix CephPGImbalance alert

Reviewed-by: Avan Thakkar <athakkar@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Fri, 5 May 2023 09:45:45 +0000 (15:15 +0530)]

Merge pull request #51358 from rhcs-dashboard/wip-59655-quincy

quincy: mgr/dashboard: bump moment from 2.29.3 to 2.29.4 in /src/pybind/mgr/dashboard/frontend

Reviewed-by: Pegonzal <NOT@FOUND>

commit | commitdiff | tree

dependabot[bot] [Wed, 6 Jul 2022 19:10:21 +0000 (19:10 +0000)]

mgr/dashboard: bump moment in /src/pybind/mgr/dashboard/frontend

Bumps [moment](https://github.com/moment/moment) from 2.29.3 to 2.29.4.
- [Release notes](https://github.com/moment/moment/releases)
- [Changelog](https://github.com/moment/moment/blob/develop/CHANGELOG.md)
- [Commits](https://github.com/moment/moment/compare/2.29.3...2.29.4)

---
updated-dependencies:
- dependency-name: moment
dependency-type: direct:production
...

Signed-off-by: dependabot[bot] <support@github.com>
(cherry picked from commit 9e8245e328e56755f13bffbec4b0740850696f94)

commit | commitdiff | tree

Nizamudeen A [Fri, 5 May 2023 05:27:31 +0000 (10:57 +0530)]

Merge pull request #51149 from rhcs-dashboard/wip-59466-quincy

quincy: mgr/dashboard: skip Create OSDs step in Cluster expansion

Reviewed-by: Avan Thakkar <athakkar@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Fri, 5 May 2023 05:26:16 +0000 (10:56 +0530)]

Merge pull request #51112 from rhcs-dashboard/wip-59459-quincy

quincy: mgr/dashboard: expose more grafana configs in service form

Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>

commit | commitdiff | tree

Anthony D'Atri [Fri, 5 May 2023 03:10:35 +0000 (23:10 -0400)]

Merge pull request #51350 from zdover23/wip-doc-2023-05-05-backport-51348-to-quincy

quincy: doc: Use `ceph osd crush tree` command to display weight set weights

commit | commitdiff | tree

James Lakin [Thu, 4 May 2023 17:02:36 +0000 (18:02 +0100)]

doc: Use `ceph osd crush tree` command to display weight set weights

The previous `ceph osd tree` doesn't show pool-defined weight-sets as the above documentation suggests.

Signed-off-by: James Lakin <james@jameslakin.co.uk>
(cherry picked from commit 15c3d72a43a37798de823b26f1429f7776f67aaa)

commit | commitdiff | tree

Nizamudeen A [Thu, 4 May 2023 15:29:09 +0000 (20:59 +0530)]

Merge pull request #51325 from rhcs-dashboard/wip-59623-quincy

quincy: mgr/dashboard: fix the rbd mirroring configure check

Reviewed-by: Avan Thakkar <athakkar@redhat.com>

commit | commitdiff | tree

Anthony D'Atri [Thu, 4 May 2023 02:19:12 +0000 (22:19 -0400)]

Merge pull request #51338 from zdover23/wip-doc-2023-05-04-backport-51292-to-quincy

quincy: doc/rados: edit stretch-mode.rst

commit | commitdiff | tree

Zac Dover [Sun, 30 Apr 2023 02:09:51 +0000 (12:09 +1000)]

doc/rados: edit stretch-mode.rst

Edit "Stretch Mode Limitations" (renamed "Limitations of Stretch Mode"
in this commit) in doc/rados/operations/stretch-mode.rst.

Co-authored-by: Greg Farnum <gfarnum@redhat.com>
Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 410e2a181c3247d13a1b20d80c4bcbbc1a5f84da)

commit | commitdiff | tree

Laura Flores [Wed, 3 May 2023 18:38:03 +0000 (13:38 -0500)]

Merge pull request #51335 from ljflores/wip-59625-quincy

quincy: mgr: add urllib3==1.26.15 to mgr/requirements.txt

commit | commitdiff | tree

Laura Flores [Mon, 1 May 2023 16:28:54 +0000 (16:28 +0000)]

mgr: add urllib3==1.26.15 to mgr/requirements.txt

We do not depend on any particular version of
urllib3, but as a workaround to the incompatibility
of urllib3 constraints between kubernetes and
requests, we need to pin it temporarily to
the version both are happy with.

Fixes: https://tracker.ceph.com/issues/59591
Signed-off-by: Laura Flores <lflores@redhat.com>
(cherry picked from commit 80d460005e44649191aa862fa78bd278644b5237)

commit | commitdiff | tree

Guillaume Abrioux [Wed, 3 May 2023 09:22:21 +0000 (11:22 +0200)]

Merge pull request #51210 from guits/wip-59257-quincy

quincy: ceph-volume: fix drive-group issue that expects the batch_args to be a string

commit | commitdiff | tree

Guillaume Abrioux [Wed, 3 May 2023 09:22:08 +0000 (11:22 +0200)]

Merge pull request #51206 from guits/wip-59518-quincy

quincy: ceph-volume: fix batch refactor issue

commit | commitdiff | tree

Guillaume Abrioux [Wed, 3 May 2023 09:21:53 +0000 (11:21 +0200)]

Merge pull request #51195 from guits/wip-59524-quincy

quincy: ceph-volume: quick fix in zap.py

commit | commitdiff | tree

Nizamudeen A [Thu, 27 Apr 2023 11:24:24 +0000 (16:54 +0530)]

mgr/dashboard: fix the rbd mirroring configure check

In one-way mirroring, the condition we are checking now for configuring
the mirroring will fail because only one cluster needs to have the
mirror daemon present. Thus even if mirroring is successfuly happening
the page won't load. For now relaxing the rule until we find a better
api call to check for the status

Fixes: https://tracker.ceph.com/issues/59573
Signed-off-by: Nizamudeen A <nia@redhat.com>
(cherry picked from commit 09de6be991c4240065bf5774e798b3d274443cff)

commit | commitdiff | tree

zdover23 [Tue, 2 May 2023 22:25:12 +0000 (08:25 +1000)]

Merge pull request #51310 from zdover23/wip-doc-2023-05-02-backport-51133-to-quincy

quincy: doc/mgr: update prompts in prometheus.rst

Reviewed-by: Cole Mitchell <cole.mitchell.ceph@gmail.com>

commit | commitdiff | tree

Zac Dover [Tue, 18 Apr 2023 14:28:50 +0000 (16:28 +0200)]

doc/mgr: update prompts in prometheus.rst

Update prompts in prometheus.rst so that they're unselectable.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 5a90d64b343f158d43397c70c267deb4e7ef0e00)

commit | commitdiff | tree

Anthony D'Atri [Mon, 1 May 2023 23:25:55 +0000 (19:25 -0400)]

Merge pull request #51306 from zdover23/wip-doc-2023-05-02-backport-51299-to-quincy

quincy: doc/radosgw: rabbitmq - push-endpoint edit

commit | commitdiff | tree

Zac Dover [Mon, 1 May 2023 17:14:01 +0000 (03:14 +1000)]

doc/radosgw: rabbitmq - push-endpoint edit

Remove a note that directed users to change "push-endpoint" (with a
hyphen) to "push_endpoint" (with an underscore) when using rabbitmq.

Re: https://github.com/ceph/ceph/pull/48486#issuecomment-1529925389

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit e4b35de2abf00d514c76f77645c587c562bab05d)

commit | commitdiff | tree

Anthony D'Atri [Mon, 1 May 2023 20:35:59 +0000 (16:35 -0400)]

Merge pull request #51303 from zdover23/wip-doc-2023-05-02-backport-51296-to-quincy

quincy: doc/rados: edit stretch-mode.rst

commit | commitdiff | tree

Zac Dover [Mon, 1 May 2023 02:29:07 +0000 (12:29 +1000)]

doc/rados: edit stretch-mode.rst

Refine and supplement the introductory and explanatory text at the top
of the /doc/rados/operations/stretch-mode.rst file.

Co-authored-by: Josh Durgin <jdurgin@redhat.com>
Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit b642900abc57302e62a5064dba510c3cc5743ac0)

commit | commitdiff | tree

Anthony D'Atri [Sat, 29 Apr 2023 20:01:07 +0000 (16:01 -0400)]

Merge pull request #51290 from zdover23/wip-doc-2023-04-30-backport-51285-to-quincy

quincy: doc/rados: edit stretch-mode procedure

commit | commitdiff | tree

Zac Dover [Sat, 29 Apr 2023 00:14:02 +0000 (10:14 +1000)]

doc/rados: edit stretch-mode procedure

Edit the "stretch mode" section in doc/rados/operations/stretch-mode.rst
so that the procedure is formatted as a procedure and the sentences
correctly have heads.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit a19ff7a5ea9bbd24365648a90abfa1b720c5b231)

commit | commitdiff | tree

zdover23 [Sat, 29 Apr 2023 17:32:11 +0000 (03:32 +1000)]

Merge pull request #51287 from zdover23/wip-doc-2023-04-29-backport-51276-to-quincy

quincy: docs: Update the Prometheus endpoint info

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Paul Cuzner [Fri, 28 Apr 2023 05:21:39 +0000 (17:21 +1200)]

docs: Update the Prometheus endpoint info

This patch just tidies up some of the links and adds
an example showing how the http_sd_configs option
may be used.

Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
(cherry picked from commit 690d34ab08f22cd988828aa2097531627000907e)

commit | commitdiff | tree

Anthony D'Atri [Fri, 28 Apr 2023 00:52:26 +0000 (20:52 -0400)]

Merge pull request #51273 from zdover23/wip-doc-2023-04-28-backport-51271-to-quincy

quincy: doc/rados: m-config-ref: edit "background"

commit | commitdiff | tree

Zac Dover [Thu, 27 Apr 2023 22:35:17 +0000 (08:35 +1000)]

doc/rados: m-config-ref: edit "background"

Edit the "Background" section of doc/rados/monitor/config-ref.rst

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 9223863fc83095def59b416bf70f9a828a701ccc)

commit | commitdiff | tree

Aashish Sharma [Mon, 24 Apr 2023 06:14:11 +0000 (11:44 +0530)]

mgr/dashboard: fix CephPGImbalance alert

Fixes: https://tracker.ceph.com/issues/55568
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
(cherry picked from commit 8b5c4d27c20bce82bb46064a2cd2928a0736e6cd)

commit | commitdiff | tree

zdover23 [Thu, 27 Apr 2023 00:44:56 +0000 (10:44 +1000)]

Merge pull request #51240 from zdover23/wip-doc-2023-04-27-backport-51154-to-quincy

quincy: doc/rados/ops: edit user-management.rst (3 of x)

Reviewed-by: Cole Mitchell <cole.mitchell@gmail.com>

commit | commitdiff | tree

Zac Dover [Thu, 20 Apr 2023 08:25:00 +0000 (10:25 +0200)]

doc/rados/ops: edit user-management.rst (3 of x)

Line-edit doc/rados/user-management.rst (3 of x).

https://tracker.ceph.com/issues/58485

Follows https://github.com/ceph/ceph/pull/51140.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 97b751ed8f8917f9d4d9cbca03f224e6518836ef)

commit | commitdiff | tree

zdover23 [Thu, 27 Apr 2023 00:09:17 +0000 (10:09 +1000)]

Merge pull request #51156 from zdover23/wip-doc-2023-04-20-backport-51140-to-quincy

quincy: doc/rados: edit user-management (2 of x)

Reviewed-by: Cole Mitchell <cole.mitchell@gmail.com>

commit | commitdiff | tree

Anthony D'Atri [Wed, 26 Apr 2023 22:26:46 +0000 (18:26 -0400)]

Merge pull request #51236 from zdover23/wip-doc-2023-04-27-backport-51204-to-quincy

quincy: doc/cephfs: explain cephfs data and metadata set

commit | commitdiff | tree

Zac Dover [Tue, 25 Apr 2023 07:46:53 +0000 (17:46 +1000)]

doc/cephfs: explain cephfs data and metadata set

Explain how to set application metadata for the CephFS data pool and the
CephFS metadata pool.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 9152f9700420f9735533f276559af87dff97bd75)

commit | commitdiff | tree

Anthony D'Atri [Wed, 26 Apr 2023 00:21:47 +0000 (20:21 -0400)]

Merge pull request #51221 from zdover23/wip-doc-2023-04-26-backport-51193-to-quincy

quincy: doc/start: rewrite intro paragraph

commit | commitdiff | tree

Zac Dover [Mon, 24 Apr 2023 11:02:16 +0000 (13:02 +0200)]

doc/start: rewrite intro paragraph

Rewrite the first paragraph in doc/start/intro.rst.

Signed-off-by: Zac Dover <zac.dover@proton.me>
Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit bea01d5f1469030253a3403dbb9e2c9fa97806ac)

commit | commitdiff | tree

Patrick Donnelly [Tue, 25 Apr 2023 15:00:09 +0000 (11:00 -0400)]

Merge PR #51029 into quincy

* refs/pull/51029/head:
Revert "qa/fs/mixed-clients: specify distros for tests"

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Tue, 25 Apr 2023 14:56:57 +0000 (10:56 -0400)]

Merge PR #50783 into quincy

* refs/pull/50783/head:
tools/cephfs: include lost+found in scan_links

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Tue, 25 Apr 2023 14:54:52 +0000 (10:54 -0400)]

Merge PR #50779 into quincy

* refs/pull/50779/head:
mds: add config to decide whether to mark dentry bad
qa: add missing scan_links step for data scan recovery
qa/tasks/cephfs: test damage to dentry's first is caught
qa/tasks/cephfs: use rank_asok and allow specifying rank
qa/tasks: allow specifying timeout command prefix to ceph
mds: provide test configs for creating first corruption
mds: catch damage to dentry's first field
mds: add debugging for pre_cow_old_inode
mds: cleanup code
mds: check for some dentry damage in scrub
mds: remove unused method
mds: note damaged dentry with first gt last
mds: cluster log scrub failure for dirfrag
mds: mark dirfrag good if repaired
mds: only dump past_parent_snap if non-empty

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Tue, 25 Apr 2023 14:53:55 +0000 (10:53 -0400)]

Merge PR #50777 into quincy

* refs/pull/50777/head:
log: fix stderr handling on Windows
log: add tests for stderr writes to fifos
log: use non-blocking atomic writes to stderr fifos
log: invalidate m_fd on close
log: reorg header

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Tue, 25 Apr 2023 14:53:05 +0000 (10:53 -0400)]

Merge PR #50774 into quincy

* refs/pull/50774/head:
qa: ignore expected scrub error

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Tue, 25 Apr 2023 14:52:23 +0000 (10:52 -0400)]

Merge PR #50772 into quincy

* refs/pull/50772/head:
qa: output higher debugging for cephfs-journal-tool/cephfs-data-scan

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Tue, 25 Apr 2023 14:51:46 +0000 (10:51 -0400)]

Merge PR #50768 into quincy

* refs/pull/50768/head:
qa: ignore MDS_TRIM warnings when osd thrashing

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Tue, 25 Apr 2023 14:50:56 +0000 (10:50 -0400)]

Merge PR #50767 into quincy

* refs/pull/50767/head:
qa: simplify and use correct recovery procedure
doc: update alternate meta pool recovery
tools/cephfs/DataScan: add debugging for directory injection

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Tue, 25 Apr 2023 14:50:07 +0000 (10:50 -0400)]

Merge PR #50766 into quincy

* refs/pull/50766/head:
qa: cleanup volumes on unwind

Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Pedro Gonzalez Gomez [Tue, 25 Apr 2023 10:59:29 +0000 (12:59 +0200)]

Merge pull request #51164 from rhcs-dashboard/wip-59501-quincy

quincy: mgr/dashboard: hide notification on force promote

Reviewed-by: Avan Thakkar <athakkar@redhat.com>

commit | commitdiff | tree

Mohan Sharma [Tue, 27 Dec 2022 06:01:04 +0000 (11:31 +0530)]

ceph-volume: fix drive-group issue

The drive-group expects the batch_args to be a string,
however in the current version it is passed as a list
of one element, thus calling the first item of the list solves the issue.

Fixes: https://tracker.ceph.com/issues/59203
Signed-off-by: Mohan Sharma <mohan7427@gmail.com>
(cherry picked from commit 7602a99f7a1308c684a7c1d619bb6d9f09c79af9)

commit | commitdiff | tree

Guillaume Abrioux [Thu, 13 Apr 2023 14:42:32 +0000 (16:42 +0200)]

ceph-volume: fix batch refactor regression

This makes sure `ceph-volume lvm batch` will recreate the db device
with the right size when coming from a cluster deployed prior to 14.2.13

Fixes: https://tracker.ceph.com/issues/59442
Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
(cherry picked from commit 98c68f2c8f648a5b9faac295999de212d362744d)

commit | commitdiff | tree

Guillaume Abrioux [Wed, 29 Mar 2023 14:58:11 +0000 (16:58 +0200)]

ceph-volume: quick fix in zap.py

`api.get_single_pv(filters={'lv_uuid': lv.lv_uuid})` needs to be called
only if `--destroy` is passed in order to remove vg and pv when there's
nothing left.

With old deployments, it is possible that a lv_uuid matches more than 1
PV.
Given that `get_single_pv()` is only needed when `--destroy` is passed,
let's move this call where it is actually needed.

This makes `ceph-volume lvm zap` fail even though

Fixes: https://tracker.ceph.com/issues/59210
Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
(cherry picked from commit a666f700f16937565484dffc90713f6c04d76313)

commit | commitdiff | tree

Anthony D'Atri [Sun, 23 Apr 2023 21:19:36 +0000 (23:19 +0200)]

Merge pull request #51182 from zdover23/wip-doc-2023-04-23-backport-51177-to-quincy

quincy: doc/start: edit first 150 lines of documenting-ceph

commit | commitdiff | tree

Anthony D'Atri [Sun, 23 Apr 2023 21:16:26 +0000 (23:16 +0200)]

Merge pull request #51185 from zdover23/wip-doc-2023-04-23-backport-51178-to-quincy

quincy: Wip doc 2023 04 23 backport 51178 to quincy

commit | commitdiff | tree

Zac Dover [Sat, 22 Apr 2023 08:55:38 +0000 (10:55 +0200)]

doc/glossary: add "Placement Groups" definition

Add a definition of "Placement Groups" to the Glossary.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 9f37ea651f9ee2c51e0705b9b58ed356f1bc56e6)

commit | commitdiff | tree

Zac Dover [Sat, 22 Apr 2023 07:03:12 +0000 (09:03 +0200)]

doc/start: edit first 50 lines of documenting-ceph

Edit the first 150 lines of doc/start/documenting-ceph.rst. This is part
of an initiative to harvest the fruits of Cephalocon 2023, at which
documentation proved to be in demand to a surprising degree.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit dd37f94aa4f1de947b1eaf5d82cc529925f5823e)

commit | commitdiff | tree

Svelar [Sun, 23 Apr 2023 02:50:48 +0000 (10:50 +0800)]

Merge pull request #51124 from zdover23/wip-doc-2023-04-17-backport-49762-to-quincy

quincy: vstart: fix text format

commit | commitdiff | tree

Pedro Gonzalez Gomez [Thu, 20 Apr 2023 15:29:57 +0000 (17:29 +0200)]

mgr/dashboard: hide notification on force promote

Fixes: https://tracker.ceph.com/issues/59500
Signed-off-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
(cherry picked from commit abe1e5101cae0fc98ad9c6c404c6f7ce97a42137)

commit | commitdiff | tree

Zac Dover [Tue, 18 Apr 2023 20:59:09 +0000 (22:59 +0200)]

doc/rados: edit user-management (2 of x)

Line-edit doc/rados/user-management.rst (2 of x). Some internal
references had to be removed, but these will be repaired when the next
part of this file is updated in a future PR.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit e3575bb72f307a27d49fedf3692ca661e3d613a5)

commit | commitdiff | tree

Nizamudeen A [Fri, 14 Apr 2023 19:33:11 +0000 (01:03 +0530)]

mgr/dashboard: skip Create OSDs step in Cluster expansion

Its to ensure OSDs are not deployed on all hosts because that would make
the host draining impossible

Fixes: https://tracker.ceph.com/issues/59457
Signed-off-by: Nizamudeen A <nia@redhat.com>
(cherry picked from commit 0f6d23a7aa024495a79cdedc80f7a00902115b6b)

commit | commitdiff | tree

Nizamudeen A [Tue, 18 Apr 2023 13:27:49 +0000 (18:57 +0530)]

Merge pull request #51119 from rhcs-dashboard/wip-59464-quincy

quincy: mgr/dashboard: remove unncessary hyperlink in landing page

Reviewed-by: Pegonzal <NOT@FOUND>

commit | commitdiff | tree

Rongqi Sun [Tue, 17 Jan 2023 05:55:01 +0000 (13:55 +0800)]

vstart: fix text format

Signed-off-by: Rongqi Sun <sunrongqi@huawei.com>
(cherry picked from commit 57dc8ce51602543d42a2f82cb829eda1c231b434)

commit | commitdiff | tree

Nizamudeen A [Mon, 17 Apr 2023 09:38:06 +0000 (15:08 +0530)]

mgr/dashboard: remove unncessary hyperlink in landing page

Fixes: https://tracker.ceph.com/issues/59462
Signed-off-by: Nizamudeen A <nia@redhat.com>
(cherry picked from commit 7e7da955445ecf37bc43fc296298bd91d0d8a140)

commit | commitdiff | tree

Nizamudeen A [Mon, 17 Apr 2023 13:38:57 +0000 (19:08 +0530)]

Merge pull request #51079 from rhcs-dashboard/wip-59451-quincy

quincy: mgr/dashboard: fix cephadm e2e expression changed error

Reviewed-by: Aashish Sharma <aasharma@redhat.com>

commit | commitdiff | tree

colemitchell [Mon, 17 Apr 2023 10:43:34 +0000 (12:43 +0200)]

Merge pull request #51117 from zdover23/wip-doc-2023-04-17-backport-51114-to-quincy

quincy: doc/radosgw: format part of s3select

Reviewed-by: Cole Mitchell <cole.mitchell.ceph@gmail.com>

commit | commitdiff | tree

Cole Mitchell [Mon, 17 Apr 2023 09:34:49 +0000 (05:34 -0400)]

doc/radosgw: format part of s3select

Partially format the 'Basic Workflow' section's introduction and 'Basic Functionalities' subsection in s3select. Nothing else is being fixed.

Signed-off-by: Cole Mitchell <cole.mitchell.ceph@gmail.com>
(cherry picked from commit 13cf134c0610509da52aa68e11e26f0740002bde)

commit | commitdiff | tree

Nizamudeen A [Mon, 14 Nov 2022 02:20:54 +0000 (07:50 +0530)]

mgr/dashboard: expose more grafana configs in service form

Show the grafana_port and initial_admin_password in the form but disable
the password field in the edit option

Fixes: https://tracker.ceph.com/issues/58016
Signed-off-by: Nizamudeen A <nia@redhat.com>
(cherry picked from commit 6987191377e5ca623563ae851d87e0961416e3f4)

commit | commitdiff | tree

zdover23 [Sun, 16 Apr 2023 16:24:40 +0000 (18:24 +0200)]

Merge pull request #51108 from zdover23/wip-doc-2023-04-16-backport-51099-to-quincy

quincy: doc/dev: format command in cephfs-mirroring

Reviewed-by: Cole Mitchell <cole.mitchell.ceph@gmail.com>

commit | commitdiff | tree

zdover23 [Sun, 16 Apr 2023 16:22:48 +0000 (18:22 +0200)]

Merge pull request #51097 from zdover23/wip-doc-2023-04-16-backport-51062-to-quincy

quincy: doc/glossary: add "Hybrid Storage"

Reviewed-by: Cole Mitchell <cole.mitchell.ceph@gmail.com>

commit | commitdiff | tree

zdover23 [Sun, 16 Apr 2023 16:18:52 +0000 (18:18 +0200)]

Merge pull request #51093 from zdover23/wip-doc-2023-04-16-backport-51091-to-quincy

quincy: doc/mgr/prometheus: fix confval reference

Reviewed-by: Cole Mitchell <cole.mitchell.ceph@gmail.com>

commit | commitdiff | tree

Zac Dover [Sun, 16 Apr 2023 09:11:27 +0000 (11:11 +0200)]

doc/dev: format command in cephfs-mirroring

Correctly format a command in doc/dev/cephfs-mirroring/#creating-users.

Reported by casanlin@init7.net at
https://pad.ceph.com/p/Report_Documentation_Bugs

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 408219bfca6b1e698229967e76d22d10028b7c20)

commit | commitdiff | tree

colemitchell [Sun, 16 Apr 2023 14:43:17 +0000 (10:43 -0400)]

Merge pull request #51105 from zdover23/wip-doc-2023-04-16-backport-51103-to-quincy

quincy: doc/radosgw: format part of s3select

Reviewed-by: Cole Mitchell <cole.mitchell.ceph@gmail.com>

commit | commitdiff | tree

Cole Mitchell [Sun, 16 Apr 2023 13:13:56 +0000 (09:13 -0400)]

doc/radosgw: format part of s3select

Format the first section of s3select. Nothing else is being fixed.

Signed-off-by: Cole Mitchell <cole.mitchell.ceph@gmail.com>
(cherry picked from commit a6a84471a7af154e7ccc93f51df2fc9744dc606c)

Unnamed repository; edit this file 'description' to name the repository.