]> git.apps.os.sepia.ceph.com Git - ceph-ci.git/log
ceph-ci.git
3 years agomgr/telemetry: add `enable / disable channel all`
Yaarit Hatuka [Wed, 12 Jan 2022 05:57:21 +0000 (05:57 +0000)]
mgr/telemetry: add `enable / disable channel all`

Enable or disable all telemetry channels at once with:
    ceph telemetry enable channel all
    ceph telemetry disable channel all

Signed-off-by: Yaarit Hatuka <yaarit@redhat.com>
3 years agomgr/telemetry: do not restore channels default when opting-out
Yaarit Hatuka [Wed, 12 Jan 2022 05:32:01 +0000 (05:32 +0000)]
mgr/telemetry: do not restore channels default when opting-out

Other modules do not reset their configuration; keep telemetry module
consistent with this behavior.

Signed-off-by: Yaarit Hatuka <yaarit@redhat.com>
3 years agomgr/telemetry: verify there are new collections when nagging due to a major
Yaarit Hatuka [Wed, 12 Jan 2022 05:01:48 +0000 (05:01 +0000)]
mgr/telemetry: verify there are new collections when nagging due to a major
upgrade

When adding a new collection we define whether to nag the user about it.
We may add many collections and nag about none of them. However, in case
of a major upgrade, we wish to notify the user about these new
collections. This commit verifies there are indeed new collections when
nagging due to a major upgrade.

Signed-off-by: Yaarit Hatuka <yaarit@redhat.com>
3 years agomgr/telemetry: improve output of `ceph telemetry collection ls`
Yaarit Hatuka [Wed, 12 Jan 2022 04:36:27 +0000 (04:36 +0000)]
mgr/telemetry: improve output of `ceph telemetry collection ls`

STATUS column now indicates whether a collection is being reported, and
the reasons why it's not (either the user is not opted-in to this
collection, or its channel is off).

Also, removed the ENROLLED and DEFAULT columns due to potential
confusion they may cause.

In case a user is not opted-in to certain collections, a message will
appear above the table with the missing collections:

    New collections are available:
    ['basic_base', 'basic_mds_metadata', 'crash_base', 'device_base',
    'ident_base', 'perf_perf']
    Run `ceph telemetry on` to opt-in to these collections.

Signed-off-by: Yaarit Hatuka <yaarit@redhat.com>
3 years agomgr/telemetry: use dict lookup when traversing MODULE_COLLECTION
Yaarit Hatuka [Wed, 12 Jan 2022 02:08:52 +0000 (02:08 +0000)]
mgr/telemetry: use dict lookup when traversing MODULE_COLLECTION

Signed-off-by: Yaarit Hatuka <yaarit@redhat.com>
3 years agomgr/telemetry: add test coverage for telemetry upgrade
Yaarit Hatuka [Tue, 7 Dec 2021 23:17:13 +0000 (23:17 +0000)]
mgr/telemetry: add test coverage for telemetry upgrade

Test the behavior of the module after an upgrade, as we shift from our
revision design to Collections.

Signed-off-by: Yaarit Hatuka <yaarit@redhat.com>
3 years agodoc/mgr/telemetry: document new commands
Yaarit Hatuka [Tue, 7 Dec 2021 22:16:28 +0000 (22:16 +0000)]
doc/mgr/telemetry: document new commands

New commands:

  ceph telemetry enable channel <channel_name>
  ceph telemetry disable channel <channel_name>
  ceph telemetry channel ls
  ceph telemetry collection ls
  ceph telemetry collection diff
  ceph telemetry preview
  ceph telemetry preview-device
  ceph telemetry preview-all

Signed-off-by: Yaarit Hatuka <yaarit@redhat.com>
3 years agomgr/telemetry: add command to list all collections
Yaarit Hatuka [Tue, 7 Dec 2021 18:30:56 +0000 (18:30 +0000)]
mgr/telemetry: add command to list all collections

List all collections, their current enrollment state, status, default,
and description, with:

$ ceph telemetry collection ls

NAME                  ENROLLED    STATUS    DEFAULT    DESC
basic_base            TRUE        ON        ON         Basic information about the cluster (capacity, number and type of daemons, version, etc.)
basic_mds_metadata    TRUE        ON        ON         MDS metadata
crash_base            TRUE        ON        ON         Information about daemon crashes (daemon type and version, backtrace, etc.)
device_base           TRUE        ON        ON         Information about device health metrics
ident_base            TRUE        OFF       OFF        User-provided identifying information about the cluster
perf_perf             TRUE        OFF       OFF        Information about performance counters of the cluster

Please note:

NAME:
=====
Collection name; prefix indicates the channel the collection belongs to.

ENROLLED:
=========
Signifies the collections that were available in the module when the
user last opted-in to telemetry. Please note: Even if a collection is
'enrolled', its metrics will be reported only if its channel is enabled.

STATUS:
=======
Indicates whether the collection metrics are reported; this is
determined by the status (enabled / disabled) of the channel the
collection belongs to, along with the enrollment status of the
collection.

DEFAULT:
========
The default status (enabled / disabled) of the channel the collection
belongs to.

DESC:
=====
Collection description.

Signed-off-by: Yaarit Hatuka <yaarit@redhat.com>
3 years agomgr/telemetry: fix missing type annotations
Yaarit Hatuka [Tue, 30 Nov 2021 04:32:24 +0000 (04:32 +0000)]
mgr/telemetry: fix missing type annotations

Signed-off-by: Yaarit Hatuka <yaarit@redhat.com>
3 years agomgr/telemetry: add preview-device and preview-all commands
Yaarit Hatuka [Tue, 23 Nov 2021 21:28:47 +0000 (21:28 +0000)]
mgr/telemetry: add preview-device and preview-all commands

`ceph telemetry show` will show a sample cluster report if the user is
opted-in to telemetry. The report will be compiled of the collections
the user is opted-in to. To preview a report compiled of the most recent
collection available, use `ceph telemetry preview`.

The device channel is not included in the cluster report, since it's
being sent to a different endpoint, thus we use
`ceph telemetry show-device` in case the user is opted-in to telemetry
and the device channel is enabled. If not, it can also be previewed with
`ceph telemetry preview-device`.

If telemetry is on, and device channel is enabled, both reports can be
reviewed with `ceph telemetry show-all`, otherwise use
`ceph telemetry preview-all`.

Signed-off-by: Yaarit Hatuka <yaarit@redhat.com>
3 years agomgr/telemetry: add command to list all channels
Yaarit Hatuka [Tue, 23 Nov 2021 17:11:38 +0000 (17:11 +0000)]
mgr/telemetry: add command to list all channels

List all channels, their current state, default, and description, with:

$ ceph telemetry channel ls

NAME      ENABLED    DEFAULT    DESC
basic     ON         ON         Share basic cluster information (size, version)
ident     OFF        OFF        Share a user-provided description and/or contact email for the cluster
crash     ON         ON         Share metadata about Ceph daemon crashes (version, stack straces, etc)
device    ON         ON         Share device health metrics (e.g., SMART data, minus potentially identifying info like serial numbers)
perf      ON         OFF        Share perf counter metrics summed across the whole cluster

Signed-off-by: Yaarit Hatuka <yaarit@redhat.com>
3 years agomgr/telemetry: add commands to enable/disable channels
Yaarit Hatuka [Tue, 23 Nov 2021 00:12:10 +0000 (00:12 +0000)]
mgr/telemetry: add commands to enable/disable channels

Currently we enable/disable a telemetry channel via CLI with:
  `ceph config set mgr mgr/telemetry/channel_basic true`
  `ceph config set mgr mgr/telemetry/channel_crash false`

We can now do this with:
  `ceph telemetry enable channel basic`
  `ceph telemetry disable channel crash`

We allow enabling / disabling lists of channels:
  `ceph telemetry enable channel basic device crash perf`
  `ceph telemetry disable channel basic device crash perf`

Please note, telemetry should be on for these commands to take effect.

Signed-off-by: Yaarit Hatuka <yaarit@redhat.com>
3 years agomgr/telemetry: introduce new design for adding new data
Yaarit Hatuka [Mon, 15 Nov 2021 16:53:59 +0000 (16:53 +0000)]
mgr/telemetry: introduce new design for adding new data

The current design requires increasing the telemetry revision each time
we add new data to the report. As a result, users need to re-opt-in to
telemetry. This new design allows for adding new data to the report,
while allowing users to keep sending only what they already opted-in to,
hence no re-opt-in is required. In case users wish to report the new
data as well, they need to re-opt-in and enable any new channels.

Also, move formatting perf histograms to a function, so we can use it
both in `show` and `preview` commands.

Fix get_report call in dashboard to use get_report_locked.

Signed-off-by: Yaarit Hatuka <yaarit@redhat.com>
3 years agoMerge pull request #39440 from pdvian/wip-warn-filestore-osds
Yuri Weinstein [Wed, 12 Jan 2022 16:49:02 +0000 (08:49 -0800)]
Merge pull request #39440 from pdvian/wip-warn-filestore-osds

mon/OSDMonitor, osd: Add warning on filestore deprecation and force use of wpq scheduler for filestore OSDs

Reviewed-by: Neha Ojha <nojha@redhat.com>
3 years agoMerge pull request #41778 from felixhuettner/fix_subdir_name
Daniel Gryniewicz [Wed, 12 Jan 2022 14:22:04 +0000 (09:22 -0500)]
Merge pull request #41778 from felixhuettner/fix_subdir_name

rgw: RGWSwiftWebsiteHandler::is_web_dir checks empty subdir_name

3 years agoMerge pull request #38234 from inspur-wyq/wip-copy-obj-check-size
Daniel Gryniewicz [Wed, 12 Jan 2022 14:21:48 +0000 (09:21 -0500)]
Merge pull request #38234 from inspur-wyq/wip-copy-obj-check-size

rgw : check the object size when copy obj

3 years agoMerge pull request #38532 from Rjerk/wip-empty-tagset
Daniel Gryniewicz [Wed, 12 Jan 2022 14:21:35 +0000 (09:21 -0500)]
Merge pull request #38532 from Rjerk/wip-empty-tagset

rgw: an empty tagset is allowed by S3

3 years agoMerge pull request #40573 from Huber-ming/rgw_admin
Daniel Gryniewicz [Wed, 12 Jan 2022 14:21:15 +0000 (09:21 -0500)]
Merge pull request #40573 from Huber-ming/rgw_admin

rgw: delete abbreviation for option "--new-uid"

3 years agoMerge pull request #40575 from Huber-ming/rgw_admin-f
Daniel Gryniewicz [Wed, 12 Jan 2022 14:20:49 +0000 (09:20 -0500)]
Merge pull request #40575 from Huber-ming/rgw_admin-f

radosgw-admin: delete the abbreviation of option "--infile"

3 years agoMerge pull request #44500 from idryomov/wip-rbd-test-group-leak
Ilya Dryomov [Wed, 12 Jan 2022 11:56:09 +0000 (12:56 +0100)]
Merge pull request #44500 from idryomov/wip-rbd-test-group-leak

test/librbd: fix group_info.name leaks in TestGroup.add_image

Reviewed-by: Mykola Golub <mgolub@suse.com>
3 years agoMerge pull request #44490 from liu-chunmei/crimson-fix-aligned
Liu-Chunmei [Wed, 12 Jan 2022 06:05:22 +0000 (22:05 -0800)]
Merge pull request #44490 from liu-chunmei/crimson-fix-aligned

crimson: fix assert_aligned(size) in trim_data_reservation

reviewed by: Samuel Just <sjust@redhat.com> , Yingxin <yingxin.cheng@intel.com>

3 years agoMerge pull request #44532 from rzarzynski/wip-crimson-fix-test-runner
Yingxin [Wed, 12 Jan 2022 01:23:53 +0000 (09:23 +0800)]
Merge pull request #44532 from rzarzynski/wip-crimson-fix-test-runner

test/crimson: fix a race condition in SeastarRunner

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
Reviewed-by: Chunmei Liu <chunmei.liu@intel.com>
Reviewed-by: Samuel Just <sjust@redhat.com>
3 years agoMerge pull request #43593 from ljflores/wip-rocksdb
Neha Ojha [Wed, 12 Jan 2022 00:31:41 +0000 (16:31 -0800)]
Merge pull request #43593 from ljflores/wip-rocksdb

mgr: expose rocksdb version number for use in telemetry

Reviewed-by: Kefu Chai <tchaikov@gmail.com>
3 years agocrimson: fix assert_aligned(size) in trim_data_reservation
chunmei-liu [Tue, 11 Jan 2022 03:44:43 +0000 (19:44 -0800)]
crimson: fix assert_aligned(size) in trim_data_reservation

Signed-off-by: chunmei-liu <chunmei.liu@intel.com>
signed-off-by: Samuel Just <sjust@redhat.com>

3 years agomgr/telemetry: add the rocksdb version number to telemetry
Laura Flores [Tue, 4 Jan 2022 22:54:33 +0000 (22:54 +0000)]
mgr/telemetry: add the rocksdb version number to telemetry

Capturing the RocksDB version number in Telemetry would allow us to check that users are using the appropriate RocksDB version for their Ceph cluster. For instance, if a user is working in a Pacific cluster, but their RocksDB version is meant for Nautilus, that might be a problem.

It is strucured as "rocksdb_stats" --> "version" in anticipation of more stats that can will be added under "rocksdb_stats".

Signed-off-by: Laura Flores <lflores@redhat.com>
3 years agomgr: expose rocksdb version number in the mgr module
Laura Flores [Wed, 22 Dec 2021 23:09:25 +0000 (23:09 +0000)]
mgr: expose rocksdb version number in the mgr module

It is only necessary here to link the rocksdb include directory
since the mgr simply needs access to the rocksdb version numbers.

Signed-off-by: Laura Flores <lflores@redhat.com>
Co-authored-by: Kefu Chai <tchaikov@gmail.com>
Co-authored-by: Adam Kupczyk <akupczyk@redhat.com>
3 years agoMerge pull request #43794 from aclamk/wip-bluefs-fine-grain-locking-4
Neha Ojha [Tue, 11 Jan 2022 21:59:16 +0000 (13:59 -0800)]
Merge pull request #43794 from aclamk/wip-bluefs-fine-grain-locking-4

os/bluestore: BlueFS fine grain locking

Reviewed-by: Igor Fedotov <igor.fedotov@croit.io>
3 years agoMerge pull request #44537 from pdvian/wip-typo-fix
Casey Bodley [Tue, 11 Jan 2022 20:50:34 +0000 (15:50 -0500)]
Merge pull request #44537 from pdvian/wip-typo-fix

rgw: Fix typo error for stat_remote_obj

Reviewed-by: Casey Bodley <cbodley@redhat.com>
3 years agoMerge pull request #44496 from idryomov/wip-diff-iterate-parent-fix
Ilya Dryomov [Tue, 11 Jan 2022 19:28:19 +0000 (20:28 +0100)]
Merge pull request #44496 from idryomov/wip-diff-iterate-parent-fix

librbd: restore diff-iterate include_parent functionality in fast-diff mode

Reviewed-by: Mykola Golub <mgolub@suse.com>
3 years agorgw: Fix typo error for stat_remote_obj
Prashant D [Tue, 11 Jan 2022 18:58:20 +0000 (13:58 -0500)]
rgw: Fix typo error for stat_remote_obj

Signed-off-by: Prashant D <pdhange@redhat.com>
3 years agoMerge pull request #44483 from idryomov/wip-diff-iterate-offset-fix
Ilya Dryomov [Tue, 11 Jan 2022 17:59:12 +0000 (18:59 +0100)]
Merge pull request #44483 from idryomov/wip-diff-iterate-offset-fix

librbd: diff-iterate reports incorrect offsets in fast-diff mode

Reviewed-by: Mykola Golub <mgolub@suse.com>
3 years agoMerge pull request #44355 from sseshasa/wip-fix-enhance-mclock-logging
Yuri Weinstein [Tue, 11 Jan 2022 17:32:12 +0000 (09:32 -0800)]
Merge pull request #44355 from sseshasa/wip-fix-enhance-mclock-logging

osd:  Enhance mclockscheduler logging and formatted dump information.

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
3 years agoMerge pull request #44482 from sebastian-philipp/nfs-upgrade-port-conflict
Yuri Weinstein [Tue, 11 Jan 2022 17:31:35 +0000 (09:31 -0800)]
Merge pull request #44482 from sebastian-philipp/nfs-upgrade-port-conflict

mgr/cephadm: nfs migration: avoid port conflicts

Reviewed-by: Adam King adking@redhat.com
3 years agoMerge pull request #36308 from linuxbox2/wip-rgwlc-bcktdel
Matt Benjamin [Tue, 11 Jan 2022 17:17:22 +0000 (12:17 -0500)]
Merge pull request #36308 from linuxbox2/wip-rgwlc-bcktdel

rgwlc:  remove lc entry on bucket delete

3 years agoMerge pull request #42891 from ofriedma/wip-ofriedma-rgw-qos-finale
Casey Bodley [Tue, 11 Jan 2022 16:35:05 +0000 (11:35 -0500)]
Merge pull request #42891 from ofriedma/wip-ofriedma-rgw-qos-finale

rgw: Add rgw rate limiting per user and per bucket

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
Reviewed-by: Yuval Lifshitz <ylifshit@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
3 years agoMerge pull request #38774 from linuxbox2/wip-doc-replsupport
Casey Bodley [Tue, 11 Jan 2022 16:13:24 +0000 (11:13 -0500)]
Merge pull request #38774 from linuxbox2/wip-doc-replsupport

rgw: document S3 bucket replication support

Reviewed-by: Casey Bodley <cbodley@redhat.com>
3 years agotest/crimson: fix a race condition in SeastarRunner
Radoslaw Zarzynski [Tue, 11 Jan 2022 12:09:28 +0000 (12:09 +0000)]
test/crimson: fix a race condition in SeastarRunner

This patch is supposed to fix the following problem:

```
        Start 234: unittest-seastar-errorator
216/258 Test #234: unittest-seastar-errorator ................Child aborted***Exception:   0.95 sec
WARNING: debug mode. Not for benchmarking or production
WARN  2022-01-11 10:01:32,973 [shard 0] seastar - Creation of perf_event based stall detector failed, falling back to posix timer: std::system_error (error system:13, perf_event_open() failed: Permission denied)
unittest-seastar-errorator: ../src/test/crimson/seastar_runner.h:45: int SeastarRunner::init(int, char **): Assertion `begin_signaled == true' failed.
Aborting.
Backtrace:
  0xb3397a
  0x1d19430
  0x1d1905d
  0x1b72332
  0x1b9dd95
  0x1c75fe9
  0x1c76231
  0x1c7605a
  0x7f20b7aee3bf
  /lib/x86_64-linux-gnu/libc.so.6+0x4618a
  /lib/x86_64-linux-gnu/libc.so.6+0x25858
  /lib/x86_64-linux-gnu/libc.so.6+0x25728
  /lib/x86_64-linux-gnu/libc.so.6+0x36f35
  0xc8bb88
  0xc7ec71
  /lib/x86_64-linux-gnu/libc.so.6+0x270b2
  0xafc95d
```

Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
3 years agoMerge pull request #43404 from rhcs-dashboard/angular-12-upgradee
Ernesto Puerta [Tue, 11 Jan 2022 12:02:07 +0000 (13:02 +0100)]
Merge pull request #43404 from rhcs-dashboard/angular-12-upgradee

mgr/dashboard: Update Angular version to 12

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Volker Theile <vtheile@suse.com>
3 years agoMerge pull request #44088 from ceph/feature-48388-cache
Ernesto Puerta [Tue, 11 Jan 2022 11:52:41 +0000 (12:52 +0100)]
Merge pull request #44088 from ceph/feature-48388-cache

mgr: TTL cache implementation

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>
Reviewed-by: sebastian-philipp <NOT@FOUND>
3 years agoMerge pull request #44294 from rhcs-dashboard/feature-bluestore-onode
Ernesto Puerta [Tue, 11 Jan 2022 10:24:21 +0000 (11:24 +0100)]
Merge pull request #44294 from rhcs-dashboard/feature-bluestore-onode

mgr/dashboard: monitoring:Implement BlueStore onode hit/miss counters into the dashboard

Reviewed-by: Aashish Sharma <aasharma@redhat.com>
Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Laura Flores <lflores@redhat.com>
Reviewed-by: neha-ojha <NOT@FOUND>
Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>
3 years agoMerge pull request #44456 from jtlayton/wip-53765
Venky Shankar [Tue, 11 Jan 2022 09:09:16 +0000 (14:39 +0530)]
Merge pull request #44456 from jtlayton/wip-53765

mount.ceph: fix the handling of new-syntax device names

Reviewed-by: Venky Shankar <vshankar@redhat.com>
3 years agoosd: Display scheduler specific info when dumping an OpSchedulerItem
Sridhar Seshasayee [Fri, 17 Dec 2021 12:30:05 +0000 (18:00 +0530)]
osd: Display scheduler specific info when dumping an OpSchedulerItem

Implement logic to dump information relevant to the scheduler type being
employed when dumping details about an OpSchedulerItem. For e.g., the
'priority' field is relevant for the 'wpq' scheduler, but for the
'mclock_scheduler', the 'qos_cost' gives more information during debugging.

A couple of additional fields called 'qos_cost' and 'is_qos_request' are
introduced in OpSchedulerItem class. These are mainly used to facilitate
dumping of relevant information depending on the scheduler type. The
interesting points are when an item is enqueued and dequeued.

For the 'mclock_scheduler', the 'class_id' and the 'qos_cost' fields are
dumped during enqueue and dequeue op respectively. For the 'wpq' scheduler
things remain the same as before.

An additional benefit of this change is to help immediately identify the
type of scheduler being used for a given shard depending on what is dumped
in the debug messages while debugging.

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
3 years agoosd: Add debug logs and formatted dumps in the mClockScheduler
Sridhar Seshasayee [Tue, 14 Dec 2021 10:41:14 +0000 (16:11 +0530)]
osd: Add debug logs and formatted dumps in the mClockScheduler

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
3 years agoosd: Change description of 'dump_op_pq_state' command to be more generic
Sridhar Seshasayee [Thu, 6 Jan 2022 09:58:21 +0000 (15:28 +0530)]
osd: Change description of 'dump_op_pq_state' command to be more generic

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
3 years agocommon: Introduce 'mclock' debug subsystem
Sridhar Seshasayee [Thu, 6 Jan 2022 09:56:20 +0000 (15:26 +0530)]
common: Introduce 'mclock' debug subsystem

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
3 years agoMerge pull request #44410 from tchaikov/wip-cmake-cleanup
Samuel Just [Mon, 10 Jan 2022 21:36:09 +0000 (13:36 -0800)]
Merge pull request #44410 from tchaikov/wip-cmake-cleanup

cmake: detect linux/blk/zoned support

Reviewed-by: Samuel Just <sjust@redhat.com>
3 years agoMerge pull request #44502 from yuvalif/fix-call-to-mrgw
Yuval Lifshitz [Mon, 10 Jan 2022 20:05:22 +0000 (22:05 +0200)]
Merge pull request #44502 from yuvalif/fix-call-to-mrgw

rgw/multisite/test: pass zero (no) ssl port to mrgw.sh

3 years agoMerge pull request #44495 from Matan-B/wip-matanb-crimson-flavor
Matan [Mon, 10 Jan 2022 18:30:18 +0000 (20:30 +0200)]
Merge pull request #44495 from Matan-B/wip-matanb-crimson-flavor

qa/suites: Crimson flavor configuration fix

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: chunmei-liu chunmei.liu@intel.com
3 years agotest/librbd: make diff-iterate clone tests exercise fast-diff mode
Ilya Dryomov [Fri, 7 Jan 2022 12:31:08 +0000 (13:31 +0100)]
test/librbd: make diff-iterate clone tests exercise fast-diff mode

The fast-diff feature wasn't propagated to the clone so these tests
were exercising the slow list_snaps path no matter what RBD_FEATURES
value was supplied to ceph_test_librbd.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
3 years agolibrbd: restore diff-iterate include_parent functionality in fast-diff mode
Ilya Dryomov [Wed, 5 Jan 2022 19:24:40 +0000 (20:24 +0100)]
librbd: restore diff-iterate include_parent functionality in fast-diff mode

Commit 4429ed4f3f4c ("librbd: switch diff iterate API to use new snaps
list dispatch methods") removed the recursive execute() call.  The new
list_snaps method does indeed handle parent diffs internally but it is
not used in fast-diff mode.  Nothing changed there -- we still need to
load the parent object map, calculate parent object_diff_state, etc.

Fixes: https://tracker.ceph.com/issues/53787
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
3 years agolibrbd: stash unmodified include_parent value in DiffContext
Ilya Dryomov [Wed, 5 Jan 2022 18:45:50 +0000 (19:45 +0100)]
librbd: stash unmodified include_parent value in DiffContext

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
3 years agotest/librbd: fix group_info.name leaks in TestGroup.add_image
Ilya Dryomov [Sat, 8 Jan 2022 13:18:32 +0000 (14:18 +0100)]
test/librbd: fix group_info.name leaks in TestGroup.add_image

Introduced in commit c04e8c3374e0 ("test/librbd: add get_group test").

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
3 years agodoc: rgw: document S3 bucket replication support
Matt Benjamin [Tue, 5 Jan 2021 20:30:23 +0000 (15:30 -0500)]
doc: rgw: document S3 bucket replication support

Support was added at Octopus.

Fixes: https://tracker.ceph.com/issues/48755
Signed-off-by: Matt Benjamin <mbenjamin@redhat.com>
3 years agorgw: Add admin ops API for rate limiting
Or Friedmann [Mon, 3 Jan 2022 16:32:16 +0000 (16:32 +0000)]
rgw: Add admin ops API for rate limiting

Add admin ops API for rate limiting and some bug fixes

Signed-off-by: Or Friedmann <ofriedma@redhat.com>
3 years agoMerge pull request #44221 from BenoitKnecht/ceph-mon-device-metrics
Yuri Weinstein [Mon, 10 Jan 2022 15:56:29 +0000 (07:56 -0800)]
Merge pull request #44221 from BenoitKnecht/ceph-mon-device-metrics

mon: Abort device health when device not found

Reviewed-by: Yaarit Hatuka <yaarit@redhat.com>
3 years agoMerge pull request #43362 from ybwang0211/pubsub_settingng_spell
Daniel Gryniewicz [Mon, 10 Jan 2022 15:10:51 +0000 (10:10 -0500)]
Merge pull request #43362 from ybwang0211/pubsub_settingng_spell

rgw:word spelling error in rgw/rgw_pubsub.h file

3 years agoMerge pull request #42115 from sandrobonazzola/patch-2
Sebastian Wagner [Mon, 10 Jan 2022 14:41:32 +0000 (15:41 +0100)]
Merge pull request #42115 from sandrobonazzola/patch-2

doc/install: ensure to open ceph-mon firewall ports

Reviewed-by: Sebastian Wagner <sewagner@redhat.com>
3 years agorgw/multisite/test: passzero (no) ssl port to mrgw.sh
Yuval Lifshitz [Sun, 9 Jan 2022 16:27:40 +0000 (18:27 +0200)]
rgw/multisite/test: passzero (no) ssl port to mrgw.sh

in multisite tests script.
this is to fix an issue from: 1f3fbf218faa4859f02917828dbdb48be41a579a

Signed-off-by: Yuval Lifshitz <ylifshit@redhat.com>
3 years agoMerge pull request #44301 from yuvalif/wip-yuval-cloudevents
Yuval Lifshitz [Sun, 9 Jan 2022 07:32:53 +0000 (09:32 +0200)]
Merge pull request #44301 from yuvalif/wip-yuval-cloudevents

rgw/notifications: add cloudevents support to HTTP endpoint

3 years agoMerge pull request #44278 from fengchunsong/dpdk-tso
Kefu Chai [Sat, 8 Jan 2022 11:28:00 +0000 (19:28 +0800)]
Merge pull request #44278 from fengchunsong/dpdk-tso

msg/async/dpdk: Add an option to enable/disable TSO

Reviewed-by: Kefu Chai <tchaikov@gmail.com>
3 years agoMerge pull request #44198 from wjwithagen/fix-bluestore-get-io-size
Kefu Chai [Sat, 8 Jan 2022 04:29:21 +0000 (12:29 +0800)]
Merge pull request #44198 from wjwithagen/fix-bluestore-get-io-size

common/blkdev: add missing stubs for OSes not Linux

Reviewed-by: Igor Fedotov <igor.fedotov@croit.io>
Reviewed-by: Kefu Chai <tchaikov@gmail.com>
3 years agoMerge pull request #44402 from inspur-wyq/wip-pybind-mgr
Yuri Weinstein [Fri, 7 Jan 2022 23:36:58 +0000 (15:36 -0800)]
Merge pull request #44402 from inspur-wyq/wip-pybind-mgr

pybind/mgr: fix typos

Reviewed-by: Kefu Chai <kchai@redhat.com>
3 years agoMerge pull request #44055 from ifed01/wip-ifed-cosmetics-for-optimize-rm
Yuri Weinstein [Fri, 7 Jan 2022 23:36:37 +0000 (15:36 -0800)]
Merge pull request #44055 from ifed01/wip-ifed-cosmetics-for-optimize-rm

test/store_test, os/bluestore: bunch of minor improvements/cleanups

Reviewed-by: Adam Kupczyk <akupczyk@redhat.com>
3 years agoMerge pull request #43993 from SMIL-Infra/cmake-systemd-cleanup
Yuri Weinstein [Fri, 7 Jan 2022 23:35:42 +0000 (15:35 -0800)]
Merge pull request #43993 from SMIL-Infra/cmake-systemd-cleanup

cmake: cleanups about systemd and install

Reviewed-by: Kefu Chai <kchai@redhat.com>
3 years agoMerge pull request #44423 from 5cs/fix-unshare-blob-test
Igor Fedotov [Fri, 7 Jan 2022 23:27:42 +0000 (02:27 +0300)]
Merge pull request #44423 from 5cs/fix-unshare-blob-test

test: fix bluestore unshare blob test case

Reviewed-by: Igor Fedotov <igor.fedotov@croit.io>
3 years agoMerge pull request #44416 from yuvalif/wip-yuval-rgw-params-in-multisite
Casey Bodley [Fri, 7 Jan 2022 20:19:44 +0000 (15:19 -0500)]
Merge pull request #44416 from yuvalif/wip-yuval-rgw-params-in-multisite

rgw/multisite/test: allow passing rgw parameter to multisite script

Reviewed-by: Casey Bodley <cbodley@redhat.com>
3 years agoMerge pull request #44020 from adk3798/host-ls-scale
Sebastian Wagner [Fri, 7 Jan 2022 10:59:04 +0000 (11:59 +0100)]
Merge pull request #44020 from adk3798/host-ls-scale

mgr/orchestrator: add filtering and count option for orch host ls

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Sebastian Wagner <sewagner@redhat.com>
3 years agorgw/notifications: add cloudevents support to HTTP endpoint
yuval Lifshitz [Mon, 13 Dec 2021 18:56:20 +0000 (20:56 +0200)]
rgw/notifications: add cloudevents support to HTTP endpoint

following the cloudevents HTTP spec:
https://github.com/cloudevents/spec/blob/v1.0.1/http-protocol-binding.md
and more specifically this aws-s3 spec:
https://github.com/cloudevents/spec/blob/main/cloudevents/adapters/aws-s3.md

Signed-off-by: yuval Lifshitz <ylifshit@redhat.com>
3 years agorgw/common: make "get" methods on RGWHTTPArgs const
yuval Lifshitz [Mon, 13 Dec 2021 16:45:19 +0000 (18:45 +0200)]
rgw/common: make "get" methods on RGWHTTPArgs const

Signed-off-by: yuval Lifshitz <ylifshit@redhat.com>
3 years agoMerge pull request #44412 from cyx1231st/wip-crimson-improve-log
Yingxin [Fri, 7 Jan 2022 07:04:46 +0000 (15:04 +0800)]
Merge pull request #44412 from cyx1231st/wip-crimson-improve-log

crimson: leverage --default-log-level to configure the global log level

Reviewed-by: Samuel Just <sjust@redhat.com>
3 years agoMerge pull request #44473 from johnbent/patch-1
Laura Flores [Thu, 6 Jan 2022 16:05:09 +0000 (10:05 -0600)]
Merge pull request #44473 from johnbent/patch-1

README.md: Update README.md to add link to tracker.ceph.com

3 years agoMerge pull request #43833 from melissa-kun-li/prevent-cephadm-stray-daemon-tcmurunner
Sebastian Wagner [Thu, 6 Jan 2022 16:00:00 +0000 (17:00 +0100)]
Merge pull request #43833 from melissa-kun-li/prevent-cephadm-stray-daemon-tcmurunner

mgr/cephadm: fix tcmu-runner cephadm_stray_daemon

3 years agoqa/suites: Fix crimson flavor configuration
Matan Breizman [Thu, 6 Jan 2022 13:32:33 +0000 (13:32 +0000)]
qa/suites: Fix crimson flavor configuration

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
3 years agoMerge PR #44054 into master
Sage Weil [Thu, 6 Jan 2022 13:54:45 +0000 (08:54 -0500)]
Merge PR #44054 into master

* refs/pull/44054/head:
doc/rados/operations: document pg_num_max
mgr: set max of 32 pgs for .mgr pool
mgr/dashboard: expect pg_num_max property for pools
mon/OSDMonitor: add option --pg-num_max arg for create pool
mon/OSDMonitor: disallow setting pg_num < min or > max
mgr/pg_autoscaler: apply pg_num_max
mon: add pg_num_max pool property

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
3 years agoMerge pull request #44157 from CongMinYin/pwl-small-fixes
Ilya Dryomov [Thu, 6 Jan 2022 13:54:31 +0000 (14:54 +0100)]
Merge pull request #44157 from CongMinYin/pwl-small-fixes

librbd/cache/pwl: code cleanup and some minor fixes

Reviewed-by: Mykola Golub <mgolub@suse.com>
Reviewed-by: Deepika Upadhyay <dupadhya@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
3 years agolibrbd: diff-iterate reports incorrect offsets in fast-diff mode
Ilya Dryomov [Tue, 4 Jan 2022 19:38:35 +0000 (20:38 +0100)]
librbd: diff-iterate reports incorrect offsets in fast-diff mode

If rbd_diff_iterate2() is called on an image offset that doesn't
correspond to an object boundary, the callback is invoked with an
incorrect image offset.  For example, assuming a fully allocated
image, a diff request for 806354944~57344 results in offs=807403520,
len=57344, exists=true invocation, which is ahead by 1048576 bytes.
This occurs only in fast-diff mode, for a diff request on an image
with the fast-diff feature disabled or if whole_object parameter is
set to false the invocation is correct.

This bug goes back to the introduction of fast-diff mode in commit
6d5b969d4206 ("librbd: add diff_iterate2 to API").

Fixes: https://tracker.ceph.com/issues/53784
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
3 years agomgr/cephadm: nfs migration: avoid port conflicts
Sebastian Wagner [Thu, 6 Jan 2022 11:35:38 +0000 (12:35 +0100)]
mgr/cephadm: nfs migration: avoid port conflicts

The nfs upgrade renames the nfs spec from `nfs.ganesha-{service_id}`
to `nfs.{service_id}`. Previously we used the orphan-daemon check
to remove the old `nfs.ganesha-{service_id}` daemons. This does not work
as sometimes serve() tries to deploy the new daemons before cleaning up the
old daemons. This results in a port conflict breaking the upgrade.

Fixes: https://tracker.ceph.com/issues/53424
Signed-off-by: Sebastian Wagner <sewagner@redhat.com>
3 years agolibrbd/cache/pwl: merge multiple output "<<" calls
Yin Congmin [Tue, 4 Jan 2022 08:09:27 +0000 (16:09 +0800)]
librbd/cache/pwl: merge multiple output "<<" calls

Follow the principle of fewer "<<" calls and higher performance.
Merge multiple output "<<" calls.

Signed-off-by: Yin Congmin <congmin.yin@intel.com>
3 years agoMerge pull request #44475 from athanatos/sjust/wip-53783
Yingxin [Thu, 6 Jan 2022 01:36:06 +0000 (09:36 +0800)]
Merge pull request #44475 from athanatos/sjust/wip-53783

crimson/os/seastore/cache: fix debug macro usage

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
3 years agocrimson/os/seastore/cache: fix debug macro usage
Samuel Just [Wed, 5 Jan 2022 23:46:33 +0000 (23:46 +0000)]
crimson/os/seastore/cache: fix debug macro usage

Introduced via conflict between 277e57 and 632916.

Fixes: https://tracker.ceph.com/issues/53783
Signed-off-by: Samuel Just <sjust@redhat.com>
3 years agoREADME.md: Update README.md to add link to tracker.ceph.com
John Bent [Wed, 5 Jan 2022 16:04:40 +0000 (09:04 -0700)]
README.md: Update README.md to add link to tracker.ceph.com

I searched the existing documentation for a link to the tracker and had a hard time finding it. Other folks like myself might like it so prominently displayed.
Added a link to https://tracker.ceph.com/projects/ceph as https://tracker.ceph.com/ is basically blank and not totally intuitive how to get to issues.

Signed-off-by: John Bent <john.bent@seagate.com>
3 years agoMerge pull request #44429 from cyx1231st/wip-crimson-improve-log-2
Samuel Just [Wed, 5 Jan 2022 20:52:18 +0000 (12:52 -0800)]
Merge pull request #44429 from cyx1231st/wip-crimson-improve-log-2

crimson/os/seastore: differentiate loggers in seastore

Reviewed-by: Samuel Just <sjust@redhat.com>
3 years agoMerge pull request #44420 from aisakaki/lru-cache
Samuel Just [Wed, 5 Jan 2022 20:34:24 +0000 (12:34 -0800)]
Merge pull request #44420 from aisakaki/lru-cache

crimson/os/seastore/cache: add lru

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
3 years agoMerge pull request #44474 from ljflores/wip-doc-integration-tests
Neha Ojha [Wed, 5 Jan 2022 18:33:55 +0000 (10:33 -0800)]
Merge pull request #44474 from ljflores/wip-doc-integration-tests

doc/.../testing_integration_tests: fix formatting of teuthology-suite option

Reviewed-by: Neha Ojha <nojha@redhat.com>
3 years agodoc/.../testing_integration_tests: fix formatting of teuthology-suite option
Laura Flores [Wed, 5 Jan 2022 16:48:53 +0000 (16:48 +0000)]
doc/.../testing_integration_tests: fix formatting of teuthology-suite option

The `--subset` teuthology-suite option should be formatted.

Signed-off-by: Laura Flores <lflores@redhat.com>
3 years agorgw: Add rgw rate limiting per user and per bucket
Or Friedmann [Tue, 1 Jun 2021 12:45:09 +0000 (15:45 +0300)]
rgw: Add rgw rate limiting per user and per bucket

Add rgw rate limiting per user and per bucket

Signed-off-by: Or Friedmann <ofriedma@redhat.com>
3 years agoRGW - Zipper add attr merge for User
Daniel Gryniewicz [Tue, 31 Aug 2021 15:59:45 +0000 (11:59 -0400)]
RGW - Zipper add attr merge for User

Signed-off-by: Daniel Gryniewicz <dang@redhat.com>
3 years agoMerge pull request #39192 from linuxbox2/wip-lc-notify
Matt Benjamin [Wed, 5 Jan 2022 14:38:47 +0000 (09:38 -0500)]
Merge pull request #39192 from linuxbox2/wip-lc-notify

rgwlc:  optionally support notifications on object expiration

3 years agomgr/dashboard: Update Angular version to 12
Nizamudeen A [Sun, 3 Oct 2021 18:56:45 +0000 (00:26 +0530)]
mgr/dashboard: Update Angular version to 12

A full changelog can be seen here: https://blog.angular.io/angular-v12-is-now-available-32ed51fbfd49

For us, the most I had to do is to take care of the min-max validation
and a small CSS change regarding the math()

Fixes: https://tracker.ceph.com/issues/53049
Signed-off-by: Nizamudeen A <nia@redhat.com>
3 years agoMerge pull request #44341 from sebastian-philipp/cleanup-snmp
Sebastian Wagner [Wed, 5 Jan 2022 13:34:48 +0000 (14:34 +0100)]
Merge pull request #44341 from sebastian-philipp/cleanup-snmp

mgr/cephadm: Cleanup snmp

Reviewed-by: Adam King <adking@redhat.com>
3 years agoMerge pull request #44000 from nmshelke/exception-handling-48473
Venky Shankar [Wed, 5 Jan 2022 12:25:39 +0000 (17:55 +0530)]
Merge pull request #44000 from nmshelke/exception-handling-48473

mgr/stats: exception handling for ceph fs perf stats command

Reviewed-by: Venky Shankar <vshankar@redhat.com>
3 years agoMerge pull request #44291 from vshankar/wip-ignore-invalid-client-op
Venky Shankar [Wed, 5 Jan 2022 12:25:06 +0000 (17:55 +0530)]
Merge pull request #44291 from vshankar/wip-ignore-invalid-client-op

mds: ignore unknown client op when tracking op latency

Reviewed-by: Venky Shankar <vshankar@redhat.com>
3 years agoMerge pull request #44279 from jtlayton/wip-50235
Venky Shankar [Wed, 5 Jan 2022 12:16:01 +0000 (17:46 +0530)]
Merge pull request #44279 from jtlayton/wip-50235

cephfs-shell: add the ability to mount a named filesystem

Reviewed-by: Venky Shankar <vshankar@redhat.com>
3 years agoMerge pull request #44158 from adk3798/agent-failed
Sebastian Wagner [Wed, 5 Jan 2022 11:28:34 +0000 (12:28 +0100)]
Merge pull request #44158 from adk3798/agent-failed

mgr/cephadm: don't include agents in CEPHADM_FAILED_DAEMON

Reviewed-by: Sebastian Wagner <sewagner@redhat.com>
3 years agoMerge pull request #44388 from rhcs-dashboard/02-host-inventory-fix
Alfonso Martínez [Wed, 5 Jan 2022 10:32:22 +0000 (11:32 +0100)]
Merge pull request #44388 from rhcs-dashboard/02-host-inventory-fix

mgr/dashboard: fix orchestrator/02-hosts-inventory.e2e failure

Reviewed-by: Waad Alkhoury <walkhour@redhat.com>
Reviewed-by: Aashish Sharma <aasharma@redhat.com>
Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
3 years agodoc/rados/operations: Document health check OSD_FILESTORE
Prashant D [Tue, 21 Dec 2021 19:39:51 +0000 (14:39 -0500)]
doc/rados/operations: Document health check OSD_FILESTORE

Signed-off-by: Prashant D <pdhange@redhat.com>
3 years agoosd, mon/OSDMonitor: Force the scheduler type to 'wpq' for filestore OSDs
Sridhar Seshasayee [Wed, 8 Dec 2021 07:36:28 +0000 (13:06 +0530)]
osd, mon/OSDMonitor: Force the scheduler type to 'wpq' for filestore OSDs

The 'mclock_scheduler' is not supported for filestore OSDs. Enforce the
usage of 'wpq' scheduler for such OSDs to avoid issues.

Also, in this scenario, the override of various config settings for the
'mclock_scheduler' are not performed.

Signed-off-by: Sridhar Seshasayee <sseshasa@redhat.com>
3 years agomon/OSDMonitor: Raise health warning for filestore osds
Prashant D [Fri, 12 Nov 2021 13:44:27 +0000 (13:44 +0000)]
mon/OSDMonitor: Raise health warning for filestore osds

Filestore will be deprecated in Quincy, considering
that BlueStore has been the default objectstore for
quite some time.

Fixes: https://tracker.ceph.com/issues/49275
Signed-off-by: Prashant D <pdhange@redhat.com>
3 years agoMerge pull request #44193 from adk3798/agent-upgrade
Sebastian Wagner [Wed, 5 Jan 2022 09:27:21 +0000 (10:27 +0100)]
Merge pull request #44193 from adk3798/agent-upgrade

mgr/cephadm: speed up upgrade when using agent

Reviewed-by: John Mulligan <jmulligan@redhat.com>
Reviewed-by: Sebastian Wagner <sewagner@redhat.com>
3 years agoMerge pull request #44266 from adk3798/http-reconfig
Sebastian Wagner [Wed, 5 Jan 2022 09:26:01 +0000 (10:26 +0100)]
Merge pull request #44266 from adk3798/http-reconfig

mgr/cephadm: reconfig agents over http

Reviewed-by: Sebastian Wagner <sewagner@redhat.com>