]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
17 months agomgr/dashboard: snap schedule module detect/eneable 54853/head
Ivo Almeida [Sun, 10 Dec 2023 17:44:33 +0000 (17:44 +0000)]
mgr/dashboard: snap schedule module detect/eneable

Fixes: https://tracker.ceph.com/issues/63768
Signed-off-by: Ivo Almeida <ialmeida@redhat.com>
17 months agoMerge pull request #55304 from afreen23/fix-64150
Pedro Gonzalez Gomez [Mon, 29 Jan 2024 07:56:55 +0000 (08:56 +0100)]
Merge pull request #55304 from afreen23/fix-64150

mgr/dashboard: Fix issue with Object's Overview page

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
17 months agoMerge pull request #54103 from ceph/dependabot-npm_and_yarn-src-pybind-mgr-dashboard...
Nizamudeen A [Mon, 29 Jan 2024 07:06:38 +0000 (12:36 +0530)]
Merge pull request #54103 from ceph/dependabot-npm_and_yarn-src-pybind-mgr-dashboard-frontend-babel-traverse-7.23.2

mgr/dashboard: bump @babel/traverse from 7.23.0 to 7.23.2 in /src/pybind/mgr/dashboard/frontend

Reviewed-by: Nizamudeen A <nia@redhat.com>
17 months agoMerge pull request #53988 from ljflores/wip-read-balancer-mgr-module
Laura Flores [Mon, 29 Jan 2024 05:40:01 +0000 (23:40 -0600)]
Merge pull request #53988 from ljflores/wip-read-balancer-mgr-module

17 months agoMerge pull request #55312 from afreen23/fix-55270
Nizamudeen A [Mon, 29 Jan 2024 04:44:27 +0000 (10:14 +0530)]
Merge pull request #55312 from afreen23/fix-55270

mgr/dashboard:  Code refactor rgw migrate component for using correctly the MIGRATE action verb

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
17 months agoMerge pull request #55341 from zdover23/wip-doc-2024-01-29-architecture-rados
zdover23 [Mon, 29 Jan 2024 03:33:57 +0000 (13:33 +1000)]
Merge pull request #55341 from zdover23/wip-doc-2024-01-29-architecture-rados

doc/architecture.rst: improve rados definition

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
17 months agoMerge pull request #55342 from ljflores/wip-tox-mgr
Laura Flores [Mon, 29 Jan 2024 03:18:30 +0000 (21:18 -0600)]
Merge pull request #55342 from ljflores/wip-tox-mgr

17 months agomgr: pin pytest to version 7.4.4 55342/head
Laura Flores [Mon, 29 Jan 2024 00:58:25 +0000 (00:58 +0000)]
mgr: pin pytest to version 7.4.4

On 2024-01-27, pytest updated to 8.0.0,
which broke run-tox-mgr.

https://docs.pytest.org/en/stable/changelog.html

==================================== ERRORS ====================================
_____________________ ERROR collecting alerts/__init__.py ______________________
alerts/__init__.py:2: in <module>
    from .module import Alerts
alerts/module.py:6: in <module>
    from mgr_module import CLIReadCommand, HandleCommandResult, MgrModule, Option
mgr_module.py:1: in <module>
    import ceph_module  # noqa
E   ModuleNotFoundError: No module named 'ceph_module'
______________________ ERROR collecting alerts/module.py _______________________
alerts/module.py:6: in <module>
    from mgr_module import CLIReadCommand, HandleCommandResult, MgrModule, Option
mgr_module.py:1: in <module>
    import ceph_module  # noqa
E   ModuleNotFoundError: No module named 'ceph_module'
____________________ ERROR collecting balancer/__init__.py _____________________
balancer/__init__.py:2: in <module>
    from .module import Module
balancer/module.py:12: in <module>
    from mgr_module import CLIReadCommand, CLICommand, CommandResult, MgrModule, Option, OSDMap, CephReleases
mgr_module.py:1: in <module>
    import ceph_module  # noqa
E   ModuleNotFoundError: No module named 'ceph_module'
_____________________ ERROR collecting balancer/module.py ______________________
balancer/module.py:12: in <module>
    from mgr_module import CLIReadCommand, CLICommand, CommandResult, MgrModule, Option, OSDMap, CephReleases
mgr_module.py:1: in <module>
    import ceph_module  # noqa
E   ModuleNotFoundError: No module named 'ceph_module'

Fixes: https://tracker.ceph.com/issues/64200
Signed-off-by: Laura Flores <lflores@ibm.com>
17 months agodoc/architecture.rst: improve rados definition 55341/head
Zac Dover [Sun, 28 Jan 2024 19:33:58 +0000 (05:33 +1000)]
doc/architecture.rst: improve rados definition

Improve the definition of RADOS, and link to information about RADOS.

Signed-off-by: Zac Dover <zac.dover@proton.me>
17 months agoosd: clear out unneeded pending pg-upmap-primary mappings 53988/head
Laura Flores [Fri, 26 Jan 2024 17:32:43 +0000 (17:32 +0000)]
osd: clear out unneeded pending pg-upmap-primary mappings

If the score did not improve, we should clear out any
pending pg-upmap-primary mappings so they don't execute
in situations where the same incremental is used to balance
multiple pools (i.e. in the balancer mgr module).

Signed-off-by: Laura Flores <lflores@ibm.com>
17 months agoqa/suites/rados/thrash/d-balancer: add read and upmap-read modes to balancer testing
Laura Flores [Wed, 3 Jan 2024 20:52:00 +0000 (20:52 +0000)]
qa/suites/rados/thrash/d-balancer: add read and upmap-read modes to balancer testing

Here, we turn the balancer mgr module to read and upmap read mode
during various thrashing scenarios.

Signed-off-by: Laura Flores <lflores@ibm.com>
17 months agomgr/balancer: add pg_upmap_primaries to `balancer status detail`
Laura Flores [Tue, 2 Jan 2024 21:28:03 +0000 (21:28 +0000)]
mgr/balancer: add pg_upmap_primaries to `balancer status detail`

Followup to https://github.com/ceph/ceph/pull/54801/commits/8a5553597ca6a428cb8ffc9fc5bebde048fbd068.
Streamlines some of the logic so pg upmap activity is properly
initalized, and updated in offline mode as well as online.

Signed-off-by: Laura Flores <lflores@ibm.com>
17 months agoPendingReleaseNotes: add note about read balancer mgr module integration
Laura Flores [Fri, 22 Dec 2023 19:23:41 +0000 (19:23 +0000)]
PendingReleaseNotes: add note about read balancer mgr module integration

Signed-off-by: Laura Flores <lflores@ibm.com>
17 months agomgr: add read balancer support inside the balancer module
Laura Flores [Thu, 18 Jan 2024 18:57:24 +0000 (18:57 +0000)]
mgr: add read balancer support inside the balancer module

Read balancing may now be managed automatically via the balancer
manager module. Users may choose between two new modes: ``upmap-read``, which
offers upmap and read optimization simultaneously, or ``read``, which may be used
to only optimize reads. Existing balancer commands have also been added to
contain more information about read balancing.

Run the following commands to test the new automatic behavior:
`ceph balancer on` (on by default)
`ceph balancer mode <read|upmap-read>`
`ceph balancer status`

Run the following commands to test the new supervised behavior:
`ceph balancer off`
`ceph balancer mode <read|upmap-read>`
`ceph balancer eval` | `ceph balancer eval <pool-name>`
`ceph balancer eval-verbose` | `ceph balancer eval-verbose <pool-name>`
`ceph balancer optimize <plan-name>`
`ceph balancer show <plan-name>`
`ceph balancer eval <plan-name>`
`ceph balancer execute <plan-name>`

In the balancer module, there is also a new "self_test" function which tests
the module's basic functionality. This test can be triggered with the following
commands:
`ceph mgr module enable selftest`
`ceph mgr self-test module balancer`

Related Trello: https://trello.com/c/sWoKctzL/859-add-read-balancer-support-inside-the-balancer-module
Signed-off-by: Laura Flores <lflores@ibm.com>
17 months agoMerge pull request #55281 from Matan-B/wip-matanb-crimson-cyanstore-rmcoll
Matan Breizman [Sun, 28 Jan 2024 09:22:39 +0000 (11:22 +0200)]
Merge pull request #55281 from Matan-B/wip-matanb-crimson-cyanstore-rmcoll

crimson/os/cyanstore: support OP_RMCOLL

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: chunmei-liu <chunmei.liu@intel.com>
17 months agoMerge pull request #55333 from zdover23/wip-doc-2024-01-27-radosgw-index-verb-disagre...
zdover23 [Sun, 28 Jan 2024 08:17:52 +0000 (18:17 +1000)]
Merge pull request #55333 from zdover23/wip-doc-2024-01-27-radosgw-index-verb-disagreement

doc/radosgw: fix verb disagreement - index.html

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
17 months agodoc/radosgw: fix verb disagreement - index.html 55333/head
Zac Dover [Sat, 27 Jan 2024 08:40:20 +0000 (18:40 +1000)]
doc/radosgw: fix verb disagreement - index.html

Fix a tricky verb disagreement and rewrite a few sentences for what I
hope is greater clarity.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
17 months agoMerge pull request #54423 from guits/dmcrypt-optim
Guillaume Abrioux [Sat, 27 Jan 2024 11:27:42 +0000 (12:27 +0100)]
Merge pull request #54423 from guits/dmcrypt-optim

ceph-volume: use 'no workqueue' options with dmcrypt

17 months agoceph-volume: fix partitions support in disk.get_devices() 54423/head
Guillaume Abrioux [Fri, 26 Jan 2024 20:35:18 +0000 (21:35 +0100)]
ceph-volume: fix partitions support in disk.get_devices()

The following:
```
is_part = get_file_contents(os.path.join(_sys_dev_block_path, item, 'partition')) == "1"
```
assumes any `/sys/dev/block/x:y/partition` contains '1' which is wrong.
This file actually contains the corresponding partition number.

Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
17 months agoMerge pull request #55323 from ceph/dependabot-github_actions-gregsdennis-dependencie...
Laura Flores [Fri, 26 Jan 2024 22:46:36 +0000 (16:46 -0600)]
Merge pull request #55323 from ceph/dependabot-github_actions-gregsdennis-dependencies-action-1.3.2

.github: Bump gregsdennis/dependencies-action from 1.2.3 to 1.3.2

17 months agomgr: add CephReleases class to sustainably compare releases
Laura Flores [Thu, 18 Jan 2024 18:55:28 +0000 (18:55 +0000)]
mgr: add CephReleases class to sustainably compare releases

Changes how the upmap balancer compares min_mon_release
to account for release names eventually wrapping around the alphabet.

Signed-off-by: Laura Flores <lflores@ibm.com>
17 months agoMerge pull request #55331 from ceph/revert-55096-sjust/for-review/wip-crush-msr
Laura Flores [Fri, 26 Jan 2024 22:15:46 +0000 (16:15 -0600)]
Merge pull request #55331 from ceph/revert-55096-sjust/for-review/wip-crush-msr

Revert "crush: add multistep retry rules"

17 months agoceph-volume: use 'no workqueue' options with dmcrypt
Guillaume Abrioux [Wed, 8 Nov 2023 16:43:46 +0000 (16:43 +0000)]
ceph-volume: use 'no workqueue' options with dmcrypt

CloudFlare engineers made some testing and realized that using
workqueues with encryption on flash devices has a bad effect.

See [1] for details.

With this patch it will make ceph-volume call crypsetup with
`--perf-no_read_workqueue` and `--perf-no_write_workqueue` options
when the device is not a rotational.

[1] https://blog.cloudflare.com/speeding-up-linux-disk-encryption/

Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
Co-Authored-by: Stefan Kooman <stefan@kooman.org>
Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
17 months agoRevert "crush: add multistep retry rules" 55331/head
Samuel Just [Fri, 26 Jan 2024 20:23:03 +0000 (12:23 -0800)]
Revert "crush: add multistep retry rules"

This PR was merged by accident before it was ready.
Let's revert for now and open a new PR.

Signed-off-by: Samuel Just <sjust@redhat.com>
17 months agoMerge pull request #55096 from athanatos/sjust/for-review/wip-crush-msr
Yuri Weinstein [Fri, 26 Jan 2024 19:57:53 +0000 (11:57 -0800)]
Merge pull request #55096 from athanatos/sjust/for-review/wip-crush-msr

crush: add multistep retry rules

Reviewed-by: Laura Flores <lflores@redhat.com>
17 months agoMerge pull request #55303 from cbodley/wip-63130-debug
Casey Bodley [Fri, 26 Jan 2024 14:18:48 +0000 (14:18 +0000)]
Merge pull request #55303 from cbodley/wip-63130-debug

cmake/arrow: don't treat warnings as errors

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>
17 months agoMerge pull request #55234 from ajarr/wip-64063
Ilya Dryomov [Fri, 26 Jan 2024 11:37:52 +0000 (12:37 +0100)]
Merge pull request #55234 from ajarr/wip-64063

rbd-nbd: use netlink interface by default

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
17 months agoMerge pull request #55121 from zhscn/fix-ambiguous-error
Kefu Chai [Fri, 26 Jan 2024 06:42:06 +0000 (14:42 +0800)]
Merge pull request #55121 from zhscn/fix-ambiguous-error

common: fix ambiguous error when using gcc 13

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Kefu Chai <tchaikov@gmail.com>
17 months agoMerge pull request #54896 from cyx1231st/wip-crimson-save-conn-foreign-copy
Yingxin [Fri, 26 Jan 2024 05:47:37 +0000 (13:47 +0800)]
Merge pull request #54896 from cyx1231st/wip-crimson-save-conn-foreign-copy

crimson/osd: drop a foreign-copy to shard-0 for every pg operation

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Chunmei Liu <chunmei.liu@intel.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>
17 months agoMerge pull request #52496 from adamemerson/wip-rgw-surface-neorados
Casey Bodley [Fri, 26 Jan 2024 02:43:44 +0000 (02:43 +0000)]
Merge pull request #52496 from adamemerson/wip-rgw-surface-neorados

rgw: Surface neorados

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
17 months ago.github: Bump gregsdennis/dependencies-action from 1.2.3 to 1.3.2 55323/head
dependabot[bot] [Thu, 25 Jan 2024 23:57:13 +0000 (23:57 +0000)]
.github: Bump gregsdennis/dependencies-action from 1.2.3 to 1.3.2

Bumps [gregsdennis/dependencies-action](https://github.com/gregsdennis/dependencies-action) from 1.2.3 to 1.3.2.
- [Release notes](https://github.com/gregsdennis/dependencies-action/releases)
- [Commits](https://github.com/gregsdennis/dependencies-action/compare/80b5ffec566913b1494d5a8577ab0d60e476271d...f98d55eee1f66e7aaea4a60e71892736ae2548c7)

---
updated-dependencies:
- dependency-name: gregsdennis/dependencies-action
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
17 months agoMerge pull request #55308 from ljflores/wip-dependabot
Laura Flores [Thu, 25 Jan 2024 23:56:29 +0000 (17:56 -0600)]
Merge pull request #55308 from ljflores/wip-dependabot

17 months agoMerge pull request #55307 from zdover23/wip-doc-2024-01-25-radosgw-admin-usage
zdover23 [Thu, 25 Jan 2024 23:24:13 +0000 (09:24 +1000)]
Merge pull request #55307 from zdover23/wip-doc-2024-01-25-radosgw-admin-usage

doc/radosgw: edit "Usage" admin.rst

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
17 months agodoc/radosgw: edit "Usage" admin.rst 55307/head
Zac Dover [Wed, 24 Jan 2024 19:13:00 +0000 (05:13 +1000)]
doc/radosgw: edit "Usage" admin.rst

Edit "Usage" in doc/radosgw/admin.rst.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
17 months agoMerge pull request #55315 from cbodley/wip-moncommand-dencoder
Casey Bodley [Thu, 25 Jan 2024 17:12:07 +0000 (17:12 +0000)]
Merge pull request #55315 from cbodley/wip-moncommand-dencoder

mon: zero-initialize MonCommand::flags

Reviewed-by: Kefu Chai <tchaikov@gmail.com>
17 months agorbd-nbd: map using netlink interface by default 55234/head
Ramana Raja [Wed, 17 Jan 2024 18:24:36 +0000 (13:24 -0500)]
rbd-nbd: map using netlink interface by default

Mapping rbd images to nbd devices using ioctl interface is not
robust. It was discovered that the device size or the md5 checksum
of the nbd device was incorrect immediately after mapping using
ioctl method. When using the nbd netlink interface to map RBD images
the issue was not encountered. Switch to using nbd netlink interface
for mapping.

Fixes: https://tracker.ceph.com/issues/64063
Signed-off-by: Ramana Raja <rraja@redhat.com>
17 months agocrimson/os/cyanstore: support OP_RMCOLL 55281/head
Matan Breizman [Tue, 23 Jan 2024 12:18:05 +0000 (12:18 +0000)]
crimson/os/cyanstore: support OP_RMCOLL

Signed-off-by: Matan Breizman <mbreizma@redhat.com>
17 months agomon: zero-initialize MonCommand::flags 55315/head
Casey Bodley [Thu, 25 Jan 2024 14:45:08 +0000 (09:45 -0500)]
mon: zero-initialize MonCommand::flags

causing failures in check-generated.sh:

2 MonCommand
/tmp/typ-qmnDZ1ahR /tmp/typ-lIgJTZiUe differ: byte 100, line 6
**** MonCommand test 1 dump_json check failed ****
   ceph-dencoder type MonCommand select_test 1 dump_json > /tmp/typ-qmnDZ1ahR
   ceph-dencoder type MonCommand select_test 1 encode decode dump_json > /tmp/typ-lIgJTZiUe
6c6
<     "flags": 94661933599904
---
>     "flags": 94342467308704

Signed-off-by: Casey Bodley <cbodley@redhat.com>
17 months agoMerge pull request #55287 from ajarr/wip-64139
Ilya Dryomov [Thu, 25 Jan 2024 12:04:26 +0000 (13:04 +0100)]
Merge pull request #55287 from ajarr/wip-64139

rbd-nbd: fix resize of images mapped using netlink

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
17 months agomgr/dashboard: Code refactor rgw migrate component for using correctly the MIGRATE... 55312/head
Afreen [Thu, 25 Jan 2024 11:21:06 +0000 (16:51 +0530)]
mgr/dashboard:  Code refactor rgw migrate component for using correctly the MIGRATE action verb

fixes https://tracker.ceph.com/issues/64152

this.MIGRATE = $localize`Migrate to Multi-Site`;

Just like other action verbs we should set this.Migrate = "MIGRATE" only.
This will require rephrasing in the following places as well:
1. https://github.com/ceph/ceph/blob/d3256c484136a1b32b79a904861f681a9248ba3c/src/pybind/mgr/dashboard/frontend/src/app/ceph/rgw/rgw-multisite-details/rgw-multisite-details.component.ts#L223-L228

2. https://github.com/ceph/ceph/blob/d3256c484136a1b32b79a904861f681a9248ba3c/src/pybind/mgr/dashboard/frontend/src/app/ceph/rgw/rgw-multisite-migrate/rgw-multisite-migrate.component.ts#L180-L18

Signed-off-by: Afreen <afreen23.git@gmail.com>
17 months agoMerge pull request #55270 from afreen23/fix-cap-inconsistency-multisite
Nizamudeen A [Thu, 25 Jan 2024 10:10:43 +0000 (15:40 +0530)]
Merge pull request #55270 from afreen23/fix-cap-inconsistency-multisite

mgr/dashboard: Fix inconsistency in capitalisation of "Multi-site"

Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: rosinL <NOT@FOUND>
17 months agoMerge pull request #55182 from rkachach/fix_issue_64029
Redouane Kachach [Thu, 25 Jan 2024 09:23:43 +0000 (10:23 +0100)]
Merge pull request #55182 from rkachach/fix_issue_64029

mgr/rook: adding some basic rook e2e testing

17 months agoMerge pull request #55266 from athanatos/sjust/wip-63996
Samuel Just [Thu, 25 Jan 2024 05:05:09 +0000 (21:05 -0800)]
Merge pull request #55266 from athanatos/sjust/wip-63996

crimson: retain map references in OSDSingletonState::store_maps

Reviewed-by: Xuehan Xu <xuxuehan@qianxin.com>
Reviewed-by: Matan Breizman <mbreizma@redhat.com>
17 months agocrimson/osd/shard_services: retain map references in OSDSingletonState::store_maps 55266/head
Samuel Just [Wed, 10 Jan 2024 17:43:45 +0000 (09:43 -0800)]
crimson/osd/shard_services: retain map references in OSDSingletonState::store_maps

Introduced: 3f11cd94
Fixes: https://tracker.ceph.com/issues/63996
Signed-off-by: Samuel Just <sjust@redhat.com>
17 months agocrimson/osd/shard_service.cc: convert to newer logging machinery
Samuel Just [Wed, 10 Jan 2024 17:16:49 +0000 (17:16 +0000)]
crimson/osd/shard_service.cc: convert to newer logging machinery

Signed-off-by: Samuel Just <sjust@redhat.com>
17 months agocrimson/osd/osd.cc: migrate logging to new style
Samuel Just [Sat, 6 Jan 2024 23:32:03 +0000 (15:32 -0800)]
crimson/osd/osd.cc: migrate logging to new style

Signed-off-by: Samuel Just <sjust@redhat.com>
17 months agoMerge pull request #55288 from athanatos/sjust/wip-64140
Samuel Just [Thu, 25 Jan 2024 01:23:47 +0000 (17:23 -0800)]
Merge pull request #55288 from athanatos/sjust/wip-64140

Revert "crimson/os/alienstore/alien_log: _flush concurrently"

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
17 months agorgw: Thread site config into RADOS driver 52496/head
Adam Emerson [Thu, 9 Nov 2023 20:23:53 +0000 (15:23 -0500)]
rgw: Thread site config into RADOS driver

Make this available so multisite sync doesn't have to rely on the zone
service.

Signed-off-by: Adam Emerson <aemerson@redhat.com>
17 months agorgw: Asio frontend shares `io_context` with the rest of RGW
Adam C. Emerson [Fri, 14 Apr 2023 19:45:04 +0000 (15:45 -0400)]
rgw: Asio frontend shares `io_context` with the rest of RGW

Pull the `io_context` and threads out of `AsioFrontend`, pass in a
reference to `io_context_pool` so it can be shut down at `AsioFrontend::join`

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
17 months agoMerge pull request #54987 from batrick/i63822
Yuri Weinstein [Wed, 24 Jan 2024 21:31:31 +0000 (13:31 -0800)]
Merge pull request #54987 from batrick/i63822

pybind/mgr/devicehealth: skip legacy objects that cannot be loaded

Reviewed-by: Nitzan Mordechai <nmordech@redhat.com>
Reviewed-by: Yaarit Hatuka <yaarithatuka@gmail.com>
17 months agoMerge pull request #54491 from jianwei1216/fix_osd_pg_stat_report_interval_max_cmain
Yuri Weinstein [Wed, 24 Jan 2024 21:30:50 +0000 (13:30 -0800)]
Merge pull request #54491 from jianwei1216/fix_osd_pg_stat_report_interval_max_cmain

fix: resolve inconsistent judgment of osd_pg_stat_report_interval_max

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Matan Breizman <Matan.Brz@gmail.com>
17 months agoMerge pull request #53250 from YiteGu/add-perfcount-for-allocator
Yuri Weinstein [Wed, 24 Jan 2024 21:30:07 +0000 (13:30 -0800)]
Merge pull request #53250 from YiteGu/add-perfcount-for-allocator

os/bluestore: add perfcount for bluestore/bluefs allocator

Reviewed-by: Igor Fedotov <ifedotov@suse.com>
17 months agoMerge pull request #52530 from amathuria/wip-amat-fix-59531
Yuri Weinstein [Wed, 24 Jan 2024 21:28:16 +0000 (13:28 -0800)]
Merge pull request #52530 from amathuria/wip-amat-fix-59531

osd: Add memstore to unsupported objstores for QoS

Reviewed-by: Sridhar Seshasayee <sseshasa@redhat.com>
Reviewed-by: Samuel Just <sjust@redhat.com>
17 months agorgw: SAL drivers take `boost::asio::io_context`
Adam C. Emerson [Fri, 14 Apr 2023 18:59:19 +0000 (14:59 -0400)]
rgw: SAL drivers take `boost::asio::io_context`

`RadosDriver` needs it, and since SAL generally uses `optional_yield`,
other stores are going to need it to implement that sensibly.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
17 months agoceph/async: `io_context_pool` constructor/start takes init function
Adam C. Emerson [Fri, 14 Apr 2023 19:43:36 +0000 (15:43 -0400)]
ceph/async: `io_context_pool` constructor/start takes init function

Add overloads of the constructor and `start` that take an init
function, for setting up the thread before dropping into
`io_context::run`.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
17 months agocommon/async: `io_context_pool` constructor takes int64_t
Adam C. Emerson [Fri, 14 Apr 2023 18:58:16 +0000 (14:58 -0400)]
common/async: `io_context_pool` constructor takes int64_t

Just because this is what Ceph's config uses and it saves a narrowing
conversion. If we want to set a max value on the thread count, we
should do it in config.

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
17 months agoinclude/inline_memory.h: Add missing includes to fix build
Adam Emerson [Wed, 10 Jan 2024 17:37:15 +0000 (12:37 -0500)]
include/inline_memory.h: Add missing includes to fix build

Signed-off-by: Adam Emerson <aemerson@redhat.com>
17 months ago.github: fix length of dependabot commit message prefix 55308/head
Laura Flores [Wed, 24 Jan 2024 20:37:04 +0000 (20:37 +0000)]
.github: fix length of dependabot commit message prefix

Followup to commit 265f727481c729fbe78dce7410def7de6fe869d1.

Fixes this error mesage under GitHub Insights (https://github.com/ceph/ceph/network/updates):

```
The property '#/updates/1/commit-message/prefix' was not of a maximum string length of 15
```

Signed-off-by: Laura Flores <lflores@ibm.com>
17 months agorbd-nbd: log errors during netlink_resize() using derr 55287/head
Ramana Raja [Tue, 23 Jan 2024 21:07:04 +0000 (16:07 -0500)]
rbd-nbd: log errors during netlink_resize() using derr

When using rbd CLI to map the images to NBD devices via netlink,
any errors that arose during image resizing in netlink_resize()
were not logged. Switching the error logging from using cerr to
derr helps log the errors from netlink_resize().

Signed-off-by: Ramana Raja <rraja@redhat.com>
17 months agorbd_nbd: fix resize of images mapped using netlink
Ramana Raja [Mon, 22 Jan 2024 22:06:58 +0000 (17:06 -0500)]
rbd_nbd: fix resize of images mapped using netlink

Include device identifier or cookie in the message sent to the kernel
to resize images mapped to NBD devices using netlink. Otherwise,
netlink_resize() fails and the size of the device isn't updated.

Fixes: https://tracker.ceph.com/issues/64139
Signed-off-by: Ramana Raja <rraja@redhat.com>
17 months agoMerge pull request #49462 from rzarzynski/wip-bug-53789
Laura Flores [Wed, 24 Jan 2024 20:00:03 +0000 (14:00 -0600)]
Merge pull request #49462 from rzarzynski/wip-bug-53789

osdc: fix the ENOCONN normalization in Objecter::_linger_reconnect()

17 months agoMerge pull request #55219 from samarahu/rgw_asio_frontend_asserts
Casey Bodley [Wed, 24 Jan 2024 18:39:14 +0000 (18:39 +0000)]
Merge pull request #55219 from samarahu/rgw_asio_frontend_asserts

rgw/asio: Add asserts to rgw_asio_frontend.cc

Reviewed-by: Adam C. Emerson <aemerson@redhat.com>
17 months agomgr/rook: increase minikube mem to 6GB to avoid stability issues 55182/head
Redouane Kachach [Wed, 24 Jan 2024 18:03:56 +0000 (19:03 +0100)]
mgr/rook: increase minikube mem to 6GB to avoid stability issues

Signed-off-by: Redouane Kachach <rkachach@redhat.com>
17 months agoMerge pull request #55192 from dparmar18/fix_docstrings_ceph_test_case
Gregory Farnum [Wed, 24 Jan 2024 17:37:34 +0000 (09:37 -0800)]
Merge pull request #55192 from dparmar18/fix_docstrings_ceph_test_case

qa: typo fixes in ceph_test_case docstrings

17 months agoMerge pull request #53320 from jzhu116-bloomberg/wip-62710
Casey Bodley [Wed, 24 Jan 2024 16:01:43 +0000 (16:01 +0000)]
Merge pull request #53320 from jzhu116-bloomberg/wip-62710

rgw/multisite: maintain endpoints connectable status and retry the requests to them when appropriate

Reviewed-by: Mark Kogan <mkogan@ibm.com>
17 months agoMerge pull request #54941 from samsungceph/vstart_network_v2
Adam King [Wed, 24 Jan 2024 15:25:23 +0000 (10:25 -0500)]
Merge pull request #54941 from samsungceph/vstart_network_v2

vstart: Pick only CIDR-formatted routes when cephadm enabled

Reviewed-by: Adam King <adking@redhat.com>
17 months agomgr/dashboard: Fix issue with Object's Overview page 55304/head
Afreen [Wed, 24 Jan 2024 15:09:54 +0000 (20:39 +0530)]
mgr/dashboard: Fix issue with Object's Overview page

Fixes https://tracker.ceph.com/issues/64150

Issue: Due to absence of null checks, the overview page was failing to
load.

Fix: Added, null check at relevant places to ensure data availability
before accessing the data.

Signed-off-by: Afreen <afreen23.git@gmail.com>
17 months agocmake/arrow: don't treat warnings as errors 55303/head
Casey Bodley [Wed, 24 Jan 2024 14:44:25 +0000 (09:44 -0500)]
cmake/arrow: don't treat warnings as errors

with arrow's default BUILD_WARNING_LEVEL, -Werror is added to cflags and
debug builds fail on a warning about _FORTIFY_SOURCE

Fixes: https://tracker.ceph.com/issues/63130
Signed-off-by: Casey Bodley <cbodley@redhat.com>
17 months agoMerge pull request #53668 from mdw-at-linuxbox/wip-master-update-kmip-1
Casey Bodley [Wed, 24 Jan 2024 13:30:25 +0000 (13:30 +0000)]
Merge pull request #53668 from mdw-at-linuxbox/wip-master-update-kmip-1

Update libkmip to pull in some portability changes.

Reviewed-by: Casey Bodley <cbodley@redhat.com>
17 months agomgr/rook: adding some basic rook e2e testing
Redouane Kachach [Mon, 15 Jan 2024 14:25:02 +0000 (15:25 +0100)]
mgr/rook: adding some basic rook e2e testing
Fixes: https://tracker.ceph.com/issues/64029
Signed-off-by: Redouane Kachach <rkachach@redhat.com>
17 months agoRevert "crimson/os/alienstore/alien_log: _flush concurrently" 55288/head
Samuel Just [Tue, 23 Jan 2024 21:47:27 +0000 (21:47 +0000)]
Revert "crimson/os/alienstore/alien_log: _flush concurrently"

While submitting the log line asyncronously is reasonable,
with this implementation the EntryVector &q parameter does
not necessarily outlive the submission continuation.

This reverts commit 511af83e2747361350b60ce0ce88e67a726d9343.

Fixes: https://tracker.ceph.com/issues/64140
Signed-off-by: Samuel Just <sjust@redhat.com>
17 months agoMerge pull request #55183 from galsalomon66/s3select_fixes_QE_bugs
Gal Salomon [Tue, 23 Jan 2024 21:59:15 +0000 (23:59 +0200)]
Merge pull request #55183 from galsalomon66/s3select_fixes_QE_bugs

rgw/s3select: bug fixes per QE recent defects

17 months agoUpdate libkmip submodule to pull in some portability changes. 53668/head
Marcus Watts [Tue, 26 Sep 2023 07:04:35 +0000 (03:04 -0400)]
Update libkmip submodule to pull in some portability changes.

Signed-off-by: Marcus Watts <mwatts@redhat.com>
17 months agoMerge pull request #55217 from ronen-fr/wip-rf-old-reserv
Ronen Friedman [Tue, 23 Jan 2024 19:39:35 +0000 (21:39 +0200)]
Merge pull request #55217 from ronen-fr/wip-rf-old-reserv

osd/scrub: check reservation replies for relevance

Reviewed-by: Samuel Just <sjust@redhat.com>-
17 months agoMerge pull request #55067 from yaarith/telemetry-pool-flags
Laura Flores [Tue, 23 Jan 2024 18:22:32 +0000 (12:22 -0600)]
Merge pull request #55067 from yaarith/telemetry-pool-flags

mgr/telemetry: add pool flags

17 months agoMerge pull request #55240 from rosinL/wip-fix-64032
Laura Flores [Tue, 23 Jan 2024 16:38:46 +0000 (10:38 -0600)]
Merge pull request #55240 from rosinL/wip-fix-64032

install-deps: Force remove ceph-libboost* packages

17 months agoMerge pull request #55278 from Himura2la/patch-2
zdover23 [Tue, 23 Jan 2024 15:58:23 +0000 (01:58 +1000)]
Merge pull request #55278 from Himura2la/patch-2

Docs: Specify correct fs type for mkfs on volume creation

Reviewed-by: Zac Dover <zac.dover@proton.me>
Reviewed-by: Ilya Dryomov <idryomov@redhat.com>
17 months agobug fixes per QE recent defects 55183/head
galsalomon66 [Mon, 15 Jan 2024 14:38:19 +0000 (16:38 +0200)]
bug fixes per QE recent defects
update for the engine_version message
s3select submodule

Signed-off-by: galsalomon66 <gal.salomon@gmail.com>
17 months agoMerge pull request #55277 from yuvalif/wip-yuval-63578
Casey Bodley [Tue, 23 Jan 2024 14:18:22 +0000 (14:18 +0000)]
Merge pull request #55277 from yuvalif/wip-yuval-63578

rgw/lua: fix compilation issue when lua packages are disabled

Reviewed-by: Casey Bodley <cbodley@redhat.com>
17 months agodoc: specify correct fs type for mkfs 55278/head
Himura Kazuto [Tue, 23 Jan 2024 12:59:10 +0000 (12:59 +0000)]
doc: specify correct fs type for mkfs

The default value is ext2, which is not supported (anymore?).

Signed-off-by: Vladislav Glagolev <vladislav.glagolev@devexpress.com>
17 months agorgw/lua: fix compilation issue when lua packages are disabled 55277/head
Yuval Lifshitz [Tue, 23 Jan 2024 11:09:26 +0000 (11:09 +0000)]
rgw/lua: fix compilation issue when lua packages are disabled

Fixes: https://tracker.ceph.com/issues/63578#change-253102
Signed-off-by: Yuval Lifshitz <ylifshit@redhat.com>
17 months agomgr/dashboard: Fix inconsistency in capitalisation of "Multi-site" 55270/head
Afreen [Tue, 23 Jan 2024 02:34:32 +0000 (08:04 +0530)]
mgr/dashboard: Fix inconsistency in capitalisation of "Multi-site"

fixes https://tracker.ceph.com/issues/64125

Across the dashboard, two instances are present: Multi-site and
Multi-Site.
Making it consistent all over by using Multi-site.

Signed-off-by: Afreen <afreen23.git@gmail.com>
17 months agoosd/scrub: check reservation replies for relevance 55217/head
Ronen Friedman [Wed, 17 Jan 2024 15:36:16 +0000 (09:36 -0600)]
osd/scrub: check reservation replies for relevance

Compare a token (nonce) carried in the reservation reply with the remembered
token of the reservation request.  If they don't match, the reply is
stale and should be ignored (and logged).

Fixes: https://tracker.ceph.com/issues/64052
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
17 months agoMerge pull request #55107 from ronen-fr/wip-rf-rm-penaltyq
Ronen Friedman [Tue, 23 Jan 2024 05:57:39 +0000 (07:57 +0200)]
Merge pull request #55107 from ronen-fr/wip-rf-rm-penaltyq

osd/scrub: remove the 'penalty queue' from the scrubber

Reviewed-by: Samuel Just <sjust@redhat.com>-
17 months agoMerge pull request #55269 from zdover23/wip-doc-2024-01-23-radosgw-admin-read-write...
zdover23 [Tue, 23 Jan 2024 02:31:46 +0000 (12:31 +1000)]
Merge pull request #55269 from zdover23/wip-doc-2024-01-23-radosgw-admin-read-write-global-rate-limit-config

doc/radosgw: edit "read/write global rate limit" admin.rst

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
17 months agodoc/radosgw: edit "read/write global rate limit" admin.rst 55269/head
Zac Dover [Tue, 23 Jan 2024 02:13:10 +0000 (12:13 +1000)]
doc/radosgw: edit "read/write global rate limit" admin.rst

Edit "Reading/Writing Global Rate Limit Configuration" in
doc/radosgw/admin.rst.

Signed-off-by: Zac Dover <zac.dover@proton.me>
17 months agoMerge pull request #55223 from athanatos/sjust/wip-64055
Samuel Just [Mon, 22 Jan 2024 21:26:19 +0000 (13:26 -0800)]
Merge pull request #55223 from athanatos/sjust/wip-64055

crimson: clear obc_registry on interval change

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
Reviewed-by: Samuel Just <sjust@redhat.com>
17 months agorgw/multisite: add multisite test cases with some rgw instances down 53320/head
Jane Zhu [Wed, 10 Jan 2024 05:40:35 +0000 (00:40 -0500)]
rgw/multisite: add multisite test cases with some rgw instances down

Signed-off-by: Juan Zhu <jzhu4@dev-10-34-20-139.pw1.bcc.bloomberg.com>
17 months agoMerge pull request #55070 from pdvian/wip-fix-progressevent
Yuri Weinstein [Mon, 22 Jan 2024 16:18:41 +0000 (08:18 -0800)]
Merge pull request #55070 from pdvian/wip-fix-progressevent

mon: initialize ProgressEvent::add_to_ceph_s

Reviewed-by: Laura Flores <lflores@redhat.com>
17 months agoMerge pull request #53154 from ifed01/wip-ifed-no-death-tests
Yuri Weinstein [Mon, 22 Jan 2024 16:17:35 +0000 (08:17 -0800)]
Merge pull request #53154 from ifed01/wip-ifed-no-death-tests

test/store_test: get rid off assert_death.

Reviewed-by: Adam Kupczyk <akupczyk@redhat.com>
Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>
17 months agoMerge pull request #49415 from ljflores/wip-update-telemetry-upgrade
Yuri Weinstein [Mon, 22 Jan 2024 16:13:17 +0000 (08:13 -0800)]
Merge pull request #49415 from ljflores/wip-update-telemetry-upgrade

qa/workunits: update telemetry quincy workunits with `basic_pool_options_bluestore` collection

Reviewed-by: Yaarit Hatuka <yaarithatuka@gmail.com>
17 months agoosd/scrub: update job's NB on failure 55107/head
Ronen Friedman [Tue, 2 Jan 2024 16:09:06 +0000 (10:09 -0600)]
osd/scrub: update job's NB on failure

When a scrub job fails, update its NB to the current time plus a
fixed delay.  This prevents the job from being scheduled again
immediately.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
17 months agoosd/scrub: fix set_last_deep_scrub_stamp()
Ronen Friedman [Tue, 9 Jan 2024 14:15:33 +0000 (08:15 -0600)]
osd/scrub: fix set_last_deep_scrub_stamp()

The call should update last_scrub_stamp, too, without
requiring an extra call to on_scrub_schedule_input_change()

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
17 months agoosd/scrub: introduce a 'not before' attribute for scrub jobs
Ronen Friedman [Sun, 31 Dec 2023 16:18:09 +0000 (10:18 -0600)]
osd/scrub: introduce a 'not before' attribute for scrub jobs

The NB enables the OSD to delay the next attempt to schedule a specific
scrub job.  This is useful for jobs that have failed for whatever
reason, especially if the primary has failed to acquire the replicas.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
17 months agoosd/scrub: remove the 'penalized jobs' queue
Ronen Friedman [Sat, 30 Dec 2023 12:36:26 +0000 (06:36 -0600)]
osd/scrub: remove the 'penalized jobs' queue

The 'penalized jobs' queue was used to track scrub jobs that had failed
to acquire their replicas, and to prevent those jobs from being retried
too quickly.  This functionality will be replaced by a
simple 'not before' delay (see the next commits).

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
17 months agocrimson/osd/osd_operations/scrub_events: fix to use get_peering_request_pipeline() 54896/head
Yingxin Cheng [Mon, 22 Jan 2024 08:53:16 +0000 (16:53 +0800)]
crimson/osd/osd_operations/scrub_events: fix to use get_peering_request_pipeline()

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
17 months agocrimson/osd: drop a foreign-copy to shard-0 for every pg operation
Yingxin Cheng [Mon, 11 Dec 2023 06:38:51 +0000 (14:38 +0800)]
crimson/osd: drop a foreign-copy to shard-0 for every pg operation

By using ConnectionRef before pg submission, and after that, change to
use ConnectionXcoreRef.

The intent is to drop the foreign copy of the connection to shard 0 at
pg submission time. This should remove two pairs of crosscore
communications in shard 0 for each I/O, one for connection-ref foreign
copy, another for connection-ref destruction.

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
17 months agoMerge pull request #55262 from Matan-B/wip-matanb-crimson-bluestore-submit
Matan Breizman [Mon, 22 Jan 2024 08:44:23 +0000 (10:44 +0200)]
Merge pull request #55262 from Matan-B/wip-matanb-crimson-bluestore-submit

crimson/os/alienstore/alien_log: _flush concurrently

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
17 months agocrimson/osd/osd_operations/client_request: don't use and move pgref at the same time
Yingxin Cheng [Tue, 9 Jan 2024 06:39:41 +0000 (14:39 +0800)]
crimson/osd/osd_operations/client_request: don't use and move pgref at the same time

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
17 months agocrimson/osd/osd_operations/client_request: cleanup shard_services
Yingxin Cheng [Mon, 8 Jan 2024 06:25:20 +0000 (14:25 +0800)]
crimson/osd/osd_operations/client_request: cleanup shard_services

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>