]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
3 months agocommon/pick_address: Add IPv6 support to is_addr_in_subnet 62814/head
Nitzan Mordechai [Thu, 28 Nov 2024 11:44:00 +0000 (11:44 +0000)]
common/pick_address: Add IPv6 support to is_addr_in_subnet

Updated the is_addr_in_subnet function to work with both
IPv4 and IPv6 addresses. Previously, it only supported IPv4,
which caused failures when IPv6 addresses were passed in.

Changes:
 - Use inet_pton to detect IPv4 (AF_INET) or IPv6 (AF_INET6).
 - Added sockaddr_in6 for IPv6 handling while keeping sockaddr_in for IPv4.
 - Adjust the family and ifa_addr dynamically based on the address type.

Fixes: https://tracker.ceph.com/issues/67517
Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>
(cherry picked from commit d68857c1e57e93a68d9301b3beff7e652f327a9e)

3 months agoMerge pull request #62797 from zdover23/wip-doc-2025-04-14-backport-62784-to-reef
Anthony D'Atri [Mon, 14 Apr 2025 01:43:56 +0000 (21:43 -0400)]
Merge pull request #62797 from zdover23/wip-doc-2025-04-14-backport-62784-to-reef

reef: doc/cephadm: Add admonition re restarting an OSD service

3 months agodoc/cephadm: Add admonition re restarting an OSD service 62797/head
Anthony D'Atri [Fri, 11 Apr 2025 14:29:52 +0000 (10:29 -0400)]
doc/cephadm: Add admonition re restarting an OSD service

Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
(cherry picked from commit 524db4609f5f0b4639828b327cfc972243e08ce7)

3 months agoMerge pull request #62719 from ceph/reef-release
Yuri Weinstein [Thu, 10 Apr 2025 20:42:41 +0000 (13:42 -0700)]
Merge pull request #62719 from ceph/reef-release

v18.2.5

Reviewed-by: Ilya Dryomov <idryomov@redhat.com>
3 months agoMerge pull request #62721 from YiteGu/wip-reserved-by-bluefs-reef
Igor Fedotov [Thu, 10 Apr 2025 11:18:23 +0000 (14:18 +0300)]
Merge pull request #62721 from YiteGu/wip-reserved-by-bluefs-reef

reef: os/bluestore: make BlueFS an exclusive selector for volume reserved

Reviewed-by: Igor Fedotov <igor.fedotov@croit.io>
3 months agoMerge pull request #62751 from idryomov/wip-70831-reef
Ilya Dryomov [Thu, 10 Apr 2025 07:55:01 +0000 (09:55 +0200)]
Merge pull request #62751 from idryomov/wip-70831-reef

reef: test/librbd/test_notify.py: force line-buffered output

Reviewed-by: Mykola Golub <mykola.golub@clyso.com>
3 months agoMerge pull request #62326 from NitzanMordhai/wip-70495-reef
Naveen Naidu [Thu, 10 Apr 2025 02:07:16 +0000 (07:37 +0530)]
Merge pull request #62326 from NitzanMordhai/wip-70495-reef

reef: OSDMonitor: exclude destroyed OSDs from "ceph node ls" output

3 months agotest/librbd/test_notify.py: force line-buffered output 62751/head
Ilya Dryomov [Mon, 7 Apr 2025 10:40:50 +0000 (12:40 +0200)]
test/librbd/test_notify.py: force line-buffered output

"master" and "slave" invocations are intended to run in parallel and
coordinate between themselves.  Ensure that their respective output is
properly timestamped and ordered in teuthology.log file.

Fixes: https://tracker.ceph.com/issues/70831
Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit 49dbabcecec7c458a6d2ddfbd8e14a6ec419b953)

3 months agoMerge pull request #62152 from YiteGu/wip-70147-reef
Igor Fedotov [Wed, 9 Apr 2025 16:17:51 +0000 (19:17 +0300)]
Merge pull request #62152 from YiteGu/wip-70147-reef

reef: blk/kerneldevice: notify_all only required when discard_drain wait for condition

Reviewed-by: Igor Fedotov <igor.fedotov@croit.io>
3 months agoMerge pull request #62122 from ifed01/wip-ifed-some-recover-stuff-reef
Igor Fedotov [Wed, 9 Apr 2025 16:16:38 +0000 (19:16 +0300)]
Merge pull request #62122 from ifed01/wip-ifed-some-recover-stuff-reef

reef: tools/ceph-objectstore-tool: tricks to tolerate disk errors for "pg export" command

Reviewed-by: Adam Kupczyk <akupczyk@ibm.com>
3 months agoMerge pull request #62054 from k0ste/wip-67125-reef
Igor Fedotov [Wed, 9 Apr 2025 16:13:43 +0000 (19:13 +0300)]
Merge pull request #62054 from k0ste/wip-67125-reef

reef: os/bluestore: Fix ExtentDecoderPartial::_consume_new_blob

Reviewed-by: Igor Fedotov <igor.fedotov@croit.io>
3 months agoMerge pull request #62730 from afreen23/wip-70837-reef
afreen23 [Wed, 9 Apr 2025 11:02:36 +0000 (16:32 +0530)]
Merge pull request #62730 from afreen23/wip-70837-reef

reef: mgr/dashboard: Fix empty ceph version in GET api/hosts

Reviewed-by: Afreen Misbah <afreen@ibm.com>
Reviewed-by: Naman Munet <nmunet@redhat.com>
3 months agoos/bluestore: make BlueFS an exclusive selector for volume reserved 62721/head
Igor Fedotov [Wed, 28 Sep 2022 10:43:50 +0000 (13:43 +0300)]
os/bluestore: make BlueFS an exclusive selector for volume reserved
block size.

Signed-off-by: Igor Fedotov <ifedotov@croit.io>
(cherry picked from commit 3cc75e8670d4fb43062ce41744262f858116f764)

3 months agoMerge pull request #62216 from ifed01/wip-ifed-fix-expand-reef
Laura Flores [Tue, 8 Apr 2025 20:14:19 +0000 (15:14 -0500)]
Merge pull request #62216 from ifed01/wip-ifed-fix-expand-reef

reef: os/bluestore: fix bdev expansion and more

3 months agomgr/dashboard: Fix empty ceph version in GET api/hosts 62730/head
Afreen Misbah [Mon, 7 Apr 2025 19:00:08 +0000 (00:30 +0530)]
mgr/dashboard: Fix empty ceph version in GET api/hosts

Fixes https://tracker.ceph.com/issues/70821

Due to the pagination the host list is being fetched from orchestrator which caused a regression as via orchestrator list ceph version is always marked empty.
Caused by https://github.com/ceph/ceph/pull/52154

Also fixed tests , as the new version addition causing whole json object mock to fail in tests

Signed-off-by: Afreen Misbah <afreen@ibm.com>
(cherry picked from commit 1aa74cf3fce6394ad08e83e91a90923d983b03c5)

3 months ago18.2.5 62719/head v18.2.5
Ceph Release Team [Mon, 7 Apr 2025 16:49:17 +0000 (16:49 +0000)]
18.2.5

Signed-off-by: Ceph Release Team <ceph-maintainers@ceph.io>
3 months agoMerge pull request #62695 from zdover23/wip-doc-2025-04-07-backport-62687-to-reef
Anthony D'Atri [Sun, 6 Apr 2025 20:35:28 +0000 (16:35 -0400)]
Merge pull request #62695 from zdover23/wip-doc-2025-04-07-backport-62687-to-reef

reef: doc/cephadm/services: improve rgw.rst and snmp-gateway.rst

3 months agodoc/cephadm/services: improve rgw.rst and snmp-gateway.rst 62695/head
Anthony D'Atri [Sat, 5 Apr 2025 12:22:46 +0000 (08:22 -0400)]
doc/cephadm/services: improve rgw.rst and snmp-gateway.rst

Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
(cherry picked from commit 3d18222b46d65784335b4af0d651118b5e588d5e)

3 months agotest/librbd/test_notify.py: conditionally ignore some errors
Ilya Dryomov [Sat, 5 Apr 2025 10:26:23 +0000 (12:26 +0200)]
test/librbd/test_notify.py: conditionally ignore some errors

In 2020, commit 01ff1530544c ("librbd: make all maintenance op
notifications async") introduced a backwards compatibility issue where
if exclusive lock is held by an older (octopus and below) client and
a maintenance op is proxied to it from a newer client, the newer client
interprets the notification for the in-place completion of the op as
the notification for the acceptance of an async request and expects
another notification for the completion of the op which never comes.
In 2021, this bug was discovered and test_notify.py was amended to
ignore it in commit 9c0b239d70cd ("qa/upgrade: conditionally disable
update_features tests").

However the two update_features tests that started hanging and got
disabled weren't the only ones to misbehave.  Rename, create_snap and
remove_snap tests were affected too but didn't hang or fail because
librbd also filtered certain errors codes like EEXIST and EINVAL.
Taking rename is an example:

1. a rename request is sent to from a newer client (N) to an octopus
   client (O)
2. O successfully renames the image and sends a completion notification
   with result = 0
3. N mistakes it for async request acceptance
4. after a timeout, N resends the rename request to O
5. O sees that an image already has that name (after step 2) and sends
   a completion notification with result = EEXIST
6. N interprets it as async request denial and bubbles up EEXIST,
   however right before returning control from Operations::rename()
   EEXIST is filtered and 0 is returned to the user

So back then rename, create_snap and remove_snap tests continued to
pass but started taking 30+ seconds instead of completing immediately.
In 2025 we did away with filtering error codes in commit 66508cdaa190
("librbd: stop filtering async request error codes") and these tests
started to fail.  Following the approach taken in commit 9c0b239d70cd
("qa/upgrade: conditionally disable update_features tests"), let's
ignore these failures based on the same environment variable.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
(cherry picked from commit e7aeb7e325b8777251051dcc32e3baa70880257d)

3 months agoMerge pull request #62688 from idryomov/wip-test-notify-ignore-errors-reef
Ilya Dryomov [Sun, 6 Apr 2025 10:31:04 +0000 (12:31 +0200)]
Merge pull request #62688 from idryomov/wip-test-notify-ignore-errors-reef

reef: test/librbd/test_notify.py: conditionally ignore some errors

Reviewed-by: Yuri Weinstein <yweinste@redhat.com>
3 months agotest/librbd/test_notify.py: conditionally ignore some errors 62688/head
Ilya Dryomov [Sat, 5 Apr 2025 10:26:23 +0000 (12:26 +0200)]
test/librbd/test_notify.py: conditionally ignore some errors

In 2020, commit 01ff1530544c ("librbd: make all maintenance op
notifications async") introduced a backwards compatibility issue where
if exclusive lock is held by an older (octopus and below) client and
a maintenance op is proxied to it from a newer client, the newer client
interprets the notification for the in-place completion of the op as
the notification for the acceptance of an async request and expects
another notification for the completion of the op which never comes.
In 2021, this bug was discovered and test_notify.py was amended to
ignore it in commit 9c0b239d70cd ("qa/upgrade: conditionally disable
update_features tests").

However the two update_features tests that started hanging and got
disabled weren't the only ones to misbehave.  Rename, create_snap and
remove_snap tests were affected too but didn't hang or fail because
librbd also filtered certain errors codes like EEXIST and EINVAL.
Taking rename is an example:

1. a rename request is sent to from a newer client (N) to an octopus
   client (O)
2. O successfully renames the image and sends a completion notification
   with result = 0
3. N mistakes it for async request acceptance
4. after a timeout, N resends the rename request to O
5. O sees that an image already has that name (after step 2) and sends
   a completion notification with result = EEXIST
6. N interprets it as async request denial and bubbles up EEXIST,
   however right before returning control from Operations::rename()
   EEXIST is filtered and 0 is returned to the user

So back then rename, create_snap and remove_snap tests continued to
pass but started taking 30+ seconds instead of completing immediately.
In 2025 we did away with filtering error codes in commit 66508cdaa190
("librbd: stop filtering async request error codes") and these tests
started to fail.  Following the approach taken in commit 9c0b239d70cd
("qa/upgrade: conditionally disable update_features tests"), let's
ignore these failures based on the same environment variable.

Signed-off-by: Ilya Dryomov <idryomov@gmail.com>
3 months agoMerge pull request #62664 from afreen23/wip-70790-reef
afreen23 [Sat, 5 Apr 2025 00:51:32 +0000 (06:21 +0530)]
Merge pull request #62664 from afreen23/wip-70790-reef

reef: mgr/dashboard: Dashboard not showing Object/Overview correctly.

Reviewed-by: Afreen Misbah <afreen@ibm.com>
3 months agorgw: remove keep_tail from RGWObjState
Jane Zhu [Wed, 2 Apr 2025 15:53:47 +0000 (15:53 +0000)]
rgw: remove keep_tail from RGWObjState

Signed-off-by: Jane Zhu <jzhu116@bloomberg.net>
(cherry picked from commit fd76b6466c298121994ba65cce3c0e76f8568841)

Conflicts:
src/rgw/rgw_sal_store.h RGWObjState is in rgw_sal.h on reef
(cherry picked from commit b94fcdfb6a2e01fd471e8c6ebd34145bebb78e20)

3 months agorgw: keep the tails when copying object to itself
Jane Zhu [Tue, 1 Apr 2025 13:45:36 +0000 (13:45 +0000)]
rgw: keep the tails when copying object to itself

Signed-off-by: Jane Zhu <jzhu116@bloomberg.net>
(cherry picked from commit 333e4a9b0de745cf5be40c5f6c32df7a340b007a)

Conflicts:
src/rgw/driver/rados/rgw_rados.cc
src/rgw/driver/rados/rgw_rados.h
    _do_write_meta() no req_context arg
    complete_atomic_modification() no optional_yield arg
(cherry picked from commit fdea7f34829010aaf77e8bb7ae979b07887abe78)

3 months agoMerge pull request #62656 from cbodley/wip-70784-reef
Yuri Weinstein [Fri, 4 Apr 2025 17:41:48 +0000 (10:41 -0700)]
Merge pull request #62656 from cbodley/wip-70784-reef

reef: rgw: keep the tails when copying object to itself

Reviewed-by:Jane Zhu <jzhu116@bloomberg.net>

3 months agoMerge pull request #62667 from zdover23/wip-doc-2025-04-04-backport-62661-to-reef
Anthony D'Atri [Thu, 3 Apr 2025 20:46:03 +0000 (16:46 -0400)]
Merge pull request #62667 from zdover23/wip-doc-2025-04-04-backport-62661-to-reef

reef: doc/radosgw: Improve cloud-restore and cloud-transition

3 months agodoc/radosgw: Improve cloud-restore and cloud-transition 62667/head
Anthony D'Atri [Thu, 3 Apr 2025 17:58:49 +0000 (13:58 -0400)]
doc/radosgw: Improve cloud-restore and cloud-transition

Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
(cherry picked from commit 6a6807470850ea972c5c3786be3735d26875b221)

3 months agoMerge pull request #62261 from dvanders/dvanders_doc_fixes_reef
Anthony D'Atri [Thu, 3 Apr 2025 19:22:32 +0000 (15:22 -0400)]
Merge pull request #62261 from dvanders/dvanders_doc_fixes_reef

reef: doc/rados/configuration/bluestore-config-ref: Fix lowcase typo

3 months agomgr/dashboard: Dashboard not showing Object/Overview correctly. 62664/head
Aashish Sharma [Wed, 31 Jul 2024 11:51:02 +0000 (17:21 +0530)]
mgr/dashboard: Dashboard not showing Object/Overview correctly.

Add undefined checks wherever necessary to the updateChartData method in
dashboard-area-chart component

Fixes: https://tracker.ceph.com/issues/67009
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
(cherry picked from commit f4b3943fb2e7e16835193b48b94d32d9eb575ecf)

Conflicts:
src/pybind/mgr/dashboard/frontend/src/app/ceph/dashboard-v3/dashboard-area-chart/dashboard-area-chart.component.ts

3 months agorgw: remove keep_tail from RGWObjState 62656/head
Jane Zhu [Wed, 2 Apr 2025 15:53:47 +0000 (15:53 +0000)]
rgw: remove keep_tail from RGWObjState

Signed-off-by: Jane Zhu <jzhu116@bloomberg.net>
(cherry picked from commit fd76b6466c298121994ba65cce3c0e76f8568841)

Conflicts:
src/rgw/rgw_sal_store.h RGWObjState is in rgw_sal.h on reef

3 months agorgw: keep the tails when copying object to itself
Jane Zhu [Tue, 1 Apr 2025 13:45:36 +0000 (13:45 +0000)]
rgw: keep the tails when copying object to itself

Signed-off-by: Jane Zhu <jzhu116@bloomberg.net>
(cherry picked from commit 333e4a9b0de745cf5be40c5f6c32df7a340b007a)

Conflicts:
src/rgw/driver/rados/rgw_rados.cc
src/rgw/driver/rados/rgw_rados.h
    _do_write_meta() no req_context arg
    complete_atomic_modification() no optional_yield arg

3 months agoMerge pull request #62569 from ronen-fr/wip-rf-61590-reef
Ronen Friedman [Thu, 3 Apr 2025 08:15:41 +0000 (11:15 +0300)]
Merge pull request #62569 from ronen-fr/wip-rf-61590-reef

reef: osd/scrub: discard repair_oinfo_oid()

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
3 months agoMerge pull request #62645 from zdover23/wip-doc-2025-04-03-backport-62643-to-reef
Anthony D'Atri [Thu, 3 Apr 2025 04:19:40 +0000 (00:19 -0400)]
Merge pull request #62645 from zdover23/wip-doc-2025-04-03-backport-62643-to-reef

reef: doc/cephadm: s/confg/config/

3 months agodoc/cephadm: s/confg/config/ 62645/head
Zac Dover [Thu, 3 Apr 2025 03:23:30 +0000 (13:23 +1000)]
doc/cephadm: s/confg/config/

Correct a misspelled command.

Re: https://github.com/ceph/ceph/pull/62367#discussion_r2025832937

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit 0f4d9a9ea29f12333609045928499963b37ba5f3)

3 months agoMerge pull request #62621 from zdover23/wip-doc-2025-04-02-backport-62617-to-reef
Anthony D'Atri [Wed, 2 Apr 2025 15:32:06 +0000 (11:32 -0400)]
Merge pull request #62621 from zdover23/wip-doc-2025-04-02-backport-62617-to-reef

reef: doc/rados/configuration: Correct admonition in ceph-conf.rst

3 months agoosd/scrub: discard repair_oinfo_oid() 62569/head
Ronen Friedman [Thu, 30 Jan 2025 09:27:58 +0000 (03:27 -0600)]
osd/scrub: discard repair_oinfo_oid()

repair_oinfo_oid(), called every scrub, has a very specific
functionality: fix the object ID specified in the Object Info
attribute, if different from the ID of the owning object.

This fix was added in 2017, as a response to a unique failure
scenario that was observed in Sepia - probably following a
filesystem bug. See https://tracker.ceph.com/issues/18409 &
https://tracker.ceph.com/issues/20471.

The limited functionality of repair_oinfo_oid() -
only repairing this one specific issue, and only if the OI_ATTR
exists and is decodable - does not justify the overhead of
running it every scrub.

(cherry picked from commit aa22f19831731185e3c115a2b4e5603e8ef2634f)
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
3 months agoMerge pull request #62606 from cbodley/wip-70754-reef
Casey Bodley [Wed, 2 Apr 2025 13:46:54 +0000 (09:46 -0400)]
Merge pull request #62606 from cbodley/wip-70754-reef

reef: cephfs-top, qa: Remove unnecessary global statements in tests

Reviewed-by: Ronen Friedman <rfriedma@redhat.com>
3 months agodoc/rados/configuration: Correct admonition in ceph-conf.rst 62621/head
Anthony D'Atri [Tue, 1 Apr 2025 23:19:18 +0000 (19:19 -0400)]
doc/rados/configuration: Correct admonition in ceph-conf.rst

Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
(cherry picked from commit 5ff8e3e4381764822383ba769b2c4e73876fbc0c)

3 months agocephfs-top: Removes unused `global` statements 62606/head
Kefu Chai [Sun, 30 Mar 2025 03:59:12 +0000 (11:59 +0800)]
cephfs-top: Removes unused `global` statements

Recent flake8 runs were failing with:
```
py3: flake8==7.2.0,mccabe==0.7.0,pip==25.0.1,pycodestyle==2.13.0,pyflakes==3.3.0,setuptools==75.8.0,wheel==0.45.1
py3: commands[0] /home/jenkins-build/build/workspace/ceph-pull-requests/src/tools/cephfs/top> flake8 --ignore=W503 --max-line-length=100 cephfs-top
cephfs-top:344:9: F824 `global fs_list` is unused: name is never assigned in scope
cephfs-top:466:13: F824 `global current_states` is unused: name is never assigned in scope
cephfs-top:872:9: F824 `global metrics_dict` is unused: name is never assigned in scope
cephfs-top:872:9: F824 `global current_states` is unused: name is never assigned in scope
cephfs-top:911:9: F824 `global fs_list` is unused: name is never assigned in scope
cephfs-top:981:9: F824 `global current_states` is unused: name is never assigned in scope
cephfs-top:1126:13: F824 `global current_states` is unused: name is never assigned in scope
py3: exit 1 (0.77 seconds) /home/jenkins-build/build/workspace/ceph-pull-requests/src/tools/cephfs/top> flake8 --ignore=W503 --max-line-length=100 cephfs-top pid=2309605
  py3: FAIL code 1 (8.15=setup[7.38]+cmd[0.77] seconds)
  evaluation failed :( (8.24 seconds)
```

Since these variables are only being referenced and not assigned within
their scopes, the `global` declarations are unnecessary and can be
safely removed. This change:

- Removes all flagged `global` statements
- Fixes the failing flake8 checks in the CI pipeline
- Maintains the original code behavior as variable references still work without the `global` keyword

The `global` keyword is only needed when assigning to global variables
within a function scope, not when simply referencing them.

Signed-off-by: Kefu Chai <tchaikov@gmail.com>
(cherry picked from commit 39b262f7a6e1d69dc96255a1df62a68297d3f931)

3 months agoqa: Remove unnecessary global statements in tests
Kefu Chai [Sun, 30 Mar 2025 03:48:28 +0000 (11:48 +0800)]
qa: Remove unnecessary global statements in tests

Removes unused `global` statements from Python test files to fix flake8
F824 errors.

Recent flake8 runs were failing with:

```
./tasks/radosgw_admin.py:330:5: F824 `global log` is unused: name is never assigned in scope
./workunits/dencoder/test_readable.py:99:5: F824 `global incompat_paths` is unused: name is never assigned in scope
./workunits/dencoder/test_readable.py:164:5: F824 `global backward_compat` is unused: name is never assigned in scope
./workunits/dencoder/test_readable.py:165:5: F824 `global fast_shouldnt_skip` is unused: name is never assigned in scope
```

Since these variables are only being referenced and not assigned within
their scopes, the `global` declarations are unnecessary and can be
safely removed. This change:

- Removes all flagged `global` statements
- Fixes the failing flake8 checks in the CI pipeline
- Maintains the original code behavior as variable references still work
  without the `global` keyword

The `global` keyword is only needed when assigning to global variables
within a function scope, not when simply referencing them.

Signed-off-by: Kefu Chai <tchaikov@gmail.com>
(cherry picked from commit bcc275f98cb26c9432c320cd24ee93ff629e0574)

3 months agoMerge pull request #62503 from zdover23/wip-doc-2025-03-26-backport-62459-to-reef
Zac Dover [Thu, 27 Mar 2025 14:33:30 +0000 (00:33 +1000)]
Merge pull request #62503 from zdover23/wip-doc-2025-03-26-backport-62459-to-reef

reef: src/common: add guidance for deep-scrubbing ratio warning

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
3 months agoMerge pull request #61987 from zdover23/wip-doc-2025-02-25-backport-61984-to-reef
Zac Dover [Thu, 27 Mar 2025 12:41:57 +0000 (22:41 +1000)]
Merge pull request #61987 from zdover23/wip-doc-2025-02-25-backport-61984-to-reef

reef: doc/rgw: clarify path-style vs virtual-hosted-style access

Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
3 months agoMerge pull request #61818 from zdover23/wip-doc-2025-02-14-backport-61688-to-reef
Zac Dover [Thu, 27 Mar 2025 12:40:47 +0000 (22:40 +1000)]
Merge pull request #61818 from zdover23/wip-doc-2025-02-14-backport-61688-to-reef

reef: doc/dev/release-process.rst: release builds cannot build containers

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
3 months agoMerge pull request #61654 from zdover23/wip-doc-2025-02-05-backport-61254-to-reef
Zac Dover [Thu, 27 Mar 2025 12:39:58 +0000 (22:39 +1000)]
Merge pull request #61654 from zdover23/wip-doc-2025-02-05-backport-61254-to-reef

reef: doc/rados/operations/stretch-mode: Improve doc

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
3 months agoMerge pull request #61665 from zdover23/wip-doc-2025-02-06-backport-61616-to-reef
Zac Dover [Thu, 27 Mar 2025 12:39:20 +0000 (22:39 +1000)]
Merge pull request #61665 from zdover23/wip-doc-2025-02-06-backport-61616-to-reef

reef: doc/cephadm: clarify "Monitoring OSD State"

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Sayalee Raut <saraut@redhat.com>
3 months agoMerge pull request #61615 from zdover23/wip-doc-2025-02-02-backport-61613-to-reef
Zac Dover [Thu, 27 Mar 2025 12:35:07 +0000 (22:35 +1000)]
Merge pull request #61615 from zdover23/wip-doc-2025-02-02-backport-61613-to-reef

reef: doc/architecture: remove sentence

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
3 months agoMerge pull request #62291 from zdover23/wip-doc-2025-03-14-backport-62258-to-reef
Zac Dover [Thu, 27 Mar 2025 12:33:30 +0000 (22:33 +1000)]
Merge pull request #62291 from zdover23/wip-doc-2025-03-14-backport-62258-to-reef

reef: doc/rados/configuration/bluestore-config-ref: Fix lowercase typos

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
3 months agoreef: qa/cephfs: switch to ubuntu 22.04 for stock kernel testing
Venky Shankar [Tue, 25 Mar 2025 13:24:08 +0000 (18:54 +0530)]
reef: qa/cephfs: switch to ubuntu 22.04 for stock kernel testing

This is for reef only since we don't have rhel8 images (which results
in failure to schedule fs suite run), so switch to using ubuntu 22.04.

Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit ccd60c09424fe7f5c4d4e364871c7be091d3c0d7)

3 months agoceph-volume: allow zapping partitions on multipath devices
Guillaume Abrioux [Fri, 7 Mar 2025 16:23:02 +0000 (17:23 +0100)]
ceph-volume: allow zapping partitions on multipath devices

ceph-volume refuses to zap a device if it is a partition on a multipath
device due to an overly strict condition. This change ensures that only
full mapper devices (excluding partitions) are blocked from being zapped,
allowing partitions on multipath devices to be processed correctly.

Fixes: https://tracker.ceph.com/issues/70363
Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
(cherry picked from commit 16daa6a687c0536324b06536af12ce4e8fc04087)
(cherry picked from commit 29b6bcda3f69f594a751ec92b6985b3dfdd4d56b)

3 months agoMerge pull request #62492 from vshankar/wip-rm-rhel_8
Yuri Weinstein [Wed, 26 Mar 2025 21:17:51 +0000 (14:17 -0700)]
Merge pull request #62492 from vshankar/wip-rm-rhel_8

reef: qa/cephfs: switch to ubuntu 22.04 for stock kernel testing

Reviewed-by: Christopher Hoffman <choffman@redhat.com>
3 months agoMerge pull request #62345 from phlogistonjohn/wip-70153-reef
Adam King [Wed, 26 Mar 2025 19:40:17 +0000 (15:40 -0400)]
Merge pull request #62345 from phlogistonjohn/wip-70153-reef

reef: container: small container image improvements

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Dan Mick <dan.mick@redhat.com>
3 months agoreef: qa/cephfs: switch to ubuntu 22.04 for stock kernel testing 62492/head
Venky Shankar [Tue, 25 Mar 2025 13:24:08 +0000 (18:54 +0530)]
reef: qa/cephfs: switch to ubuntu 22.04 for stock kernel testing

This is for reef only since we don't have rhel8 images (which results
in failure to schedule fs suite run), so switch to using ubuntu 22.04.

Signed-off-by: Venky Shankar <vshankar@redhat.com>
3 months agoMerge pull request #62178 from guits/wip-70368-reef
Guillaume Abrioux [Wed, 26 Mar 2025 15:16:12 +0000 (16:16 +0100)]
Merge pull request #62178 from guits/wip-70368-reef

reef: ceph-volume: allow zapping partitions on multipath devices

3 months agoMerge pull request #62468 from ljflores/wip-reef-62422
Laura Flores [Wed, 26 Mar 2025 14:49:15 +0000 (09:49 -0500)]
Merge pull request #62468 from ljflores/wip-reef-62422

reef: PendingReleaseNotes; doc/rados/operations: document "rm-pg-upmap-primary-{all}" commands

3 months agoMerge pull request #62364 from athanatos/wip-70536-reef
SrinivasaBharathKanta [Tue, 25 Mar 2025 23:19:41 +0000 (04:49 +0530)]
Merge pull request #62364 from athanatos/wip-70536-reef

reef: osd: fix osd mclock queue item leak

3 months agosrc/common: add guidance for deep-scrubbing ratio warning 62503/head
Zac Dover [Mon, 24 Mar 2025 12:26:11 +0000 (22:26 +1000)]
src/common: add guidance for deep-scrubbing ratio warning

Add an explanation of how to set the value of
"mon_warn_pg_not_deep_scrubbed_ratio" to the confval definition of that
variable. Although this variable contains the string "mon", it is set on
the Manager. I have added a note to direct users to set this value on
the Manager.

This issue was pointed out by Petr Tlapa on Slack in late March of 2025.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit aeef59a50ee31072648ba0c7436b6522137614cd)

3 months agoPendingReleaseNotes; doc/rados/operations: document "rm-pg-upmap-primary-{all}" commands 62468/head
Laura Flores [Thu, 20 Mar 2025 21:19:30 +0000 (16:19 -0500)]
PendingReleaseNotes; doc/rados/operations: document "rm-pg-upmap-primary-{all}" commands

Added a release note for `ceph osd rm-pg-upmap-primary` and documentation to
complement #62190.

Signed-off-by: Laura Flores <lflores@ibm.com>
(cherry picked from commit 4dee7917bf010dc6d8e24f0ef437bd8a41c355c9)

*Note: PendingReleaseNotes changed to put the release note under v18.2.5.

3 months agoMerge pull request #62444 from zdover23/wip-doc-2025-03-22-backport-62422-to-reef
Anthony D'Atri [Sat, 22 Mar 2025 14:54:04 +0000 (10:54 -0400)]
Merge pull request #62444 from zdover23/wip-doc-2025-03-22-backport-62422-to-reef

reef: doc/cephadm/services: Add mention of --zap for OSD removal

3 months agodoc/cephadm/services: Add mention of --zap for OSD removal 62444/head
Anthony D'Atri [Fri, 21 Mar 2025 21:59:02 +0000 (17:59 -0400)]
doc/cephadm/services: Add mention of --zap for OSD removal

Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
(cherry picked from commit 20b621fceb865403e19e1d7876f81e670829ff59)

3 months agoMerge pull request #61653 from aclamk/wip-aclamk-pereman2-bluefs-expand-bug-reef
SrinivasaBharathKanta [Fri, 21 Mar 2025 21:23:25 +0000 (02:53 +0530)]
Merge pull request #61653 from aclamk/wip-aclamk-pereman2-bluefs-expand-bug-reef

[reef] os/bluestore: fix _extend_log seq advance

3 months agoMerge pull request #62428 from zdover23/wip-doc-2025-03-21-backport-62426-to-reef
Anthony D'Atri [Fri, 21 Mar 2025 16:07:05 +0000 (12:07 -0400)]
Merge pull request #62428 from zdover23/wip-doc-2025-03-21-backport-62426-to-reef

reef: doc/cephadm/services: Correct indentation in osd.rst

3 months agoMerge pull request #60901 from mohit84/wip-69086-reef
SrinivasaBharathKanta [Fri, 21 Mar 2025 14:12:15 +0000 (19:42 +0530)]
Merge pull request #60901 from mohit84/wip-69086-reef

reef: TEST_backfill_grow fails after finding "num_bytes mismatch" in osd log

3 months agoMerge pull request #57065 from batrick/wip-65623-reef
SrinivasaBharathKanta [Fri, 21 Mar 2025 14:11:49 +0000 (19:41 +0530)]
Merge pull request #57065 from batrick/wip-65623-reef

reef: mgr: process map before notifying clients

3 months agoMerge pull request #62212 from NitzanMordhai/wip-70379-reef
SrinivasaBharathKanta [Fri, 21 Mar 2025 14:06:59 +0000 (19:36 +0530)]
Merge pull request #62212 from NitzanMordhai/wip-70379-reef

reef: LogMonitor: set no_reply for forward MLog commands

3 months agoMerge pull request #62220 from ifed01/wip-ifed-limit-discard-qlen-reef
NitzanMordhai [Fri, 21 Mar 2025 08:51:13 +0000 (10:51 +0200)]
Merge pull request #62220 from ifed01/wip-ifed-limit-discard-qlen-reef

reef: blk/KernelDevice: Introduce a cap on the number of pending discards

3 months agoLogMonitor: set no_reply for forward MLog commands 62212/head
Nitzan Mordechai [Thu, 20 Feb 2025 07:37:45 +0000 (07:37 +0000)]
LogMonitor: set no_reply for forward MLog commands

On streach mod clusters we can see slow ops when
removing and adding osds with --zap --force when osds
connected to peon monitor and forwarding the MLog to leader.
the no_reply is set only when we are connected to the leader,
this fix will add also the other option - so no_reply set anyway.

Fixes: https://tracker.ceph.com/issues/54489
Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>
(cherry picked from commit 56cb53db96380c917f4192c31cbdd0f0f3ef1fe9)

3 months agoMerge pull request #62210 from NitzanMordhai/wip-70382-reef
NitzanMordhai [Fri, 21 Mar 2025 08:42:06 +0000 (10:42 +0200)]
Merge pull request #62210 from NitzanMordhai/wip-70382-reef

reef: suites/rados: cache tier deprecated, no need to keep the tests for it

3 months agoMerge pull request #62145 from ifed01/wip-ifed-no-compression-plugin-reload-reef
NitzanMordhai [Fri, 21 Mar 2025 08:41:29 +0000 (10:41 +0200)]
Merge pull request #62145 from ifed01/wip-ifed-no-compression-plugin-reload-reef

reef: os/bluestore: do cache locally compressor engines ever used

3 months agodoc/cephadm/services: Correct indentation in osd.rst 62428/head
Anthony D'Atri [Fri, 21 Mar 2025 03:29:57 +0000 (23:29 -0400)]
doc/cephadm/services: Correct indentation in osd.rst

Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
(cherry picked from commit 7b5f73f4c17acc7cd0aab87db528e5232c8f05e1)

3 months agoos/bluestore: fix _extend_log seq advance 61653/head
Pere Diaz Bou [Fri, 29 Sep 2023 11:17:03 +0000 (13:17 +0200)]
os/bluestore: fix _extend_log seq advance

when extending the log, the sequence was left on a bad state because it would first create a transaction to update with the current seq number but leave the "real" transaction with the same sequence number which should be `extend_log_transaction.seq + 1`.

Signed-off-by: Pere Diaz Bou <pdiabou@redhat.com>
(cherry picked from commit 63f0a0df14c9d8e68be61e374438bc75cef45a1f)

Fixes: https://tracker.ceph.com/issues/69764
3 months agoos/bluestore: test _extend_log sequence advance
Pere Diaz Bou [Fri, 29 Sep 2023 11:16:42 +0000 (13:16 +0200)]
os/bluestore: test _extend_log sequence advance

Signed-off-by: Pere Diaz Bou <pdiabou@redhat.com>
(cherry picked from commit 8c68a503f1d87d4efedab53dd68d7c79529d27aa)

3 months agoMerge pull request #62405 from zdover23/wip-doc-2025-03-20-backport-62394-to-reef
Zac Dover [Thu, 20 Mar 2025 12:07:53 +0000 (22:07 +1000)]
Merge pull request #62405 from zdover23/wip-doc-2025-03-20-backport-62394-to-reef

reef: doc/radosgw /notifications: fix topic details

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
4 months agodoc/radosgw /notifications: fix topic details 62405/head
Laimis Juzeliūnas [Wed, 19 Mar 2025 12:33:04 +0000 (14:33 +0200)]
doc/radosgw /notifications: fix topic details

This commit fixes documentation about many-to-many topic relationship for notifications. The current sentence states the same fact twice instead of clarifying.

Fixes: https://tracker.ceph.com/issues/64956
Signed-off-by: Laimis Juzeliunas <laimis.juzeliunas@oxylabs.io>
(cherry picked from commit 058b95abe62893c5c13f02cdf7dd438b0a95a128)

4 months agoMerge pull request #62339 from phlogistonjohn/wip-70506-reef
Adam King [Wed, 19 Mar 2025 18:05:41 +0000 (14:05 -0400)]
Merge pull request #62339 from phlogistonjohn/wip-70506-reef

reef: build-with-container: two small fixes

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Dan Mick <dan.mick@redhat.com>
4 months agoMerge pull request #62380 from zdover23/wip-doc-2025-03-19-backport-62367-to-reef
Anthony D'Atri [Wed, 19 Mar 2025 14:11:33 +0000 (10:11 -0400)]
Merge pull request #62380 from zdover23/wip-doc-2025-03-19-backport-62367-to-reef

reef: doc/cephadm: Add PG autoscaler advice to upgrade.rst

4 months agoMerge pull request #62354 from aaSharma14/wip-70523-reef
Aashish Sharma [Wed, 19 Mar 2025 07:17:41 +0000 (12:47 +0530)]
Merge pull request #62354 from aaSharma14/wip-70523-reef

reef: mgr/dashboard: When configuring the RGW Multisite endpoints from the UI allow FQDN(Not only IP)

Reviewed-by: Naman Munet <naman.munet@ibm.com>
4 months agodoc/cephadm: Add PG autoscaler advice to upgrade.rst 62380/head
Anthony D'Atri [Tue, 18 Mar 2025 19:35:34 +0000 (15:35 -0400)]
doc/cephadm: Add PG autoscaler advice to upgrade.rst

Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
(cherry picked from commit ee69f52193996d528410a71c2c67fc27b47dee31)

4 months agoMerge pull request #62376 from zdover23/wip-doc-2025-03-19-backport-62371-to-reef
Zac Dover [Wed, 19 Mar 2025 05:30:50 +0000 (15:30 +1000)]
Merge pull request #62376 from zdover23/wip-doc-2025-03-19-backport-62371-to-reef

reef: doc/dev/developer_guide/essentials: update mailing lists

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
4 months agodoc/dev/developer_guide/essentials: update mailing lists 62376/head
Laimis Juzeliūnas [Tue, 18 Mar 2025 22:24:50 +0000 (00:24 +0200)]
doc/dev/developer_guide/essentials: update mailing lists

Update information for subscribing to Ceph development mailing lists as current documentation is outdated.

Fixes: https://tracker.ceph.com/issues/64580
Signed-off-by: Laimis Juzeliunas <laimis.juzeliunas@oxylabs.io>
(cherry picked from commit e7bf607269335ac40d91cb4b8f265064ffaac402)

4 months agoMerge pull request #62191 from ljflores/wip-reef-backport-69760
Laura Flores [Wed, 19 Mar 2025 03:13:30 +0000 (22:13 -0500)]
Merge pull request #62191 from ljflores/wip-reef-backport-69760

4 months agoMerge pull request #62369 from phlogistonjohn/jjm-reef-more-type-ignore
Laura Flores [Wed, 19 Mar 2025 02:32:07 +0000 (21:32 -0500)]
Merge pull request #62369 from phlogistonjohn/jjm-reef-more-type-ignore

4 months agoreef: mgr/diskprediction_local: avoid more mypy errors 62369/head
John Mulligan [Tue, 18 Mar 2025 19:56:25 +0000 (15:56 -0400)]
reef: mgr/diskprediction_local: avoid more mypy errors

Similar to c4111033172db28c4737e8438f27901811919ce4 this patch
suppresses mypy errors in the diskprediction_local mgr module.
I probably put the magic comment on more lines than needed but
mypy does not have a block-comment method to suppress checking
for just a region of code today.
This patch is not a backport as the issue is only impacting
reef CI jobs and so it is applied directly to the reef branch.

Signed-off-by: John Mulligan <phlogistonjohn@asynchrono.us>
4 months agodmclock/.../dmclock_server: do not clean clients with requests 62364/head
Samuel Just [Thu, 13 Feb 2025 04:16:47 +0000 (04:16 +0000)]
dmclock/.../dmclock_server: do not clean clients with requests

PriorityQueueBase::do_clean() shouldn't remove ClientRec instances which
still have queued requests.  Otherwise, very low priority clients might
end up having requests actually lost, which shouldn't be possible.

In the OSD, this resulted in PGRecovery items being lost if queued with
background_best_effort while expanding a cluster.  Such items can
legitimately sit in the queue for a long period of time as they
represent background data migration which is allowed to be starved by an
aggressive client workload.  Dropping the items broke an assumption in
the OSD that all items enqueued would eventually be dequeued resulting
in resources being leaked.

Fixes: https://tracker.ceph.com/issues/61594
Signed-off-by: Samuel Just <sjust@redhat.com>
(cherry picked from commit 35cf0f6c1355ff420c431340ac36d9a3c029d6c2)

4 months agotest/osd/TestMClockScheduler: TestMultiClientOrderedEnqueueDequeue use client class
Samuel Just [Thu, 13 Feb 2025 04:06:49 +0000 (04:06 +0000)]
test/osd/TestMClockScheduler: TestMultiClientOrderedEnqueueDequeue use client class

Otherwise, scheduler will yield a delay eventually.

Signed-off-by: Samuel Just <sjust@redhat.com>
(cherry picked from commit 0938a592e922c284cb939561dd1bd1589e154ae3)

4 months agotest/osd/TestMClockScheduler: create_item should pass prio < cutoff
Samuel Just [Thu, 13 Feb 2025 03:54:28 +0000 (03:54 +0000)]
test/osd/TestMClockScheduler: create_item should pass prio < cutoff

Cutoff is set to 12, so let's pass something < 12 rather than 12.
Comments in some tests suggest that the intent is for create_item
to create things in the mclock queue rather than the high_queue.

Signed-off-by: Samuel Just <sjust@redhat.com>
(cherry picked from commit 5562ecbcdb1b0ae43145cb7ea9c7a3c76e6449f4)

4 months agotest/osd/TestMClockScheduler: add test for very slow dequeue
Samuel Just [Thu, 13 Feb 2025 02:55:27 +0000 (02:55 +0000)]
test/osd/TestMClockScheduler: add test for very slow dequeue

Related: https://tracker.ceph.com/issues/61594
Signed-off-by: Samuel Just <sjust@redhat.com>
(cherry picked from commit b35589f7eb39e6bfabe7df1c55281f41925eca61)

4 months agoosd/scheduler/mClockScheduler: add constructor to specify idle/erase/check times
Samuel Just [Thu, 13 Feb 2025 03:45:42 +0000 (19:45 -0800)]
osd/scheduler/mClockScheduler: add constructor to specify idle/erase/check times

Useful for unit test, default times are *really* long.

Signed-off-by: Samuel Just <sjust@redhat.com>
(cherry picked from commit 8b7b0463792d85575996e65752087a25c9a3a65a)

4 months agoosd/scheduler/mClockScheduler: move constructor to header
Samuel Just [Thu, 13 Feb 2025 03:48:18 +0000 (19:48 -0800)]
osd/scheduler/mClockScheduler: move constructor to header

Signed-off-by: Samuel Just <sjust@redhat.com>
(cherry picked from commit 35b6176d90e1c48ba899b9ed0fd807ad8b290e0d)

4 months agosrc/dmclock: remove superfluous license files
Samuel Just [Sat, 15 Feb 2025 01:23:12 +0000 (01:23 +0000)]
src/dmclock: remove superfluous license files

Signed-off-by: Samuel Just <sjust@redhat.com>
(cherry picked from commit c95539ec3704c2372b4152bc8af5223a2fe7c55d)

4 months agosrc/dmclock: copy dmclock submodule into ceph.git
Samuel Just [Thu, 13 Feb 2025 04:15:25 +0000 (20:15 -0800)]
src/dmclock: copy dmclock submodule into ceph.git

Copies dmclock.git at e4ccdcfa828 into ceph.git.

Signed-off-by: Samuel Just <sjust@redhat.com>
(cherry picked from commit c625008dae9fb067e5f567031d5c8c355f2672c2)

4 months agoscript: ensure curl is always available in build containers 62339/head
John Mulligan [Thu, 13 Mar 2025 11:59:42 +0000 (07:59 -0400)]
script: ensure curl is always available in build containers

Ensure that curl is installed in all build containers regardless of
ceph's dependencies or other factors. This allows us to use curl in
any subsequent build steps/scripts.

Fixes: https://tracker.ceph.com/issues/70451
Signed-off-by: John Mulligan <jmulligan@redhat.com>
(cherry picked from commit b4e11f75bfa76036b9109485aa1cb4f9d633c8a2)

4 months agomgr/dashboard: When configuring the RGW Multisite endpoints from the UI allow FQDN... 62354/head
Aashish Sharma [Tue, 26 Nov 2024 09:56:38 +0000 (15:26 +0530)]
mgr/dashboard: When configuring the RGW Multisite endpoints from the UI allow FQDN(Not only IP)

When configuring the RGW Multisite endpoints from the UI allow FQDN, at the moment when using a FQDN it's not allowed

Fixes: https://tracker.ceph.com/issues/69055
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
(cherry picked from commit 9f3619af9ae911955916195084d225928d4b2f43)

Conflicts:
src/pybind/mgr/dashboard/frontend/package-lock.json (conflicts
with typescript package version, kept the existing one)
src/pybind/mgr/dashboard/frontend/package.json (conflicts with
typescript package version, kept the existing one)
src/pybind/mgr/dashboard/frontend/src/app/ceph/rgw/rgw-multisite-migrate/rgw-multisite-migrate.component.ts (conflicts with automated system user creation in main)
src/pybind/mgr/dashboard/frontend/src/app/shared/forms/cd-validators.ts (conflicts with oauthAddressTest validator)

4 months agomon, osd: add command to remove invalid pg-upmap-primary entries 62191/head
Laura Flores [Fri, 7 Mar 2025 06:22:00 +0000 (06:22 +0000)]
mon, osd: add command to remove invalid pg-upmap-primary entries

The current rm-pg-upmap-primary command checks that the pgid exists
in the pgmap before continuing to remove it. Due to https://tracker.ceph.com/issues/66867,
some invalid pg-upmap-primary entires may exist for pools that have been removed.
Currently, these mappings are impossible to remove since the pgids no longer
exist in the pgmap.

This new command, rm-pg-upmap-primary-all, allows users the ability to remove
any and all pg-upmap-primary mappings in the osdmap at once, which includes
valid and invalid entries.

This command may also be helpful when upgrading from versions where users
are plagued by https://tracker.ceph.com/issues/61948. Users may use an upgraded
mon to remove all pg-upmap-primray entries (valid and invalid) so they continue
to upgrade to a safe version.

See manual testing for this patch here: https://tracker.ceph.com/issues/67179#note-12

Fixes: https://tracker.ceph.com/issues/67179
Fixes: https://tracker.ceph.com/issues/69760
Signed-off-by: Laura Flores <lflores@ibm.com>
(cherry picked from commit 6e9e2033bf0f4779bdfac9a3a4f29115459c8c0e)

Conflicts:
src/osd/OSDMap.cc
src/osd/OSDMap.h
The `rm_all_upmap_prims` per pool function is part of
https://github.com/ceph/ceph/commit/2953db8b58535605882dff2e1d4ff36e6075e122, which
is related to the "size optimized" read balancer feature that
is only included >= Squid.

4 months agocontainer: stop deleting python generated files 62345/head
John Mulligan [Sat, 8 Feb 2025 20:03:32 +0000 (15:03 -0500)]
container: stop deleting python generated files

Stop deleting the python generated files (pyc, pyo) that RPM packages
have installed. At some point in the misty past someone thought it would
be a good idea to remove these. This practice got carried over to the
new in-tree Containerfile. IMO this is probably due to a thought to save
space, but if that's the case then the RPMs should not be carrying them
either. Plus, not having them is going to slow python down as it needs
to compile every py file that gets loaded. Let's be consistent: if the
RPMs have pyc and pyo files then they should be in the image - if
they're bad or too big they should not be in the RPMs either, right?

This has the pleasant side effect of making `rpm -Va` inside the image
happier.

Fixes: https://tracker.ceph.com/issues/69869
Signed-off-by: John Mulligan <jmulligan@redhat.com>
(cherry picked from commit 0f178e61de52c6a0b757f8f6937340c002e66c73)

4 months agocontainer: avoid installing docs using the dnf configuration
John Mulligan [Sat, 8 Feb 2025 19:51:23 +0000 (14:51 -0500)]
container: avoid installing docs using the dnf configuration

Avoid installing docs by using the dnf configuration tsflags parameter,
passing the nodocs flag. This tells dnf and rpm not to install
documentation, such as manpages. Stop installing the docs just to delete
them later with an `rm -rf` type command.  Now the docs don't get
installed in the first place, saving space, but the rpm is happy
(`rpm -Va` no longer shows docs as 'missing').

Fixes: https://tracker.ceph.com/issues/69868
Signed-off-by: John Mulligan <jmulligan@redhat.com>
(cherry picked from commit bf9b8d36aba3c7a8c7a3ecfc4d00359985e745b6)

4 months agoscript/build-with-container: fix incorrect build invocation for tests
John Mulligan [Thu, 13 Mar 2025 11:48:28 +0000 (07:48 -0400)]
script/build-with-container: fix incorrect build invocation for tests

The `tests` build target was set to build the "normal" target but it
needs to build the tests.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
(cherry picked from commit 6b635d3ec080c9514442e2617d1ab868050a39d9)

4 months agoMerge pull request #62087 from aaSharma14/wip-70252-reef
Aashish Sharma [Mon, 17 Mar 2025 05:52:29 +0000 (11:22 +0530)]
Merge pull request #62087 from aaSharma14/wip-70252-reef

reef: mgr: fix subuser creation via dashboard

Reviewed-by: Nizamudeen A <nia@redhat.com>
4 months agoMerge branch 'reef' into wip-70147-reef 62152/head
Yite Gu [Mon, 17 Mar 2025 02:43:55 +0000 (10:43 +0800)]
Merge branch 'reef' into wip-70147-reef

Signed-off-by: Yite Gu <yitegu0@gmail.com>
4 months agoOSDMonitor: exclude destroyed OSDs from "ceph node ls" output 62326/head
Nitzan Mordechai [Tue, 11 Mar 2025 15:37:07 +0000 (15:37 +0000)]
OSDMonitor: exclude destroyed OSDs from "ceph node ls" output

After destroying an OSD, the command "ceph node ls" still listed the destroyed OSD,
causing confusion. This fix ensures that the status of the OSD is checked before
adding it to the output, aligning its behavior with "ceph osd tree," which already
displays the correct status.

Fixes: https://tracker.ceph.com/issues/70400
Signed-off-by: Nitzan Mordechai <nmordech@redhat.com>
(cherry picked from commit 97c43c728dc8e702ce19bc14f6aa9a4790c8415b)

4 months agoMerge pull request #62321 from zdover23/wip-doc-2025-03-15-backport-62319-to-reef
Zac Dover [Sat, 15 Mar 2025 03:23:53 +0000 (13:23 +1000)]
Merge pull request #62321 from zdover23/wip-doc-2025-03-15-backport-62319-to-reef

reef: doc/rados/troubleshooting: Improve troubleshooting-pg.rst

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>