]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
7 months agoMerge pull request #60747 from mohit84/crimson_device_class
Matan Breizman [Tue, 26 Nov 2024 15:22:30 +0000 (17:22 +0200)]
Merge pull request #60747 from mohit84/crimson_device_class

crimson: Set device class during spawn of a crimson osd

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
7 months agoMerge pull request #60736 from guits/fix-stale-bot
Guillaume Abrioux [Tue, 26 Nov 2024 07:28:07 +0000 (08:28 +0100)]
Merge pull request #60736 from guits/fix-stale-bot

.github/workflows: Do not exempt PRs with milestone

7 months agoMerge pull request #59897 from avanthakkar/note-cephfs-earmark
Rishabh Dave [Tue, 26 Nov 2024 06:26:38 +0000 (11:56 +0530)]
Merge pull request #59897 from avanthakkar/note-cephfs-earmark

doc: add pendingreleasenotes for cephfs subvolume earmarking feature

Reviewed-by: Rishabh Dave <ridave@redhat.com>
Reviewed-by: Neeraj Pratap Singh <neesingh@redhat.com>
7 months agoMerge pull request #60768 from VallariAg/wip-prometheus-tested-limits
afreen23 [Tue, 26 Nov 2024 05:25:50 +0000 (10:55 +0530)]
Merge pull request #60768 from VallariAg/wip-prometheus-tested-limits

monitoring: Add alert NVMeoFTooManyNamespaces

Reviewed-by: Afreen Misbah <afreen@ibm.com>
7 months agoMerge pull request #60702 from ceph/wip-yuriw-release-17.2.8-main
Yuri Weinstein [Tue, 26 Nov 2024 02:14:30 +0000 (18:14 -0800)]
Merge pull request #60702 from ceph/wip-yuriw-release-17.2.8-main

doc: 17.2.8 Release Notes

Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@redhat.com>
7 months agodoc/releases: fix some formatting for the 17.2.8 release notes 60702/head
Laura Flores [Fri, 15 Nov 2024 16:53:14 +0000 (10:53 -0600)]
doc/releases: fix some formatting for the 17.2.8 release notes

Signed-off-by: Laura Flores <lflores@ibm.com>
Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
7 months agoMerge pull request #60756 from ArbitCode/wip-raja-rgw-fix-68211
Casey Bodley [Mon, 25 Nov 2024 22:35:39 +0000 (17:35 -0500)]
Merge pull request #60756 from ArbitCode/wip-raja-rgw-fix-68211

rgw:Wrong x-rgw-quota-bucket-objects header when bucket quota is disabled

Reviewed-by: Casey Bodley <cbodley@redhat.com>
7 months agoMerge pull request #60813 from anthonyeleven/improve-subsystem
Zac Dover [Mon, 25 Nov 2024 13:54:34 +0000 (23:54 +1000)]
Merge pull request #60813 from anthonyeleven/improve-subsystem

doc/rados/troubleshooting: Improve log-and-debug.rst

Reviewed-by: Zac Dover <zac.dover@proton.me>
7 months agoMerge pull request #60818 from gbregman/main
Gil Bregman [Mon, 25 Nov 2024 12:09:43 +0000 (14:09 +0200)]
Merge pull request #60818 from gbregman/main

Add key encryption support to nvmeof configuration

7 months agopython-common/ceph/deployment: Add key encryption support to nvmeof configuration 60818/head
Gil Bregman [Mon, 25 Nov 2024 08:36:06 +0000 (10:36 +0200)]
python-common/ceph/deployment: Add key encryption support to nvmeof configuration
Fixes https://tracker.ceph.com/issues/69036

Signed-off-by: Gil Bregman <gbregman@il.ibm.com>
7 months agomgr/cephadm: Add key encryption support to nvmeof configuration
Gil Bregman [Mon, 25 Nov 2024 08:35:24 +0000 (10:35 +0200)]
mgr/cephadm: Add key encryption support to nvmeof configuration
Fixes https://tracker.ceph.com/issues/69036

Signed-off-by: Gil Bregman <gbregman@il.ibm.com>
7 months agodoc/rados/troubleshooting: Improve log-and-debug.rst 60813/head
Anthony D'Atri [Mon, 25 Nov 2024 01:24:18 +0000 (20:24 -0500)]
doc/rados/troubleshooting: Improve log-and-debug.rst

Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
7 months agoMerge pull request #60805 from zdover23/wip-doc-2024-11-23-cephadm-install-curl-based...
Zac Dover [Sun, 24 Nov 2024 12:15:53 +0000 (22:15 +1000)]
Merge pull request #60805 from zdover23/wip-doc-2024-11-23-cephadm-install-curl-based-installation

doc/cephadm: Improve "Curl-based Installation"

:Reviewd-by: Anthony D'Atri <anthony.datri@gmail.com>

7 months agoMerge pull request #60804 from zdover23/wip-doc-2024-11-23-cephadm-install
Zac Dover [Sun, 24 Nov 2024 12:15:35 +0000 (22:15 +1000)]
Merge pull request #60804 from zdover23/wip-doc-2024-11-23-cephadm-install

doc/cephadm: Clarify "Deploying a new Cluster"

Reviewed-by: Anthony D'Atri<anthony.datri@gmail.com>
7 months agodoc/cephadm: Improve "Curl-based Installation" 60805/head
Zac Dover [Sat, 23 Nov 2024 13:42:47 +0000 (23:42 +1000)]
doc/cephadm: Improve "Curl-based Installation"

Change the name of the section "curl-based installation" to "Using Curl to Install Cephadm", and improve the English in that section.

Signed-off-by: Zac Dover <zac.dover@proton.me>
7 months agodoc/cephadm: Clarify "Deploying a new Cluster" 60804/head
Zac Dover [Sat, 23 Nov 2024 12:32:13 +0000 (22:32 +1000)]
doc/cephadm: Clarify "Deploying a new Cluster"

Change the title of the section "Deploying a new Ceph cluster" to "Using
cephadm to Deploy a New Ceph Cluster". This is part of the initiative to
separate package-related documentation from container-based
documenation.

Signed-off-by: Zac Dover <zac.dover@proton.me>
7 months agoMerge pull request #60775 from cbodley/wip-rgw-access-key-active-json
Casey Bodley [Fri, 22 Nov 2024 21:41:33 +0000 (16:41 -0500)]
Merge pull request #60775 from cbodley/wip-rgw-access-key-active-json

rgw: RGWAccessKey::decode_json() preserves default value of 'active'

Reviewed-by: Shilpa Jagannath <smanjara@redhat.com>
7 months agoMerge pull request #58645 from vshankar/wip-omap-commit-header-size
Igor Golikov [Fri, 22 Nov 2024 18:52:06 +0000 (20:52 +0200)]
Merge pull request #58645 from vshankar/wip-omap-commit-header-size

mds: account for header size during omap commit

7 months agoMerge pull request #60488 from jmolmo/exporter_manage_sigterm
Juan Miguel Olmo [Fri, 22 Nov 2024 12:53:00 +0000 (13:53 +0100)]
Merge pull request #60488 from jmolmo/exporter_manage_sigterm

exporter: SIGTERM/SIGINT/SIGHUP management

7 months agorgw:Wrong x-rgw-quota-bucket-objects header when bucket quota is disabled 60756/head
Raja Sharma [Sun, 17 Nov 2024 08:58:24 +0000 (14:28 +0530)]
rgw:Wrong x-rgw-quota-bucket-objects header when bucket quota is disabled

This commit introduces to get quota settings if bucket quota is enabled

Fixes: https://tracker.ceph.com/issues/68211
Signed-off-by: Raja Sharma <raja@ibm.com>
7 months agoMerge pull request #60787 from ronen-fr/wip-rf-reservStat
Ronen Friedman [Fri, 22 Nov 2024 06:33:57 +0000 (08:33 +0200)]
Merge pull request #60787 from ronen-fr/wip-rf-reservStat

osd/scrub: fixing reservations dump format

Reviewed-by: Nitzan Mordechai <nmordech@redhat.com>
7 months agoMerge pull request #60636 from mohit84/issue_68585
Yuri Weinstein [Thu, 21 Nov 2024 23:52:00 +0000 (15:52 -0800)]
Merge pull request #60636 from mohit84/issue_68585

TEST_backfill_grow fails after finding "num_bytes mismatch" in osd log

Reviewed-by: Ronen Friedman <rfriedma@redhat.com>
Reviewed-by: Samuel Just <sjust@redhat.com>
7 months agoMerge pull request #60253 from MaxKellermann/includes
Yuri Weinstein [Thu, 21 Nov 2024 23:51:23 +0000 (15:51 -0800)]
Merge pull request #60253 from MaxKellermann/includes

common: add missing include, reduce header dependencies

Reviewed-by: Ronen Friedman <rfriedma@redhat.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
7 months agoMerge pull request #59524 from liangmingyuanneo/wip-standalone-test-pg-repair
Yuri Weinstein [Thu, 21 Nov 2024 23:49:56 +0000 (15:49 -0800)]
Merge pull request #59524 from liangmingyuanneo/wip-standalone-test-pg-repair

qa/standalone: bugfix for wait_for_scrub

Reviewed-by: Ronen Friedman <rfriedma@redhat.com>
7 months agoMerge pull request #60523 from ljflores/wip-tracker-68657
Laura Flores [Thu, 21 Nov 2024 23:10:40 +0000 (17:10 -0600)]
Merge pull request #60523 from ljflores/wip-tracker-68657

mgr/balancer: optimize 'balancer status detail'

7 months agoMerge pull request #60542 from adk3798/sign-generated-rgw-certs
Adam King [Thu, 21 Nov 2024 21:08:33 +0000 (16:08 -0500)]
Merge pull request #60542 from adk3798/sign-generated-rgw-certs

mgr/cephadm: sign generated RGW certs

Reviewed-by: Redouane Kachach <rkachach@ibm.com>
7 months agoMerge pull request #60303 from adk3798/handle-failed-daemon-start
Adam King [Thu, 21 Nov 2024 21:07:16 +0000 (16:07 -0500)]
Merge pull request #60303 from adk3798/handle-failed-daemon-start

cephadm: handle "systemctl start" failures during deployment better

Reviewed-by: John Mulligan <jmulligan@redhat.com>
7 months agoMerge pull request #60531 from Kushal-deb/fix_issue_2124175
Adam King [Thu, 21 Nov 2024 21:04:43 +0000 (16:04 -0500)]
Merge pull request #60531 from Kushal-deb/fix_issue_2124175

mgr/cephadm: Added fix for Modifying Profiles(add-setting and rm-setting) to allow multiple setting changes

Reviewed-by: Adam King <adking@redhat.com>
7 months agoMerge pull request #60587 from phlogistonjohn/jjm-more-py312-fixes
Adam King [Thu, 21 Nov 2024 21:03:45 +0000 (16:03 -0500)]
Merge pull request #60587 from phlogistonjohn/jjm-more-py312-fixes

various python 3.12 fixes

Reviewed-by: Adam King <adking@redhat.com>
7 months agoMerge pull request #60319 from baum/no_huge
baum [Thu, 21 Nov 2024 15:35:45 +0000 (17:35 +0200)]
Merge pull request #60319 from baum/no_huge

cephadm: support no huge pages for nvmeof spdk

7 months agoosd/scrub: fixing reservations dump format 60787/head
Ronen Friedman [Thu, 21 Nov 2024 06:00:30 +0000 (00:00 -0600)]
osd/scrub: fixing reservations dump format

Headers were missing in the faulty dump output. This commit
fixes the section type to 'object section'.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
7 months agocephadm/nvmeof: support no huge pages for nvmeof spdk 60319/head
Alexander Indenbaum [Tue, 15 Oct 2024 12:48:41 +0000 (12:48 +0000)]
cephadm/nvmeof: support no huge pages for nvmeof spdk

depends on: https://github.com/ceph/ceph-nvmeof/pull/898

Signed-off-by: Alexander Indenbaum <aindenba@redhat.com>
7 months agoMerge pull request #60668 from ronen-fr/wip-rf-mconf
Ronen Friedman [Wed, 20 Nov 2024 06:08:45 +0000 (08:08 +0200)]
Merge pull request #60668 from ronen-fr/wip-rf-mconf

osd/scrub: cache frequently used configuration parameters

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Reviewed-by: Samuel Just <sjust@redhat.com>
7 months agoMerge pull request #60769 from gbregman/main
Adam King [Tue, 19 Nov 2024 20:07:07 +0000 (15:07 -0500)]
Merge pull request #60769 from gbregman/main

mgr/cephadm: Add resource limits to nvmeof configuration

Reviewed-by: Adam King <adking@redhat.com>
7 months agoMerge pull request #60588 from AlexonOliveiraRH/wip-doc-templates
Anthony D'Atri [Tue, 19 Nov 2024 19:49:58 +0000 (14:49 -0500)]
Merge pull request #60588 from AlexonOliveiraRH/wip-doc-templates

doc/cephadm/services: Update monitoring.rst to add additional templates

7 months agorgw: RGWAccessKey::decode_json() preserves default value of 'active' 60775/head
Casey Bodley [Tue, 19 Nov 2024 19:02:02 +0000 (14:02 -0500)]
rgw: RGWAccessKey::decode_json() preserves default value of 'active'

squid added the new "bool active" member that defaults to true, but
RGWAccessKey::decode_json() sets it to false when "active" isn't present

this is an issue for multisite when a squid zone replicates user
metadata from a master zone running reef

Fixes: https://tracker.ceph.com/issues/68985
Signed-off-by: Casey Bodley <cbodley@redhat.com>
7 months agoMerge pull request #60598 from xxhdx1985126/wip-68808
Matan Breizman [Tue, 19 Nov 2024 16:23:00 +0000 (18:23 +0200)]
Merge pull request #60598 from xxhdx1985126/wip-68808

crimson/osd/replicate_backend: add the skipped newly created clone object to the push queue after the clone request completes

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
Reviewed-by: Samuel Just <sjust@redhat.com>
7 months agocommon: rm the cast operator of md_config_cacher_t 60668/head
Ronen Friedman [Thu, 14 Nov 2024 12:12:23 +0000 (06:12 -0600)]
common: rm the cast operator of md_config_cacher_t

as its replacement - the type-deduced operator() - is now in use,
and the resulting code is more readable and less error-prone.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
7 months agoosd: rm all uses of the cast operator of md_config_cacher_t
Ronen Friedman [Thu, 14 Nov 2024 12:01:50 +0000 (06:01 -0600)]
osd: rm all uses of the cast operator of md_config_cacher_t

as this interface is to be removed in the next commit.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
7 months agoosd/scrub: cache frequently used configuration parameters
Ronen Friedman [Wed, 6 Nov 2024 14:33:12 +0000 (08:33 -0600)]
osd/scrub: cache frequently used configuration parameters

using the md_config_cacher_t, which is a cache object that registers
itself to the config observer and caches the up-to-date configuration
value.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
7 months agocommon: a simple API to extract md_config_cacher_t cached value
Ronen Friedman [Wed, 13 Nov 2024 09:02:00 +0000 (03:02 -0600)]
common: a simple API to extract md_config_cacher_t cached value

Using a type-deduced operator() (i.e. my_opt() ), instead of the
existing ValueT() operator (which requires a type-matching cast).

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
7 months agomgr/cephadm: Add resource limits to nvmeof configuration 60769/head
Gil Bregman [Tue, 19 Nov 2024 11:51:37 +0000 (13:51 +0200)]
mgr/cephadm: Add resource limits to nvmeof configuration
Fixes https://tracker.ceph.com/issues/68967

Signed-off-by: Gil Bregman <gbregman@il.ibm.com>
7 months agopython-common/ceph/deployment: Add resource limits to nvmeof configuration
Gil Bregman [Tue, 19 Nov 2024 11:51:01 +0000 (13:51 +0200)]
python-common/ceph/deployment: Add resource limits to nvmeof configuration
Fixes https://tracker.ceph.com/issues/68967

Signed-off-by: Gil Bregman <gbregman@il.ibm.com>
7 months agoMerge pull request #60572 from xxhdx1985126/wip-68787 60757/head
Matan Breizman [Tue, 19 Nov 2024 11:24:23 +0000 (13:24 +0200)]
Merge pull request #60572 from xxhdx1985126/wip-68787

crimson/osd/backfill_state: don't try to do object backfills when primary_bi is empty

Reviewed-by: Matan Breizman <mbreizma@redhat.com>
7 months agocrimson: Set device class during spawn of a crimson osd 60747/head
Mohit Agrawal [Fri, 15 Nov 2024 05:10:09 +0000 (10:40 +0530)]
crimson: Set device class during spawn of a crimson osd

Implement a wrapper for different backend storage to
set device_class during spawn of a process.

Fixes: https://tracker.ceph.com/issues/66627
Signed-off-by: Mohit Agrawal <moagrawa@redhat.com>
7 months agoMerge pull request #60661 from ronen-fr/wip-rf-scheduledeep
Ronen Friedman [Tue, 19 Nov 2024 08:47:26 +0000 (10:47 +0200)]
Merge pull request #60661 from ronen-fr/wip-rf-scheduledeep

osd/scrub: fix 'schedule-deepscrub' test asok command

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
7 months agomonitoring: Add alert NVMeoFTooManyNamespaces 60768/head
Vallari Agrawal [Mon, 18 Nov 2024 05:50:00 +0000 (11:20 +0530)]
monitoring: Add alert NVMeoFTooManyNamespaces

NVMeoFTooManyNamespaces helps to alert user if total
number of namespaces across subsystems are more than
1024.

Change NVMeoFTooManySubsystems limit to 128 from 16.

Fixes: https://github.com/ceph/ceph-nvmeof/issues/948
Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
7 months agoMerge pull request #60674 from oshrey16/bugfix-rgw-ps-list-topics-op
Yuval Lifshitz [Tue, 19 Nov 2024 06:20:00 +0000 (08:20 +0200)]
Merge pull request #60674 from oshrey16/bugfix-rgw-ps-list-topics-op

rgw/notification: Fix segmentation fault in RGWPSListTopicsOp::execute() and correct topic listing to use get_topics_v2

Reviewed-By: Joseph Mundackal <jmundackal@bloomberg.net>, Yuval Lifshitz <ylifshit@ibm.com>
7 months agoMerge pull request #60765 from zdover23/wip-doc-2024-11-19-os-recommendations
Anthony D'Atri [Tue, 19 Nov 2024 01:04:55 +0000 (20:04 -0500)]
Merge pull request #60765 from zdover23/wip-doc-2024-11-19-os-recommendations

doc/start: update os-recommendations.rst

7 months agodoc/start: update os-recommendations.rst 60765/head
Zac Dover [Tue, 19 Nov 2024 00:37:56 +0000 (10:37 +1000)]
doc/start: update os-recommendations.rst

Remove information about the operating systems that support Ceph's
official container images from the "Platforms" table in
doc/start/os-recommendations.rst and add that information to the (new)
table that shows the operating systems that support Ceph's official
container images.

Credit for this change should go to Enrico Bocchi, who noticed a
discrepancy that motivated it.

Signed-off-by: Zac Dover <zac.dover@proton.me>
7 months agoMerge pull request #60309 from linuxbox2/wip-cksum-68537
Casey Bodley [Mon, 18 Nov 2024 16:51:32 +0000 (11:51 -0500)]
Merge pull request #60309 from linuxbox2/wip-cksum-68537

rgw_cksum:  treat mpu checksums w/o part count as valid in CompleteMu…

Reviewed-by: Casey Bodley <cbodley@redhat.com>
7 months agoMerge pull request #60711 from soumyakoduri/wip-skoduri-dbstore
Soumya Koduri [Mon, 18 Nov 2024 15:01:36 +0000 (20:31 +0530)]
Merge pull request #60711 from soumyakoduri/wip-skoduri-dbstore

rgw: Update README for DBStore and Posix drivers

Reviewed-by: Casey Bodley <cbodley@redhat.com>
8 months agomds: account for header size during omap commit 58645/head
Venky Shankar [Wed, 17 Jul 2024 10:05:45 +0000 (10:05 +0000)]
mds: account for header size during omap commit

fnode_t is set in the omap header during directory commit
operation which isn't accounted when tracking operation
size.

Fixes: http://tracker.ceph.com/issues/67597
Signed-off-by: Venky Shankar <vshankar@redhat.com>
8 months agorgw/notification: fix segmentation fault and topic listing logic 60674/head
Oshrey Avraham [Mon, 18 Nov 2024 10:06:22 +0000 (12:06 +0200)]
rgw/notification: fix segmentation fault and topic listing logic

- Fixed a segmentation fault caused by a null bucket pointer in RGWPSListTopicsOp::execute()
- Corrected logic to use get_topics_v2 when supported, with fallback otherwise

Fixes: https://tracker.ceph.com/issues/68756
Signed-off-by: Oshrey Avraham <oshrey16@gmail.com>
8 months agodoc: add pendingreleasenotes for cephfs subvolume earmarking feature 59897/head
Avan Thakkar [Fri, 20 Sep 2024 09:09:48 +0000 (14:39 +0530)]
doc: add pendingreleasenotes for cephfs subvolume earmarking feature

Signed-off-by: Avan Thakkar <athakkar@redhat.com>
8 months ago.github/workflows: Do not exempt PRs with milestone 60736/head
Guillaume Abrioux [Thu, 14 Nov 2024 15:49:32 +0000 (16:49 +0100)]
.github/workflows: Do not exempt PRs with milestone

The stale bot doesn't close certain PRs, even if they have been inactive
for several years, because almost all of these PRs have a milestone

Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
8 months agocrimson/osd/replicate_backend: add the skipped newly created clone 60598/head
Xuehan Xu [Mon, 4 Nov 2024 09:30:10 +0000 (17:30 +0800)]
crimson/osd/replicate_backend: add the skipped newly created clone
object to the push queue after the clone request completes

Fixes: https://tracker.ceph.com/issues/68808
Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
8 months agoexporter: SIGTERM/SIGINT/SIGHUP management 60488/head
Juan Miguel Olmo Martínez [Fri, 25 Oct 2024 12:57:44 +0000 (14:57 +0200)]
exporter: SIGTERM/SIGINT/SIGHUP management

Ceph exporter manages SIGINT, SIGTERM and SIGHUP signals

Fixes: https://tracker.ceph.com/issues/68721
Signed-off-by: Juan Miguel Olmo Martínez <jolmomar@redhat.com>
8 months agoMerge pull request #60655 from xxhdx1985126/wip-seastore-move-out-root-meta
Yingxin [Fri, 15 Nov 2024 08:51:05 +0000 (16:51 +0800)]
Merge pull request #60655 from xxhdx1985126/wip-seastore-move-out-root-meta

crimson/os/seastore: move the root meta out of the root block

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
8 months agoMerge pull request #60209 from myoungwon/wip-init-onode-layout
Yingxin [Fri, 15 Nov 2024 08:50:17 +0000 (16:50 +0800)]
Merge pull request #60209 from myoungwon/wip-init-onode-layout

crimson/os/seastore: initialize oi and ss in onode_layout_t to zero

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
8 months agodoc: 17.2.8 Release Notes
Yuri Weinstein [Mon, 11 Nov 2024 21:58:29 +0000 (13:58 -0800)]
doc: 17.2.8 Release Notes

Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
8 months agoMerge pull request #60715 from ronen-fr/wip-rf-dumpsched
Ronen Friedman [Thu, 14 Nov 2024 13:55:36 +0000 (15:55 +0200)]
Merge pull request #60715 from ronen-fr/wip-rf-dumpsched

osd/scrub: list additional information when dumping the queue

Reviewed-by: Samuel Just <sjust@redhat.com>
8 months agoMerge pull request #60727 from guits/dmcrypt-activate
Guillaume Abrioux [Thu, 14 Nov 2024 13:02:16 +0000 (14:02 +0100)]
Merge pull request #60727 from guits/dmcrypt-activate

ceph-volume: fix dmcrypt activation regression

8 months agoMerge pull request #60404 from VallariAg/wip-nvmeof-listeners-prometheus-alerts
Vallari Agrawal [Thu, 14 Nov 2024 10:53:55 +0000 (16:23 +0530)]
Merge pull request #60404 from VallariAg/wip-nvmeof-listeners-prometheus-alerts

monitoring: add 2 nvmeof alerts to prometheus_alerts.yaml

8 months agoMerge pull request #51766 from zhsgao/mds_clear_session_failed
Venky Shankar [Thu, 14 Nov 2024 07:08:35 +0000 (12:38 +0530)]
Merge pull request #51766 from zhsgao/mds_clear_session_failed

mds: session in the importing state cannot be cleared if an export subtree task is interrupted while the state of importer is acking

Reviewed-by: Venky Shankar <vshankar@redhat.com>
8 months agoMerge pull request #60067 from VallariAg/wip-nvmeof-healthcheck
Vallari Agrawal [Thu, 14 Nov 2024 03:10:43 +0000 (08:40 +0530)]
Merge pull request #60067 from VallariAg/wip-nvmeof-healthcheck

mon: add nvmeof healthchecks

8 months agoceph-volume: fix dmcrypt activation regression 60727/head
Guillaume Abrioux [Wed, 13 Nov 2024 15:23:36 +0000 (16:23 +0100)]
ceph-volume: fix dmcrypt activation regression

`ceph.with_tpm` tag is actually a `str`, which means that
`bool('0')` will return `True`.

Fixes: https://tracker.ceph.com/issues/68944
Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
8 months agoMerge pull request #60554 from jshen28/fix-legacy-addr-mismatch
Ilya Dryomov [Wed, 13 Nov 2024 12:25:08 +0000 (13:25 +0100)]
Merge pull request #60554 from jshen28/fix-legacy-addr-mismatch

Make mon_info_t consistent with mon_addr

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
8 months agomds: client is evicted when an export subtree task is interrupted 51766/head
Zhansong Gao [Mon, 3 Jul 2023 09:18:14 +0000 (17:18 +0800)]
mds: client is evicted when an export subtree task is interrupted

The importer will force open some sessions provided by the exporter but the client does not know about
the new sessions until the exporter notifies it, and the notifications cannot be sent if the exporter
is interrupted. The client does not renew the sessions regularly that it does not know about, so the client
will be evicted by the importer after `session_autoclose` seconds (300 seconds by default).

The sessions that are forced opened in the importer need to be closed when the import process is reversed.

Signed-off-by: Zhansong Gao <zhsgao@hotmail.com>
8 months agoqa: add test for importer's unexpected client eviction after an export subtree task...
Zhansong Gao [Mon, 3 Jul 2023 08:42:34 +0000 (16:42 +0800)]
qa: add test for importer's unexpected client eviction after an export subtree task is interrupted

Signed-off-by: Zhansong Gao <zhsgao@hotmail.com>
8 months agomds: session in the importing state cannot be cleared if an export subtree task is...
Zhansong Gao [Fri, 26 May 2023 04:20:17 +0000 (12:20 +0800)]
mds: session in the importing state cannot be cleared if an export subtree task is interrupted while the state of importer is acking

The related sessions in the importer are in the importing state(`Session::is_importing` return true) when the state of importer is `acking`,
`Migrator::import_reverse` called by `MDCache::handle_resolve` should reverse the process to clear the importing state if the exporter restarts
at this time, but it doesn't do that actually because of its bug. And it will cause these sessions to not be cleared when the client is
unmounted(evicted or timeout) until the mds is restarted.

The bug in `import_reverse` is that it contains the code to handle state `IMPORT_ACKING` but it will never be executed because
the state is modified to `IMPORT_ABORTING` at the beginning. Move `stat.state = IMPORT_ABORTING` to the end of import_reverse
so that it can handle the state `IMPORT_ACKING`.

Fixes: https://tracker.ceph.com/issues/61459
Signed-off-by: Zhansong Gao <zhsgao@hotmail.com>
8 months agoqa: add test for importer's session cleanup after an export subtree task is interrupted
Zhansong Gao [Fri, 26 May 2023 04:58:42 +0000 (12:58 +0800)]
qa: add test for importer's session cleanup after an export subtree task is interrupted

Signed-off-by: Zhansong Gao <zhsgao@hotmail.com>
8 months agoqa/suites/nvmeof: add nvmeof warnings to log-ignorelist 60067/head
Vallari Agrawal [Tue, 15 Oct 2024 12:40:20 +0000 (18:10 +0530)]
qa/suites/nvmeof: add nvmeof warnings to log-ignorelist

Add NVMEOF_SINGLE_GATEWAY and NVMEOF_GATEWAY_DOWN
warnings to nvmeof:thrash job's log-ignorelist

Signed-off-by: Vallari Agrawal <val.agl002@gmail.com>
8 months agomds: the assert should be before the journal entry submit otherwise it's racy
Zhansong Gao [Fri, 16 Jun 2023 04:30:40 +0000 (12:30 +0800)]
mds: the assert should be before the journal entry submit otherwise it's racy

Signed-off-by: Zhansong Gao <zhsgao@hotmail.com>
8 months agomds: add `importing_count` to session dump
Zhansong Gao [Fri, 26 May 2023 04:25:10 +0000 (12:25 +0800)]
mds: add `importing_count` to session dump

Signed-off-by: Zhansong Gao <zhsgao@hotmail.com>
8 months agoMerge PR #60464 into main
Patrick Donnelly [Wed, 13 Nov 2024 03:17:59 +0000 (22:17 -0500)]
Merge PR #60464 into main

* refs/pull/60464/head:
mds: add or update MDS thread names
log: cache recent threads up to a day
common: cache pthread names
log: concatenate thread names and print once per thread

Reviewed-by: Milind Changire <mchangir@redhat.com>
8 months agoMerge PR #60327 into main
Patrick Donnelly [Wed, 13 Nov 2024 03:16:54 +0000 (22:16 -0500)]
Merge PR #60327 into main

* refs/pull/60327/head:
mon/MDSMonitor: send reply to beacons with `state=DNE`

Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>
8 months agoMerge PR #60381 into main
Patrick Donnelly [Wed, 13 Nov 2024 03:14:20 +0000 (22:14 -0500)]
Merge PR #60381 into main

* refs/pull/60381/head:
doc: remove refrences to `mds_log_major_segment_event_ratio`
mds: start a new major segment after reaching minor segment threshold
mds: make parts of mdlog reusable to be used by beacon

Reviewed-by: Anthony D Atri <anthony.datri@gmail.com>
Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>
8 months agoMerge PR #60325 into main
Patrick Donnelly [Wed, 13 Nov 2024 03:13:46 +0000 (22:13 -0500)]
Merge PR #60325 into main

* refs/pull/60325/head:
mds/Beacon: wake up the thread in shutdown()

Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
8 months agoMerge PR #60283 into main
Patrick Donnelly [Wed, 13 Nov 2024 03:12:27 +0000 (22:12 -0500)]
Merge PR #60283 into main

* refs/pull/60283/head:
mds: add issue_seq to all cap messages
include/ceph_fs: correct ceph_mds_cap_peer field name
include/ceph_fs: correct ceph_mds_cap_item field name
messages/MClientCaps: use correct ceph_seq_t for cap sequence types
messages/MClientCaps: dump issue_seq for debugging
mds: remove dead code

Reviewed-by: Venky Shankar <vshankar@redhat.com>
8 months agoMerge PR #60226 into main
Patrick Donnelly [Wed, 13 Nov 2024 03:10:57 +0000 (22:10 -0500)]
Merge PR #60226 into main

* refs/pull/60226/head:
mds/QuiesceDbEncoding: add `inline` to work around linker error

Reviewed-by: Rishabh Dave <ridave@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>
8 months agoMerge PR #58586 into main
Patrick Donnelly [Wed, 13 Nov 2024 03:09:30 +0000 (22:09 -0500)]
Merge PR #58586 into main

* refs/pull/58586/head:
qa/suites/fs/thrash: add convolve operator for overrides
qa/cephfs: s/prefetch_dirfrags/prefetch_oft_dirfrags

Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>
8 months agoMerge pull request #60675 from zmc/make-dist-quiet
Zack Cerza [Tue, 12 Nov 2024 19:21:23 +0000 (12:21 -0700)]
Merge pull request #60675 from zmc/make-dist-quiet

8 months agoosd/scrub: list additional information when dumping the queue 60715/head
Ronen Friedman [Tue, 12 Nov 2024 14:21:25 +0000 (08:21 -0600)]
osd/scrub: list additional information when dumping the queue

Extend the information provided for operator dump commands, to
include the basic identity and scheduling information of the
entries in the scrub queue.

This change mostly benefits automatic QA and our internal
testing.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
8 months agorgw: Update README for DBStore and Posix drivers 60711/head
Soumya Koduri [Tue, 12 Nov 2024 06:34:33 +0000 (12:04 +0530)]
rgw: Update README for DBStore and Posix drivers

Signed-off-by: Soumya Koduri <skoduri@redhat.com>
8 months agocrimson/osd/replicated_backend: add the dedicated method to construct
Xuehan Xu [Mon, 4 Nov 2024 07:19:26 +0000 (15:19 +0800)]
crimson/osd/replicated_backend: add the dedicated method to construct
MOSDRepOp

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
8 months agocrimson/osd/pg: add is_missing_on_peer()
Xuehan Xu [Mon, 4 Nov 2024 07:18:19 +0000 (15:18 +0800)]
crimson/osd/pg: add is_missing_on_peer()

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
8 months agocrimson/osd/pg: reuse already defined methods as many as possible to
Xuehan Xu [Mon, 4 Nov 2024 07:17:19 +0000 (15:17 +0800)]
crimson/osd/pg: reuse already defined methods as many as possible to
improve readability

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
8 months agocrimson/osd/ops_executor: call OpsExecutor::apply_stats() before 60596/head
Xuehan Xu [Sat, 9 Nov 2024 06:59:45 +0000 (14:59 +0800)]
crimson/osd/ops_executor: call OpsExecutor::apply_stats() before
OpsExecutor::obc is moved out.

OpsExecutor::apply_stats() accesses OpsExecutor::obc indirectly

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
8 months agocrimson/osd/pg: fix dangerous lambda captures
Xuehan Xu [Sat, 2 Nov 2024 02:01:52 +0000 (10:01 +0800)]
crimson/osd/pg: fix dangerous lambda captures

In PG::submit_transaction(), osd_op_p was first moved to
ReplicatedBackend::submit_transaction(), and later its at_version is
captured in another lambda. The latter may be already empty at the time
of the capture.

Signed-off-by: Xuehan Xu <xuxuehan@qianxin.com>
8 months agoMerge pull request #60703 from zdover23/wip-doc-2024-11-12-rados-operations-add-or... 59906/head
Anthony D'Atri [Tue, 12 Nov 2024 01:18:56 +0000 (20:18 -0500)]
Merge pull request #60703 from zdover23/wip-doc-2024-11-12-rados-operations-add-or-rm-osds

doc/rados: correct "full ratio" note

8 months agoMerge pull request #60704 from zdover23/wip-doc-2024-11-12-start-hardware-recommendations
Anthony D'Atri [Tue, 12 Nov 2024 01:17:26 +0000 (20:17 -0500)]
Merge pull request #60704 from zdover23/wip-doc-2024-11-12-start-hardware-recommendations

doc/start: fix "are are" typo

8 months agoMerge pull request #60705 from zdover23/wip-doc-2024-11-12-radosgw-account
Anthony D'Atri [Tue, 12 Nov 2024 01:10:55 +0000 (20:10 -0500)]
Merge pull request #60705 from zdover23/wip-doc-2024-11-12-radosgw-account

doc/radosgw: s/Poliicy/Policy/

8 months agodoc/radosgw: s/Poliicy/Policy/ 60705/head
Zac Dover [Mon, 11 Nov 2024 23:54:56 +0000 (09:54 +1000)]
doc/radosgw: s/Poliicy/Policy/

Correct "AttachGroupPoliicy" to "AttachGroupPolicy".

Fixes: https://tracker.ceph.com/issues/68902
Signed-off-by: Zac Dover <zac.dover@proton.me>
8 months agodoc/start: fix "are are" typo 60704/head
Zac Dover [Mon, 11 Nov 2024 23:47:21 +0000 (09:47 +1000)]
doc/start: fix "are are" typo

Fix typo reading "They are are single-threaded".
s/are are/are/

Fixes: https://tracker.ceph.com/issues/68901
Signed-off-by: Zac Dover <zac.dover@proton.me>
8 months agoMerge pull request #60092 from connorfawcett/vstart_runner_bytes_fix
Connor Fawcett [Mon, 11 Nov 2024 23:39:41 +0000 (23:39 +0000)]
Merge pull request #60092 from connorfawcett/vstart_runner_bytes_fix

qa/vstart-runner: Only remove non-ASCII characters from string-based command outputs

8 months agodoc/rados: correct "full ratio" note 60703/head
Zac Dover [Mon, 11 Nov 2024 23:31:28 +0000 (09:31 +1000)]
doc/rados: correct "full ratio" note

Correct a note that directed users not to add an OSD after the cluster
has reached its "full ratio". The note now says "Do not let your cluster
reach its full ratio before adding an OSD."

Hat tip: Oskar Berggren

Fixes: https://tracker.ceph.com/issues/68900
Co-authored-by: Oskar Berggren <oskar.berggren@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
8 months agomonitoring: add tests for 2 new nvmeof alerts 60404/head
Vallari Agrawal [Wed, 30 Oct 2024 14:04:01 +0000 (19:34 +0530)]
monitoring: add tests for 2 new nvmeof alerts

Add test for alerts NVMeoFMissingListener and
NVMeoFZeroListenerSubsystem to test_alerts.yml.

Signed-off-by: Vallari Agrawal <vallari.agrawal@ibm.com>
8 months agoMerge pull request #60529 from zdover23/wip-doc-2024-10-29-start-os-recommendations
Zac Dover [Mon, 11 Nov 2024 15:33:06 +0000 (01:33 +1000)]
Merge pull request #60529 from zdover23/wip-doc-2024-10-29-start-os-recommendations

doc/start: separate package chart from container chart

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
8 months agoMerge pull request #59328 from mkogan1/wip-d3n-umask
Mark Kogan [Mon, 11 Nov 2024 13:59:59 +0000 (15:59 +0200)]
Merge pull request #59328 from mkogan1/wip-d3n-umask

rgw: d3n: set access permissions for the D3N cache directory when creating