git.apps.os.sepia.ceph.com Git

]> git.apps.os.sepia.ceph.com Git - ceph-ci.git/log

projects / ceph-ci.git / log

commit | commitdiff | tree

skanta [Fri, 10 Oct 2025 23:31:22 +0000 (05:01 +0530)]

Merge branch 'wip-rf-minchunk' of https://github.com/ronen-fr/ceph into wip-bharath10-testing-2025-10-11-0501

commit | commitdiff | tree

skanta [Fri, 10 Oct 2025 23:31:18 +0000 (05:01 +0530)]

Merge branch 'wip-ksirivad-fix-msg-v2' of https://github.com/kamoltat/ceph into wip-bharath10-testing-2025-10-11-0501

commit | commitdiff | tree

Adam King [Fri, 10 Oct 2025 20:55:51 +0000 (16:55 -0400)]

Merge pull request #65855 from rhcs-dashboard/bump-monitoring-stack

mgr/cephadm: bump monitoring stack versions

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Afreen Misbah <afreen@ibm.com>

commit | commitdiff | tree

Kamoltat Sirivadhna [Wed, 30 Jul 2025 13:57:47 +0000 (13:57 +0000)]

doc/health-checks: update MON_NETSPLIT documentation

Update the MON_NETSPLIT health check documentation to reflect the
introduction of the configurable mon_netsplit_grace_period option.

Fixes: https://tracker.ceph.com/issues/71344
Signed-off-by: Kamoltat Sirivadhna <ksirivad@redhat.com>

commit | commitdiff | tree

Kamoltat Sirivadhna [Fri, 25 Jul 2025 04:18:46 +0000 (04:18 +0000)]

qa/suites/rados: white list + add MON_NETSPLIT tests

Some existing netsplit test in 3az + stretch mode
needed white listing and check for netsplit details

Make qa/tasks/mon_thrash.py set
mon_netsplit_grace_period to 30 seconds
when we try to freeze monitors instead of killing them.

Make qa/tasks/stretch_mode_disable_enable.py set
mon_netsplit_grace_period to 30 seconds
during `teardown` phase only.

Fixes: https://tracker.ceph.com/issues/71344
Signed-off-by: Kamoltat Sirivadhna <ksirivad@redhat.com>

commit | commitdiff | tree

Kamoltat Sirivadhna [Fri, 18 Jul 2025 05:00:18 +0000 (05:00 +0000)]

msg/async: ProtocolV2::send_server_ident update ProtocolV2::global_seq

In ProtocolV2::send_server_ident(), the global_seq was being fetched
from messenger->get_global_seq() and used in the ServerIdentFrame,
however, it is stored in a local var and not the private class var
ProtocolV2::global_seq. This causes problems like
where the receiving peer sees a peer_global_seq that
appears older than expected, triggering a false-positive reconnect logic:

```
2025-07-15T11:40:50.927+0000 mon.c handle_existing_connection client has clearly restarted
(peer_global_seq < ex_peer_global_seq && cookie changed), dropping existing connection=0x563ffe9a9000 in favor of new one
```

In this case, mon.c received a peer_global_seq=75, which was already logged by mon.d as gs=79 in
its send_server_ident()—but ProtocolV2::global_seq was never updated, resulting in inconsistent state and premature connection teardown.

This commit fixes the issue by assigning the freshly incremented messenger->get_global_seq() value to the local global_seq field in ProtocolV2 as well,
ensuring consistency in the protocol.

Fixes: https://tracker.ceph.com/issues/71344
Signed-off-by: Kamoltat Sirivadhna <ksirivad@redhat.com>

commit | commitdiff | tree

Casey Bodley [Fri, 10 Oct 2025 17:45:43 +0000 (13:45 -0400)]

Merge pull request #65723 from mheler/rgw_cloud_transition_header_fix

rgw/s3: Always include x-amz-content-sha256 header in AWS v4 signatures

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Fri, 10 Oct 2025 17:40:59 +0000 (13:40 -0400)]

Merge PR #65839 into main

* refs/pull/65839/head:
pybind/cephfs: fix including of platform_errno.h

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Christopher Hoffman <choffman@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@ibm.com>

commit | commitdiff | tree

SrinivasaBharathKanta [Fri, 10 Oct 2025 16:01:52 +0000 (21:31 +0530)]

Merge pull request #65525 from ethanwu-syno/fix-snapper-parse-m

osd/SnapMapper: fix SnapMapper::Scrubber shard parsing

commit | commitdiff | tree

SrinivasaBharathKanta [Fri, 10 Oct 2025 16:01:17 +0000 (21:31 +0530)]

Merge pull request #65450 from tchaikov/wip-fix-out-of-bound-access

cpp-btree: fix array bounds warning in btree_node::swap()

commit | commitdiff | tree

SrinivasaBharathKanta [Fri, 10 Oct 2025 16:00:50 +0000 (21:30 +0530)]

Merge pull request #65033 from Nordix/fix-72380-sunnat

mgr/prometheus: Handle empty JSON from orch get-security-config

commit | commitdiff | tree

Adam King [Fri, 10 Oct 2025 15:08:26 +0000 (11:08 -0400)]

Merge pull request #65757 from phlogistonjohn/jjm-qa-task-smb

qa/tasks: move smb related task funcs out of cephadm.py to smb.py

Reviewed-by: Adam King <adking@redhat.com>

commit | commitdiff | tree

Rishabh Dave [Wed, 8 Oct 2025 17:11:25 +0000 (22:41 +0530)]

pybind/cephfs: fix including of platform_errno.h

Introduced-by: 2439bcb87398
Fixes: https://tracker.ceph.com/issues/73435
Signed-off-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

afreen23 [Fri, 10 Oct 2025 10:05:10 +0000 (15:35 +0530)]

Merge pull request #65852 from rhcs-dashboard/73447-fix-usage-bar-for-secondary-cluster

mgr/dashboard : Fixed usage bar for secondary site in rbd mirroing

Reviewed-by: Ankush Behl <cloudbehl@gmail.com>

commit | commitdiff | tree

Ilya Dryomov [Fri, 10 Oct 2025 08:41:24 +0000 (10:41 +0200)]

Merge pull request #63190 from adamemerson/wip-gcc15-rocksdb-cstdint

build: Modify rocksdb submodule

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Kefu Chai <tchaikov@gmail.com>

commit | commitdiff | tree

afreen23 [Fri, 10 Oct 2025 08:27:59 +0000 (13:57 +0530)]

Merge pull request #65840 from rhcs-dashboard/fix-permissionsrt

mgr/dashboard: fix dashboard freeze on missing smb permissions

Reviewed-by: Afreen Misbah <afreen@ibm.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Raja [Fri, 10 Oct 2025 07:03:18 +0000 (12:33 +0530)]

Merge pull request #65734 from ArbitCode/wip-raja-get-caller-identity-issue

rgw/sts : get-caller-identity issue fix

commit | commitdiff | tree

Pritha Srivastava [Fri, 10 Oct 2025 04:52:24 +0000 (10:22 +0530)]

Merge pull request #65866 from samarahu/d4n-suite-redis-fix

qa/d4n: Fix redis scanning in workunit

commit | commitdiff | tree

Nizamudeen A [Fri, 10 Oct 2025 04:33:45 +0000 (10:03 +0530)]

Merge pull request #65857 from rhcs-dashboard/table-title-template

mgr/dashboard: support templates in table component title

Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Fri, 10 Oct 2025 04:01:46 +0000 (09:31 +0530)]

Merge pull request #65558 from rhcs-dashboard/rbd-form-fixes

mgr/dashboard: fix rbd form mirroring toggle

Reviewed-by: Afreen Misbah <afreen@ibm.com>
Reviewed-by: Ankush Behl <cloudbehl@gmail.com>

commit | commitdiff | tree

Nizamudeen A [Fri, 10 Oct 2025 03:51:06 +0000 (09:21 +0530)]

Merge pull request #65833 from rhcs-dashboard/custom-message-validator

mgr/dashboard: support custom validator messages in inline edit

Reviewed-by: Naman Munet <nmunet@redhat.com>

commit | commitdiff | tree

Joseph Mundackal [Thu, 9 Oct 2025 19:57:42 +0000 (15:57 -0400)]

Merge pull request #65865 from jmundack/doc-Remove-unsupposed-OS-from-packages-and-container-hosts-table

doc: Update os-recommendations.rst to remove empty rows

commit | commitdiff | tree

afreen23 [Thu, 9 Oct 2025 18:57:21 +0000 (00:27 +0530)]

Merge pull request #64587 from SundownRises/NotificationArea-component

mgr/dashboard: Carbonised Notification area component

Reviewed-by: Afreen Misbah <afreen@ibm.com>

commit | commitdiff | tree

Samarah [Wed, 8 Oct 2025 16:10:53 +0000 (16:10 +0000)]

qa/d4n: Fix redis scanning in workunit

Signed-off-by: Samarah <samarah.uriarte@ibm.com>

commit | commitdiff | tree

Joseph Mundackal [Thu, 9 Oct 2025 17:09:26 +0000 (13:09 -0400)]

doc: Update os-recommendations.rst to remove empty rows

- Removing rows in tables without any supported versions

Signed-off-by: Joseph Mundackal <jmundackal@bloomberg.net>

commit | commitdiff | tree

Nizamudeen A [Thu, 9 Oct 2025 09:12:03 +0000 (14:42 +0530)]

mgr/cephadm: bump monitoring stack versions

bumping prometheus, node-exporter, alertmanager, grafana.

Fixes: https://tracker.ceph.com/issues/73467
Signed-off-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Afreen Misbah [Thu, 9 Oct 2025 12:27:21 +0000 (17:57 +0530)]

mgr/dashboard: Remove unused css

Signed-off-by: Afreen Misbah <afreen@ibm.com>

commit | commitdiff | tree

Venky Shankar [Thu, 9 Oct 2025 13:13:26 +0000 (18:43 +0530)]

Merge PR #65713 into main

* refs/pull/65713/head:

Reviewed-by: Dhairya Parmar <dparmar@redhat.com>
Reviewed-by: Christopher Hoffman <choffman@redhat.com>

commit | commitdiff | tree

Aashish Sharma [Thu, 9 Oct 2025 11:43:00 +0000 (17:13 +0530)]

Merge pull request #62360 from cloudbehl/node-exporter-dashboard

mgr/dashboard: added node-exporter dashboard

Reviewed-by: Aashish Sharma <aasharma@redhat.com>

commit | commitdiff | tree

Afreen Misbah [Thu, 9 Oct 2025 11:37:47 +0000 (17:07 +0530)]

mgr/dashboard: Fix color issue of notification panel

The global header css of navigation panel was overriding the color of notifications.

Signed-off-by: Afreen Misbah <afreen@ibm.com>

commit | commitdiff | tree

Naman Munet [Thu, 9 Oct 2025 10:58:11 +0000 (16:28 +0530)]

mgr/dashboard: support templates in table component title

fixes: https://tracker.ceph.com/issues/73456

Signed-off-by: Naman Munet <naman.munet@ibm.com>

commit | commitdiff | tree

afreen23 [Thu, 9 Oct 2025 10:45:58 +0000 (16:15 +0530)]

Merge pull request #65787 from Hezko/nvmeof-cli-missing-cmnds2

mgr/dashboard: support gw get_stats and listener info

Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Abhishek Desai [Thu, 9 Oct 2025 07:49:34 +0000 (13:19 +0530)]

mgr/dashboard : Fixed usage bar for secondary site in rbd mirroing
fixes : https://tracker.ceph.com/issues/73447
Signed-off-by: Abhishek Desai <abhishek.desai1@ibm.com>

commit | commitdiff | tree

Ronen Friedman [Thu, 9 Oct 2025 07:31:30 +0000 (02:31 -0500)]

osd/scrub: do not reduce min chunk on preemption

Existing logic handling scrub preemptions is halving both the
'max' and the 'min' values.
This isn't optimal: the 'min' values (used mainly to guarantee a minimal
number of objects to fetch from the backend in one operation) can and
should also be used to limit the effect of preemptions on the execution
of the scrub.

Fixes: https://tracker.ceph.com/issues/73410
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Wed, 8 Oct 2025 10:29:36 +0000 (15:59 +0530)]

mgr/dashboard: support custom validator messages in inline edit

we can add custom messages to the customTemplate like
```

        name: $localize`Name`,
        prop: 'name',
        flexGrow: 2,
        cellTemplate: this.removingStatTpl,
        cellTransformation: CellTemplate.editing,
        customTemplateConfig: {
          validators: [Validators.required, Validators.pattern(/^[A-Za-z ]+$/)],
          errorMessages: {
            required:  $localize`This field is required.`,
            pattern: $localize`The field format is invalid.`
          }
        }
      },
```

also extend support for async validator

Signed-off-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Wed, 8 Oct 2025 16:54:40 +0000 (11:54 -0500)]

osd/scrub: new/modified perf counters for scrub preemption

A new 'scrub_[replicated/ec]_io_intersects' counter is added, to count
the times a client write op intersects the active scrub chunk range.
A new 'scrub_[replicated/ec]_io_blocked' counter counts the times
the write op did not preempt the scrub, and instead blocked for the
scrub to release the chunk.

The existing labeled counter <label>_write_blocked is still there,
but will be removed in 'Umbrella'. Its meaning was modified,
to mimic the new 'scrub_[replicated/ec]_io_blocked' counter.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ankush Behl [Tue, 18 Mar 2025 15:26:45 +0000 (20:56 +0530)]

mgr/dashboard: added node-exporter dashboard

Fixes: https://tracker.ceph.com/issues/70534
Signed-off-by: Ankush Behl <cloudbehl@gmail.com>

commit | commitdiff | tree

Nizamudeen A [Thu, 9 Oct 2025 05:06:49 +0000 (10:36 +0530)]

Merge pull request #65704 from rhcs-dashboard/rename-sidenav-items

mgr/dashboard: Rename side-nav panel items

Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
Reviewed-by: Puja Shahu <pshahu@redhat.com>
Reviewed-by: Afreen Misbah <afreen@ibm.com>
Reviewed-by: Dnyaneshwari talwekar <dtalweka@redhat.com>
Reviewed-by: Aashish Sharma <aasharma@redhat.com>

commit | commitdiff | tree

Gregory Farnum [Wed, 8 Oct 2025 23:17:59 +0000 (16:17 -0700)]

Merge pull request #65743 from edwinzrodriguez/ceph-clang-format-remainder

Update misc file tab settings, based on the discussions in this PR
and https://github.com/ceph/ceph/pull/65049

Reviewed-by: Ilya Dryomov <idryomov@redhat.com>
Reviewed-by: Anthony D Atri <anthony.datri@gmail.com>
Reviewed-by: John Mulligan <jmulligan@redhat.com>
Reviewed-by: Greg Farnum <gfarnum@redhat.com>

commit | commitdiff | tree

Gregory Farnum [Wed, 8 Oct 2025 23:17:02 +0000 (16:17 -0700)]

Merge pull request #65742 from edwinzrodriguez/ceph-clang-format-cc

Update cc file tab settings, based on the discussions in this PR
and https://github.com/ceph/ceph/pull/65049

Reviewed-by: Ilya Dryomov <idryomov@redhat.com>
Reviewed-by: Anthony D Atri <anthony.datri@gmail.com>
Reviewed-by: Greg Farnum <gfarnum@redhat.com>

commit | commitdiff | tree

Gregory Farnum [Wed, 8 Oct 2025 23:15:18 +0000 (16:15 -0700)]

Merge pull request #65741 from edwinzrodriguez/ceph-clang-format-h

Update header file tab settings, based on the discussions in this PR
and https://github.com/ceph/ceph/pull/65049

Reviewed-by: Ilya Dryomov <idryomov@redhat.com>
Reviewed-by: Anthony D Atri <anthony.datri@gmail.com>
Reviewed-by: Adam Emerson <aemerson@redhat.com>
Reviewed-by: Greg Farnum <gfarnum@redhat.com>

commit | commitdiff | tree

David Galloway [Wed, 8 Oct 2025 22:51:55 +0000 (18:51 -0400)]

Merge pull request #65835 from phlogistonjohn/jjm-bwc-naming-cleanup

build-with-container: naming cleanup

commit | commitdiff | tree

John Mulligan [Wed, 8 Oct 2025 20:41:36 +0000 (16:41 -0400)]

script/build-with-container: improve error handling for invalid distros

Instead of throwing a long obnoxious traceback at the user if the value
supplied to -d/--distro is invalid do something nicer. For example:
```
$ ./src/script/build-with-container.py -d trixy -e build
usage: build-with-container.py [-h] [--help-build-steps]
build-with-container.py: error: argument --distro/-d: unknown distro: 'trixy' not in centos10, centos10stream, centos8, centos9, centos9stream, rocky9, rockylinux9, rocky10, rockylinux10, fedora41, fc41, fedora42, fc42, fedora43, fc43, ubuntu20.04, ubuntu-focal, focal, ubuntu22.04, ubuntu-jammy, jammy, ubuntu24.04, ubuntu-noble, noble, debian12, debian-bookworm, bookworm, debian13, debian-trixie, trixie

```

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Wed, 8 Oct 2025 21:04:27 +0000 (17:04 -0400)]

script/build-with-container: add debian 13 (trixie)

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Wed, 8 Oct 2025 17:30:18 +0000 (13:30 -0400)]

script/build-with-container: add ubuntu 20.04 (focal)

Add ubuntu 20.04 (focal) to the available list of distro kinds.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Wed, 8 Oct 2025 14:28:30 +0000 (10:28 -0400)]

script/build-with-container: add a pair of fedora distro versions

Add fedora 42 and the soon-to-be-released fedora 43.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Wed, 8 Oct 2025 14:26:58 +0000 (10:26 -0400)]

script/build-with-container: lightly organize the distro kind aliases

Do a tiny reorg of the distro kind aliases and container images to keep
the EL distros together and comment out each "section".

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

John Mulligan [Wed, 8 Oct 2025 14:23:25 +0000 (10:23 -0400)]

script/build-with-container: be consistent with naming in distro kinds

Update the DistroKind enum and related items so that the naming is
applied consistently. That is: the canonical (no pun indented) form
of the name is "<name><version>" and codenames, such as "jammy" or
"bookworm" are aliases. This matches the previously existing code.

Signed-off-by: John Mulligan <jmulligan@redhat.com>

commit | commitdiff | tree

Pedro Gonzalez Gomez [Wed, 8 Oct 2025 17:25:29 +0000 (19:25 +0200)]

mgr/dashboard: fix dashboard freeze on missing smb permissions

Fixes: https://tracker.ceph.com/issues/73436
Signed-off-by: Pedro Gonzalez Gomez <pegonzal@ibm.com>

commit | commitdiff | tree

J. Eric Ivancich [Wed, 8 Oct 2025 14:14:57 +0000 (10:14 -0400)]

Merge pull request #65797 from ivancich/wip-bump-arrow

rgw: use later versions of Apache Arrow at utf8proc submodules

Reviewed-by: Edwin Rodriguez <edwin.rodriguez1@ibm.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Adam King [Wed, 8 Oct 2025 13:19:27 +0000 (09:19 -0400)]

Merge pull request #65644 from ShwetaBhosale1/fix_issue_73035_nfs_ganesha_tls_support

mgr/cephadm: Cephadm support for NFS-Ganesha TLS configuration

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
Reviewed-by: Redouane Kachach <rkachach@ibm.com>

commit | commitdiff | tree

David Galloway [Wed, 8 Oct 2025 12:48:03 +0000 (08:48 -0400)]

Merge pull request #65804 from dmick/add-bookworm-to-bwc

src/script: add bookworm to build scripts

commit | commitdiff | tree

Tomer Haskalovitch [Sun, 21 Sep 2025 18:42:49 +0000 (21:42 +0300)]

mgr/dashboard: support gw get_stats and listener info

Update nvmeof/gateway submodule to have the relevant protobuf objects and calls.

Signed-off-by: Tomer Haskalovitch <tomer.haska@ibm.com>

commit | commitdiff | tree

Naman Munet [Mon, 29 Sep 2025 04:51:06 +0000 (10:21 +0530)]

mgr/dashboard: Rename side-nav panel items

Fixes: https://tracker.ceph.com/issues/73252
Commit includes changes:
1) Renaming Topic to Notification destination
2) Renaming Tiering to Storage class
3) Renaming Users to User Management
4) fix storage class table refresh after delete
5) Also made changes to internal routing for topic and storage class

Signed-off-by: Naman Munet <naman.munet@ibm.com>

commit | commitdiff | tree

Nizamudeen A [Wed, 8 Oct 2025 04:38:22 +0000 (10:08 +0530)]

Merge pull request #65482 from rhcs-dashboard/table-refresh-pagination

mgr/dashboard: server side table rendering improvements

Reviewed-by: Ankush Behl <ankush.behl@ibm.com>
Reviewed-by: Pedro Gonzalez <pedro.gonzalez@redhat.com>

commit | commitdiff | tree

bluikko [Wed, 8 Oct 2025 03:54:57 +0000 (10:54 +0700)]

Merge pull request #65809 from bluikko/doc-scrub-admonition-typo-cephfs

doc/cephfs: Typos and small improvements in scrub.rst

commit | commitdiff | tree

bluikko [Wed, 8 Oct 2025 03:53:38 +0000 (10:53 +0700)]

Merge pull request #65811 from bluikko/doc-cache-config-label-warnings-cephfs

doc/cephfs: Add missing labels in cache-configuration.rst

commit | commitdiff | tree

Rishabh Dave [Wed, 8 Oct 2025 03:02:04 +0000 (08:32 +0530)]

Merge pull request #65135 from rishabh-d-dave/libcephfs-platform-errno

libcephfs: convert ceph errno to host-based errno

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Dan Mick [Tue, 7 Oct 2025 22:22:44 +0000 (15:22 -0700)]

Merge pull request #65699 from phlogistonjohn/jjm-py-cmake-labels

pybind/mgr: add default set of cmake/ctest labels to tests

commit | commitdiff | tree

David Galloway [Tue, 7 Oct 2025 20:04:22 +0000 (16:04 -0400)]

Merge pull request #65819 from ceph/wip-yuriw-release-doc

doc/dev: Fix typo in release-process.rst

commit | commitdiff | tree

Kamoltat Sirivadhna [Fri, 18 Jul 2025 04:58:26 +0000 (04:58 +0000)]

src/mon/Elector.cc: Improve logging for the elector

Fixes: https://tracker.ceph.com/issues/71344
Signed-off-by: Kamoltat Sirivadhna <ksirivad@redhat.com>

commit | commitdiff | tree

Kamoltat Sirivadhna [Mon, 14 Jul 2025 18:46:02 +0000 (18:46 +0000)]

src/mon/ConnectionTracker.cc: && HealthMonitor.cc Improve logging

Improve logging for Netsplit pairs and
other variables in HealthMonitor::check_netsplit

Fixes: https://tracker.ceph.com/issues/71344
Signed-off-by: Kamoltat Sirivadhna <ksirivad@redhat.com>

commit | commitdiff | tree

Kamoltat Sirivadhna [Sun, 13 Jul 2025 20:26:49 +0000 (20:26 +0000)]

src/mon/HealthMonitor: Add mon_netsplit_grace_period to suppress transient MON_NETSPLIT warnings

When a monitor is elected leader and begins evaluating connectivity,
it may detect temporary disconnections between monitors that have not
yet fully reconnected to each other—particularly after events like
monitor restarts, SIGSTOP/SIGCONT (as used in mon_thrash), or brief network blips.

This can result in false-positive MON_NETSPLIT health warnings that
quickly disappear within seconds as the cluster topology stabilizes.

This commit introduces a configurable option:
- mon_netsplit_grace_period (default: 9 seconds)

When the leader observes a netsplit between two monitors or locations,
it will wait for the grace period before raising a health warning.
If the split resolves within this window, no warning is emitted.

This reduces test flakiness and alert fatigue while preserving the
accuracy of persistent MON_NETSPLIT detection.

Fixes: https://tracker.ceph.com/issues/71344
Signed-off-by: Kamoltat Sirivadhna <ksirivad@redhat.com>
Conflicts:
src/common/options/mon.yaml.in - trivial fix

commit | commitdiff | tree

Kamoltat Sirivadhna [Wed, 9 Jul 2025 03:47:59 +0000 (03:47 +0000)]

src/msg/async: Improve logging and prefixes for global_seq

global_seq needs more visibility on how it gets updated,
decided to add more loggings in AsyncMessenger::get_global_seq
and also added the prefixes for global_seq in both
ProtocolV1 and ProtocolV2.

Fixes: https://tracker.ceph.com/issues/71344
Signed-off-by: Kamoltat Sirivadhna <ksirivad@redhat.com>

commit | commitdiff | tree

Kamoltat Sirivadhna [Wed, 21 May 2025 20:55:04 +0000 (20:55 +0000)]

qa/suites/rados: increase debug && msgr-failures/none && white list

bump mon debug level to 30 in RADOS
and bump debug_ms from mon in
rados/monthrash && rados/multimon.

Add msgr-failures/none scenario to multimon and monthrash suite
this is a control scenario, where MON_NETSPLIT can only be organically
generated due to actually monitor network partition.

Whitelist the MON_NETSPLIT health warning in msgr-failures cases (excluding none)
for both multimon and monthrash suites. This is because all other
msgr-failures that is not `none` will have ms_inject_socket_failures
which is not an organic case of MON_NETSPLIT.

Fixes: https://tracker.ceph.com/issues/71344
Signed-off-by: Kamoltat Sirivadhna <ksirivad@redhat.com>

commit | commitdiff | tree

Kamoltat Sirivadhna [Wed, 28 May 2025 22:54:08 +0000 (22:54 +0000)]

qa/tasks/mon_thrash: enhanced logging in thrash

Fixes: https://tracker.ceph.com/issues/71344
Signed-off-by: Kamoltat Sirivadhna <ksirivad@redhat.com>

commit | commitdiff | tree

Kamoltat Sirivadhna [Thu, 5 Jun 2025 20:14:57 +0000 (20:14 +0000)]

ProtocolV2.cc: Add more loggings to functions that needs them

Fixes: https://tracker.ceph.com/issues/71344
Signed-off-by: Kamoltat Sirivadhna <ksirivad@redhat.com>

commit | commitdiff | tree

Kamoltat Sirivadhna [Mon, 23 Jun 2025 19:55:23 +0000 (19:55 +0000)]

msg/async/ProtocolV2: Speed up connection logic when peer restarts

When a client is restarted, it loses its state including global_seq and
gets a new client_cookie. This creates an issue during reconnection
because the server has an existing connection with a higher
global_seq value, causing it to reject the new connection as "stale"
with the error:

  "this is a stale connection, peer_global_seq="

This commit adds detection logic in
ProtocolV2::handle_existing_connection() to identify client restarts
by checking:

  - exproto->peer_global_seq > peer_global_seq
    (global_seq should only increase during a session; a decrease
    strongly suggests a restart)

  - client_cookie has changed (client generated a new cookie)

  - existing connection state is not Ready or Standby

When all three conditions are met, the server drops the existing connection
and accepts the new one. This leads to faster reconnection, especially
in cases like Monitor restarts, helping clients rejoin the quorum
without delay and avoiding spurious MON_NETSPLIT warnings.

It also prevents reliance on server-side reconnect attempts or waiting
for the client's global_seq to catch up.

Note: without this patch, it'll take longer
for the reconnection to happen, eventually the server will
also act as a client and send_reconnect since it has server_cookie.
However, this could be 15 seconds later, since, the server probably
was hit with a _fault max_back_off when it was trying to connect
to the client when the client was still down and has not yet
restart. This patch basically does not wait for the reconnection
to happen and intelligently make decisions in dropping existing
connection or not based on 3 conditions.

Fixes: https://tracker.ceph.com/issues/71344
Signed-off-by: Kamoltat Sirivadhna <ksirivad@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Tue, 7 Oct 2025 19:46:25 +0000 (12:46 -0700)]

docs: Fixed typo

Signed-off-by: Yuri Weinstein <yweinste@redhat.com>

commit | commitdiff | tree

Dan Mick [Tue, 7 Oct 2025 11:06:37 +0000 (04:06 -0700)]

src/test/fs: add more --copy-dt-needed-entries

See comment in src/test/libcephfs/CMakeLists.txt. I don't
know why bookworm tweaks this issue so that it occurs on two
more test binaries, but this fix works around the link error
in the previously-discovered way

Signed-off-by: Dan Mick <dan.mick@redhat.com>

commit | commitdiff | tree

Dan Mick [Tue, 7 Oct 2025 03:50:17 +0000 (20:50 -0700)]

make-debs.sh: use ID instead of NAME for workdir

NAME is "for the user", and as such, Debian's is "Debian GNU/Linux",
which isn't friendly for making a pathname. ID is more like what
we want (lowercase, no spaces, limited special characters),
in the two cases we care most about, 'ubuntu' and 'debian'.

Signed-off-by: Dan Mick <dan.mick@redhat.com>

commit | commitdiff | tree

Dan Mick [Tue, 7 Oct 2025 01:48:42 +0000 (18:48 -0700)]

src/script: add bookworm to build-with-container.py

..and its friend buildcontainer-setup.sh

Signed-off-by: Dan Mick <dan.mick@redhat.com>

commit | commitdiff | tree

Zack Cerza [Tue, 7 Oct 2025 17:36:41 +0000 (11:36 -0600)]

Merge pull request #65803 from zmc/sccache-flex

Dockerfile.build: Make sccache repo configurable

commit | commitdiff | tree

J. Eric Ivancich [Mon, 6 Oct 2025 17:45:26 +0000 (13:45 -0400)]

rgw: update cmake files to refer to system Apach Arrow version 9

When built using WITH_SYSTEM_ARROW refer to version 9.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>

commit | commitdiff | tree

Adam King [Tue, 7 Oct 2025 13:52:18 +0000 (09:52 -0400)]

Merge pull request #63169 from andreasstieger/fuse3

ceph.spec.in: build with fuse 3

Reviewed-by: Adam King <adking@redhat.com>

commit | commitdiff | tree

Connor Fawcett [Tue, 7 Oct 2025 12:39:35 +0000 (13:39 +0100)]

Merge pull request #65703 from connorfawcett/pool-migration-design-doc

docs/dev: Add design document with information on proposed design for pool migration

commit | commitdiff | tree

Rishabh Dave [Tue, 7 Oct 2025 11:27:05 +0000 (16:57 +0530)]

Merge pull request #63917 from rishabh-d-dave/vols-purge-trash-entry

pybind/cephfs, mgr/volumes: introduce non-recurisve rmtree(), refactor purge() to use it and add MDS optimizations

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Ville Ojamo [Tue, 7 Oct 2025 08:49:00 +0000 (15:49 +0700)]

doc/cephfs: Add missing labels in cache-configuration.rst

Fix two Sphinx warnings about :ref: links in troubleshooting.rst to
undefined labels.

Signed-off-by: Ville Ojamo <14869000+bluikko@users.noreply.github.com>

commit | commitdiff | tree

Ville Ojamo [Tue, 7 Oct 2025 07:45:12 +0000 (14:45 +0700)]

doc/cephfs: Typos and small improvements in scrub.rst

Don't use space in "subdirectory" or "kick-start".

Change command syntax introduction text to the common "To ..., run a
command of the following form:" when the syntax has placeholders.

Use double backticks consistently (same as sections in the beginning)
for CLI commands, parameters etc.

Linkify "cluster health warnings" to the CephFS health warnings doc.

Use admonition and fix a typo in article.

Use title case for section titles and trim overly long underline.

Minor improvements to language.

Don't use unordered lists in two cases when there's no real need,
especially when there is only a single item in the latter case.

Signed-off-by: Ville Ojamo <14869000+bluikko@users.noreply.github.com>

commit | commitdiff | tree

afreen23 [Tue, 7 Oct 2025 07:31:12 +0000 (13:01 +0530)]

Merge pull request #65676 from Hezko/nvmeof-cli-ns-add-size-params2

mgr/dashboard: raise exception if both size and rbd_image_size are being passed in ns add

Reviewed-by: Afreen Misbah <afreen@ibm.com>

commit | commitdiff | tree

bluikko [Tue, 7 Oct 2025 05:14:33 +0000 (12:14 +0700)]

Merge pull request #64874 from bluikko/doc-manual-freebsd-deployment-improvements-install

doc/install: Small improvements in manual-freebsd-deployment.rst

commit | commitdiff | tree

Zack Cerza [Fri, 3 Oct 2025 22:06:57 +0000 (16:06 -0600)]

Dockerfile.build: Make sccache repo configurable

CEPH-BUILD-BRANCH: sccache-stats
DISTROS: centos9 noble
ARCHS: x86_64
FLAVORS: default
CI-CONTAINER: false
Signed-off-by: Zack Cerza <zack@cerza.org>

commit | commitdiff | tree

J. Eric Ivancich [Mon, 6 Oct 2025 16:01:05 +0000 (12:01 -0400)]

rgw: bump up versions of Apache Arrow and utf8proc submodules

Arrow goes up to 17.0.0 and utf8proc to 2.11.0.

arrow submodule
utf8proc submodule

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>

commit | commitdiff | tree

Anthony D'Atri [Mon, 6 Oct 2025 15:48:28 +0000 (11:48 -0400)]

Merge pull request #64744 from bluikko/doc-services-formatting-cephadm

doc/cephadm/services: Improve formatting and small fixes in index.rst

commit | commitdiff | tree

Anthony D'Atri [Mon, 6 Oct 2025 15:01:52 +0000 (11:01 -0400)]

Merge pull request #64729 from bluikko/doc-ceph-dokan-warnings-plus-cephfs

doc: Fix Sphinx warnings&errors and use labels in 4 files for links

commit | commitdiff | tree

Ville Ojamo [Tue, 29 Jul 2025 06:23:07 +0000 (13:23 +0700)]

doc: Fix a Sphinx warning&error and use labels for links in 4 files

Fix Sphinx warning about missing empty line after a label in
cephfs/ceph-dokan.rst.

Fix Sphinx error about invalid indentation in releases/squid.rst.

Fix Sphinx warning about invalid bash prompt block in
dev/crimson/crimson.rst.
Also fix warnings about too short section title text underline and fix
an incorrect link name (and use "above" instead of "below").

Fix Sphinx warning about missing strong emphasis closure, put the
problematic star inside inline preformatted/code in dev/blkin.rst.

Fix Sphinx warning about explicit markup, add the required empty line in
cephfs/mds-config-ref.rst.

Fix Sphinx error about invalid table in radosgw/s3/bucketops.rst.

Use the label at the beginning of the document in cephfs/ceph-dokan.rst
for hyperlinking with :ref: instead of using "external links" feature
from cephfs/createfs.rst cephfs/mount-prerequisites.rst
install/windows-install.rst.

Also use existing label in cephfs/mount-using-kernel-driver.rst
similarly from cephfs/createfs.rst cephfs/mount-prerequisites.rst.

Also use existing label in man/8/cephfs-shell.rst similarly from
cephfs/index.rst.

Add a label in cephfs/mount-using-fuse.rst and use it from
cephfs/createfs.rst cephfs/mount-prerequisites.rst.

Remove space at the end of line in cephfs/createfs.rst
cephfs/mount-prerequisites.rst cephfs/mount-using-fuse.rst
cephfs/cephfs-fuse.rst cephfs/index.rst.

Signed-off-by: Ville Ojamo <14869000+bluikko@users.noreply.github.com>

commit | commitdiff | tree

Connor Fawcett [Mon, 29 Sep 2025 01:21:16 +0000 (02:21 +0100)]

docs/dev: Add design document with information on proposed design for pool migration

Signed-off-by: Connor Fawcett <connorfa@uk.ibm.com>

commit | commitdiff | tree

Nizamudeen A [Mon, 6 Oct 2025 10:03:03 +0000 (15:33 +0530)]

Merge pull request #65627 from rhcs-dashboard/fix-401-error

mgr/dashboard: fix prometheus API error when not configured

Reviewed-by: Afreen Misbah <afreen@ibm.com>

commit | commitdiff | tree

Shraddha Agrawal [Mon, 6 Oct 2025 09:56:41 +0000 (15:26 +0530)]

Merge pull request #65545 from shraddhaag/wip-shraddhaag-availability-default

options/mon: disable availability tracking by default

Reviewed-by: Radoslaw Zarzynski <Radoslaw.Adam.Zarzynski@ibm.com>

commit | commitdiff | tree

Nizamudeen A [Wed, 17 Sep 2025 11:10:48 +0000 (16:40 +0530)]

mgr/dashboard: type the rbd mirror modes

Fixes: https://tracker.ceph.com/issues/72458
Signed-off-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Shweta Bhosale [Tue, 23 Sep 2025 15:50:04 +0000 (21:20 +0530)]

mgr/cephadm: Cephadm support for NFS-Ganesha TLS configuration

Fixes: https://tracker.ceph.com/issues/73035
Signed-off-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>

commit | commitdiff | tree

Shweta Bhosale [Tue, 23 Sep 2025 12:55:11 +0000 (18:25 +0530)]

mgr/cephadm: Adding support for custom CA cert

Fixes: https://tracker.ceph.com/issues/73035
Signed-off-by: Shweta Bhosale <Shweta.Bhosale1@ibm.com>

commit | commitdiff | tree

Alex Ainscow [Mon, 6 Oct 2025 07:13:05 +0000 (08:13 +0100)]

Merge pull request #65662 from aainscow/ec_direct_reads_docs

docs: Split Reads design (EC Direct Reads and Replica splits)

commit | commitdiff | tree

anrao19 [Mon, 6 Oct 2025 05:50:59 +0000 (11:20 +0530)]

Merge pull request #65606 from VVoidV/fix-stringnotequals-logic

rgw/iam: fix NotEquals handling for multiple values

commit | commitdiff | tree

Ville Ojamo [Tue, 29 Jul 2025 15:56:38 +0000 (22:56 +0700)]

doc/cephadm/services: Improve formatting and small fixes in index.rst

Indent block formatting to the left margin of the current paragraph.
And indent with a consistent amount of spaces.
Fix some text paragraphs indented too much.

Remove spaces at the end of lines.
Remove a stray space preceding a comma and in the middle of a hostname.
Add some full stops and colons where suitable.

Use title case in section titles consistently.
Fix some capitalization errors in text: YAML, CPU, some words starting
with capital case in the middle of a sentence.
Always use "service specification" (capitalized normally) instead of
also sometimes "Service Specification" or "Services Specification".
Cephadm the concept was sometimes capitalized even in the middle of a
sentence, sometimes not capitalized even in the beginning of a sentence.
Only capitalize it at the beginning of a sentence.
Capitalize "MON" when it refers to daemon/service and not to a
configuration parameter, for consistency with other such services.

Use double backticks instead of single backtick or otherwise for data
like warning names, CLI commands, file names.

Don't direct to add a space after "regex:" for a pattern, which
contradicted the next sample.

Try to improve a sentence to also mention the host list after the count
in an example command.

Use privileged user prompt consistently for privileged example CLI
commands.

Signed-off-by: Ville Ojamo <14869000+bluikko@users.noreply.github.com>

commit | commitdiff | tree

anrao19 [Mon, 6 Oct 2025 05:47:32 +0000 (11:17 +0530)]

Merge pull request #64971 from edwinzrodriguez/ceph-wip-72480

rgw/s3select: Disable stringop truncation warning for s3select.h

commit | commitdiff | tree

Rishabh Dave [Tue, 19 Aug 2025 18:40:09 +0000 (00:10 +0530)]

pybind: convert ceph errno to host-based errno

Fixes: https://tracker.ceph.com/issues/72401
Signed-off-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Rishabh Dave [Wed, 20 Aug 2025 07:41:04 +0000 (13:11 +0530)]

src/include: move ceph_to_hostos_errno() to separate header file

Including src/include/types.h in src/pybind/cephfs/types.pxd leads to
compilation error: "fatal error: acconfig.h: No such file or directory".
types.h as well as int_types.h include acconfig.h header file.

Move the code to be included in types.pxd to a separate file where
acconfig.h won't be included, thus preventing this error.

Signed-off-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Rishabh Dave [Sun, 31 Aug 2025 18:50:19 +0000 (00:20 +0530)]

qa: set -x for qa/workunits/libcephfs/test.sh

LibCephFS unit tests are compiled into different binary files and run
after another but without logging name of the binary being executed,
which can make it bit difficult to find out which binary/test group is
being run. Therefore "set -x" in the script so that binary name/test
group is printed before tests run.

Signed-off-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Rishabh Dave [Fri, 3 Oct 2025 11:24:40 +0000 (16:54 +0530)]

Merge pull request #65104 from edwinzrodriguez/ceph-wip-72556

mds: Return ceph.dir.subvolume vxattr

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-ny: Christopher Hoffman <choffman@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>

Unnamed repository; edit this file 'description' to name the repository.

RSS Atom