]>
git.apps.os.sepia.ceph.com Git - ceph.git/log
Paul Cuzner [Thu, 15 Feb 2024 22:56:03 +0000 (11:56 +1300)]
ceph-mixins: Update MIB to include nvmeof notification
Signed-off-by: Paul Cuzner <pcuzner@ibm.com>
Paul Cuzner [Tue, 13 Feb 2024 00:55:24 +0000 (13:55 +1300)]
ceph-mixins: Add test cases for nvmeof alerts
Signed-off-by: Paul Cuzner <pcuzner@ibm.com>
Paul Cuzner [Tue, 13 Feb 2024 00:54:48 +0000 (13:54 +1300)]
ceph-mixins: nvmeof alerts added
Signed-off-by: Paul Cuzner <pcuzner@ibm.com>
Paul Cuzner [Fri, 9 Feb 2024 04:00:28 +0000 (17:00 +1300)]
ceph-mixins: Add nvmeof alerts
Signed-off-by: Paul Cuzner <pcuzner@ibm.com>
Paul Cuzner [Fri, 9 Feb 2024 03:56:58 +0000 (16:56 +1300)]
ceph-mixins: Add vars to support nvmeof alerts
Signed-off-by: Paul Cuzner <pcuzner@ibm.com>
Adam Kupczyk [Tue, 6 Feb 2024 14:32:48 +0000 (15:32 +0100)]
Merge pull request #55054 from pereman2/zns-remove
os/bluestore: remove zoned namespace support
It has never been finished and now its in the way of future improvements.
Yuval Lifshitz [Tue, 6 Feb 2024 12:09:37 +0000 (14:09 +0200)]
Merge pull request #55275 from qiuxinyidian/rgw-noti-dev
rgw: add topic owner user check when creating
reviewed-by: cbodley, kchheda3
zdover23 [Tue, 6 Feb 2024 09:36:55 +0000 (19:36 +1000)]
Merge pull request #55447 from zdover23/wip-doc-2024-02-05-rados-operations-pools
doc/rados: update PG guidance
Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
Zhipeng Li [Tue, 23 Jan 2024 06:50:52 +0000 (14:50 +0800)]
rgw: modify topic owner check when creating
add tests to cover topic policies
as well as behavior when no policies are defined
Fixes: https://tracker.ceph.com/issues/64124
Signed-off-by: Zhipeng Li <qiuxinyidian@gmail.com>
Casey Bodley [Mon, 5 Feb 2024 21:21:11 +0000 (21:21 +0000)]
Merge pull request #54311 from gabriel-samfira/do-not-escape-slash
RGW: fix cloud-sync not being able to sync folders
Reviewed-by: Casey Bodley <cbodley@redhat.com>
Casey Bodley [Mon, 5 Feb 2024 21:17:31 +0000 (21:17 +0000)]
Merge pull request #55252 from jzhu116-bloomberg/wip-21800
rgw/multisite: avoid writing multipart parts to the bucket index log
Reviewed-by: Casey Bodley <cbodley@redhat.com>
Casey Bodley [Mon, 5 Feb 2024 21:16:56 +0000 (21:16 +0000)]
Merge pull request #55293 from jzhu116-bloomberg/wip-63178
rgw/lc: pass in flag as an argument when calling function complete(..)
Reviewed-by: Casey Bodley <cbodley@redhat.com>
Casey Bodley [Mon, 5 Feb 2024 21:16:27 +0000 (21:16 +0000)]
Merge pull request #54661 from lightmelodies/rgw-gc
rgw: optimize gc chain size calculation
Reviewed-by: Casey Bodley <cbodley@redhat.com>
Ilya Dryomov [Mon, 5 Feb 2024 19:12:51 +0000 (20:12 +0100)]
Merge pull request #55452 from nbalacha/wip-typo-fix
rbd-mirror: fix a typo in NotifyAckPayload::dump()
Reviewed-by: Ramana Raja <rraja@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
Casey Bodley [Mon, 5 Feb 2024 17:59:46 +0000 (17:59 +0000)]
Merge pull request #55406 from cbodley/wip-64255
rgw: SiteConfig::load() falls back to local zonegroup
Reviewed-by: Shilpa Jagannath <smanjara@redhat.com>
Reviewed-by: Adam Emerson <aemerson@redhat.com>
Adam King [Mon, 5 Feb 2024 13:28:42 +0000 (08:28 -0500)]
Merge pull request #55448 from guits/subclass-thread
node-proxy: refactor entrypoint
Reviewed-by: Adam King <adking@redhat.com>
Ronen Friedman [Mon, 5 Feb 2024 12:21:32 +0000 (14:21 +0200)]
Merge pull request #55280 from ronen-fr/wip-rf-extra-lg
osd: remove superfluous '>' in spg_t formatter
Reviewed-by: Samuel Just <sjust@redhat.com>
Yuval Lifshitz [Mon, 5 Feb 2024 12:16:55 +0000 (14:16 +0200)]
Merge pull request #55414 from yuvalif/wip_yuval_url_decode_sns
rgw/rest: fix url decode of post params passed as attributes
reviewed-by: cbodley
N Balachandran [Mon, 5 Feb 2024 09:46:17 +0000 (15:16 +0530)]
rbd-mirror: fixed a typo in instance_watcher
Fixes a typo in NotifyAckPayload::dump().
Signed-off-by: N Balachandran <nibalach@redhat.com>
Guillaume Abrioux [Sun, 4 Feb 2024 19:11:41 +0000 (19:11 +0000)]
node-proxy: refactor entrypoint
This commit introduces a major refactor of the main
entrypoint.
- subclass threading.Thread:
- Introduce a new class `BaseThread()` that is a
`threading.Thread()` abstraction class in order
to monitor the different threads.
- `BaseSystem()` inherits from `BaseThread()`.
- Handle `SIGTERM` signal in order to gracefully shutdown
node-proxy (make threads exit gracefully, log out from RedFish API, etc.)
Additionally, this:
- drops the class `Logger()` from util.py which
was not adding value. It is now replaced with a simple `get_logger()`
function.
- changes the node-proxy API port from 8080 to 9456
(8080 being widely used for frontend apps...)
- changes the container entrypoint in order to use the
`ceph-node-proxy` binary from the packaging
Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
Zac Dover [Sun, 4 Feb 2024 15:36:10 +0000 (01:36 +1000)]
doc/rados: update PG guidance
Update the "Creating a Pool" section of doc/rados/operations/pools.rst
so that the documentation no longer insists that the user change the
values of "osd_pool_default_pg_num" and "osd_pool_default_pgp_num".
See also: https://github.com/ceph/ceph/pull/55419
Tracker: https://tracker.ceph.com/issues/64259
Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
Kefu Chai [Mon, 5 Feb 2024 02:20:07 +0000 (10:20 +0800)]
Merge pull request #55089 from cbodley/wip-boost-mirrors
make-dist: update boost mirrors
Reviewed-by: Kefu Chai <tchaikov@gmail.com>
zdover23 [Sun, 4 Feb 2024 14:44:35 +0000 (00:44 +1000)]
Merge pull request #55434 from anthonyeleven/rgwyamlin
src/common/options: Correct typo in rgw.yaml.in
Reviewed-by: Zac Dover <zac.dover@proton.me>
Matan Breizman [Sun, 4 Feb 2024 09:45:43 +0000 (11:45 +0200)]
Merge pull request #55276 from Matan-B/wip-matanb-crimson-alienstore-cores
crimson: rework CPU pinning
Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
zdover23 [Sat, 3 Feb 2024 22:25:50 +0000 (08:25 +1000)]
Merge pull request #55419 from zdover23/wip-doc-2024-02-02-rados-config-pool-pg-config-ref
doc/rados: update config for autoscaler
Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
Anthony D'Atri [Sat, 3 Feb 2024 07:18:25 +0000 (02:18 -0500)]
src/common/options: Correct typo in rgw.yaml.in
Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
Zac Dover [Fri, 2 Feb 2024 01:53:45 +0000 (11:53 +1000)]
doc/rados: update config for autoscaler
Update doc/rados/configuration/pool-pg-config-ref.rst to account for the
behavior of autoscaler.
Previously, this file was last meaningfully altered in 2013, prior to
the invention of autoscaler. A recent confusion was brought to my
attention on the Ceph Slack whereby a user attempted to alter the
default values of a Quincy cluster, as suggested in this documentation.
That alteration caused Ceph to throw the error "Error ERANGE: 'pgp_num'
must be greater than 0 and lower or equal than 'pg_num', which in this
case is one" and a related "rgw_init_ioctx ERROR" reading in part
"Numerical result out of range". The user removed the
"osd_pool_default_pgp_num" configuration line from ceph.conf and the
cluster worked as expected. I presume that this is because the removal
of this configuration line allowed autoscaler to work as intended.
Fixes: https://tracker.ceph.com/issues/64259
Co-authored-by: David Orman <ormandj@corenode.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
Casey Bodley [Wed, 31 Jan 2024 19:29:43 +0000 (14:29 -0500)]
rgw: SiteConfig::load() falls back to local zonegroup
allow radosgw-admin commands like 'user create' to operate on a new zone
that hasn't been committed to the period yet. this follows similar logic
in RGWSI_Zone::do_start()
Fixes: https://tracker.ceph.com/issues/64255
Signed-off-by: Casey Bodley <cbodley@redhat.com>
Adam King [Fri, 2 Feb 2024 18:57:49 +0000 (13:57 -0500)]
Merge pull request #54312 from dvanders/dvanders_mcltf_true
qa/tasks/cephadm: enable mon_cluster_log_to_file
Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
Casey Bodley [Fri, 2 Feb 2024 16:37:53 +0000 (16:37 +0000)]
Merge pull request #54837 from imtzw/urlencode
rgw: fix list obj v1 nextmarker
Reviewed-by: Matt Benjamin <mbenjamin@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
Casey Bodley [Fri, 2 Feb 2024 15:42:00 +0000 (15:42 +0000)]
Merge pull request #54867 from BBoozmen/oozmen_lc_subsystem
rgw/common: add rgw lifecycle specific debug log subsystem
Reviewed-by: Matt Benjamin <mbenjamin@redhat.com>
Reviewed-by: Jiffin Tony Thottan <jthottan@redhat.com>
Casey Bodley [Fri, 2 Feb 2024 15:41:17 +0000 (15:41 +0000)]
Merge pull request #55162 from cfsnyder/wip-64014-cfsnyder-pacific
rgw: fix issue with concurrent versioned deletes leaving behind olh entries
Reviewed-by: J. Eric Ivancich <ivancich@redhat.com>
Casey Bodley [Fri, 2 Feb 2024 15:40:48 +0000 (15:40 +0000)]
Merge pull request #55229 from tobias-urdin/http-error-keystone-issue-admin-token-request
rgw: fix http error check in keystone issue_admin_token_request
Reviewed-by: Casey Bodley <cbodley@redhat.com>
baum [Fri, 2 Feb 2024 07:54:28 +0000 (09:54 +0200)]
Merge pull request #55330 from baum/centos9-install-build-deps
🧹build dependencies: centos9
Venky Shankar [Fri, 2 Feb 2024 05:05:00 +0000 (10:35 +0530)]
Merge PR #55233 into main
* refs/pull/55233/head:
qa: use centos 9.stream for cephfs stock kernel testing
Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>
Casey Bodley [Thu, 1 Feb 2024 19:40:57 +0000 (19:40 +0000)]
Merge pull request #53901 from cbodley/wip-qa-container-distros-s
qa/distros: bump container host distros from centos 8->9
Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
Ilya Dryomov [Thu, 1 Feb 2024 19:29:58 +0000 (20:29 +0100)]
Merge pull request #55413 from petrutlucian94/flipped_tid
msg: update MOSDOp() to use ceph_tid_t instead of long
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
Matan Breizman [Tue, 30 Jan 2024 17:53:16 +0000 (17:53 +0000)]
crimson: introduce crimson_seastar_cpu_cores
seastar::smp::count is deduced from the cpuset provided.
from seastar/src/core/reactor:
```
if (smp_opts.smp) {
smp::count = smp_opts.smp.get_value();
} else {
smp::count = cpu_set.size();
}
```
Co-authored-by: Yingxin Cheng <yingxin.cheng@intel.com>
Signed-off-by: Matan Breizman <mbreizma@redhat.com>
Yuval Lifshitz [Thu, 1 Feb 2024 15:55:59 +0000 (15:55 +0000)]
rgw/rest: fix url decode of post params passed as attributes
this is fixing a regression with the SNS policies
cause by:
4bdc5d18dd68b95c6ccd4c0e77a1bd04ad86dbb8
the changes to the test code is to accomodate different
boto3 versions
Signed-off-by: Yuval Lifshitz <ylifshit@redhat.com>
Lucian Petrut [Thu, 1 Feb 2024 14:40:03 +0000 (14:40 +0000)]
msg: update MOSDOp() to use ceph_tid_t instead of long
The MOSDOp constructor receives the the transaction ID as a long
instead of ceph_tid_t.
The issue is that "long" uses 32b on Windows instead of 64 bits,
so it flips after about 2 billion requests. At that point, the OSD
replies are dropped because of transaction ID mismatches.
We'll solve the issue by using the correct type for the transaction
id, specifically ceph_tid_t.
Fixes: https://tracker.ceph.com/issues/64291
Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
Ivo Almeida [Thu, 1 Feb 2024 13:48:17 +0000 (10:48 -0300)]
Merge pull request #55373 from ivoalmeida/snapshot-schedule-form-directory-typeahead
Snapshot schedule form directory typeahead
Reviewed-by: Nizamudeen A <nia@redhat.com>
Redouane Kachach [Thu, 1 Feb 2024 11:35:39 +0000 (12:35 +0100)]
Merge pull request #55393 from rkachach/fix_issue_64266
mgr/rook: adding deployment to ceph image built for rook e2e testing
Matan Breizman [Tue, 23 Jan 2024 14:08:48 +0000 (14:08 +0000)]
vstarh.sh: Crimson - set crimson_alien_thread_cpu_cores
Signed-off-by: Matan Breizman <mbreizma@redhat.com>
Matan Breizman [Tue, 23 Jan 2024 14:06:01 +0000 (14:06 +0000)]
crimson/os/alienstore: cleanup N_CORES_FOR_SEASTAR constant
No CPU pinning if crimson_alien_thread_cpu_cores is not set.
Signed-off-by: Matan Breizman <mbreizma@redhat.com>
Redouane Kachach [Wed, 31 Jan 2024 09:48:35 +0000 (10:48 +0100)]
mgr/rook: adding deployment to ceph image built for rook e2e testing
Fixes: https://tracker.ceph.com/issues/64266
Signed-off-by: Redouane Kachach <rkachach@redhat.com>
Casey Bodley [Wed, 31 Jan 2024 22:15:33 +0000 (22:15 +0000)]
Merge pull request #47723 from mdw-at-linuxbox/wip-master-tempurl
rgw: swift: tempurl fixes for ceph
Reviewed-by: Casey Bodley <cbodley@redhat.com>
Anthony D'Atri [Wed, 31 Jan 2024 20:08:26 +0000 (15:08 -0500)]
Merge pull request #55403 from cbodley/wip-doc-rgw-topic-attrs
doc/rgw: fix Attributes index in CreateTopic example
Adam King [Wed, 31 Jan 2024 18:28:50 +0000 (13:28 -0500)]
Merge pull request #54742 from guits/node-proxy
orch: implement hardware monitoring
Reviewed-by: Juan Miguel Olmo Martínez <jolmomar@redhat.com>
Ilya Dryomov [Wed, 31 Jan 2024 17:32:54 +0000 (18:32 +0100)]
Merge pull request #55397 from jagombar/fixInvalidSnapId2
librbd: return ENOENT from Snapshot::get_timestamp for nonexistent snap_id
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
Casey Bodley [Wed, 31 Jan 2024 17:31:02 +0000 (12:31 -0500)]
doc/rgw: fix Attributes index in CreateTopic example
Signed-off-by: Casey Bodley <cbodley@redhat.com>
Ivo Almeida [Mon, 29 Jan 2024 10:58:43 +0000 (10:58 +0000)]
mgr/dashboard: added dir search to snap schdl form
Fixes: https://tracker.ceph.com/issues/64246
Signed-off-by: Ivo Almeida <ialmeida@redhat.com>
Nizamudeen A [Wed, 31 Jan 2024 16:30:59 +0000 (22:00 +0530)]
Merge pull request #54781 from rhcs-dashboard/nvmeof-integration
mgr/dashboard: implement APIs for nvmeof management
Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Nizamudeen A [Wed, 31 Jan 2024 15:57:22 +0000 (21:27 +0530)]
Merge pull request #54900 from ivoalmeida/snapshot-schedule-create
added snap schedule form
Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Ronen Friedman [Wed, 31 Jan 2024 15:56:43 +0000 (17:56 +0200)]
Merge pull request #55340 from ronen-fr/wip-rf-reserver2
osd/scrub: use an AsyncReserver to handle scrub reservations on the replica side
Reviewed-by: Samuel Just <sjust@redhat.com>-
Josh Salomon [Wed, 31 Jan 2024 15:26:48 +0000 (17:26 +0200)]
Merge pull request #55198 from JoshSalomon/balancer-perf
osd: Read balancer for devices with different sizes (performance)
Guillaume Abrioux [Wed, 31 Jan 2024 14:23:44 +0000 (15:23 +0100)]
doc: add node-proxy documentation
This commit adds some documentation about the
'hardware inventory / monitoring' feature (node-proxy agent).
Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
Anthony D'Atri [Wed, 31 Jan 2024 13:49:55 +0000 (08:49 -0500)]
Merge pull request #55396 from zdover23/wip-doc-2024-01-31-arch-monitors
doc/architecture: improve some paragraphs
Ali Maredia [Wed, 31 Jan 2024 13:36:42 +0000 (08:36 -0500)]
Merge pull request #55383 from ceph/cpp-redis-cxx-standard
cpp_redis: update to a version with the CMAKE_CXX_STANDARD fixed
Reviewed-by: Ali Maredia <amaredia@redhat.com>
jagombar [Tue, 30 Jan 2024 18:25:53 +0000 (18:25 +0000)]
librbd: return ENOENT from Snapshot::get_timestamp for nonexistent snap_id
Fixes: https://tracker.ceph.com/issues/47287
Signed-off-by: John Agombar <agombar@uk.ibm.com>
Zac Dover [Tue, 30 Jan 2024 09:51:53 +0000 (19:51 +1000)]
doc/architecture: improve some paragraphs
Improve paragraphs under the heading "The Ceph Storage Cluster". Remove
a sentence that was pleonastic in its context in the paragraph.
Signed-off-by: Zac Dover <zac.dover@proton.me>
Ivo Almeida [Wed, 13 Dec 2023 01:08:52 +0000 (01:08 +0000)]
mgr/dashboard: added snap schedule form
Fixes: https://tracker.ceph.com/issues/63827
Signed-off-by: Ivo Almeida <ialmeida@redhat.com>
Matan Breizman [Wed, 31 Jan 2024 09:14:02 +0000 (11:14 +0200)]
Merge pull request #55305 from Matan-B/wip-matanb-crimson-do_delete_work
crimson/osd/pg: do_delete_work use claim_and_reset
Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Xuehan Xu <xuxuehan@qianxin.com>
Leonid Usov [Tue, 30 Jan 2024 22:55:45 +0000 (00:55 +0200)]
cpp_redis: update to a version with the CMAKE_CXX_STANDARD fixed
This updates the cpp_redis submodule to a version
which contains https://github.com/ceph/cpp_redis/pull/5
Signed-off-by: Leonid Usov <leonid.usov@ibm.com>
Ronen Friedman [Sun, 28 Jan 2024 16:48:18 +0000 (10:48 -0600)]
test: fix test_scrub_sched following recent scrub changes
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
Ronen Friedman [Sun, 28 Jan 2024 15:35:07 +0000 (09:35 -0600)]
osd/scrub: restore 'dump_scrub_reservations' functionality
As ScrubResources is no longer involved in remote reservations, some
of the data listed by 'dump_scrub_reservations' is now collected by
OsdScrub itself (prior to this change, OsdScrub just forwarded the
request to ScrubResources).
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
Ronen Friedman [Sun, 28 Jan 2024 11:45:50 +0000 (05:45 -0600)]
osd/scrub: control reservation queueing using a config option
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
Ronen Friedman [Thu, 25 Jan 2024 19:05:01 +0000 (13:05 -0600)]
osd/scrub: directly manage remote reservations in the FSM
The FSM now interacts with the scrub_reserver directly.
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
Nizamudeen A [Thu, 25 Jan 2024 07:43:41 +0000 (13:13 +0530)]
vstart: add nvmeof_gw to the vstart script
so that it can be later used by the dashboard to configure the nvmeof
through UI
and create rbd pool in UI
Fixes: https://tracker.ceph.com/issues/64201
Signed-off-by: Nizamudeen A <nia@redhat.com>
Laura Flores [Wed, 31 Jan 2024 03:43:35 +0000 (21:43 -0600)]
Merge pull request #55379 from ljflores/wip-revert-labler-changes
Laura Flores [Tue, 30 Jan 2024 22:06:03 +0000 (22:06 +0000)]
tools: add 'osd-size-aware' mode to osdmaptool
Sample command:
`osdmaptool om --read out.txt --read-pool cephfs.a.data --osd-size-aware`
Signed-off-by: Laura Flores <lflores@ibm.com>
Laura Flores [Tue, 30 Jan 2024 20:38:40 +0000 (20:38 +0000)]
mgr/balancer: add new keys to balancer mgr module
Signed-off-by: Laura Flores <lflores@ibm.com>
zdover23 [Wed, 31 Jan 2024 00:48:35 +0000 (10:48 +1000)]
Merge pull request #55372 from zdover23/wip-doc-2024-01-30-architecture-typo
doc/architecture.rst - fix typo
Reviewed-by: Cole Mitchell <cole.mitchell.ceph@gmail.com>
Josh Salomon [Wed, 24 Jan 2024 12:46:07 +0000 (14:46 +0200)]
osd: unit tests for read balance osd size aware score
Signed-off-by: Josh Salomon <41079547+JoshSalomon@users.noreply.github.com>
Josh Salomon [Wed, 24 Jan 2024 12:40:53 +0000 (14:40 +0200)]
osd: Add score for read balance osd size aware policy
This score works for pools in which the read_ratio
value is set.
Current limitations:
- This mechanism ignores osd read affinty
- There is a plan adding support for read affinity 0
in the next version.
- This mechanism works only when all PGs are full
- If read_ration is not set - the existing mechanism (named
fair score) is used.
Signed-off-by: Josh Salomon <41079547+JoshSalomon@users.noreply.github.com>
Signed-off-by: Josh Salomon <41079547+JoshSalomon@users.noreply.github.com>
Josh Salomon [Tue, 16 Jan 2024 18:45:32 +0000 (20:45 +0200)]
osd: Unit tests for read balancer on OSDs with different sizes
Test cases for the read balancer which takes osd sizes into account.
Some balancing code refactoring and reorg for code that is used in
multiple tests
Signed-off-by: Josh Salomon <41079547+JoshSalomon@users.noreply.github.com>
Josh Salomon [Tue, 16 Jan 2024 18:33:47 +0000 (20:33 +0200)]
osd: Read balancer for OSDs with different sizes
This commit adds calculation for desired primary distribution which
takes into account the osd size. This way smaller OSDs can take more
read operations (by adding more primaries) and the larger OSDs take less
primaries and the load of the cluater can increase. (This feature offset
a bit the weakest link in the chain effect under some conditions). In
order to calculate the loads correctly there is a need to know the
read/write ratio for the pool, and this commit assumes the read_ratio
parameter is available for the pool.
Signed-off-by: Josh Salomon <41079547+JoshSalomon@users.noreply.github.com>
Josh Salomon [Tue, 26 Dec 2023 08:41:18 +0000 (10:41 +0200)]
osd: Add 'read_ratio' pool parameterr
This parameter is used for better read balancing with non identical
devices.
- This parameter is controlled using the commands 'ceph osd pool set/get'
- This parameter is applicable only for replicated pools
- Valid values are integers in the range [0..100] and represent the
percentage of read IOs out of all IOs in the pool
- Value of 0 unsets this parameter and the value will be the default
value (this is the generic behavior of the command 'ceph osd pool
set'
- default value can be set by config parameter
`osd_pool_default_read_ratio`
Signed-off-by: Josh Salomon <41079547+JoshSalomon@users.noreply.github.com>
Laura Flores [Tue, 30 Jan 2024 17:32:20 +0000 (11:32 -0600)]
.github/workflows: revert version bump to labler action
Signed-off-by: Laura Flores <lflores@ibm.com>
Laura Flores [Tue, 30 Jan 2024 17:13:30 +0000 (11:13 -0600)]
Merge pull request #55364 from ljflores/wip-pr-triage
.github/workflows: disable sync-labels in pr-triage
Guillaume Abrioux [Tue, 30 Jan 2024 16:38:31 +0000 (16:38 +0000)]
agent/node-proxy: fix wrong host name used in data endpoint
data['cephx']['name'] will return something like:
node-proxy.hostname123
the prefix "node-proxy." has the be removed otherwise there will be
a mismatch between what is actually expected.
Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
Guillaume Abrioux [Thu, 25 Jan 2024 14:34:26 +0000 (14:34 +0000)]
mgr/cephadm: update node-proxy unit tests
The recent migration to a separate daemon implied
some changes which have broken these tests.
This commit fixes them.
Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
Guillaume Abrioux [Wed, 24 Jan 2024 15:08:14 +0000 (15:08 +0000)]
mgr/cephadm: add a new config option 'oob_default_addr'
So there's a default value (169.254.1.1) which is the default
address for the 'OS to iDrac pass-through' interface.
Given that node-proxy will reach the RedFish API through this interface,
we can make users avoid to pass that addr when providing the host spec
at bootstrap time.
Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>
Casey Bodley [Thu, 25 Jan 2024 17:41:39 +0000 (12:41 -0500)]
qa/distros: add supported-container-hosts for cephadm and upgrade suites
Signed-off-by: Casey Bodley <cbodley@redhat.com>
Adam Kupczyk [Tue, 30 Jan 2024 14:43:49 +0000 (15:43 +0100)]
Merge pull request #53610 from Patty8122/documentation-update-fsck-flagdeep
doc/man/8: Add valid options in ceph-bluestore-tool.rst
Venky Shankar [Tue, 30 Jan 2024 14:10:35 +0000 (19:40 +0530)]
Merge PR #54515 into main
* refs/pull/54515/head:
qa: get_global_id() returns a string (global_id)
qa: correct usage of DEBUGFS_META_DIR in dedent
Reviewed-by: Rishabh Dave <ridave@redhat.com>
Reviewed-by: Dhairya Parmar <dparmar@redhat.com>
Redouane Kachach [Tue, 30 Jan 2024 13:30:56 +0000 (14:30 +0100)]
Merge pull request #55313 from rkachach/fix_issue_64183
mgr/rook: adding metrics monitoring e2e testing
Zac Dover [Tue, 30 Jan 2024 12:24:11 +0000 (22:24 +1000)]
doc/architecture.rst - fix typo
s/requies/requires
Signed-off-by: Zac Dover <zac.dover@proton.me>
Ivo Almeida [Tue, 30 Jan 2024 11:30:51 +0000 (08:30 -0300)]
Merge pull request #55284 from rhcs-dashboard/cephfs-clone-create
mgr/dashboard: create cephfs snapshot clone
Nizamudeen A [Fri, 12 Jan 2024 08:41:18 +0000 (14:11 +0530)]
mgr/dashboard: adapt to new nvmeof APIs
Fixes: https://tracker.ceph.com/issues/64201
Signed-off-by: Nizamudeen A <nia@redhat.com>
avanthakkar [Mon, 11 Dec 2023 13:20:54 +0000 (18:50 +0530)]
mgr/dashboard: introduce APIs for NvmeOf management
Fixes: https://tracker.ceph.com/issues/64201
Signed-off-by: avanthakkar <avanjohn@gmail.com>
Nizamudeen A [Tue, 5 Dec 2023 10:16:12 +0000 (15:46 +0530)]
mgr/dashboard: gRPC client for nvmeof communication
Created a gRPC client by utilising the protobuf file available in nvmeof
repo
Copied the file to this repo and generated its output.
Fixes: https://tracker.ceph.com/issues/64201
Signed-off-by: Nizamudeen A <nia@redhat.com>
Nizamudeen A [Tue, 5 Dec 2023 10:11:01 +0000 (15:41 +0530)]
mgr/cephadm: configure the dashboard gateways
cephadm configures the nvmeof gateways and add the gateways to a config
store which dashboard will later on fetch to make the grpc calls.
Fixes: https://tracker.ceph.com/issues/64201
Signed-off-by: Nizamudeen A <nia@redhat.com>
Rongqi Sun [Tue, 30 Jan 2024 09:31:52 +0000 (17:31 +0800)]
Merge pull request #55351 from rosinL/wip-fix-64122
script/run-make: install lvm2 for make check cephadm test
Nizamudeen A [Tue, 30 Jan 2024 09:09:02 +0000 (14:39 +0530)]
Merge pull request #55228 from rhcs-dashboard/fix-64081-main
mgr/dashboard: Export rgw multisite status via the API
Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Ronen Friedman [Sun, 28 Jan 2024 11:02:31 +0000 (05:02 -0600)]
osd/scrub: remove remote reservation code from ScrubResources
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
Ronen Friedman [Tue, 30 Jan 2024 08:55:13 +0000 (02:55 -0600)]
common: allow AsyncReserver requests with no callbacks
As no callbacks are needed for request_reservation_or_fail(),
the synchronous request API.
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
Ronen Friedman [Sun, 28 Jan 2024 07:25:38 +0000 (01:25 -0600)]
osd/scrub: add a general data-carrying event type to the FSM
Based on Sam's Crimson FSM's 'value-event'.
Co-authored-by: Sam Just <sjust@redhat.com>
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
Ronen Friedman [Sun, 28 Jan 2024 07:42:09 +0000 (01:42 -0600)]
osd/scrub: define the scrub_reserver to track remote reservations
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
Venky Shankar [Tue, 30 Jan 2024 07:40:19 +0000 (13:10 +0530)]
Merge PR #52652 into main
* refs/pull/52652/head:
PendingReleaseNotes: add note about new mdlog trimming configurations
mds: drive mdlog trimming via a separate thread
mds: allow runtime modification of mdlog trimming configuration
mds: remove a bunch of heuristics from MDLog::trim()
mds: add mdlog trimming threshold and decay counter
Reviewed-by: Leonid Usov <leonid.usov@ibm.com>
Venky Shankar [Tue, 30 Jan 2024 07:28:48 +0000 (12:58 +0530)]
Merge PR #55184 into main
* refs/pull/55184/head:
qa: remove redundant test test_volumes.TestRenameCmd.test_rename_when_fs_is_online
Reviewed-by: Christopher Hoffman <choffman@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>