git-server-git.apps.pok.os.sepia.ceph.com Git - ceph.git/log

]> git-server-git.apps.pok.os.sepia.ceph.com Git - ceph.git/log

projects / ceph.git / log

summary | shortlog | log | commit | commitdiff | tree
first ⋅ prev ⋅ next

commit | commitdiff | tree

Tobias Urdin [Tue, 6 Feb 2024 07:50:55 +0000 (07:50 +0000)]

rgw/auth: ignoring signatures for HTTP OPTIONS calls

Before [1] we always sent all HTTP OPTIONS requests to
the S3AnonymousEngine and ignored any provided AWSv4
credentials sent in the request.

That PR changed so that if we got credentials in the
request we instead sent it through the authentication
code in order to solve HTTP OPTIONS requests on tenanted
users to start working (because we need to resolve the
tenant, also called bucket tenant in the code, and we can't
only rely on the bucket name since it will not be found).

We solved this by modifying the canonical HTTP method used
when calculating the AWSv4 signature by instead using the
access-control-request-method header which worked good.

This change did not take into account that when you generated
a presigned URL for a put_object request you can also pass in
extra parameters like a canned ACL [2] to the Params variable
in for example boto3's generated_presigned_url().

Doing that will cause the client to add the x-amz-acl header
to x-amz-signedheaders and also use that in their signature
calculation.

When doing a HTTP OPTIONS calls for CORS on that presigned URL
the browser will never send a x-amz-acl header with the correct
data since that is something that the actual PUT request should
include later, so that HTTP OPTIONS call should pass even though
the signature can never be calculated correctly server-side like
verified against AWS S3 in tracker [3].

This patch as a result skips the signature calculation when doing
EC2 auth using the LocalEngine but we still need to pass the request
there in order to lookup the user to support buckets in a tenant.

For the Keystone EC2 auth we're pretty out of luck in the sense that
Keystone's API itself requires us to send the AWSv4 signature in the
request with the access_key in order to obtain a token, and we cannot
leave the signature out, we also cannot spoof the signature from
rgw -> keystone since we don't have access to the secret_key if it's
not in our cache.

For that approach we simply pass on to get_access_token() that if it's
an HTTP OPTIONS and we find the access_key in the cache we pull that
and ignore verifying signature and pass it on for validation. This means
that the cache must be warm if using Keystone auth and adding extra
params to a presigned URL.

This partly makes some of the commits in [1] redundant for EC2
LocalEngine auth but we still need it for tenanted bucket support.

[1] https://github.com/ceph/ceph/pull/52673
[2] https://docs.aws.amazon.com/AmazonS3/latest/userguide/acl-overview.html#canned-acl
[3] https://tracker.ceph.com/issues/64308

Fixes: https://tracker.ceph.com/issues/64308
Signed-off-by: Tobias Urdin <tobias.urdin@binero.se>

commit | commitdiff | tree

Casey Bodley [Mon, 5 Feb 2024 21:21:11 +0000 (21:21 +0000)]

Merge pull request #54311 from gabriel-samfira/do-not-escape-slash

RGW: fix cloud-sync not being able to sync folders

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Casey Bodley [Mon, 5 Feb 2024 21:17:31 +0000 (21:17 +0000)]

Merge pull request #55252 from jzhu116-bloomberg/wip-21800

rgw/multisite: avoid writing multipart parts to the bucket index log

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Casey Bodley [Mon, 5 Feb 2024 21:16:56 +0000 (21:16 +0000)]

Merge pull request #55293 from jzhu116-bloomberg/wip-63178

rgw/lc: pass in flag as an argument when calling function complete(..)

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Casey Bodley [Mon, 5 Feb 2024 21:16:27 +0000 (21:16 +0000)]

Merge pull request #54661 from lightmelodies/rgw-gc

rgw: optimize gc chain size calculation

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Mon, 5 Feb 2024 19:12:51 +0000 (20:12 +0100)]

Merge pull request #55452 from nbalacha/wip-typo-fix

rbd-mirror: fix a typo in NotifyAckPayload::dump()

Reviewed-by: Ramana Raja <rraja@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Casey Bodley [Mon, 5 Feb 2024 17:59:46 +0000 (17:59 +0000)]

Merge pull request #55406 from cbodley/wip-64255

rgw: SiteConfig::load() falls back to local zonegroup

Reviewed-by: Shilpa Jagannath <smanjara@redhat.com>
Reviewed-by: Adam Emerson <aemerson@redhat.com>

commit | commitdiff | tree

Adam King [Mon, 5 Feb 2024 13:28:42 +0000 (08:28 -0500)]

Merge pull request #55448 from guits/subclass-thread

node-proxy: refactor entrypoint

Reviewed-by: Adam King <adking@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Mon, 5 Feb 2024 12:21:32 +0000 (14:21 +0200)]

Merge pull request #55280 from ronen-fr/wip-rf-extra-lg

osd: remove superfluous '>' in spg_t formatter

Reviewed-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Yuval Lifshitz [Mon, 5 Feb 2024 12:16:55 +0000 (14:16 +0200)]

Merge pull request #55414 from yuvalif/wip_yuval_url_decode_sns

rgw/rest: fix url decode of post params passed as attributes

reviewed-by: cbodley

commit | commitdiff | tree

N Balachandran [Mon, 5 Feb 2024 09:46:17 +0000 (15:16 +0530)]

rbd-mirror: fixed a typo in instance_watcher

Fixes a typo in NotifyAckPayload::dump().

Signed-off-by: N Balachandran <nibalach@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Sun, 4 Feb 2024 19:11:41 +0000 (19:11 +0000)]

node-proxy: refactor entrypoint

This commit introduces a major refactor of the main
entrypoint.

- subclass threading.Thread:
  - Introduce a new class `BaseThread()` that is a
    `threading.Thread()` abstraction class in order
    to monitor the different threads.
  - `BaseSystem()` inherits from `BaseThread()`.
  - Handle `SIGTERM` signal in order to gracefully shutdown
    node-proxy (make threads exit gracefully, log out from RedFish API, etc.)

Additionally, this:
  - drops the class `Logger()` from util.py which
was not adding value. It is now replaced with a simple `get_logger()`
function.
  - changes the node-proxy API port from 8080 to 9456
    (8080 being widely used for frontend apps...)
  - changes the container entrypoint in order to use the
    `ceph-node-proxy` binary from the packaging

Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>

commit | commitdiff | tree

Kefu Chai [Mon, 5 Feb 2024 02:20:07 +0000 (10:20 +0800)]

Merge pull request #55089 from cbodley/wip-boost-mirrors

make-dist: update boost mirrors

Reviewed-by: Kefu Chai <tchaikov@gmail.com>

commit | commitdiff | tree

zdover23 [Sun, 4 Feb 2024 14:44:35 +0000 (00:44 +1000)]

Merge pull request #55434 from anthonyeleven/rgwyamlin

src/common/options: Correct typo in rgw.yaml.in

Reviewed-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Matan Breizman [Sun, 4 Feb 2024 09:45:43 +0000 (11:45 +0200)]

Merge pull request #55276 from Matan-B/wip-matanb-crimson-alienstore-cores

crimson: rework CPU pinning

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

zdover23 [Sat, 3 Feb 2024 22:25:50 +0000 (08:25 +1000)]

Merge pull request #55419 from zdover23/wip-doc-2024-02-02-rados-config-pool-pg-config-ref

doc/rados: update config for autoscaler

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Anthony D'Atri [Sat, 3 Feb 2024 07:18:25 +0000 (02:18 -0500)]

src/common/options: Correct typo in rgw.yaml.in

Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>

commit | commitdiff | tree

Zac Dover [Fri, 2 Feb 2024 01:53:45 +0000 (11:53 +1000)]

doc/rados: update config for autoscaler

Update doc/rados/configuration/pool-pg-config-ref.rst to account for the
behavior of autoscaler.

Previously, this file was last meaningfully altered in 2013, prior to
the invention of autoscaler. A recent confusion was brought to my
attention on the Ceph Slack whereby a user attempted to alter the
default values of a Quincy cluster, as suggested in this documentation.
That alteration caused Ceph to throw the error "Error ERANGE: 'pgp_num'
must be greater than 0 and lower or equal than 'pg_num', which in this
case is one" and a related "rgw_init_ioctx ERROR" reading in part
"Numerical result out of range". The user removed the
"osd_pool_default_pgp_num" configuration line from ceph.conf and the
cluster worked as expected. I presume that this is because the removal
of this configuration line allowed autoscaler to work as intended.

Fixes: https://tracker.ceph.com/issues/64259
Co-authored-by: David Orman <ormandj@corenode.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Casey Bodley [Wed, 31 Jan 2024 19:29:43 +0000 (14:29 -0500)]

rgw: SiteConfig::load() falls back to local zonegroup

allow radosgw-admin commands like 'user create' to operate on a new zone
that hasn't been committed to the period yet. this follows similar logic
in RGWSI_Zone::do_start()

Fixes: https://tracker.ceph.com/issues/64255
Signed-off-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Adam King [Fri, 2 Feb 2024 18:57:49 +0000 (13:57 -0500)]

Merge pull request #54312 from dvanders/dvanders_mcltf_true

qa/tasks/cephadm: enable mon_cluster_log_to_file

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Casey Bodley [Fri, 2 Feb 2024 16:37:53 +0000 (16:37 +0000)]

Merge pull request #54837 from imtzw/urlencode

rgw: fix list obj v1 nextmarker

Reviewed-by: Matt Benjamin <mbenjamin@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Casey Bodley [Fri, 2 Feb 2024 15:42:00 +0000 (15:42 +0000)]

Merge pull request #54867 from BBoozmen/oozmen_lc_subsystem

rgw/common: add rgw lifecycle specific debug log subsystem

Reviewed-by: Matt Benjamin <mbenjamin@redhat.com>
Reviewed-by: Jiffin Tony Thottan <jthottan@redhat.com>

commit | commitdiff | tree

Casey Bodley [Fri, 2 Feb 2024 15:41:17 +0000 (15:41 +0000)]

Merge pull request #55162 from cfsnyder/wip-64014-cfsnyder-pacific

rgw: fix issue with concurrent versioned deletes leaving behind olh entries

Reviewed-by: J. Eric Ivancich <ivancich@redhat.com>

commit | commitdiff | tree

Casey Bodley [Fri, 2 Feb 2024 15:40:48 +0000 (15:40 +0000)]

Merge pull request #55229 from tobias-urdin/http-error-keystone-issue-admin-token-request

rgw: fix http error check in keystone issue_admin_token_request

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

baum [Fri, 2 Feb 2024 07:54:28 +0000 (09:54 +0200)]

Merge pull request #55330 from baum/centos9-install-build-deps

🧹build dependencies: centos9

commit | commitdiff | tree

Venky Shankar [Fri, 2 Feb 2024 05:05:00 +0000 (10:35 +0530)]

Merge PR #55233 into main

* refs/pull/55233/head:
qa: use centos 9.stream for cephfs stock kernel testing

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Casey Bodley [Thu, 1 Feb 2024 19:40:57 +0000 (19:40 +0000)]

Merge pull request #53901 from cbodley/wip-qa-container-distros-s

qa/distros: bump container host distros from centos 8->9

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Thu, 1 Feb 2024 19:29:58 +0000 (20:29 +0100)]

Merge pull request #55413 from petrutlucian94/flipped_tid

msg: update MOSDOp() to use ceph_tid_t instead of long

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Matan Breizman [Tue, 30 Jan 2024 17:53:16 +0000 (17:53 +0000)]

crimson: introduce crimson_seastar_cpu_cores

seastar::smp::count is deduced from the cpuset provided.

from seastar/src/core/reactor:
```
   if (smp_opts.smp) {
        smp::count = smp_opts.smp.get_value();
    } else {
        smp::count = cpu_set.size();
    }
```

Co-authored-by: Yingxin Cheng <yingxin.cheng@intel.com>
Signed-off-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Yuval Lifshitz [Thu, 1 Feb 2024 15:55:59 +0000 (15:55 +0000)]

rgw/rest: fix url decode of post params passed as attributes

this is fixing a regression with the SNS policies
cause by: 4bdc5d18dd68b95c6ccd4c0e77a1bd04ad86dbb8

the changes to the test code is to accomodate different
boto3 versions

Signed-off-by: Yuval Lifshitz <ylifshit@redhat.com>

commit | commitdiff | tree

Lucian Petrut [Thu, 1 Feb 2024 14:40:03 +0000 (14:40 +0000)]

msg: update MOSDOp() to use ceph_tid_t instead of long

The MOSDOp constructor receives the the transaction ID as a long
instead of ceph_tid_t.

The issue is that "long" uses 32b on Windows instead of 64 bits,
so it flips after about 2 billion requests. At that point, the OSD
replies are dropped because of transaction ID mismatches.

We'll solve the issue by using the correct type for the transaction
id, specifically ceph_tid_t.

Fixes: https://tracker.ceph.com/issues/64291
Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>

commit | commitdiff | tree

Ivo Almeida [Thu, 1 Feb 2024 13:48:17 +0000 (10:48 -0300)]

Merge pull request #55373 from ivoalmeida/snapshot-schedule-form-directory-typeahead

Snapshot schedule form directory typeahead

Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Redouane Kachach [Thu, 1 Feb 2024 11:35:39 +0000 (12:35 +0100)]

Merge pull request #55393 from rkachach/fix_issue_64266

mgr/rook: adding deployment to ceph image built for rook e2e testing

commit | commitdiff | tree

Matan Breizman [Tue, 23 Jan 2024 14:08:48 +0000 (14:08 +0000)]

vstarh.sh: Crimson - set crimson_alien_thread_cpu_cores

Signed-off-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Matan Breizman [Tue, 23 Jan 2024 14:06:01 +0000 (14:06 +0000)]

crimson/os/alienstore: cleanup N_CORES_FOR_SEASTAR constant

No CPU pinning if crimson_alien_thread_cpu_cores is not set.

Signed-off-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Redouane Kachach [Wed, 31 Jan 2024 09:48:35 +0000 (10:48 +0100)]

mgr/rook: adding deployment to ceph image built for rook e2e testing
Fixes: https://tracker.ceph.com/issues/64266
Signed-off-by: Redouane Kachach <rkachach@redhat.com>

commit | commitdiff | tree

Casey Bodley [Wed, 31 Jan 2024 22:15:33 +0000 (22:15 +0000)]

Merge pull request #47723 from mdw-at-linuxbox/wip-master-tempurl

rgw: swift: tempurl fixes for ceph

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Anthony D'Atri [Wed, 31 Jan 2024 20:08:26 +0000 (15:08 -0500)]

Merge pull request #55403 from cbodley/wip-doc-rgw-topic-attrs

doc/rgw: fix Attributes index in CreateTopic example

commit | commitdiff | tree

Adam King [Wed, 31 Jan 2024 18:28:50 +0000 (13:28 -0500)]

Merge pull request #54742 from guits/node-proxy

orch: implement hardware monitoring

Reviewed-by: Juan Miguel Olmo Martínez <jolmomar@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Wed, 31 Jan 2024 17:32:54 +0000 (18:32 +0100)]

Merge pull request #55397 from jagombar/fixInvalidSnapId2

librbd: return ENOENT from Snapshot::get_timestamp for nonexistent snap_id

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Casey Bodley [Wed, 31 Jan 2024 17:31:02 +0000 (12:31 -0500)]

doc/rgw: fix Attributes index in CreateTopic example

Signed-off-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Ivo Almeida [Mon, 29 Jan 2024 10:58:43 +0000 (10:58 +0000)]

mgr/dashboard: added dir search to snap schdl form

Fixes: https://tracker.ceph.com/issues/64246
Signed-off-by: Ivo Almeida <ialmeida@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Wed, 31 Jan 2024 16:30:59 +0000 (22:00 +0530)]

Merge pull request #54781 from rhcs-dashboard/nvmeof-integration

mgr/dashboard: implement APIs for nvmeof management

Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Wed, 31 Jan 2024 15:57:22 +0000 (21:27 +0530)]

Merge pull request #54900 from ivoalmeida/snapshot-schedule-create

added snap schedule form

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Wed, 31 Jan 2024 15:56:43 +0000 (17:56 +0200)]

Merge pull request #55340 from ronen-fr/wip-rf-reserver2

osd/scrub: use an AsyncReserver to handle scrub reservations on the replica side

Reviewed-by: Samuel Just <sjust@redhat.com>-

commit | commitdiff | tree

Josh Salomon [Wed, 31 Jan 2024 15:26:48 +0000 (17:26 +0200)]

Merge pull request #55198 from JoshSalomon/balancer-perf

osd: Read balancer for devices with different sizes (performance)

commit | commitdiff | tree

Guillaume Abrioux [Wed, 31 Jan 2024 14:23:44 +0000 (15:23 +0100)]

doc: add node-proxy documentation

This commit adds some documentation about the
'hardware inventory / monitoring' feature (node-proxy agent).

Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>

commit | commitdiff | tree

Anthony D'Atri [Wed, 31 Jan 2024 13:49:55 +0000 (08:49 -0500)]

Merge pull request #55396 from zdover23/wip-doc-2024-01-31-arch-monitors

doc/architecture: improve some paragraphs

commit | commitdiff | tree

Ali Maredia [Wed, 31 Jan 2024 13:36:42 +0000 (08:36 -0500)]

Merge pull request #55383 from ceph/cpp-redis-cxx-standard

cpp_redis: update to a version with the CMAKE_CXX_STANDARD fixed

Reviewed-by: Ali Maredia <amaredia@redhat.com>

commit | commitdiff | tree

jagombar [Tue, 30 Jan 2024 18:25:53 +0000 (18:25 +0000)]

librbd: return ENOENT from Snapshot::get_timestamp for nonexistent snap_id

Fixes: https://tracker.ceph.com/issues/47287
Signed-off-by: John Agombar <agombar@uk.ibm.com>

commit | commitdiff | tree

Zac Dover [Tue, 30 Jan 2024 09:51:53 +0000 (19:51 +1000)]

doc/architecture: improve some paragraphs

Improve paragraphs under the heading "The Ceph Storage Cluster". Remove
a sentence that was pleonastic in its context in the paragraph.

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Ivo Almeida [Wed, 13 Dec 2023 01:08:52 +0000 (01:08 +0000)]

mgr/dashboard: added snap schedule form

Fixes: https://tracker.ceph.com/issues/63827
Signed-off-by: Ivo Almeida <ialmeida@redhat.com>

commit | commitdiff | tree

Matan Breizman [Wed, 31 Jan 2024 09:14:02 +0000 (11:14 +0200)]

Merge pull request #55305 from Matan-B/wip-matanb-crimson-do_delete_work

crimson/osd/pg: do_delete_work use claim_and_reset

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Leonid Usov [Tue, 30 Jan 2024 22:55:45 +0000 (00:55 +0200)]

cpp_redis: update to a version with the CMAKE_CXX_STANDARD fixed

This updates the cpp_redis submodule to a version
which contains https://github.com/ceph/cpp_redis/pull/5

Signed-off-by: Leonid Usov <leonid.usov@ibm.com>

commit | commitdiff | tree

Ronen Friedman [Sun, 28 Jan 2024 16:48:18 +0000 (10:48 -0600)]

test: fix test_scrub_sched following recent scrub changes

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Sun, 28 Jan 2024 15:35:07 +0000 (09:35 -0600)]

osd/scrub: restore 'dump_scrub_reservations' functionality

As ScrubResources is no longer involved in remote reservations, some
of the data listed by 'dump_scrub_reservations' is now collected by
OsdScrub itself (prior to this change, OsdScrub just forwarded the
request to ScrubResources).

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Sun, 28 Jan 2024 11:45:50 +0000 (05:45 -0600)]

osd/scrub: control reservation queueing using a config option

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Thu, 25 Jan 2024 19:05:01 +0000 (13:05 -0600)]

osd/scrub: directly manage remote reservations in the FSM

The FSM now interacts with the scrub_reserver directly.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Thu, 25 Jan 2024 07:43:41 +0000 (13:13 +0530)]

vstart: add nvmeof_gw to the vstart script

so that it can be later used by the dashboard to configure the nvmeof
through UI

and create rbd pool in UI

Fixes: https://tracker.ceph.com/issues/64201
Signed-off-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Laura Flores [Wed, 31 Jan 2024 03:43:35 +0000 (21:43 -0600)]

Merge pull request #55379 from ljflores/wip-revert-labler-changes

commit | commitdiff | tree

Laura Flores [Tue, 30 Jan 2024 22:06:03 +0000 (22:06 +0000)]

tools: add 'osd-size-aware' mode to osdmaptool

Sample command:
`osdmaptool om --read out.txt --read-pool cephfs.a.data --osd-size-aware`

Signed-off-by: Laura Flores <lflores@ibm.com>

commit | commitdiff | tree

Laura Flores [Tue, 30 Jan 2024 20:38:40 +0000 (20:38 +0000)]

mgr/balancer: add new keys to balancer mgr module

Signed-off-by: Laura Flores <lflores@ibm.com>

commit | commitdiff | tree

zdover23 [Wed, 31 Jan 2024 00:48:35 +0000 (10:48 +1000)]

Merge pull request #55372 from zdover23/wip-doc-2024-01-30-architecture-typo

doc/architecture.rst - fix typo

Reviewed-by: Cole Mitchell <cole.mitchell.ceph@gmail.com>

commit | commitdiff | tree

Josh Salomon [Wed, 24 Jan 2024 12:46:07 +0000 (14:46 +0200)]

osd: unit tests for read balance osd size aware score

Signed-off-by: Josh Salomon <41079547+JoshSalomon@users.noreply.github.com>

commit | commitdiff | tree

Josh Salomon [Wed, 24 Jan 2024 12:40:53 +0000 (14:40 +0200)]

osd: Add score for read balance osd size aware policy

This score works for pools in which the read_ratio
value is set.
Current limitations:
- This mechanism ignores osd read affinty
  - There is a plan adding support for read affinity 0
    in the next version.
- This mechanism works only when all PGs are full
- If read_ration is not set - the existing mechanism (named
  fair score) is used.

Signed-off-by: Josh Salomon <41079547+JoshSalomon@users.noreply.github.com>
Signed-off-by: Josh Salomon <41079547+JoshSalomon@users.noreply.github.com>

commit | commitdiff | tree

Josh Salomon [Tue, 16 Jan 2024 18:45:32 +0000 (20:45 +0200)]

osd: Unit tests for read balancer on OSDs with different sizes

Test cases for the read balancer which takes osd sizes into account.
Some balancing code refactoring and reorg for code that is used in
multiple tests

Signed-off-by: Josh Salomon <41079547+JoshSalomon@users.noreply.github.com>

commit | commitdiff | tree

Josh Salomon [Tue, 16 Jan 2024 18:33:47 +0000 (20:33 +0200)]

osd: Read balancer for OSDs with different sizes

This commit adds calculation for desired primary distribution which
takes into account the osd size. This way smaller OSDs can take more
read operations (by adding more primaries) and the larger OSDs take less
primaries and the load of the cluater can increase. (This feature offset
a bit the weakest link in the chain effect under some conditions). In
order to calculate the loads correctly there is a need to know the
read/write ratio for the pool, and this commit assumes the read_ratio
parameter is available for the pool.

Signed-off-by: Josh Salomon <41079547+JoshSalomon@users.noreply.github.com>

commit | commitdiff | tree

Josh Salomon [Tue, 26 Dec 2023 08:41:18 +0000 (10:41 +0200)]

osd: Add 'read_ratio' pool parameterr

This parameter is used for better read balancing with non identical
devices.
- This parameter is controlled using the commands 'ceph osd pool set/get'
- This parameter is applicable only for replicated pools
- Valid values are integers in the range [0..100] and represent the
  percentage of read IOs out of all IOs in the pool
  - Value of 0 unsets this parameter and the value will be the default
    value (this is the generic behavior of the command 'ceph osd pool
    set'
  - default value can be set by config parameter
    `osd_pool_default_read_ratio`

Signed-off-by: Josh Salomon <41079547+JoshSalomon@users.noreply.github.com>

commit | commitdiff | tree

Laura Flores [Tue, 30 Jan 2024 17:32:20 +0000 (11:32 -0600)]

.github/workflows: revert version bump to labler action

Signed-off-by: Laura Flores <lflores@ibm.com>

commit | commitdiff | tree

Laura Flores [Tue, 30 Jan 2024 17:13:30 +0000 (11:13 -0600)]

Merge pull request #55364 from ljflores/wip-pr-triage

.github/workflows: disable sync-labels in pr-triage

commit | commitdiff | tree

Guillaume Abrioux [Tue, 30 Jan 2024 16:38:31 +0000 (16:38 +0000)]

agent/node-proxy: fix wrong host name used in data endpoint

data['cephx']['name'] will return something like:

node-proxy.hostname123

the prefix "node-proxy." has the be removed otherwise there will be
a mismatch between what is actually expected.

Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>

commit | commitdiff | tree

Guillaume Abrioux [Thu, 25 Jan 2024 14:34:26 +0000 (14:34 +0000)]

mgr/cephadm: update node-proxy unit tests

The recent migration to a separate daemon implied
some changes which have broken these tests.
This commit fixes them.

Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>

commit | commitdiff | tree

Guillaume Abrioux [Wed, 24 Jan 2024 15:08:14 +0000 (15:08 +0000)]

mgr/cephadm: add a new config option 'oob_default_addr'

So there's a default value (169.254.1.1) which is the default
address for the 'OS to iDrac pass-through' interface.
Given that node-proxy will reach the RedFish API through this interface,
we can make users avoid to pass that addr when providing the host spec
at bootstrap time.

Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>

commit | commitdiff | tree

Casey Bodley [Thu, 25 Jan 2024 17:41:39 +0000 (12:41 -0500)]

qa/distros: add supported-container-hosts for cephadm and upgrade suites

Signed-off-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Adam Kupczyk [Tue, 30 Jan 2024 14:43:49 +0000 (15:43 +0100)]

Merge pull request #53610 from Patty8122/documentation-update-fsck-flagdeep

doc/man/8: Add valid options in ceph-bluestore-tool.rst

commit | commitdiff | tree

Venky Shankar [Tue, 30 Jan 2024 14:10:35 +0000 (19:40 +0530)]

Merge PR #54515 into main

* refs/pull/54515/head:
qa: get_global_id() returns a string (global_id)
qa: correct usage of DEBUGFS_META_DIR in dedent

Reviewed-by: Rishabh Dave <ridave@redhat.com>
Reviewed-by: Dhairya Parmar <dparmar@redhat.com>

commit | commitdiff | tree

Redouane Kachach [Tue, 30 Jan 2024 13:30:56 +0000 (14:30 +0100)]

Merge pull request #55313 from rkachach/fix_issue_64183

mgr/rook: adding metrics monitoring e2e testing

commit | commitdiff | tree

Zac Dover [Tue, 30 Jan 2024 12:24:11 +0000 (22:24 +1000)]

doc/architecture.rst - fix typo

s/requies/requires

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Ivo Almeida [Tue, 30 Jan 2024 11:30:51 +0000 (08:30 -0300)]

Merge pull request #55284 from rhcs-dashboard/cephfs-clone-create

mgr/dashboard: create cephfs snapshot clone

commit | commitdiff | tree

Nizamudeen A [Fri, 12 Jan 2024 08:41:18 +0000 (14:11 +0530)]

mgr/dashboard: adapt to new nvmeof APIs

Fixes: https://tracker.ceph.com/issues/64201
Signed-off-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

avanthakkar [Mon, 11 Dec 2023 13:20:54 +0000 (18:50 +0530)]

mgr/dashboard: introduce APIs for NvmeOf management

Fixes: https://tracker.ceph.com/issues/64201
Signed-off-by: avanthakkar <avanjohn@gmail.com>

commit | commitdiff | tree

Nizamudeen A [Tue, 5 Dec 2023 10:16:12 +0000 (15:46 +0530)]

mgr/dashboard: gRPC client for nvmeof communication

Created a gRPC client by utilising the protobuf file available in nvmeof
repo

Copied the file to this repo and generated its output.

Fixes: https://tracker.ceph.com/issues/64201
Signed-off-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Tue, 5 Dec 2023 10:11:01 +0000 (15:41 +0530)]

mgr/cephadm: configure the dashboard gateways

cephadm configures the nvmeof gateways and add the gateways to a config
store which dashboard will later on fetch to make the grpc calls.

Fixes: https://tracker.ceph.com/issues/64201
Signed-off-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Rongqi Sun [Tue, 30 Jan 2024 09:31:52 +0000 (17:31 +0800)]

Merge pull request #55351 from rosinL/wip-fix-64122

script/run-make: install lvm2 for make check cephadm test

commit | commitdiff | tree

Nizamudeen A [Tue, 30 Jan 2024 09:09:02 +0000 (14:39 +0530)]

Merge pull request #55228 from rhcs-dashboard/fix-64081-main

mgr/dashboard: Export rgw multisite status via the API

Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Sun, 28 Jan 2024 11:02:31 +0000 (05:02 -0600)]

osd/scrub: remove remote reservation code from ScrubResources

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Tue, 30 Jan 2024 08:55:13 +0000 (02:55 -0600)]

common: allow AsyncReserver requests with no callbacks

As no callbacks are needed for request_reservation_or_fail(),
the synchronous request API.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Sun, 28 Jan 2024 07:25:38 +0000 (01:25 -0600)]

osd/scrub: add a general data-carrying event type to the FSM

Based on Sam's Crimson FSM's 'value-event'.

Co-authored-by: Sam Just <sjust@redhat.com>
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Sun, 28 Jan 2024 07:42:09 +0000 (01:42 -0600)]

osd/scrub: define the scrub_reserver to track remote reservations

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Venky Shankar [Tue, 30 Jan 2024 07:40:19 +0000 (13:10 +0530)]

Merge PR #52652 into main

* refs/pull/52652/head:
PendingReleaseNotes: add note about new mdlog trimming configurations
mds: drive mdlog trimming via a separate thread
mds: allow runtime modification of mdlog trimming configuration
mds: remove a bunch of heuristics from MDLog::trim()
mds: add mdlog trimming threshold and decay counter

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>

commit | commitdiff | tree

Venky Shankar [Tue, 30 Jan 2024 07:28:48 +0000 (12:58 +0530)]

Merge PR #55184 into main

* refs/pull/55184/head:
qa: remove redundant test test_volumes.TestRenameCmd.test_rename_when_fs_is_online

Reviewed-by: Christopher Hoffman <choffman@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Venky Shankar [Tue, 30 Jan 2024 06:56:32 +0000 (12:26 +0530)]

Merge PR #55346 into main

* refs/pull/55346/head:
qa: remove error string checks and check w/ return value

Reviewed-by: Dhairya Parmar <dparmar@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Tue, 30 Jan 2024 05:48:39 +0000 (11:18 +0530)]

Merge pull request #55349 from afreen23/fix-64112

mgr/dashboard: Create subvol of same name in different group

Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Venky Shankar [Tue, 26 Sep 2023 12:22:03 +0000 (08:22 -0400)]

PendingReleaseNotes: add note about new mdlog trimming configurations

Signed-off-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Tue, 8 Aug 2023 05:48:45 +0000 (01:48 -0400)]

mds: drive mdlog trimming via a separate thread

Signed-off-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Mon, 24 Jul 2023 04:34:47 +0000 (00:34 -0400)]

mds: allow runtime modification of mdlog trimming configuration

Signed-off-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Mon, 31 Jul 2023 05:59:24 +0000 (01:59 -0400)]

mds: remove a bunch of heuristics from MDLog::trim()

These were probbaly introduced to workaround some sort of
resource overusage by the MDS during trimming, but now it
looks like they are not really neeeded, especially if we
introduce a dedicated thread for log trimming.

Signed-off-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Mon, 24 Jul 2023 04:33:47 +0000 (00:33 -0400)]

mds: add mdlog trimming threshold and decay counter

Fixes: http://tracker.ceph.com/issues/61908
Signed-off-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Thu, 25 Jan 2024 09:32:33 +0000 (15:02 +0530)]

qa: remove error string checks and check w/ return value

I ran into this failure once #54972 was merged. The test is validating
the error string returned due to the failed mount. There aren't any
return value checks - which is a _more_ important check. Generic error
string checks will fail once a (error) string is changed (typo, etc..).

Signed-off-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Tue, 30 Jan 2024 04:33:57 +0000 (10:03 +0530)]

Merge PR #54808 into main

* refs/pull/54808/head:
client: fix copying bufferlist to iovec structures in Client::_read
src/test: test sync call providing nullptr as ctx to async api

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Milind Changire <mchangir@redhat.com>
Reviewed-by: Frank S. Filz <ffilzlnx@mindspring.com>

Unnamed repository; edit this file 'description' to name the repository.