git.apps.os.sepia.ceph.com Git - ceph.git/log

]> git.apps.os.sepia.ceph.com Git - ceph.git/log

projects / ceph.git / log

summary | shortlog | log | commit | commitdiff | tree
first ⋅ prev ⋅ next

commit | commitdiff | tree

Lucian Petrut [Wed, 12 Apr 2023 09:39:52 +0000 (09:39 +0000)]

common/tracer: fix decoding when jaeger tracing is disabled

We aren't currently using jaeger tracing on Windows. The issue is
that Windows hosts (or any other host that doesn't use jaeger)
are experiencing message decoding failures after a recent change [1].

This change updates the tracer encoding so that messages from
non-jaeger hosts may be decoded by services that use jaeger.

[1] https://github.com/ceph/ceph/pull/47457

Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
This commit rebrings 3701ffa6733b001d4278a0b68395c5efe2382f25 which
got reverted due to an implicit dependency with other revert. Please
see https://github.com/ceph/ceph/pull/52114#issuecomment-1950288188.

Conflicts:
src/common/tracer.h
formatting conflict with 7179ac0037320480f089e1a9e291cdc37282d250

commit | commitdiff | tree

Omri Zeneva [Wed, 24 Aug 2022 13:57:11 +0000 (09:57 -0400)]

tracer/osd/librados/build/rgw: rgw and osd end2end tracing using opentelemetry

* build: add opentelemetry to cmake system
crimson targets that uses Message.cc/h are built before opentelemetry (o-tel), so we need to build o-tel eralier so we also add the library to the include path earlier
this shoud work for WITH_JAEGER flag both the ON/OFF cases, and for librados where the compilation flag is ignored

* msg/tracer: add o-tel trace to Messages with decode/encode function in tracer.h
some files that uses Message.cc/h just need the encode/decode functions and not all others functions.
some crimson targets does not link with ceph_context (common) which is required for tracer.cc file. so we just need to include that functions

* librados: Add opentelemtry trace param for aio_operate and operate methods
in order to propagate the trace info I added the otel-trace as an extra param.
in some places, there already was a blkin trace info, and since it is not used in other places we can safely change it to o-tel trace info.
this will be done in another commit, so the cleanup of blkin trace will be in a dedicated commit

* osd: use the o-tel trace of the msg as a parent span of the osd trace
if there is a valid span in the msg, we will add this op to the request
trace, otherwise it will start a new trace for the OSD op

* rgw: pass put obj trace info to librados
in order to make it possible, I saved the trace info inside the sal::Object, so we can use it later when writing the object to rados
it could be used also later for read ops.
note the trace field of req_state is initalized only in rgw_process, so it's also required in librgw request flow

* prevent breaking channges to kSize. make sure that changes between components built with
different versions of OTEL do not break message compatibility

Signed-off-by: Omri Zeneva <ozeneva@redhat.com>

commit | commitdiff | tree

Casey Bodley [Mon, 5 Feb 2024 21:21:11 +0000 (21:21 +0000)]

Merge pull request #54311 from gabriel-samfira/do-not-escape-slash

RGW: fix cloud-sync not being able to sync folders

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Casey Bodley [Mon, 5 Feb 2024 21:17:31 +0000 (21:17 +0000)]

Merge pull request #55252 from jzhu116-bloomberg/wip-21800

rgw/multisite: avoid writing multipart parts to the bucket index log

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Casey Bodley [Mon, 5 Feb 2024 21:16:56 +0000 (21:16 +0000)]

Merge pull request #55293 from jzhu116-bloomberg/wip-63178

rgw/lc: pass in flag as an argument when calling function complete(..)

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Casey Bodley [Mon, 5 Feb 2024 21:16:27 +0000 (21:16 +0000)]

Merge pull request #54661 from lightmelodies/rgw-gc

rgw: optimize gc chain size calculation

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Mon, 5 Feb 2024 19:12:51 +0000 (20:12 +0100)]

Merge pull request #55452 from nbalacha/wip-typo-fix

rbd-mirror: fix a typo in NotifyAckPayload::dump()

Reviewed-by: Ramana Raja <rraja@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Casey Bodley [Mon, 5 Feb 2024 17:59:46 +0000 (17:59 +0000)]

Merge pull request #55406 from cbodley/wip-64255

rgw: SiteConfig::load() falls back to local zonegroup

Reviewed-by: Shilpa Jagannath <smanjara@redhat.com>
Reviewed-by: Adam Emerson <aemerson@redhat.com>

commit | commitdiff | tree

Adam King [Mon, 5 Feb 2024 13:28:42 +0000 (08:28 -0500)]

Merge pull request #55448 from guits/subclass-thread

node-proxy: refactor entrypoint

Reviewed-by: Adam King <adking@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Mon, 5 Feb 2024 12:21:32 +0000 (14:21 +0200)]

Merge pull request #55280 from ronen-fr/wip-rf-extra-lg

osd: remove superfluous '>' in spg_t formatter

Reviewed-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Yuval Lifshitz [Mon, 5 Feb 2024 12:16:55 +0000 (14:16 +0200)]

Merge pull request #55414 from yuvalif/wip_yuval_url_decode_sns

rgw/rest: fix url decode of post params passed as attributes

reviewed-by: cbodley

commit | commitdiff | tree

N Balachandran [Mon, 5 Feb 2024 09:46:17 +0000 (15:16 +0530)]

rbd-mirror: fixed a typo in instance_watcher

Fixes a typo in NotifyAckPayload::dump().

Signed-off-by: N Balachandran <nibalach@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Sun, 4 Feb 2024 19:11:41 +0000 (19:11 +0000)]

node-proxy: refactor entrypoint

This commit introduces a major refactor of the main
entrypoint.

- subclass threading.Thread:
  - Introduce a new class `BaseThread()` that is a
    `threading.Thread()` abstraction class in order
    to monitor the different threads.
  - `BaseSystem()` inherits from `BaseThread()`.
  - Handle `SIGTERM` signal in order to gracefully shutdown
    node-proxy (make threads exit gracefully, log out from RedFish API, etc.)

Additionally, this:
  - drops the class `Logger()` from util.py which
was not adding value. It is now replaced with a simple `get_logger()`
function.
  - changes the node-proxy API port from 8080 to 9456
    (8080 being widely used for frontend apps...)
  - changes the container entrypoint in order to use the
    `ceph-node-proxy` binary from the packaging

Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>

commit | commitdiff | tree

Kefu Chai [Mon, 5 Feb 2024 02:20:07 +0000 (10:20 +0800)]

Merge pull request #55089 from cbodley/wip-boost-mirrors

make-dist: update boost mirrors

Reviewed-by: Kefu Chai <tchaikov@gmail.com>

commit | commitdiff | tree

zdover23 [Sun, 4 Feb 2024 14:44:35 +0000 (00:44 +1000)]

Merge pull request #55434 from anthonyeleven/rgwyamlin

src/common/options: Correct typo in rgw.yaml.in

Reviewed-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Matan Breizman [Sun, 4 Feb 2024 09:45:43 +0000 (11:45 +0200)]

Merge pull request #55276 from Matan-B/wip-matanb-crimson-alienstore-cores

crimson: rework CPU pinning

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

zdover23 [Sat, 3 Feb 2024 22:25:50 +0000 (08:25 +1000)]

Merge pull request #55419 from zdover23/wip-doc-2024-02-02-rados-config-pool-pg-config-ref

doc/rados: update config for autoscaler

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>

commit | commitdiff | tree

Anthony D'Atri [Sat, 3 Feb 2024 07:18:25 +0000 (02:18 -0500)]

src/common/options: Correct typo in rgw.yaml.in

Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>

commit | commitdiff | tree

Zac Dover [Fri, 2 Feb 2024 01:53:45 +0000 (11:53 +1000)]

doc/rados: update config for autoscaler

Update doc/rados/configuration/pool-pg-config-ref.rst to account for the
behavior of autoscaler.

Previously, this file was last meaningfully altered in 2013, prior to
the invention of autoscaler. A recent confusion was brought to my
attention on the Ceph Slack whereby a user attempted to alter the
default values of a Quincy cluster, as suggested in this documentation.
That alteration caused Ceph to throw the error "Error ERANGE: 'pgp_num'
must be greater than 0 and lower or equal than 'pg_num', which in this
case is one" and a related "rgw_init_ioctx ERROR" reading in part
"Numerical result out of range". The user removed the
"osd_pool_default_pgp_num" configuration line from ceph.conf and the
cluster worked as expected. I presume that this is because the removal
of this configuration line allowed autoscaler to work as intended.

Fixes: https://tracker.ceph.com/issues/64259
Co-authored-by: David Orman <ormandj@corenode.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Casey Bodley [Wed, 31 Jan 2024 19:29:43 +0000 (14:29 -0500)]

rgw: SiteConfig::load() falls back to local zonegroup

allow radosgw-admin commands like 'user create' to operate on a new zone
that hasn't been committed to the period yet. this follows similar logic
in RGWSI_Zone::do_start()

Fixes: https://tracker.ceph.com/issues/64255
Signed-off-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Adam King [Fri, 2 Feb 2024 18:57:49 +0000 (13:57 -0500)]

Merge pull request #54312 from dvanders/dvanders_mcltf_true

qa/tasks/cephadm: enable mon_cluster_log_to_file

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Casey Bodley [Fri, 2 Feb 2024 16:37:53 +0000 (16:37 +0000)]

Merge pull request #54837 from imtzw/urlencode

rgw: fix list obj v1 nextmarker

Reviewed-by: Matt Benjamin <mbenjamin@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Casey Bodley [Fri, 2 Feb 2024 15:42:00 +0000 (15:42 +0000)]

Merge pull request #54867 from BBoozmen/oozmen_lc_subsystem

rgw/common: add rgw lifecycle specific debug log subsystem

Reviewed-by: Matt Benjamin <mbenjamin@redhat.com>
Reviewed-by: Jiffin Tony Thottan <jthottan@redhat.com>

commit | commitdiff | tree

Casey Bodley [Fri, 2 Feb 2024 15:41:17 +0000 (15:41 +0000)]

Merge pull request #55162 from cfsnyder/wip-64014-cfsnyder-pacific

rgw: fix issue with concurrent versioned deletes leaving behind olh entries

Reviewed-by: J. Eric Ivancich <ivancich@redhat.com>

commit | commitdiff | tree

Casey Bodley [Fri, 2 Feb 2024 15:40:48 +0000 (15:40 +0000)]

Merge pull request #55229 from tobias-urdin/http-error-keystone-issue-admin-token-request

rgw: fix http error check in keystone issue_admin_token_request

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

baum [Fri, 2 Feb 2024 07:54:28 +0000 (09:54 +0200)]

Merge pull request #55330 from baum/centos9-install-build-deps

🧹build dependencies: centos9

commit | commitdiff | tree

Venky Shankar [Fri, 2 Feb 2024 05:05:00 +0000 (10:35 +0530)]

Merge PR #55233 into main

* refs/pull/55233/head:
qa: use centos 9.stream for cephfs stock kernel testing

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Casey Bodley [Thu, 1 Feb 2024 19:40:57 +0000 (19:40 +0000)]

Merge pull request #53901 from cbodley/wip-qa-container-distros-s

qa/distros: bump container host distros from centos 8->9

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Thu, 1 Feb 2024 19:29:58 +0000 (20:29 +0100)]

Merge pull request #55413 from petrutlucian94/flipped_tid

msg: update MOSDOp() to use ceph_tid_t instead of long

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Matan Breizman [Tue, 30 Jan 2024 17:53:16 +0000 (17:53 +0000)]

crimson: introduce crimson_seastar_cpu_cores

seastar::smp::count is deduced from the cpuset provided.

from seastar/src/core/reactor:
```
   if (smp_opts.smp) {
        smp::count = smp_opts.smp.get_value();
    } else {
        smp::count = cpu_set.size();
    }
```

Co-authored-by: Yingxin Cheng <yingxin.cheng@intel.com>
Signed-off-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Yuval Lifshitz [Thu, 1 Feb 2024 15:55:59 +0000 (15:55 +0000)]

rgw/rest: fix url decode of post params passed as attributes

this is fixing a regression with the SNS policies
cause by: 4bdc5d18dd68b95c6ccd4c0e77a1bd04ad86dbb8

the changes to the test code is to accomodate different
boto3 versions

Signed-off-by: Yuval Lifshitz <ylifshit@redhat.com>

commit | commitdiff | tree

Lucian Petrut [Thu, 1 Feb 2024 14:40:03 +0000 (14:40 +0000)]

msg: update MOSDOp() to use ceph_tid_t instead of long

The MOSDOp constructor receives the the transaction ID as a long
instead of ceph_tid_t.

The issue is that "long" uses 32b on Windows instead of 64 bits,
so it flips after about 2 billion requests. At that point, the OSD
replies are dropped because of transaction ID mismatches.

We'll solve the issue by using the correct type for the transaction
id, specifically ceph_tid_t.

Fixes: https://tracker.ceph.com/issues/64291
Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>

commit | commitdiff | tree

Ivo Almeida [Thu, 1 Feb 2024 13:48:17 +0000 (10:48 -0300)]

Merge pull request #55373 from ivoalmeida/snapshot-schedule-form-directory-typeahead

Snapshot schedule form directory typeahead

Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Redouane Kachach [Thu, 1 Feb 2024 11:35:39 +0000 (12:35 +0100)]

Merge pull request #55393 from rkachach/fix_issue_64266

mgr/rook: adding deployment to ceph image built for rook e2e testing

commit | commitdiff | tree

Matan Breizman [Tue, 23 Jan 2024 14:08:48 +0000 (14:08 +0000)]

vstarh.sh: Crimson - set crimson_alien_thread_cpu_cores

Signed-off-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Matan Breizman [Tue, 23 Jan 2024 14:06:01 +0000 (14:06 +0000)]

crimson/os/alienstore: cleanup N_CORES_FOR_SEASTAR constant

No CPU pinning if crimson_alien_thread_cpu_cores is not set.

Signed-off-by: Matan Breizman <mbreizma@redhat.com>

commit | commitdiff | tree

Redouane Kachach [Wed, 31 Jan 2024 09:48:35 +0000 (10:48 +0100)]

mgr/rook: adding deployment to ceph image built for rook e2e testing
Fixes: https://tracker.ceph.com/issues/64266
Signed-off-by: Redouane Kachach <rkachach@redhat.com>

commit | commitdiff | tree

Casey Bodley [Wed, 31 Jan 2024 22:15:33 +0000 (22:15 +0000)]

Merge pull request #47723 from mdw-at-linuxbox/wip-master-tempurl

rgw: swift: tempurl fixes for ceph

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Anthony D'Atri [Wed, 31 Jan 2024 20:08:26 +0000 (15:08 -0500)]

Merge pull request #55403 from cbodley/wip-doc-rgw-topic-attrs

doc/rgw: fix Attributes index in CreateTopic example

commit | commitdiff | tree

Adam King [Wed, 31 Jan 2024 18:28:50 +0000 (13:28 -0500)]

Merge pull request #54742 from guits/node-proxy

orch: implement hardware monitoring

Reviewed-by: Juan Miguel Olmo Martínez <jolmomar@redhat.com>

commit | commitdiff | tree

Ilya Dryomov [Wed, 31 Jan 2024 17:32:54 +0000 (18:32 +0100)]

Merge pull request #55397 from jagombar/fixInvalidSnapId2

librbd: return ENOENT from Snapshot::get_timestamp for nonexistent snap_id

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

commit | commitdiff | tree

Casey Bodley [Wed, 31 Jan 2024 17:31:02 +0000 (12:31 -0500)]

doc/rgw: fix Attributes index in CreateTopic example

Signed-off-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Ivo Almeida [Mon, 29 Jan 2024 10:58:43 +0000 (10:58 +0000)]

mgr/dashboard: added dir search to snap schdl form

Fixes: https://tracker.ceph.com/issues/64246
Signed-off-by: Ivo Almeida <ialmeida@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Wed, 31 Jan 2024 16:30:59 +0000 (22:00 +0530)]

Merge pull request #54781 from rhcs-dashboard/nvmeof-integration

mgr/dashboard: implement APIs for nvmeof management

Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Wed, 31 Jan 2024 15:57:22 +0000 (21:27 +0530)]

Merge pull request #54900 from ivoalmeida/snapshot-schedule-create

added snap schedule form

Reviewed-by: Pedro Gonzalez Gomez <pegonzal@redhat.com>
Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Wed, 31 Jan 2024 15:56:43 +0000 (17:56 +0200)]

Merge pull request #55340 from ronen-fr/wip-rf-reserver2

osd/scrub: use an AsyncReserver to handle scrub reservations on the replica side

Reviewed-by: Samuel Just <sjust@redhat.com>-

commit | commitdiff | tree

Josh Salomon [Wed, 31 Jan 2024 15:26:48 +0000 (17:26 +0200)]

Merge pull request #55198 from JoshSalomon/balancer-perf

osd: Read balancer for devices with different sizes (performance)

commit | commitdiff | tree

Guillaume Abrioux [Wed, 31 Jan 2024 14:23:44 +0000 (15:23 +0100)]

doc: add node-proxy documentation

This commit adds some documentation about the
'hardware inventory / monitoring' feature (node-proxy agent).

Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>

commit | commitdiff | tree

Anthony D'Atri [Wed, 31 Jan 2024 13:49:55 +0000 (08:49 -0500)]

Merge pull request #55396 from zdover23/wip-doc-2024-01-31-arch-monitors

doc/architecture: improve some paragraphs

commit | commitdiff | tree

Ali Maredia [Wed, 31 Jan 2024 13:36:42 +0000 (08:36 -0500)]

Merge pull request #55383 from ceph/cpp-redis-cxx-standard

cpp_redis: update to a version with the CMAKE_CXX_STANDARD fixed

Reviewed-by: Ali Maredia <amaredia@redhat.com>

commit | commitdiff | tree

jagombar [Tue, 30 Jan 2024 18:25:53 +0000 (18:25 +0000)]

librbd: return ENOENT from Snapshot::get_timestamp for nonexistent snap_id

Fixes: https://tracker.ceph.com/issues/47287
Signed-off-by: John Agombar <agombar@uk.ibm.com>

commit | commitdiff | tree

Zac Dover [Tue, 30 Jan 2024 09:51:53 +0000 (19:51 +1000)]

doc/architecture: improve some paragraphs

Improve paragraphs under the heading "The Ceph Storage Cluster". Remove
a sentence that was pleonastic in its context in the paragraph.

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Ivo Almeida [Wed, 13 Dec 2023 01:08:52 +0000 (01:08 +0000)]

mgr/dashboard: added snap schedule form

Fixes: https://tracker.ceph.com/issues/63827
Signed-off-by: Ivo Almeida <ialmeida@redhat.com>

commit | commitdiff | tree

Matan Breizman [Wed, 31 Jan 2024 09:14:02 +0000 (11:14 +0200)]

Merge pull request #55305 from Matan-B/wip-matanb-crimson-do_delete_work

crimson/osd/pg: do_delete_work use claim_and_reset

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Xuehan Xu <xuxuehan@qianxin.com>

commit | commitdiff | tree

Leonid Usov [Tue, 30 Jan 2024 22:55:45 +0000 (00:55 +0200)]

cpp_redis: update to a version with the CMAKE_CXX_STANDARD fixed

This updates the cpp_redis submodule to a version
which contains https://github.com/ceph/cpp_redis/pull/5

Signed-off-by: Leonid Usov <leonid.usov@ibm.com>

commit | commitdiff | tree

Ronen Friedman [Sun, 28 Jan 2024 16:48:18 +0000 (10:48 -0600)]

test: fix test_scrub_sched following recent scrub changes

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Sun, 28 Jan 2024 15:35:07 +0000 (09:35 -0600)]

osd/scrub: restore 'dump_scrub_reservations' functionality

As ScrubResources is no longer involved in remote reservations, some
of the data listed by 'dump_scrub_reservations' is now collected by
OsdScrub itself (prior to this change, OsdScrub just forwarded the
request to ScrubResources).

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Sun, 28 Jan 2024 11:45:50 +0000 (05:45 -0600)]

osd/scrub: control reservation queueing using a config option

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Thu, 25 Jan 2024 19:05:01 +0000 (13:05 -0600)]

osd/scrub: directly manage remote reservations in the FSM

The FSM now interacts with the scrub_reserver directly.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Thu, 25 Jan 2024 07:43:41 +0000 (13:13 +0530)]

vstart: add nvmeof_gw to the vstart script

so that it can be later used by the dashboard to configure the nvmeof
through UI

and create rbd pool in UI

Fixes: https://tracker.ceph.com/issues/64201
Signed-off-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Laura Flores [Wed, 31 Jan 2024 03:43:35 +0000 (21:43 -0600)]

Merge pull request #55379 from ljflores/wip-revert-labler-changes

commit | commitdiff | tree

Laura Flores [Tue, 30 Jan 2024 22:06:03 +0000 (22:06 +0000)]

tools: add 'osd-size-aware' mode to osdmaptool

Sample command:
`osdmaptool om --read out.txt --read-pool cephfs.a.data --osd-size-aware`

Signed-off-by: Laura Flores <lflores@ibm.com>

commit | commitdiff | tree

Laura Flores [Tue, 30 Jan 2024 20:38:40 +0000 (20:38 +0000)]

mgr/balancer: add new keys to balancer mgr module

Signed-off-by: Laura Flores <lflores@ibm.com>

commit | commitdiff | tree

zdover23 [Wed, 31 Jan 2024 00:48:35 +0000 (10:48 +1000)]

Merge pull request #55372 from zdover23/wip-doc-2024-01-30-architecture-typo

doc/architecture.rst - fix typo

Reviewed-by: Cole Mitchell <cole.mitchell.ceph@gmail.com>

commit | commitdiff | tree

Josh Salomon [Wed, 24 Jan 2024 12:46:07 +0000 (14:46 +0200)]

osd: unit tests for read balance osd size aware score

Signed-off-by: Josh Salomon <41079547+JoshSalomon@users.noreply.github.com>

commit | commitdiff | tree

Josh Salomon [Wed, 24 Jan 2024 12:40:53 +0000 (14:40 +0200)]

osd: Add score for read balance osd size aware policy

This score works for pools in which the read_ratio
value is set.
Current limitations:
- This mechanism ignores osd read affinty
  - There is a plan adding support for read affinity 0
    in the next version.
- This mechanism works only when all PGs are full
- If read_ration is not set - the existing mechanism (named
  fair score) is used.

Signed-off-by: Josh Salomon <41079547+JoshSalomon@users.noreply.github.com>
Signed-off-by: Josh Salomon <41079547+JoshSalomon@users.noreply.github.com>

commit | commitdiff | tree

Josh Salomon [Tue, 16 Jan 2024 18:45:32 +0000 (20:45 +0200)]

osd: Unit tests for read balancer on OSDs with different sizes

Test cases for the read balancer which takes osd sizes into account.
Some balancing code refactoring and reorg for code that is used in
multiple tests

Signed-off-by: Josh Salomon <41079547+JoshSalomon@users.noreply.github.com>

commit | commitdiff | tree

Josh Salomon [Tue, 16 Jan 2024 18:33:47 +0000 (20:33 +0200)]

osd: Read balancer for OSDs with different sizes

This commit adds calculation for desired primary distribution which
takes into account the osd size. This way smaller OSDs can take more
read operations (by adding more primaries) and the larger OSDs take less
primaries and the load of the cluater can increase. (This feature offset
a bit the weakest link in the chain effect under some conditions). In
order to calculate the loads correctly there is a need to know the
read/write ratio for the pool, and this commit assumes the read_ratio
parameter is available for the pool.

Signed-off-by: Josh Salomon <41079547+JoshSalomon@users.noreply.github.com>

commit | commitdiff | tree

Josh Salomon [Tue, 26 Dec 2023 08:41:18 +0000 (10:41 +0200)]

osd: Add 'read_ratio' pool parameterr

This parameter is used for better read balancing with non identical
devices.
- This parameter is controlled using the commands 'ceph osd pool set/get'
- This parameter is applicable only for replicated pools
- Valid values are integers in the range [0..100] and represent the
  percentage of read IOs out of all IOs in the pool
  - Value of 0 unsets this parameter and the value will be the default
    value (this is the generic behavior of the command 'ceph osd pool
    set'
  - default value can be set by config parameter
    `osd_pool_default_read_ratio`

Signed-off-by: Josh Salomon <41079547+JoshSalomon@users.noreply.github.com>

commit | commitdiff | tree

Laura Flores [Tue, 30 Jan 2024 17:32:20 +0000 (11:32 -0600)]

.github/workflows: revert version bump to labler action

Signed-off-by: Laura Flores <lflores@ibm.com>

commit | commitdiff | tree

Laura Flores [Tue, 30 Jan 2024 17:13:30 +0000 (11:13 -0600)]

Merge pull request #55364 from ljflores/wip-pr-triage

.github/workflows: disable sync-labels in pr-triage

commit | commitdiff | tree

Guillaume Abrioux [Tue, 30 Jan 2024 16:38:31 +0000 (16:38 +0000)]

agent/node-proxy: fix wrong host name used in data endpoint

data['cephx']['name'] will return something like:

node-proxy.hostname123

the prefix "node-proxy." has the be removed otherwise there will be
a mismatch between what is actually expected.

Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>

commit | commitdiff | tree

Guillaume Abrioux [Thu, 25 Jan 2024 14:34:26 +0000 (14:34 +0000)]

mgr/cephadm: update node-proxy unit tests

The recent migration to a separate daemon implied
some changes which have broken these tests.
This commit fixes them.

Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>

commit | commitdiff | tree

Guillaume Abrioux [Wed, 24 Jan 2024 15:08:14 +0000 (15:08 +0000)]

mgr/cephadm: add a new config option 'oob_default_addr'

So there's a default value (169.254.1.1) which is the default
address for the 'OS to iDrac pass-through' interface.
Given that node-proxy will reach the RedFish API through this interface,
we can make users avoid to pass that addr when providing the host spec
at bootstrap time.

Signed-off-by: Guillaume Abrioux <gabrioux@ibm.com>

commit | commitdiff | tree

Casey Bodley [Thu, 25 Jan 2024 17:41:39 +0000 (12:41 -0500)]

qa/distros: add supported-container-hosts for cephadm and upgrade suites

Signed-off-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Adam Kupczyk [Tue, 30 Jan 2024 14:43:49 +0000 (15:43 +0100)]

Merge pull request #53610 from Patty8122/documentation-update-fsck-flagdeep

doc/man/8: Add valid options in ceph-bluestore-tool.rst

commit | commitdiff | tree

Venky Shankar [Tue, 30 Jan 2024 14:10:35 +0000 (19:40 +0530)]

Merge PR #54515 into main

* refs/pull/54515/head:
qa: get_global_id() returns a string (global_id)
qa: correct usage of DEBUGFS_META_DIR in dedent

Reviewed-by: Rishabh Dave <ridave@redhat.com>
Reviewed-by: Dhairya Parmar <dparmar@redhat.com>

commit | commitdiff | tree

Redouane Kachach [Tue, 30 Jan 2024 13:30:56 +0000 (14:30 +0100)]

Merge pull request #55313 from rkachach/fix_issue_64183

mgr/rook: adding metrics monitoring e2e testing

commit | commitdiff | tree

Zac Dover [Tue, 30 Jan 2024 12:24:11 +0000 (22:24 +1000)]

doc/architecture.rst - fix typo

s/requies/requires

Signed-off-by: Zac Dover <zac.dover@proton.me>

commit | commitdiff | tree

Ivo Almeida [Tue, 30 Jan 2024 11:30:51 +0000 (08:30 -0300)]

Merge pull request #55284 from rhcs-dashboard/cephfs-clone-create

mgr/dashboard: create cephfs snapshot clone

commit | commitdiff | tree

Nizamudeen A [Fri, 12 Jan 2024 08:41:18 +0000 (14:11 +0530)]

mgr/dashboard: adapt to new nvmeof APIs

Fixes: https://tracker.ceph.com/issues/64201
Signed-off-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

avanthakkar [Mon, 11 Dec 2023 13:20:54 +0000 (18:50 +0530)]

mgr/dashboard: introduce APIs for NvmeOf management

Fixes: https://tracker.ceph.com/issues/64201
Signed-off-by: avanthakkar <avanjohn@gmail.com>

commit | commitdiff | tree

Nizamudeen A [Tue, 5 Dec 2023 10:16:12 +0000 (15:46 +0530)]

mgr/dashboard: gRPC client for nvmeof communication

Created a gRPC client by utilising the protobuf file available in nvmeof
repo

Copied the file to this repo and generated its output.

Fixes: https://tracker.ceph.com/issues/64201
Signed-off-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Tue, 5 Dec 2023 10:11:01 +0000 (15:41 +0530)]

mgr/cephadm: configure the dashboard gateways

cephadm configures the nvmeof gateways and add the gateways to a config
store which dashboard will later on fetch to make the grpc calls.

Fixes: https://tracker.ceph.com/issues/64201
Signed-off-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Rongqi Sun [Tue, 30 Jan 2024 09:31:52 +0000 (17:31 +0800)]

Merge pull request #55351 from rosinL/wip-fix-64122

script/run-make: install lvm2 for make check cephadm test

commit | commitdiff | tree

Nizamudeen A [Tue, 30 Jan 2024 09:09:02 +0000 (14:39 +0530)]

Merge pull request #55228 from rhcs-dashboard/fix-64081-main

mgr/dashboard: Export rgw multisite status via the API

Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Sun, 28 Jan 2024 11:02:31 +0000 (05:02 -0600)]

osd/scrub: remove remote reservation code from ScrubResources

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Tue, 30 Jan 2024 08:55:13 +0000 (02:55 -0600)]

common: allow AsyncReserver requests with no callbacks

As no callbacks are needed for request_reservation_or_fail(),
the synchronous request API.

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Sun, 28 Jan 2024 07:25:38 +0000 (01:25 -0600)]

osd/scrub: add a general data-carrying event type to the FSM

Based on Sam's Crimson FSM's 'value-event'.

Co-authored-by: Sam Just <sjust@redhat.com>
Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Ronen Friedman [Sun, 28 Jan 2024 07:42:09 +0000 (01:42 -0600)]

osd/scrub: define the scrub_reserver to track remote reservations

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>

commit | commitdiff | tree

Venky Shankar [Tue, 30 Jan 2024 07:40:19 +0000 (13:10 +0530)]

Merge PR #52652 into main

* refs/pull/52652/head:
PendingReleaseNotes: add note about new mdlog trimming configurations
mds: drive mdlog trimming via a separate thread
mds: allow runtime modification of mdlog trimming configuration
mds: remove a bunch of heuristics from MDLog::trim()
mds: add mdlog trimming threshold and decay counter

Reviewed-by: Leonid Usov <leonid.usov@ibm.com>

commit | commitdiff | tree

Venky Shankar [Tue, 30 Jan 2024 07:28:48 +0000 (12:58 +0530)]

Merge PR #55184 into main

* refs/pull/55184/head:
qa: remove redundant test test_volumes.TestRenameCmd.test_rename_when_fs_is_online

Reviewed-by: Christopher Hoffman <choffman@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Venky Shankar [Tue, 30 Jan 2024 06:56:32 +0000 (12:26 +0530)]

Merge PR #55346 into main

* refs/pull/55346/head:
qa: remove error string checks and check w/ return value

Reviewed-by: Dhairya Parmar <dparmar@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>

commit | commitdiff | tree

Nizamudeen A [Tue, 30 Jan 2024 05:48:39 +0000 (11:18 +0530)]

Merge pull request #55349 from afreen23/fix-64112

mgr/dashboard: Create subvol of same name in different group

Reviewed-by: Ankush Behl <cloudbehl@gmail.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Venky Shankar [Tue, 26 Sep 2023 12:22:03 +0000 (08:22 -0400)]

PendingReleaseNotes: add note about new mdlog trimming configurations

Signed-off-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Tue, 8 Aug 2023 05:48:45 +0000 (01:48 -0400)]

mds: drive mdlog trimming via a separate thread

Signed-off-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Mon, 24 Jul 2023 04:34:47 +0000 (00:34 -0400)]

mds: allow runtime modification of mdlog trimming configuration

Signed-off-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Mon, 31 Jul 2023 05:59:24 +0000 (01:59 -0400)]

mds: remove a bunch of heuristics from MDLog::trim()

These were probbaly introduced to workaround some sort of
resource overusage by the MDS during trimming, but now it
looks like they are not really neeeded, especially if we
introduce a dedicated thread for log trimming.

Signed-off-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Mon, 24 Jul 2023 04:33:47 +0000 (00:33 -0400)]

mds: add mdlog trimming threshold and decay counter

Fixes: http://tracker.ceph.com/issues/61908
Signed-off-by: Venky Shankar <vshankar@redhat.com>

commit | commitdiff | tree

Venky Shankar [Thu, 25 Jan 2024 09:32:33 +0000 (15:02 +0530)]

qa: remove error string checks and check w/ return value

I ran into this failure once #54972 was merged. The test is validating
the error string returned due to the failed mount. There aren't any
return value checks - which is a _more_ important check. Generic error
string checks will fail once a (error) string is changed (typo, etc..).

Signed-off-by: Venky Shankar <vshankar@redhat.com>

Unnamed repository; edit this file 'description' to name the repository.