]>
git.apps.os.sepia.ceph.com Git - ceph.git/log
Samuel Just [Fri, 15 Oct 2021 22:04:22 +0000 (15:04 -0700)]
Merge pull request #43537 from xxhdx1985126/wip-seastore-iov-max
crimson/os/seastore: deal with scenarios in which bufferlist of do_writev has more than IOV_MAX buffers
Reviewed-by: Samuel Just <sjust@redhat.com>
Casey Bodley [Fri, 15 Oct 2021 18:05:05 +0000 (14:05 -0400)]
Merge pull request #43507 from cbodley/wip-rgw-log-spam
rgw: hide some unnecessary log spam
Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
Casey Bodley [Fri, 15 Oct 2021 18:04:13 +0000 (14:04 -0400)]
Merge pull request #43488 from cbodley/wip-52873
radosgw-admin: 'reshard list' doesn't log ENOENT errors
Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
Casey Bodley [Fri, 15 Oct 2021 18:03:37 +0000 (14:03 -0400)]
Merge pull request #43428 from cfsnyder/wip-52818-random-rgw-req-id
rgw/rgw_rados: make RGW request IDs non-deterministic
Reviewed-by: Matt Benjamin <mbenjami@redhat.com>
Reviewed-by: Adam C. Emerson <aemerson@redhat.com>
Casey Bodley [Fri, 15 Oct 2021 18:02:26 +0000 (14:02 -0400)]
Merge pull request #42981 from mkogan1/rgw-d3n-fix-lru-exp
rgw: d3n: fix d3n cached objects on disk lru eviction entering infini…
Reviewed-by: Casey Bodley <cbodley@redhat.com>
Casey Bodley [Fri, 15 Oct 2021 18:01:58 +0000 (14:01 -0400)]
Merge pull request #42208 from BryceCao/wip-add-return-when-access-key-exist-in-subuser-create
rgw : add return when access key exist in subuser create
Reviewed-by: Casey Bodley <cbodley@redhat.com>
Casey Bodley [Fri, 15 Oct 2021 17:59:58 +0000 (13:59 -0400)]
Merge pull request #40144 from hualongfeng/osd_compression_bypass
rgw: osd compression bypass after rgw compression
Reviewed-by: Casey Bodley <cbodley@redhat.com>
Kefu Chai [Fri, 15 Oct 2021 16:10:32 +0000 (00:10 +0800)]
Merge pull request #43533 from tchaikov/wip-cmake-less-chatty
cmake: only pass --verbose when VERBOSE env variable is set
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Casey Bodley [Fri, 15 Oct 2021 15:57:13 +0000 (11:57 -0400)]
rgw: silence error message for nonexistent realm
in a normal vstart cluster without a realm or period, every command
prints this error message:
0 RGWPeriod::init failed to init realm id : (2) No such file or directory
Signed-off-by: Casey Bodley <cbodley@redhat.com>
Sage Weil [Fri, 15 Oct 2021 15:48:29 +0000 (11:48 -0400)]
Merge PR #43558 into master
* refs/pull/43558/head:
doc/governance.rst: update clt membership
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>
Patrick Donnelly [Fri, 15 Oct 2021 15:16:25 +0000 (11:16 -0400)]
Merge PR #43430 into master
* refs/pull/43430/head:
qa: skip c-v install on pacific start
Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
Patrick Donnelly [Fri, 15 Oct 2021 15:15:41 +0000 (11:15 -0400)]
Merge PR #43426 into master
* refs/pull/43426/head:
qa/cephfs: update xfstests_dev for centos stream
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Fri, 15 Oct 2021 15:13:55 +0000 (11:13 -0400)]
Merge PR #43425 into master
* refs/pull/43425/head:
qa: import CommandFailedError from exceptions not run
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Jos Collin <jcollin@redhat.com>
Patrick Donnelly [Fri, 15 Oct 2021 15:13:05 +0000 (11:13 -0400)]
Merge PR #43420 into master
* refs/pull/43420/head:
qa: skip internal metadata directory when scanning ceph debugfs directory
Reviewed-by: Jeff Layton <jlayton@redhat.com>
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Patrick Donnelly [Fri, 15 Oct 2021 15:11:48 +0000 (11:11 -0400)]
Merge PR #43382 into master
* refs/pull/43382/head:
mds: skip journaling blocklisted clients when in `replay` state
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Daniel Gryniewicz [Fri, 15 Oct 2021 15:02:18 +0000 (11:02 -0400)]
Merge pull request #43120 from dang/wip-dang-user-load-attrs
RGW Zipper - load user attributes with user
Reviewed-by: Soumya Koduri <skoduri@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
Kefu Chai [Fri, 15 Oct 2021 11:25:55 +0000 (19:25 +0800)]
Merge pull request #31909 from rosinL/wip-fix-dpdk-cond-wait
msg/async/dpdk: exit condition waiting when DPDKStack is destructed
Reviewed-by: Kefu Chai <kchai@redhat.com>
Xuehan Xu [Thu, 14 Oct 2021 09:43:23 +0000 (17:43 +0800)]
crimson/os/seastore: deal with scenarios in which bufferlist of do_writev has more than IOV_MAX buffers
Signed-off-by: Xuehan Xu <xxhdx1985126@gmail.com>
Chunsong Feng [Wed, 20 Nov 2019 01:42:11 +0000 (09:42 +0800)]
msg/async/dpdk: exit condition waiting when DPDKStack is destructed
exit() will call pthread_cond_destroy attempting to destroy dpdk::eal::cond
upon which other threads are currently blocked results in undefine
behavior. Link different libc version test, libc-2.17 can exit,
libc-2.27 will deadlock, the call stack is as follows:
Thread 3 (Thread 0xffff7e5749f0 (LWP 62213)):
#0 0x0000ffff7f3c422c in futex_wait_cancelable (private=<optimized out>, expected=0,
futex_word=0xaaaadc0e30f4 <dpdk::eal::cond+44>) at ../sysdeps/unix/sysv/linux/futex-internal.h:88
#1 __pthread_cond_wait_common (abstime=0x0, mutex=0xaaaadc0e30f8 <dpdk::eal::lock>, cond=0xaaaadc0e30c8 <dpdk::eal::cond>)
at pthread_cond_wait.c:502
#2 __pthread_cond_wait (cond=0xaaaadc0e30c8 <dpdk::eal::cond>, mutex=0xaaaadc0e30f8 <dpdk::eal::lock>)
at pthread_cond_wait.c:655
#3 0x0000ffff7f1f1f80 in std::condition_variable::wait(std::unique_lock<std::mutex>&) ()
from /usr/lib/aarch64-linux-gnu/libstdc++.so.6
#4 0x0000aaaad37f5078 in dpdk::eal::<lambda()>::operator()(void) const (__closure=<optimized out>, __closure=<optimized out>)
at ./src/msg/async/dpdk/dpdk_rte.cc:136
#5 0x0000ffff7f1f7ed4 in ?? () from /usr/lib/aarch64-linux-gnu/libstdc++.so.6
#6 0x0000ffff7f3be088 in start_thread (arg=0xffffe73e197f) at pthread_create.c:463
#7 0x0000ffff7efc74ec in thread_start () at ../sysdeps/unix/sysv/linux/aarch64/clone.S:78
Thread 1 (Thread 0xffff7ee3b010 (LWP 62200)):
#0 0x0000ffff7f3c3c38 in futex_wait (private=<optimized out>, expected=12, futex_word=0xaaaadc0e30ec <dpdk::eal::cond+36>)
at ../sysdeps/unix/sysv/linux/futex-internal.h:61
#1 futex_wait_simple (private=<optimized out>, expected=12, futex_word=0xaaaadc0e30ec <dpdk::eal::cond+36>)
at ../sysdeps/nptl/futex-internal.h:135
#2 __pthread_cond_destroy (cond=0xaaaadc0e30c8 <dpdk::eal::cond>) at pthread_cond_destroy.c:54
#3 0x0000ffff7ef2be34 in __run_exit_handlers (status=-6, listp=0xffff7f04a5a0 <__exit_funcs>, run_list_atexit=255,
run_list_atexit@entry=true, run_dtors=run_dtors@entry=true) at exit.c:108
#4 0x0000ffff7ef2bf6c in __GI_exit (status=<optimized out>) at exit.c:139
#5 0x0000ffff7ef176e4 in __libc_start_main (main=0x0, argc=0, argv=0x0, init=<optimized out>, fini=<optimized out>,
rtld_fini=<optimized out>, stack_end=<optimized out>) at ../csu/libc-start.c:344
#6 0x0000aaaad2939db0 in _start () at ./src/include/buffer.h:642
Fixes: https://tracker.ceph.com/issues/42890
Signed-off-by: Chunsong Feng <fengchunsong@huawei.com>
Signed-off-by: luo rixin <luorixin@huawei.com>
Kefu Chai [Fri, 15 Oct 2021 08:07:36 +0000 (16:07 +0800)]
Merge pull request #43555 from tchaikov/wip-dashboard-tsconfig
mgr/dashboard: s/hosts.clickHostTab/hosts.clickTab/
Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>
Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Kefu Chai [Thu, 14 Oct 2021 22:44:55 +0000 (06:44 +0800)]
mgr/dashboard: s/hosts.clickHostTab/hosts.clickTab/
this change is a follow-up fix of
3276147049511a9a8d45d84c95154a4ec014cd83
otherwise we have following test failure:
[lint:tsc ] cypress/integration/orchestrator/01-hosts.e2e-spec.ts(29,13): error TS2339: Property 'clickHostTab' does not exist on type 'HostsPageHelper'.
also change "host" to "hostname" to be more consistent
Fixes: https://tracker.ceph.com/issues/52645
Signed-off-by: Kefu Chai <tchaikov@gmail.com>
Xuehan Xu [Fri, 15 Oct 2021 05:42:41 +0000 (13:42 +0800)]
common: add prepare_iovs to bufferlist
Enable bufferlist that have more than IOV_MAX buffers to build a vector of iov.
Signed-off-by: Xuehan Xu <xxhdx1985126@gmail.com>
Xuehan Xu [Thu, 14 Oct 2021 09:41:18 +0000 (17:41 +0800)]
crimson/common: extract parallel_for_each into errorator-loop.h
otherwise, crimson::errorator<AllowedErrors...>::parallel_for_each would be
referencing incomplete crimson::parallel_for_each_state type
Signed-off-by: Xuehan Xu <xxhdx1985126@gmail.com>
Feng Hualong [Tue, 10 Aug 2021 08:22:33 +0000 (16:22 +0800)]
rgw: osd compression bypass after rgw compression
In particular, when rgw is itself doing compression,
rgw should send a incompressible hint to OSD.
Because the OSD has little effect on the data
compressed in the condition. So there bypassing the
osd compression is to avoid repeated compression
calculation.
Signed-off-by: Feng Hualong <hualong.feng@intel.com>
Kefu Chai [Fri, 15 Oct 2021 02:09:52 +0000 (10:09 +0800)]
Merge pull request #43554 from athanatos/sjust/wip-block-segment-manager-sync
crimson/os/seastore/segment_manager/block: open with dsync
Reviewed-by: Kefu Chai <tchaikov@gmail.com>
Reviewed-by: Chunmei Liu <chunmei.liu@intel.com>
Reviewed-by: Xuehan Xu <xxhdx1985126@gmail.com>
Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
Josh Durgin [Fri, 15 Oct 2021 00:52:50 +0000 (20:52 -0400)]
doc/governance.rst: update clt membership
This now matches the mailing list which Sage audited.
Signed-off-by: Josh Durgin <jdurgin@redhat.com>
Sage Weil [Fri, 15 Oct 2021 01:00:13 +0000 (21:00 -0400)]
Merge PR #43333 into master
* refs/pull/43333/head:
.github/CODEOWNERS: tag rook devs on rook related PRs
Reviewed-by: Sébastien Han <seb@redhat.com>
Reviewed-by: Sebastian Wagner <sewagner@redhat.com>
Samuel Just [Thu, 14 Oct 2021 21:51:38 +0000 (14:51 -0700)]
crimson/os/seastore/segment_manager/block: open with dsync
67efc4 appears to be simply incorrect, I don't see any calls
to flush(), so we do need to open with dsync until we
implement a smarter flushing scheme.
Also, refactor open_device to remove mode param -- we always
pass the same value.
Reverts:
67efc48332cebb69497c5676be6ed2a6e5365e03
Signed-off-by: Samuel Just <sjust@redhat.com>
Casey Bodley [Thu, 14 Oct 2021 20:13:28 +0000 (16:13 -0400)]
Merge pull request #43477 from anthonyeleven/anthonyeleven/fix-23027
ceph: document rgw_multipart_min_part_size
Reviewed-by: Casey Bodley <cbodley@redhat.com>
Sebastian Wagner [Thu, 14 Oct 2021 15:33:26 +0000 (17:33 +0200)]
Merge pull request #42997 from josephsawaya/wip-mgr-rook-orch-ls
mgr/rook: orch ls displays info about OSDs
Reviewed-by: Juan Miguel Olmo Martínez <jolmomar@redhat.com>
Sebastian Wagner [Thu, 14 Oct 2021 14:54:18 +0000 (16:54 +0200)]
Merge pull request #43332 from sebastian-philipp/sphinx-rgw-spec
doc/cephadm: use sphinx autoclass to document RGWSpec
Reviewed-by: Kefu Chai <kchai@redhat.com>
Kefu Chai [Thu, 14 Oct 2021 14:43:16 +0000 (22:43 +0800)]
Merge pull request #43239 from trociny/wip-48959
osd: handle inconsistent hash info during backfill and deep scrub gracefully
Reviewed-by: Samuel Just <sjust@redhat.com>
Kefu Chai [Thu, 14 Oct 2021 14:41:52 +0000 (22:41 +0800)]
Merge pull request #43498 from dheart-joe/bluestore_fix
os/bluestore: _do_write_small fix head_pad
Reviewed-by: Igor Fedotov <ifedotov@suse.com>
Kefu Chai [Thu, 14 Oct 2021 14:40:20 +0000 (22:40 +0800)]
Merge pull request #42099 from aclamk/wip-bluefs-fine-grain-locking-2
os/bluestore: BlueFS fine grain locking
Reviewed-by: Sage Weil <sage@newdream.net>
Kefu Chai [Thu, 14 Oct 2021 14:39:14 +0000 (22:39 +0800)]
Merge pull request #43374 from ljflores/wip-remove-pool-name
mgr/telemetry: remove pool name from telemetry perf report
Reviewed-by: Yaarit Hatuka <yaarithatuka@gmail.com>
Kefu Chai [Thu, 14 Oct 2021 14:38:39 +0000 (22:38 +0800)]
Merge pull request #43463 from Zhiwei-Dai/wip-enhance-qa-python3-compatibility
qa/tasks: replace iterkeys() with keys() for Python 3
Reviewed-by: Kefu Chai <tchaikov@gmail.com>
Kefu Chai [Thu, 14 Oct 2021 14:37:36 +0000 (22:37 +0800)]
Merge pull request #43405 from ljflores/wip-perfcounter-priorities
os/bluestore: update perf counter priorities
Reviewed-by: Igor Fedotov <ifedotov@suse.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
Kefu Chai [Thu, 14 Oct 2021 14:36:30 +0000 (22:36 +0800)]
Merge pull request #43422 from ronen-fr/wip-rf-exact-time-sec
common/ceph_time: fix wrong seconds output in exact_timespan_str()
Reviewed-by: Adam C. Emerson <aemerson@redhat.com>
Reviewed-by: Kefu Chai <tchaikov@gmail.com>
Kefu Chai [Thu, 14 Oct 2021 14:34:56 +0000 (22:34 +0800)]
Merge pull request #43515 from fengchunsong/dpdk-dev
msgs/async/dpdk: fix dpdk build error
Reviewed-by: Kefu Chai <tchaikov@gmail.com>
Kefu Chai [Thu, 14 Oct 2021 14:33:39 +0000 (22:33 +0800)]
Merge pull request #43531 from tchaikov/wip-pick-addr
common/pick_address: refactor pick_addresses()
Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Sebastian Wagner [Thu, 14 Oct 2021 14:29:05 +0000 (16:29 +0200)]
Merge pull request #43073 from strenuous-life/wip-cephadm-osd-rm-status
mgr/cephadm: the display of 'orch osd rm status' is incorrect
Reviewed-by: Sebastian Wagner <sewagner@redhat.com>
Sebastian Wagner [Thu, 14 Oct 2021 14:28:30 +0000 (16:28 +0200)]
Merge pull request #43214 from batrick/i52654
pybind/mgr/cephadm: set allow_standby_replay during CephFS upgrade
Reviewed-by: Sage Weil <sage@newdream.net>
Reviewed-by: Sebastian Wagner <sewagner@redhat.com>
Sebastian Wagner [Thu, 14 Oct 2021 14:28:03 +0000 (16:28 +0200)]
Merge pull request #43253 from guits/fix_orch_ls_osd
cephadm: fix osd size reported in 'orch ls'
Reviewed-by: Sebastian Wagner <sewagner@redhat.com>
Sebastian Wagner [Thu, 14 Oct 2021 14:27:10 +0000 (16:27 +0200)]
Merge pull request #43339 from adk3798/volume-chunk
mgr/cephadm: use json-pretty for ceph-volume calls
Reviewed-by: Sage Weil <sage@newdream.net>
Patrick Donnelly [Thu, 14 Oct 2021 14:08:18 +0000 (10:08 -0400)]
Merge PR #43517 into master
* refs/pull/43517/head:
doc / cephfs: health message codes should be permalinks
Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Kefu Chai [Thu, 14 Oct 2021 14:00:27 +0000 (22:00 +0800)]
Merge pull request #43538 from rzarzynski/wip-crimson-net-actual-bind-failure
crimson/osd: print the actual reason on bind failure.
Reviewed-by: Kefu Chai <tchaikov@gmail.com>
Ernesto Puerta [Thu, 14 Oct 2021 13:12:42 +0000 (15:12 +0200)]
Merge pull request #42557 from ceph/feature-50336-cluster-creation-wizard
mgr/dashboard: Cluster Creation/Expansion Wizard
Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: sebastian-philipp <NOT@FOUND>
Reviewed-by: Volker Theile <vtheile@suse.com>
Ernesto Puerta [Thu, 14 Oct 2021 11:26:44 +0000 (13:26 +0200)]
Merge pull request #43453 from rhcs-dashboard/e2e-host-fix-teuth
mgr/dashboard: Fix orchestrator/01-hosts.e2e-spec.ts failure
Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
Chunsong Feng [Wed, 13 Oct 2021 03:55:08 +0000 (03:55 +0000)]
src/msg/dpdk: reserve funcs capacity to avoid reallocation
When a new vector is added larger than then current vector capacity,
it reallocates space. lamda function accesses the previous adress will
cause a segment fault. Therefore, reserve sufficient funcs space to
avoid reallocation.
Fixes: https://tracker.ceph.com/issues/52907
Signed-off-by: Chunsong Feng <fengchunsong@huawei.com>
Chunsong Feng [Wed, 13 Oct 2021 03:33:02 +0000 (03:33 +0000)]
msgs/async/dpdk: fix dpdk build error
Signed-off-by: Chunsong Feng <fengchunsong@huawei.com>
Radoslaw Zarzynski [Wed, 13 Oct 2021 12:21:04 +0000 (12:21 +0000)]
crimson/osd: print the actual reason on bind failure.
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Yuval Lifshitz [Thu, 14 Oct 2021 07:41:15 +0000 (10:41 +0300)]
Merge pull request #43308 from TRYTOBE8TME/wip-rgw-post-object-upload
src/rgw: HTTP POST object upload
Kefu Chai [Thu, 14 Oct 2021 06:39:55 +0000 (14:39 +0800)]
cmake: only pass --verbose when VERBOSE env variable is set
so that the setuptools is less chatty by default when building python
bindings.
Signed-off-by: Kefu Chai <tchaikov@gmail.com>
Kefu Chai [Wed, 13 Oct 2021 23:51:31 +0000 (07:51 +0800)]
common/pick_address: refactor pick_addresses()
* consolidate the logic handling CEPH_PICK_ADDRESS_PREFER_IPV4 using
std::sort(). this might be overkill. but it helps to explain
what CEPH_PICK_ADDRESS_PREFER_IPV4 is for, and helps to dedup
the code to order the addresses.
* let fill_in_one_address() return an optional<entity_addrvec_t>.
more readable this way
* early return if the required address is not found, instead of
checking variables like ipv4_r
* rename fill_in_one_address() to get_one_address() to reflect
the change of the function's return value's type
Signed-off-by: Kefu Chai <tchaikov@gmail.com>
Venky Shankar [Wed, 13 Oct 2021 05:32:15 +0000 (11:02 +0530)]
doc / cephfs: health message codes should be permalinks
... so that such links can be included in alert warnings.
Additionally, document some other health warnings. Credit to @pcuzner
to point out that not all health warnings have been documented.
Signed-off-by: Venky Shankar <vshankar@redhat.com>
Kefu Chai [Wed, 13 Oct 2021 23:38:09 +0000 (07:38 +0800)]
common/pick_address: use string_view when appropriate
we don't need a string when string_view is good enough.
Signed-off-by: Kefu Chai <tchaikov@gmail.com>
adk3798 [Wed, 13 Oct 2021 21:45:42 +0000 (17:45 -0400)]
Merge pull request #43526 from adk3798/disable-agent-default
mgr/cephadm: disable agent by default
Reviewed-by: Michael Fritch <mfritch@suse.com>
Sage Weil [Wed, 13 Oct 2021 21:08:20 +0000 (17:08 -0400)]
Merge PR #43049 into master
* refs/pull/43049/head:
mgr/rook: apply mds using placement spec and osd_pool_default_size
mgr/rook: factor out replica/failureDomain calc
Reviewed-by: Juan Miguel Olmo <jolmomar@redhat.com>
Adam King [Wed, 13 Oct 2021 16:33:47 +0000 (12:33 -0400)]
mgr/cephadm: disable agent by default
to avoid issues while it is stabilized
Signed-off-by: Adam King <adking@redhat.com>
Sebastian Wagner [Wed, 13 Oct 2021 16:11:36 +0000 (18:11 +0200)]
Merge pull request #43490 from Daniel-Pivonka/cephadm-doc-adopt-sshuser
doc/cephadm: document setting ssh-user during adopt
Reviewed-by: Adam King <adking@redhat.com>
Daniel Pivonka [Mon, 11 Oct 2021 21:15:35 +0000 (17:15 -0400)]
doc/cephadm: document setting ssh-user during adopt
Signed-off-by: Daniel Pivonka <dpivonka@redhat.com>
Laura Flores [Mon, 4 Oct 2021 04:41:10 +0000 (04:41 +0000)]
os/bluestore: update priorities and nicks of bluestore perf counters
These perf counters do not show up in telemetry unless they are set to a "useful" priority or higher. Fetching these counters in telemetry may help to diagnose problems with RocksDB / BlueFS prefetching / insufficient cache sizes.
Signed-off-by: Laura Flores <lflores@redhat.com>
Ernesto Puerta [Wed, 13 Oct 2021 11:37:51 +0000 (13:37 +0200)]
Merge pull request #43377 from rhcs-dashboard/fix-clients-connection-query
mgr/dashboard: replace "Ceph-cluster" Client connections with active-standby MGRs
Reviewed-by: Aashish Sharma <aasharma@redhat.com>
Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Greg Farnum <gfarnum@redhat.com>
Reviewed-by: neha-ojha <NOT@FOUND>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Nizamudeen A [Fri, 17 Sep 2021 13:13:01 +0000 (18:43 +0530)]
mgr/dashboard: Cluster expansion e2e cleanups
Fixes: https://tracker.ceph.com/issues/52645
Signed-off-by: Nizamudeen A <nia@redhat.com>
Avan Thakkar [Tue, 3 Aug 2021 09:01:57 +0000 (14:31 +0530)]
mgr/dashboard: introduce gather facts in host list
Fixes: https://tracker.ceph.com/issues/52017
Signed-off-by: Avan Thakkar <athakkar@redhat.com>
Aashish Sharma [Tue, 7 Sep 2021 06:30:45 +0000 (12:00 +0530)]
mgr/dashboard: Cluster Creation Add Services Section
Add Services section in cluster creation wizard
Create Cluster OSD Section Followups
1. The device preview disappearing when going to next step and coming back to the previous step
2. Even when clearing the device preview, the Storage Capacity count and the drive group spec doesn't get cleared.
3. Expanding the cluster without selecting any devices gives a 400
error.
4. Renamed "Delete Host" to "Remove Host"
5. Generalizing most of the sub component code
Fixes: https://tracker.ceph.com/issues/52499
Fixes: https://tracker.ceph.com/issues/51991
Signed-off-by: Nizamudeen A <nia@redhat.com>
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
Nizamudeen A [Tue, 17 Aug 2021 13:34:31 +0000 (19:04 +0530)]
mgr/dashboard: Cluster Creation Create OSDs Section
Create OSDs section in cluster creation wizard
Fixes: https://tracker.ceph.com/issues/51991
Fixes: https://tracker.ceph.com/issues/52298
Signed-off-by: Nizamudeen A <nia@redhat.com>
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
Avan Thakkar [Fri, 9 Jul 2021 13:27:35 +0000 (18:57 +0530)]
mgr/dashboard: Review Section for the Create Cluster Workflow
Fixes: https://tracker.ceph.com/issues/50566
Signed-off-by: Avan Thakkar <athakkar@redhat.com>
Nizamudeen A [Sun, 4 Jul 2021 13:16:45 +0000 (18:46 +0530)]
mgr/dashboard: Cluster Creation Add Host Section and e2es
Add host section of the cluster creation workflow.
1. Fix bug in the modal where going forward one step on the wizard and coming back opens up the add host modal.
2. Rename Create Cluster to Expand Cluster as per the discussions
3. A skip confirmation modal to warn the user when he tries to skip the
cluster creation
4. Adapted all the tests
5. Did some UI improvements like fixing and aligning the styles,
colors..
- Used routed modal for host Additon form
- Renamed the Create to Add in Host Form
Fixes: https://tracker.ceph.com/issues/51517
Fixes: https://tracker.ceph.com/issues/51640
Fixes: https://tracker.ceph.com/issues/50336
Fixes: https://tracker.ceph.com/issues/50565
Signed-off-by: Avan Thakkar <athakkar@redhat.com>
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
Signed-off-by: Nizamudeen A <nia@redhat.com>
Avan Thakkar [Tue, 1 Jun 2021 12:55:15 +0000 (18:25 +0530)]
mgr/dashboard: Create Cluster Workflow welcome screen and e2e tests
A module option called CLUSTER_STATUS has two option. INSTALLED
AND POST_INSTALLED. When CLUSTER_STATUS is INSTALLED it will allow to show the
create-cluster-wizard after login the initial time. After the cluster
creation is succesfull this option is set to POST_INSTALLED
Also has the e2e codes for the Review Section
Fixes: https://tracker.ceph.com/issues/50336
Signed-off-by: Avan Thakkar <athakkar@redhat.com>
Signed-off-by: Nizamudeen A <nia@redhat.com>
Kalpesh Pandya [Wed, 22 Sep 2021 10:26:21 +0000 (15:56 +0530)]
src/test: HTTP POST object upload
This solves the tracker: https://tracker.ceph.com/issues/51724
Basically it is using 'generate_presigned_post()' boto3 API.
This is verified under AMQP endpoint.
Signed-off-by: Kalpesh Pandya <kapandya@redhat.com>
Samuel Just [Tue, 12 Oct 2021 21:39:38 +0000 (14:39 -0700)]
Merge pull request #43500 from rzarzynski/wip-crimson-interruptible-condalias
crimson: osd operations respect interruptor's InterruptCondition.
Reviewed-by: Samuel Just <sjust@redhat.com>
Samuel Just [Tue, 12 Oct 2021 21:38:47 +0000 (14:38 -0700)]
Merge pull request #43501 from rzarzynski/wip-crimson-parse-env
crimson: respect the CEPH_ARGS environment variable.
Reviewed-by: Kefu Chai <tchaikov@gmail.com>
Reviewed-by: Samuel Just <sjust@redhat.com>
Sebastian Wagner [Tue, 12 Oct 2021 20:29:06 +0000 (22:29 +0200)]
Merge pull request #43274 from pcuzner/add-mib
monitoring:Adding the Ceph MIB
Reviewed-by: Sebastian Wagner <sewagner@redhat.com>
Gregory Farnum [Tue, 12 Oct 2021 19:00:56 +0000 (12:00 -0700)]
Merge pull request #43509 from ceph/revert-43462-wip-test-erasure-code-for-aarch64
Revert "qa: support isal ec test for aarch64"
Reviewed-by: Greg Farnum <gfarnum@redhat.com>
Zack Cerza [Tue, 12 Oct 2021 18:43:34 +0000 (12:43 -0600)]
Revert "qa: support isal ec test for aarch64"
This commit has been causing scheduled jobs to request e.g. aarch64
smithi machines, which don't exist. The dispatcher then tries to find them forever, requiring the dispatcher to be killed and restarted. The queue
will sit idle until someone notices the problem.
Signed-off-by: Zack Cerza <zack@redhat.com>
Casey Bodley [Tue, 12 Oct 2021 18:30:28 +0000 (14:30 -0400)]
rgw: raise log level of http client headers
these messages are especially spammy because the http client's dout
prefix includes the entire request path:
2021-10-12T04:53:39.281+0000
7f66a0ff9700 10 http_client[POST/http://example.com/admin/log?type=data¬ify2&source-zone=
3a571642 -9f5e-46d8-8186-
9eca8cc79ac6 &rgwx-zonegroup=
8f3b29b1 -ffc6-4c90-9d0c-
9bd258028cd8 ]receive_http_header
2021-10-12T04:53:39.281+0000
7f66a0ff9700 10 http_client[POST/http://example.com/admin/log?type=data¬ify2&source-zone=
3a571642 -9f5e-46d8-8186-
9eca8cc79ac6 &rgwx-zonegroup=
8f3b29b1 -ffc6-4c90-9d0c-
9bd258028cd8 ]received header:HTTP/1.1 501 Not Implemented
2021-10-12T04:53:39.281+0000
7f66a0ff9700 10 http_client[POST/http://example.com/admin/log?type=data¬ify2&source-zone=
3a571642 -9f5e-46d8-8186-
9eca8cc79ac6 &rgwx-zonegroup=
8f3b29b1 -ffc6-4c90-9d0c-
9bd258028cd8 ]receive_http_header
2021-10-12T04:53:39.281+0000
7f66a0ff9700 10 http_client[POST/http://example.com/admin/log?type=data¬ify2&source-zone=
3a571642 -9f5e-46d8-8186-
9eca8cc79ac6 &rgwx-zonegroup=
8f3b29b1 -ffc6-4c90-9d0c-
9bd258028cd8 ]received header:Content-Length: 108
2021-10-12T04:53:39.281+0000
7f66a0ff9700 10 http_client[POST/http://example.com/admin/log?type=data¬ify2&source-zone=
3a571642 -9f5e-46d8-8186-
9eca8cc79ac6 &rgwx-zonegroup=
8f3b29b1 -ffc6-4c90-9d0c-
9bd258028cd8 ]receive_http_header
2021-10-12T04:53:39.281+0000
7f66a0ff9700 10 http_client[POST/http://example.com/admin/log?type=data¬ify2&source-zone=
3a571642 -9f5e-46d8-8186-
9eca8cc79ac6 &rgwx-zonegroup=
8f3b29b1 -ffc6-4c90-9d0c-
9bd258028cd8 ]received header:x-amz-request-id: tx00000000000000027e9ba-
00616514d3 -70a3-east
2021-10-12T04:53:39.281+0000
7f66a0ff9700 10 http_client[POST/http://example.com/admin/log?type=data¬ify2&source-zone=
3a571642 -9f5e-46d8-8186-
9eca8cc79ac6 &rgwx-zonegroup=
8f3b29b1 -ffc6-4c90-9d0c-
9bd258028cd8 ]receive_http_header
2021-10-12T04:53:39.281+0000
7f66a0ff9700 10 http_client[POST/http://example.com/admin/log?type=data¬ify2&source-zone=
3a571642 -9f5e-46d8-8186-
9eca8cc79ac6 &rgwx-zonegroup=
8f3b29b1 -ffc6-4c90-9d0c-
9bd258028cd8 ]received header:Accept-Ranges: bytes
2021-10-12T04:53:39.281+0000
7f66a0ff9700 10 http_client[POST/http://example.com/admin/log?type=data¬ify2&source-zone=
3a571642 -9f5e-46d8-8186-
9eca8cc79ac6 &rgwx-zonegroup=
8f3b29b1 -ffc6-4c90-9d0c-
9bd258028cd8 ]receive_http_header
2021-10-12T04:53:39.281+0000
7f66a0ff9700 10 http_client[POST/http://example.com/admin/log?type=data¬ify2&source-zone=
3a571642 -9f5e-46d8-8186-
9eca8cc79ac6 &rgwx-zonegroup=
8f3b29b1 -ffc6-4c90-9d0c-
9bd258028cd8 ]received header:Content-Type: application/json
2021-10-12T04:53:39.281+0000
7f66a0ff9700 10 http_client[POST/http://example.com/admin/log?type=data¬ify2&source-zone=
3a571642 -9f5e-46d8-8186-
9eca8cc79ac6 &rgwx-zonegroup=
8f3b29b1 -ffc6-4c90-9d0c-
9bd258028cd8 ]receive_http_header
2021-10-12T04:53:39.281+0000
7f66a0ff9700 10 http_client[POST/http://example.com/admin/log?type=data¬ify2&source-zone=
3a571642 -9f5e-46d8-8186-
9eca8cc79ac6 &rgwx-zonegroup=
8f3b29b1 -ffc6-4c90-9d0c-
9bd258028cd8 ]received header:Date: Tue, 12 Oct 2021 04:53:39 GMT
2021-10-12T04:53:39.281+0000
7f66a0ff9700 10 http_client[POST/http://example.com/admin/log?type=data¬ify2&source-zone=
3a571642 -9f5e-46d8-8186-
9eca8cc79ac6 &rgwx-zonegroup=
8f3b29b1 -ffc6-4c90-9d0c-
9bd258028cd8 ]receive_http_header
raise to 30 so they're only enabled for super-verbose logging
Signed-off-by: Casey Bodley <cbodley@redhat.com>
Casey Bodley [Tue, 12 Oct 2021 18:22:10 +0000 (14:22 -0400)]
rgw: raise log level of realm/zonegroup/zone output on startup
this output shows up in all radosgw-admin commands, and isn't relevant
to most of them
Signed-off-by: Casey Bodley <cbodley@redhat.com>
Samuel Just [Tue, 12 Oct 2021 15:09:58 +0000 (08:09 -0700)]
Merge pull request #43492 from xxhdx1985126/wip-52894
crimson/os/seastore: set ExtentPlacementManager::allocated_to before rolling segments
Reviewed-by: Chunmei Liu <chunmei.liu@intel.com>
Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
Nizamudeen A [Thu, 7 Oct 2021 15:36:29 +0000 (21:06 +0530)]
mgr/dashboard: Fix orchestrator/01-hosts.e2e-spec.ts failure
The test is failing on deleting a host because the agent daemon is
present in that host. Its not possible to simply delete a host. We need
to drain it first and then delete it.
Fixes: https://tracker.ceph.com/issues/52764
Signed-off-by: Nizamudeen A <nia@redhat.com>
Radoslaw Zarzynski [Tue, 12 Oct 2021 11:25:20 +0000 (11:25 +0000)]
crimson: respect the CEPH_ARGS environment variable.
Rook uses `CEPH_ARGS` to convey the `mon_host` configurable. Lack of
support for it was the root cause of inability to reach out to monitors.
See: https://gist.github.com/rzarzynski/
95746aa73a48e811749a3b0aaeb31680 #gistcomment-
3924233 .
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Radoslaw Zarzynski [Tue, 12 Oct 2021 11:23:45 +0000 (11:23 +0000)]
crimson/osd: rectify a coding style issue in PglogBasedRecovery::do_recovery().
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
Radoslaw Zarzynski [Tue, 12 Oct 2021 09:38:23 +0000 (09:38 +0000)]
crimson: osd operations respect interruptor's InterruptCondition.
For the sake of DRY.
Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
dheart [Tue, 12 Oct 2021 08:46:34 +0000 (16:46 +0800)]
os/bluestore: _do_write_small fix head_pad
Signed-off-by: dheart <dheart_joe@163.com>
Avan Thakkar [Fri, 11 Jun 2021 11:07:10 +0000 (16:37 +0530)]
cephadm: expose gather-facts api method
Fixes: https://tracker.ceph.com/issues/51209
This PR intends to expose host metadata(gather-facts) api method
Signed-off-by: Avan Thakkar <athakkar@redhat.com>
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
Ali Maredia [Tue, 12 Oct 2021 05:31:00 +0000 (01:31 -0400)]
Merge pull request #42052 from TRYTOBE8TME/wip-rgw-dpp-3
rgw: DPP addition 4
Reviewed-by: Ali Maredia <amaredia@redhat.com>
Samuel Just [Tue, 12 Oct 2021 05:17:52 +0000 (22:17 -0700)]
Merge pull request #43181 from xxhdx1985126/wip-extent-placement-manager-multi-device-3
crimson/os/seastore: add multi-device support
Reviewed-by: Samuel Just <sjust@redhat.com>
Xuehan Xu [Tue, 12 Oct 2021 01:55:21 +0000 (09:55 +0800)]
crimson/os/seastore: set ExtentPlacementManager::allocated_to before rolling segments
There are circumstances in which a transaction that are supposed to roll the current segment
is invalidated after it finished writing and before it rolls the segment. If we don't set
ExtentPlacementManager::allocated_to in this situation, another transaction can try to write
to the old "allocated_to" position, which would cause an invalid write error
Fixes: https://tracker.ceph.com/issues/52894
Signed-off-by: Xuehan Xu <xxhdx1985126@gmail.com>
Samuel Just [Tue, 12 Oct 2021 01:28:42 +0000 (18:28 -0700)]
Merge pull request #43449 from rzarzynski/wip-crimson-interruptible-peeringevent
crimson/osd: implement interruptions in PeeringEvent.
Reviewed-by: Chunmei Liu <chunmei.liu@intel.com>
Reviewed-by: Samuel Just <sjust@redhat.com>
Samuel Just [Tue, 12 Oct 2021 01:25:35 +0000 (18:25 -0700)]
Merge pull request #43483 from cyx1231st/wip-seastore-record-header-fullness
crimson/os/seastore: measure inline/ool record header fullness
Reviewed-by: Chunmei Liu <chunmei.liu@intel.com>
Reviewed-by: Samuel Just <sjust@redhat.com>
Casey Bodley [Mon, 11 Oct 2021 18:25:37 +0000 (14:25 -0400)]
radosgw-admin: 'reshard list' doesn't log ENOENT errors
ENOENT errors are expected, especially in fresh clusters, before we've
written any entries to the reshard list shards. avoid logging these
non-fatal ERROR messages:
> -1 ERROR: failed to list reshard log entries, oid=reshard.
0000000000 marker= (2) No such file or directory
Fixes: https://tracker.ceph.com/issues/52873
Signed-off-by: Casey Bodley <cbodley@redhat.com>
Joseph Sawaya [Fri, 3 Sep 2021 17:30:43 +0000 (13:30 -0400)]
mgr/rook: apply mds using placement spec and osd_pool_default_size
This commit changes the apply_mds command in the rook orchestrator
to support some placement specs and also sets the replica size according
to the osd_pool_default_size ceph option.
This commit also adds `orch apply mds` to the QA to test if the command
runs.
Signed-off-by: Joseph Sawaya <jsawaya@redhat.com>
Sage Weil [Mon, 11 Oct 2021 16:57:40 +0000 (11:57 -0500)]
mgr/rook: factor out replica/failureDomain calc
Signed-off-by: Sage Weil <sage@newdream.net>
Sebastian Wagner [Mon, 11 Oct 2021 16:47:35 +0000 (18:47 +0200)]
Merge pull request #43471 from liewegas/fix-rook-1-node
qa/suites/orch/rook: replicate across osds for 1-node clusters
Reviewed-by: Sebastian Wagner <sewagner@redhat.com>
Avan Thakkar [Thu, 30 Sep 2021 22:26:42 +0000 (03:56 +0530)]
mgr/dashboard: replace Client connections with active-stdby mgrs
Fixes: https://tracker.ceph.com/issues/52121
Signed-off-by: Avan Thakkar <athakkar@redhat.com>
Sebastian Wagner [Mon, 11 Oct 2021 14:45:58 +0000 (16:45 +0200)]
Merge pull request #43376 from Daniel-Pivonka/cephadm-apply-health
mgr/cephadm: set HEALTH warnings during apply phase in serve
Reviewed-by: Melissa Li <mingkli@redhat.com>
Sebastian Wagner [Mon, 11 Oct 2021 14:43:10 +0000 (16:43 +0200)]
Merge pull request #43455 from liewegas/qa-nvme-loop
qa: use nvme_loop devices for (some) cephadm tests
Reviewed-by: Sebastian Wagner <sewagner@redhat.com>
Sebastian Wagner [Mon, 11 Oct 2021 14:40:39 +0000 (16:40 +0200)]
Merge pull request #43450 from guits/fix_selinux_apply_spec
cephadm: shell --mount shouldnt enforce ':z' option
Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Michael Fritch <mfritch@suse.com>
Sebastian Wagner [Mon, 11 Oct 2021 14:38:35 +0000 (16:38 +0200)]
Merge pull request #43458 from mgfritch/cephadm-scp
mgr/cephadm: use `asyncssh.scp` to write remote files
Reviewed-by: Sebastian Wagner <sewagner@redhat.com>
Sage Weil [Fri, 8 Oct 2021 17:28:31 +0000 (12:28 -0500)]
mgr/rook: specify failureDomain on CephObjectStore CR
Signed-off-by: Sage Weil <sage@newdream.net>