git.apps.os.sepia.ceph.com Git

]> git.apps.os.sepia.ceph.com Git - ceph.git/log

Melissa [Wed, 21 Jul 2021 03:46:41 +0000 (23:46 -0400)]

mgr/cephadm: replace instances of remoto/execnet with asyncssh

Fixes: https://tracker.ceph.com/issues/44676
Signed-off-by: Melissa Li <li.melissa.kun@gmail.com>

commit | commitdiff | tree

Melissa [Wed, 21 Jul 2021 03:36:20 +0000 (23:36 -0400)]

mgr/cephadm: rewrite test_etc_ceph to apply to asyncssh

Fixes: https://tracker.ceph.com/issues/44676
Signed-off-by: Melissa Li <li.melissa.kun@gmail.com>

commit | commitdiff | tree

Melissa [Wed, 21 Jul 2021 03:33:37 +0000 (23:33 -0400)]

mgr/cephadm: remove test_stale_connections

remove test_stale_connections because it applies to remoto

Fixes: https://tracker.ceph.com/issues/44676
Signed-off-by: Melissa Li <li.melissa.kun@gmail.com>

commit | commitdiff | tree

Melissa [Wed, 21 Jul 2021 03:28:04 +0000 (23:28 -0400)]

mgr/cephadm: add test_offline, edit with_cephadm_module

add `event_loop` and `tkey` object to with_cephadm_module, and create MockEventLoopThread in fixtures.py to test async functions of ssh.py.
rewrite test_offline to be compatible with asyncssh

Fixes: https://tracker.ceph.com/issues/44676
Signed-off-by: Melissa Li <li.melissa.kun@gmail.com>

commit | commitdiff | tree

Melissa [Wed, 21 Jul 2021 03:19:33 +0000 (23:19 -0400)]

mgr/cephadm: simplify _deploy_cephadm_binary, remove _deploy_cephadm_binary_conn

_deploy_cephadm_binary can be simplified to call write_remote_file in ssh.py, and _deploy_cephadm_binary_conn is no longer needed

Fixes: https://tracker.ceph.com/issues/44676
Signed-off-by: Melissa Li <li.melissa.kun@gmail.com>

commit | commitdiff | tree

Melissa [Wed, 21 Jul 2021 03:12:21 +0000 (23:12 -0400)]

mgr/cephadm: use _remote_connection (ssh.py), _execute_command, _check_execute_command in _run_cephadm

remove _get_connection from module.py and _remote_connection in serve.py, replacing with _remote_connection in ssh.py.
also, replace remoto.process.check with _execute_command and _check_execute_command in ssh.py

Fixes: https://tracker.ceph.com/issues/44676
Signed-off-by: Melissa Li <li.melissa.kun@gmail.com>

commit | commitdiff | tree

Melissa [Wed, 21 Jul 2021 02:43:38 +0000 (22:43 -0400)]

mgr/cephadm: remove remotes.py, replace old _write_remote_file in serve.py with write_remote_file in ssh.py

remove remotes.py because it is specific to execnet/remoto.
_write_remote_file in ssh.py now fulfills the function of write_file in remotes.py and the old _write_remote_file in serve.py

Fixes: https://tracker.ceph.com/issues/44676
Signed-off-by: Melissa Li <li.melissa.kun@gmail.com>

commit | commitdiff | tree

Melissa [Wed, 21 Jul 2021 01:50:33 +0000 (21:50 -0400)]

mgr/cephadm: remove _executable_path from module.py

Fixes: https://tracker.ceph.com/issues/44676
Signed-off-by: Melissa Li <li.melissa.kun@gmail.com>

commit | commitdiff | tree

Melissa [Tue, 20 Jul 2021 23:11:29 +0000 (19:11 -0400)]

mgr/cephadm: make synchronous wrapper functions of the async functions in ssh.py

Fixes: https://tracker.ceph.com/issues/44676
Signed-off-by: Melissa Li <li.melissa.kun@gmail.com>

commit | commitdiff | tree

Melissa [Tue, 20 Jul 2021 23:02:57 +0000 (19:02 -0400)]

mgr/cephadm: create thread to start event loop for ssh.py, and return results of the async functions with get_result

The EventLoopThread class starts a thread and an event loop which runs forever. Coroutines are scheduled on the event loop by the `get_result` method which uses `run_coroutine_threadsafe` to return a concurrent.futures.Future, and ultimately the result with .result()

Fixes: https://tracker.ceph.com/issues/44676
Signed-off-by: Melissa Li <li.melissa.kun@gmail.com>

commit | commitdiff | tree

Melissa [Tue, 20 Jul 2021 22:24:26 +0000 (18:24 -0400)]

mgr/cephadm: move _reconfig_ssh, _reset_con, and _reset_cons to ssh.py

Fixes: https://tracker.ceph.com/issues/44676
Signed-off-by: Melissa Li <li.melissa.kun@gmail.com>

commit | commitdiff | tree

Melissa [Tue, 20 Jul 2021 21:22:00 +0000 (17:22 -0400)]

mgr/cephadm: create async function _write_remote_file to write files on remote host

_write_remote_file uses _check_execute_command in ssh.py which calls _execute_command which uses shlex quote. Thus, any commands with an int will need to be transformed into a str because shlex quote does not take int objects

Fixes: https://tracker.ceph.com/issues/44676
Signed-off-by: Melissa Li <li.melissa.kun@gmail.com>

commit | commitdiff | tree

Melissa [Tue, 20 Jul 2021 21:02:40 +0000 (17:02 -0400)]

mgr/cephadm: execute commands run over ssh via asyncssh

_execute_command will run commands over ssh using the asyncssh `run` method: https://asyncssh.readthedocs.io/en/latest/api.html#asyncssh.SSHClientConnection.run
_check_execute_command will check the output of _execute_command and raise OrchestratorError if command fails on the remote host.
All commands run over ssh are prepended with sudo in `_execute_command` and shell-escaped with shlex quote.
If the cached ssh connection is closed or broken, the connection object will be removed from the cache, added to the `offline_hosts`, and an OrchestratorError will be raised. On the next call, the connection object will attempt to be recreated.
Exceptions involving asyncssh methods should be handled otherwise errors like TypeError: __init__() missing 1 required positional argument: 'reason' could occur due to the asyncssh error interacting with `raise_if_exception`

Fixes: https://tracker.ceph.com/issues/44676
Signed-off-by: Melissa Li <li.melissa.kun@gmail.com>

commit | commitdiff | tree

Melissa [Tue, 20 Jul 2021 19:56:32 +0000 (15:56 -0400)]

mgr/cephadm: create and cache asyncssh connection objects, and handle asyncssh connection errors

Create asyncssh connection object in async `_remote_connection` function and cache in `self.cons`
Create a handler for asyncssh log redirection and output ssh log if a connection error occurs
Disable asyncssh logger from propagating because the asyncssh info messages are verbose

Fixes: https://tracker.ceph.com/issues/44676
Signed-off-by: Melissa Li <li.melissa.kun@gmail.com>

commit | commitdiff | tree

zdover23 [Thu, 12 Aug 2021 22:48:40 +0000 (08:48 +1000)]

Merge pull request #42167 from emenguy/nomad_ceph_integration_documentation

[docs]: RBD and Nomad integration

Reviewed-by: Zac Dover <zac.dover@gmail.com>

commit | commitdiff | tree

Sage Weil [Thu, 12 Aug 2021 20:04:21 +0000 (16:04 -0400)]

Merge PR #42771 into master

* refs/pull/42771/head:
.githubmap: fix format

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Ernesto Puerta [Thu, 12 Aug 2021 18:53:07 +0000 (20:53 +0200)]

Merge pull request #41570 from jhrcz-ls/wip-cephfs-overview-use-rate

mgr/dashboard: cephfs MDS Workload to use rate for counter type metric

commit | commitdiff | tree

Ernesto Puerta [Thu, 12 Aug 2021 18:52:03 +0000 (20:52 +0200)]

Merge pull request #42745 from rhcs-dashboard/52130-tox-cleanup

mgr/dashboard: tox.ini: delete useless env. 'apidocs'

commit | commitdiff | tree

Ernesto Puerta [Thu, 12 Aug 2021 18:50:51 +0000 (20:50 +0200)]

Merge pull request #42724 from rhcs-dashboard/52082-cephadm-e2e-improv

mgr/dashboard: run-cephadm-e2e-tests.sh improvements

commit | commitdiff | tree

Ernesto Puerta [Thu, 12 Aug 2021 18:22:39 +0000 (20:22 +0200)]

.githubmap: fix format

Issue introduced by merge commit
4b9a3b217127d003d4ae6addb1f0e1d8ce21b816

Signed-off-by: Ernesto Puerta <epuertat@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Thu, 12 Aug 2021 18:24:39 +0000 (11:24 -0700)]

Merge pull request #42630 from mark15213/licephsqlitefix

libcephsqlite: fix unconditional success bug in CheckReservedLock

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Yuri Weinstein [Thu, 12 Aug 2021 18:21:36 +0000 (11:21 -0700)]

Merge pull request #42345 from aclamk/wip-resharding-column-options

Add handling of block_cache option for resharding

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Daniel Gryniewicz [Thu, 12 Aug 2021 17:57:30 +0000 (13:57 -0400)]

Merge pull request #42767 from soumyakoduri/wip-skoduri-sqlite-freebsd

rgw/dbstore: Resolve library link issues on FreeBSD

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
Reviewed-by: Willem Jan Withagen

commit | commitdiff | tree

Ernesto Puerta [Thu, 12 Aug 2021 16:39:25 +0000 (18:39 +0200)]

Merge pull request #42766 from rhcs-dashboard/fix-grafonnet-42194

mgr/dashboard: fix grafonnet build error

commit | commitdiff | tree

Neha Ojha [Thu, 12 Aug 2021 16:14:59 +0000 (09:14 -0700)]

Merge pull request #42725 from ifed01/wip-ifed-fix-deferred

os/bluestore: make deferred writes less aggressive for large writes

Reviewed-by: Adam Kupczyk <akupczyk@redhat.com>

commit | commitdiff | tree

Casey Bodley [Thu, 12 Aug 2021 14:06:51 +0000 (10:06 -0400)]

Merge pull request #42685 from BenoitKnecht/rgw-opa-100-continue

rgw: Use 100-continue in OPA requests to reduce latency

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Soumya Koduri [Thu, 12 Aug 2021 13:52:08 +0000 (19:22 +0530)]

rgw/dbstore: Resolve library link issues on FreeBSD

Add "rgw_common" to the target_link_libraries list to
resolve issues with sqlite_db target build on FreeBSD

Reported-by: Willem Jan Withagen wjw@digiware.nl
Signed-off-by: Soumya Koduri <skoduri@redhat.com>

commit | commitdiff | tree

Kefu Chai [Thu, 12 Aug 2021 13:13:54 +0000 (21:13 +0800)]

Merge pull request #42657 from tchaikov/wip-optional-wheel

tools/setup-virtualenv: do not use wheel if wheelhouse does not exist

Reviewed-by: Deepika Upadhyay <dupadhya@redhat.com>

commit | commitdiff | tree

Aashish Sharma [Thu, 12 Aug 2021 08:58:23 +0000 (14:28 +0530)]

mgr/dashboard: fix grafonnet build error

This PR tends to fix the issue caused by #42194

Fixes:https://tracker.ceph.com/issues/52238
Signed-off-by: Aashish Sharma <aasharma@redhat.com>

commit | commitdiff | tree

Sebastian Wagner [Thu, 12 Aug 2021 11:06:11 +0000 (13:06 +0200)]

Merge pull request #42293 from zdover23/wip-doc-cephadm-client-setup-2021-07-12

doc/cephadm: rewrite client-setup.rst

Reviewed-by: Sebastian Wagner <sewagner@redhat.com>

commit | commitdiff | tree

Sage Weil [Wed, 11 Aug 2021 19:28:40 +0000 (15:28 -0400)]

Merge PR #42759 into master

* refs/pull/42759/head:
doc/mgr/nfs: add section on updating an nfs cluster

Reviewed-by: Varsha Rao <varao@redhat.com>

commit | commitdiff | tree

Neha Ojha [Wed, 11 Aug 2021 18:29:12 +0000 (11:29 -0700)]

Merge pull request #39871 from benhanokh/no_column_b

BlueStore: Remove Allocations from RocksDB

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Adam Kupczyk <akupczyk@redhat.com>

commit | commitdiff | tree

Casey Bodley [Wed, 11 Aug 2021 17:36:10 +0000 (13:36 -0400)]

Merge pull request #42122 from yehudasa/wip-rgw-realm-resolve

rgw: modify realm entities init resolution

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Sage Weil [Wed, 11 Aug 2021 16:31:21 +0000 (11:31 -0500)]

doc/mgr/nfs: add section on updating an nfs cluster

Signed-off-by: Sage Weil <sage@newdream.net>

commit | commitdiff | tree

Ernesto Puerta [Wed, 11 Aug 2021 16:11:59 +0000 (18:11 +0200)]

Merge pull request #42194 from rhcs-dashboard/add-grafonnet-grafana

mgr/dashboard: monitoring: replace Grafana JSON with Grafonnet based code

commit | commitdiff | tree

Sage Weil [Wed, 11 Aug 2021 15:28:28 +0000 (11:28 -0400)]

Merge PR #42252 into master

* refs/pull/42252/head:
mgr/dashboard: set rgw credentials: fix api tests
mgr/dashboard: run-frontend-e2e-tests.sh: remove unneeded rgw setting
mgr/dashboard: rgw service creation form: add realm and zone to service spec.
mgr/dashboard: connect-rgw: rename to set-rgw-credentials; refactoring
mgr/dashboard: connect-rgw: adaptation and test coverage
mgr/cephadm: re-check dashboard <-> rgw creds when rgw daemons created/destroyed
mgr/dashboard: add 'dashboard connect-rgw' command
doc/mgr/dashboard: simplify dashboard+rgw config docs

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Juan Miguel Olmo <jolmomar@redhat.com>

commit | commitdiff | tree

Sage Weil [Wed, 11 Aug 2021 14:58:39 +0000 (10:58 -0400)]

Merge PR #42682 into master

* refs/pull/42682/head:
cephadm: no need to explicitly enable prometheus module
mgr/cephadm: enable prometheus module before deploying prometheus
mgr/cephadm: drop daemon_id arg to CephadmService.config()
doc/cephadm: no need to manually enable the prometheus module

Reviewed-by: Sebastian Wagner <sewagner@redhat.com>

commit | commitdiff | tree

Aashish Sharma [Tue, 6 Jul 2021 11:02:20 +0000 (16:32 +0530)]

mgr/dashboard: monitoring: replace Grafana JSON with Grafonnet based Code

This PR intends to add grafonnet to generate grafana JSON files

Fixes: https://tracker.ceph.com/issues/45184
Signed-off-by: Aashish Sharma <aasharma@redhat.com>

commit | commitdiff | tree

Gabriel BenHanokh [Thu, 14 Jan 2021 06:59:35 +0000 (08:59 +0200)]

[BlueStore]: [Remove Allocations from RocksDB]

Currently BlueStore keeps its allocation info inside RocksDB.
BlueStore is committing all allocation information (alloc/release) into RocksDB (column-family B) before the client Write is performed causing a delay in write path and adding significant load to the CPU/Memory/Disk.
Committing all state into RocksDB allows Ceph to survive failures without losing the allocation state.

The new code skips the RocksDB updates on allocation time and instead perform a full desatge of the allocator object with all the OSD allocation state in a single step during umount().
This results with an 25% increase in IOPS and reduced latency in small random-write workloads, but exposes the system to losing allocation info in failure cases where we don't call umount.
We added code to perform a full allocation-map rebuild from information stored inside the ONode which is used in failure cases.
When we perform a graceful shutdown there is no need for recovery and we simply read the allocation-map from a flat file where the allocation-map was stored during umount() (in fact this mode is faster and shaves few seconds from boot time since reading a flat file is faster than iterating over RocksDB)

Open Issues:

There is a bug in the src/stop.sh script killing ceph without invoking umount() which means anyone using it will always invoke the recovery path.
Adam Kupczyk is fixing this issue in a separate PR.
A simple workaround is to add a call to 'killall -15 ceph-osd' before calling src/stop.sh

Fast-Shutdown and Ceph Suicide (done when the system underperforms) stop the system without a proper drain and a call to umount.
This will trigger a full recovery which can be long( 3 minutes in my testing, but your your mileage may vary).
We plan on adding a follow up PR doing the following in Fast-Shutdown and Ceph Suicide:

Block the OSD queues from accepting any new request
Delete all items in queue which we didn't start yet
Drain all in-flight tasks
call umount (and destage the allocation-map)
If drain didn't complete within a predefined time-limit (say 3 minutes) -> kill the OSD
Signed-off-by: Gabriel Benhanokh <gbenhano@redhat.com>
create allocator from on-disk onodes and BlueFS inodes
change allocator + add stat counters + report illegal physical-extents
compare allocator after rebuild from ONodes
prevent collection from being open twice
removed FSCK repo check for null-fm
Bug-Fix: don't add BlueFS allocation to shared allocator
add configuration option to commit to No-Column-B
Only invalidate allocation file after opening rocksdb in read-write mode
fix tests not to expect failure in cases unapplicable to null-allocator
accept non-existing allocation file and don't fail the invaladtion as it could happen legally
don't commit to null-fm when db is opened in repair-mode
add a reverse mechanism from null_fm to real_fm (using RocksDB)
Using Ceph encode/decode, adding more info to header/trailer, add crc protection
Code cleanup

some changes requested by Adam (cleanup and style changes)

Signed-off-by: Gabriel Benhanokh <gbenhano@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 5 Aug 2021 14:31:09 +0000 (10:31 -0400)]

cephadm: no need to explicitly enable prometheus module

Signed-off-by: Sage Weil <sage@newdream.net>

commit | commitdiff | tree

Sage Weil [Thu, 5 Aug 2021 14:24:13 +0000 (10:24 -0400)]

mgr/cephadm: enable prometheus module before deploying prometheus

The mon will restart the mgr when the module is enabled, so we don't
really have to do anything here. The raise is there just in case the
mgr doesn't immediately get the new mgrmap and respawn, although there is
likely no harm done if we continue to deploy prometheus in the meantime,
even if we're interrupted partway through.

Signed-off-by: Sage Weil <sage@newdream.net>

commit | commitdiff | tree

Sage Weil [Thu, 5 Aug 2021 14:17:40 +0000 (10:17 -0400)]

mgr/cephadm: drop daemon_id arg to CephadmService.config()

Unused (and nonsensical since this is *service* config).

Signed-off-by: Sage Weil <sage@newdream.net>

commit | commitdiff | tree

Sage Weil [Thu, 5 Aug 2021 14:24:46 +0000 (10:24 -0400)]

doc/cephadm: no need to manually enable the prometheus module

Signed-off-by: Sage Weil <sage@newdream.net>

commit | commitdiff | tree

Kefu Chai [Wed, 11 Aug 2021 12:30:16 +0000 (20:30 +0800)]

Merge pull request #42748 from tchaikov/wip-crimson-cleanup

crimson/tools/store_nbd: do not capture unused variable

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

Kefu Chai [Wed, 11 Aug 2021 11:22:57 +0000 (19:22 +0800)]

Merge pull request #42746 from tchaikov/wip-boost-j

do_cmake:sh: do not set BOOST_J

Reviewed-by: Deepika Upadhyay <dupadhya@redhat.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

Kefu Chai [Wed, 11 Aug 2021 11:22:28 +0000 (19:22 +0800)]

Merge pull request #42743 from tchaikov/wip-install-dep

install-deps.sh: retry if dpkg was interrupted

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

Kefu Chai [Wed, 11 Aug 2021 10:33:41 +0000 (18:33 +0800)]

crimson/tools/store_nbd: do not capture unused variable

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Wed, 11 Aug 2021 08:29:10 +0000 (16:29 +0800)]

script/run-make.sh: retry if dpkg was interrupted

there is chance that apt-get is interrupted in the middle when a new PR
cancels the running jenkins job, the next job running apt-get or dpkg
would run into issues like:

E: dpkg was interrupted, you must manually run 'sudo dpkg --configure -a' to correct the problem.
Build step 'Execute shell' marked build as failure

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Wed, 11 Aug 2021 09:32:20 +0000 (17:32 +0800)]

do_cmake:sh: do not set BOOST_J

do_cmake.sh is called by src/script/run-make.sh in configure() function,
in src/script/run-make.sh, BOOST_J is also set if it is not set. so we
can drop the code setting BOOST_J in do_cmake.sh.

this helps to silence the cmake warning like:

CMake Warning:
Manually-specified variables were not used by the project:

BOOST_J

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Alfonso Martínez [Wed, 11 Aug 2021 09:14:53 +0000 (11:14 +0200)]

mgr/dashboard: tox.ini: delete useless env. 'apidocs'

Fixes: https://tracker.ceph.com/issues/52130
Signed-off-by: Alfonso Martínez <almartin@redhat.com>

commit | commitdiff | tree

Alfonso Martínez [Wed, 11 Aug 2021 08:25:42 +0000 (10:25 +0200)]

mgr/dashboard: run-cephadm-e2e-tests.sh improvements

- Jenkins env.: make sure the cluster is always started.
- PR template: add trigger phrase to the jenkins commands list.
- Cypress: add --no-install flag; clean previous reports.

Fixes: https://tracker.ceph.com/issues/52082
Signed-off-by: Alfonso Martínez <almartin@redhat.com>

commit | commitdiff | tree

Alfonso Martínez [Wed, 11 Aug 2021 06:59:13 +0000 (08:59 +0200)]

mgr/dashboard: set rgw credentials: fix api tests

Fixes: https://tracker.ceph.com/issues/44605
Signed-off-by: Alfonso Martínez <almartin@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 10 Aug 2021 20:47:34 +0000 (16:47 -0400)]

Merge PR #42318 into master

* refs/pull/42318/head:
mgr/rook: update DefaultFetcher device path to look at local and fix bug
mgr/rook: add node and PV name information to Device in DefaultFetcher
mgr/rook: fix typing errors in Fetcher classes
mgr/rook: create and use DefaultFetcher and LSOFetcher classes
mgr/rook: create KubernetesCustomResource class to fetch CRs
mgr/rook: fix device ls error handling
mgr/rook: change storage class module option name and default value
mgr/rook: fix typing errors related to storage_class_name and device ls
mgr/rook: make `device ls` only display pvs in specified storage class
mgr/rook: add StorageV1Api and storage_class_name to RookCluster
mgr/rook: add StorageV1Api to RookOrchestrator
mgr/rook: add mgr/rook/storage_class_name to ceph config
mgr/rook: ceph orch device ls fetch and display info about PVs
mgr/rook: add CustomObjectsApi to RookCluster

Reviewed-by: Juan Miguel Olmo <jolmomar@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 10 Aug 2021 20:47:21 +0000 (16:47 -0400)]

Merge PR #42613 into master

* refs/pull/42613/head:
qa/suites/roch/rook/smoke: test rook 1.7.0, not 1.6.2
qa/tasks/rook: set storage_class to scratch

Reviewed-by: merge 42318

commit | commitdiff | tree

Sage Weil [Tue, 10 Aug 2021 20:37:38 +0000 (16:37 -0400)]

Merge PR #42691 into master

* refs/pull/42691/head:
mgr/nfs: add --port to 'nfs cluster create' and port to 'nfs cluster info'
qa/suites/orch/cephadm/smoke-roleless: test taking ganeshas offline
qa/tasks/vip: exec with bash -ex
qa/suites/orch/cephadm: separate test_nfs from test_orch_cli

Reviewed-by: Varsha Rao <varao@redhat.com>

commit | commitdiff | tree

Daniel Gryniewicz [Tue, 10 Aug 2021 17:49:17 +0000 (13:49 -0400)]

Merge pull request #42574 from soumyakoduri/wip-skoduri-dbstore-fixes

rgw/dbstore: Fix DBstore build conflicts

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>

commit | commitdiff | tree

Kefu Chai [Tue, 10 Aug 2021 16:45:00 +0000 (00:45 +0800)]

Merge pull request #42729 from cyx1231st/wip-seastore-interruptive-future

crimson/os/seastore: wrap up interruptive-futures in seastore

Reviewed-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Kefu Chai [Tue, 10 Aug 2021 15:29:42 +0000 (23:29 +0800)]

Merge pull request #42733 from rzarzynski/wip-script-run-cbt-with-cyanstore

script: run-cbt.sh tests crimson with CyanStore instead of MemStore.

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Josh Durgin [Tue, 10 Aug 2021 15:11:35 +0000 (08:11 -0700)]

Merge pull request #42525 from zdover23/wip-doc-rados-config-storage-devices-front-matter-2021-07-28

doc/rados: rewrite storage device front matter

Reviewed-by: Josh Durgin <jdurgin@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 10 Aug 2021 14:36:37 +0000 (10:36 -0400)]

Merge PR #42680 into master

* refs/pull/42680/head:
src/pybind/mgr/nfs/tests: pass cluster_id to from_export_block()
src/pybind/mgr/nfs: remove `tag` option
src/pybind/mgr/nfs: remove per daemon config test
src/pybind/mgr/nfs: directly use cluster_id and remove daemon related stuff

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 10 Aug 2021 14:35:59 +0000 (10:35 -0400)]

Merge PR #42627 into master

* refs/pull/42627/head:
.github/labeler: add nfs dev doc
doc/dev/vstart-ganesha: update about RGW export
src/vstart: update ganesha pid dir location
src/vstart: create rgw export for nfs

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Radoslaw Zarzynski [Tue, 10 Aug 2021 12:34:45 +0000 (12:34 +0000)]

script: run-cbt.sh tests crimson with CyanStore instead of MemStore.

These tests were always supposed to run against CyanStore. However,
commit e6ed65db8b4e0a2f8026c2e35a12dd292c5f2b8c (PR #42437) changed
the meaning of `--memstore` and introduced `--cyanstore` to be used
instead. This commit makes `run-cbt.sh` aware about the new switch.

Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

Zac Dover [Mon, 12 Jul 2021 20:07:49 +0000 (06:07 +1000)]

doc/cephadm: rewrite client-setup.rst

This improves the text in client-setup.rst.

We should make certain that the technical details
in this file remain current in July 2021. This
file was origingally written in November 2019.

Signed-off-by: Zac Dover <zac.dover@gmail.com>

commit | commitdiff | tree

Zac Dover [Wed, 28 Jul 2021 14:37:46 +0000 (00:37 +1000)]

doc/rados: rewrite storage device front matter

This PR updates the text in the RADOS Guide
(the Ceph Storage Cluster Guide) that appears
at the beginning of the "Storage Devices"
chapter. I did the following:

- rewrote some of the sentences so that
  they read more like written text than like
  spoken language
- added "Ceph Manager" to the list of daemons
  that a Ceph cluster comprises
- that's about it.

Signed-off-by: Zac Dover <zac.dover@gmail.com>

commit | commitdiff | tree

Alfonso Martínez [Mon, 9 Aug 2021 11:14:20 +0000 (13:14 +0200)]

mgr/dashboard: run-frontend-e2e-tests.sh: remove unneeded rgw setting

Fixes: https://tracker.ceph.com/issues/44605
Signed-off-by: Alfonso Martínez <almartin@redhat.com>

commit | commitdiff | tree

Alfonso Martínez [Mon, 9 Aug 2021 10:12:52 +0000 (12:12 +0200)]

mgr/dashboard: rgw service creation form: add realm and zone to service spec.

Align rgw service id pattern with cephadm: https://github.com/ceph/ceph/pull/39877
- Update rgw pattern to allow service id for non-multisite config.
- Extract realm and zone from service id (when detected) and add them to the service spec.

Fixes: https://tracker.ceph.com/issues/44605
Signed-off-by: Alfonso Martínez <almartin@redhat.com>

commit | commitdiff | tree

Alfonso Martínez [Fri, 6 Aug 2021 06:57:47 +0000 (08:57 +0200)]

mgr/dashboard: connect-rgw: rename to set-rgw-credentials; refactoring

- Rename the dashboard command to better reflect its behavior.
- Rename '_radosgw_admin' method to 'send_rgwadmin_command' for consistency with
'send_mon_command' and move it to the mgr_module.py .
- Cleanup: remove unneeded rgw settings.
- Better error handling and test coverage.

Fixes: https://tracker.ceph.com/issues/44605
Signed-off-by: Alfonso Martínez <almartin@redhat.com>

commit | commitdiff | tree

Alfonso Martínez [Wed, 28 Jul 2021 07:48:18 +0000 (09:48 +0200)]

mgr/dashboard: connect-rgw: adaptation and test coverage

- Align Dashboard with cephadm: configure credentials using the same logic.
- Fix: create a 'dashboard' user per realm (before: only on 1st realm).
- Lint fixes, test coverage, method renaming to better reflect behavior and method visibility.

Fixes: https://tracker.ceph.com/issues/44605
Signed-off-by: Alfonso Martínez <almartin@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 8 Jul 2021 17:22:59 +0000 (13:22 -0400)]

mgr/cephadm: re-check dashboard <-> rgw creds when rgw daemons created/destroyed

We don't always know when a realm is created/destroyed, but we can use
service config and purge to cover most such cases.

Signed-off-by: Sage Weil <sage@newdream.net>

commit | commitdiff | tree

Sage Weil [Thu, 8 Jul 2021 17:10:23 +0000 (13:10 -0400)]

mgr/dashboard: add 'dashboard connect-rgw' command

Signed-off-by: Sage Weil <sage@newdream.net>

commit | commitdiff | tree

Sage Weil [Thu, 8 Jul 2021 20:19:42 +0000 (16:19 -0400)]

doc/mgr/dashboard: simplify dashboard+rgw config docs

Signed-off-by: Sage Weil <sage@newdream.net>

commit | commitdiff | tree

Igor Fedotov [Mon, 9 Aug 2021 15:18:20 +0000 (18:18 +0300)]

test/store_test: more testing for deferred writes

Signed-off-by: Igor Fedotov <ifedotov@suse.com>

commit | commitdiff | tree

Igor Fedotov [Mon, 9 Aug 2021 16:14:33 +0000 (19:14 +0300)]

os/bluestore: enforce one more non-inclusive comparision against prefer_deferred_size

Signed-off-by: Igor Fedotov <ifedotov@suse.com>

commit | commitdiff | tree

Adam Kupczyk [Mon, 9 Aug 2021 13:59:46 +0000 (15:59 +0200)]

os/bluestore: Better handling of deferred write trigger

Now deferred write in _do_alloc_write does not depend on blob size,
but on size of extent allocated on disk.
It is now possible to set bluestore_prefer_deferred_size way larger than
bluestore_max_blob_size and still get desired behavior.
Example: for deferred=256K, blob=64K : when op write is 128K both blobs will be
written as deferred. When op write is 256K then all will go as regular write.

Signed-off-by: Adam Kupczyk <akupczyk@redhat.com>

commit | commitdiff | tree

Igor Fedotov [Mon, 9 Aug 2021 15:18:35 +0000 (18:18 +0300)]

os/bluestore: account for alignment with max_blob_size when splitting
write I/O into chunks.

Without the fix the following write seq:
0~4M
4096~4M

produces tons of deferred writes at the second stage.

Signed-off-by: Igor Fedotov <ifedotov@suse.com>

commit | commitdiff | tree

Igor Fedotov [Fri, 6 Aug 2021 23:22:12 +0000 (02:22 +0300)]

os/bluestore: introduce l_bluestore_write_deferred_bytes perf counter.

Signed-off-by: Igor Fedotov <ifedotov@suse.com>

commit | commitdiff | tree

Igor Fedotov [Fri, 6 Aug 2021 18:32:23 +0000 (21:32 +0300)]

os/bluestore: use non-inclusive comparision against prefer_deferred_size

Fixes: https://tracker.ceph.com/issues/52089
Signed-off-by: Igor Fedotov <ifedotov@suse.com>

commit | commitdiff | tree

Mark Kogan [Tue, 10 Aug 2021 11:08:44 +0000 (14:08 +0300)]

Merge pull request #42262 from wjwithagen/wjw-fix-d3n-cache

rgw: conditionalize d3n_datacache use of libAIO

commit | commitdiff | tree

Kefu Chai [Tue, 10 Aug 2021 09:50:15 +0000 (17:50 +0800)]

Merge pull request #42728 from cyx1231st/wip-crimson-msgr-fix-compression

crimson/net/ProtocolV2: disable COMPRESSION in crimson msgr features

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Chunmei Liu <chunmei.liu@intel.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Yingxin Cheng [Tue, 10 Aug 2021 01:40:56 +0000 (09:40 +0800)]

crimson/net/ProtocolV2: disable COMPRESSION in crimson msgr features

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Kefu Chai [Tue, 10 Aug 2021 04:16:19 +0000 (12:16 +0800)]

Merge pull request #42672 from aclamk/wip-bluestore-tool-conf

tools/ceph-bluestore-tool: Enable configuration options from monitor/ceph.conf

Reviewed-by: Igor Fedotov <ifedotov@suse.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>

commit | commitdiff | tree

Yingxin Cheng [Fri, 6 Aug 2021 06:51:50 +0000 (14:51 +0800)]

crimson/os/seastore: drop retired_extent_gate_t

retired_extent_gate_t is no longer needed after integrating
interruptive-future.

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Fri, 6 Aug 2021 06:14:23 +0000 (14:14 +0800)]

crimson/os/seatore: drop the temporary InterruptedTransactionManager

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Thu, 5 Aug 2021 07:40:43 +0000 (15:40 +0800)]

crimson/os/seastore: wrap up interruptive-futures in seastore

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Yingxin Cheng [Tue, 3 Aug 2021 08:41:26 +0000 (16:41 +0800)]

crimson/os/seastore: convert ObjectDataHandler with interruptible-future

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Sage Weil [Mon, 9 Aug 2021 23:33:52 +0000 (19:33 -0400)]

Merge PR #42535 into master

* refs/pull/42535/head:
mgr/cephadm: quay.io for non-ceph images too
mgr/cephadm: DEFAULT_IMAGE from quay, not docker
doc/install/containers: quay.io!

Reviewed-by: Sebastian Wagner <sewagner@redhat.com>
Reviewed-by: Dimitri Savineau <dsavinea@redhat.com>

commit | commitdiff | tree

Sage Weil [Mon, 9 Aug 2021 23:33:41 +0000 (19:33 -0400)]

Merge PR #42726 into master

* refs/pull/42726/head:
cephadm: fix container name detection

Reviewed-by: Adam King <adking@redhat.com>

commit | commitdiff | tree

Neha Ojha [Mon, 9 Aug 2021 22:11:19 +0000 (15:11 -0700)]

Merge pull request #42568 from kamoltat/wip-autoscaler-profile-status-print

pybind/mgr/pg_autoscaler: Added PROFILE to autoscale-status

Reviewed-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Guillaume Abrioux [Mon, 9 Aug 2021 20:32:43 +0000 (22:32 +0200)]

Merge pull request #42469 from BlaineEXE/ceph-volume-work-around-atari-partitions

ceph-volume: work around phantom atari partitions

commit | commitdiff | tree

Sage Weil [Mon, 9 Aug 2021 19:23:11 +0000 (15:23 -0400)]

Merge PR #42709 into master

* refs/pull/42709/head:
qa/tasks/kubeadm: force docker cgroup engine to systemd

Reviewed-by: Travis Nielsen <tnielsen@redhat.com>

commit | commitdiff | tree

Neha Ojha [Mon, 9 Aug 2021 18:42:41 +0000 (11:42 -0700)]

Merge pull request #42722 from neha-ojha/wip-remove-rgw-perf

qa/suites/rados/perf/ceph.yaml: remove rgw

Reviewed-by: Sridhar Seshasayee <sseshasa@redhat.com>

commit | commitdiff | tree

Sage Weil [Mon, 9 Aug 2021 18:15:28 +0000 (14:15 -0400)]

cephadm: fix container name detection

'enter' was broken because we weren't correctly identifying the container
name. Strip the newline from the inspect result so that we can reliably
match against the 'running' state.

Signed-off-by: Sage Weil <sage@newdream.net>

commit | commitdiff | tree

Neha Ojha [Mon, 9 Aug 2021 17:31:50 +0000 (17:31 +0000)]

qa/suites/perf-basic/ceph.yaml: remove rgw

This is no longer required because we removed cosbench workloads in
fd350fd0150a2d4072f055658c20314a435a19ba.

Signed-off-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Igor Fedotov [Fri, 6 Aug 2021 14:50:13 +0000 (17:50 +0300)]

os/bluestore: improve logging around deferred writes

Signed-off-by: Igor Fedotov <ifedotov@suse.com>

commit | commitdiff | tree

Ernesto Puerta [Mon, 9 Aug 2021 16:19:26 +0000 (18:19 +0200)]

Merge pull request #42697 from rhcs-dashboard/52082-cephadm-e2e-improv

mgr/dashboard: cephadm e2e start script: add --expanded option

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Ernesto Puerta [Mon, 9 Aug 2021 16:07:10 +0000 (18:07 +0200)]

Merge pull request #42701 from ceph/fix-dashboard_sso_doc-master

mgr/dashboard: clarify SSO documentation

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>

commit | commitdiff | tree

Casey Bodley [Mon, 9 Aug 2021 15:51:36 +0000 (11:51 -0400)]

Merge pull request #42688 from cbodley/wip-52069

qa/rgw: update apache-maven mirror for rgw/hadoop-s3a

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>

commit | commitdiff | tree

Casey Bodley [Mon, 9 Aug 2021 15:51:21 +0000 (11:51 -0400)]

Merge pull request #42689 from cbodley/wip-52070

qa/rgw: barbican and pykmip tasks upgrade pip before installing pytz

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 5 Aug 2021 21:57:58 +0000 (17:57 -0400)]

mgr/nfs: add --port to 'nfs cluster create' and port to 'nfs cluster info'

Fixes: https://tracker.ceph.com/issues/51787
Signed-off-by: Sage Weil <sage@newdream.net>

commit | commitdiff | tree

Ilya Dryomov [Mon, 9 Aug 2021 15:12:29 +0000 (17:12 +0200)]

Merge pull request #42581 from CongMinYin/perf

librbd/cache/pwl: fix buf_persist and add writeback_lat perf counters

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>

Unnamed repository; edit this file 'description' to name the repository.

RSS Atom