]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
3 years agomgr/dashboard: fix grafonnet build error 42766/head
Aashish Sharma [Thu, 12 Aug 2021 08:58:23 +0000 (14:28 +0530)]
mgr/dashboard: fix grafonnet build error

This PR tends to fix the issue caused by #42194

Fixes:https://tracker.ceph.com/issues/52238
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
3 years agoMerge PR #42759 into master
Sage Weil [Wed, 11 Aug 2021 19:28:40 +0000 (15:28 -0400)]
Merge PR #42759 into master

* refs/pull/42759/head:
doc/mgr/nfs: add section on updating an nfs cluster

Reviewed-by: Varsha Rao <varao@redhat.com>
3 years agoMerge pull request #39871 from benhanokh/no_column_b
Neha Ojha [Wed, 11 Aug 2021 18:29:12 +0000 (11:29 -0700)]
Merge pull request #39871 from benhanokh/no_column_b

BlueStore: Remove Allocations from RocksDB

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Adam Kupczyk <akupczyk@redhat.com>
3 years agoMerge pull request #42122 from yehudasa/wip-rgw-realm-resolve
Casey Bodley [Wed, 11 Aug 2021 17:36:10 +0000 (13:36 -0400)]
Merge pull request #42122 from yehudasa/wip-rgw-realm-resolve

rgw: modify realm entities init resolution

Reviewed-by: Casey Bodley <cbodley@redhat.com>
3 years agodoc/mgr/nfs: add section on updating an nfs cluster 42759/head
Sage Weil [Wed, 11 Aug 2021 16:31:21 +0000 (11:31 -0500)]
doc/mgr/nfs: add section on updating an nfs cluster

Signed-off-by: Sage Weil <sage@newdream.net>
3 years agoMerge pull request #42194 from rhcs-dashboard/add-grafonnet-grafana
Ernesto Puerta [Wed, 11 Aug 2021 16:11:59 +0000 (18:11 +0200)]
Merge pull request #42194 from rhcs-dashboard/add-grafonnet-grafana

mgr/dashboard: monitoring: replace Grafana JSON with Grafonnet based code

3 years agoMerge PR #42252 into master
Sage Weil [Wed, 11 Aug 2021 15:28:28 +0000 (11:28 -0400)]
Merge PR #42252 into master

* refs/pull/42252/head:
mgr/dashboard: set rgw credentials: fix api tests
mgr/dashboard: run-frontend-e2e-tests.sh: remove unneeded rgw setting
mgr/dashboard: rgw service creation form: add realm and zone to service spec.
mgr/dashboard: connect-rgw: rename to set-rgw-credentials; refactoring
mgr/dashboard: connect-rgw: adaptation and test coverage
mgr/cephadm: re-check dashboard <-> rgw creds when rgw daemons created/destroyed
mgr/dashboard: add 'dashboard connect-rgw' command
doc/mgr/dashboard: simplify dashboard+rgw config docs

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Juan Miguel Olmo <jolmomar@redhat.com>
3 years agoMerge PR #42682 into master
Sage Weil [Wed, 11 Aug 2021 14:58:39 +0000 (10:58 -0400)]
Merge PR #42682 into master

* refs/pull/42682/head:
cephadm: no need to explicitly enable prometheus module
mgr/cephadm: enable prometheus module before deploying prometheus
mgr/cephadm: drop daemon_id arg to CephadmService.config()
doc/cephadm: no need to manually enable the prometheus module

Reviewed-by: Sebastian Wagner <sewagner@redhat.com>
3 years agomgr/dashboard: monitoring: replace Grafana JSON with Grafonnet based Code 42194/head
Aashish Sharma [Tue, 6 Jul 2021 11:02:20 +0000 (16:32 +0530)]
mgr/dashboard: monitoring: replace Grafana JSON with Grafonnet based Code

This PR intends to add grafonnet to generate grafana JSON files

Fixes: https://tracker.ceph.com/issues/45184
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
3 years ago[BlueStore]: [Remove Allocations from RocksDB] 39871/head
Gabriel BenHanokh [Thu, 14 Jan 2021 06:59:35 +0000 (08:59 +0200)]
[BlueStore]: [Remove Allocations from RocksDB]

Currently BlueStore keeps its allocation info inside RocksDB.
BlueStore is committing all allocation information (alloc/release) into RocksDB (column-family B) before the client Write is performed causing a delay in write path and adding significant load to the CPU/Memory/Disk.
Committing all state into RocksDB allows Ceph to survive failures without losing the allocation state.

The new code skips the RocksDB updates on allocation time and instead perform a full desatge of the allocator object with all the OSD allocation state in a single step during umount().
This results with an 25% increase in IOPS and reduced latency in small random-write workloads, but exposes the system to losing allocation info in failure cases where we don't call umount.
We added code to perform a full allocation-map rebuild from information stored inside the ONode which is used in failure cases.
When we perform a graceful shutdown there is no need for recovery and we simply read the allocation-map from a flat file where the allocation-map was stored during umount() (in fact this mode is faster and shaves few seconds from boot time since reading a flat file is faster than iterating over RocksDB)

Open Issues:

There is a bug in the src/stop.sh script killing ceph without invoking umount() which means anyone using it will always invoke the recovery path.
Adam Kupczyk is fixing this issue in a separate PR.
A simple workaround is to add a call to 'killall -15 ceph-osd' before calling src/stop.sh

Fast-Shutdown and Ceph Suicide (done when the system underperforms) stop the system without a proper drain and a call to umount.
This will trigger a full recovery which can be long( 3 minutes in my testing, but your your mileage may vary).
We plan on adding a follow up PR doing the following in Fast-Shutdown and Ceph Suicide:

Block the OSD queues from accepting any new request
Delete all items in queue which we didn't start yet
Drain all in-flight tasks
call umount (and destage the allocation-map)
If drain didn't complete within a predefined time-limit (say 3 minutes) -> kill the OSD
Signed-off-by: Gabriel Benhanokh <gbenhano@redhat.com>
create allocator from on-disk onodes and BlueFS inodes
change allocator + add stat counters + report illegal physical-extents
compare allocator after rebuild from ONodes
prevent collection from being open twice
removed FSCK repo check for null-fm
Bug-Fix: don't add BlueFS allocation to shared allocator
add configuration option to commit to No-Column-B
Only invalidate allocation file after opening rocksdb in read-write mode
fix tests not to expect failure in cases unapplicable to null-allocator
accept non-existing allocation file and don't fail the invaladtion as it could happen legally
don't commit to null-fm when db is opened in repair-mode
add a reverse mechanism from null_fm to real_fm (using RocksDB)
Using Ceph encode/decode, adding more info to header/trailer, add crc protection
Code cleanup

some changes requested by Adam (cleanup and style changes)

Signed-off-by: Gabriel Benhanokh <gbenhano@redhat.com>
3 years agocephadm: no need to explicitly enable prometheus module 42682/head
Sage Weil [Thu, 5 Aug 2021 14:31:09 +0000 (10:31 -0400)]
cephadm: no need to explicitly enable prometheus module

Signed-off-by: Sage Weil <sage@newdream.net>
3 years agomgr/cephadm: enable prometheus module before deploying prometheus
Sage Weil [Thu, 5 Aug 2021 14:24:13 +0000 (10:24 -0400)]
mgr/cephadm: enable prometheus module before deploying prometheus

The mon will restart the mgr when the module is enabled, so we don't
really have to do anything here.  The raise is there just in case the
mgr doesn't immediately get the new mgrmap and respawn, although there is
likely no harm done if we continue to deploy prometheus in the meantime,
even if we're interrupted partway through.

Signed-off-by: Sage Weil <sage@newdream.net>
3 years agomgr/cephadm: drop daemon_id arg to CephadmService.config()
Sage Weil [Thu, 5 Aug 2021 14:17:40 +0000 (10:17 -0400)]
mgr/cephadm: drop daemon_id arg to CephadmService.config()

Unused (and nonsensical since this is *service* config).

Signed-off-by: Sage Weil <sage@newdream.net>
3 years agodoc/cephadm: no need to manually enable the prometheus module
Sage Weil [Thu, 5 Aug 2021 14:24:46 +0000 (10:24 -0400)]
doc/cephadm: no need to manually enable the prometheus module

Signed-off-by: Sage Weil <sage@newdream.net>
3 years agoMerge pull request #42748 from tchaikov/wip-crimson-cleanup
Kefu Chai [Wed, 11 Aug 2021 12:30:16 +0000 (20:30 +0800)]
Merge pull request #42748 from tchaikov/wip-crimson-cleanup

crimson/tools/store_nbd: do not capture unused variable

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
3 years agoMerge pull request #42746 from tchaikov/wip-boost-j
Kefu Chai [Wed, 11 Aug 2021 11:22:57 +0000 (19:22 +0800)]
Merge pull request #42746 from tchaikov/wip-boost-j

do_cmake:sh: do not set BOOST_J

Reviewed-by: Deepika Upadhyay <dupadhya@redhat.com>
Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
3 years agoMerge pull request #42743 from tchaikov/wip-install-dep
Kefu Chai [Wed, 11 Aug 2021 11:22:28 +0000 (19:22 +0800)]
Merge pull request #42743 from tchaikov/wip-install-dep

install-deps.sh: retry if dpkg was interrupted

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
3 years agocrimson/tools/store_nbd: do not capture unused variable 42748/head
Kefu Chai [Wed, 11 Aug 2021 10:33:41 +0000 (18:33 +0800)]
crimson/tools/store_nbd: do not capture unused variable

Signed-off-by: Kefu Chai <kchai@redhat.com>
3 years agoscript/run-make.sh: retry if dpkg was interrupted 42743/head
Kefu Chai [Wed, 11 Aug 2021 08:29:10 +0000 (16:29 +0800)]
script/run-make.sh: retry if dpkg was interrupted

there is chance that apt-get is interrupted in the middle when a new PR
cancels the running jenkins job, the next job running apt-get or dpkg
would run into issues like:

E: dpkg was interrupted, you must manually run 'sudo dpkg --configure -a' to correct the problem.
Build step 'Execute shell' marked build as failure

Signed-off-by: Kefu Chai <kchai@redhat.com>
3 years agodo_cmake:sh: do not set BOOST_J 42746/head
Kefu Chai [Wed, 11 Aug 2021 09:32:20 +0000 (17:32 +0800)]
do_cmake:sh: do not set BOOST_J

do_cmake.sh is called by src/script/run-make.sh in configure() function,
in src/script/run-make.sh, BOOST_J is also set if it is not set. so we
can drop the code setting BOOST_J in do_cmake.sh.

this helps to silence the cmake warning like:

CMake Warning:
  Manually-specified variables were not used by the project:

    BOOST_J

Signed-off-by: Kefu Chai <kchai@redhat.com>
3 years agomgr/dashboard: set rgw credentials: fix api tests 42252/head
Alfonso Martínez [Wed, 11 Aug 2021 06:59:13 +0000 (08:59 +0200)]
mgr/dashboard: set rgw credentials: fix api tests

Fixes: https://tracker.ceph.com/issues/44605
Signed-off-by: Alfonso Martínez <almartin@redhat.com>
3 years agoMerge PR #42318 into master
Sage Weil [Tue, 10 Aug 2021 20:47:34 +0000 (16:47 -0400)]
Merge PR #42318 into master

* refs/pull/42318/head:
mgr/rook: update DefaultFetcher device path to look at local and fix bug
mgr/rook: add node and PV name information to Device in DefaultFetcher
mgr/rook: fix typing errors in Fetcher classes
mgr/rook: create and use DefaultFetcher and LSOFetcher classes
mgr/rook: create KubernetesCustomResource class to fetch CRs
mgr/rook: fix device ls error handling
mgr/rook: change storage class module option name and default value
mgr/rook: fix typing errors related to storage_class_name and device ls
mgr/rook: make `device ls` only display pvs in specified storage class
mgr/rook: add StorageV1Api and storage_class_name to RookCluster
mgr/rook: add StorageV1Api to RookOrchestrator
mgr/rook: add mgr/rook/storage_class_name to ceph config
mgr/rook: ceph orch device ls fetch and display info about PVs
mgr/rook: add CustomObjectsApi to RookCluster

Reviewed-by: Juan Miguel Olmo <jolmomar@redhat.com>
3 years agoMerge PR #42613 into master
Sage Weil [Tue, 10 Aug 2021 20:47:21 +0000 (16:47 -0400)]
Merge PR #42613 into master

* refs/pull/42613/head:
qa/suites/roch/rook/smoke: test rook 1.7.0, not 1.6.2
qa/tasks/rook: set storage_class to scratch

Reviewed-by: merge 42318
3 years agoMerge PR #42691 into master
Sage Weil [Tue, 10 Aug 2021 20:37:38 +0000 (16:37 -0400)]
Merge PR #42691 into master

* refs/pull/42691/head:
mgr/nfs: add --port to 'nfs cluster create' and port to 'nfs cluster info'
qa/suites/orch/cephadm/smoke-roleless: test taking ganeshas offline
qa/tasks/vip: exec with bash -ex
qa/suites/orch/cephadm: separate test_nfs from test_orch_cli

Reviewed-by: Varsha Rao <varao@redhat.com>
3 years agoMerge pull request #42574 from soumyakoduri/wip-skoduri-dbstore-fixes
Daniel Gryniewicz [Tue, 10 Aug 2021 17:49:17 +0000 (13:49 -0400)]
Merge pull request #42574 from soumyakoduri/wip-skoduri-dbstore-fixes

rgw/dbstore: Fix DBstore build conflicts

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
3 years agoMerge pull request #42729 from cyx1231st/wip-seastore-interruptive-future
Kefu Chai [Tue, 10 Aug 2021 16:45:00 +0000 (00:45 +0800)]
Merge pull request #42729 from cyx1231st/wip-seastore-interruptive-future

crimson/os/seastore: wrap up interruptive-futures in seastore

Reviewed-by: Samuel Just <sjust@redhat.com>
3 years agoMerge pull request #42733 from rzarzynski/wip-script-run-cbt-with-cyanstore
Kefu Chai [Tue, 10 Aug 2021 15:29:42 +0000 (23:29 +0800)]
Merge pull request #42733 from rzarzynski/wip-script-run-cbt-with-cyanstore

script: run-cbt.sh tests crimson with CyanStore instead of MemStore.

Reviewed-by: Kefu Chai <kchai@redhat.com>
3 years agoMerge pull request #42525 from zdover23/wip-doc-rados-config-storage-devices-front...
Josh Durgin [Tue, 10 Aug 2021 15:11:35 +0000 (08:11 -0700)]
Merge pull request #42525 from zdover23/wip-doc-rados-config-storage-devices-front-matter-2021-07-28

doc/rados: rewrite storage device front matter

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
3 years agoMerge PR #42680 into master
Sage Weil [Tue, 10 Aug 2021 14:36:37 +0000 (10:36 -0400)]
Merge PR #42680 into master

* refs/pull/42680/head:
src/pybind/mgr/nfs/tests: pass cluster_id to from_export_block()
src/pybind/mgr/nfs: remove `tag` option
src/pybind/mgr/nfs: remove per daemon config test
src/pybind/mgr/nfs: directly use cluster_id and remove daemon related stuff

Reviewed-by: Sage Weil <sage@redhat.com>
3 years agoMerge PR #42627 into master
Sage Weil [Tue, 10 Aug 2021 14:35:59 +0000 (10:35 -0400)]
Merge PR #42627 into master

* refs/pull/42627/head:
.github/labeler: add nfs dev doc
doc/dev/vstart-ganesha: update about RGW export
src/vstart: update ganesha pid dir location
src/vstart: create rgw export for nfs

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>
3 years agoscript: run-cbt.sh tests crimson with CyanStore instead of MemStore. 42733/head
Radoslaw Zarzynski [Tue, 10 Aug 2021 12:34:45 +0000 (12:34 +0000)]
script: run-cbt.sh tests crimson with CyanStore instead of MemStore.

These tests were always supposed to run against CyanStore. However,
commit e6ed65db8b4e0a2f8026c2e35a12dd292c5f2b8c (PR #42437) changed
the meaning of `--memstore` and introduced `--cyanstore` to be used
instead. This commit makes `run-cbt.sh` aware about the new switch.

Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
3 years agodoc/rados: rewrite storage device front matter 42525/head
Zac Dover [Wed, 28 Jul 2021 14:37:46 +0000 (00:37 +1000)]
doc/rados: rewrite storage device front matter

This PR updates the text in the RADOS Guide
(the Ceph Storage Cluster Guide) that appears
at the beginning of the "Storage Devices"
chapter. I did the following:

- rewrote some of the sentences so that
  they read more like written text than like
  spoken language
- added "Ceph Manager" to the list of daemons
  that a Ceph cluster comprises
- that's about it.

Signed-off-by: Zac Dover <zac.dover@gmail.com>
3 years agomgr/dashboard: run-frontend-e2e-tests.sh: remove unneeded rgw setting
Alfonso Martínez [Mon, 9 Aug 2021 11:14:20 +0000 (13:14 +0200)]
mgr/dashboard: run-frontend-e2e-tests.sh: remove unneeded rgw setting

Fixes: https://tracker.ceph.com/issues/44605
Signed-off-by: Alfonso Martínez <almartin@redhat.com>
3 years agomgr/dashboard: rgw service creation form: add realm and zone to service spec.
Alfonso Martínez [Mon, 9 Aug 2021 10:12:52 +0000 (12:12 +0200)]
mgr/dashboard: rgw service creation form: add realm and zone to service spec.

Align rgw service id pattern with cephadm: https://github.com/ceph/ceph/pull/39877
  - Update rgw pattern to allow service id for non-multisite config.
  - Extract realm and zone from service id (when detected) and add them to the service spec.

Fixes: https://tracker.ceph.com/issues/44605
Signed-off-by: Alfonso Martínez <almartin@redhat.com>
3 years agomgr/dashboard: connect-rgw: rename to set-rgw-credentials; refactoring
Alfonso Martínez [Fri, 6 Aug 2021 06:57:47 +0000 (08:57 +0200)]
mgr/dashboard: connect-rgw: rename to set-rgw-credentials; refactoring

- Rename the dashboard command to better reflect its behavior.
- Rename '_radosgw_admin' method to 'send_rgwadmin_command' for consistency with
  'send_mon_command' and move it to the mgr_module.py .
- Cleanup: remove unneeded rgw settings.
- Better error handling and test coverage.

Fixes: https://tracker.ceph.com/issues/44605
Signed-off-by: Alfonso Martínez <almartin@redhat.com>
3 years agomgr/dashboard: connect-rgw: adaptation and test coverage
Alfonso Martínez [Wed, 28 Jul 2021 07:48:18 +0000 (09:48 +0200)]
mgr/dashboard: connect-rgw: adaptation and test coverage

- Align Dashboard with cephadm: configure credentials using the same logic.
- Fix: create a 'dashboard' user per realm (before: only on 1st realm).
- Lint fixes, test coverage, method renaming to better reflect behavior and method visibility.

Fixes: https://tracker.ceph.com/issues/44605
Signed-off-by: Alfonso Martínez <almartin@redhat.com>
3 years agomgr/cephadm: re-check dashboard <-> rgw creds when rgw daemons created/destroyed
Sage Weil [Thu, 8 Jul 2021 17:22:59 +0000 (13:22 -0400)]
mgr/cephadm: re-check dashboard <-> rgw creds when rgw daemons created/destroyed

We don't always know when a realm is created/destroyed, but we can use
service config and purge to cover most such cases.

Signed-off-by: Sage Weil <sage@newdream.net>
3 years agomgr/dashboard: add 'dashboard connect-rgw' command
Sage Weil [Thu, 8 Jul 2021 17:10:23 +0000 (13:10 -0400)]
mgr/dashboard: add 'dashboard connect-rgw' command

Signed-off-by: Sage Weil <sage@newdream.net>
3 years agodoc/mgr/dashboard: simplify dashboard+rgw config docs
Sage Weil [Thu, 8 Jul 2021 20:19:42 +0000 (16:19 -0400)]
doc/mgr/dashboard: simplify dashboard+rgw config docs

Signed-off-by: Sage Weil <sage@newdream.net>
3 years agoMerge pull request #42262 from wjwithagen/wjw-fix-d3n-cache
Mark Kogan [Tue, 10 Aug 2021 11:08:44 +0000 (14:08 +0300)]
Merge pull request #42262 from wjwithagen/wjw-fix-d3n-cache

rgw: conditionalize d3n_datacache use of libAIO

3 years agoMerge pull request #42728 from cyx1231st/wip-crimson-msgr-fix-compression
Kefu Chai [Tue, 10 Aug 2021 09:50:15 +0000 (17:50 +0800)]
Merge pull request #42728 from cyx1231st/wip-crimson-msgr-fix-compression

crimson/net/ProtocolV2: disable COMPRESSION in crimson msgr features

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Chunmei Liu <chunmei.liu@intel.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
3 years agocrimson/net/ProtocolV2: disable COMPRESSION in crimson msgr features 42728/head
Yingxin Cheng [Tue, 10 Aug 2021 01:40:56 +0000 (09:40 +0800)]
crimson/net/ProtocolV2: disable COMPRESSION in crimson msgr features

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
3 years agoMerge pull request #42672 from aclamk/wip-bluestore-tool-conf
Kefu Chai [Tue, 10 Aug 2021 04:16:19 +0000 (12:16 +0800)]
Merge pull request #42672 from aclamk/wip-bluestore-tool-conf

tools/ceph-bluestore-tool: Enable configuration options from monitor/ceph.conf

Reviewed-by: Igor Fedotov <ifedotov@suse.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
3 years agocrimson/os/seastore: drop retired_extent_gate_t 42729/head
Yingxin Cheng [Fri, 6 Aug 2021 06:51:50 +0000 (14:51 +0800)]
crimson/os/seastore: drop retired_extent_gate_t

retired_extent_gate_t is no longer needed after integrating
interruptive-future.

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
3 years agocrimson/os/seatore: drop the temporary InterruptedTransactionManager
Yingxin Cheng [Fri, 6 Aug 2021 06:14:23 +0000 (14:14 +0800)]
crimson/os/seatore: drop the temporary InterruptedTransactionManager

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
3 years agocrimson/os/seastore: wrap up interruptive-futures in seastore
Yingxin Cheng [Thu, 5 Aug 2021 07:40:43 +0000 (15:40 +0800)]
crimson/os/seastore: wrap up interruptive-futures in seastore

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
3 years agocrimson/os/seastore: convert ObjectDataHandler with interruptible-future
Yingxin Cheng [Tue, 3 Aug 2021 08:41:26 +0000 (16:41 +0800)]
crimson/os/seastore: convert ObjectDataHandler with interruptible-future

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
3 years agoMerge PR #42535 into master
Sage Weil [Mon, 9 Aug 2021 23:33:52 +0000 (19:33 -0400)]
Merge PR #42535 into master

* refs/pull/42535/head:
mgr/cephadm: quay.io for non-ceph images too
mgr/cephadm: DEFAULT_IMAGE from quay, not docker
doc/install/containers: quay.io!

Reviewed-by: Sebastian Wagner <sewagner@redhat.com>
Reviewed-by: Dimitri Savineau <dsavinea@redhat.com>
3 years agoMerge PR #42726 into master
Sage Weil [Mon, 9 Aug 2021 23:33:41 +0000 (19:33 -0400)]
Merge PR #42726 into master

* refs/pull/42726/head:
cephadm: fix container name detection

Reviewed-by: Adam King <adking@redhat.com>
3 years agoMerge pull request #42568 from kamoltat/wip-autoscaler-profile-status-print
Neha Ojha [Mon, 9 Aug 2021 22:11:19 +0000 (15:11 -0700)]
Merge pull request #42568 from kamoltat/wip-autoscaler-profile-status-print

pybind/mgr/pg_autoscaler: Added PROFILE to autoscale-status

Reviewed-by: Neha Ojha <nojha@redhat.com>
3 years agoMerge pull request #42469 from BlaineEXE/ceph-volume-work-around-atari-partitions
Guillaume Abrioux [Mon, 9 Aug 2021 20:32:43 +0000 (22:32 +0200)]
Merge pull request #42469 from BlaineEXE/ceph-volume-work-around-atari-partitions

ceph-volume: work around phantom atari partitions

3 years agoMerge PR #42709 into master
Sage Weil [Mon, 9 Aug 2021 19:23:11 +0000 (15:23 -0400)]
Merge PR #42709 into master

* refs/pull/42709/head:
qa/tasks/kubeadm: force docker cgroup engine to systemd

Reviewed-by: Travis Nielsen <tnielsen@redhat.com>
3 years agoMerge pull request #42722 from neha-ojha/wip-remove-rgw-perf
Neha Ojha [Mon, 9 Aug 2021 18:42:41 +0000 (11:42 -0700)]
Merge pull request #42722 from neha-ojha/wip-remove-rgw-perf

qa/suites/rados/perf/ceph.yaml: remove rgw

Reviewed-by: Sridhar Seshasayee <sseshasa@redhat.com>
3 years agocephadm: fix container name detection 42726/head
Sage Weil [Mon, 9 Aug 2021 18:15:28 +0000 (14:15 -0400)]
cephadm: fix container name detection

'enter' was broken because we weren't correctly identifying the container
name.  Strip the newline from the inspect result so that we can reliably
match against the 'running' state.

Signed-off-by: Sage Weil <sage@newdream.net>
3 years agoqa/suites/perf-basic/ceph.yaml: remove rgw 42722/head
Neha Ojha [Mon, 9 Aug 2021 17:31:50 +0000 (17:31 +0000)]
qa/suites/perf-basic/ceph.yaml: remove rgw

This is no longer required because we removed cosbench workloads in
fd350fd0150a2d4072f055658c20314a435a19ba.

Signed-off-by: Neha Ojha <nojha@redhat.com>
3 years agoMerge pull request #42697 from rhcs-dashboard/52082-cephadm-e2e-improv
Ernesto Puerta [Mon, 9 Aug 2021 16:19:26 +0000 (18:19 +0200)]
Merge pull request #42697 from rhcs-dashboard/52082-cephadm-e2e-improv

mgr/dashboard: cephadm e2e start script: add --expanded option

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
3 years agoMerge pull request #42701 from ceph/fix-dashboard_sso_doc-master
Ernesto Puerta [Mon, 9 Aug 2021 16:07:10 +0000 (18:07 +0200)]
Merge pull request #42701 from ceph/fix-dashboard_sso_doc-master

mgr/dashboard: clarify SSO documentation

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
3 years agoMerge pull request #42688 from cbodley/wip-52069
Casey Bodley [Mon, 9 Aug 2021 15:51:36 +0000 (11:51 -0400)]
Merge pull request #42688 from cbodley/wip-52069

qa/rgw: update apache-maven mirror for rgw/hadoop-s3a

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
3 years agoMerge pull request #42689 from cbodley/wip-52070
Casey Bodley [Mon, 9 Aug 2021 15:51:21 +0000 (11:51 -0400)]
Merge pull request #42689 from cbodley/wip-52070

qa/rgw: barbican and pykmip tasks upgrade pip before installing pytz

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
3 years agomgr/nfs: add --port to 'nfs cluster create' and port to 'nfs cluster info' 42691/head
Sage Weil [Thu, 5 Aug 2021 21:57:58 +0000 (17:57 -0400)]
mgr/nfs: add --port to 'nfs cluster create' and port to 'nfs cluster info'

Fixes: https://tracker.ceph.com/issues/51787
Signed-off-by: Sage Weil <sage@newdream.net>
3 years agoMerge pull request #42581 from CongMinYin/perf
Ilya Dryomov [Mon, 9 Aug 2021 15:12:29 +0000 (17:12 +0200)]
Merge pull request #42581 from CongMinYin/perf

librbd/cache/pwl: fix buf_persist and add writeback_lat perf counters

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
3 years agoqa/suites/rados/perf/ceph.yaml: remove rgw
Neha Ojha [Mon, 9 Aug 2021 14:35:01 +0000 (14:35 +0000)]
qa/suites/rados/perf/ceph.yaml: remove rgw

This is no longer required because we removed cosbench workloads in
fd350fd0150a2d4072f055658c20314a435a19ba. This is also required to prevent
failures like the following or any other changes that break the rgw task:

```
2021-08-06T20:13:25.812 INFO:teuthology.orchestra.run.smithi060.stderr:curl: (7) Failed to connect to smithi060.front.sepia.ceph.com port 80: Connection refused
2021-08-06T20:15:33.813 ERROR:teuthology.contextutil:Saw exception from nested tasks
Traceback (most recent call last):
  File "/home/teuthworker/src/git.ceph.com_git_teuthology_04c2febe7099917d97a71271f17abb5710030132/teuthology/contextutil.py", line 31, in nested
    vars.append(enter())
  File "/usr/lib/python3.6/contextlib.py", line 81, in __enter__
    return next(self.gen)
  File "/home/teuthworker/src/github.com_ceph_ceph-c_3c0f8c8164075af7aac4d1f2805d3f4580709461/qa/tasks/rgw.py", line 191, in start_rgw
    wait_for_radosgw(url, remote)
  File "/home/teuthworker/src/github.com_ceph_ceph-c_3c0f8c8164075af7aac4d1f2805d3f4580709461/qa/tasks/util/rgw.py", line 94, in wait_for_radosgw
    assert exit_status == 0
AssertionError
```

Signed-off-by: Neha Ojha <nojha@redhat.com>
3 years agoqa/suites/orch/cephadm/smoke-roleless: test taking ganeshas offline
Sage Weil [Thu, 5 Aug 2021 18:41:20 +0000 (14:41 -0400)]
qa/suites/orch/cephadm/smoke-roleless: test taking ganeshas offline

Signed-off-by: Sage Weil <sage@newdream.net>
3 years agoMerge pull request #42718 from mkogan1/rgw-fix-vstart-X
Casey Bodley [Mon, 9 Aug 2021 14:19:15 +0000 (10:19 -0400)]
Merge pull request #42718 from mkogan1/rgw-fix-vstart-X

rgw: fix vstart -X after default ms_mon_client_mode = secure

Reviewed-by: Casey Bodley <cbodley@redhat.com>
3 years agodoc/rados/operations/placement-groups: added bias + profile 42568/head
Kamoltat [Fri, 6 Aug 2021 04:23:29 +0000 (04:23 +0000)]
doc/rados/operations/placement-groups: added bias + profile

Added documentations on the autoscale profile and bias

Signed-off-by: Kamoltat <ksirivad@redhat.com>
3 years agoMerge pull request #42715 from tchaikov/wip-crimson-tls-int-cond
Kefu Chai [Mon, 9 Aug 2021 13:31:15 +0000 (21:31 +0800)]
Merge pull request #42715 from tchaikov/wip-crimson-tls-int-cond

crimson/common: instantiate interrupt_cond in .cc

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
3 years agomgr/dashboard: cephadm e2e start script: add --expanded option 42697/head
Alfonso Martínez [Mon, 9 Aug 2021 13:14:21 +0000 (15:14 +0200)]
mgr/dashboard: cephadm e2e start script: add --expanded option

- Additional improvements: clean npm cache in jenkins env.; display mgr logs on error.

Fixes: https://tracker.ceph.com/issues/52082
Signed-off-by: Alfonso Martínez <almartin@redhat.com>
3 years agoMerge pull request #42662 from trociny/wip-52063
Ilya Dryomov [Mon, 9 Aug 2021 12:58:25 +0000 (14:58 +0200)]
Merge pull request #42662 from trociny/wip-52063

rbd-mirror: fix potential async op tracker leak in start_image_replayers

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
3 years agoMerge pull request #42708 from cbodley/wip-rgw-meta-sync-behind-after-trim
Casey Bodley [Mon, 9 Aug 2021 12:55:57 +0000 (08:55 -0400)]
Merge pull request #42708 from cbodley/wip-rgw-meta-sync-behind-after-trim

radosgw-admin: 'sync status' is not behind if there are no mdlog entries

Reviewed-by: Adam C. Emerson <aemerson@redhat.com>
3 years agocrimson: build without "using namespace std" 42715/head
Kefu Chai [Mon, 9 Aug 2021 11:12:05 +0000 (19:12 +0800)]
crimson: build without "using namespace std"

* add "std::" prefix in headers
* add "using" declarations in .cc files.

so we don't reply on "using namespace std" in one or more included
headers.

Signed-off-by: Kefu Chai <kchai@redhat.com>
3 years agorgw: fix vstart -X after default ms_mon_client_mode = secure 42718/head
Mark Kogan [Mon, 9 Aug 2021 09:53:23 +0000 (09:53 +0000)]
rgw: fix vstart -X after default ms_mon_client_mode = secure

after merging of PR #42587 rgw is unable to start when cephx is disabled using vstart.sh -X

Fixes: https://tracker.ceph.com/issues/52103
Signed-off-by: Mark Kogan <mkogan@redhat.com>
3 years agoMerge pull request #42611 from s0nea/wip-doc-transifex-resource
Ernesto Puerta [Mon, 9 Aug 2021 08:59:32 +0000 (10:59 +0200)]
Merge pull request #42611 from s0nea/wip-doc-transifex-resource

doc/dev: Dashboard translations release update

Reviewed-by: aaryanporwal <NOT@FOUND>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Tatjana Dehler <tdehler@suse.com>
3 years agocrimson/common: instantiate interrupt_cond in .cc
Kefu Chai [Sun, 8 Aug 2021 17:21:38 +0000 (01:21 +0800)]
crimson/common: instantiate interrupt_cond in .cc

so we can explicitly instantiate it.

this should address the segfault when accessing interrupt_cond when
it is defined as a plain thread local storage template variable in the
header file.

it seems Clang is not able to identify the access to TLS variable and
the value of %fs segment register of the main thread is always zero if
interrupt_cond is defined as a plain global variable stored in
thread local storage.

Signed-off-by: Kefu Chai <kchai@redhat.com>
3 years agotest/crimson: drop copy assignment operator
Kefu Chai [Sun, 8 Aug 2021 13:41:34 +0000 (21:41 +0800)]
test/crimson: drop copy assignment operator

because of the const reference member variable of `parent`:

const delta_overlay_t &parent;

we are not allowed to mutate it. so the explicitly defaulted copy assignment
operator is implicitly deleted

this change silences the warnings from Clang like:

 ../src/test/crimson/seastore/test_transaction_manager.cc:159:12: warning: explicitly defaulted copy assignment operator is implicitly deleted [-Wdefaulted-function-deleted]
         iterator &operator=(const iterator &) = default;
                   ^
 ../src/test/crimson/seastore/test_transaction_manager.cc:98:25: note: copy assignment operator of 'iterator' is implicitly deleted because field 'parent' is of reference type 'const transaction_manager_$
         const delta_overlay_t &parent;
                                ^

Signed-off-by: Kefu Chai <kchai@redhat.com>
3 years agoMerge pull request #42711 from tchaikov/wip-seastar
Kefu Chai [Sun, 8 Aug 2021 13:24:33 +0000 (21:24 +0800)]
Merge pull request #42711 from tchaikov/wip-seastar

seastar: pick up change adding ioctl and fcntl support

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
3 years agoseastar: pick up change adding ioctl and fcntl support 42711/head
Kefu Chai [Sat, 7 Aug 2021 03:56:35 +0000 (11:56 +0800)]
seastar: pick up change adding ioctl and fcntl support

prepare for the better support of ZNS.

Signed-off-by: Kefu Chai <kchai@redhat.com>
3 years agoMerge pull request #42668 from tchaikov/wip-doc-ci
Kefu Chai [Sat, 7 Aug 2021 02:59:14 +0000 (10:59 +0800)]
Merge pull request #42668 from tchaikov/wip-doc-ci

doc/dev/continuous-integration: document chacra

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
3 years agodoc/dev/continuous-integration: document chacra 42668/head
Kefu Chai [Thu, 5 Aug 2021 08:21:31 +0000 (16:21 +0800)]
doc/dev/continuous-integration: document chacra

Signed-off-by: Kefu Chai <kchai@redhat.com>
3 years agolibrbd/cache/pwl/rwl: fix buf_persist and add writeback_lat perf counters 42581/head
Yin Congmin [Fri, 30 Jul 2021 05:55:28 +0000 (13:55 +0800)]
librbd/cache/pwl/rwl: fix buf_persist and add writeback_lat perf counters

initialize buf_persist_time, then change name buf_persist_time to
buf_persist_start_time, change flush to internal_flush. add
writeback_lat perf conters. update some print formats for perf.

Fixes: https://tracker.ceph.com/issues/52090
Signed-off-by: Yin Congmin <congmin.yin@intel.com>
3 years agoqa/suites/roch/rook/smoke: test rook 1.7.0, not 1.6.2 42613/head
Sage Weil [Fri, 6 Aug 2021 21:50:20 +0000 (17:50 -0400)]
qa/suites/roch/rook/smoke: test rook 1.7.0, not 1.6.2

The RBAC changes so that we can see the PVs etc are needed by mgr/rook.

Signed-off-by: Sage Weil <sage@newdream.net>
3 years agoMerge pull request #42149 from hualongfeng/fix_sync_point
Ilya Dryomov [Fri, 6 Aug 2021 21:09:44 +0000 (23:09 +0200)]
Merge pull request #42149 from hualongfeng/fix_sync_point

librbd/cache/pwl: avoid stack overflow caused by nested shared_ptr destruction

Reviewed-by: Jianpeng Ma <jianpeng.ma@intel.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
3 years agoradosgw-admin: 'sync status' is not behind if there are no mdlog entries 42708/head
Casey Bodley [Fri, 6 Aug 2021 19:14:26 +0000 (15:14 -0400)]
radosgw-admin: 'sync status' is not behind if there are no mdlog entries

if remote mdlogs are trimmed prematurely, sync status will report
that it's behind the remote's max-marker even if there are no mdlog
entries to sync

for each behind shard, we fetch the next mdlog entry from the remote. if
we get an empty listing, remove that shard from behind_shards. this
logic now has to run before we print "behind shards:" so that empty
shards aren't listed

Fixes: https://tracker.ceph.com/issues/52091
Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agolibrbd/cache/pwl: avoid stack overflow caused by nested shared_ptr destruction 42149/head
Hualong Feng [Fri, 2 Jul 2021 06:22:06 +0000 (14:22 +0800)]
librbd/cache/pwl: avoid stack overflow caused by nested shared_ptr destruction

Destruction of nested shared_ptr will cause stack overflow.
With the explicit assignment of nullptr, the deleted node
is completely disconnected from the current linked list

-------              *******               -------
|sync | <--earlier-- |sync | <--earlier-x- |sync |
|point| --later----> |point| --later----x> |point|
-------              *******               -------
   |                    |                     |
   V                    V                     V
-------              -------               -------
|log_ | ---next----> |log_ | ---next----x> |log_ |
|entry|              |entry|               |entry|
-------              -------               -------

earlier: earlier_sync_point
later:   later_sync_point
next:    next_sync_point_entry

Fixes: https://tracker.ceph.com/issues/51418
Signed-off-by: Feng Hualong <hualong.feng@intel.com>
3 years agoqa/tasks/kubeadm: force docker cgroup engine to systemd 42709/head
Sage Weil [Fri, 6 Aug 2021 19:21:08 +0000 (14:21 -0500)]
qa/tasks/kubeadm: force docker cgroup engine to systemd

Signed-off-by: Sage Weil <sage@newdream.net>
3 years agorgw, rgw-admin: add realm/zonegroup/zone id configurables 42122/head
Yehuda Sadeh [Tue, 29 Jun 2021 22:42:25 +0000 (15:42 -0700)]
rgw, rgw-admin: add realm/zonegroup/zone id configurables

Add the following configurables:
 - rgw_realm_id
 - rgw_zonegroup_id
 - rgw_zone_id

radosgw-admin already had --realm-id, --zonegroup-id, --zone-id config options,
however, there were no such config options for radosgw. Adding these new config
options so that we could start radosgw with specifying the appropriate entity id
(vs. the entity names that was required previously). These also apply for
radosgw-admin.
The old radosgw-admin params are still valid for backward compatibility.

Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>
3 years agorgw-admin: don't init zonegroup on zone default command
Yehuda Sadeh [Tue, 29 Jun 2021 21:03:18 +0000 (14:03 -0700)]
rgw-admin: don't init zonegroup on zone default command

Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>
3 years agorgw-admin: require explicit zone/zonegroup/realm param for removal ops
Yehuda Sadeh [Tue, 29 Jun 2021 20:54:00 +0000 (13:54 -0700)]
rgw-admin: require explicit zone/zonegroup/realm param for removal ops

Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>
3 years agorgw-admin: try to resolve realm entities (raw storage ops)
Yehuda Sadeh [Fri, 25 Jun 2021 21:53:02 +0000 (14:53 -0700)]
rgw-admin: try to resolve realm entities (raw storage ops)

Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>
3 years agorgw: refactor realm/zongegroup/zone resolving
Yehuda Sadeh [Fri, 25 Jun 2021 21:53:08 +0000 (14:53 -0700)]
rgw: refactor realm/zongegroup/zone resolving

Don't allow a zone to run under wrong zonegroup. Allow
starting with correct realm/period/zonegroup if only zone is provided,
and a realm exists for that zone.
Refactor svc.zone init and simplify logic complexity.

Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>
3 years agoMerge pull request #42705 from tchaikov/wip-clang-fixes
Kefu Chai [Fri, 6 Aug 2021 18:05:30 +0000 (02:05 +0800)]
Merge pull request #42705 from tchaikov/wip-clang-fixes

crimson, test/crimson, msg: clang fixes

Reviewed-by: Samuel Just <sjust@redhat.com>
3 years agodoc,mgr/dashboard: clarify SSO documentation 42701/head
Ernesto Puerta [Fri, 6 Aug 2021 12:18:32 +0000 (14:18 +0200)]
doc,mgr/dashboard: clarify SSO documentation

Signed-off-by: Ernesto Puerta <epuertat@redhat.com>
3 years agoMerge pull request #42660 from zdover23/wip-doc-2021-08-04rados-config-storage-device...
Josh Durgin [Fri, 6 Aug 2021 16:59:48 +0000 (09:59 -0700)]
Merge pull request #42660 from zdover23/wip-doc-2021-08-04rados-config-storage-devices-filestore

doc/rados: rewrite "FileStore" section

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
3 years agocrimson/os/seastore: do not capture unused variable 42705/head
Kefu Chai [Fri, 6 Aug 2021 16:42:31 +0000 (00:42 +0800)]
crimson/os/seastore: do not capture unused variable

Signed-off-by: Kefu Chai <kchai@redhat.com>
3 years agotest/crimson/seastore: drop duplicated i++
Kefu Chai [Fri, 6 Aug 2021 16:39:11 +0000 (00:39 +0800)]
test/crimson/seastore: drop duplicated i++

for two reasons:

* to silence Clang warning
* it's very unusual to do this. and this is a bad smell.

Signed-off-by: Kefu Chai <kchai@redhat.com>
3 years agotest/crimson/seastore: capture labels by defining new variables
Kefu Chai [Fri, 6 Aug 2021 16:37:28 +0000 (00:37 +0800)]
test/crimson/seastore: capture labels by defining new variables

structured binding does not define variables, so we have to define
variables in capture list for capturing them.

Signed-off-by: Kefu Chai <kchai@redhat.com>
3 years agomsg/async/compression_onwire: include used header
Kefu Chai [Fri, 6 Aug 2021 16:36:39 +0000 (00:36 +0800)]
msg/async/compression_onwire: include used header

this fixes the FTBFS with Clang + libc++.

Signed-off-by: Kefu Chai <kchai@redhat.com>
3 years agoMerge pull request #42700 from tchaikov/wip-crimson-cleanup
Kefu Chai [Fri, 6 Aug 2021 15:44:35 +0000 (23:44 +0800)]
Merge pull request #42700 from tchaikov/wip-crimson-cleanup

crimson/os/seastore: do not capture unused variable

Reviewed-by: Samuel Just <sjust@redhat.com>
3 years agoMerge pull request #36517 from giladmaya/on_wire_compression
Neha Ojha [Fri, 6 Aug 2021 15:07:58 +0000 (08:07 -0700)]
Merge pull request #36517 from giladmaya/on_wire_compression

msgr: added support in on-wire compression for osd-osd communication

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
3 years agoMerge pull request #42686 from dsavineau/cephadm_almalinux_distro
Sebastian Wagner [Fri, 6 Aug 2021 14:51:32 +0000 (16:51 +0200)]
Merge pull request #42686 from dsavineau/cephadm_almalinux_distro

cephadm: add AlmaLinux to supported distro

Reviewed-by: Sebastian Wagner <sewagner@redhat.com>
3 years ago.github/labeler: add nfs dev doc 42627/head
Varsha Rao [Wed, 4 Aug 2021 11:03:43 +0000 (16:33 +0530)]
.github/labeler: add nfs dev doc

Signed-off-by: Varsha Rao <varao@redhat.com>