git.apps.os.sepia.ceph.com Git - ceph.git/log

]> git.apps.os.sepia.ceph.com Git - ceph.git/log

projects / ceph.git / log

summary | shortlog | log | commit | commitdiff | tree
first ⋅ prev ⋅ next

commit | commitdiff | tree

Shyamsundar Ranganathan [Sun, 5 Jul 2020 23:17:02 +0000 (19:17 -0400)]

mgr/volumes: Introduce v2 subvolumes

Implements subvolume v2 version.

Following support is added,
- Ability to retain snapshots on subvolume deletion
- Modify directory where snapshot is created to the subvolume
- "features" supported to subvolume info output, specifically ability
for a subvolume to retain snashots
- Current state of subvolume in info output
- Auto upgrade to v2 from eligible v1 subvolumes
- Adjust other functions as needed to support the changes

Signed-off-by: Shyamsundar Ranganathan <srangana@redhat.com>

commit | commitdiff | tree

Shyamsundar Ranganathan [Thu, 2 Jul 2020 01:08:34 +0000 (21:08 -0400)]

mgr/volumes: Use operation type during subvolume open

Subvolume open currently takes in 2 optional parameters to
denote desired state and type. This enables the open to
allow the operation to suceed based on the (type, state)
tuple.

Instead, pass an operation type to be performed on a subvolume
during open, and decide internal to a subvolume version if the
operation is allowed based on its state and type.

Also modifies the state machine code, to be more amenable to
modifications and improves redability.

Signed-off-by: Shyamsundar Ranganathan <srangana@redhat.com>

commit | commitdiff | tree

Patrick Donnelly [Wed, 29 Jul 2020 18:05:02 +0000 (11:05 -0700)]

Merge PR #24068 into master

* refs/pull/24068/head:
mds: rename {CDir,Migrator}::cache to mdcache
mds: make MDSCacheObject::is_ambiguous_auth() virtual
mds: make sure rename old inode's parent dirfrag is projected.
mds: track projected inode/fnode in Mutation
mds: use smart pointer to manager CDir::fnode
mds: use smart pointer to manage CInode::{inode,xattrs,old_inodes}
osdc/Filer: make layout pointer const

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Kefu Chai [Wed, 29 Jul 2020 14:48:04 +0000 (22:48 +0800)]

Merge pull request #36053 from tchaikov/wip-mkdir

kv: replace compat_mkdir with fs::create_directory

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

Yan, Zheng [Tue, 23 Jul 2019 01:41:30 +0000 (09:41 +0800)]

mds: rename {CDir,Migrator}::cache to mdcache

make it be consistant with CInode::mdcache

Signed-off-by: "Yan, Zheng" <zyan@redhat.com>

commit | commitdiff | tree

Yan, Zheng [Tue, 23 Jul 2019 01:14:41 +0000 (09:14 +0800)]

mds: make MDSCacheObject::is_ambiguous_auth() virtual

CInode overrides is_ambiguous_auth(). Locker calls is_ambiguous_auth()
from base class.

Signed-off-by: "Yan, Zheng" <zyan@redhat.com>

commit | commitdiff | tree

Yan, Zheng [Thu, 7 May 2020 02:33:12 +0000 (10:33 +0800)]

mds: make sure rename old inode's parent dirfrag is projected.

if rename dest dentry is remote dentry, Server::_rename_prepare() only
pre dirty old inode, but does not project fnode for old inode's parent
dirfrag. This will trigger a assertion (introduced by previous commit)
in CDir::mark_dirty().

Signed-off-by: "Yan, Zheng" <zyan@redhat.com>

commit | commitdiff | tree

Yan, Zheng [Tue, 9 Jul 2019 10:15:35 +0000 (18:15 +0800)]

mds: track projected inode/fnode in Mutation

Signed-off-by: "Yan, Zheng" <zyan@redhat.com>

commit | commitdiff | tree

Yan, Zheng [Sat, 14 Jul 2018 08:33:19 +0000 (16:33 +0800)]

mds: use smart pointer to manager CDir::fnode

Signed-off-by: "Yan, Zheng" <zyan@redhat.com>

commit | commitdiff | tree

Yan, Zheng [Thu, 16 Jul 2020 03:19:10 +0000 (11:19 +0800)]

mds: use smart pointer to manage CInode::{inode,xattrs,old_inodes}

this avoid copying whole inode_t and xattr map when journaling inodes.

Signed-off-by: "Yan, Zheng" <zyan@redhat.com>

commit | commitdiff | tree

Kefu Chai [Wed, 29 Jul 2020 12:35:13 +0000 (20:35 +0800)]

Merge pull request #36307 from petrutlucian94/rocksdb_lz4

cmake: fix lz4 params when building rocksdb

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Wed, 29 Jul 2020 12:30:11 +0000 (20:30 +0800)]

Merge pull request #36295 from majianpeng/bluefs-reduce-unnecessary-flush

os/bluestore/BlueFS: Don't flush unused device.

Reviewed-by: Igor Fedotov <ifedotov@suse.com>

commit | commitdiff | tree

Kefu Chai [Wed, 29 Jul 2020 12:29:01 +0000 (20:29 +0800)]

Merge pull request #36232 from mgfritch/cephadm-ok-to-stop

mgr/cephadm: add `orch ok-to-stop` commands

Reviewed-by: Sebastian Wagner <sebastian.wagner@suse.com>
Reviewed-by: Ricardo Marques <rimarques@suse.com>

commit | commitdiff | tree

Kefu Chai [Wed, 29 Jul 2020 12:25:56 +0000 (20:25 +0800)]

Merge pull request #34848 from changchengx/protocolv2

refine class member function implementation in ProtocolV2

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Tatjana Dehler [Wed, 29 Jul 2020 11:33:14 +0000 (13:33 +0200)]

Merge pull request #36007 from votdev/issue_46448_hosts_unit_tests

mgr/dashboard: Add hosts page unit tests

Reviewed-by: Sebastian Krah <skrah@suse.com>
Reviewed-by: Tatjana Dehler <tdehler@suse.com>
Reviewed-by: Tiago Melo <tmelo@suse.com>

commit | commitdiff | tree

Volker Theile [Wed, 29 Jul 2020 11:24:14 +0000 (13:24 +0200)]

Merge pull request #36320 from s0nea/wip-dashboard-46573

mgr/dashboard: wait longer for health status to be cleared

Reviewed-by: Ni-Feng Chang <kiefer.chang@suse.com>
Reviewed-by: Volker Theile <vtheile@suse.com>
Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Stephan Müller <smueller@suse.com>

commit | commitdiff | tree

Kefu Chai [Wed, 29 Jul 2020 09:05:36 +0000 (17:05 +0800)]

Merge pull request #36323 from tchaikov/wip-crimson-msgr-v1-v2

crimson: picking peer addr of the compatible type

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Jan Fajerski [Wed, 29 Jul 2020 08:56:57 +0000 (10:56 +0200)]

Merge pull request #35728 from jan--f/c-v-add-subcommand-parse-drive-groups

ceph-volume: add drive-group subcommand

commit | commitdiff | tree

Kefu Chai [Wed, 29 Jul 2020 08:48:23 +0000 (16:48 +0800)]

Merge pull request #36342 from tchaikov/wip-crimson-heartbeat-erase

crimson/osd: erase an element by iterator instead

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>

commit | commitdiff | tree

Sebastian Wagner [Wed, 29 Jul 2020 08:19:42 +0000 (10:19 +0200)]

Merge pull request #36217 from sebastian-philipp/cephadm-common-mypy-ini

cephadm: use src/mypy.ini instead

Reviewed-by: Joshua Schmid <jschmid@suse.de>
Reviewed-by: Michael Fritch <mfritch@suse.com>

commit | commitdiff | tree

Sebastian Wagner [Wed, 29 Jul 2020 08:15:07 +0000 (10:15 +0200)]

Merge pull request #36235 from matthewoliver/cephadm_iscsi_tcmu_runner

cephadm: Add tcmu-runner container when deploying ceph-iscsi

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Sebastian Wagner <sebastian.wagner@suse.com>

commit | commitdiff | tree

Kefu Chai [Wed, 29 Jul 2020 07:33:59 +0000 (15:33 +0800)]

crimson/osd: implement cls_get_pool_stripe_width

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Wed, 29 Jul 2020 06:37:57 +0000 (14:37 +0800)]

crimson/osd: erase an element by iterator instead

we should not remove an element while iterating it in a map, as erasing
the element invalidates the iterator, which causes segmfault when we are
advancing it after erasing the dereferenced element.

in this change, an iterator is used for walking through the map, in
comparision with creating a to-be-removed list, this one is more
efficient and more idiomatic.

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Wed, 29 Jul 2020 06:18:56 +0000 (14:18 +0800)]

Merge pull request #36341 from tchaikov/wip-crimson-cls

crimson/osd: correct the function name of cls_cxx_map_get_vals_by_keys()

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

Kefu Chai [Wed, 29 Jul 2020 04:32:26 +0000 (12:32 +0800)]

crimson/osd: correct the function name of cls_cxx_map_get_vals_by_keys()

it was an oversight in 7a4c6359e483f8c71276ece5cde16eb0771ac5d2

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Wed, 29 Jul 2020 01:44:46 +0000 (09:44 +0800)]

Merge pull request #36079 from winndows/superfluous_break6

msg: Remove superfluous breaks

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>

commit | commitdiff | tree

Neha Ojha [Wed, 29 Jul 2020 01:11:12 +0000 (18:11 -0700)]

Merge pull request #36297 from dvanders/dvanders_46443

osd: fix crash in _committed_osd_maps if incremental osdmap crc fails

Reviewed-by: Xiaoxi Chen <xiaoxchen@ebay.com>
Reviewed-by: David Zafman <dzafman@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Michael Fritch [Tue, 21 Jul 2020 21:06:19 +0000 (15:06 -0600)]

mgr/cephadm: add `orch host ok-to-stop` command

$ ceph orch host ok-to-stop host1
It is presumed safe to stop host host1

Signed-off-by: Michael Fritch <mfritch@suse.com>

commit | commitdiff | tree

Michael Fritch [Tue, 21 Jul 2020 21:26:43 +0000 (15:26 -0600)]

mgr/cephadm: return HandleCommandResult from ok_to_stop

- return output from the result of the ok_to_stop command
- log ok-to-stop result during all invocations

Signed-off-by: Michael Fritch <mfritch@suse.com>

commit | commitdiff | tree

Michael Fritch [Wed, 22 Jul 2020 23:43:05 +0000 (17:43 -0600)]

mgr/orch: add errno to OrchestratorError

add errno to OrchestratorError and ServiceSpecValidationError exceptions

Signed-off-by: Michael Fritch <mfritch@suse.com>

commit | commitdiff | tree

Neha Ojha [Tue, 28 Jul 2020 17:36:09 +0000 (10:36 -0700)]

qa/suites/rados/thrash/crc-failures: randomly inject bad incremental osdmap crc

Signed-off-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Dan van der Ster [Mon, 27 Jul 2020 15:40:27 +0000 (17:40 +0200)]

osd: don't write transaction when inc crc failed

80da5f9a987c6a48b93f25228fdac85890013520 exposed a flaw in how
handle_osd_map falls back to a full osdmap if the crc of an incremental
failed.

If the first message in a map message had a crc error, then the
loop would exit with last < start, which would then cause a null
dereference in _committed_osd_maps.

Fixes: https://tracker.ceph.com/issues/46443
Signed-off-by: Dan van der Ster <daniel.vanderster@cern.ch>

commit | commitdiff | tree

Dan van der Ster [Mon, 27 Jul 2020 12:23:54 +0000 (14:23 +0200)]

qa/standalone/osd: add bad-inc-map.sh

Test that the osd doesn't crash when it gets a bad incremental osdmap.

Related-to: https://tracker.ceph.com/issues/46443
Signed-off-by: Dan van der Ster <daniel.vanderster@cern.ch>

commit | commitdiff | tree

Mykola Golub [Tue, 28 Jul 2020 18:30:07 +0000 (21:30 +0300)]

Merge pull request #36287 from dillaman/wip-librbd-close

librbd: use task finisher thread for image open/close callbacks

Reviewed-by: Mykola Golub <mgolub@suse.com>

commit | commitdiff | tree

Jason Dillaman [Tue, 28 Jul 2020 17:33:17 +0000 (13:33 -0400)]

Merge pull request #36253 from changchengx/exclusive

doc: specify RBD_LOCK_MODE_EXCLUSIVE for exclusive-lock

Reviewed-by: Jason Dillaman <dillaman@redhat.com>

commit | commitdiff | tree

Kefu Chai [Tue, 28 Jul 2020 15:22:14 +0000 (23:22 +0800)]

Merge pull request #36328 from tchaikov/wip-crimson-cls_cxx_map_get_vals

crimson/osd: implement cls_cxx_map_get_vals()

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>

commit | commitdiff | tree

Kefu Chai [Tue, 28 Jul 2020 14:39:21 +0000 (22:39 +0800)]

crimson/osd: implement cls_cxx_map_get_vals()

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Changcheng Liu [Thu, 23 Jul 2020 03:09:46 +0000 (11:09 +0800)]

doc: specify RBD_LOCK_MODE_EXCLUSIVE for exclusive-lock

The exclusive-lock could be transited transparently between clients
after finishing write operation. To disable "transparent" transition,
it needs to acquire the lock with RBD_LOCK_MODE_EXCLUSIVE.

Signed-off-by: Changcheng Liu <changcheng.liu@aliyun.com>

commit | commitdiff | tree

Kefu Chai [Tue, 28 Jul 2020 12:49:58 +0000 (20:49 +0800)]

Merge pull request #34928 from p-se/wip-pse-revise-monitoring-doc

mgr/dashboard: revise monitoring documentation

Reviewed-by: Lenz Grimmer <lgrimmer@suse.com>
Reviewed-by: Stephan Müller <smueller@suse.com>

commit | commitdiff | tree

Kefu Chai [Tue, 28 Jul 2020 12:32:59 +0000 (20:32 +0800)]

crimson: use pick_addr() for picking peer addr

in teuthology tests, there is good chance that we have ceph.conf
containing:

mon host = 172.21.15.122

which is translated to two monitors

- a: 172.21.15.122:3300
- a-legacy: 172.21.15.122:6789

both has protocol type of "any". so, to enable crimson to use settings
like this, we should let crimson to accept them, and drop the connection
if the peer claim to be using an incompatible protocol, when they are
exchanging banners.

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Tatjana Dehler [Tue, 28 Jul 2020 11:18:56 +0000 (13:18 +0200)]

mgr/dashboard: wait longer for health status to be cleared

Because of reasons the cluster needs more time to recover from
HEALTH_WARN while changes are made by `test_pool_update_metadata`.
Lets wait several times for the cluster status to be HEALTH_OK
again.

Fixes: https://tracker.ceph.com/issues/46573
Signed-off-by: Tatjana Dehler <tdehler@suse.com>

commit | commitdiff | tree

Kefu Chai [Tue, 28 Jul 2020 12:32:00 +0000 (20:32 +0800)]

crimson/mon: use mon with only v2 address

crimson msgr supports v2 protocol now, so we can connect to monitor
which only provides v2 addresses.

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Tue, 28 Jul 2020 12:29:29 +0000 (20:29 +0800)]

msg/msg_types.h: add pick_addr()

for picking an addr from an entity_addrvec_t by given protocol type.
so:
  - v2 => v2, any
  - v1 => v1, any
  - any => any, v1, v2

and add a helper of `addr_of_type()` to avoid repeatings.

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Sebastian Wagner [Tue, 28 Jul 2020 12:46:20 +0000 (14:46 +0200)]

Merge pull request #36285 from sebastian-philipp/orch-completion-generic

mgr/orch: Add some more type annotations

Reviewed-by: Michael Fritch <mfritch@suse.com>

commit | commitdiff | tree

Volker Theile [Tue, 28 Jul 2020 12:00:25 +0000 (14:00 +0200)]

Merge pull request #36258 from rhcs-dashboard/fix-cpu-stats

mgr/dashboard: cpu stats incorrectly displayed

Reviewed-by: Patrick Seidensal <pseidensal@suse.com>
Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Volker Theile <vtheile@suse.com>

commit | commitdiff | tree

Sebastian Wagner [Fri, 24 Jul 2020 15:29:28 +0000 (17:29 +0200)]

mgr/orch: Add some more type annotations

Made `orch.Completion` a generic type

Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>

commit | commitdiff | tree

Sebastian Wagner [Tue, 28 Jul 2020 09:54:12 +0000 (11:54 +0200)]

Merge pull request #36012 from adk3798/cephadm_44886

mgr/cephadm: allow use of authenticated registry

commit | commitdiff | tree

Sebastian Wagner [Tue, 28 Jul 2020 09:52:53 +0000 (11:52 +0200)]

Merge pull request #36262 from sebastian-philipp/orch-readd-apply_dg

mgr/cephadm: re-add `apply_drivegroups()`

Reviewed-by: Joshua Schmid <jschmid@suse.de>
Reviewed-by: Kiefer Chang <kiefer.chang@suse.com>

commit | commitdiff | tree

Nathan Cutler [Tue, 28 Jul 2020 09:33:30 +0000 (11:33 +0200)]

Merge pull request #36306 from smithfarm/wip-add-octopus-to-release-table

doc/releases: add "octopus" column to Release Timeline

Reviewed-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Sebastian Wagner [Tue, 28 Jul 2020 09:29:01 +0000 (11:29 +0200)]

Merge pull request #36301 from sebastian-philipp/doc-cephadm-status-no-progress

doc/cephadm: `status` doesn't show a progress

Reviewed-by: Michael Fritch <mfritch@suse.com>
Reviewed-by: Zac Dover <zac.dover@gmail.com>

commit | commitdiff | tree

Matthew Oliver [Wed, 22 Jul 2020 07:09:12 +0000 (17:09 +1000)]

cephadm: Add tcmu-runner container when deploying ceph-iscsi

Currently when we deploy ceph-iscsi via cephadm it doesn't include a
running tcmu-runner. Which means initiators will be able to login but
you wont see the LUNS on the initiator.

This patch deploys an additional tcmu-runner container along side the
ceph-iscsi container that just runs the tcmu-runner service.

Fixes: https://tracker.ceph.com/issues/46540
Signed-off-by: Matthew Oliver <moliver@suse.com>

commit | commitdiff | tree

Kefu Chai [Tue, 28 Jul 2020 02:21:24 +0000 (10:21 +0800)]

Merge pull request #36090 from inspur-wyq/wip-37532

mon: fix the 'Error ERANGE' message when conf "osd_objectstore" is filestore

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Tue, 28 Jul 2020 02:20:00 +0000 (10:20 +0800)]

Merge pull request #36283 from rzarzynski/wip-bl-raw-privatization

common/bl: don't access raw::data nor raw::len directly. Use getters instead.

Reviewed-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Lucian Petrut [Mon, 27 Jul 2020 13:57:59 +0000 (13:57 +0000)]

cmake: fix lz4 params when building rocksdb

Recent RocksDB version use slightly different parameter names for
the LZ4 include/lib dirs, we'll have to pass the right ones.

We'll also have to fix the "CMAKE_TOOLCHAIN_FILE" parameter,
which isn't passed properly.

Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>

commit | commitdiff | tree

Nathan Cutler [Mon, 27 Jul 2020 15:40:58 +0000 (17:40 +0200)]

doc/releases: add "octopus" column to Release Timeline

Octopus has been out for awhile. I suppose this should have been done
earlier, but "better late than never".

Signed-off-by: Nathan Cutler <ncutler@suse.com>

commit | commitdiff | tree

Nathan Cutler [Mon, 27 Jul 2020 15:39:22 +0000 (17:39 +0200)]

Merge pull request #36245 from smithfarm/wip-mimic-is-eol

doc/releases: Mimic is EOL

Reviewed-by: Abhishek Lekshmanan <abhishek@suse.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Kefu Chai [Mon, 27 Jul 2020 14:53:07 +0000 (22:53 +0800)]

Merge pull request #36279 from tchaikov/wip-crimson-msgr-v2.1

crimson/net: enable on-wire-encryt and v2.1 support

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>

commit | commitdiff | tree

Sebastian Wagner [Mon, 27 Jul 2020 14:50:01 +0000 (16:50 +0200)]

doc/cephadm: `status` doesn't show a progress

Fixes: https://tracker.ceph.com/issues/45858
Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>

commit | commitdiff | tree

Nathan Cutler [Mon, 27 Jul 2020 14:25:30 +0000 (16:25 +0200)]

Merge pull request #35852 from smithfarm/wip-opensuse-os-recommendations

doc/start/os-recommendations: current state of openSUSE

Reviewed-by: Tim Serong <tserong@suse.com>

commit | commitdiff | tree

Casey Bodley [Mon, 27 Jul 2020 13:54:46 +0000 (09:54 -0400)]

Merge pull request #36269 from dang/wip-dang-46692

RGW - fix bulkupload, broken by zipper

Reviewed-by: Casey Bodley <cbodley@redhat.com>

commit | commitdiff | tree

Sebastian Wagner [Mon, 20 Jul 2020 11:55:09 +0000 (13:55 +0200)]

cephadm: use src/mypy.ini instead

Signed-off-by: Sebastian Wagner <sebastian.wagner@suse.com>

commit | commitdiff | tree

Jason Dillaman [Fri, 24 Jul 2020 16:13:10 +0000 (12:13 -0400)]

librbd: use task finisher thread for image open/close callbacks

There was a potential race condition with utilizing the AsioEngine
to deliver asynchronous image open and close callbacks. This left
the potential for the io_context thread to attempt to destroy itself.

This commit changes the behavior of the image open and close callbacks
to always delete the ImageCtx (now matches the synchronous API behavior)
and it always invokes the callback in Finisher thread whose lifetime is
tied to the CephContext.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>

commit | commitdiff | tree

Jan Fajerski [Mon, 27 Jul 2020 08:29:21 +0000 (10:29 +0200)]

Merge pull request #36219 from guits/guits-fix_zap_osdid_osdfsid

ceph-volume: filter by osd-id or osd-fsid when zapping

commit | commitdiff | tree

Jianpeng Ma [Mon, 27 Jul 2020 06:59:08 +0000 (14:59 +0800)]

os/bluestore/BlueFS: Don't flush unused device.

Signed-off-by: Jianpeng Ma <jianpeng.ma@intel.com>

commit | commitdiff | tree

Guillaume Abrioux [Mon, 20 Jul 2020 13:43:38 +0000 (15:43 +0200)]

ceph-volume: filter by osd-id or osd-fsid when zapping

2f5c10c12c37e6865ce54bb4940d3779353cba4f introduced a bug:

`ceph-volume lvm zap` command fails under certain conditions.

when passing `--osd-id` or `--osd-fsid` to `ceph-volume lvm zap` command
it tries to zap additionnal devices that have nothing to do with the osd
being zapped.

When calling `api.get_lvs()` in `ensure_associated_lvs()` we have to
pass the osd-id/osd-fsid information so only related devices are
returned by `get_lvs()` method

Closes: https://tracker.ceph.com/issues/46627
Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sat, 25 Jul 2020 09:22:14 +0000 (17:22 +0800)]

messages/MOSDBoot: pass OSDSuperblock by const ref

MOSDBoot's ctor does not change the parameter, so let's pass by const
reference.

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sat, 25 Jul 2020 09:13:41 +0000 (17:13 +0800)]

crimson/os/alienstore: always use fsid in bluestore

alienstore should not be stateful in this perspective, it should proxy
all acccess of fsid to bluestore.

there are couple issues in existing implementation:

* when mkfs, bluestore tries to generate a new osd_fsid if the specified
  one is empty. but we explicitly pass the given uuid down to
  AlienStore::mkfs() so the bluestore can use it. so we should pass it
  down instad of storing it locally.
* when persisting superblock in OSD::mkfs(), superblock.osd_fsid() is
  read from store->get_fsid(), if user specifies an empty uuid, we
  should persist the generated uuid in the superblock.

in this change, all access to fsid is proxied to the underlying
bluestore.

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sat, 25 Jul 2020 07:45:26 +0000 (15:45 +0800)]

stop.sh: stop osd before mon

osd sends a MOSDMarkMeDown message to monitor and waits for its ack
before timeout, so if we can stop osd before stopping mon, stop.sh can
return sooner without waiting until the timeout.

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sat, 25 Jul 2020 05:04:58 +0000 (13:04 +0800)]

crimson/mgr: only pick the addr of the same type

to avoid the attempts to connect an OSD which is bound to a v2
address to a v1 address of a mgr.

in general, osd is bound to both v1 and v2 addresses, but crimson
msgr does not support multiple bound address at the time of writing, so
to avoid the failures when trying to connect to incompatible addresses,
let's filter out them when connecting to monitor. this change
silence warnings like:

peer_addr_for_me v1:172.21.15.106:60008/0 type doesn't match myaddr
v2:0.0.0.0:6802/26710

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sat, 25 Jul 2020 05:02:49 +0000 (13:02 +0800)]

mon/MgrMap: let get_active_addrs() return a const ref

no need to create a temporary instance for referencing those addresses.

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sat, 25 Jul 2020 04:38:04 +0000 (12:38 +0800)]

crimson/mon: only pick the addr of the same type

to avoid the attempts to connect an OSD which is bound to a v2 address
to a v1 addrss of a monitor.

in general, osd is bound to both v1 and v2 addresses, but crimson msgr
does not support multiple bound address at the time of writing, so to
avoid the failures when trying to connect to incompatible addresses,
let's filter out them when connecting to monitor. this change silence
warnings like:

peer_addr_for_me v1:172.21.15.106:60008/0 type doesn't match myaddr
v2:0.0.0.0:6802/26710

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Fri, 24 Jul 2020 15:13:37 +0000 (23:13 +0800)]

crimson/osd: print out client caps

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Fri, 24 Jul 2020 15:10:51 +0000 (23:10 +0800)]

auth/cephx: implement random()->get_bytes() for crimson

instead of using CryptoRandom use the C++ standard library for
generating secret.

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Fri, 24 Jul 2020 13:11:08 +0000 (21:11 +0800)]

crimson/admin: catch thrown exception

if the socket file exists, a std::system_error is thrown. and we should
catch it.

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Fri, 24 Jul 2020 09:00:06 +0000 (17:00 +0800)]

crimson/msgr: Revert "don't advertise the on-wire format v2.1."

This reverts commit a74948bc5095b32212189352c163030bfe10db71.

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Fri, 24 Jul 2020 10:03:16 +0000 (18:03 +0800)]

crimson/net: enable msgr v2.1 support

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Fri, 24 Jul 2020 10:01:12 +0000 (18:01 +0800)]

crimson/net: enable on_wire encryption support

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Fri, 24 Jul 2020 08:59:58 +0000 (16:59 +0800)]

crimson/net: set is_rev1 for messenger v2.1 support

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Fri, 24 Jul 2020 08:55:36 +0000 (16:55 +0800)]

crimson/net: keep rx_preamble for msgr v2.1 support

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Fri, 24 Jul 2020 08:45:46 +0000 (16:45 +0800)]

crimson/net: drop stale TODO

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Fri, 24 Jul 2020 08:33:22 +0000 (16:33 +0800)]

crimson/net: use rx_frame_asm for handling data read from wire

by leveraging FrameAssembler, it's much simpler. and it also pave the
road to a better messenger v2.0 and v2.1 protocol support.

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Fri, 24 Jul 2020 08:30:44 +0000 (16:30 +0800)]

crimson/net: mark abort_ functions [[noreturn]]

otherwise compiler complains if control reaches end of non-void
function.

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Fri, 24 Jul 2020 03:51:57 +0000 (11:51 +0800)]

msg/async/crypto_onwire: drop unused member variable

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Changcheng Liu [Thu, 30 Apr 2020 03:18:36 +0000 (11:18 +0800)]

msg: correct read result check

The "r >= 0" is checked under "r <= 0", so the right condition
is "r == 0".

Signed-off-by: Changcheng Liu <changcheng.liu@aliyun.com>

commit | commitdiff | tree

Changcheng Liu [Thu, 30 Apr 2020 02:34:01 +0000 (10:34 +0800)]

msg: remove undefined/unused interface from ProtocolV2

There's no need to keep read_message_data_prepare since ProtocolV2
use segmented buffer.

Signed-off-by: Changcheng Liu <changcheng.liu@aliyun.com>

commit | commitdiff | tree

Kefu Chai [Sat, 25 Jul 2020 18:03:02 +0000 (02:03 +0800)]

Merge pull request #36259 from majianpeng/bluefs-reduce-ceph_clock_now

os/bluestore/BlueFS: reduce unnecessary ceph_clock_now().

Reviewed-by: Igor Fedotov <ifedotov@suse.com>

commit | commitdiff | tree

Kefu Chai [Sat, 25 Jul 2020 18:00:15 +0000 (02:00 +0800)]

Merge pull request #33899 from rs-fabrica/rados_generic_options_usage_message

rados: include generic options in usage message

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sat, 25 Jul 2020 17:57:49 +0000 (01:57 +0800)]

Merge pull request #36115 from BenoitKnecht/diskprediction-local-array-shape

mgr/diskprediction_local: Fix array size error

Reviewed-by: Josh Durgin <jdurgin@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sat, 25 Jul 2020 17:57:11 +0000 (01:57 +0800)]

Merge pull request #35306 from changchengx/blk

blk: add option to set device type to select blk driver

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sat, 25 Jul 2020 17:54:29 +0000 (01:54 +0800)]

Merge pull request #36274 from xiexingguo/wip-peer-num-objects

osd/PeeringState: prevent peer's num_objects going negative

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: yanjun <yan.jun8@zte.com.cn>

commit | commitdiff | tree

Kefu Chai [Sat, 25 Jul 2020 10:22:08 +0000 (18:22 +0800)]

Merge pull request #36236 from tchaikov/wip-std-bind

test/librados_test_stub: use std::bind

Reviewed-by: Adam C. Emerson <aemerson@redhat.com>

commit | commitdiff | tree

Kefu Chai [Sat, 25 Jul 2020 06:41:33 +0000 (14:41 +0800)]

Merge pull request #36071 from rzarzynski/wip-crimson-errorator-assert-cleanup

crimson: improve assertions in errorator

Reviewed-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

xie xingguo [Fri, 24 Jul 2020 01:57:40 +0000 (09:57 +0800)]

osd/PeeringState: prevent peer's num_objects going negative

Saw it in a teuthology run:

-5645> 2020-07-20 04:34:32.067 7f351e329700 5 osd.5 pg_epoch: 667 ... exit Started/Primary/Active/Backfilling
-5642> 2020-07-20 04:34:32.067 7f351e329700 5 osd.5 pg_epoch: 667 ... enter Started/Primary/Active/Recovered
-5633> 2020-07-20 04:34:32.067 7f351e329700 20 osd.5 pg_epoch: 667 ... _update_calc_stats shard 5 primary objects 0 missing 0
-5632> 2020-07-20 04:34:32.067 7f351e329700 20 osd.5 pg_epoch: 667 ... _update_calc_stats shard 3 objects -1 missing 1
-5631> 2020-07-20 04:34:32.067 7f351e329700 20 osd.5 pg_epoch: 667 ... _update_calc_stats shard 6 objects 0 missing 0

This will crash the choose_acting() procedure as it will mistakenly
think that peer 3 should continue to perform asynchronous recovery
(e.g., due to num_objects_missing = 1) in contrast to fully
backfill-recovered.

While I did not dig into the real cause, there are a couple of
possible explanations of how num_objects can be off. I think that
if a roll forward or log replay could delete something twice, maybe
there would be an undercount. Or maybe something as simple as a
corruption.

Since _update_calc_stats() is going to fix num_objects_missing
for that peer anyway, let's make sure it always starts with a
clean state.

Fixes: https://tracker.ceph.com/issues/46705
Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>

commit | commitdiff | tree

Neha Ojha [Fri, 24 Jul 2020 23:13:55 +0000 (16:13 -0700)]

Merge pull request #36121 from aclamk/wip-bluefs-log-replay-rescue

Rescue procedure for extremely large bluefs log

Reviewed-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Neha Ojha [Fri, 24 Jul 2020 21:47:44 +0000 (14:47 -0700)]

Merge pull request #35909 from dzafman/wip-46275

osd: Cancel in-progress scrubs (not user requested)

Reviewed-by: Neha Ojha <nojha@redhat.com>

commit | commitdiff | tree

Daniel Gryniewicz [Thu, 23 Jul 2020 16:38:11 +0000 (12:38 -0400)]

RGW - fix bulkupload, broken by zipper

Bulkupload depended on the existence of empty bucketinfo. Fix that, to
avoid a crash. In additions, the error handler for swift used buckets.

Fixes 46692

Signed-off-by: Daniel Gryniewicz <dang@redhat.com>

commit | commitdiff | tree

David Zafman [Tue, 7 Jul 2020 01:02:08 +0000 (18:02 -0700)]

test: Check for interuption of scrubs with nosrub/nodeep_scrub

Signed-off-by: David Zafman <dzafman@redhat.com>

commit | commitdiff | tree

David Zafman [Thu, 2 Jul 2020 17:05:57 +0000 (10:05 -0700)]

osd: Cancel in-progress scrubs (not user requested)

This change adds new scrubber.req_scrub to track user
requested scrubs, deep_scrub or repair.

Fixes: https://tracker.ceph.com/issues/46275
Signed-off-by: David Zafman <dzafman@redhat.com>

commit | commitdiff | tree

David Zafman [Thu, 23 Jul 2020 16:40:54 +0000 (09:40 -0700)]

osd: Arrange code so that it is clearer should not cause any change

Signed-off-by: David Zafman <dzafman@redhat.com>

commit | commitdiff | tree

David Zafman [Tue, 21 Jul 2020 20:58:42 +0000 (13:58 -0700)]

test: mon-last-epoch-clean.sh fixed to avoid shell globbing

Signed-off-by: David Zafman <dzafman@redhat.com>

Unnamed repository; edit this file 'description' to name the repository.