]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
3 years agomgr/dashboard: cephadm e2e job: improvements 44362/head
Alfonso Martínez [Mon, 20 Dec 2021 08:12:37 +0000 (09:12 +0100)]
mgr/dashboard: cephadm e2e job: improvements

- Avoid jenkins env. error: Input file /workdir/./src/pybind/mgr/dashboard/ci/cephadm/ceph_cluster.yml not found.Leaving...
- More meaningful log output.

Fixes: https://tracker.ceph.com/issues/53666
Signed-off-by: Alfonso Martínez <almartin@redhat.com>
3 years agoMerge pull request #44336 from sebastian-philipp/cephadm-fix-test-facts
Sebastian Wagner [Fri, 17 Dec 2021 10:20:59 +0000 (11:20 +0100)]
Merge pull request #44336 from sebastian-philipp/cephadm-fix-test-facts

mgr/cephadm: Fix test_facts

Reviewed-by: Adam King <adking@redhat.com>
3 years agoMerge pull request #44338 from rzarzynski/wip-crimson-abort_on_LOST_REVERT
Samuel Just [Fri, 17 Dec 2021 09:58:58 +0000 (01:58 -0800)]
Merge pull request #44338 from rzarzynski/wip-crimson-abort_on_LOST_REVERT

crimson/osd: explicitly abort on LOST_REVERT in PG log.

Reviewed-by: Samuel Just <sjust@redhat.com>
3 years agoMerge pull request #44340 from rzarzynski/wip-crimson-backfill-discarding
Samuel Just [Fri, 17 Dec 2021 09:57:45 +0000 (01:57 -0800)]
Merge pull request #44340 from rzarzynski/wip-crimson-backfill-discarding

crimson/osd: honor discarding old backfill messages

Reviewed-by: Samuel Just <sjust@redhat.com>
3 years agoMerge pull request #44345 from xxhdx1985126/wip-onode-root-fix
Samuel Just [Fri, 17 Dec 2021 09:55:42 +0000 (01:55 -0800)]
Merge pull request #44345 from xxhdx1985126/wip-onode-root-fix

crimson/os/seastore: reset onode in 'SeaStore::repeat_with_onode' before the transaction gets destroyed

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
Reviewed-by: Samuel Just <sjust@redhat.com>
3 years agocrimson/os/seastore: reset onode in 'SeaStore::repeat_with_onode' before the transact... 44345/head
Xuehan Xu [Fri, 17 Dec 2021 05:20:35 +0000 (13:20 +0800)]
crimson/os/seastore: reset onode in 'SeaStore::repeat_with_onode' before the transaction gets destroyed

Onodes hold references to the onode tree extents. And if it's referencing the root extent, that root
extent is cached in the onode trees root_tracker which caches onode tree roots by transaction address.
Than root_tracker entry only gets removed when the onode(or the corresponding "super") is destroyed.
On the other hand, two non-concurrent transactions can occupy the same address. So if an onode gets destroyed
after its transaction is destroyed, there will be a chance that another transaction occupying the same
address get that not-yet-destroyed and may-be-outdated onode.

BTW, Since we already cache extents in transactions, might want to drop onode tree root_tracker later?

Fixes: https://tracker.ceph.com/issues/53651
Signed-off-by: Xuehan Xu <xxhdx1985126@gmail.com>
3 years agocrimson/os/seastore/transaction: make sure the transaction's write_set is a subset...
Xuehan Xu [Fri, 17 Dec 2021 05:53:16 +0000 (13:53 +0800)]
crimson/os/seastore/transaction: make sure the transaction's write_set is a subset of its read_set

Signed-off-by: Xuehan Xu <xxhdx1985126@gmail.com>
3 years agosrc/vstart.sh: correct shell syntax error
Xuehan Xu [Fri, 17 Dec 2021 05:31:40 +0000 (13:31 +0800)]
src/vstart.sh: correct shell syntax error

Signed-off-by: Xuehan Xu <xxhdx1985126@gmail.com>
3 years agoMerge PR #44228 into master
Sage Weil [Fri, 17 Dec 2021 04:54:25 +0000 (23:54 -0500)]
Merge PR #44228 into master

* refs/pull/44228/head:
qa/suites/orch/cephadm/osds: test 'ceph cephadm osd activate'
mgr/cephadm/services/osd: skip found osds that already have daemons
mgr/cephadm: allow activation of OSDs that have previously started

Reviewed-by: Sebastian Wagner <sewagner@redhat.com>
3 years agoqa/suites/orch/cephadm/osds: test 'ceph cephadm osd activate' 44228/head
Sage Weil [Thu, 16 Dec 2021 15:00:05 +0000 (10:00 -0500)]
qa/suites/orch/cephadm/osds: test 'ceph cephadm osd activate'

Make sure this command behaves when the /var/lib/ceph osd.NNN dir is
removed.

Signed-off-by: Sage Weil <sage@newdream.net>
3 years agoMerge pull request #42009 from pritha-srivastava/wip-rgw-sts-correct-read-obj-perm
Casey Bodley [Thu, 16 Dec 2021 18:19:33 +0000 (13:19 -0500)]
Merge pull request #42009 from pritha-srivastava/wip-rgw-sts-correct-read-obj-perm

rgw/sts: fix read_obj_policy permission evaluation

Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Adam C. Emerson <aemerson@redhat.com>
3 years agoMerge pull request #44331 from ronen-fr/wip-rf-tips187
Ronen Friedman [Thu, 16 Dec 2021 17:10:17 +0000 (19:10 +0200)]
Merge pull request #44331 from ronen-fr/wip-rf-tips187

crimson/osd: removing an unneeded make_unique()

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
3 years agocrimson/osd: implement op discarding for backfill. 44340/head
Radoslaw Zarzynski [Thu, 16 Dec 2021 16:46:59 +0000 (16:46 +0000)]
crimson/osd: implement op discarding for backfill.

Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
3 years agocrimson/osd: generalize PG::old_peering_msg() to handle query_epoch.
Radoslaw Zarzynski [Thu, 16 Dec 2021 16:45:00 +0000 (16:45 +0000)]
crimson/osd: generalize PG::old_peering_msg() to handle query_epoch.

Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
3 years agocrimson/osd: dissect the lpr check into PG::old_peering_msg().
Radoslaw Zarzynski [Thu, 16 Dec 2021 16:31:35 +0000 (16:31 +0000)]
crimson/osd: dissect the lpr check into PG::old_peering_msg().

Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
3 years agoMerge pull request #44280 from chrisphoffman/warn_pg
Neha Ojha [Thu, 16 Dec 2021 16:33:13 +0000 (08:33 -0800)]
Merge pull request #44280 from chrisphoffman/warn_pg

mon: Omit MANY_OBJECTS_PER_PG warning when autoscaler is on

Reviewed-by: Neha Ojha <nojha@redhat.com>
3 years agocrimson/osd: explicitly abort on LOST_REVERT in PG log. 44338/head
Radoslaw Zarzynski [Thu, 16 Dec 2021 16:01:32 +0000 (16:01 +0000)]
crimson/osd: explicitly abort on LOST_REVERT in PG log.

Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
3 years agoMerge pull request #43901 from pcuzner/snmp-notifier
Sebastian Wagner [Thu, 16 Dec 2021 15:53:55 +0000 (16:53 +0100)]
Merge pull request #43901 from pcuzner/snmp-notifier

mgr/cephadm: Add snmp-gateway service support

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Sebastian Wagner <sewagner@redhat.com>
3 years agomgr/cephadm: Fix test_facts 44336/head
Sebastian Wagner [Thu, 16 Dec 2021 15:40:08 +0000 (16:40 +0100)]
mgr/cephadm: Fix test_facts

Wasn't executed before

Signed-off-by: Sebastian Wagner <sewagner@redhat.com>
3 years agomgr/cephadm/services/osd: skip found osds that already have daemons
Sage Weil [Mon, 6 Dec 2021 15:19:57 +0000 (10:19 -0500)]
mgr/cephadm/services/osd: skip found osds that already have daemons

If we are trying to deploy new or newly-found osds, we can skip the ones
that already have cephadm daemons deployed.

Fixes: https://tracker.ceph.com/issues/53491
Signed-off-by: Sage Weil <sage@newdream.net>
3 years agomgr/cephadm: allow activation of OSDs that have previously started
Sage Weil [Mon, 6 Dec 2021 15:19:16 +0000 (10:19 -0500)]
mgr/cephadm: allow activation of OSDs that have previously started

When this code was introduced way back in ea987a0e56db106f7c76d11f86b3e602257f365e,
for some reason I was focused only on freshly created OSDs.  The
get_osd_uuid_map() helper is used by deploy_osd_daemons_for_existing_osds()
which is called not only by OSD creation but also by 'ceph cephadm
osd activate', which is meant to instantiate daemons for existing OSD
devices (e.g., devices that were reattached to a new server, or whose
/var/lib/ceph/$fsid/osd.$id directory was lost for some other reason.
However, if we ignore OSDs with up_from > 0, then we can't recreate a
daemon instance for such existing OSDs--arguably the most important ones,
since they may hold real data.

Fixes: https://tracker.ceph.com/issues/53491
Signed-off-by: Sage Weil <sage@newdream.net>
3 years agoMerge pull request #44293 from sebastian-philipp/extract-uid-gid-readable
Sebastian Wagner [Thu, 16 Dec 2021 15:22:31 +0000 (16:22 +0100)]
Merge pull request #44293 from sebastian-philipp/extract-uid-gid-readable

cephadm: make extract_uid_gid errors more readable

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Michael Fritch <mfritch@suse.com>
3 years agoMerge PR #44208 into master
Sage Weil [Thu, 16 Dec 2021 14:53:19 +0000 (09:53 -0500)]
Merge PR #44208 into master

* refs/pull/44208/head:
mgr/progress: avoid inefficient dump of all pg stats

Reviewed-by: Kamoltat Sirivadhna <ksirivad@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
3 years agoMerge PR #44211 into master
Sage Weil [Thu, 16 Dec 2021 14:52:24 +0000 (09:52 -0500)]
Merge PR #44211 into master

* refs/pull/44211/head:
mon: increase mon_down_mkfs_grace to 2m

Reviewed-by: Kefu Chai <kchai@redhat.com>
3 years agocrimson/osd: make PG::can_discard_replica_op() reusable for RecoveryBackend.
Radoslaw Zarzynski [Thu, 16 Dec 2021 09:57:24 +0000 (09:57 +0000)]
crimson/osd: make PG::can_discard_replica_op() reusable for RecoveryBackend.

Signed-off-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
3 years agocrimson/osd: removing an unneeded make_unique() 44331/head
Ronen Friedman [Thu, 16 Dec 2021 10:49:57 +0000 (10:49 +0000)]
crimson/osd: removing an unneeded make_unique()

As the desired lifetime of the object matches the lifetime if
it is allocated on the stack, and as no ownership is transferred,
there is no point in using a unique_ptr here.

And see Google's guidance (https://abseil.io/tips/187),
under "Common Anti-Pattern: Avoiding &".

Signed-off-by: Ronen Friedman <rfriedma@redhat.com>
3 years agoMerge pull request #43889 from Daniel-Pivonka/cephadm-reg-cred
Sebastian Wagner [Thu, 16 Dec 2021 10:08:04 +0000 (11:08 +0100)]
Merge pull request #43889 from Daniel-Pivonka/cephadm-reg-cred

mgr/cephadm: store contianer registry credentials in config-key

Reviewed-by: Sage Weil <sage@newdream.net>
Reviewed-by: Sebastian Wagner <sewagner@redhat.com>
3 years agoMerge pull request #44031 from adk3798/agent-thrash
Sebastian Wagner [Thu, 16 Dec 2021 10:07:03 +0000 (11:07 +0100)]
Merge pull request #44031 from adk3798/agent-thrash

 mgr/cephadm: agent: simplify handling of agent reports

Reviewed-by: Sebastian Wagner <sewagner@redhat.com>
3 years agoMerge pull request #44267 from phlogistonjohn/jjm-ceph-orch-yaml-issue206961
Sebastian Wagner [Thu, 16 Dec 2021 10:05:16 +0000 (11:05 +0100)]
Merge pull request #44267 from phlogistonjohn/jjm-ceph-orch-yaml-issue206961

python-common: add int value validation for count and count_per_host

Reviewed-by: Adam King <adking@redhat.com>
3 years agoMerge pull request #44269 from SUSE/wip-fix-mgr-cephadm-suse-cherrypy-requires
Tim Serong [Thu, 16 Dec 2021 05:15:13 +0000 (16:15 +1100)]
Merge pull request #44269 from SUSE/wip-fix-mgr-cephadm-suse-cherrypy-requires

ceph.spec.in: fix mgr-cephadm CherryPy requirement for SUSE builds

Reviewed-by: Nathan Cutler <ncutler@suse.com>
Reviewed-by: Michael Fritch <mfritch@suse.com>
Reviewed-by: Adam King <adking@redhat.com>
3 years agomgr/cephadm: Add snmp-gateway service support 43901/head
Paul Cuzner [Fri, 12 Nov 2021 03:16:59 +0000 (16:16 +1300)]
mgr/cephadm: Add snmp-gateway service support

Add a new snmp-gateway service to provide a bridge between
Prometheus and an SNMP management platform. The gateway
service uses https://github.com/maxwo/snmp_notifier to provide
an SNMP v2c and SNMP V3 support.

The SNMP V3 support mandates at least authentication, and also
offers authentication and privacy (encryption).

Fixes: https://tracker.ceph.com/issues/52920
Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
3 years agomgr/cephadm: Add unit tests for snmp-gateway support
Paul Cuzner [Fri, 12 Nov 2021 03:19:00 +0000 (16:19 +1300)]
mgr/cephadm: Add unit tests for snmp-gateway support

Adds tests to validate the deployed configuration given a known
input context, and check the parameters created based on input
various input scenarios

Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
3 years agoMerge pull request #44197 from josephsawaya/zns-segment-manager
Samuel Just [Thu, 16 Dec 2021 02:30:08 +0000 (18:30 -0800)]
Merge pull request #44197 from josephsawaya/zns-segment-manager

crimson/os/seastore: add ZNSSegmentManager

Reviewed-by: Samuel Just <sjust@redhat.com>
3 years agoMerge pull request #44323 from cbodley/wip-53252
Casey Bodley [Wed, 15 Dec 2021 19:23:40 +0000 (14:23 -0500)]
Merge pull request #44323 from cbodley/wip-53252

rgw/amqp: add default case to silence compiler warning

Reviewed-by: Yuval Lifshitz <ylifshit@redhat.com>
3 years agomgr/cephadm: agent: simplify handling of agent reports 44031/head
Adam King [Fri, 19 Nov 2021 18:58:02 +0000 (13:58 -0500)]
mgr/cephadm: agent: simplify handling of agent reports

Don't try to do extra things like checking other agents
or updating health checks when agents report. Rely on
serve loop for that

This also should help with thrashing we've been seeing

Signed-off-by: Adam King <adking@redhat.com>
3 years agorgw/amqp: add default case to silence compiler warning 44323/head
Casey Bodley [Wed, 15 Dec 2021 16:07:49 +0000 (11:07 -0500)]
rgw/amqp: add default case to silence compiler warning

Fixes: https://tracker.ceph.com/issues/53252
Signed-off-by: Casey Bodley <cbodley@redhat.com>
3 years agoMerge pull request #44131 from ceph/wip-yuriw-release-16.2.7-master
David Galloway [Wed, 15 Dec 2021 16:23:16 +0000 (11:23 -0500)]
Merge pull request #44131 from ceph/wip-yuriw-release-16.2.7-master

doc: 16.2.7 Release Notes

3 years agoMerge pull request #44136 from adk3798/agent-logging
Sebastian Wagner [Wed, 15 Dec 2021 13:38:45 +0000 (14:38 +0100)]
Merge pull request #44136 from adk3798/agent-logging

mgr/cephadm: agent: log response from mgr

Reviewed-by: Michael Fritch <mfritch@suse.com>
Reviewed-by: Sebastian Wagner <sewagner@redhat.com>
3 years agoMerge pull request #44249 from DimitriPapadopoulos/codespell
Sebastian Wagner [Wed, 15 Dec 2021 12:26:12 +0000 (13:26 +0100)]
Merge pull request #44249 from DimitriPapadopoulos/codespell

doc,man: Typos found by codespell

Reviewed-by: Laura Paduano <lpaduano@suse.com>
Reviewed-by: Sebastian Wagner <sewagner@redhat.com>
3 years agoMerge pull request #44309 from foadlind/fix_cephadm_doc
Sebastian Wagner [Wed, 15 Dec 2021 12:22:18 +0000 (13:22 +0100)]
Merge pull request #44309 from foadlind/fix_cephadm_doc

doc/cephadm/upgrade: correct example command

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Sebastian Wagner <sewagner@redhat.com>
3 years agodoc,man: typos found by codespell 44249/head
Dimitri Papadopoulos [Wed, 8 Dec 2021 11:43:27 +0000 (12:43 +0100)]
doc,man: typos found by codespell

Signed-off-by: Dimitri Papadopoulos <3234522+DimitriPapadopoulos@users.noreply.github.com>
3 years agomgr/cephadm: Updated docs for snmp-gateway support
Paul Cuzner [Fri, 12 Nov 2021 03:17:52 +0000 (16:17 +1300)]
mgr/cephadm: Updated docs for snmp-gateway support

Updated docs to show snmp-gateway usage. docs provide
guidance on SNMP versions supported and show CLI and
yaml deployment examples.

Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
3 years agomgr/cephadm: provide initial snmp gateway support
Paul Cuzner [Wed, 13 Oct 2021 23:35:31 +0000 (12:35 +1300)]
mgr/cephadm: provide initial snmp gateway support

This patch enables the cephadm binary
to deploy an SNMP gateway based on -
https://hub.docker.com/r/maxwo/snmp-notifier

Fixes: https://tracker.ceph.com/issues/52920
Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
3 years agoMerge pull request #44216 from locallocal/master
Yuri Weinstein [Tue, 14 Dec 2021 23:21:24 +0000 (15:21 -0800)]
Merge pull request #44216 from locallocal/master

os/bluestore: don't need separate variable to mark hits when lookup oid.

Reviewed-by: Igor Fedotov <ifedotov@suse.com>
3 years agoMerge pull request #43770 from ifed01/wip-ifed-fix-53002
Yuri Weinstein [Tue, 14 Dec 2021 23:20:41 +0000 (15:20 -0800)]
Merge pull request #43770 from ifed01/wip-ifed-fix-53002

os/bluestore: avoid premature onode release.

Reviewed-by: Adam Kupczyk <akupczyk@redhat.com>
3 years agoMerge pull request #43412 from yaarith/devicehealth_fix_tz
Yuri Weinstein [Tue, 14 Dec 2021 23:19:24 +0000 (15:19 -0800)]
Merge pull request #43412 from yaarith/devicehealth_fix_tz

mgr/devicehealth: fix missing timezone from time delta calculation

Reviewed-by: Laura Flores <lflores@redhat.com>
3 years agocrimson/os/seastore: add ZNSSegmentManager 44197/head
Joseph Sawaya [Fri, 3 Dec 2021 17:28:21 +0000 (17:28 +0000)]
crimson/os/seastore: add ZNSSegmentManager

This commit adds the ZNSSegmentManager, so SeaStore can
interface with ZNS devices using the linux/blkzoned
library.

Signed-off-by: Joseph Sawaya <jsawaya@redhat.com>
3 years agodoc/releases/pacific.rst: Instructions for notable change 44131/head
David Galloway [Mon, 13 Dec 2021 21:54:30 +0000 (16:54 -0500)]
doc/releases/pacific.rst: Instructions for notable change

Signed-off-by: David Galloway <dgallowa@redhat.com>
3 years agoMerge pull request #44163 from rhcs-dashboard/fix-53451-master
Ernesto Puerta [Tue, 14 Dec 2021 16:42:56 +0000 (17:42 +0100)]
Merge pull request #44163 from rhcs-dashboard/fix-53451-master

mgr/dashboard: disable Promql test in ARM

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: David Galloway <dgallowa@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: p-se <NOT@FOUND>
Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>
Reviewed-by: sebastian-philipp <NOT@FOUND>
3 years agoMerge pull request #44295 from chrisphoffman/add-email-contact
Neha Ojha [Tue, 14 Dec 2021 15:22:38 +0000 (07:22 -0800)]
Merge pull request #44295 from chrisphoffman/add-email-contact

mailmap: Add Christopher Hoffman

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
3 years agoMerge pull request #44299 from kalebskeithley/dencode2
Casey Bodley [Tue, 14 Dec 2021 15:04:42 +0000 (10:04 -0500)]
Merge pull request #44299 from kalebskeithley/dencode2

rgw:cleanup/refactor json and xml encoders and decoders

Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
3 years agoos/bluestore: avoid premature onode release. 43770/head
Igor Fedotov [Tue, 2 Nov 2021 12:03:39 +0000 (15:03 +0300)]
os/bluestore: avoid premature onode release.

This was observed when onode's removal is followed by reading
and the latter causes object release before the removal is finalized.
The root cause is an improper 'pinned' state assessment in Onode::get

More detailed overview is:
At some point Onode::get() might face the case when nref == 2 and pinned = true
which means parallel incomplete put is running on the onode - ref count is
decremented but pinned state is still unmodified (and even lock hasn't been
acquired yet).
This might finally result in two puts racing over the same onode with nref == 2
which finally results in a premature onode release:
  // nref =3, pinned = 1
  // Thread 1                   Thread 2
  //   o->put()                   o->get()
  //   --nref(n = 2, pinned=1)
  //                              nref++ (n=3, pinned = 1)
  //                              return
  //                              ...
  //                              o->put()
  //                              --nref(n = 2)
  //                              pinned = 0,
  //                              --nref(n = 1)
  //                              ocs->_unpin_and_rm(o) -> o->put()
  //                                ...
  //                                --nref(n = 0)
  //                                release o
  //  o->c->get_onode_cache()
  //  FAULT!
  //
The suggested fix is to introduce additional atomic counter tracking
running put() functions. And permit onode release when both regular
nref and put_nref are both equal to zero.

Fixes: https://tracker.ceph.com/issues/53002
Signed-off-by: Igor Fedotov <igor.fedotov@croit.io>
3 years agoMerge pull request #44029 from cbodley/wip-rgw-beast-header-limit
Casey Bodley [Tue, 14 Dec 2021 13:21:05 +0000 (08:21 -0500)]
Merge pull request #44029 from cbodley/wip-rgw-beast-header-limit

rgw/beast: add max_header_size option with 16k default, up from 4k

Reviewed-by: Mark Kogan <mkogan@redhat.com>
3 years agocephadm: make extract_uid_gid errors more readable 44293/head
Sebastian Wagner [Mon, 13 Dec 2021 11:54:22 +0000 (12:54 +0100)]
cephadm: make extract_uid_gid errors more readable

Avoid dumping a traceback

Signed-off-by: Sebastian Wagner <sewagner@redhat.com>
3 years agodoc/cephadm/upgrade: correct example command 44309/head
Foad Lind [Tue, 14 Dec 2021 13:01:58 +0000 (14:01 +0100)]
doc/cephadm/upgrade: correct example command

Update the ceph version used in the example upgrade command to match the one mentioned in the text above it.

Signed-off-by: Foad Lind <foad.lind@citynetwork.eu>
3 years agoMerge pull request #44235 from xxhdx1985126/wip-onode-omap-hint-optimization
Samuel Just [Tue, 14 Dec 2021 08:10:31 +0000 (00:10 -0800)]
Merge pull request #44235 from xxhdx1985126/wip-onode-omap-hint-optimization

crimson/os/seastore: avoid onode/omap laddr hint conflicts as much as possible

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
Reviewed-by: Chunmei Liu <chunmei.liu@intel.com>
3 years agocrimson/os/seastore: make onode data/metadata laddr space reservation configurable 44235/head
Xuehan Xu [Sun, 12 Dec 2021 08:43:30 +0000 (16:43 +0800)]
crimson/os/seastore: make onode data/metadata laddr space reservation configurable

Signed-off-by: Xuehan Xu <xxhdx1985126@gmail.com>
3 years agocrimson/os/seastore: randomize metadata laddr hints
Samuel Just [Fri, 10 Dec 2021 06:22:44 +0000 (06:22 +0000)]
crimson/os/seastore: randomize metadata laddr hints

This should prevent omap and xattr extent allocations from clumping near
the onode's hint.  Additionally, only generate them past the default
16MB object_data_handler reservation.

Signed-off-by: Samuel Just <sjust@redhat.com>
3 years agocrimson/common: DONOT call crimson::get_logger() if NDEBUG is defined
Xuehan Xu [Tue, 7 Dec 2021 06:55:51 +0000 (14:55 +0800)]
crimson/common: DONOT call crimson::get_logger() if NDEBUG is defined

Avoid debug related perf degredation

Signed-off-by: Xuehan Xu <xxhdx1985126@gmail.com>
3 years agoMerge pull request #44141 from xxhdx1985126/wip-53409
Samuel Just [Tue, 14 Dec 2021 06:15:36 +0000 (22:15 -0800)]
Merge pull request #44141 from xxhdx1985126/wip-53409

crimson/os/seastore/segment_cleaner: correct available space calculation

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
Reviewed-by: Chunmei Liu <chunmei.liu@intel.com>
3 years agoMerge pull request #44290 from liu-chunmei/crimson-fix-no-pg
Samuel Just [Tue, 14 Dec 2021 06:08:10 +0000 (22:08 -0800)]
Merge pull request #44290 from liu-chunmei/crimson-fix-no-pg

crimson/osd: fix interruptor assert when no pg in peering_event

Reviewed-by: Samuel Just <sjust@redhat.com>
3 years agocrimson/os/seastore/segment_cleaner: correct available space calculation 44141/head
Xuehan Xu [Fri, 26 Nov 2021 06:39:06 +0000 (14:39 +0800)]
crimson/os/seastore/segment_cleaner: correct available space calculation

Current available space calculation is wrong, it just counts the space occupied
by extents, deltas and other stuff are not taken into account.

Fixes: https://tracker.ceph.com/issues/53409
Signed-off-by: Xuehan Xu <xxhdx1985126@gmail.com>
3 years agocrimson/common: redirect interruptible future debug output to default subsys
Xuehan Xu [Tue, 30 Nov 2021 04:43:34 +0000 (12:43 +0800)]
crimson/common: redirect interruptible future debug output to default subsys

Signed-off-by: Xuehan Xu <xxhdx1985126@gmail.com>
3 years agocrimson/os/seastore/segment_cleaner: add perf metrics for better monitoring
Xuehan Xu [Tue, 9 Nov 2021 02:39:23 +0000 (10:39 +0800)]
crimson/os/seastore/segment_cleaner: add perf metrics for better monitoring

Signed-off-by: Xuehan Xu <xxhdx1985126@gmail.com>
3 years agoMerge pull request #44015 from liewegas/fix-44012
Yuri Weinstein [Mon, 13 Dec 2021 23:20:51 +0000 (15:20 -0800)]
Merge pull request #44015 from liewegas/fix-44012

osd/PeeringState: separate history's pruub from pg's

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Samuel Just <sjust@redhat.com>
3 years agoMerge pull request #43864 from yaarith/fix-config-notify
Yuri Weinstein [Mon, 13 Dec 2021 23:20:11 +0000 (15:20 -0800)]
Merge pull request #43864 from yaarith/fix-config-notify

mgr/telemetry: fix waiting for mgr to warm up

Reviewed-by: Sage Weil <sage@redhat.com>
3 years agoMerge pull request #43857 from aclamk/wip-aclamk-omap-clone-assert
Yuri Weinstein [Mon, 13 Dec 2021 23:19:14 +0000 (15:19 -0800)]
Merge pull request #43857 from aclamk/wip-aclamk-omap-clone-assert

os/bluestore: Protect _clone against sudden omap format changes

Reviewed-by: Igor Fedotov <ifedotov@suse.com>
3 years agodoc/releases/pacific.rst: add core updates for 16.2.7
Neha Ojha [Tue, 7 Dec 2021 17:47:22 +0000 (17:47 +0000)]
doc/releases/pacific.rst: add core updates for 16.2.7

16.2.7 fixes https://tracker.ceph.com/issues/53062, so remove the
"big scary warning" from the top of the pacific release page. We continue
to warn about this bug under the 16.2.6 section and in
https://docs.ceph.com/en/latest/releases/pacific/#upgrading-from-octopus-or-nautilus.

Signed-off-by: Neha Ojha <nojha@redhat.com>
3 years agodoc/releases/index.rst: change ref to 16.2.7
Neha Ojha [Tue, 7 Dec 2021 17:19:03 +0000 (17:19 +0000)]
doc/releases/index.rst: change ref to 16.2.7

Signed-off-by: Neha Ojha <nojha@redhat.com>
3 years agodoc: 16.2.7 change log => 3 PRs added
Yuri Weinstein [Mon, 6 Dec 2021 21:57:53 +0000 (13:57 -0800)]
doc: 16.2.7 change log => 3 PRs added

Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
3 years agodoc: 16.2.7 Release Notes (dashboard)
Ernesto Puerta [Wed, 1 Dec 2021 20:32:17 +0000 (21:32 +0100)]
doc: 16.2.7 Release Notes (dashboard)

Signed-off-by: Ernesto Puerta <epuertat@redhat.com>
3 years agodoc: 16.2.7 Release Notes
Yuri Weinstein [Mon, 29 Nov 2021 16:36:57 +0000 (08:36 -0800)]
doc: 16.2.7 Release Notes

Signed-off-by: Yuri Weinstein <yweinste@redhat.com>
3 years agodoc/rados/operations: Updated rados docs to include 44280/head
Christopher Hoffman [Mon, 13 Dec 2021 21:15:08 +0000 (21:15 +0000)]
doc/rados/operations: Updated rados docs to include
changes to health-checks in MANY_OBJECTS_PER_PG
warning when autoscale is on.

Signed-off-by: Christopher Hoffman <choffman@redhat.com>
3 years agoMerge pull request #44298 from adamemerson/wip-leveldb-release-note
Neha Ojha [Mon, 13 Dec 2021 21:15:59 +0000 (13:15 -0800)]
Merge pull request #44298 from adamemerson/wip-leveldb-release-note

doc: Add PendingReleaseNote for LevelDB removal

Reviewed-by: Neha Ojha <nojha@redhat.com>
3 years agomon: Omit MANY_OBJECTS_PER_PG warning when autoscaler is on
Christopher Hoffman [Fri, 10 Dec 2021 14:54:19 +0000 (14:54 +0000)]
mon: Omit MANY_OBJECTS_PER_PG warning when autoscaler is on
Add a conditional statement when autoscaler is
set to ON to omit message when about pool having
many more objects per pg than cluster average.

Fixes: https://tracker.ceph.com/issues/53516
Signed-off-by: Christopher Hoffman <choffman@redhat.com>
3 years agomgr/cephadm: agent: log response from mgr 44136/head
Adam King [Mon, 29 Nov 2021 20:58:26 +0000 (15:58 -0500)]
mgr/cephadm: agent: log response from mgr

Signed-off-by: Adam King <adking@redhat.com>
3 years agorgw:cleanup/refactor json and xml encoders and decoders 44299/head
Kaleb S. KEITHLEY [Mon, 13 Dec 2021 19:33:52 +0000 (14:33 -0500)]
rgw:cleanup/refactor json and xml encoders and decoders

move the encoder and decoder methods into their associated class
files to eliminate undefined references to the class vtable

https://tracker.ceph.com/issues/53596

Signed-off-by: Kaleb S. KEITHLEY <kkeithle@redhat.com>
3 years agodoc: Add PendingReleaseNote for LevelDB removal 44298/head
Adam C. Emerson [Mon, 13 Dec 2021 18:04:52 +0000 (13:04 -0500)]
doc: Add PendingReleaseNote for LevelDB removal

Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
3 years agomgr/dashboard: disable Promql test in ARM 44163/head
Ernesto Puerta [Wed, 1 Dec 2021 18:51:44 +0000 (19:51 +0100)]
mgr/dashboard: disable Promql test in ARM

Temporarily disable this test while debugging the issue (since https://github.com/ceph/ceph/pull/43669
originally passed the ARM check).

Fixes: https://tracker.ceph.com/issues/53451
Signed-off-by: Ernesto Puerta <epuertat@redhat.com>
3 years agoMerge pull request #44009 from cbodley/wip-qa-cls-rgw-gc
Casey Bodley [Mon, 13 Dec 2021 16:18:49 +0000 (11:18 -0500)]
Merge pull request #44009 from cbodley/wip-qa-cls-rgw-gc

qa/rgw: run ceph_test_cls_rgw_gc in rgw/verify suite

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
3 years agomailmap: Add Christopher Hoffman 44295/head
Christopher Hoffman [Mon, 13 Dec 2021 14:40:15 +0000 (14:40 +0000)]
mailmap: Add Christopher Hoffman

Signed-off-by: Christopher Hoffman <choffman@redhat.com>
3 years agoMerge pull request #42905 from sebastian-philipp/service_spec_no_redundant_placement
Sebastian Wagner [Mon, 13 Dec 2021 11:11:09 +0000 (12:11 +0100)]
Merge pull request #42905 from sebastian-philipp/service_spec_no_redundant_placement

python-common: improve OSD spec error messages

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Michael Fritch <mfritch@suse.com>
3 years agoMerge pull request #44218 from guits/guits-issue-44356
Guillaume Abrioux [Mon, 13 Dec 2021 08:47:01 +0000 (09:47 +0100)]
Merge pull request #44218 from guits/guits-issue-44356

ceph-volume: fix error 'KeyError' with inventory

3 years agoMerge pull request #44219 from guits/guits-issue-53425
Guillaume Abrioux [Mon, 13 Dec 2021 08:46:44 +0000 (09:46 +0100)]
Merge pull request #44219 from guits/guits-issue-53425

ceph-volume: fix tags dict output in `lvm list`

3 years agoMerge pull request #44281 from athanatos/sjust/wip-53555
Samuel Just [Mon, 13 Dec 2021 08:43:34 +0000 (00:43 -0800)]
Merge pull request #44281 from athanatos/sjust/wip-53555

crimson/os/seastore: index lba pins atomically with addition to cache

Reviewed-by: Xuehan Xu <xuxuehan@360.cn>
Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
3 years agocrimson/osd: fix interruptor assert when no pg in peering_event 44290/head
chunmei-liu [Sun, 12 Dec 2021 06:16:36 +0000 (22:16 -0800)]
crimson/osd: fix interruptor assert when no pg in peering_event

when no pg created, can't use interruptor

Signed-off-by: chunmei-liu <chunmei.liu@intel.com>
3 years agoceph.spec.in: fix mgr-cephadm CherryPy requirement for SUSE builds 44269/head
Tim Serong [Fri, 10 Dec 2021 07:43:25 +0000 (18:43 +1100)]
ceph.spec.in: fix mgr-cephadm CherryPy requirement for SUSE builds

Commit 78983ad0d0c added cherrypy to ceph-mgr-cephadm's Requires,
but this needs to be split out into distro-specific sections due
to subtle/irritating naming differences.

Fixes: 78983ad0d0cce422da32dc4876ac186f6d32c3f5
Signed-off-by: Tim Serong <tserong@suse.com>
3 years agoMerge pull request #44225 from liewegas/fix-53506
Yuri Weinstein [Fri, 10 Dec 2021 23:35:42 +0000 (15:35 -0800)]
Merge pull request #44225 from liewegas/fix-53506

osd/OSDMapMapping: fix spurious threadpool timeout errors

Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
3 years agoMerge pull request #44025 from ljflores/wip-remove-aggregated-perf-data
Yuri Weinstein [Fri, 10 Dec 2021 23:35:09 +0000 (15:35 -0800)]
Merge pull request #44025 from ljflores/wip-remove-aggregated-perf-data

mgr/telemetry: remove aggregated perf metrics from the perf channel

Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Yaarit Hatuka <yaarit@redhat.com>
Reviewed-by: Alfonso Martínez <almartin@redhat.com>
3 years agoMerge pull request #43612 from adamemerson/wip-unleveling
Yuri Weinstein [Fri, 10 Dec 2021 23:33:05 +0000 (15:33 -0800)]
Merge pull request #43612 from adamemerson/wip-unleveling

build: Remove LevelDB support

Reviewed-by: Kefu Chai <kchai@redhat.com>
3 years agocrimson/os/seastore/cache: init extents prior to read 44281/head
Samuel Just [Fri, 10 Dec 2021 22:31:00 +0000 (14:31 -0800)]
crimson/os/seastore/cache: init extents prior to read

Thus should ensure that any captured members of extent_init_func are
still valid at the cost of not being able to access the contents of the
extent at invocation time.  With this, we should be able to rely on any
logical extents/lba extents in the cache having validly initialized lba
pins.

Fixes: https://tracker.ceph.com/issues/53555
Signed-off-by: Samuel Just <sjust@redhat.com>
3 years agocrimson/os/seatore/.../lba_btree: update get_*_node to add_pin without reading node...
Samuel Just [Fri, 10 Dec 2021 22:14:23 +0000 (14:14 -0800)]
crimson/os/seatore/.../lba_btree: update get_*_node to add_pin without reading node contents

This will allow us to do add_pin before we perform the actual extent read.

Signed-off-by: Samuel Just <sjust@redhat.com>
3 years agocrimson/os/seastore: pass depth/begin/end to get_*_node
Samuel Just [Fri, 10 Dec 2021 21:51:31 +0000 (13:51 -0800)]
crimson/os/seastore: pass depth/begin/end to get_*_node

We'll need this to populate the pin fields prior to read.

Signed-off-by: Samuel Just <sjust@redhat.com>
3 years agocrimson/os/seastore/transaction_manager: clarify that init lambda only runs on new...
Samuel Just [Fri, 10 Dec 2021 21:42:03 +0000 (21:42 +0000)]
crimson/os/seastore/transaction_manager: clarify that init lambda only runs on new extents

Signed-off-by: Samuel Just <sjust@redhat.com>
3 years agocrimson/os/seastore/cache: remove unnecessary forward
Samuel Just [Thu, 9 Dec 2021 08:37:24 +0000 (00:37 -0800)]
crimson/os/seastore/cache: remove unnecessary forward

Signed-off-by: Samuel Just <sjust@redhat.com>
3 years agopython-common: add test inputs verifying count & count-per-host >= 1 44267/head
John Mulligan [Fri, 10 Dec 2021 13:19:59 +0000 (08:19 -0500)]
python-common: add test inputs verifying count & count-per-host >= 1

This adds unit new test inputs, local to python-common that verify the
correct error messages are raised when count == 0 and count_per_host ==
0.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
3 years agopython-common: make count & count-per-host >= 1 checks consistent
John Mulligan [Fri, 10 Dec 2021 13:16:19 +0000 (08:16 -0500)]
python-common: make count & count-per-host >= 1 checks consistent

The previous version of the validate function had a incorrect error
statement that suggested the count must be >1  when it should have
been >=1. This confusion was possibly due to using "n < 1" on
one line and "n <= 0" on another line. Since both values are supposed
to be integers this change corrects the error message and makes
the comparisons on the lines both use "n < 1" (since I find it easier
to see that the check "n < 1" is the inverse of the error text
asserting "n >= 1").

Signed-off-by: John Mulligan <jmulligan@redhat.com>
3 years agopython-common: add unit test func for invalid yaml inputs
John Mulligan [Wed, 8 Dec 2021 20:37:11 +0000 (15:37 -0500)]
python-common: add unit test func for invalid yaml inputs

I didn't find a preexisting test function for this so I added a
new test that is fed yaml snippets and expected error messages.
This verifies some of the recently added validation for
count and cound_per_host under the placement spec.

Signed-off-by: John Mulligan <jmulligan@redhat.com>
3 years agopython-common: add int value validation for count and count_per_host
John Mulligan [Wed, 8 Dec 2021 20:33:54 +0000 (15:33 -0500)]
python-common: add int value validation for count and count_per_host

Add additional validation for the count and count_per_host fields
sourced from YAML.

Fixes: https://tracker.ceph.com/issues/50524
Signed-off-by: John Mulligan <jmulligan@redhat.com>
3 years agoMerge pull request #44262 from cbodley/wip-53325
Casey Bodley [Fri, 10 Dec 2021 16:48:39 +0000 (11:48 -0500)]
Merge pull request #44262 from cbodley/wip-53325

test/rgw: disable cls_rgw_gc test cases with defer_gc()

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>