]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
5 years agomgr/insights: make 'insights prune-health-history 0' zap current state 35214/head
Sage Weil [Thu, 30 Jan 2020 17:40:48 +0000 (11:40 -0600)]
mgr/insights: make 'insights prune-health-history 0' zap current state

The self._health_slot value needs to be updated too if the current key
is removed.

Fixes: https://tracker.ceph.com/issues/43886
Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 04f25b26c5aea5b4cf5cb94e939d6fba74228b27)

5 years agoMerge pull request #35180 from smithfarm/wip-45642-nautilus
Yuri Weinstein [Fri, 22 May 2020 18:58:48 +0000 (11:58 -0700)]
Merge pull request #35180 from smithfarm/wip-45642-nautilus

nautilus: tests: src/test/compressor: Add missing gtest

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Willem Jan Withagen <wjw@digiware.nl>
5 years agoMerge pull request #35047 from smithfarm/wip-44686-nautilus
Yuri Weinstein [Fri, 22 May 2020 18:53:02 +0000 (11:53 -0700)]
Merge pull request #35047 from smithfarm/wip-44686-nautilus

nautilus: qa/standalone/osd/osd-backfill-stats.sh: get_latest_osdmap to propagate map change

Reviewed-by: David Zafman <dzafman@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
5 years agoMerge pull request #35049 from smithfarm/wip-45224-nautilus
Yuri Weinstein [Fri, 22 May 2020 18:46:57 +0000 (11:46 -0700)]
Merge pull request #35049 from smithfarm/wip-45224-nautilus

nautilus: ceph_test_rados_api_{watch_notify,misc}: tolerate some timeouts

Reviewed-by: Brad Hubbard <bhubbard@redhat.com>
5 years agoMerge pull request #35048 from smithfarm/wip-44689-nautilus
Yuri Weinstein [Fri, 22 May 2020 18:45:50 +0000 (11:45 -0700)]
Merge pull request #35048 from smithfarm/wip-44689-nautilus

nautilus: test: Add flush_pg_stats to avoid race with getting num_shards_repaired

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: David Zafman <dzafman@redhat.com>
5 years agoMerge pull request #35028 from shyukri/wip-44163-nautilus
Yuri Weinstein [Fri, 22 May 2020 18:42:28 +0000 (11:42 -0700)]
Merge pull request #35028 from shyukri/wip-44163-nautilus

nautilus:test/fio: bump to fio 3.15

Reviewed-by: Igor Fedotov <ifedotov@suse.com>
5 years agoMerge pull request #35024 from smithfarm/wip-43919-nautilus
Yuri Weinstein [Fri, 22 May 2020 18:40:25 +0000 (11:40 -0700)]
Merge pull request #35024 from smithfarm/wip-43919-nautilus

nautilus: osd: dispatch_context and queue split finish on early bail-out

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
5 years agoMerge pull request #34957 from smithfarm/wip-44841-nautilus
Yuri Weinstein [Fri, 22 May 2020 18:39:30 +0000 (11:39 -0700)]
Merge pull request #34957 from smithfarm/wip-44841-nautilus

nautilus: osd/PeeringState: do not trim pg log past last_update_ondisk

Reviewed-by: Samuel Just <sjust@redhat.com>
Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
5 years agoMerge pull request #34823 from votdev/wip-45323-nautilus
Yuri Weinstein [Fri, 22 May 2020 18:38:11 +0000 (11:38 -0700)]
Merge pull request #34823 from votdev/wip-45323-nautilus

nautilus: mgr/dashboard: monitoring menu entry should indicate firing alerts

Reviewed-by: Patrick Seidensal <pnawracay@suse.com>
Reviewed-by: Stephan Müller <smueller@suse.com>
5 years agoMerge pull request #35050 from smithfarm/wip-43773-nautilus
Yuri Weinstein [Fri, 22 May 2020 15:31:20 +0000 (08:31 -0700)]
Merge pull request #35050 from smithfarm/wip-43773-nautilus

nautilus: qa: log warning on scrub error

Reviewed-by: Ramana Raja <rraja@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
5 years agoMerge pull request #35000 from smithfarm/wip-45496-nautilus
Yuri Weinstein [Fri, 22 May 2020 15:30:41 +0000 (08:30 -0700)]
Merge pull request #35000 from smithfarm/wip-45496-nautilus

nautilus: cephfs: Client: fix Finisher assert failure

Reviewed-by: Xiubo Li <xiubli@redhat.com>
Reviewed-by: Ramana Raja <rraja@redhat.com>
5 years agoMerge pull request #34784 from Vicente-Cheng/wip-45221-nautilus
Yuri Weinstein [Fri, 22 May 2020 15:29:21 +0000 (08:29 -0700)]
Merge pull request #34784 from Vicente-Cheng/wip-45221-nautilus

nautilus: cephfs-journal-tool: correctly parse --dry_run argument

Reviewed-by: Ramana Raja <rraja@redhat.com>
5 years agoMerge pull request #34783 from Vicente-Cheng/wip-45217-nautilus
Yuri Weinstein [Fri, 22 May 2020 15:28:40 +0000 (08:28 -0700)]
Merge pull request #34783 from Vicente-Cheng/wip-45217-nautilus

nautilus: qa/cephfs: update TestClusterFull:test_barrier

Reviewed-by: Ramana Raja <rraja@redhat.com>
5 years agoMerge pull request #34435 from Vicente-Cheng/wip-44483-nautilus
Yuri Weinstein [Fri, 22 May 2020 15:28:12 +0000 (08:28 -0700)]
Merge pull request #34435 from Vicente-Cheng/wip-44483-nautilus

nautilus: mds: handle ceph_assert on blacklisting

Reviewed-by: Ramana Raja <rraja@redhat.com>
5 years agoMerge pull request #34338 from Vicente-Cheng/wip-44478-nautilus
Yuri Weinstein [Fri, 22 May 2020 15:24:44 +0000 (08:24 -0700)]
Merge pull request #34338 from Vicente-Cheng/wip-44478-nautilus

nautilus: mds: Using begin() and empty() to iterate the xlist

Reviewed-by: Yan, Zheng <zyan@redhat.com>
Reviewed-by: Ramana Raja <rraja@redhat.com>
5 years agosrc/test/compressor: Add missing gtest 35180/head
Willem Jan Withagen [Wed, 4 Mar 2020 21:50:44 +0000 (22:50 +0100)]
src/test/compressor: Add missing gtest

Signed-off-by: Willem Jan Withagen <wjw@digiware.nl>
(cherry picked from commit a11c72d6d6c72b6801cf58c202ea92f77af14ebf)

5 years agoMerge pull request #35093 from neha-ojha/wip-too-few-pgs-nautilus
Yuri Weinstein [Thu, 21 May 2020 15:44:04 +0000 (08:44 -0700)]
Merge pull request #35093 from neha-ojha/wip-too-few-pgs-nautilus

nautilus: qa/suites/upgrade: disable more min pg per osd warnings

Reviewed-by: Brad Hubbard <bhubbard@redhat.com>
Reviewed-by: Yuri Weinstein <yweinste@redhat.com>
5 years agoqa/suites/upgrade: disable more min pg per osd warnings 35093/head
Neha Ojha [Fri, 15 May 2020 21:23:31 +0000 (21:23 +0000)]
qa/suites/upgrade: disable more min pg per osd warnings

This follows 58eb3edc8478c993c5446475df58d659d3f6d356.

This change is not cherry-picked from master since it already has
1ac34a5ea3d1aca299b02e574b295dd4bf6167f4.

Signed-off-by: Neha Ojha <nojha@redhat.com>
5 years agoMerge pull request #35004 from smithfarm/wip-44490-nautilus
Yuri Weinstein [Wed, 20 May 2020 15:20:41 +0000 (08:20 -0700)]
Merge pull request #35004 from smithfarm/wip-44490-nautilus

nautilus: compressor/lz4: work around bug in liblz4 versions <1.8.2

Reviewed-by: Neha Ojha <nojha@redhat.com>
5 years agoMerge pull request #34983 from jecluis/wip-45402-nautilus
Yuri Weinstein [Wed, 20 May 2020 15:20:12 +0000 (08:20 -0700)]
Merge pull request #34983 from jecluis/wip-45402-nautilus

nautilus: mon/OSDMonitor: allow trimming maps even if osds are down

Reviewed-by: Neha Ojha <nojha@redhat.com>
5 years agoMerge pull request #34916 from sseshasa/wip-45391-nautilus
Yuri Weinstein [Wed, 20 May 2020 15:19:27 +0000 (08:19 -0700)]
Merge pull request #34916 from sseshasa/wip-45391-nautilus

nautilus: mon/OSDMonitor: Always tune priority cache manager memory on all mons

Reviewed-by: Neha Ojha <nojha@redhat.com>
5 years agoMerge pull request #34882 from hillpd/wip-45359-nautilus
Yuri Weinstein [Wed, 20 May 2020 15:18:56 +0000 (08:18 -0700)]
Merge pull request #34882 from hillpd/wip-45359-nautilus

nautilus: rados: prevent ShardedOpWQ suicide_grace drop when waiting for work.

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
5 years agoMerge pull request #34829 from rhcs-dashboard/wip-45329-nautilus
Yuri Weinstein [Wed, 20 May 2020 15:17:48 +0000 (08:17 -0700)]
Merge pull request #34829 from rhcs-dashboard/wip-45329-nautilus

nautilus: monitoring: fix decimal precision in Grafana %percentages

Reviewed-by: Lenz Grimmer <lgrimmer@suse.com>
Reviewed-by: Stephan Müller <smueller@suse.com>
5 years agoMerge pull request #34711 from smithfarm/wip-45126-nautilus
Yuri Weinstein [Wed, 20 May 2020 15:16:03 +0000 (08:16 -0700)]
Merge pull request #34711 from smithfarm/wip-45126-nautilus

nautilus: os/bluestore: fix extent leak after main device expand.

Reviewed-by: Igor Fedotov <ifedotov@suse.com>
5 years agoMerge pull request #34692 from votdev/wip-44475-nautilus
Yuri Weinstein [Wed, 20 May 2020 15:15:12 +0000 (08:15 -0700)]
Merge pull request #34692 from votdev/wip-44475-nautilus

nautilus: mgr/dashboard: Not able to restrict bucket creation for new user

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Laura Paduano <lpaduano@suse.com>
5 years agoMerge pull request #34794 from smithfarm/wip-45064-nautilus
Yuri Weinstein [Tue, 19 May 2020 15:32:51 +0000 (08:32 -0700)]
Merge pull request #34794 from smithfarm/wip-45064-nautilus

nautilus: os/bluestore: fix 'unused' calculation

Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
Reviewed-by: Igor Fedotov <ifedotov@suse.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
5 years agoMerge pull request #34400 from votdev/wip-44203-nautilus
Yuri Weinstein [Tue, 19 May 2020 15:31:47 +0000 (08:31 -0700)]
Merge pull request #34400 from votdev/wip-44203-nautilus

nautilus: mgr/dashboard: RGW port autodetection does not support "Beast" RGW frontend

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Laura Paduano <lpaduano@suse.com>
5 years agoMerge pull request #35045 from smithfarm/wip-45501-nautilus
Yuri Weinstein [Mon, 18 May 2020 15:37:11 +0000 (08:37 -0700)]
Merge pull request #35045 from smithfarm/wip-45501-nautilus

nautilus: rgw: anonomous swift to obj that dont exist should 401

Reviewed-by: Casey Bodley <cbodley@redhat.com>
5 years agoMerge pull request #35019 from smithfarm/wip-45517-nautilus
Yuri Weinstein [Mon, 18 May 2020 15:36:40 +0000 (08:36 -0700)]
Merge pull request #35019 from smithfarm/wip-45517-nautilus

nautilus: rgw: fix multipart upload's error response

Reviewed-by: Casey Bodley <cbodley@redhat.com>
5 years agoMerge pull request #34780 from Vicente-Cheng/wip-44370-nautilus
Yuri Weinstein [Mon, 18 May 2020 14:44:32 +0000 (07:44 -0700)]
Merge pull request #34780 from Vicente-Cheng/wip-44370-nautilus

nautilus: msg/async/rdma: fix bug event center is blocked by rdma construct connection for transport ib sync msg

Reviewed-by: Kefu Chai <kchai@redhat.com>
5 years agoMerge pull request #34761 from smithfarm/wip-44674-nautilus
Yuri Weinstein [Fri, 15 May 2020 23:16:10 +0000 (16:16 -0700)]
Merge pull request #34761 from smithfarm/wip-44674-nautilus

nautilus: mgr/balancer: tolerate pgs outside of target weight map

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: xie xingguo <xie.xingguo@zte.com.cn>
5 years agoMerge pull request #34516 from shyukri/wip-43920-nautilus
Yuri Weinstein [Fri, 15 May 2020 23:14:44 +0000 (16:14 -0700)]
Merge pull request #34516 from shyukri/wip-43920-nautilus

nautilus: common/bl: fix memory corruption in bufferlist::claim_append()

Reviewed-by: Radoslaw Zarzynski <rzarzyns@redhat.com>
5 years agoMerge pull request #34515 from shyukri/wip-43087-nautilus
Yuri Weinstein [Fri, 15 May 2020 23:13:56 +0000 (16:13 -0700)]
Merge pull request #34515 from shyukri/wip-43087-nautilus

nautilus: bluestore: prevent BlueFS::dirty_files from being leaked when syncing metadata

Reviewed-by: Igor Fedotov <ifedotov@suse.com>
5 years agoMerge pull request #34475 from rhcs-dashboard/wip-44997-nautilus
Yuri Weinstein [Fri, 15 May 2020 23:12:38 +0000 (16:12 -0700)]
Merge pull request #34475 from rhcs-dashboard/wip-44997-nautilus

nautilus: rpm: add python3-saml as install dependency

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
5 years agoMerge pull request #35046 from smithfarm/wip-45365-nautilus
Yuri Weinstein [Fri, 15 May 2020 18:20:17 +0000 (11:20 -0700)]
Merge pull request #35046 from smithfarm/wip-45365-nautilus

nautilus: qa/workunits/rbd: fix list-mapped filter in unmap_device

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
Reviewed-by: Mykola Golub <mgolub@mirantis.com>
5 years agotest/objectstore/test_bluestore_types: add tests for unused bitmap 34794/head
Igor Fedotov [Mon, 3 Feb 2020 20:21:28 +0000 (23:21 +0300)]
test/objectstore/test_bluestore_types: add tests for unused bitmap

Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
Signed-off-by: Igor Fedotov <ifedotov@suse.com>
(cherry picked from commit 985497227acbdd46fefc4de5aef75aaa9630514b)

5 years agoos/bluestore: do not use 'unused' bitmap if makes no sense.
Igor Fedotov [Mon, 3 Feb 2020 15:50:50 +0000 (18:50 +0300)]
os/bluestore: do not use 'unused' bitmap if makes no sense.

The processing logic which relies on 'unused' bitmap makes sense for
bluestore setup where min alloc size is different from device block
size. Now omitting if that's not true.

Signed-off-by: Igor Fedotov <ifedotov@suse.com>
(cherry picked from commit f960a018f938c70ea3d7f031611cdc5a6b9955b1)

5 years agoos/bluestore: fix unused 'tail' calculation.
Igor Fedotov [Mon, 3 Feb 2020 15:36:21 +0000 (18:36 +0300)]
os/bluestore: fix unused 'tail' calculation.

Fixes: https://tracker.ceph.com/issues/41901
Signed-off-by: Igor Fedotov <ifedotov@suse.com>
(cherry picked from commit c91cc3a8d689995e8554c41c9b0f652d9a3458da)

Conflicts:
    src/test/objectstore/store_test.cc
- omitted test case "TEST_P(StoreTestSpecificAUSize, ReproBug41901Test)"
  from the backport, because nautilus does not have the
  "bluestore_debug_enforce_settings" option

5 years agoMerge pull request #34986 from smithfarm/wip-45499-nautilus
Yuri Weinstein [Wed, 13 May 2020 18:58:28 +0000 (11:58 -0700)]
Merge pull request #34986 from smithfarm/wip-45499-nautilus

nautilus: rgw: fix some list buckets handle leak

Reviewed-by: Casey Bodley <cbodley@redhat.com>
5 years agoMerge pull request #35003 from smithfarm/wip-44651-nautilus
Yuri Weinstein [Wed, 13 May 2020 18:56:29 +0000 (11:56 -0700)]
Merge pull request #35003 from smithfarm/wip-44651-nautilus

nautilus: qa/suites/rbd: removed OpenStack tempest test cases

Reviewed-by: Jason Dillaman <dillaman@redhat.com>
5 years agoMerge pull request #35002 from smithfarm/wip-45502-nautilus
Yuri Weinstein [Wed, 13 May 2020 18:55:28 +0000 (11:55 -0700)]
Merge pull request #35002 from smithfarm/wip-45502-nautilus

nautilus: rgw: fix lc does not delete objects that do not have exactly the same tags as the rule

Reviewed-by: Casey Bodley <cbodley@redhat.com>
5 years agoMerge pull request #35001 from smithfarm/wip-45486-nautilus
Yuri Weinstein [Wed, 13 May 2020 18:54:46 +0000 (11:54 -0700)]
Merge pull request #35001 from smithfarm/wip-45486-nautilus

nautilus: rgw: radosgw-admin: fix infinite loops in 'datalog list'

Reviewed-by: Casey Bodley <cbodley@redhat.com>
5 years agoMerge pull request #34954 from smithfarm/wip-45436-nautilus
Yuri Weinstein [Wed, 13 May 2020 18:53:42 +0000 (11:53 -0700)]
Merge pull request #34954 from smithfarm/wip-45436-nautilus

nautilus: rgw: dmclock: wait until the request is handled

Reviewed-by: Casey Bodley <cbodley@redhat.com>
5 years agoMerge pull request #34815 from vumrao/wip-vumrao-45316
Yuri Weinstein [Wed, 13 May 2020 18:53:07 +0000 (11:53 -0700)]
Merge pull request #34815 from vumrao/wip-vumrao-45316

nautilus: radosgw-admin: add support for --bucket-id in bucket stats command

Reviewed-by: Casey Bodley <cbodley@redhat.com>
5 years agoMerge pull request #33675 from ivancich/nautilus-tune-sharded-bucket-listing
Yuri Weinstein [Wed, 13 May 2020 18:52:19 +0000 (11:52 -0700)]
Merge pull request #33675 from ivancich/nautilus-tune-sharded-bucket-listing

nautilus: rgw: tune sharded bucket listing

Reviewed-by: Matt Benjamin <mbenjami@redhat.com>
Reviewed-by: Casey Bodley <cbodley@redhat.com>
5 years agoqa: log warning on scrub error 35050/head
Patrick Donnelly [Mon, 20 Jan 2020 19:23:09 +0000 (11:23 -0800)]
qa: log warning on scrub error

Instead of printing the (useless) traceback, just print a warning about
ignoring the failure. The traceback makes it harder to search for the
real problem in the teuthology log.

Fixes: https://tracker.ceph.com/issues/43718
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit b7454e423620e829e7417cdfca1faf5cd91dec3f)

Conflicts:
qa/tasks/mon_thrash.py
- whereas master has "self.manager.raw_cluster_cmd('mon', 'scrub')" in
  the try block, in nautilus it is only "self.manager.raw_cluster_cmd('scrub')"

5 years agoceph_test_watch_notify: try notify 10x if ALLOW_TIMEOUTS is set 35049/head
Sage Weil [Tue, 17 Mar 2020 16:51:56 +0000 (11:51 -0500)]
ceph_test_watch_notify: try notify 10x if ALLOW_TIMEOUTS is set

See https://tracker.ceph.com/issues/44062

Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 2183edcc65c76bff075f3650ea4ebc73edc55b16)

5 years agoceph_test_rados_api_misc: ShutdownRace timeout if ALLOW_TIMEOUTS is set
Sage Weil [Tue, 17 Mar 2020 16:52:32 +0000 (11:52 -0500)]
ceph_test_rados_api_misc: ShutdownRace timeout if ALLOW_TIMEOUTS is set

See https://tracker.ceph.com/issues/44582

Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 3a57e70028e6226a41ef295217160c038441cd70)

5 years agoqa/suites/rados/verify: set ALLOW_TIMEOUTS for workunits
Sage Weil [Tue, 17 Mar 2020 16:45:17 +0000 (11:45 -0500)]
qa/suites/rados/verify: set ALLOW_TIMEOUTS for workunits

When running under valgrind (and thrashing) things can be slow.  Tell
tests in case they need to tolerate timeouts.

Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 40a7bcea706b72388056e254db752a44d1a59fcc)

5 years agoqa/suites/rados/verify/validater/valgrind: tolerate SLOW_OPS
Sage Weil [Tue, 17 Mar 2020 23:10:37 +0000 (18:10 -0500)]
qa/suites/rados/verify/validater/valgrind: tolerate SLOW_OPS

Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 7c19c1534badf00adf756e3a4e342e3d009a63ca)

5 years agoqa/suites/rados/verify: whitelist MON_DOWN when using valgrind
Sage Weil [Sun, 29 Sep 2019 15:27:01 +0000 (10:27 -0500)]
qa/suites/rados/verify: whitelist MON_DOWN when using valgrind

Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 52d706c75f2c70c80bfc4d8182217d5831bfbf4e)

5 years agoqa/suites/rados/verify/validater/valgrind: less bluestore logging
Sage Weil [Tue, 17 Mar 2020 21:36:46 +0000 (16:36 -0500)]
qa/suites/rados/verify/validater/valgrind: less bluestore logging

Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit baeb051910744bfd7a279776afc44c8f9aa6a6e6)

5 years agoqa/suites/rados/verify/validater: increase heartbeat grace
Sage Weil [Tue, 17 Mar 2020 21:35:21 +0000 (16:35 -0500)]
qa/suites/rados/verify/validater: increase heartbeat grace

Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 4fda9d50f09d527262fd65eab9b9cff3fd700aad)

5 years agoRevert "qa/suites/rados/verify/validator/valgrind: debug refs = 5"
Nathan Cutler [Wed, 13 May 2020 15:45:27 +0000 (17:45 +0200)]
Revert "qa/suites/rados/verify/validator/valgrind: debug refs = 5"

This reverts commit 65e81e6eb4f136bf21b67e5de10ab49f028f9e95.

Signed-off-by: Nathan Cutler <ncutler@suse.com>
Conflicts:
    reverted the nautilus commit instead of cherry-picking the revert of
    the master commit

5 years agotest: Add flush_pg_stats to avoid race with getting num_shards_repaired 35048/head
David Zafman [Fri, 6 Mar 2020 04:25:37 +0000 (04:25 +0000)]
test: Add flush_pg_stats to avoid race with getting num_shards_repaired

Fixes: https://tracker.ceph.com/issues/44439
Signed-off-by: David Zafman <dzafman@redhat.com>
(cherry picked from commit e509b7c7d0d762c20927720b08d3a5b1f8980fb4)

5 years agoqa/standalone/osd/osd-backfill-stats.sh: get_latest_osdmap to propagate map change 35047/head
Neha [Wed, 18 Mar 2020 22:57:41 +0000 (22:57 +0000)]
qa/standalone/osd/osd-backfill-stats.sh: get_latest_osdmap to propagate map change

Fixes: https://tracker.ceph.com/issues/44518
Signed-off-by: Neha Ojha <nojha@redhat.com>
(cherry picked from commit 6edd1cb68654702aa0698b50708514105ac01632)

5 years agoqa/workunits/rbd: fix list-mapped filter in unmap_device 35046/head
Mykola Golub [Tue, 28 Apr 2020 10:45:33 +0000 (11:45 +0100)]
qa/workunits/rbd: fix list-mapped filter in unmap_device

In recent versions `rbd list-mapped` does not print the white space
at the end of the line.

Fixes: https://tracker.ceph.com/issues/45305
Signed-off-by: Mykola Golub <mgolub@suse.com>
(cherry picked from commit d667e385a12ea4125dbf74da97a9ba4911a3f7bc)

5 years agoosd: dispatch_context and queue split finish on early bail-out 35024/head
Sage Weil [Tue, 28 Jan 2020 19:33:49 +0000 (13:33 -0600)]
osd: dispatch_context and queue split finish on early bail-out

If we bail out of advance_pg early because there is an upcoming merge, we
still need to dispatch_context() on rctx before we drop the PG lock.  And
the rctx that we submit needs to include the on_applied finisher comit
to call _finish_splits.

This is noticeable (at least) when there is a split and merge that are
both known.  When we process the split, the new child is added to new_pgs.
When we get to the merge epoch, we stop early and take the bail-out
path.

Fix by adding a dispatch_context call for this path.  And further make sure
that both dispatch_context callers in this function queue up the
new_pgs event.

Fixes: https://tracker.ceph.com/issues/43825
Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 14c4dbb03fe3c7fb4a0a6fa40dcb89e89fb66b3c)

Conflicts:
src/osd/OSD.cc
- where master says "rctx.transaction.register_on_applied", replaced
  with nautilus equivalent: "rctx->transaction->register_on_applied"
- where master says "dispatch_context(rctx, pg, pg->get_osdmap(),
  &handle)", replaced with nautilus equivalent:
  "dispatch_context_transaction(*rctx, pg, &handle)"

5 years agorgw: anonomous swift to obj that dont exist should 401 35045/head
Matthew Oliver [Wed, 26 Feb 2020 06:15:22 +0000 (06:15 +0000)]
rgw: anonomous swift to obj that dont exist should 401

Currently, if you attempt to GET and object in the Swift API that
doesn't exist and you don't pass a `X-Auth-Token` it will 404 instead of
401.

This is actually a rather big problem as it means someone can leak data
out of the cluster, not object data itself, but if an object exists or
not.

This is caused by the SwiftAnonymousEngine's, frankly wide open
is_applicable acceptance. When we get to checking the bucket or object
for user acceptance we  deal with it properly, but if the object doesn't
exsit, because the user has been "authorised" rgw returns a 404.

Why? Because we always override the user with the Swift account.
Meaning as far as checks are concerned the auth user is the user, not
and anonymous user.

I assume this is because a swift container could have world readable
reads or writes and in slight s3 and swift api divergents can make these
interesting edge cases leak in.

This patch doesn't change the user to the swift account if they are
anonymous. So we can do some anonymous checks when it suits later in the
request processing path.

Fixes: https://tracker.ceph.com/issues/43617
Signed-off-by: Matthew Oliver <moliver@suse.com>
(cherry picked from commit b03d9754e113d24221f1ce0bac17556ab0017a8a)

Conflicts:
src/rgw/rgw_swift_auth.h
- where master has "rgw_user(s->account_name)", nautilus has
  "s->account_name" only

5 years agoMerge pull request #34640 from rhcs-dashboard/wip-44327-nautilus
Laura Paduano [Wed, 13 May 2020 12:20:39 +0000 (14:20 +0200)]
Merge pull request #34640 from rhcs-dashboard/wip-44327-nautilus

nautilus: mgr/dashboard: fix backport #33764

5 years agoMerge pull request #34450 from rhcs-dashboard/wip-44980-nautilus
Laura Paduano [Wed, 13 May 2020 12:16:57 +0000 (14:16 +0200)]
Merge pull request #34450 from rhcs-dashboard/wip-44980-nautilus

nautilus: monitoring: Fix pool capacity incorrect

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Patrick Seidensal <pseidensal@suse.com>
Reviewed-by: Laura Paduano <lpaduano@suse.com>
5 years agoMerge pull request #34847 from bk201/wip-45273-nautilus
Laura Paduano [Wed, 13 May 2020 12:15:05 +0000 (14:15 +0200)]
Merge pull request #34847 from bk201/wip-45273-nautilus

nautilus: mgr/dashboard: fixing RBD purge error in backend

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Ernesto Puertat <epuertat@redhat.com>
Reviewed-by: Laura Paduano <lpaduano@suse.com>
5 years agoMerge pull request #35016 from rhcs-dashboard/45515-fix-py2-strptime-error
Lenz Grimmer [Wed, 13 May 2020 07:47:41 +0000 (09:47 +0200)]
Merge pull request #35016 from rhcs-dashboard/45515-fix-py2-strptime-error

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Laura Paduano <lpaduano@suse.com>
Reviewed-by: Volker Theile <vtheile@suse.com>
5 years agoMerge pull request #31586 from bk201/fix-42651
Lenz Grimmer [Wed, 13 May 2020 07:46:02 +0000 (09:46 +0200)]
Merge pull request #31586 from bk201/fix-42651

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Laura Paduano <lpaduano@suse.com>
5 years agomgr/dashboard: Fix I18N errors 34823/head
Volker Theile [Wed, 13 May 2020 06:08:48 +0000 (08:08 +0200)]
mgr/dashboard: Fix I18N errors

We were just checking if the translation sources were correct and ignore if the
extraction simply failed.

Fixes: https://tracker.ceph.com/issues/45428
Signed-off-by: Tiago Melo <tmelo@suse.com>
(cherry picked from commit 897e6a352d1623a045801fabcd3ddaa34bf0e498)

Conflicts:
src/pybind/mgr/dashboard/frontend/src/app/core/navigation/navigation/navigation.component.html
src/pybind/mgr/dashboard/run-frontend-unittests.sh

Removed because that files are not available in Nautilus:
src/pybind/mgr/dashboard/frontend/src/app/shared/services/password-policy.service.ts
src/pybind/mgr/dashboard/frontend/src/app/ceph/cluster/telemetry/telemetry.component.ts
src/pybind/mgr/dashboard/frontend/src/app/ceph/shared/smart-list/smart-list.component.ts

5 years agomgr/dashboard: monitoring menu entry should indicate firing alerts
Volker Theile [Fri, 24 Apr 2020 10:18:25 +0000 (12:18 +0200)]
mgr/dashboard: monitoring menu entry should indicate firing alerts

Fixes: https://tracker.ceph.com/issues/43081
Signed-off-by: Volker Theile <vtheile@suse.com>
(cherry picked from commit f88fb46d680127963c0d04b797b5742ada6024b4)

Conflicts:
src/pybind/mgr/dashboard/frontend/src/app/core/navigation/navigation/navigation.component.html
src/pybind/mgr/dashboard/frontend/src/app/core/navigation/navigation/navigation.component.scss
src/pybind/mgr/dashboard/frontend/src/app/core/navigation/navigation/navigation.component.spec.ts

5 years agotest/fio: bump to fio 3.15 35028/head
Igor Fedotov [Mon, 11 Nov 2019 22:19:29 +0000 (01:19 +0300)]
test/fio: bump to fio 3.15

Signed-off-by: Igor Fedotov <ifedotov@suse.com>
(cherry picked from commit 710e5da85b3066247f2541c88127a28e1ce5fa08)

5 years agorgw: fix multipart upload's error response 35019/head
xxcs [Wed, 22 Jan 2020 02:55:51 +0000 (10:55 +0800)]
rgw: fix multipart upload's error response
Fixes: https://tracker.ceph.com/issues/43751
Signed-off-by: GaryHyg <huygbj@inspur.com>
(cherry picked from commit 8c2a867c4b97e47ff707f148d692e32237fb7b43)

5 years agomgr/dashboard: fix py2 strptime ImportError (not thread safe) 35016/head
Alfonso Martínez [Tue, 12 May 2020 12:04:41 +0000 (14:04 +0200)]
mgr/dashboard: fix py2 strptime ImportError (not thread safe)

Fixes: https://tracker.ceph.com/issues/45515
Signed-off-by: Alfonso Martínez <almartin@redhat.com>
5 years agoMerge pull request #34884 from tchaikov/nautilus-upgrade-mimic-retry
Yuri Weinstein [Mon, 11 May 2020 21:54:14 +0000 (14:54 -0700)]
Merge pull request #34884 from tchaikov/nautilus-upgrade-mimic-retry

nautilus: qa/tasks/ceph.py: do not use option mimic does not understand

Reviewed-by: Yuri Weinstein <yweinste@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>
5 years agoMerge pull request #34873 from tchaikov/nautilus-qa-upgrade-from-rados-wo-object...
Yuri Weinstein [Mon, 11 May 2020 21:53:31 +0000 (14:53 -0700)]
Merge pull request #34873 from tchaikov/nautilus-qa-upgrade-from-rados-wo-object-size

nautilus: qa/suites/rados: use default objectsize for upgrade tests

Reviewed-by: Yuri Weinstein <yweinste@redhat.com>
Reviewed-by: Brad Hubbard <bhubbard@redhat.com>
5 years agoMerge pull request #34870 from badone/wip-ms_dispatch-kcore-selinux-denial
Yuri Weinstein [Mon, 11 May 2020 21:51:45 +0000 (14:51 -0700)]
Merge pull request #34870 from badone/wip-ms_dispatch-kcore-selinux-denial

nautilus: selinux: Allow getattr access to /proc/kcore

Reviewed-by: Neha Ojha <nojha@redhat.com>
Reviewed-by: Yuri Weinstein <yweinste@redhat.com>
5 years agorgw: fix some list buckets handle leak 34986/head
Tianshan Qu [Wed, 21 Aug 2019 04:33:37 +0000 (12:33 +0800)]
rgw: fix some list buckets handle leak

Fixes: https://tracker.ceph.com/issues/44283
Signed-off-by: Tianshan Qu <tianshan@xsky.com>
(cherry picked from commit 8955ca2c0c804f6c43bb33c97bbe263bac1cc36d)

Conflicts:
    src/rgw/rgw_bucket.cc
- changed two instances of "store->ctl()->meta.mgr->" to "store->meta_mgr->"
  to align with nautilus usage

5 years agocompressor/lz4: rebuild if buffer is not contiguous 35004/head
Sage Weil [Thu, 27 Feb 2020 15:30:27 +0000 (09:30 -0600)]
compressor/lz4: rebuild if buffer is not contiguous

In older versions of lz4 (specifically < 1.8.2) bit errors
can be introduced when compressing from fragmented memory.  The lz4
bug was fixed by this lz4 commit:

commit af127334670a5e7b710bbd6adb71aa7c3ef0cd72
Author: Yann Collet <cyan@fb.com>
Date:   Sat May 5 18:24:11 2018 -0700

    fixed frametest error

    The error can be reproduced using following command :
    ./frametest -v -i100000000 -s1659 -t31096808

    It's actually a bug in the stream LZ4 API,
    when starting a new stream
    and providing a first chunk to complete with size < MINMATCH.
    In which case, the chunk becomes a dictionary.
    No hash was generated and stored,
    but the chunk is accessible as default position 0 points to dictStart,
    and position 0 is still within MAX_DISTANCE.
    Then, next attempt to read 32-bits from position 0 fails.

    The issue would have been mitigated by starting from index 64 KB,
    effectively eliminating position 0 as too far away.

    The proper fix is to eliminate such "dictionary" as too small.
    Which is what this patch does.

This is a workaround to rebuild our input buffer into a continguos buffer
if it is not already contiguous.

Fixes: https://tracker.ceph.com/issues/39525
Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 8e80f5f678915f18bdf873502283bf3e2020f904)

5 years agoqa/objectstore: test all bluestore compression algorithms
Sage Weil [Thu, 27 Feb 2020 15:55:22 +0000 (09:55 -0600)]
qa/objectstore: test all bluestore compression algorithms

Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 8c8fb59dcf02da7e2af8f3f5b1e49d73bb1dbe28)

5 years agotest/compressor: test round trip of an osdmap
Dan van der Ster [Wed, 26 Feb 2020 20:50:07 +0000 (21:50 +0100)]
test/compressor: test round trip of an osdmap

Check if the compressors can compress/decompress a bufferlist which is not word
aligned, such as a freshly-encoded osdmap.

Related-to: https://tracker.ceph.com/issues/39525
Signed-off-by: Dan van der Ster <daniel.vanderster@cern.ch>
(cherry picked from commit 1b1c71a2c28c38d3e28f006b1cb164435a653c02)

5 years agoqa/suites/rbd: removed OpenStack tempest test cases 35003/head
Jason Dillaman [Wed, 11 Mar 2020 19:34:26 +0000 (15:34 -0400)]
qa/suites/rbd: removed OpenStack tempest test cases

The OpenStack tempests tests do not stay stable and break approximately
every six months. Remove the test suite for now.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>
(cherry picked from commit 232d48f511a0f39221bd51e012cd60ed6a1240db)

Conflicts:
qa/suites/rbd/openstack/workloads/devstack-tempest-gate.yaml
- some difference compared to master, but the entire test is being deleted so
  I didn't examine it further

5 years agofix rgw lc does not delete objects that do not have exactly the same tags as the... 35002/head
Or Friedmann [Wed, 4 Sep 2019 13:34:52 +0000 (16:34 +0300)]
fix rgw lc does not delete objects that do not have exactly the same tags as the rule

It is possible that object will have multiple tags more than the rule that applied on.
Object is not being deleted if not all tags exactly the same as in the rule.

S3-tests: ceph/s3-tests#303
Fixes: https://tracker.ceph.com/issues/41652
Signed-off-by: Or Friedmann <ofriedma@redhat.com>
(cherry picked from commit ebb806ba83fa9d68f14194b1f9886f21f7195a3d)

5 years agoradosgw-admin: fix infinite loop in 'datalog list' 35001/head
Casey Bodley [Tue, 24 Mar 2020 19:01:31 +0000 (15:01 -0400)]
radosgw-admin: fix infinite loop in 'datalog list'

affects datalog shards with more than 1000 entries

Fixes: https://tracker.ceph.com/issues/44751
Signed-off-by: Casey Bodley <cbodley@redhat.com>
(cherry picked from commit 166f38fe70e6afd2831bdb87a27c8fe757899b95)

Conflicts:
src/rgw/rgw_admin.cc
- nautilus has "log->list_entries" where master has
  "datalog_svc->list_entries"

5 years agoClient: make sure the Finisher's mutex lock not held during it being distructed 35000/head
Xiubo Li [Tue, 31 Mar 2020 09:09:45 +0000 (05:09 -0400)]
Client: make sure the Finisher's mutex lock not held during it being distructed

The objecter_finisher is already started in Client::Client(), but
in the failure path when initializing and starting the Client object,
we may not get a chance to call the Client::shutdown() to stop the
Finisher thread, which maybe still holding the mutex lock in it. Then
when destrucing the Finisher object the pthread_mutex_destroy() will
fail.

This fix will delay the objecter_finisher thread to start in ::init()
until we're ready to call Client::shutdown on any errors instead.

Fixes: https://tracker.ceph.com/issues/44389
Signed-off-by: Xiubo Li <xiubli@redhat.com>
(cherry picked from commit fbff4ee153f17da51c3b6675eb0616f2b2062f5e)

Conflicts:
src/client/Client.cc
- nautilus uses client_lock.Lock()/client_lock.Unlock() instead of
  std::lock_guard in a code block

5 years agoMerge pull request #34767 from smithfarm/wip-45212-nautilus
Yuri Weinstein [Mon, 11 May 2020 14:54:51 +0000 (07:54 -0700)]
Merge pull request #34767 from smithfarm/wip-45212-nautilus

nautilus: client: reset requested_max_size if file write is not wanted

Reviewed-by: Ramana Raja <rraja@redhat.com>
Reviewed-by: Greg Farnum <gfarnum@redhat.com>
Reviewed-by: Yan, Zheng <zyan@redhat.com>
5 years agoMerge pull request #34679 from kotreshhr/nautilus-subvolume-info
Yuri Weinstein [Mon, 11 May 2020 14:53:39 +0000 (07:53 -0700)]
Merge pull request #34679 from kotreshhr/nautilus-subvolume-info

nautilus: mgr/volumes: Add interface to get subvolume metadata

Reviewed-by: Ramana Raja <rraja@redhat.com>
5 years agoMerge pull request #34470 from Vicente-Cheng/wip-44655-nautilus
Yuri Weinstein [Mon, 11 May 2020 14:52:55 +0000 (07:52 -0700)]
Merge pull request #34470 from Vicente-Cheng/wip-44655-nautilus

nautilus: qa: fix various py3 cephfs qa bugs x2

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Ramana Raja <rraja@redhat.com>
5 years agomon/OSDMonitor: ensure lec only accounts for up osds 34983/head
Joao Eduardo Luis [Tue, 21 Nov 2017 10:18:24 +0000 (10:18 +0000)]
mon/OSDMonitor: ensure lec only accounts for up osds

If we also consider down osds, we may very well be in a healthy state
but keeping maps as far back as the last epoch when a given osd went
down. If said osd stays down for eons, we will be keeping bajillions of
maps that we shouldn't.

Signed-off-by: Joao Eduardo Luis <joao@suse.de>
(cherry picked from commit e62269c8929e414284ad0773c4a3c82e43735e4e)

5 years agoqa/standalone: exercise osdmon's last epoch clean
Joao Eduardo Luis [Mon, 20 Nov 2017 15:57:10 +0000 (15:57 +0000)]
qa/standalone: exercise osdmon's last epoch clean

Signed-off-by: Joao Eduardo Luis <joao@suse.de>
(cherry picked from commit 3d682c21f6246276fa30d179766a49f0eb78d73a)

5 years agomon/OSDMonitor: dump last epoch clean info on report
Joao Eduardo Luis [Tue, 21 Nov 2017 10:15:10 +0000 (10:15 +0000)]
mon/OSDMonitor: dump last epoch clean info on report

Signed-off-by: Joao Eduardo Luis <joao@suse.de>
(cherry picked from commit bd2e5c62757c17d00f30620c043ed06abb8a0bca)

5 years agosuites/rados/thrash: add rados-high-concurrency 34957/head
Samuel Just [Fri, 13 Mar 2020 22:58:44 +0000 (15:58 -0700)]
suites/rados/thrash: add rados-high-concurrency

Signed-off-by: Samuel Just <sjust@redhat.com>
(cherry picked from commit ceb8e457e7c3bd1ce1c14833c091f517a024d6d5)

Conflicts:
qa/tasks/radosbench.py
- size is treated differently in nautilus

5 years agoPrimaryLogPG: add assert that we aren't trimming past last_update_ondisk
Samuel Just [Fri, 13 Mar 2020 22:43:58 +0000 (15:43 -0700)]
PrimaryLogPG: add assert that we aren't trimming past last_update_ondisk

Signed-off-by: Samuel Just <sjust@redhat.com>
(cherry picked from commit d4b1cc61e6526d325fd759f98e13e5a10523f5f7)

Conflicts:
src/osd/PeeringState.h
- not introducing this function since it is just a getter and it's not
  clear where it should go in nautilus
        src/osd/PrimaryLogPG.h
- use last_update_ondisk directly instead of via getter function

5 years agoqa/osd-recovery: pass osd_pg_log_trim_min = 0 to exercise short pg logs
xie xingguo [Fri, 13 Mar 2020 00:45:52 +0000 (08:45 +0800)]
qa/osd-recovery: pass osd_pg_log_trim_min = 0 to exercise short pg logs

we have osd_min_pg_log_entries to 2 (good) but not osd_pg_log_trim_min
which defaults to 100. Thus, even on those tests we're only rarely vulnerable.
Reset osd_min_pg_log_entries to 0 to make sure we really
would keep a minimal pg log in hand.

Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
(cherry picked from commit a9412ba25cb65816e85f8e414b399ff5e4db905c)

5 years agoqa/short_pg_log: pass osd_pg_log_trim_min = 0 to exercise short pg logs
xie xingguo [Thu, 12 Mar 2020 23:59:07 +0000 (07:59 +0800)]
qa/short_pg_log: pass osd_pg_log_trim_min = 0 to exercise short pg logs

we have osd_min_pg_log_entries to 2 (good) but not osd_pg_log_trim_min
which defaults to 100. Thus, even on those tests we're only rarely vulnerable.
Reset osd_min_pg_log_entries to 0 to make sure we really
keep a minimal pg log in hand.

Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
(cherry picked from commit b73b93c6f0a9317fbed4a311931d159c9a0af9d7)

5 years agoosd/PeeringState: do not trim pg log past last_update_ondisk
xie xingguo [Thu, 12 Mar 2020 10:01:45 +0000 (18:01 +0800)]
osd/PeeringState: do not trim pg log past last_update_ondisk

Trimming past last_update_ondisk would be really bad, e.g.,
a new interval change would cancel&redo a previous op, and if
we trim past last_update_ondisk, there could be potential
object inconsistencies as log merging won't necessarily be able
to find all divergent entries later (we lost track of the unfinished
op that should really be reverted).

Fixes: https://tracker.ceph.com/issues/44532
Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
(cherry picked from commit eef1928e0153f09dc2522bb885b0ad9f5c65f4bd)

Conflicts:
src/osd/PeeringState.cc
- changes applied manually in src/osd/PrimaryLogPG.cc

5 years agoMerge pull request #34107 from yuvalif/notification-backports-nautilus
Yuri Weinstein [Thu, 7 May 2020 15:58:05 +0000 (08:58 -0700)]
Merge pull request #34107 from yuvalif/notification-backports-nautilus

nautilus: rgw/notifications: backporting features and bug fix

Reviewed-by: Casey Bodley <cbodley@redhat.com>
5 years agorgw: dmclock: wait until the request is handled. 34954/head
GaryHyg [Mon, 7 Oct 2019 02:17:34 +0000 (10:17 +0800)]
rgw: dmclock: wait until the request is handled.

Fixes: https://tracker.ceph.com/issues/42217
Signed-off-by: GaryHyg <huygbj@inspur.com>
(cherry picked from commit ec507626624a66a3a81d7ea65d0b8d78d9b8a221)

5 years agoMerge pull request #34915 from tchaikov/nautilus-mimic-x-disable-min-pg-per-osd-warning
Yuri Weinstein [Wed, 6 May 2020 22:42:12 +0000 (15:42 -0700)]
Merge pull request #34915 from tchaikov/nautilus-mimic-x-disable-min-pg-per-osd-warning

nautilus: qa/suites/upgrade: disable min pg per osd warning

Reviewed-by: Yuri Weinstein <yweinste@redhat.com>
5 years agoMerge pull request #34771 from smithfarm/wip-45210-nautilus
Ramana Raja [Wed, 6 May 2020 17:23:35 +0000 (22:53 +0530)]
Merge pull request #34771 from smithfarm/wip-45210-nautilus

nautilus: ceph-fuse: link to libfuse3 and pass "-o big_writes" to libfuse if libfuse < 3.0.0

Reviewed-by: Zheng Yan <zyan@redhat.com>
Reviewed-by: Xiubo Li <xiubli@redhat.com>
Reviewed-by: Ramana Raja <rraja@redhat.com>
5 years agorgw: clean up address 0-length listing results... 33675/head
J. Eric Ivancich [Fri, 10 Jan 2020 19:12:35 +0000 (14:12 -0500)]
rgw: clean up address 0-length listing results...

Some minor clean-ups to the previous commit, including adjust logging
messages, rename variable, convert a #define to a constexpr (and
adjust its scope).

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
(cherry picked from commit 4470ca8a3ff22aeb140e0c3be396e5f1bb25aa26)

5 years agorgw: address 0-length listing results when non-vis entries dominate
J. Eric Ivancich [Thu, 13 Feb 2020 01:38:44 +0000 (20:38 -0500)]
rgw: address 0-length listing results when non-vis entries dominate

A change to advance the marker in RGWRados::cls_bucket_list_ordered to
the last entry visited rather than the final entry in list to push
progress as far as possible.

Since non-vis entries tend to cluster on the same shard, such as
during incomplete multipart uploads, this can severely limit the
number of entries returned by a call to
RGWRados::cls_bucket_list_ordered since once that shard has provided
all its members, we must stop. This interacts with a recent
optimization to reduce the number of entries requested from each
shard. To address this the number of attempts is sent as a parameter,
so the number of entries requested from each shard can grow with each
attempt. Currently the growth is linear but perhaps exponential growth
(capped at number of entries requested) should be considered.

Previously RGWRados::Bucket::List::list_objects_ordered was capped at
2 attempts, but now we keep attempting to insure we make forward
progress and return entries when some exist. If we fail to make
forward progress, we log the error condition and stop looping.

Additional logging, mostly at level 20, is added to the two key
functions involved in ordered bucket listing to make it easier to
follow the logic and address potential future issues that might arise.

Additionally modify attempt number based on how many results were
received.

Change the per-shard request number, so it grows exponentially rather
than linearly as the attempts go up.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
(cherry picked from commit 28bd8bada4236060db6d5aed6b1eb345ab507890)

5 years agorgw: reduce per-shard entry count during ordered bucket listing
J. Eric Ivancich [Mon, 14 Oct 2019 20:21:35 +0000 (16:21 -0400)]
rgw: reduce per-shard entry count during ordered bucket listing

Currently, if a client requests the 1000 next entries from a bucket,
each bucket index shard will receive a request for the 1000 next
entries. When there are hundreds, thousands, or tens of thousands of
bucket index shards, this results in a huge amplification of the
request, even though only 1000 entries will be returned.

These changes reduce the per-bucket index shard requests. These also
allow re-requests in edge cases where all of one shard's returned
entries are consumed. Finally these changes improve the determination
of whether the resulting list is truncated.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
(cherry picked from commit 729c0ba1361b76ecace94793b956354486b3dcd8)

5 years agoMerge pull request #34817 from tchaikov/nautilus-44668
Ramana Raja [Wed, 6 May 2020 08:40:07 +0000 (14:10 +0530)]
Merge pull request #34817 from tchaikov/nautilus-44668

nautilus: qa: use small default pg count for CephFS pools

Reviewed-by: Ramana Raja <rraja@redhat.com>
5 years agoMerge pull request #34563 from Vicente-Cheng/wip-45050-nautilus
Ramana Raja [Wed, 6 May 2020 08:38:56 +0000 (14:08 +0530)]
Merge pull request #34563 from Vicente-Cheng/wip-45050-nautilus

nautilus: mgr: force purge normal ceph entities from service map

Reviewed-by: Ramana Raja <rraja@redhat.com>