git.apps.os.sepia.ceph.com Git

pybind/mgr/pg_autoscale: revert to default profile scale-up

pg_autoscale module will now start out all the pools
with a scale-up profile by default.

Added tests in workunits/mon/pg_autoscaler.sh
to evaluate if the default pool creation is
a scale-up profile

Updated documentation and release notes to
reflect the change in the default behavior
of the pg_autoscale profile.

Fixes: https://tracker.ceph.com/issues/53309
Signed-off-by: Kamoltat <ksirivad@redhat.com>
(cherry picked from commit a9f9f7b3fd813d429c4a539edf560d3fb6eb553b)

Conflicts:
src/pybind/mgr/pg_autoscaler/module.py - trivial fix

Merge pull request #43930 from rhcs-dashboard/wip-53092-pacific

pacific: mgr/dashboard: all pyfakefs must be pinned on same version

Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: ljflores <NOT@FOUND>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: rishabh-d-dave <NOT@FOUND>

Merge pull request #43973 from rhcs-dashboard/wip-53253-pacific

pacific: mgr/dashboard: Spelling mistake in host-form Network address field

Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>

Merge pull request #43893 from rhcs-dashboard/wip-53221-pacific

pacific: mgr/dashboard: include mfa_ids in rgw user-details section

Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>

Merge pull request #43970 from ifed01/wip-ifed-fix-vol-select

pacific: os/bluestore: do not select absent device in volume selector

Reviewed-by: Adam Kupczyk <akupczyk@redhat.com>

Merge pull request #43939 from rhcs-dashboard/wip-53270-pacific

pacific: mgr/dashboard: Edit a service feature

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>

Merge pull request #43975 from ivancich/wip-53291-pacific

pacific: rgw: remove prefix & delim params for bucket removal & mp upload abort

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>

Merge pull request #43805 from sunnyku/wip-53028-pacific

pacific: librbd/object_map: rbd diff between two snapshots lists entire image content

Reviewed-by: Mykola Golub <mgolub@mirantis.com>

Merge pull request #43924 from vshankar/wip-53217

pacific: cephfs-mirror, test: add thrasher for cephfs mirror daemon, HA test yamls

Reviewed-by: Kotresh HR khiremat@redhat.com

Merge pull request #43815 from lxbsz/wip-53164

pacific: mds: do not trim stray dentries during opening the root

Reviewed-by: Venky Shankar vshankar@redhat.com

Merge pull request #43782 from lxbsz/wip-53120

Pacific: client: do not defer releasing caps when revoking

Reviewed-by: Venky Shankar vshankar@redhat.com

Merge pull request #43702 from cfsnyder/wip-52678-pacific

pacific: qa: lengthen grace for fs map showing dead MDS

Reviewed-by: Kotresh HR khiremat@redhat.com

Merge pull request #43434 from callithea/wip-52412-pacific

pacific: snap-schedule: count retained snapshots per retention policy

Reviewed-by: Kotresh HR khiremat@redhat.com
Reviewed-by: Venky Shankar vshankar@redhat.com

Merge pull request #43906 from sebastian-philipp/pacific-backport-43376-43602-43654-43734-42838-43454-43628-43762-43790-43737-

pacific: cephadm: November batch

Reviewed-by: Adam King <adking@redhat.com>
Reviewed-by: Daniel Pivonka <dpivonka@redhat.com>
Reviewed-by: Guillaume Abrioux <gabrioux@redhat.com>

Merge pull request #43948 from guits/wip-53278-pacific

pacific: ceph-volume: fix bug with miscalculation of required db/wal slot size for VGs with multiple PVs

Merge pull request #43949 from guits/wip-53188-pacific

pacific: ceph-volume: fix a typo causing AttributeError

rgw: remove prefix & delim params for bucket removal & mp upload abort

The calls to remove a bucket had parameters to specify a prefix and
delimiter, which does not make sense. This was precipitated due to some
existing Swift protocol logic, but buckets are removed irrespective of
prefix and delimiter. So the functions and calls are adjusted to
remove those parameters. Additionally, those same parameters were
removed for aborting incomplete multipart uploads.

Additionally a bug is fixed in which during bucket removal, multipart
uploads were only removed if the prefix was non-empty.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>
(cherry-picked from 5156e4e7ce307f8ac6ee6dc36f7e546c93d17d55)

Conflicts:
        src/rgw/rgw_sal_rados.cc
        src/rgw/rgw_sal.h
        src/rgw/rgw_sal_rados.h
  - Alterations due to Zipper 7 code refactoring
        src/rgw/rgw_sal_dbstore.cc
        src/rgw/rgw_sal_dbstore.h
  - Did not exist before Zipper 7 code refactoring

ceph-volume/tests: update setup_mixed_type playbook

we need to create a file with a larger size.
see https://github.com/ceph/ceph/pull/43300#issuecomment-951961243

Signed-off-by: Guillaume Abrioux <gabrioux@redhat.com>
(cherry picked from commit 8af00e25aa4ab60d0309e31f6c20edd6cd5be1ee)

ceph-volume: fix bug with miscalculation of required db/wal slot size for VGs with multiple PVs

Previous logic for calculating db/wal slot sizes made the assumption that there would only be
a single PV backing each db/wal VG. This wasn't the case for OSDs deployed prior to v15.2.8,
since ceph-volume previously deployed multiple SSDs in the same VG. This fix removes the
assumption and does the correct calculation in either case.

Fixes: https://tracker.ceph.com/issues/52730
Signed-off-by: Cory Snyder <csnyder@iland.com>
(cherry picked from commit cd6aa1329f70f89338757ba295e279ecfdbc2d07)

Merge pull request #43890 from batrick/i53231

pacific: MDSMonitor: assertion during upgrade to v16.2.5+

Reviewed-by: Venky Shankar vshankar@redhat.com

Merge pull request #43841 from vshankar/wip-52952

pacific: mds: skip journaling blocklisted clients when in `replay` state

Reviewed-by: Venky Shankar vshankar@redhat.com
Reviewed-by: Kotresh HR khiremat@redhat.com

Merge pull request #43891 from batrick/i53232

pacific: MDSMonitor: no active MDS after cluster deployment

Reviewed-by: Venky Shankar vshankar@redhat.com

Merge pull request #43828 from batrick/i53006

pacific: qa: reduce frag split confs for dir_split counter test

Reviewed-by: Venky Shankar <vshankar@redhat.com>

Merge pull request #43941 from sebastian-philipp/backport-43780

pacific: doc/radosgw/nfs: add note about NFSv3 deprecation

Reviewed-by: Michael Fritch <mfritch@suse.com>

qa/suites/orch/cephadm/dashboard: Add centos_8.3_container_tools_3.0.yaml

Signed-off-by: Sebastian Wagner <sewagner@redhat.com>

qa/suites/orch/cephadm: add 8.stream + container_tools

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit eb43f2be9c326a522946bb9ebafc9a016828a41b)

Conflicts:
qa/suites/orch/cephadm/dashboard/0-distro/centos_8.2_container_tools_3.0.yaml

cephadm: Avoid "Could not locate podman: podman not found"

Right now, we're spamming the cephadm.log on docker deployments

Fixes: https://tracker.ceph.com/issues/52869
Signed-off-by: Sebastian Wagner <sewagner@redhat.com>
(cherry picked from commit 2767766ed970bf541400e62e6216f4ac29798d43)

cephadm: fix upgrade name already in use

Signed-off-by: Daniel Pivonka <dpivonka@redhat.com>
(cherry picked from commit eece84dada04b99ea5debc694e0bb3452bf94093)

mgr/cephadm: allow osd spec removal

OSD specs/drivegroups are essentially templates for OSD creation but do
not map to the full lifecycle of the OSDs that they create.  When a spec
is removed, remove it immediately.

If no --force is provided, the error lists which OSDs will be left behind.
If --force is passed, the service is removed.

This leaves behind a few oddities:

- When you list services, OSDs that were created by the drivegroup may
  still exist, causing the drivegroup to appear in the list as
  unmanaged services.
- If you create a new drivegroup with the same name, the prior OSDs will
  appear to belong to the new spec instance, regardless of whether the
  spec/drivegroup parameters are the same.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit bda5c304f33cb7a401a289d92f35b5d15f2fce15)

mgr/orchestrator: pass 'force' flag down for remove_service

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 575b7c9a98608c4f266d8e406118544916f1b8d6)

Conflicts:
src/pybind/mgr/rook/module.py

mgr/cephadm/services/osd: less noisy

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 8c1e9c05f76c43466269518a9798fe6a3b944a3e)

mgr/cephadm/services/osd: do not log ok-to-stop/safe-to-destroy failures

These failures are normal and expected; they should not pollute the log.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 5426f6b2a4da743aeb3c3a446fbfbb69c71f6870)

mgr/orchestrator: clean up 'orch osd rm status'

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit d507e5479f6be2510e1edcab03b238ced6b41aa6)

mgr/cephadm: Fixed spawning ip addresses list for public network interface.

Eevery call of find_ip_on_host() actually duplicates the list of public ip
addresses in self.networks, while it should NOT change it. As the result
value of key mgr/cephadm/host.<hostname> in kv store becomes very large
and may cause crash of ceph mgr.

Signed-off-by: Andrew Sharapov <andrewshar@gmail.com>
(cherry picked from commit b605b786a2ec95606a2c2791cc643dda3fbe24cc)

doc/cephadm: purge

Fixes: https://tracker.ceph.com/issues/50534
Signed-off-by: Sebastian Wagner <sewagner@redhat.com>
(cherry picked from commit ecf8b8ebde3bcb8353dd59a4abfccab86502eb39)

doc/cephadm: Calling miscellaneous ceph tools

Signed-off-by: Sebastian Wagner <sewagner@redhat.com>
(cherry picked from commit 4ac06bc9befb476cbc0350a0e1d40a9cf25af198)

cephadm: exclude zram and cdrom from device list

When compiling eligible block devices, we need to
exclude zram* and cdrom (srX) devices.

Fixes: https://tracker.ceph.com/issues/52905
Signed-off-by: Paul Cuzner <pcuzner@redhat.com>
(cherry picked from commit 3e7d474e1ae6eb8029a6a7d6c3f6322c4908911e)

mgr/cephadm: unit test for daemon post actions missing in cache

Signed-off-by: Daniel Pivonka <dpivonka@redhat.com>
(cherry picked from commit afb7736e677daa3f65890a9dc2d5e46e9b60d71b)

mgr/cephadm: track daemon deployment post actions by daemon name instead of type

in rare cases tracking by type can cause a daemon to be missed when doing post actions if it is not in the cache yet

Signed-off-by: Daniel Pivonka <dpivonka@redhat.com>
(cherry picked from commit 008b41bfc7a80d215314ece61af9291094e3f274)

cephadm: fix tracebacks that could occur during apply spec

Signed-off-by: Daniel Pivonka <dpivonka@redhat.com>
(cherry picked from commit d546498bc9ab5c44ce9e90d71ee4f69fed2b8429)

cephadm: --config-json overrides --config or --keyring args

These args aren't normally combined, but if --fsid and --name are
provided, they may be inferred.

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit f68f9dd7c389cbd7846c366cd528769f382c0217)

mgr/cephadm: unit test for osd preview limit warning

Signed-off-by: Adam King <adking@redhat.com>
(cherry picked from commit 70c9c3d341f168e491d5aed22c95fade7f0067f4)

mgr/cephadm: inform users if limit set for data devices is not met

Signed-off-by: Adam King <adking@redhat.com>
(cherry picked from commit 212f88cbbc2b4edb7976a2abff27ed9b45c5ae3b)

mgr/cephadm: Fix RGW ipv6 frontend configuration

Signed-off-by: Sebastian Wagner <sewagner@redhat.com>
(cherry picked from commit f127e0829b3c512a380bedca46b868fc0c0c7f56)

Conflicts:

  src/pybind/mgr/cephadm/tests/test_services.py

  fix tox test: AttributeError: 'HostCache' object has no
  attribute 'update_host_networks' which was introduced in
  78983ad0d0cce422da32dc4876ac186f6d32c3f5 (not yet in pacific)

mgr/cephadm: set health check warning for apply spec failures and daemon place failures in serve

Fixes: https://tracker.ceph.com/issues/44414
Signed-off-by: Melissa Li <li.melissa.kun@gmail.com>
(cherry picked from commit 1ccdd941c938d6fc3cd8996353495b28545427b1)

Conflicts:
src/pybind/mgr/cephadm/module.py
src/pybind/mgr/cephadm/serve.py

mgr/dashboard: Spelling mistake in host-form Network address field

Fixes: https://tracker.ceph.com/issues/53215
Signed-off-by: Avan Thakkar <athakkar@redhat.com>
(cherry picked from commit 14b62820371a2e031937443b2b6fb49c2b8cb30b)

Merge pull request #43937 from neha-ojha/wip-42709-pacific

pacific: qa/tasks/kubeadm: force docker cgroup engine to systemd

Reviewed-by: Joseph Sawaya <jsawaya@redhat.com>

os/bluestore: do not select absent device in volume selector

Fixes: ttps://tracker.ceph.com/issues/53139
Signed-off-by: Igor Fedotov <igor.fedotov@croit.io>
(cherry picked from commit 39132b0e460d2d667f02019335dced50d5db641d)

ceph-volume: fix a typo causing AttributeError

Signed-off-by: Taha Jahangir <mtjahangir@gmail.com>
(cherry picked from commit 4cdbba3344fe26b6351e88ce00a8655890a02115)

doc/radosgw/nfs: add note about NFSv3 deprecation

cephadm and rook based deployments have deprecated
the NFSv3 deprecated in favor of NFSv4

Signed-off-by: Michael Fritch <mfritch@suse.com>
(cherry picked from commit a4fa3dd71474f48e898ae604fd398154ce91b49c)

mgr/dashboard: Edit a service feature

Fixes: https://tracker.ceph.com/issues/53077
Signed-off-by: Nizamudeen A <nia@redhat.com>
(cherry picked from commit 3c3d8822d432186f6be9cb2d2603de7611aa0abb)

Merge pull request #43731 from ifed01/wip-ifed-multiple-backports-pac

pacific: os/bluestore: multiple repair fixes

Reviewed-by: Neha Ojha <nojha@redhat.com>

Merge pull request #43697 from cfsnyder/wip-52844-pacific

pacific: mon,auth: fix proposal (and mon db rebuild) of rotating secrets

Reviewed-by: Neha Ojha <nojha@redhat.com>

qa/tasks/kubeadm: force docker cgroup engine to systemd

Signed-off-by: Sage Weil <sage@newdream.net>
(cherry picked from commit 517b7759b3ab2b84b2a4ddace411e6ac7599eddd)

mgr/dashboard: all pyfakefs must be pinned on same version

Without this patch execution of install-deps.sh fails with an error.

Fixes: https://tracker.ceph.com/issues/53088
Signed-off-by: Rishabh Dave <ridave@redhat.com>
(cherry picked from commit 24d3a01a3019d30fc1d4dc4963a3942665243a48)

Merge pull request #43756 from ifed01/wip-ifed-fix-write-small-head-pad-pac

pacific: os/bluestore: _do_write_small fix head_pad

Reviewed-by: Adam Kupczyk <akupczyk@redhat.com>
Reviewed-by: Neha Ojha <nojha@redhat.com>

test: add cephfs-mirror HA active/active workunit and test yamls

Fixes: http://tracker.ceph.com/issues/50372
Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit e7bdf7fb35e57c01e05d24e2f3ec6d6c7fcdcf0a)

test: add cephfs_mirror thrasher

Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit 3a4dd30a1e385039c01675bd3fa8c785ebc66286)

tasks/cephfs_mirror: optionally run in foreground

cephfs mirror damon thrasher needs to send SIGTERM to mirror
daemons. The mirror daemon needs to run in foreground for
it to receive signal via `daemon.signal`.

Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit 087d7aa8ca4de05cf5162be54996aec90724a540)

mgr/mirroring: throttle directory reassigment to mirror daemons

This is to avoid over-shuffling directories when lots of mirror
daemons come and go.

Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit e253f685382efd7fa82849fb5eb4ad803099ca73)

Merge pull request #43918 from ideepika/wip-remain-ssd-cache-pacific

pacific: librbd/cache/pwl: SSD caching backports

Reviewed-by: Mykola Golub <mykola.golub@clyso.com>
Reviewed-by: Deepika Upadhyay <dupadhya@redhat.com>

qa/suites/rbd/persistent-writeback-cache: add test case

Add the test case which size is 8GB, So that some problems that occur
only in test scenarios above 4GB may be found in this test. For example,
the variables of 32-bit may be unexpected value when it operates with
a 64 bit value.

Signed-off-by: Yin Congmin <congmin.yin@intel.com>
(cherry picked from commit 3da4a9401c887f9fa92d090786c5ef2121665bd2)

librbd/cache/pwl/ssd: add layout version control

Signed-off-by: Yin Congmin <congmin.yin@intel.com>
(cherry picked from commit dc566a3cd30d91ecbe87cb049df5e9462a526b6d)

librbd/cache/pwl/ssd: make log entry pointers 64 bit (on-disk format change)

Fixes: https://tracker.ceph.com/issues/50675
Signed-off-by: Yin Congmin <congmin.yin@intel.com>
(cherry picked from commit c091ec3471973f41717b237d500b8e4374af660f)

librbd/cache/pwl: fix reorder issue between func process_writeback_dirty_entries

In fact, we not only make sure ops in order in func process_writeback_dirty_entries,
but also make sure ops in order between func process_writeback_dirty_entries.

Signed-off-by: Jianpeng Ma <jianpeng.ma@intel.com>
(cherry picked from commit 76f4d29d92be3f9f45767cb1ac6cc50da528ecec)

Merge pull request #43823 from adamemerson/wip-53160-pacific

pacific: rgw: Ensure buckets too old to decode a layout have layout logs

Reviewed-by: Casey Bodley <cbodley@redhat.com>

mgr/dashboard: include mfa_ids in rgw user-details section

Fixes: https://tracker.ceph.com/issues/53193
Signed-off-by: Avan Thakkar <athakkar@redhat.com>
Introducing mfa_ids in user details section.

(cherry picked from commit 6a2234cd6b68adaa0b855219c411eb95eb2ff9be)

mds/FSMap: allow upgrade when no MDS is "in"

Fixes: https://tracker.ceph.com/issues/52975
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 9b8e3187ed491e4bcfdecbc82babb0bca835bad6)

pybind/mgr/cephadm: disable FSMap sanity checks during MDS upgrade

See comment for explanation.

Fixes: https://tracker.ceph.com/issues/53155
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit b29423111be8fcfc7702a39a7b0f6da43d12067f)

mds/FSMap: assign v16.2.4 compat to pre-v16.2.5 standby daemons

With v16.2.5, the monitors store an MDS's CompatSet with its mds_info_t
in the MDSMap. If an older MDS fails and rejoins the cluster, it gets
assigned the empty CompatSet. This is problematic during upgrades as an
MDS failure may prevent the upgrade process from continuing and cause
file system unavailability.

This patch makes it so the mons will assign a reasonable default: a
CompatSet used since v14.2.0 until v16.2.5.

Fixes: https://tracker.ceph.com/issues/53150
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit 74e3f5ec5a49ce99b56c305624e9110fcb4b787c)

Merge pull request #43251 from kotreshhr/wip-52633-pacific

pacific: mds: Add new flag to MClientSession

Reviewed-by: Kotresh HR <khiremat@redhat.com>

Merge pull request #43223 from kotreshhr/wip-52628-pacific

pacific: mgr/volumes: Fix permission during subvol creation with mode

Reviewed-by: Kotresh HR <khiremat@redhat.com>

Merge pull request #43862 from ivancich/wip-multipart-purge-fix-pacific

pacific: rgw: fix bucket purge incomplete multipart uploads

Reviewed-by: Adam Emerson <aemerson@redhat.com>

Merge pull request #43772 from ideepika/wip-ssd-cache-testing-backports-pacific

pacific: librbd/cache/pwl: persistant cache backports

Reviewed-by: Mykola Golub <mykola.golub@clyso.com>

cmake/modules/Findpmem: always set pmem_VERSION_STRING

before this change, `pmem_VERSION_STRING` is not set if it is not able
to fulfill the specified version requirement. the intention was to check
if the version is able to satisfy the requirement. but actually, passing
an empty `pmem_VERSION_STRING` to `find_package_handle_standard_args()`
as the option of `VERSION_VAR` does not fail this check. on the
contrary, it prints

-- Found pmem: pmem_pmemobj_INCLUDE_DIR;pmem_pmem_INCLUDE_DIR (Required
is at least version "1.17")

if we requires pmem 1.17, while the found version is, for instance,
1.10.

if the required version is 1.7, and the found version is 1.10, the
output from cmake is:

-- Found pmem: pmem_pmemobj_INCLUDE_DIR;pmem_pmem_INCLUDE_DIR (found
suitable version "1.10", minimum required is "1.7")

in this change, the version spec is not specified when calling
`pkg_check_modules()`. so, `PKG_${component}_VERSION` is always set.
and we can always delegate the version checking to
`find_package_handle_standard_args()`. please note, we use the lower
version returned by pkg-config if multiple components are required and
both pkg-config settings return their versions.

Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit ad85af2ef8b76481adc0d1500033cd5094b8f21e)

Conflicts:
cmake/modules/Findpmem.cmake
<<<<<<< HEAD
=======
  pkg_check_modules(PKG_${component} QUIET "lib${component}")
  if(NOT pmem_VERSION_STRING OR PKG_${component}_VERSION VERSION_LESS pmem_VERSION_STRING)
    set(pmem_VERSION_STRING ${PKG_${component}_VERSION})
  endif()
  find_path(pmem_${component}_INCLUDE_DIR
    NAMES lib${component}.h
    HINTS ${PKG_${component}_INCLUDE_DIRS})
  find_library(pmem_${component}_LIBRARY
    NAMES ${component}
    HINTS ${PKG_${component}_LIBRARY_DIRS})
>>>>>>> ad85af2ef8b (cmake/modules/Findpmem: always set pmem_VERSION_STRING)

ceph.spec.in: do not build with system pmdk by default

we need to use libpmem 1.10 in #40493.

without enabling the module stream offering libpmem 1.9.2, we can only
have access to libpmem 1.6.1. and fedora 33 only has libpmem 1.9
packaged. the same applies to openSUSE Tumbleweed and openSUSE Leap. so
let's stop using libpmem packaged by distro by default, until these
distros include libpmem 1.10.

Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit ecb8d2cae2c063acf4e7e1bffed887d52117762f)

cmake: bump to PMDK v1.10

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
Signed-off-by: Yin Congmin <congmin.yin@intel.com>
(cherry picked from commit 636ab08f2604efd4cac3200d5741fa15b070f072)

cmake: use src/pmdk for building pmdk if it exists

so we can build with pmdk enabled if the dist tarball
contains pmdk

Signed-off-by: Feng Hualong <hualong.feng@intel.com>
(cherry picked from commit 14c2a2e59fbdc716d35c08735d50bdadfab8300d)

Conflicts:
cmake/modules/Buildpmem.cmake
trivial fix, adopt what master has:
1. external_project args >> source_dir_args
2. INSTALL_COMMAND ""

make-dist: add pmdk to dist tarball

Signed-off-by: Feng Hualong <hualong.feng@intel.com>
(cherry picked from commit 9d958d0b9d33bfa0e27323597986e541eed23951)

os/bluestore: fix improper offset calculation when repairing.

While repairing misreferenced blobs BlueStore could improperly calculate
an offset within a blob being fixed. This could happen when single
physical extent has been replaced by multiple ones - the following
pextent (if any in the current blob) would be treated with the improper offset within the blob. Offset calculation didn't account for each of that new pextents but the last one only.

Fixes: https://tracker.ceph.com/issues/51682
Signed-off-by: Igor Fedotov <ifedotov@suse.com>
(cherry picked from commit ca4b6675fc3fd2f4cadad58044c97c5bb23d5938)

rgw: fix bucket purge incomplete multipart uploads

The marker was not working correctly as segments of the bucket index
were listed to shut down any incomplete multipart uploads. This fixes
the marker, so it's maintained properly across iterations.

Signed-off-by: J. Eric Ivancich <ivancich@redhat.com>

Merge pull request #43787 from tchaikov/pacific-pr-41353

pacific: pybind/mgr/CMakeLists.txt: exclude files not used at runtime

Reviewed-by: Ernesto Puerta <epuertat@redhat.com>

Merge pull request #43682 from rhcs-dashboard/wip-52806-pacific

pacific: mgr/dashboard: BATCH incl.: NFS integration, Cluster Expansion Workflow, and Angular 11 upgrade

Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>

Merge pull request #43703 from cfsnyder/wip-52783-pacific

pacific: rgw/sts: fix for copy object operation using sts

Reviewed-by: Casey Bodley <cbodley@redhat.com>

Merge pull request #43809 from cbodley/wip-qa-rgw-java-pacific

pacific: qa/rgw: pacific branch targets ceph-pacific branch of java_s3tests

Reviewed-by: Yuri Weinstein <yweinste@redhat.com>
Reviewed-by: Nathan Cutler <ncutler@suse.com>
Reviewed-by: Ali Maredia <amaredia@redhat.com>

Merge pull request #43777 from cfsnyder/wip-52468-pacific

pacific: rgw: use existing s->bucket in s3 website retarget()

Reviewed-by: Casey Bodley <cbodley@redhat.com>

mds: skip journaling blocklisted clients when in `replay` state

When a standby MDS is transitioning to active, it passes through
`replay` state. When the MDS is in this state, there are no journal
segments available for recording journal updates. If the MDS receives
an OSDMap update in this state, journaling blocklisted clients causes
a crash since no journal segments are available. This is a bit hard
to reproduce as it requires correct timing of an OSDMap update along
with various other factors.

Note that, when the MDS reaches `reconnect` state, it will journal
the blocklisted clients anyway.

This partially fixes tracker: https://tracker.ceph.com/issues/51589
which mentions a similar crash but in `reconnect` state. However,
that crash was seen in nautilus.

A couple of minor changes include removing hardcoded function names
and carving out reusable parts into a separate function.

Partially-fixes: https://tracker.ceph.com/issues/51589
Signed-off-by: Venky Shankar <vshankar@redhat.com>
(cherry picked from commit 6d6236dc8d15636af8060057e6e69c26c473f987)

Merge pull request #43708 from cfsnyder/wip-52598-pacific

pacific: ceph-volume: util/prepare fix osd_id_available()

Merge pull request #43812 from rhcs-dashboard/wip-53153-pacific

pacific: mgr/dashboard: fix missing alert rule details

Reviewed-by: Avan Thakkar <athakkar@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>

qa: reduce frag split confs for dir_split counter test

Fixes: https://tracker.ceph.com/issues/52949
Fixes: a5675535ba532cfe782238d995751f9bc91f5078
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
(cherry picked from commit abd19aa215eeef6da66c522c9fbeec870acee85a)

qa: only set frag confs for workloads

Otherwise these local conf overrides prevent functional testing.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

qa: fold frag confs into conf/mds.yaml

These overrides are standard for all configurations. The config to
enable fragmentation is also long removed.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>

rgw: Ensure buckets too old to decode a layout have layout logs

When decoding `RGWBucketInfo` data from before Pacific, we won't call
`rgw::BucketLayout::decode`, but will instead synthesize the layout
information. This leaves the `rgw::BucketLayout::logs` empty, as the
fallback to populate it only applies to old versions of
`rgw::BucketLayout`.

Add a check at the end of `RGWBUcketInfo::decode` to populate it if
empty.

Fixes: https://tracker.ceph.com/issues/53132
Signed-off-by: Adam C. Emerson <aemerson@redhat.com>
(cherry picked from commit 3279509127e65314c07963a3e127e926308bd76a)
Fixes: https://tracker.ceph.com/issues/53160
Signed-off-by: Adam C. Emerson <aemerson@redhat.com>

librbd/cache/pwl: cancel advance dispatch of external flush request

For external flush request, it new syncpoint after passing
guardedrequest and before dispatch. Then dispatch bypass deferred
queue But the last write request may still in the deferred queue.
It don't dispatch and not associated with any syncpoint. The
external flush request will bypass the previous write request in
deferred queue now. This does not conform to the semantics of
external flush requests. External flush request should strictly
follow the order of dispath.

But for internal flush request, it will be dispatched after all
write request which associated with previous syncpoint, persisted
in cache. C_gather guarantee it.

It is necessary to distinguish between external and internal
flush requests. Internal flush can and should be dispatched in
advance bypass deferred queue. At the same time, the order of
external requests needs to be kept unchanged. So cancel advance
dispatch of external flush request.

Fixes: https://tracker.ceph.com/issues/52599
Signed-off-by: Yin Congmin <congmin.yin@intel.com>
(cherry picked from commit 9951868fc33f0469cd4135fadebb5a8d2b4b55dd)

librbd/cache/pwl: fix assert in _aio_stop() during shutdown

For wait_for_ops(next_ctx). this next_ctx may run in aio_thread.
Then the next program runs on the aio thread. remove_pool_file()
calls bdev->close(), then calles _aio_stop(), exec aio_thread.join(),
cause assert. Thread can't join itself. Fix it by adding close ctx
to m_work_queue, so close() can run in work queue thread.

At the same time, correct the order of wait_for_ops().
flush_dirty_entries(next_ctx) may call wake_up() and start_op().
so moving wait_for_ops() behind flush_dirty_entries(next_ctx) is more
appropriate.

Fixes: https://tracker.ceph.com/issues/52566
Signed-off-by: Yin Congmin <congmin.yin@intel.com>
(cherry picked from commit 94f9873718a82d2def8f268c1581fbf97fee0f49)

librbd/cache/pwl/ssd: move finish_op() to the end of callback function

finish_op() of ssd cache is not in the end of callback function in
append_op_log_entries(), and after finish_op(), some operation also
need to get m_lock. So, during shutdown, wait_for_ops() thinks all OPs
are over, and no thread will acquire the m_lock, In the subsequent
operation of shutdown, the m_lock is obtained, and _aio_stop() in
bdev->close() waits for all aio_writes() and aio_submit() to end
when the m_lock is held, but the callback function of aio_write() is
waiting for the m_lock, causing a deadlock. Move finish_op() to the
end to fix dead lock.

Fixes: https://tracker.ceph.com/issues/52235
Signed-off-by: Yin Congmin <congmin.yin@intel.com>
(cherry picked from commit c531768838e44ed8eb28e8b27594d7e03ca3ffcf)

librbd/cache/pwl: Check the cache is clean

Signed-off-by: Yin Congmin <congmin.yin@intel.com>
(cherry picked from commit 066b8a6d2ee091839b9b21ac89b8dfcebf8825cd)

librbd/cache/pwl/ssd: Remove unused parameter.

Met the following compiler warning message:
>[38/80] Building CXX object
src/librbd/CMakeFiles/librbd_plugin_pwl_cache.dir/cache/pwl/ssd/WriteLog.cc.o
>../src/librbd/cache/pwl/ssd/WriteLog.cc:37:25: warning: unused variable
'ops_appended_together' [-Wunused-const-variable]
>const unsigned long int ops_appended_together = MAX_WRITES_PER_SYNC_POINT;

Signed-off-by: Jianpeng Ma <jianpeng.ma@intel.com>
(cherry picked from commit 40dad4c30c3b46ed3ac06961ebe740f6d82a6bc1)

librbd/cache/pwl/ssd: Fix a race between get_cache_bl() and remove_cache_bl()

In fact, although in get_cache_bl it use lock to protect, it can't
protect function "list& operator= (const list& other)".
So we should use copy_cache_bl.

Fixes: https://tracker.ceph.com/issues/52400
Signed-off-by: Jianpeng Ma <jianpeng.ma@intel.com>
(cherry picked from commit fe72b3953735329441397f257d5dd18f6819187d)