From aa48afe7e9703773c44860f7cfeba646db72b5e4 Mon Sep 17 00:00:00 2001 From: Laura Flores Date: Mon, 18 Dec 2023 15:09:43 -0600 Subject: [PATCH] doc/releases: specify dashboard improvements ...and address other nits Signed-off-by: Laura Flores (cherry picked from commit d5ab720dc54e19904267fbde5ff54260eff9bd60) Signed-off-by: Zac Dover --- doc/releases/reef.rst | 493 ++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 493 insertions(+) diff --git a/doc/releases/reef.rst b/doc/releases/reef.rst index 6f4f827830b79..839d48ff61ffe 100644 --- a/doc/releases/reef.rst +++ b/doc/releases/reef.rst @@ -5,6 +5,499 @@ Reef Reef is the 18th stable release of Ceph. It is named after the reef squid (Sepioteuthis). +v18.2.1 Reef +============ + +This is the first backport release in the Reef series, and the first with Debian packages, +for Debian Bookworm. We recommend that all users update to this release. + +Notable Changes +--------------- + +* RGW: S3 multipart uploads using Server-Side Encryption now replicate correctly in + a multi-site deployment. Previously, the replicas of such objects were corrupted on + decryption. A new command, ``radosgw-admin bucket resync encrypted multipart``, can be + used to identify these original multipart uploads. The ``LastModified`` timestamp of + any identified object is incremented by 1ns to cause peer zones to replicate it again. + For multi-site deployments that make any use of Server-Side Encryption, we + recommended running this command against every bucket in every zone after all + zones have upgraded. + +* CEPHFS: MDS now evicts clients which are not advancing their request tids (transaction IDs), + which causes a large buildup of session metadata, resulting in the MDS going read-only due to + the RADOS operation exceeding the size threshold. `mds_session_metadata_threshold` + config controls the maximum size that an (encoded) session metadata can grow. + +* RGW: New tools have been added to ``radosgw-admin`` for identifying and + correcting issues with versioned bucket indexes. Historical bugs with the + versioned bucket index transaction workflow made it possible for the index + to accumulate extraneous "book-keeping" olh (object logical head) entries + and plain placeholder entries. In some specific scenarios where clients made + concurrent requests referencing the same object key, it was likely that a lot + of extra index entries would accumulate. When a significant number of these entries are + present in a single bucket index shard, they can cause high bucket listing + latencies and lifecycle processing failures. To check whether a versioned + bucket has unnecessary olh entries, users can now run ``radosgw-admin + bucket check olh``. If the ``--fix`` flag is used, the extra entries will + be safely removed. A distinct issue from the one described thus far, it is + also possible that some versioned buckets are maintaining extra unlinked + objects that are not listable from the S3/ Swift APIs. These extra objects + are typically a result of PUT requests that exited abnormally, in the middle + of a bucket index transaction - so the client would not have received a + successful response. Bugs in prior releases made these unlinked objects easy + to reproduce with any PUT request that was made on a bucket that was actively + resharding. Besides the extra space that these hidden, unlinked objects + consume, there can be another side effect in certain scenarios, caused by + the nature of the failure mode that produced them, where a client of a bucket + that was a victim of this bug may find the object associated with the key to + be in an inconsistent state. To check whether a versioned bucket has unlinked + entries, users can now run ``radosgw-admin bucket check unlinked``. If the + ``--fix`` flag is used, the unlinked objects will be safely removed. Finally, + a third issue made it possible for versioned bucket index stats to be + accounted inaccurately. The tooling for recalculating versioned bucket stats + also had a bug, and was not previously capable of fixing these inaccuracies. + This release resolves those issues and users can now expect that the existing + ``radosgw-admin bucket check`` command will produce correct results. We + recommend that users with versioned buckets, especially those that existed + on prior releases, use these new tools to check whether their buckets are + affected and to clean them up accordingly. + +* mgr/snap-schedule: For clusters with multiple CephFS file systems, all the + snap-schedule commands now expect the '--fs' argument. + +* RADOS: A POOL_APP_NOT_ENABLED health warning will now be reported if + the application is not enabled for the pool irrespective of whether + the pool is in use or not. Always tag a pool with an application + using ``ceph osd pool application enable`` command to avoid reporting + of POOL_APP_NOT_ENABLED health warning for that pool. + The user might temporarily mute this warning using + ``ceph health mute POOL_APP_NOT_ENABLED``. + +* Dashboard: An overview page for RGW to show the overall status of RGW components. + +* Dashboard: Added management support for RGW Multi-site and CephFS Subvolumes and groups. + +* Dashboard: Fixed few bugs and issues around the new dashboard page including the broken layout, + some metrics giving wrong values and introduced a popover to display details + when there are HEALTH_WARN or HEALTH_ERR. + +* Dashboard: Fixed several issues in Ceph dashboard on Rook-backed clusters, + and improved the user experience on the Rook environment. + +Changelog +--------- + +* .github: Clarify checklist details (`pr#54130 `_, Anthony D'Atri) +* [CVE-2023-43040] rgw: Fix bucket validation against POST policies (`pr#53756 `_, Joshua Baergen) +* Adding rollback mechanism to handle bootstrap failures (`pr#53864 `_, Adam King, Redouane Kachach) +* backport of rook orchestrator fixes and e2e automated testing (`pr#54224 `_, Redouane Kachach) +* Bluestore: fix bluestore collection_list latency perf counter (`pr#52950 `_, Wangwenjuan) +* build: Remove ceph-libboost\* packages in install-deps (`pr#52769 `_, Adam Emerson) +* ceph-volume/cephadm: support lv devices in inventory (`pr#53286 `_, Guillaume Abrioux) +* ceph-volume: add --osd-id option to raw prepare (`pr#52927 `_, Guillaume Abrioux) +* ceph-volume: fix a regression in `raw list` (`pr#54521 `_, Guillaume Abrioux) +* ceph-volume: fix mpath device support (`pr#53539 `_, Guillaume Abrioux) +* ceph-volume: fix raw list for lvm devices (`pr#52619 `_, Guillaume Abrioux) +* ceph-volume: fix raw list for lvm devices (`pr#52980 `_, Guillaume Abrioux) +* ceph-volume: Revert "ceph-volume: fix raw list for lvm devices" (`pr#54429 `_, Matthew Booth, Guillaume Abrioux) +* ceph: allow xlock state to be LOCK_PREXLOCK when putting it (`pr#53661 `_, Xiubo Li) +* ceph_fs.h: add separate owner\_{u,g}id fields (`pr#53138 `_, Alexander Mikhalitsyn) +* ceph_volume: support encrypted volumes for lvm new-db/new-wal/migrate commands (`pr#52875 `_, Igor Fedotov) +* cephadm batch backport Aug 23 (`pr#53124 `_, Adam King, Luis Domingues, John Mulligan, Redouane Kachach) +* cephadm: add a --dry-run option to cephadm shell (`pr#54220 `_, John Mulligan) +* cephadm: add tcmu-runner to logrotate config (`pr#53122 `_, Adam King) +* cephadm: Adding support to configure public_network cfg section (`pr#53110 `_, Redouane Kachach) +* cephadm: delete /tmp/cephadm- when removing the cluster (`pr#53109 `_, Redouane Kachach) +* cephadm: Fix extra_container_args for iSCSI (`pr#53010 `_, Raimund Sacherer) +* cephadm: fix haproxy version with certain containers (`pr#53751 `_, Adam King) +* cephadm: make custom_configs work for tcmu-runner container (`pr#53404 `_, Adam King) +* cephadm: run tcmu-runner through script to do restart on failure (`pr#53866 `_, Adam King) +* cephadm: support for CA signed keys (`pr#53121 `_, Adam King) +* cephfs-journal-tool: disambiguate usage of all keyword (in tool help) (`pr#53646 `_, Manish M Yathnalli) +* cephfs-mirror: do not run concurrent C_RestartMirroring context (`issue#62072 `_, `pr#53638 `_, Venky Shankar) +* cephfs: implement snapdiff (`pr#53229 `_, Igor Fedotov, Lucian Petrut, Denis Barahtanov) +* cephfs_mirror: correctly set top level dir permissions (`pr#53271 `_, Milind Changire) +* client: always refresh mds feature bits on session open (`issue#63188 `_, `pr#54146 `_, Venky Shankar) +* client: correct quota check in Client::_rename() (`pr#52578 `_, Rishabh Dave) +* client: do not send metrics until the MDS rank is ready (`pr#52501 `_, Xiubo Li) +* client: force sending cap revoke ack always (`pr#52507 `_, Xiubo Li) +* client: issue a cap release immediately if no cap exists (`pr#52850 `_, Xiubo Li) +* client: move the Inode to new auth mds session when changing auth cap (`pr#53666 `_, Xiubo Li) +* client: trigger to flush the buffer when making snapshot (`pr#52497 `_, Xiubo Li) +* client: wait rename to finish (`pr#52504 `_, Xiubo Li) +* cmake: ensure fmtlib is at least 8.1.1 (`pr#52970 `_, Abhishek Lekshmanan) +* Consider setting "bulk" autoscale pool flag when automatically creating a data pool for CephFS (`pr#52899 `_, Leonid Usov) +* crimson/admin/admin_socket: remove path file if it exists (`pr#53964 `_, Matan Breizman) +* crimson/ertr: assert on invocability of func provided to safe_then() (`pr#53958 `_, Radosław Zarzyński) +* crimson/mgr: Fix config show command (`pr#53954 `_, Aishwarya Mathuria) +* crimson/net: consolidate messenger implementations and enable multi-shard UTs (`pr#54095 `_, Yingxin Cheng) +* crimson/net: set TCP_NODELAY according to ms_tcp_nodelay (`pr#54063 `_, Xuehan Xu) +* crimson/net: support connections in multiple shards (`pr#53949 `_, Yingxin Cheng) +* crimson/os/object_data_handler: splitting right side doesn't mean splitting only one extent (`pr#54061 `_, Xuehan Xu) +* crimson/os/seastore/backref_manager: scan backref entries by journal seq (`pr#53939 `_, Zhang Song) +* crimson/os/seastore/btree: should add left's size when merging levels… (`pr#53946 `_, Xuehan Xu) +* crimson/os/seastore/cache: don't add EXIST_CLEAN extents to lru (`pr#54098 `_, Xuehan Xu) +* crimson/os/seastore/cached_extent: add prepare_commit interface (`pr#53941 `_, Xuehan Xu) +* crimson/os/seastore/cbj: fix a potential overflow bug on segment_seq (`pr#53968 `_, Myoungwon Oh) +* crimson/os/seastore/collection_manager: fill CollectionNode::decoded on clean reads (`pr#53956 `_, Xuehan Xu) +* crimson/os/seastore/journal/cbj: generalize scan_valid_records() (`pr#53961 `_, Myoungwon Oh, Yingxin Cheng) +* crimson/os/seastore/omap_manager: correct editor settings (`pr#53947 `_, Zhang Song) +* crimson/os/seastore/omap_manager: fix the entry leak issue in BtreeOMapManager::omap_list() (`pr#53962 `_, Xuehan Xu) +* crimson/os/seastore/onode_manager: populate value recorders of onodes to be erased (`pr#53966 `_, Xuehan Xu) +* crimson/os/seastore/rbm: make rbm support multiple shards (`pr#53952 `_, Myoungwon Oh) +* crimson/os/seastore/transaction_manager: data loss issues (`pr#53955 `_, Xuehan Xu) +* crimson/os/seastore/transaction_manager: move intermediate_key by "remap_offset" when remapping the "back" half of the original pin (`pr#54140 `_, Xuehan Xu) +* crimson/os/seastore/zbd: zbdsegmentmanager write path fixes (`pr#54062 `_, Aravind Ramesh) +* crimson/os/seastore: add metrics about total invalidated transactions (`pr#53953 `_, Zhang Song) +* crimson/os/seastore: create page aligned bufferptr in copy ctor of CachedExtent (`pr#54097 `_, Zhang Song) +* crimson/os/seastore: enable SMR HDD (`pr#53935 `_, Aravind Ramesh) +* crimson/os/seastore: fix ceph_assert in segment_manager.h (`pr#53938 `_, Aravind Ramesh) +* crimson/os/seastore: fix daggling reference of oid in SeaStore::Shard::stat() (`pr#53960 `_, Xuehan Xu) +* crimson/os/seastore: fix in check_node (`pr#53945 `_, Xinyu Huang) +* crimson/os/seastore: OP_CLONE in seastore (`pr#54092 `_, xuxuehan, Xuehan Xu) +* crimson/os/seastore: realize lazy read in split overwrite with overwrite refactor (`pr#53951 `_, Xinyu Huang) +* crimson/os/seastore: retire_extent_addr clean up (`pr#53959 `_, Xinyu Huang) +* crimson/osd/heartbeat: Improve maybe_share_osdmap behavior (`pr#53940 `_, Samuel Just) +* crimson/osd/lsan_suppressions.cc: Add MallocExtension::Initialize() (`pr#54057 `_, Mark Nelson, Matan Breizman) +* crimson/osd/lsan_suppressions: add MallocExtension::Register (`pr#54139 `_, Matan Breizman) +* crimson/osd/object_context: consider clones found as long as they're in SnapSet::clones (`pr#53965 `_, Xuehan Xu) +* crimson/osd/osd_operations: add pipeline to LogMissingRequest to sync it (`pr#53957 `_, Xuehan Xu) +* crimson/osd/osd_operations: consistent naming to pipeline users (`pr#54060 `_, Matan Breizman) +* crimson/osd/pg: check if backfill_state exists when judging objects' (`pr#53963 `_, Xuehan Xu) +* crimson/osd/watch: Add logs around Watch/Notify (`pr#53950 `_, Matan Breizman) +* crimson/osd: add embedded suppression ruleset for LSan (`pr#53937 `_, Radoslaw Zarzynski) +* crimson/osd: cleanup and drop OSD::ShardDispatcher (`pr#54138 `_, Yingxin Cheng) +* Crimson/osd: Disable concurrent MOSDMap handling (`pr#53944 `_, Matan Breizman) +* crimson/osd: don't ignore start_pg_operation returned future (`pr#53948 `_, Matan Breizman) +* crimson/osd: fix ENOENT on accessing RadosGW user's index of buckets (`pr#53942 `_, Radoslaw Zarzynski) +* crimson/osd: fix Notify life-time mismanagement in Watch::notify_ack (`pr#53943 `_, Radoslaw Zarzynski) +* crimson/osd: fixes and cleanups around multi-core OSD (`pr#54091 `_, Yingxin Cheng) +* Crimson/osd: support multicore osd (`pr#54058 `_, chunmei) +* crimson/tools/perf_crimson_msgr: integrate multi-core msgr with various improvements (`pr#54059 `_, Yingxin Cheng) +* crimson/tools/perf_crimson_msgr: randomize client nonce (`pr#54093 `_, Yingxin Cheng) +* crimson/tools/perf_staged_fltree: fix compile error (`pr#54096 `_, Myoungwon Oh) +* crimson/vstart: default seastore_device_size will be out of space f… (`pr#53969 `_, chunmei) +* crimson: Enable tcmalloc when using seastar (`pr#54105 `_, Mark Nelson, Matan Breizman) +* debian/control: add docker-ce as recommends for cephadm package (`pr#52908 `_, Adam King) +* Debian: update to dh compat 12, fix more serious packaging errors, correct copyright syntax (`pr#53654 `_, Matthew Vernon) +* doc/architecture.rst - edit a sentence (`pr#53372 `_, Zac Dover) +* doc/architecture.rst - edit up to "Cluster Map" (`pr#53366 `_, Zac Dover) +* doc/architecture: "Edit HA Auth" (`pr#53619 `_, Zac Dover) +* doc/architecture: "Edit HA Auth" (one of several) (`pr#53585 `_, Zac Dover) +* doc/architecture: "Edit HA Auth" (one of several) (`pr#53491 `_, Zac Dover) +* doc/architecture: edit "Calculating PG IDs" (`pr#53748 `_, Zac Dover) +* doc/architecture: edit "Cluster Map" (`pr#53434 `_, Zac Dover) +* doc/architecture: edit "Data Scrubbing" (`pr#53730 `_, Zac Dover) +* doc/architecture: Edit "HA Auth" (`pr#53488 `_, Zac Dover) +* doc/architecture: edit "HA Authentication" (`pr#53632 `_, Zac Dover) +* doc/architecture: edit "High Avail. Monitors" (`pr#53451 `_, Zac Dover) +* doc/architecture: edit "OSD Membership and Status" (`pr#53727 `_, Zac Dover) +* doc/architecture: edit "OSDs service clients directly" (`pr#53686 `_, Zac Dover) +* doc/architecture: edit "Peering and Sets" (`pr#53871 `_, Zac Dover) +* doc/architecture: edit "Replication" (`pr#53738 `_, Zac Dover) +* doc/architecture: edit "SDEH" (`pr#53659 `_, Zac Dover) +* doc/architecture: edit several sections (`pr#53742 `_, Zac Dover) +* doc/architecture: repair RBD sentence (`pr#53877 `_, Zac Dover) +* doc/ceph-volume: explain idempotence (`pr#54233 `_, Zac Dover) +* doc/ceph-volume: improve front matter (`pr#54235 `_, Zac Dover) +* doc/cephadm/services: remove excess rendered indentation in osd.rst (`pr#54323 `_, Ville Ojamo) +* doc/cephadm: add ssh note to install.rst (`pr#53199 `_, Zac Dover) +* doc/cephadm: edit "Adding Hosts" in install.rst (`pr#53224 `_, Zac Dover) +* doc/cephadm: edit sentence in mgr.rst (`pr#53164 `_, Zac Dover) +* doc/cephadm: edit troubleshooting.rst (1 of x) (`pr#54283 `_, Zac Dover) +* doc/cephadm: edit troubleshooting.rst (2 of x) (`pr#54320 `_, Zac Dover) +* doc/cephadm: fix typo in cephadm initial crush location section (`pr#52887 `_, John Mulligan) +* doc/cephadm: fix typo in set ssh key command (`pr#54388 `_, Piotr Parczewski) +* doc/cephadm: update cephadm reef version (`pr#53162 `_, Rongqi Sun) +* doc/cephfs: edit mount-using-fuse.rst (`pr#54353 `_, Jaanus Torp) +* doc/cephfs: write cephfs commands fully in docs (`pr#53402 `_, Rishabh Dave) +* doc/config: edit "ceph-conf.rst" (`pr#54463 `_, Zac Dover) +* doc/configuration: edit "bg" in mon-config-ref.rst (`pr#53347 `_, Zac Dover) +* doc/dev/release-checklist: check telemetry validation (`pr#52805 `_, Yaarit Hatuka) +* doc/dev: Fix typos in files cephfs-mirroring.rst and deduplication.rst (`pr#53519 `_, Daniel Parkes) +* doc/dev: remove cache-pool (`pr#54007 `_, Zac Dover) +* doc/glossary: add "primary affinity" to glossary (`pr#53427 `_, Zac Dover) +* doc/glossary: add "Quorum" to glossary (`pr#54509 `_, Zac Dover) +* doc/glossary: improve "BlueStore" entry (`pr#54265 `_, Zac Dover) +* doc/man/8/ceph-monstore-tool: add documentation (`pr#52872 `_, Matan Breizman) +* doc/man/8: improve radosgw-admin.rst (`pr#53267 `_, Anthony D'Atri) +* doc/man: edit ceph-monstore-tool.rst (`pr#53476 `_, Zac Dover) +* doc/man: radosgw-admin.rst typo (`pr#53315 `_, Zac Dover) +* doc/man: remove docs about support for unix domain sockets (`pr#53312 `_, Zac Dover) +* doc/man: s/kvstore-tool/monstore-tool/ (`pr#53536 `_, Zac Dover) +* doc/rados/configuration: Avoid repeating "support" in msgr2.rst (`pr#52998 `_, Ville Ojamo) +* doc/rados: add bulk flag to pools.rst (`pr#53317 `_, Zac Dover) +* doc/rados: edit "troubleshooting-mon" (`pr#54502 `_, Zac Dover) +* doc/rados: edit memory-profiling.rst (`pr#53932 `_, Zac Dover) +* doc/rados: edit operations/add-or-rm-mons (1 of x) (`pr#52889 `_, Zac Dover) +* doc/rados: edit operations/add-or-rm-mons (2 of x) (`pr#52825 `_, Zac Dover) +* doc/rados: edit ops/control.rst (1 of x) (`pr#53811 `_, zdover23, Zac Dover) +* doc/rados: edit ops/control.rst (2 of x) (`pr#53815 `_, Zac Dover) +* doc/rados: edit t-mon "common issues" (1 of x) (`pr#54418 `_, Zac Dover) +* doc/rados: edit t-mon "common issues" (2 of x) (`pr#54421 `_, Zac Dover) +* doc/rados: edit t-mon "common issues" (3 of x) (`pr#54438 `_, Zac Dover) +* doc/rados: edit t-mon "common issues" (4 of x) (`pr#54443 `_, Zac Dover) +* doc/rados: edit t-mon "common issues" (5 of x) (`pr#54455 `_, Zac Dover) +* doc/rados: edit t-mon.rst text (`pr#54349 `_, Zac Dover) +* doc/rados: edit t-shooting-mon.rst (`pr#54427 `_, Zac Dover) +* doc/rados: edit troubleshooting-mon.rst (2 of x) (`pr#52839 `_, Zac Dover) +* doc/rados: edit troubleshooting-mon.rst (3 of x) (`pr#53879 `_, Zac Dover) +* doc/rados: edit troubleshooting-mon.rst (4 of x) (`pr#53897 `_, Zac Dover) +* doc/rados: edit troubleshooting-osd (1 of x) (`pr#53982 `_, Zac Dover) +* doc/rados: Edit troubleshooting-osd (2 of x) (`pr#54000 `_, Zac Dover) +* doc/rados: Edit troubleshooting-osd (3 of x) (`pr#54026 `_, Zac Dover) +* doc/rados: edit troubleshooting-pg (2 of x) (`pr#54114 `_, Zac Dover) +* doc/rados: edit troubleshooting-pg.rst (`pr#54228 `_, Zac Dover) +* doc/rados: edit troubleshooting-pg.rst (1 of x) (`pr#54073 `_, Zac Dover) +* doc/rados: edit troubleshooting.rst (`pr#53837 `_, Zac Dover) +* doc/rados: edit troubleshooting/community.rst (`pr#53881 `_, Zac Dover) +* doc/rados: format "initial troubleshooting" (`pr#54477 `_, Zac Dover) +* doc/rados: format Q&A list in t-mon.rst (`pr#54345 `_, Zac Dover) +* doc/rados: format Q&A list in tshooting-mon.rst (`pr#54366 `_, Zac Dover) +* doc/rados: improve "scrubbing" explanation (`pr#54270 `_, Zac Dover) +* doc/rados: parallelize t-mon headings (`pr#54461 `_, Zac Dover) +* doc/rados: remove cache-tiering-related keys (`pr#54227 `_, Zac Dover) +* doc/rados: remove FileStore material (in Reef) (`pr#54008 `_, Zac Dover) +* doc/rados: remove HitSet-related key information (`pr#54217 `_, Zac Dover) +* doc/rados: update monitoring-osd-pg.rst (`pr#52958 `_, Zac Dover) +* doc/radosgw: Improve dynamicresharding.rst (`pr#54368 `_, Anthony D'Atri) +* doc/radosgw: Improve language and formatting in config-ref.rst (`pr#52835 `_, Ville Ojamo) +* doc/radosgw: multisite - edit "migrating a single-site" (`pr#53261 `_, Qi Tao) +* doc/radosgw: update rate limit management (`pr#52910 `_, Zac Dover) +* doc/README.md - edit "Building Ceph" (`pr#53057 `_, Zac Dover) +* doc/README.md - improve "Running a test cluster" (`pr#53258 `_, Zac Dover) +* doc/rgw: correct statement about default zone features (`pr#52833 `_, Casey Bodley) +* doc/rgw: pubsub capabilities reference was removed from docs (`pr#54137 `_, Yuval Lifshitz) +* doc/rgw: several response headers are supported (`pr#52803 `_, Casey Bodley) +* doc/start: correct ABC test chart (`pr#53256 `_, Dmitry Kvashnin) +* doc/start: edit os-recommendations.rst (`pr#53179 `_, Zac Dover) +* doc/start: fix typo in hardware-recommendations.rst (`pr#54480 `_, Anthony D'Atri) +* doc/start: Modernize and clarify hardware-recommendations.rst (`pr#54071 `_, Anthony D'Atri) +* doc/start: refactor ABC test chart (`pr#53094 `_, Zac Dover) +* doc/start: update "platforms" table (`pr#53075 `_, Zac Dover) +* doc/start: update linking conventions (`pr#52912 `_, Zac Dover) +* doc/start: update linking conventions (`pr#52841 `_, Zac Dover) +* doc/troubleshooting: edit cpu-profiling.rst (`pr#53059 `_, Zac Dover) +* doc: Add a note on possible deadlock on volume deletion (`pr#52946 `_, Kotresh HR) +* doc: add note for removing (automatic) partitioning policy (`pr#53569 `_, Venky Shankar) +* doc: Add Reef 18.2.0 release notes (`pr#52905 `_, Zac Dover) +* doc: Add warning on manual CRUSH rule removal (`pr#53420 `_, Alvin Owyong) +* doc: clarify upmap balancer documentation (`pr#53004 `_, Laura Flores) +* doc: correct option name (`pr#53128 `_, Patrick Donnelly) +* doc: do not recommend pulling cephadm from git (`pr#52997 `_, John Mulligan) +* doc: Documentation about main Ceph metrics (`pr#54111 `_, Juan Miguel Olmo Martínez) +* doc: edit README.md - contributing code (`pr#53049 `_, Zac Dover) +* doc: expand and consolidate mds placement (`pr#53146 `_, Patrick Donnelly) +* doc: Fix doc for mds cap acquisition throttle (`pr#53024 `_, Kotresh HR) +* doc: improve submodule update command - README.md (`pr#53000 `_, Zac Dover) +* doc: make instructions to get an updated cephadm common (`pr#53260 `_, John Mulligan) +* doc: remove egg fragment from dev/developer_guide/running-tests-locally (`pr#53853 `_, Dhairya Parmar) +* doc: Update dynamicresharding.rst (`pr#54329 `_, Aliaksei Makarau) +* doc: Update mClock QOS documentation to discard osd_mclock_cost_per\_\* (`pr#54079 `_, tanchangzhi) +* doc: update rados.cc (`pr#52967 `_, Zac Dover) +* doc: update test cluster commands in README.md (`pr#53349 `_, Zac Dover) +* exporter: add ceph_daemon labels to labeled counters as well (`pr#53695 `_, avanthakkar) +* exposed the open api and telemetry links in details card (`pr#53142 `_, cloudbehl, dpandit) +* libcephsqlite: fill 0s in unread portion of buffer (`pr#53101 `_, Patrick Donnelly) +* librbd: kick ExclusiveLock state machine on client being blocklisted when waiting for lock (`pr#53293 `_, Ramana Raja) +* librbd: kick ExclusiveLock state machine stalled waiting for lock from reacquire_lock() (`pr#53919 `_, Ramana Raja) +* librbd: make CreatePrimaryRequest remove any unlinked mirror snapshots (`pr#53276 `_, Ilya Dryomov) +* MClientRequest: properly handle ceph_mds_request_head_legacy for ext_num_retry, ext_num_fwd, owner_uid, owner_gid (`pr#54407 `_, Alexander Mikhalitsyn) +* MDS imported_inodes metric is not updated (`pr#51698 `_, Yongseok Oh) +* mds/FSMap: allow upgrades if no up mds (`pr#53851 `_, Patrick Donnelly) +* mds/Server: mark a cap acquisition throttle event in the request (`pr#53168 `_, Leonid Usov) +* mds: acquire inode snaplock in open (`pr#53183 `_, Patrick Donnelly) +* mds: add event for batching getattr/lookup (`pr#53558 `_, Patrick Donnelly) +* mds: adjust pre_segments_size for MDLog when trimming segments for st… (`issue#59833 `_, `pr#54035 `_, Venky Shankar) +* mds: blocklist clients with "bloated" session metadata (`issue#62873 `_, `issue#61947 `_, `pr#53329 `_, Venky Shankar) +* mds: do not send split_realms for CEPH_SNAP_OP_UPDATE msg (`pr#52847 `_, Xiubo Li) +* mds: drop locks and retry when lock set changes (`pr#53241 `_, Patrick Donnelly) +* mds: dump locks when printing mutation ops (`pr#52975 `_, Patrick Donnelly) +* mds: fix deadlock between unlinking and linkmerge (`pr#53497 `_, Xiubo Li) +* mds: fix stray evaluation using scrub and introduce new option (`pr#50813 `_, Dhairya Parmar) +* mds: Fix the linkmerge assert check (`pr#52724 `_, Kotresh HR) +* mds: log message when exiting due to asok command (`pr#53548 `_, Patrick Donnelly) +* mds: MDLog::_recovery_thread: handle the errors gracefully (`pr#52512 `_, Jos Collin) +* mds: session ls command appears twice in command listing (`pr#52515 `_, Neeraj Pratap Singh) +* mds: skip forwarding request if the session were removed (`pr#52846 `_, Xiubo Li) +* mds: update mdlog perf counters during replay (`pr#52681 `_, Patrick Donnelly) +* mds: use variable g_ceph_context directly in MDSAuthCaps (`pr#52819 `_, Rishabh Dave) +* mgr/cephadm: Add "networks" parameter to orch apply rgw (`pr#53120 `_, Teoman ONAY) +* mgr/cephadm: add ability to zap OSDs' devices while draining host (`pr#53869 `_, Adam King) +* mgr/cephadm: add is_host\_ functions to HostCache (`pr#53118 `_, Adam King) +* mgr/cephadm: Adding sort-by support for ceph orch ps (`pr#53867 `_, Redouane Kachach) +* mgr/cephadm: allow draining host without removing conf/keyring files (`pr#53123 `_, Adam King) +* mgr/cephadm: also don't write client files/tuned profiles to maintenance hosts (`pr#53111 `_, Adam King) +* mgr/cephadm: ceph orch add fails when ipv6 address is surrounded by square brackets (`pr#53870 `_, Teoman ONAY) +* mgr/cephadm: don't use image tag in orch upgrade ls (`pr#53865 `_, Adam King) +* mgr/cephadm: fix default image base in reef (`pr#53922 `_, Adam King) +* mgr/cephadm: fix REFRESHED column of orch ps being unpopulated (`pr#53741 `_, Adam King) +* mgr/cephadm: fix upgrades with nvmeof (`pr#53924 `_, Adam King) +* mgr/cephadm: removing double quotes from the generated nvmeof config (`pr#53868 `_, Redouane Kachach) +* mgr/cephadm: show meaningful messages when failing to execute cmds (`pr#53106 `_, Redouane Kachach) +* mgr/cephadm: storing prometheus/alertmanager credentials in monstore (`pr#53119 `_, Redouane Kachach) +* mgr/cephadm: validate host label before removing (`pr#53112 `_, Redouane Kachach) +* mgr/dashboard: add e2e tests for cephfs management (`pr#53190 `_, Nizamudeen A) +* mgr/dashboard: Add more decimals in latency graph (`pr#52727 `_, Pedro Gonzalez Gomez) +* mgr/dashboard: add port and zone endpoints to import realm token form in rgw multisite (`pr#54118 `_, Aashish Sharma) +* mgr/dashboard: add validator for size field in the forms (`pr#53378 `_, Nizamudeen A) +* mgr/dashboard: align charts of landing page (`pr#53543 `_, Pedro Gonzalez Gomez) +* mgr/dashboard: allow PUT in CORS (`pr#52705 `_, Nizamudeen A) +* mgr/dashboard: allow tls 1.2 with a config option (`pr#53780 `_, Nizamudeen A) +* mgr/dashboard: Block Ui fails in angular with target es2022 (`pr#54260 `_, Aashish Sharma) +* mgr/dashboard: cephfs volume and subvolume management (`pr#53017 `_, Pedro Gonzalez Gomez, Nizamudeen A, Pere Diaz Bou) +* mgr/dashboard: cephfs volume rm and rename (`pr#53026 `_, avanthakkar) +* mgr/dashboard: cleanup rbd-mirror process in dashboard e2e (`pr#53220 `_, Nizamudeen A) +* mgr/dashboard: cluster upgrade management (batch backport) (`pr#53016 `_, avanthakkar, Nizamudeen A) +* mgr/dashboard: Dashboard RGW multisite configuration (`pr#52922 `_, Aashish Sharma, Pedro Gonzalez Gomez, Avan Thakkar, avanthakkar) +* mgr/dashboard: disable hosts field while editing the filesystem (`pr#54069 `_, Nizamudeen A) +* mgr/dashboard: disable promote on mirroring not enabled (`pr#52536 `_, Pedro Gonzalez Gomez) +* mgr/dashboard: disable protect if layering is not enabled on the image (`pr#53173 `_, avanthakkar) +* mgr/dashboard: display the groups in cephfs subvolume tab (`pr#53394 `_, Pedro Gonzalez Gomez) +* mgr/dashboard: empty grafana panels for performance of daemons (`pr#52774 `_, Avan Thakkar, avanthakkar) +* mgr/dashboard: enable protect option if layering enabled (`pr#53795 `_, avanthakkar) +* mgr/dashboard: fix cephfs create form validator (`pr#53219 `_, Nizamudeen A) +* mgr/dashboard: fix cephfs form validator (`pr#53778 `_, Nizamudeen A) +* mgr/dashboard: fix cephfs forms validations (`pr#53831 `_, Nizamudeen A) +* mgr/dashboard: fix image columns naming (`pr#53254 `_, Pedro Gonzalez Gomez) +* mgr/dashboard: fix progress bar color visibility (`pr#53209 `_, Nizamudeen A) +* mgr/dashboard: fix prometheus queries subscriptions (`pr#53669 `_, Pedro Gonzalez Gomez) +* mgr/dashboard: fix rgw multi-site import form helper (`pr#54395 `_, Aashish Sharma) +* mgr/dashboard: fix rgw multisite error when no rgw entity is present (`pr#54261 `_, Aashish Sharma) +* mgr/dashboard: fix rgw page issues when hostname not resolvable (`pr#53214 `_, Nizamudeen A) +* mgr/dashboard: fix rgw port manipulation error in dashboard (`pr#53392 `_, Nizamudeen A) +* mgr/dashboard: fix the landing page layout issues (`issue#62961 `_, `pr#53835 `_, Nizamudeen A) +* mgr/dashboard: Fix user/bucket count in rgw overview dashboard (`pr#53818 `_, Aashish Sharma) +* mgr/dashboard: fixed edit user quota form error (`pr#54223 `_, Ivo Almeida) +* mgr/dashboard: images -> edit -> disable checkboxes for layering and deef-flatten (`pr#53388 `_, avanthakkar) +* mgr/dashboard: minor usability improvements (`pr#53143 `_, cloudbehl) +* mgr/dashboard: n/a entries behind primary snapshot mode (`pr#53223 `_, Pere Diaz Bou) +* mgr/dashboard: Object gateway inventory card incorrect Buckets and user count (`pr#53382 `_, Aashish Sharma) +* mgr/dashboard: Object gateway sync status cards keeps loading when multisite is not configured (`pr#53381 `_, Aashish Sharma) +* mgr/dashboard: paginate hosts (`pr#52918 `_, Pere Diaz Bou) +* mgr/dashboard: rbd image hide usage bar when disk usage is not provided (`pr#53810 `_, Pedro Gonzalez Gomez) +* mgr/dashboard: remove empty popover when there are no health warns (`pr#53652 `_, Nizamudeen A) +* mgr/dashboard: remove green tick on old password field (`pr#53386 `_, Nizamudeen A) +* mgr/dashboard: remove unnecessary failing hosts e2e (`pr#53458 `_, Pedro Gonzalez Gomez) +* mgr/dashboard: remove used and total used columns in favor of usage bar (`pr#53304 `_, Pedro Gonzalez Gomez) +* mgr/dashboard: replace sync progress bar with last synced timestamp in rgw multisite sync status card (`pr#53379 `_, Aashish Sharma) +* mgr/dashboard: RGW Details card cleanup (`pr#53020 `_, Nizamudeen A, cloudbehl) +* mgr/dashboard: Rgw Multi-site naming improvements (`pr#53806 `_, Aashish Sharma) +* mgr/dashboard: rgw multisite topology view shows blank table for multisite entities (`pr#53380 `_, Aashish Sharma) +* mgr/dashboard: set CORS header for unauthorized access (`pr#53201 `_, Nizamudeen A) +* mgr/dashboard: show a message to restart the rgw daemons after moving from single-site to multi-site (`pr#53805 `_, Aashish Sharma) +* mgr/dashboard: subvolume rm with snapshots (`pr#53233 `_, Pedro Gonzalez Gomez) +* mgr/dashboard: update rgw multisite import form helper info (`pr#54253 `_, Aashish Sharma) +* mgr/dashboard: upgrade angular v14 and v15 (`pr#52662 `_, Nizamudeen A) +* mgr/rbd_support: fix recursive locking on CreateSnapshotRequests lock (`pr#54289 `_, Ramana Raja) +* mgr/snap_schedule: allow retention spec 'n' to be user defined (`pr#52748 `_, Milind Changire, Jakob Haufe) +* mgr/snap_schedule: make fs argument mandatory if more than one filesystem exists (`pr#54094 `_, Milind Changire) +* mgr/volumes: Fix pending_subvolume_deletions in volume info (`pr#53572 `_, Kotresh HR) +* mgr: register OSDs in ms_handle_accept (`pr#53187 `_, Patrick Donnelly) +* mon, qa: issue pool application warning even if pool is empty (`pr#53041 `_, Prashant D) +* mon/ConfigMonitor: update crush_location from osd entity (`pr#52466 `_, Didier Gazen) +* mon/MDSMonitor: plug paxos when maybe manipulating osdmap (`pr#52246 `_, Patrick Donnelly) +* mon/MonClient: resurrect original client_mount_timeout handling (`pr#52535 `_, Ilya Dryomov) +* mon/OSDMonitor: do not propose on error in prepare_update (`pr#53186 `_, Patrick Donnelly) +* mon: fix iterator mishandling in PGMap::apply_incremental (`pr#52554 `_, Oliver Schmidt) +* msgr: AsyncMessenger add faulted connections metrics (`pr#53033 `_, Pere Diaz Bou) +* os/bluestore: don't require bluestore_db_block_size when attaching new (`pr#52942 `_, Igor Fedotov) +* os/bluestore: get rid off resulting lba alignment in allocators (`pr#54772 `_, Igor Fedotov) +* osd/OpRequest: Add detail description for delayed op in osd log file (`pr#53688 `_, Yite Gu) +* osd/OSDMap: Check for uneven weights & != 2 buckets post stretch mode (`pr#52457 `_, Kamoltat) +* osd/scheduler/mClockScheduler: Use same profile and client ids for all clients to ensure allocated QoS limit consumption (`pr#53093 `_, Sridhar Seshasayee) +* osd: fix logic in check_pg_upmaps (`pr#54276 `_, Laura Flores) +* osd: fix read balancer logic to avoid redundant primary assignment (`pr#53820 `_, Laura Flores) +* osd: fix use-after-move in build_incremental_map_msg() (`pr#54267 `_, Ronen Friedman) +* osd: fix: slow scheduling when item_cost is large (`pr#53861 `_, Jrchyang Yu) +* Overview graph improvements (`pr#53090 `_, cloudbehl) +* pybind/mgr/devicehealth: do not crash if db not ready (`pr#52213 `_, Patrick Donnelly) +* pybind/mgr/pg_autoscaler: Cut back osdmap.get_pools calls (`pr#52767 `_, Kamoltat) +* pybind/mgr/pg_autoscaler: fix warn when not too few pgs (`pr#53674 `_, Kamoltat) +* pybind/mgr/pg_autoscaler: noautoscale flag retains individual pool configs (`pr#53658 `_, Kamoltat) +* pybind/mgr/pg_autoscaler: Reorderd if statement for the func: _maybe_adjust (`pr#53429 `_, Kamoltat) +* pybind/mgr/pg_autoscaler: Use bytes_used for actual_raw_used (`pr#53534 `_, Kamoltat) +* pybind/mgr/volumes: log mutex locks to help debug deadlocks (`pr#53918 `_, Kotresh HR) +* pybind/mgr: reopen database handle on blocklist (`pr#52460 `_, Patrick Donnelly) +* pybind/rbd: don't produce info on errors in aio_mirror_image_get_info() (`pr#54055 `_, Ilya Dryomov) +* python-common/drive_group: handle fields outside of 'spec' even when 'spec' is provided (`pr#53115 `_, Adam King) +* python-common/drive_selection: lower log level of limit policy message (`pr#53114 `_, Adam King) +* python-common: drive_selection: fix KeyError when osdspec_affinity is not set (`pr#53159 `_, Guillaume Abrioux) +* qa/cephfs: fix build failure for mdtest project (`pr#53827 `_, Rishabh Dave) +* qa/cephfs: fix ior project build failure (`pr#53825 `_, Rishabh Dave) +* qa/cephfs: switch to python3 for centos stream 9 (`pr#53624 `_, Xiubo Li) +* qa/rgw: add new POOL_APP_NOT_ENABLED failures to log-ignorelist (`pr#53896 `_, Casey Bodley) +* qa/smoke,orch,perf-basic: add POOL_APP_NOT_ENABLED to ignorelist (`pr#54376 `_, Prashant D) +* qa/standalone/osd/divergent-prior.sh: Divergent test 3 with pg_autoscale_mode on pick divergent osd (`pr#52721 `_, Nitzan Mordechai) +* qa/suites/crimson-rados: add centos9 to supported distros (`pr#54020 `_, Matan Breizman) +* qa/suites/crimson-rados: bring backfill testing (`pr#54021 `_, Radoslaw Zarzynski, Matan Breizman) +* qa/suites/crimson-rados: Use centos8 for testing (`pr#54019 `_, Matan Breizman) +* qa/suites/krbd: stress test for recovering from watch errors (`pr#53786 `_, Ilya Dryomov) +* qa/suites/rbd: add test to check rbd_support module recovery (`pr#54291 `_, Ramana Raja) +* qa/suites/rbd: drop cache tiering workload tests (`pr#53996 `_, Ilya Dryomov) +* qa/suites/upgrade: enable default RBD image features (`pr#53352 `_, Ilya Dryomov) +* qa/suites/upgrade: fix env indentation in stress-split upgrade tests (`pr#53921 `_, Laura Flores) +* qa/suites/{rbd,krbd}: disable POOL_APP_NOT_ENABLED health check (`pr#53599 `_, Ilya Dryomov) +* qa/tests: added - \(POOL_APP_NOT_ENABLED\) to the ignore list (`pr#54436 `_, Yuri Weinstein) +* qa: add POOL_APP_NOT_ENABLED to ignorelist for cephfs tests (`issue#62482 `_, `issue#62508 `_, `pr#54380 `_, Venky Shankar, Patrick Donnelly) +* qa: assign file system affinity for replaced MDS (`issue#61764 `_, `pr#54037 `_, Venky Shankar) +* qa: descrease pgbench scale factor to 32 for postgresql database test (`pr#53627 `_, Xiubo Li) +* qa: fix cephfs-mirror unwinding and 'fs volume create/rm' order (`pr#52656 `_, Jos Collin) +* qa: fix keystone in rgw/crypt/barbican.yaml (`pr#53412 `_, Ali Maredia) +* qa: ignore expected cluster warning from damage tests (`pr#53484 `_, Patrick Donnelly) +* qa: lengthen shutdown timeout for thrashed MDS (`pr#53553 `_, Patrick Donnelly) +* qa: move nfs (mgr/nfs) related tests to fs suite (`pr#53906 `_, Dhairya Parmar, Venky Shankar) +* qa: wait for file to have correct size (`pr#52742 `_, Patrick Donnelly) +* qa: wait for MDSMonitor tick to replace daemons (`pr#52235 `_, Patrick Donnelly) +* RadosGW API: incorrect bucket quota in response to HEAD /{bucket}/?usage (`pr#53437 `_, shreyanshjain7174) +* rbd-mirror: fix image replayer shut down description on force promote (`pr#52880 `_, Prasanna Kumar Kalever) +* rbd-mirror: fix race preventing local image deletion (`pr#52627 `_, N Balachandran) +* rbd-nbd: fix stuck with disable request (`pr#54254 `_, Prasanna Kumar Kalever) +* read balancer documentation (`pr#52777 `_, Laura Flores) +* Rgw overview dashboard backport (`pr#53065 `_, Aashish Sharma) +* rgw/amqp: remove possible race conditions with the amqp connections (`pr#53516 `_, Yuval Lifshitz) +* rgw/amqp: skip idleness tests since it needs to sleep longer than 30s (`pr#53506 `_, Yuval Lifshitz) +* rgw/crypt: apply rgw_crypt_default_encryption_key by default (`pr#52796 `_, Casey Bodley) +* rgw/crypt: don't deref null manifest_bl (`pr#53590 `_, Casey Bodley) +* rgw/kafka: failed to reconnect to broker after idle timeout (`pr#53513 `_, Yuval Lifshitz) +* rgw/kafka: make sure that destroy is called after connection is removed (`pr#53515 `_, Yuval Lifshitz) +* rgw/keystone: EC2Engine uses reject() for ERR_SIGNATURE_NO_MATCH (`pr#53762 `_, Casey Bodley) +* rgw/multisite[archive zone]: fix storing of bucket instance info in the new bucket entrypoint (`pr#53466 `_, Shilpa Jagannath) +* rgw/notification: pass in bytes_transferred to populate object_size in sync notification (`pr#53377 `_, Juan Zhu) +* rgw/notification: remove non x-amz-meta-\* attributes from bucket notifications (`pr#53375 `_, Juan Zhu) +* rgw/notifications: allow cross tenant notification management (`pr#53510 `_, Yuval Lifshitz) +* rgw/s3: ListObjectsV2 returns correct object owners (`pr#54161 `_, Casey Bodley) +* rgw/s3select: fix per QE defect (`pr#54163 `_, galsalomon66) +* rgw/s3select: s3select fixes related to Trino/TPCDS benchmark and QE tests (`pr#53034 `_, galsalomon66) +* rgw/sal: get_placement_target_names() returns void (`pr#53584 `_, Casey Bodley) +* rgw/sync-policy: Correct "sync status" & "sync group" commands (`pr#53395 `_, Soumya Koduri) +* rgw/upgrade: point upgrade suites to ragweed ceph-reef branch (`pr#53797 `_, Shilpa Jagannath) +* RGW: add admin interfaces to get and delete notifications by bucket (`pr#53509 `_, Ali Masarwa) +* rgw: add radosgw-admin bucket check olh/unlinked commands (`pr#53823 `_, Cory Snyder) +* rgw: add versioning info to radosgw-admin bucket stats output (`pr#54191 `_, Cory Snyder) +* RGW: bucket notification - hide auto generated topics when listing topics (`pr#53507 `_, Ali Masarwa) +* rgw: don't dereference nullopt in DeleteMultiObj (`pr#54124 `_, Casey Bodley) +* rgw: fetch_remote_obj() preserves original part lengths for BlockDecrypt (`pr#52816 `_, Casey Bodley) +* rgw: fetch_remote_obj() uses uncompressed size for encrypted objects (`pr#54371 `_, Casey Bodley) +* rgw: fix 2 null versionID after convert_plain_entry_to_versioned (`pr#53398 `_, rui ma, zhuo li) +* rgw: fix multipart upload object leaks due to re-upload (`pr#52615 `_, J. Eric Ivancich) +* rgw: fix rgw rate limiting RGWRateLimitInfo class decode_json max_rea… (`pr#53765 `_, xiangrui meng) +* rgw: fix SignatureDoesNotMatch when extra headers start with 'x-amz' (`pr#53770 `_, rui ma) +* rgw: fix unwatch crash at radosgw startup (`pr#53760 `_, lichaochao) +* rgw: handle http options CORS with v4 auth (`pr#53413 `_, Tobias Urdin) +* rgw: improve buffer list utilization in the chunkupload scenario (`pr#53773 `_, liubingrun) +* rgw: pick http_date in case of http_x_amz_date absence (`pr#53440 `_, Seena Fallah, Mohamed Awnallah) +* rgw: retry metadata cache notifications with INVALIDATE_OBJ (`pr#52798 `_, Casey Bodley) +* rgw: s3 object lock avoids overflow in retention date (`pr#52604 `_, Casey Bodley) +* rgw: s3website doesn't prefetch for web_dir() check (`pr#53767 `_, Casey Bodley) +* RGW: Solving the issue of not populating etag in Multipart upload result (`pr#51447 `_, Ali Masarwa) +* RGW:notifications: persistent topics are not deleted via radosgw-admin (`pr#53514 `_, Ali Masarwa) +* src/mon/Monitor: Fix set_elector_disallowed_leaders (`pr#54003 `_, Kamoltat) +* test/crimson/seastore/rbm: add sub-tests regarding RBM to the existing tests (`pr#53967 `_, Myoungwon Oh) +* test/TestOSDMap: don't use the deprecated std::random_shuffle method (`pr#52737 `_, Leonid Usov) +* valgrind: UninitCondition under __run_exit_handlers suppression (`pr#53681 `_, Mark Kogan) +* xfstests_dev: install extra packages from powertools repo for xfsprogs (`pr#52843 `_, Xiubo Li) + v18.2.0 Reef ============ -- 2.39.5