crimson/os/seastore/rbm: improve large sequential write by issuing writes in parallel
In large chunk workload, for instance, when 4MB data is divided into 64KB chunks,
the current implementation writes sequentially one by one. On general NVMe SSD
writing large chunk (>=64KB) can lead to relatively higher latency compared to a small write case,
resulting in performance degradation. Although NVMe SSD shows the increased
latency in such cases, this does not mean that thay reach their performance limit,
as they can handle multiple concurrent requests.
This commit improves parallelism by using parallel_for_each() to issue writes in parallel.