git.apps.os.sepia.ceph.com Git

]> git.apps.os.sepia.ceph.com Git - ceph.git/log

projects / ceph.git / log

commit | commitdiff | tree

Sage Weil [Wed, 26 Aug 2015 18:57:28 +0000 (14:57 -0400)]

rocksdb: update to 3.11.2

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Wed, 26 Aug 2015 18:54:00 +0000 (14:54 -0400)]

os/RocksDBStore: make other rmkey match

No need for Slice() here; it can take a string.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Wed, 26 Aug 2015 18:52:56 +0000 (14:52 -0400)]

os/RocksDBStore: fix rmkey()

This took way too long to debug!

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Wed, 26 Aug 2015 17:55:45 +0000 (13:55 -0400)]

ceph_test_keyvaluedb: some simple KeyValueDB unit tests

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Mon, 24 Aug 2015 21:59:34 +0000 (17:59 -0400)]

os/newstore: fix end bound on collection_list

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sat, 22 Aug 2015 14:33:40 +0000 (10:33 -0400)]

os/newstore: flush object before doing omap reads

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 18 Aug 2015 21:22:32 +0000 (17:22 -0400)]

os/newstore: add 'newstore backend options' to pass options to e.g. rocksdb

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 18 Aug 2015 19:33:39 +0000 (15:33 -0400)]

os/newstore: change escaping chars

# is lowest besides space and !, except for " (which would be too
confusing).

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 18 Aug 2015 19:09:21 +0000 (15:09 -0400)]

os/newstore: trim overlay when zeroing extent

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 18 Aug 2015 19:08:55 +0000 (15:08 -0400)]

os/newstore: tolerate null pnext to collection_list()

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 18 Aug 2015 18:57:47 +0000 (14:57 -0400)]

os/newstore: fix collection range for temp objects

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Xiaoxi Chen [Thu, 7 May 2015 07:41:20 +0000 (15:41 +0800)]

os/newstore: Implement fiemap

For simplicity we ignore holes inside an fragment now.

Signed-off-by: Xiaoxi Chen <xiaoxi.chen@intel.com>

commit | commitdiff | tree

Sage Weil [Mon, 4 May 2015 18:05:27 +0000 (11:05 -0700)]

os/newstore: make sync/async submit_transaction optional

It seems doing this synchronously may be better for SSDs?

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sat, 2 May 2015 23:29:24 +0000 (16:29 -0700)]

os/newstore: renamed TransContext::fds -> sync_items

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sat, 2 May 2015 00:22:57 +0000 (17:22 -0700)]

os/newstore: queue kv transactions in kv_sync_thread

It appears that db->submit_transaction() will block if there is a sync
commit that is in progress instead of simply queueing the new txn for
later. To work around this, submit these to the backend in the
kv_sync_thread prior to the synchronous submit_transaction_sync().

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sat, 2 May 2015 00:21:23 +0000 (17:21 -0700)]

os/newstore: process multiple aio completions at a time

This isn't affecting things for a slow disk, but it will matter for faster
backends.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Wed, 29 Apr 2015 22:00:46 +0000 (15:00 -0700)]

os/newstore: clean up kv commit debug output

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Wed, 29 Apr 2015 21:51:00 +0000 (14:51 -0700)]

os/newstore: only ftruncate if i_size is incorrect

Even a no-op ftruncate can block in the kernel. Prior to this change I
could frequently see ftruncate wait for an aio completion on the same
file.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Wed, 29 Apr 2015 20:57:40 +0000 (13:57 -0700)]

Revert "os/newstore: avoid sync append for small ios"

This reverts commit 69baab2f7eaca7688ce1d45802a82fc3539cd906.

This is slower. :(

commit | commitdiff | tree

Sage Weil [Wed, 29 Apr 2015 18:52:55 +0000 (11:52 -0700)]

os/newstore: avoid sync append for small ios

An append is expensive in terms of latency (write, fdatasync, kv commit),
while a wal write is just the kv commit and the write and fdatasync are
async. For small IOs doing the wal may improve performance.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 28 Apr 2015 23:11:05 +0000 (16:11 -0700)]

rocksdb: fallocate_with_keep_size = false

This improves my 4k random writes on hdd by about 25%.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 23 Apr 2015 23:10:32 +0000 (16:10 -0700)]

Revert "os/NewStore: data_map shouldn't be empty when writing all overlays"

This reverts commit 0d9cce462fec61f754ddcd17cf9a3cf69581d7c5.

We may want to write an overlay if hte object is new and the write is small to defer the cost
of the fsync.

commit | commitdiff | tree

Zhiqiang Wang [Wed, 29 Apr 2015 06:32:25 +0000 (14:32 +0800)]

os/NewStore: delay the read of all the overlays until wal applying

The read of all the overlays can be delayed until applying the wal. If
we are doing async wal apply, this can reduce write op latency by
eliminating unnecessary reads in the write code path.

Signed-off-by: Zhiqiang Wang <zhiqiang.wang@intel.com>

commit | commitdiff | tree

Xiaoxi Chen [Wed, 29 Apr 2015 05:59:16 +0000 (13:59 +0800)]

os/newstore: fix deadlock when newstore_sync_transaction=true

There is a deadlock issue in Newstore when newstore_sync_transaction = true.
With sync_transaction to true, the txc state machine will go all the way down
from STATE_IO_DONE to STATE_FINISHING in the same thread, while holding the osr->qlock().
The deadlock is caused in _txc_finish and _osr_reap_done, when trying to
lock osr->qlock again.

Since the _txc_finish can be called with(in sync transaction mode) or without
(in async transaction mode) holding the qlock, so fix this by setting the qlock
to PTHREAD_MUTEX_RECURSIVE, thus we can recursive acquire the qlock.

Signed-off-by: Xiaoxi Chen <xiaoxi.chen@intel.com>

commit | commitdiff | tree

Zhiqiang Wang [Wed, 29 Apr 2015 06:10:51 +0000 (14:10 +0800)]

os/NewStore: fix the append of the later overlays when doing combination

The data of the later contiguous overlays should be claim_append to
'op->data', instead of 'bl'.

Signed-off-by: Zhiqiang Wang <zhiqiang.wang@intel.com>

commit | commitdiff | tree

Xiaoxi Chen [Wed, 29 Apr 2015 05:45:52 +0000 (13:45 +0800)]

os/Newstore: flush_commit return true on STATE_KV_DONE

There is a racing condition here, if the flush_commit() call
happened after _txc_finish_kv and before next state, the context
was pushed to on_commits but no one will handle the context since
we already pass _txc_finish_kv. This bug can be easily reproduce
by putting a sleep(5) after _txc_finish_kv, and trigger the bug by
ceph-osd -i 0 --mkfs.

Fix this bug by return true directly when state >= STATE_KV_DONE(instead
of > in previous code). We already persist the data in STATE_KV_DONE so
it's safe for us to do this.

Signed-off-by: Xiaoxi Chen <xiaoxi.chen@intel.com>

commit | commitdiff | tree

Zhiqiang Wang [Tue, 28 Apr 2015 08:24:16 +0000 (16:24 +0800)]

os/NewStore: avoid dup the data of the overlays in the WAL

When writing all the overlays, there is no need to dup the data in WAL.
Instead, we can reference the overlays in the WAL, and remove these
overlays after commiting them to the fs. When replaying, we can get
these data from the referenced overlays. Doing this way, we can save a
write and a deletion for each of the overlay data in the db.

Signed-off-by: Zhiqiang Wang <zhiqiang.wang@intel.com>

commit | commitdiff | tree

Sage Weil [Tue, 28 Apr 2015 16:47:09 +0000 (09:47 -0700)]

os/newstore: fix multiple aio case

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 28 Apr 2015 16:28:13 +0000 (09:28 -0700)]

os/newstore: more conservative default for aio queue depth

There appears to be a kernel aio bug when the queue depth is small.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Xiaoxi Chen [Tue, 28 Apr 2015 12:56:13 +0000 (20:56 +0800)]

os/newstore:close fd after writting with O_DIRECT

fix bug in 2b4c60e0a521ad10b94bbc82865b49f2d28c2ac9

Signed-off-by: Xiaoxi Chen <xiaoxi.chen@intel.com>

commit | commitdiff | tree

Zhiqiang Wang [Tue, 28 Apr 2015 08:41:39 +0000 (16:41 +0800)]

os/NewStore: need to increase the wal op length when combining overlays

Need to add the length of the combining overlays to the length of the
wal op.

Signed-off-by: Zhiqiang Wang <zhiqiang.wang@intel.com>

commit | commitdiff | tree

Xiaoxi Chen [Fri, 17 Apr 2015 08:14:41 +0000 (16:14 +0800)]

os/Newstore:Fix collection_list_range

We need to rule out hobject_t::max before calling get_object_key
(in which will call get_filestore_key_u32 and get an assert failure)

Signed-off-by: Xiaoxi Chen <xiaoxi.chen@intel.com>

commit | commitdiff | tree

Sage Weil [Mon, 27 Apr 2015 21:42:55 +0000 (14:42 -0700)]

os/newstore: fix race in _txc_aio_submit

We cannot rely on the iterator pointers being valid after we submit the
aio because we are racing with the completion. Make our loop decision
before submitting and avoid dereferencing txc after that point.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Xiaoxi Chen [Mon, 27 Apr 2015 08:28:33 +0000 (16:28 +0800)]

os/newstore : Do not need to call fdatasync if using direct.

skip ::fdatasync if in direct mode.

Signed-off-by: Xiaoxi Chen <xiaoxi.chen@intel.com>

commit | commitdiff | tree

Zhiqiang Wang [Mon, 27 Apr 2015 08:27:21 +0000 (16:27 +0800)]

osd/NewStore: fix for skipping the overlay in _do_overlay_trim

When the offset of the write starts at the end of the overlay, that is,
p->first + p->second.length == offset, the overlay could be skipped as
well.

Signed-off-by: Zhiqiang Wang <zhiqiang.wang@intel.com>

commit | commitdiff | tree

Zhiqiang Wang [Mon, 27 Apr 2015 08:15:26 +0000 (16:15 +0800)]

os/NewStore: combine contiguous overlays when writing all the overlays

Combine contiguous overlay writes to reduce the numbers of WAL writes
and fs writes.

Signed-off-by: Zhiqiang Wang <zhiqiang.wang@intel.com>

commit | commitdiff | tree

Xiaoxi Chen [Mon, 27 Apr 2015 07:49:28 +0000 (15:49 +0800)]

os/Nestore: batch cleanup

batch cleanup wal.

Signed-off-by: Xiaoxi Chen <xiaoxi.chen@intel.com>

commit | commitdiff | tree

Sage Weil [Fri, 24 Apr 2015 20:41:35 +0000 (13:41 -0700)]

os/newstore: fix _txc_aio_submit

The aios may complete before _txc_aio_submit completes. In fact, the aio
may complete, commit to the kv store, and then queue more wal aio's before
we finish the loop. Move aios to a separate list to ensure we only submit
them once and do not right another CPU adjusting the list.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 23 Apr 2015 21:51:51 +0000 (14:51 -0700)]

os/newstore: throttle over entire write lifecycle

Take a global throttle when we submit ops and release when they complete.
The first throttles cover the period from submit to commit, while the wal
ones also cover the async post-commit wal work. The configs are additive
since the wal ones cover both periods; this should make them reasonably
idiot-proof.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Zhiqiang Wang [Thu, 23 Apr 2015 02:34:37 +0000 (10:34 +0800)]

os/NewStore: data_map shouldn't be empty when writing all overlays

This should be an assert instead of creating new data_map.

Signed-off-by: Zhiqiang Wang <zhiqiang.wang@intel.com>

commit | commitdiff | tree

Zhiqiang Wang [Thu, 23 Apr 2015 02:25:15 +0000 (10:25 +0800)]

os/NewStore: clear the shared_overlays after writing all the overlays

Signed-off-by: Zhiqiang Wang <zhiqiang.wang@intel.com>

commit | commitdiff | tree

Zhiqiang Wang [Thu, 23 Apr 2015 01:41:35 +0000 (09:41 +0800)]

os/NewStore: don't clear overlay in the create/append case of write

Shouldn't clear the overlay in the create/append case of write.
Otherwise, this removes the overlay data and leads to data loss.

Signed-off-by: Zhiqiang Wang <zhiqiang.wang@intel.com>

commit | commitdiff | tree

Sage Weil [Tue, 28 Jul 2015 17:24:00 +0000 (13:24 -0400)]

os/newstore: debug io_submit EAGAIN

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 23 Apr 2015 00:22:32 +0000 (17:22 -0700)]

os/newstore: release wal throttle when wal completes, not when queued

If we take the aio path, the io is queued immediately and the resources
are released back to the pool. Instead release them when wal completes.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 28 Jul 2015 17:22:50 +0000 (13:22 -0400)]

os/newstore: todo

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 21 Apr 2015 00:10:19 +0000 (17:10 -0700)]

os/newstore: move toward state-machine

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Mon, 20 Apr 2015 22:33:11 +0000 (15:33 -0700)]

os/newstore: use aio for wal writes, too

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Mon, 20 Apr 2015 19:48:38 +0000 (12:48 -0700)]

os/newstore: a few comments about wal

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Mon, 20 Apr 2015 19:49:46 +0000 (12:49 -0700)]

os/newstore: combined O_DSYNC with O_DIRECT

This avoids the need for an explicit fdatasync when doing O_DIRECT.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Mon, 20 Apr 2015 17:34:04 +0000 (10:34 -0700)]

os/newstore: basic aio support

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Mon, 20 Apr 2015 17:33:14 +0000 (10:33 -0700)]

os/Newstore: add newstore_db_path option

The load of Keyvalue DB is heavy, allow user to put
DB to a seperate(fast) device.

Signed-off-by: Xiaoxi Chen <xiaoxi.chen@intel.com>

commit | commitdiff | tree

Sage Weil [Thu, 16 Apr 2015 23:30:31 +0000 (16:30 -0700)]

os/newstore: throttle wal work

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 16 Apr 2015 23:01:12 +0000 (16:01 -0700)]

os/newstore: show # o_direct buffers in debug output

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 16 Apr 2015 22:01:20 +0000 (15:01 -0700)]

os/newstore: use a threadpool for applying wal events

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 16 Apr 2015 21:08:55 +0000 (14:08 -0700)]

os/newstore: rebuild buffers to be page-aligned for O_DIRECT

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 16 Apr 2015 00:10:48 +0000 (17:10 -0700)]

ceph_test_objectstore: fix omap test cleanup

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Wed, 15 Apr 2015 22:35:16 +0000 (15:35 -0700)]

os/newstore: use fdatasync instead of fsync

On XFS at least, fdatasync is sufficient to make data readable.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Wed, 15 Apr 2015 17:35:26 +0000 (10:35 -0700)]

os/newstore: update todo

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Xiaoxi Chen [Wed, 15 Apr 2015 16:10:08 +0000 (00:10 +0800)]

os/Newstore: Check onode.omap_head in valid() and next()

The db iter will be set to KeyValueDB::Iterator() if onode.omap_head
not present. In that case if we touch the db iter we will get a segmentation
fault.

Prevent to touch the db iter when onode.omap_head is invalid(equals to 0).

Signed-off-by: Xiaoxi Chen <xiaoxi.chen@intel.com>

commit | commitdiff | tree

Xiaoxi Chen [Wed, 15 Apr 2015 08:22:28 +0000 (16:22 +0800)]

Use .str() to output a stringstream.

a nit.

Signed-off-by: Xiaoxi Chen <xiaoxi.chen@intel.com>

commit | commitdiff | tree

Xiaoxi Chen [Mon, 13 Apr 2015 08:21:00 +0000 (16:21 +0800)]

os/Newstore: Allow gap in _do_write append mode

We can allow some gap so we only need to ensure
onode.size <= offset.

Signed-off-by: Xiaoxi Chen <xiaoxi.chen@intel.com>

commit | commitdiff | tree

Xiaoxi Chen [Wed, 8 Apr 2015 15:35:50 +0000 (23:35 +0800)]

Implement get_omap_iterator

implemented get_omap_iterator

Signed-off-by: Xiaoxi Chen <xiaoxi.chen@intel.com>

commit | commitdiff | tree

Xiaoxi Chen [Sun, 12 Apr 2015 03:28:13 +0000 (11:28 +0800)]

os/KeyValueDB: Add raw_key() interface for IteratorImpl

raw_key() is useful to split out the prefix.

Signed-off-by: Xiaoxi Chen <xiaoxi.chen@intel.com>

commit | commitdiff | tree

Xiaoxi Chen [Fri, 10 Apr 2015 01:39:44 +0000 (09:39 +0800)]

test/store_test Add get_omap_iterator test cases

omap iterator test cases include:
  iter aganist omap
  lower_bound
  upper_bound

Signed-off-by: Xiaoxi Chen <xiaoxi.chen@intel.com>

commit | commitdiff | tree

Sage Weil [Mon, 17 Aug 2015 19:23:19 +0000 (15:23 -0400)]

os/newstore: drop sync()

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Mon, 17 Aug 2015 19:23:03 +0000 (15:23 -0400)]

os/newstore: drop sync()

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Mon, 17 Aug 2015 19:22:42 +0000 (15:22 -0400)]

os/newstore: drop flush

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Mon, 17 Aug 2015 19:22:26 +0000 (15:22 -0400)]

os/newstore: drop sync_and_flush

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 10 Apr 2015 23:55:18 +0000 (16:55 -0700)]

os/newstore: use FS::zero()

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 10 Apr 2015 23:49:07 +0000 (16:49 -0700)]

os/newstore: use O_DIRECT is write is page-aligned

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 10 Apr 2015 23:35:40 +0000 (16:35 -0700)]

os/newstore: pass flags to _{open,create}_fid

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 10 Apr 2015 23:33:00 +0000 (16:33 -0700)]

os/newstore: drop unused FragmentHandle

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 10 Apr 2015 22:29:16 +0000 (15:29 -0700)]

os/newstore: do not call completions from kv thread

Reads may call wait_wal() holding user locks, and so we cannot block
progress on WAL completion/flushing by calling callbacks that may take
user locks.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 10 Apr 2015 21:28:13 +0000 (14:28 -0700)]

os/newstore: let wal cleanup kv txn get batched

No need to trigger another sync kv commit here; just let the next KV
commit catch it.

We could possibly do a bit better here by not waking up the kv thread at
all...

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 10 Apr 2015 18:54:01 +0000 (11:54 -0700)]

os/newstore: fix off-by-one on overlay_max_length

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 9 Apr 2015 17:52:06 +0000 (10:52 -0700)]

os/newstore: use lower_bound for finding overlay extents in map

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 9 Apr 2015 16:16:03 +0000 (09:16 -0700)]

os/newstore: use overlay even if it is a new object or append

This avoids the fsync for small writes.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Xiaoxi Chen [Wed, 8 Apr 2015 16:13:10 +0000 (00:13 +0800)]

os/Newstore:Change assert in get_onode

db->get will return negtive when key is not found.

Signed-off-by: Xiaoxi Chen <xiaoxi.chen@intel.com>

commit | commitdiff | tree

Sage Weil [Tue, 7 Apr 2015 22:24:16 +0000 (15:24 -0700)]

os/newstore: open by handle

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 7 Apr 2015 22:22:09 +0000 (15:22 -0700)]

os/newstore: use fs abstaction layer

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Xiaoxi Chen [Tue, 7 Apr 2015 08:41:27 +0000 (16:41 +0800)]

os/newstore: cap fid_max below newstore_max_dir_size

Prevent fid_max over the max_dir_size when preallocation.

Signed-off-by: Xiaoxi Chen <xiaoxi.chen@intel.com>

commit | commitdiff | tree

Sage Weil [Tue, 7 Apr 2015 18:25:00 +0000 (11:25 -0700)]

os/newstore: keep smallish overlay extents in kv db

If we have a small overwrite, keep the extent in the key/value database.
Only write it back to the file/fragment later, and when we do, write them
all at once.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 18 Aug 2015 14:10:54 +0000 (10:10 -0400)]

os/newstore: assigned unique nid to each new object

Use this as the key for omap (omap_head), but keep the omap_head field
so that we can tell when no omap data is present.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 28 Jul 2015 16:59:39 +0000 (12:59 -0400)]

os/newstore: consolite collection_list to a single implementation

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Xiaoxi Chen [Fri, 3 Apr 2015 01:48:43 +0000 (09:48 +0800)]

Clear removed_collections after reap

Previous code forgot to clear the removed_collections queues
after reaped the collections in _reap_collection.

Signed-off-by: Xiaoxi Chen <xiaoxi.chen@intel.com>

commit | commitdiff | tree

Sage Weil [Wed, 1 Apr 2015 21:44:45 +0000 (14:44 -0700)]

os/newstore: ref count OpSequencer

Our OpSequencer may live longer than the ObjectStore::Sequencer interface
object does.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Wed, 1 Apr 2015 16:46:53 +0000 (09:46 -0700)]

os/newstore: send complete overwrite to a new fid

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 18 Aug 2015 14:10:25 +0000 (10:10 -0400)]

os/newstore: clone omap

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 18 Aug 2015 14:09:20 +0000 (10:09 -0400)]

newstore: initial version

This includes a bunch of new ceph_test_objectstore tests, and a ton of fixes
to existing tests so that objects actually live inside the collections they
are written to.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 27 Mar 2015 23:07:10 +0000 (16:07 -0700)]

vstart.sh: debug newstore

commit | commitdiff | tree

Sage Weil [Tue, 1 Sep 2015 17:37:46 +0000 (13:37 -0400)]

Revert "os/Makefile.am: add os/fs/XFS.cc"

This reverts commit 32331ede41ef5b1dc4eb85304d2e86d7c027c75c.

Doh, this is in a conditional below.

commit | commitdiff | tree

David Zafman [Tue, 1 Sep 2015 17:25:08 +0000 (10:25 -0700)]

Merge pull request #5173 from ceph/wip-12000-12200

Fast read for erasure coding pool and erasure code error handling

Error handling
Reviewed-by: Loic Dachary <ldachary@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
Fast Read
Reviewed-by: David Zafman <dzafman@redhat.com>
Reviewed-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 1 Sep 2015 17:18:21 +0000 (13:18 -0400)]

os/Makefile.am: add os/fs/XFS.cc

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Kefu Chai [Tue, 1 Sep 2015 11:00:46 +0000 (19:00 +0800)]

Merge pull request #5695 from tchaikov/wip-12012

osd: translate sparse_read to read for ecpool

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Kefu Chai [Fri, 28 Aug 2015 03:36:49 +0000 (11:36 +0800)]

ceph_test_rados_api_aio: add a test for aio_sparse_read

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Thu, 27 Aug 2015 14:57:16 +0000 (22:57 +0800)]

ceph_test_rados_api_io: add tests for sparse_read

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Wed, 26 Aug 2015 07:41:13 +0000 (15:41 +0800)]

ceph_test_rados: also send sparse_read in ReadOp

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Fri, 28 Aug 2015 06:27:53 +0000 (14:27 +0800)]

osd: should use ec_pool() when checking for an ecpool

we were using pool.info.require_rollback() in do_osd_ops() when
handling OP_SPARSE_READ to tell if a pool is an ecpool. should
use pool.info.ec_pool() instead.

Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Kefu Chai [Thu, 27 Aug 2015 14:57:49 +0000 (22:57 +0800)]

osd: translate sparse_read to read for ecpool

Fixes: #12012
Signed-off-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

David Zafman [Tue, 1 Sep 2015 02:28:18 +0000 (19:28 -0700)]

osd: Fix the diagnostic logging mostly to dout(20)

Signed-off-by: David Zafman <dzafman@redhat.com>

Unnamed repository; edit this file 'description' to name the repository.

RSS Atom