git.apps.os.sepia.ceph.com Git - ceph.git/log

]> git.apps.os.sepia.ceph.com Git - ceph.git/log

projects / ceph.git / log

summary | shortlog | log | commit | commitdiff | tree
first ⋅ prev ⋅ next

commit | commitdiff | tree

Samuel Just [Fri, 29 Aug 2014 21:04:04 +0000 (14:04 -0700)]

PG::init: clear rollback info for backfill as well

Otherwise, we won't remove the old rollback objects from a resurrected pg. In
rare cases, this can cause us to get an EEXIST if we happen to reuse the same
rename id on the same object in a subsequent interval.

Fixes: #9293
Related to: 8346e10755027e982f26bab4642334fd91cc31aa
Backport: firefly
Signed-off-by: Samuel Just <sam.just@inktank.com>

commit | commitdiff | tree

Sage Weil [Thu, 28 Aug 2014 19:42:00 +0000 (12:42 -0700)]

Merge pull request #2350 from ceph/wip-8718

rgw: don't try to authenticate a CORS preflight request

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 28 Aug 2014 19:40:16 +0000 (12:40 -0700)]

Merge pull request #2348 from athanatos/wip-9054

Wip 9054

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 28 Aug 2014 18:49:08 +0000 (11:49 -0700)]

Merge pull request #2349 from athanatos/wip-9240

Wip 9240

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Yehuda Sadeh [Thu, 28 Aug 2014 00:44:18 +0000 (17:44 -0700)]

rgw: don't try to authenticate a CORS preflight request

Fixes: #8718
Backport: firefly

CORS preflight requests don't need to be authenticated. Treat them as
coming from anonymous user.

Reported-by: Robert Hubbard <bobby.hubbard@garmin.com>
Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 28 Aug 2014 18:19:54 +0000 (11:19 -0700)]

Merge pull request #2347 from athanatos/wip-9205

OSD::session_notify_pg_create: requeue at the start of the queue

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 28 Aug 2014 18:19:09 +0000 (11:19 -0700)]

Merge pull request #2346 from athanatos/wip-8777

PG: mark_log_for_rewrite on resurrection

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 28 Aug 2014 18:18:52 +0000 (11:18 -0700)]

Merge pull request #2345 from athanatos/wip-9259

PG::can_discard_op: do discard old subopreplies

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 28 Aug 2014 00:26:34 +0000 (17:26 -0700)]

Merge pull request #2310 from ceph/wip-9148

Wip 9148

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Samuel Just [Wed, 27 Aug 2014 23:21:41 +0000 (16:21 -0700)]

PG::can_discard_op: do discard old subopreplies

Otherwise, a sub_op_reply from a previous interval can stick around
until we either one day go active again and get rid of it or delete the
pg which is holding it on its waiting_for_active list. While it sticks
around futily waiting for the pg to once more go active, it will cause
harmless slow request warnings.

Fixes: #9259
Backport: firefly
Signed-off-by: Samuel Just <sam.just@inktank.com>

commit | commitdiff | tree

Yehuda Sadeh [Mon, 25 Aug 2014 17:33:23 +0000 (10:33 -0700)]

civetweb: update submodule

Add access log api

Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>

commit | commitdiff | tree

Yehuda Sadeh [Sat, 23 Aug 2014 01:34:57 +0000 (18:34 -0700)]

civetweb: update submodule

Fixes: #9208
Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>

commit | commitdiff | tree

Yehuda Sadeh [Fri, 22 Aug 2014 22:12:16 +0000 (15:12 -0700)]

rgw: convert header field underscores into dashes

Fixes: 9206
Backport: firefly

Certain web servers filter out underscores in the header field name.
Convert them into dashes.

Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>

commit | commitdiff | tree

Yehuda Sadeh [Fri, 22 Aug 2014 04:53:38 +0000 (21:53 -0700)]

rgw: clear bufferlist if write_data() successful

Fixes: #9201
Backport: firefly

We sometimes need to call RGWPutObjProcessor::handle_data() again,
so that we send the pending data. However, we failed to clear the buffer
that was already sent, thus it was resent. This triggers when using non
default pool alignments.

Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>

commit | commitdiff | tree

Samuel Just [Tue, 26 Aug 2014 23:53:02 +0000 (16:53 -0700)]

PG: mark_log_for_rewrite on resurrection

Fixes: #8777
Backport: firefly
Signed-off-by: Samuel Just <sam.just@inktank.com>

commit | commitdiff | tree

Samuel Just [Tue, 26 Aug 2014 23:39:06 +0000 (16:39 -0700)]

OSD::session_notify_pg_create: requeue at the start of the queue

Introduced: 2120f4bb6c5ba0f066d4541a51ce1d43c8ab6881
Fixes: #9205
Signed-off-by: Samuel Just <sam.just@inktank.com>

commit | commitdiff | tree

Samuel Just [Tue, 26 Aug 2014 21:13:57 +0000 (14:13 -0700)]

ReplicatedPG:start_flush send a second delete

Suppose we start with the following in the cache pool:

30:[29,21,20,15,10,4]:[22(21), 15(15,10), 4(4)]+head

The object doesn't exist at 29 or 20.

First, we flush 4 leaving the backing pool with:

3:[]+head

Then, we begin to flush 15 with a delete with snapc 4:[4] leaving the
backing pool with:

4:[4]:[4(4)]

Then, we finish flushing 15 with snapc 9:[4] with leaving the backing
pool with:

9:[4]:[4(4)]+head

Next, snaps 10 and 15 are removed causing clone 10 to be removed leaving
the cache with:

30:[29,21,20,4]:[22(21),4(4)]+head

We next begin to flush 22 by sending a delete with snapc 4(4) since
prev_snapc is 4 <---------- here is the bug

The backing pool ignores this request since 4 < 9 (ORDERSNAP) leaving it
with:

9:[4]:[4(4)]

Then, we complete flushing 22 with snapc 19:[4] leaving the backing pool
with:

19:[4]:[4(4)]+head

Then, we begin to flush head by deleting with snapc 22:[21,20,4] leaving
the backing pool with:

22[21,20,4]:[22(21,20), 4(4)]

Finally, we flush head leaving the backing pool with:

30:[29,21,20,4]:[22(21*,20*),4(4)]+head

When we go to flush clone 22, all we know is that 22 is dirty, has snaps
[21], and 4 is clean. As part of flushing 22, we need to do two things:
1) Ensure that the current head is cloned as cloneid 4 with snaps [4] by
sending a delete at snapc 4:[4].
2) Flush the data at snap sequence < 21 by sending a copyfrom with snapc
20:[20,4].

Unfortunately, it is possible that 1, 1&2, or 1 and part of the flush
process for some other now non-existent clone have already been
performed. Because of that, between 1) and 2), we need to send
a second delete ensuring that the object does not exist at 20.

Fixes: #9054
Backport: firefly
Signed-off-by: Samuel Just <sam.just@inktank.com>

commit | commitdiff | tree

Samuel Just [Mon, 11 Aug 2014 19:59:16 +0000 (12:59 -0700)]

ReplicatedPG::start_flush: remove superfluous loop

Signed-off-by: Samuel Just <sam.just@inktank.com>

commit | commitdiff | tree

Samuel Just [Tue, 26 Aug 2014 19:24:05 +0000 (12:24 -0700)]

Merge pull request #2330 from ceph/wip-9211

osd/OSDMap: encode blacklist in deterministic order

Reviewed-by: Samuel Just <sam.just@inktank.com>

commit | commitdiff | tree

Samuel Just [Tue, 26 Aug 2014 19:02:52 +0000 (12:02 -0700)]

PG: recover from each osd at most once

Signed-off-by: Samuel Just <sam.just@inktank.com>

commit | commitdiff | tree

Samuel Just [Tue, 26 Aug 2014 18:38:53 +0000 (11:38 -0700)]

PG: make the reservation sets more descriptively named

These sets won't precisely be the backfill_targets or actingbackfill
shortly.

%s/sorted_backfill_set/remote_shards_to_reserve_backfill/g
%s/acting_osd_it/remote_recovery_reservation_it/g
%s/sorted_actingbackfill_set/remote_shards_to_reserve_recovery/g

Signed-off-by: Samuel Just <sam.just@inktank.com>

commit | commitdiff | tree

Sage Weil [Tue, 26 Aug 2014 16:18:23 +0000 (09:18 -0700)]

Merge pull request #2321 from ceph/wip-9226

rgw: fix test to identify whether object has tail

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 26 Aug 2014 15:16:29 +0000 (08:16 -0700)]

osd/OSDMap: encode blacklist in deterministic order

When we use an unordered_map the encoding order is non-deterministic,
which is problematic for OSDMap. Construct an ordered map<> on encode
and use that. This lets us keep the hash table for lookups in the general
case.

Fixes: #9211
Backport: firefly
Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Samuel Just [Tue, 26 Aug 2014 00:29:12 +0000 (17:29 -0700)]

Merge pull request #2320 from ceph/wip-9221

ceph_test_rados_api_tier: make PromoteOn2ndRead test tolerate retries

Reviewed-by: Samuel Just <sam.just@inktank.com>

commit | commitdiff | tree

Yehuda Sadeh [Mon, 25 Aug 2014 17:38:42 +0000 (10:38 -0700)]

rgw: fix test to identify whether object has tail

Fixes: #9226
Reported-by: Sylvain Munaut <s.munaut@whatever-company.com>
Backport: firefly

We need to identify whether an object is just composed of a head, or
also has a tail. Test for pre-firefly objects ("explicit objs") was
broken as it was just looking at the number of explicit objs in the
manifest. However, this is insufficient, as we might have empty head,
and in this case it wouldn't appear, so we need to check whether the
sole object is actually pointing at the head.

Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>

commit | commitdiff | tree

Sage Weil [Mon, 25 Aug 2014 17:18:43 +0000 (10:18 -0700)]

ceph_test_rados_api_tier: make PromoteOn2ndRead test tolerate retries

If there is an ill-timed connection reset our read could get sent twice.
Weaken our assertion if the read was slow to tolerate this case.

Fixes: #9221
Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Somnath Roy [Mon, 18 Aug 2014 23:59:36 +0000 (16:59 -0700)]

CollectionIndex: Collection name is added to the access_lock name

The CollectionIndex constructor is changed to accept the coll_t
so that the collection name can be used to form access_lock(RWLock)
name.This is needed otherwise lockdep will report a recursive lock error
and assert. lockdep needs unique lock names for each Index object.

Fixes: #9145
Signed-off-by: Somnath Roy <somnath.roy@sandisk.com>
(cherry picked from commit 615d2d904024526cc58557ee5250c2536a3cd5c8)

commit | commitdiff | tree

Yehuda Sadeh [Thu, 21 Aug 2014 23:51:16 +0000 (16:51 -0700)]

rgw: use a separate callback for civetweb access log

Access log is separate from the regular civetweb logging. Also, changed
the log level for the regular logging as it's used mostly for error
messages.

Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>

commit | commitdiff | tree

Yehuda Sadeh [Thu, 21 Aug 2014 23:30:10 +0000 (16:30 -0700)]

rgw: separate civetweb log from rgw log

The civetweb log now is independent from the rgw log.

Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 21 Aug 2014 21:58:24 +0000 (14:58 -0700)]

Merge pull request #2301 from ceph/wip-9176

mon: fix occasional memory leak; clean up dispatch return codes

Reviewed-by: Joao Eduardo Luis <joao.luis@inktank.com>

commit | commitdiff | tree

Sage Weil [Thu, 21 Aug 2014 20:10:28 +0000 (13:10 -0700)]

mon: make dispatch(), _ms_dispatch() void

They always return true.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 21 Aug 2014 20:07:56 +0000 (13:07 -0700)]

mon: always process the message in dispatch

Nobody should be chained after teh mon, so we can safely drop any message
we don't understand.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 21 Aug 2014 20:05:35 +0000 (13:05 -0700)]

mon: fix occasional message leak after session reset

Consider:

- we get a message, put it on a wait list
- the client session resets
- we go back to process the message later and discard
- _ms_dispatch returns false, but nobody drops the msg ref

Since we call _ms_dispatch() a lot internally, we need to always return
true when we are an internal caller.

Fixes: #9176
Backport: firefly, dumpling
Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 19 Aug 2014 04:10:32 +0000 (21:10 -0700)]

Merge remote-tracking branch 'gh/next'

commit | commitdiff | tree

John Wilkins [Mon, 18 Aug 2014 21:29:09 +0000 (14:29 -0700)]

doc: Removed quick guide and wireshark from top-level IA.

Signed-off-by: John Wilkins <john.wilkins@inktank.com>

commit | commitdiff | tree

John Wilkins [Mon, 18 Aug 2014 21:28:38 +0000 (14:28 -0700)]

doc: Move wireshark documentation to dev.

Signed-off-by: John Wilkins <john.wilkins@inktank.com>

commit | commitdiff | tree

Sage Weil [Mon, 18 Aug 2014 18:57:59 +0000 (11:57 -0700)]

doc/release-notes: v0.84

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Mon, 18 Aug 2014 17:04:41 +0000 (10:04 -0700)]

Merge pull request #2280 from ceph/wip-fs-docs

doc: add notes on using "ceph fs new"

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

john [Mon, 18 Aug 2014 15:57:25 +0000 (16:57 +0100)]

doc: add notes on using "ceph fs new"

Signed-off-by: John Spray <john.spray@redhat.com>

commit | commitdiff | tree

Jenkins [Mon, 18 Aug 2014 16:02:20 +0000 (09:02 -0700)]

0.84

commit | commitdiff | tree

Sage Weil [Mon, 18 Aug 2014 03:54:28 +0000 (20:54 -0700)]

qa/workunits/rbd/qemu-iotests: touch common.env

This seems to be necessary on trusty.

Backport: firefly, dumpling
Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 055be68cf8e1b84287ab3631a02e89a9f3ae6cca)

commit | commitdiff | tree

Sage Weil [Mon, 18 Aug 2014 03:54:28 +0000 (20:54 -0700)]

qa/workunits/rbd/qemu-iotests: touch common.env

This seems to be necessary on trusty.

Backport: firefly, dumpling
Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Mon, 18 Aug 2014 03:49:05 +0000 (20:49 -0700)]

Merge pull request #2010 from ceph/wip-misplaced

osd: track misplaced objects separately from degraded objects

Reviewed-by: Samuel Just <sam.just@inktank.com>

commit | commitdiff | tree

Sage Weil [Sun, 17 Aug 2014 04:56:00 +0000 (21:56 -0700)]

qa/workunits/rest/test.py: use rbd instead of data pool for size tests

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 17 Aug 2014 04:22:48 +0000 (21:22 -0700)]

qa/workunits/rest/test.py: do snap test on our data2/3 pool

This way it works when a 'data' pool doesn't already exist.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 17 Aug 2014 04:13:21 +0000 (21:13 -0700)]

qa/workunits/rest/test.py: fix rd_kb -> rd_bytes

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 17 Aug 2014 05:04:13 +0000 (22:04 -0700)]

Merge pull request #2272 from ceph/wip-8621

Wip 8621

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sat, 16 Aug 2014 21:51:31 +0000 (14:51 -0700)]

osd: fix theoretical use-after-free of OSDMap

In practice, the map will remain pinned for a while, but this
will make coverity happy.

*** CID 1231685:  Use after free  (USE_AFTER_FREE)
/osd/OSD.cc: 6223 in OSD::handle_osd_map(MOSDMap *)()
6217
6218           if (o->test_flag(CEPH_OSDMAP_FULL))
6219            last_marked_full = e;
6220           pinned_maps.push_back(add_map(o));
6221
6222           bufferlist fbl;
>>>     CID 1231685:  Use after free  (USE_AFTER_FREE)
>>>     Calling "encode" dereferences freed pointer "o".
6223           o->encode(fbl);
6224
6225           hobject_t fulloid = get_osdmap_pobject_name(e);
6226           t.write(coll_t::META_COLL, fulloid, 0, fbl.length(), fbl);
6227           pin_map_bl(e, fbl);
6228           continue;

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sat, 16 Aug 2014 20:41:41 +0000 (13:41 -0700)]

Merge pull request #2259 from ceph/wip-9039

Wip 9039

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Wed, 2 Jul 2014 16:27:52 +0000 (09:27 -0700)]

vstart.sh: make filestore fd cache size smaller

I hit the fd limit on a vstart cluster with the default 128; reduce this
to 16.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Wed, 2 Jul 2014 16:10:23 +0000 (09:10 -0700)]

mon: track stuck undersized

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 1 Jul 2014 00:18:24 +0000 (17:18 -0700)]

mon: track pgs that get stuck degraded

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Wed, 2 Jul 2014 16:13:09 +0000 (09:13 -0700)]

osd: track last_fullsized in pg_stat_t

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 1 Jul 2014 00:18:05 +0000 (17:18 -0700)]

osd: track last_undegraded pg stat

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 1 Jul 2014 00:17:51 +0000 (17:17 -0700)]

osd/osd_types: add last_undegraded, last_undersized to pg_stat_t

Keep track of the last time the PG was known to not be degraded or
undersized.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 3 Jul 2014 03:28:07 +0000 (20:28 -0700)]

osd/PG: track PG_STATE_UNDERSIZED separately from DEGRADED

DEGRADED means there are objects without complete reduncancy; also check
for needs_recovery().

UNDERSIZED means acting set is too small.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Wed, 2 Jul 2014 01:08:33 +0000 (18:08 -0700)]

osd: add PG_STATE_UNDERSIZED

This is a distinct concept from degraded.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sat, 21 Jun 2014 00:58:23 +0000 (17:58 -0700)]

osd/PG: account for misplaces separately than degraded

A degraded object does not have enough replicas or shards, while a
misplaced object is not stored in the correct place. Account for them
separately.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sat, 21 Jun 2014 01:09:12 +0000 (18:09 -0700)]

librados: approximate legacy 'degraded' value

The librados API returns a degraded count and no misplaced count. Sum them
to approximate the old behavior.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sat, 21 Jun 2014 00:56:25 +0000 (17:56 -0700)]

mon: warn about misplaced objects, just like degraded

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sat, 14 Jun 2014 16:21:52 +0000 (09:21 -0700)]

osd: num_objects_misplaced

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Sat, 16 Aug 2014 20:15:10 +0000 (13:15 -0700)]

Merge pull request #2217 from ceph/wip-problem-osds

mon: 'ceph osd blocked-by' for histogram of peers OSDs are waiting for

Reviewed-by: Samuel Just <sam.just@inktank.com>

commit | commitdiff | tree

Sage Weil [Sat, 16 Aug 2014 20:06:02 +0000 (13:06 -0700)]

qa/workunits/rest/test.py: fix 'df' test to use total_used_bytes

This changed back in ee2dbdb0f5e54fe6f9c5999c032063b084424c4c

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sat, 16 Aug 2014 19:56:39 +0000 (12:56 -0700)]

Revert "os/FileJournal: Update the journal header when closing journal"

This reverts commit 4eb18dd487da4cb621dcbecfc475fc0871b356ac.

This may be responsible for #9073. Until that is resolved, revert.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sat, 16 Aug 2014 16:18:19 +0000 (09:18 -0700)]

Merge pull request #2271 from ceph/wip-9053

paxos: fix problem with disjoint quorum members

Reviewed-by: Joao Eduardo Luis <joao.luis@inktank.com>

commit | commitdiff | tree

Yehuda Sadeh [Fri, 15 Aug 2014 20:28:35 +0000 (13:28 -0700)]

rgw: update civetweb submodule

Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>

commit | commitdiff | tree

Alfredo Deza [Fri, 15 Aug 2014 23:42:59 +0000 (19:42 -0400)]

Merge pull request #2270 from ceph/wip-init-ceph

init-ceph: don't use bashism

Reviewed-by: Alfredo Deza <adeza@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 15 Aug 2014 23:41:43 +0000 (16:41 -0700)]

init-ceph: don't use bashism

-z STRING
the length of STRING is zero

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Alfredo Deza [Fri, 15 Aug 2014 23:40:15 +0000 (19:40 -0400)]

Merge pull request #2247 from ceph/wip-ceph-disk

ceph-disk: fix various dmcrypt bugs

Reviewed-by: Alfredo Deza <adeza@redhat.com>

commit | commitdiff | tree

Loic Dachary [Fri, 15 Aug 2014 22:19:59 +0000 (00:19 +0200)]

Merge pull request #2269 from ceph/wip-osd-mon-feature

osd: fix mon feature requirement

Reviewed-by: Loic Dachary <loic@dachary.org>

commit | commitdiff | tree

Sage Weil [Fri, 15 Aug 2014 22:01:23 +0000 (15:01 -0700)]

Merge remote-tracking branch 'gh/next'

commit | commitdiff | tree

Boris Ranto [Fri, 15 Aug 2014 17:34:27 +0000 (19:34 +0200)]

Fix -Wno-format and -Werror=format-security options clash

This causes build failure in latest fedora builds, ceph_test_librbd_fsx adds -Wno-format cflag but the default AM_CFLAGS already contain -Werror=format-security, in previous releases, this was tolerated but in the latest fedora rawhide it no longer is, ceph_test_librbd_fsx builds fine without -Wno-format on x86_64 so there is likely no need for the flag anymore

Signed-off-by: Boris Ranto <branto@redhat.com>
Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 15 Aug 2014 21:28:57 +0000 (14:28 -0700)]

osd: fix feature requirement for mons

These features should be set on the client_messenger, not
cluster_messenger.

Backport: firefly
Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 15 Aug 2014 21:11:10 +0000 (14:11 -0700)]

Merge pull request #2268 from ceph/wip-9119

Wip 9119

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Samuel Just [Thu, 14 Aug 2014 18:13:31 +0000 (11:13 -0700)]

ReplicatedPG::maybe_handle_cache: do not forward RWORDERED reads

Even with READFORWARD, we can't forward RWORDERED reads.

Fixes: #9119
Backport: firefly
Signed-off-by: Samuel Just <sam.just@inktank.com>

commit | commitdiff | tree

Samuel Just [Tue, 12 Aug 2014 23:41:38 +0000 (16:41 -0700)]

ReplicatedPG::cancel_copy: clear cop->obc

Otherwise, an objecter callback might still be hanging
onto this reference until after the flush.

Fixes: #8894
Introduced: 589b639af7c8834a1e6293d58d77a9c440107bc3
Signed-off-by: Samuel Just <sam.just@inktank.com>

commit | commitdiff | tree

Sage Weil [Fri, 15 Aug 2014 20:55:36 +0000 (13:55 -0700)]

Merge pull request #2264 from ceph/wip-crush-features

do not require crush features for rules that aren't being used

Reviewed-by: Loic Dachary <loic@dachary.org>

commit | commitdiff | tree

Sage Weil [Fri, 15 Aug 2014 20:54:11 +0000 (13:54 -0700)]

unittest_osdmap: test EC rule and pool features

TODO: tiering feature bits.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 15 Aug 2014 20:41:15 +0000 (13:41 -0700)]

Merge pull request #2266 from kevincox/removewirehsark

Remove Old Wireshark Dissectors

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Samuel Just [Fri, 15 Aug 2014 20:37:54 +0000 (13:37 -0700)]

Merge pull request #2070 from somnathr/wip-sd-filestore-optimization

Wip sd filestore optimization

Reviewed-by: Samuel Just <sam.just@inktank.com>

commit | commitdiff | tree

Kevin Cox [Fri, 15 Aug 2014 19:27:13 +0000 (15:27 -0400)]

Remove Old Wireshark Dissectors

Remove the two old Wireshark plugins. They do not build and are
superseded by the dissector which is inside Wireshark.

Signed-Off-By: Kevin Cox <kevincox@kevincox.ca>

commit | commitdiff | tree

Sage Weil [Fri, 15 Aug 2014 15:55:10 +0000 (08:55 -0700)]

osd: only require crush features for rules that are actually used

Often there will be a CRUSH rule present for erasure coding that uses the
new CRUSH steps or indep mode. If these rules are not referenced by any
pool, we do not need clients to support the mapping behavior. This is true
because the encoding has not changed; only the expected CRUSH output.

Fixes: #8963
Backport: firefly
Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 15 Aug 2014 15:52:37 +0000 (08:52 -0700)]

crush: add is_v[23]_rule(ruleid) methods

Add methods to check if a *specific* rule uses v2 or v3 features. Refactor
the existing checks to use these.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Loic Dachary [Fri, 15 Aug 2014 10:43:03 +0000 (12:43 +0200)]

Merge pull request #2213 from dachary/wip-9025-chunk-remapping

erasure-code: chunk remapping

Reviewed-by: Samuel Just <sam.just@inktank.com>

commit | commitdiff | tree

Sage Weil [Wed, 13 Aug 2014 23:17:02 +0000 (16:17 -0700)]

mon/Paxos: share state and verify contiguity early in collect phase

We verify peons are contiguous and share new paxos states to catch peons
up at the end of the round.  Do this each time we (potentially) get new
states via a collect message.  This will allow peons to be pulled forward
and remain contiguous when they otherwise would not have been able to.
For example, if

  mon.0 (leader)  20..30
  mon.1 (peon)    15..25
  mon.2 (peon)    28..40

If we got mon.1 first and then mon.2 second, we would store the new txns
and then boot mon.1 out at the end because 15..25 is not contiguous with
28..40.  However, with this change, we share 26..30 to mon.1 when we get
the collect, and then 31..40 when we get mon.2's collect, pulling them
both into the final quorum.

It also breaks the 'catch-up' work into smaller pieces, which ought to
smooth out latency a bit.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 14 Aug 2014 23:55:58 +0000 (16:55 -0700)]

mon/Paxos: verify all new peons are still contiguous at end of round

During the collect phase we verify that each peon has overlapping or
contiguous versions as us (and can therefore be caught up with some
series of transactions).  However, we *also* assimilate any new states we
get from those peers, and that may move our own first_committed forward
in time.  This means that an early responder might have originally been
contiguous, but a later one moved us forward, and when the round finished
they were not contiguous any more.  This leads to a crash on the peon
when they get our first begin message.

For example:

- we have 10..20
- first peon has 5..15
   - ok!
- second peon has 18..30
   - we apply this state
- we are now 18..30
- we finish the round
   - send commit to first peon (empty.. we aren't contiguous)
   - send no commit to second peon (we match)
- we send a begin for state 31
   - first peon crashes (it's lc is still 15)

Prevent this by checking at the end of the round if we are still
contiguous.  If not, bootstrap.  This is similar to the check we do above,
but reverse to make sure *we* aren't too far ahead of *them*.

Fixes: #9053
Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Loic Dachary [Tue, 3 Jun 2014 17:27:26 +0000 (19:27 +0200)]

erasure-code: remap chunks if not sequential

If the remap vector is not empty, use it to figure out the sequence of
data chunks.

http://tracker.ceph.com/issues/9025 Fixes: #9025

Signed-off-by: Loic Dachary <loic@dachary.org>

commit | commitdiff | tree

Loic Dachary [Tue, 3 Jun 2014 20:20:29 +0000 (22:20 +0200)]

erasure-code: parse function for the mapping parameter

Each D letter is a data chunk. For instance:

_DDD_DDD

is going to parse into:

[ 1, 2, 3, 5, 6, 7 ]

the 0 and 4 positions are not used by chunks and do not show in the
mapping. Implement ErasureCode::parse to support a reasonable default
for the mapping parameter.

Signed-off-by: Loic Dachary <loic@dachary.org>

commit | commitdiff | tree

Loic Dachary [Tue, 3 Jun 2014 15:45:47 +0000 (17:45 +0200)]

erasure-code: ErasureCodeInterface::get_chunk_mapping()

Add support for erasure code plugins that do not sequentially map the
chunks encoded to the corresponding index. This is mostly transparent to
the caller, except when it comes to retrieving the data chunks when
reading. For this purpose there needs to be a remapping function so the
caller has a way to figure out which chunks actually contain the data
and reorder them.

Signed-off-by: Loic Dachary <loic@dachary.org>

commit | commitdiff | tree

Yehuda Sadeh [Fri, 1 Aug 2014 23:34:16 +0000 (16:34 -0700)]

rgw: update civetweb submodule

Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>

commit | commitdiff | tree

Yehuda Sadeh [Fri, 1 Aug 2014 23:15:36 +0000 (16:15 -0700)]

rgw: don't allow negative / invalid content length

Certain frontends (e.g., civetweb) don't filter such requests.

Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>

commit | commitdiff | tree

Yehuda Sadeh [Fri, 1 Aug 2014 21:09:48 +0000 (14:09 -0700)]

rgw: log civetweb messages

Handle the civetweb log_message callback, divert messages into our debug
log.

Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>

commit | commitdiff | tree

Yehuda Sadeh [Thu, 31 Jul 2014 04:32:48 +0000 (21:32 -0700)]

rgw: disable civetweb url decoding

Fixes: #8621
We want to have the raw request uri, as we do the decoding ourselves.

Signed-off-by: Yehuda Sadeh <yehuda@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 14 Aug 2014 23:02:22 +0000 (16:02 -0700)]

Merge remote-tracking branch 'gh/next'

commit | commitdiff | tree

Somnath Roy [Thu, 31 Jul 2014 22:03:53 +0000 (15:03 -0700)]

FileStore: Introduced a RLock instead of WLock

While calling index->collection_version, there is no need to
hold WLock at the index level. RLock should be sufficient.

Signed-off-by: Somnath Roy <somnath.roy@sandisk.com>

commit | commitdiff | tree

Somnath Roy [Thu, 31 Jul 2014 21:56:42 +0000 (14:56 -0700)]

FileStore: No need to hold Index lock during omap calls

The Index lock is held during all the omap calls which is
not necessary.

Signed-off-by: Somnath Roy <somnath.roy@sandisk.com>

commit | commitdiff | tree

Somnath Roy [Mon, 30 Jun 2014 08:54:36 +0000 (01:54 -0700)]

FileStore: FDCache lookup is rearranged

In lfn_open() there is no point of building the Index if the
cache lookup is successful and caller is not asking for Index.

Signed-off-by: Somnath Roy <somnath.roy@sandisk.com>

commit | commitdiff | tree

Somnath Roy [Mon, 30 Jun 2014 08:28:07 +0000 (01:28 -0700)]

FileStore: Index caching is introduced for performance improvement

IndexManager now has a Index caching. Index will only be created if not
found in the cache. Earlier, each op is creating an Index object and other
ops requesting the same index needed to wait till previous op is done.
Also, after finishing lookup, this Index object was destroyed.
Now, a Index cache is been implemented to persists these Indexes since
there is a major performance hit because each op is creating and destroying
these. A RWlock is been introduced in the CollectionIndex class and that is
responsible for sync between lookup and create.
Also, since these Index objects are persistent there is no need to use
smart pointers. So, Index is a wrapper class of CollecIndex* now.
It is the responsibility of the users of Index now to lock explicitely
before using them. Index object is sufficient now for locking and no need
to hold IndexPath for locking. The function interfaces of lfn_open,lfn_find
are changed accordingly.

Signed-off-by: Somnath Roy <somnath.roy@sandisk.com>

commit | commitdiff | tree

Somnath Roy [Mon, 30 Jun 2014 07:24:39 +0000 (00:24 -0700)]

shared_cache: pass key (K) by const ref in interface methods

Signed-off-by: Somnath Roy <somnath.roy@sandisk.com>

commit | commitdiff | tree

Greg Farnum [Thu, 30 Jan 2014 22:27:04 +0000 (14:27 -0800)]

FileStore: remove the fdcache_lock

With the changes to the shared_cache, we no longer need the fdcache_lock
to prevent us from inserting a second fd for the same hobject into the cache.

Signed-off-by: Greg Farnum <greg@inktank.com>
Merged conflict fixed.

Signed-off-by: Somnath Roy <somnath.roy@sandisk.com>
Conflicts:
src/os/FileStore.cc

Unnamed repository; edit this file 'description' to name the repository.