git.apps.os.sepia.ceph.com Git

]> git.apps.os.sepia.ceph.com Git - ceph.git/log

Greg Farnum [Thu, 2 Feb 2012 00:28:35 +0000 (16:28 -0800)]

osd: d'oh again! Make this real exponential, not...ever-linear.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>

commit | commitdiff | tree

Greg Farnum [Thu, 2 Feb 2012 00:28:18 +0000 (16:28 -0800)]

osd: OpRequest currently_* needs to look at latest, not hit.

D'oh!

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>

commit | commitdiff | tree

Greg Farnum [Thu, 2 Feb 2012 00:05:32 +0000 (16:05 -0800)]

Merge remote branch 'origin/master' into wip-osd-op-tracking

Conflicts:
src/osd/ReplicatedPG.h

commit | commitdiff | tree

Greg Farnum [Wed, 1 Feb 2012 21:25:37 +0000 (13:25 -0800)]

osd: add check_ops_in_flight()

By default it warns on requests that are more than 30 seconds old,
using an exponential backoff of that interval.
Also add state name retrieval to OpRequest.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>

commit | commitdiff | tree

Greg Farnum [Mon, 30 Jan 2012 22:50:28 +0000 (14:50 -0800)]

osd: "mark" OpRequests as they move through the system.

Right now these are just informational flags which can be read out. Later
they might extend to timing information, separate lists for more precise
control over latency warnings, etc.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>

commit | commitdiff | tree

Greg Farnum [Thu, 26 Jan 2012 01:30:07 +0000 (17:30 -0800)]

PG: switch op passing interface to use OpRequest

This is all the PG/ReplicatedPG internals and the few remaining OSD callers.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>

commit | commitdiff | tree

Greg Farnum [Wed, 25 Jan 2012 23:51:58 +0000 (15:51 -0800)]

osd: switch op passing interface to use OpRequest instead of raw Messages

This doesn't handle the PG internals yet.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>

commit | commitdiff | tree

Greg Farnum [Wed, 25 Jan 2012 23:48:44 +0000 (15:48 -0800)]

osd: add new OpRequest struct and an xlist to track it

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>

commit | commitdiff | tree

Yehuda Sadeh [Wed, 1 Feb 2012 20:55:52 +0000 (12:55 -0800)]

cls_rgw: update bucket index when deleting object (with pending)

Bug #2012. Racing delete with other operations (update or another
delete) failed to update the bucket index.

Signed-off-by: Yehuda Sadeh <yehuda.sadeh@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Wed, 1 Feb 2012 18:55:45 +0000 (10:55 -0800)]

Merge remote branch 'gh/wip-divergent-backfill'

Reviewed-by: Samuel Just <samuel.just@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Wed, 1 Feb 2012 04:06:27 +0000 (20:06 -0800)]

osd: fix assignment in PG::rewind_divergent_log()

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Wed, 1 Feb 2012 00:18:52 +0000 (16:18 -0800)]

Merge remote-tracking branch 'gh/wip-journal-crc'

Reviewed-by: Josh Durgin <josh.durgin@dreamhost.com>

commit | commitdiff | tree

Greg Farnum [Thu, 12 Jan 2012 20:42:21 +0000 (12:42 -0800)]

msgr: Document recv_stamp and add a dispatch_stamp and throttle_wait.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Tue, 31 Jan 2012 21:00:45 +0000 (13:00 -0800)]

qa: test_backfill.sh: take osd.0 down

Mark this down to
1- trigger the WaitActingChange vs osd down race, and
2- help trigger a divergnet log when osd.2 is blackholed+restarted during
backfill. e.g.,

./ceph -- tell osd.1 injectargs '--filestore-blackhole' ; sleep 10 ; ./init-ceph restart osd.1

Signed-off-by: Sage Weil <sage@newdream.net>

commit | commitdiff | tree

Sage Weil [Tue, 31 Jan 2012 17:53:32 +0000 (09:53 -0800)]

osd: restart peering if requesting acting osd goes down

If we request an acting set, we need to restart peering if one of the
requested nodes goes down. This prevents a deadlock where we get stuck
in WaitActingChange because we have [a,b], want [a,b,c], but c is down and
our up and acting don't actually change.

Signed-off-by: Sage Weil <sage@newdream.net>

commit | commitdiff | tree

Sage Weil [Tue, 31 Jan 2012 17:40:23 +0000 (09:40 -0800)]

osd: rename recovery event NeedNewMap -> NeedActingChange

This is more precise.

Signed-off-by: Sage Weil <sage@newdream.net>

commit | commitdiff | tree

Sage Weil [Tue, 31 Jan 2012 15:23:10 +0000 (07:23 -0800)]

osd: use RecoveryContext transaction, finishers on recovery completion

We should use the enclosing transaction and finisher list here.

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Tue, 31 Jan 2012 15:16:37 +0000 (07:16 -0800)]

qa: test_backfill.sh: limit pg log length so we trigger backfill

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Tue, 31 Jan 2012 15:25:04 +0000 (07:25 -0800)]

osd: fix divergent backfill targets

During peering, a previous backfill target may have a slightly newer
last_update than the other options, but it will not be chosen because it
is incomplete. That caused a failed assert during activate() (#1983).

To fix, we remove the bad assert, and then fix merge_log() so that the
replica/backfill target will trim its divergent entries when it gets the
activation MLogRec. We also fix the handling of MInfoRec, as that can
trigger the same analogous condition.

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Tue, 31 Jan 2012 01:39:23 +0000 (17:39 -0800)]

filestore: implement filestore_blackhole hook

If true, we'll drop any new transactions on the floor. Useful for
triggering failure conditions (e.g., prior to killing ceph-osd itself, to
ensure some operations don't reach the local disk).

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Yehuda Sadeh [Tue, 31 Jan 2012 01:00:37 +0000 (17:00 -0800)]

rgw: should remove bucket dir instead of sending intent

that was really useless, and also bucket cleanup was broken anyway.

Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>

commit | commitdiff | tree

Yehuda Sadeh [Tue, 31 Jan 2012 00:48:15 +0000 (16:48 -0800)]

librados: fix a leak

watch notification message was missing a ->put()

Signed-off-by: Yehuda Sadeh <yehuda.sadeh@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Mon, 30 Jan 2012 22:27:24 +0000 (14:27 -0800)]

osd: disable clone overlap for push/pull

There is a bug in the push/pull code. Disable the recovery smarts by
default until we fix #2002.

There is currently a race (in the callers) where:
- an adjacent clone is missing
- we (calculate some clone overlap? and) start pulling
- we get adjacent clone
- we get push, calc a different overlap, and then get confused.

Signed-off-by: Sage Weil <sage@newdream.net>

commit | commitdiff | tree

Sage Weil [Mon, 30 Jan 2012 21:42:45 +0000 (13:42 -0800)]

Merge remote branch 'gh/wip-warnings'

commit | commitdiff | tree

Sage Weil [Mon, 30 Jan 2012 05:46:53 +0000 (21:46 -0800)]

mon: make 'osd [out|in|down]' succeed if already whatever

If we want something out and it is already out, succeed. This makes the
client command succeed if there is a transient error and it gets resent.

Signed-off-by: Sage Weil <sage@newdream.net>

commit | commitdiff | tree

Sage Weil [Mon, 30 Jan 2012 05:05:08 +0000 (21:05 -0800)]

qa: encoding: silence warning

This is cheating, but we always use this class with int types, so it makes
this go away:

warning: test/encoding.cc:79:20: ‘*((void*)(& tu)+4).ConstructorCounter::data’ may be used uninitialized in this function [-Wuninitialized]

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Mon, 30 Jan 2012 04:56:03 +0000 (20:56 -0800)]

qa: test/gather fix warning

warning: test/gather.cc:29:222: passing NULL to non-pointer argument 3 of ‘static testing::AssertionResult testing::internal::EqHelper::Compare(const char*, const char*, const T1&, T2*) [with T1 = long int, T2 = C_Gather]’ [-Wconversion-null]

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Mon, 30 Jan 2012 04:54:18 +0000 (20:54 -0800)]

qa: test/rados-api/list fix warning

warning: test/rados-api/list.cc:43:156: converting ‘false’ to pointer type for argument 1 of ‘char testing::internal::IsNullLiteralHelper(testing::internal::Secret*)’ [-Wconversion-null]

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Mon, 30 Jan 2012 04:36:46 +0000 (20:36 -0800)]

test_ipaddr: reverse ASSERT_EQ order

Make these warnings go away:

warning: test/test_ipaddr.cc:217:156: converting ‘false’ to pointer type for argument 1 of ‘char testing::internal::IsNullLiteralHelper(testing::internal::Secret*)’ [-Wconversion-null]

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Mon, 30 Jan 2012 01:26:55 +0000 (17:26 -0800)]

osd: remove unused var

warning: osd/PG.cc:1331:20: variable 'plu' set but not used [-Wunused-but-set-variable]

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Mon, 30 Jan 2012 01:26:14 +0000 (17:26 -0800)]

admin_socket: fix uninit warning

warning: common/admin_socket_client.cc:166:19: 'socket_fd' may be used uninitialized in this function [-Wuninitialized]

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Sun, 29 Jan 2012 17:26:28 +0000 (09:26 -0800)]

mon: trim old auth states

These aren't exposed outside the monitor, so we really only keep them
around to assist in mon recovery. Give ourselves a healthy margin over
the max join drift for that.

Fixes: #2000
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Sun, 29 Jan 2012 16:48:22 +0000 (08:48 -0800)]

filestore: fix rollback when current/ missing entirely

This can happen when we are starting, rolling back, remove current/, and
then fail before we snapshot a snap_ into place.

Most of the logic was already in place for this; we tried to fix it in
cd2dedd7d190a43a6be50a7f18849fe0123c72bc but missed this piece.

Fixes: #1999
Signed-off-by: Sage Weil <sage@newdream.net>

commit | commitdiff | tree

Sage Weil [Sat, 28 Jan 2012 01:32:28 +0000 (17:32 -0800)]

osd: reset pgstats timer when we reopen monitor session

Otherwise we'll reopen every second from here on out, without giving the
new session a chance to start up and do it's thing.

Signed-off-by: Sage Weil <sage@newdream.net>

commit | commitdiff | tree

Sage Weil [Sat, 28 Jan 2012 19:40:08 +0000 (11:40 -0800)]

clock: ignore clock_offset if cct is NULL

This is helpful e.g. from assert.

Signed-off-by: Sage Weil <sage@newdream.net>

commit | commitdiff | tree

Sage Weil [Thu, 26 Jan 2012 01:35:49 +0000 (17:35 -0800)]

filejournal: add corruption test to check crc checking code

Verify that the journal replay rejects a corrupted journal entry.

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Thu, 26 Jan 2012 00:37:34 +0000 (16:37 -0800)]

filejournal: assume gibberish flags imply none

Old journals didn't properly initialize the flags (oops). Assume that
any bits besides the first 2 imply no flags.

Make note that this hack needs to be removed after some time has passed,
but well before these new flags are used. Or, such use should be
accompanied by a full header format rev and incompatibility.

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Thu, 26 Jan 2012 00:36:17 +0000 (16:36 -0800)]

filejournal: include crc in entry header/footer

Use the unused flags field for this. Previously it was always 0, so this
lets us skip old entries on old journals and only worry about missing one
out of 2^32 corruptions. New journals get a flag that strictly enforces
the crc check.

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Mon, 23 Jan 2012 20:03:32 +0000 (12:03 -0800)]

qa: test_filejournal: test lots of small writes too

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Sat, 28 Jan 2012 19:08:52 +0000 (11:08 -0800)]

qa: add test_filejournal

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Thu, 26 Jan 2012 00:12:42 +0000 (16:12 -0800)]

filejournal: fix header initialization

Make sure it's zeros to start with. Currently flags might be gibberish!

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Tue, 24 Jan 2012 01:00:28 +0000 (17:00 -0800)]

filejournal: clean up some errno checks

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Thu, 19 Jan 2012 00:24:52 +0000 (16:24 -0800)]

filejournal: assert submit_entry gets >0 bytes

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Thu, 19 Jan 2012 00:24:38 +0000 (16:24 -0800)]

filejournal: initialize header before writing

Avoid writing uninitialized crap.

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Thu, 19 Jan 2012 00:21:35 +0000 (16:21 -0800)]

filejournal: move zero_buf allocation

We need header.alignment to be defined.

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Sat, 28 Jan 2012 17:38:46 +0000 (09:38 -0800)]

client: do not send release to down mds

We can have a session with state where the mds is not up; don't blindly
send a message or we can get

./mds/MDSMap.h: In function 'const entity_inst_t MDSMap::get_inst(int)', in thread '0x7f092aad1910'
./mds/MDSMap.h: 465: FAILED assert(up.count(m))
ceph version 0.35-6-g6eb8862 (commit:6eb8862e91d142451e256aaa02b34c81a4f21dea)
1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x70) [0x71f11a]
2: (MDSMap::get_inst(int)+0x4b) [0x6dc191]
3: (Client::flush_cap_releases()+0x94) [0x677e60]
4: (Client::tick()+0x1f0) [0x690adc]
5: (C_C_Tick::finish(int)+0x1c) [0x6f3fbe]
6: (SafeTimer::timer_thread()+0x2c5) [0x6fbfe5]
7: (SafeTimerThread::entry()+0x19) [0x6fe399]
8: (Thread::_entry_func(void*)+0x20) [0x72e944]
9: /lib/libpthread.so.0 [0x7f092dea573a]
10: (clone()+0x6d) [0x7f092cba169d]

with a map like

$ ./ceph mds dump 85
2012-01-28 09:37:19.251946 mon <- [mds,dump,85]
2012-01-28 09:37:19.252618 mon.1 -> 'dumped mdsmap epoch 85' (0)
epoch   85
flags   0
created 2012-01-28 09:24:42.411202
modified        2012-01-28 09:28:45.093301
tableserver     0
root    0
session_timeout 60
session_autoclose       300
last_failure    0
last_failure_osd_epoch  18
compat  compat={},rocompat={},incompat={1=base v0.20,2=client writeable ranges,3=default file layouts on dirs,4=dir inode in separate object}
max_mds 1
in      0
up      {}
failed  0
stopped
data_pools      [0]
metadata_pool   1

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Sat, 28 Jan 2012 18:04:45 +0000 (10:04 -0800)]

Merge branch 'stable'

commit | commitdiff | tree

Sage Weil [Sat, 28 Jan 2012 17:26:46 +0000 (09:26 -0800)]

signal: use _exit() on SIGTERM

No need to call onexit handlers, static dtors, whatever.

This may help with #1996 and #1549.

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Josh Durgin [Fri, 27 Jan 2012 19:45:26 +0000 (11:45 -0800)]

test: add script for checking admin socket 'objecter_requests' output

Just a couple internal consistency checks for now. More specific ones
would depend on workload.

Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>

commit | commitdiff | tree

Josh Durgin [Mon, 23 Jan 2012 21:04:14 +0000 (13:04 -0800)]

objecter: add an admin socket command to get in-flight requests

Fixes: #1881
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>

commit | commitdiff | tree

Josh Durgin [Mon, 23 Jan 2012 23:04:17 +0000 (15:04 -0800)]

admin socket: increase debug level for successful requests

Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>

commit | commitdiff | tree

Josh Durgin [Sat, 21 Jan 2012 01:02:52 +0000 (17:02 -0800)]

admin socket: add include guard

Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>

commit | commitdiff | tree

Josh Durgin [Fri, 20 Jan 2012 23:58:37 +0000 (15:58 -0800)]

CephContext: add method for retrieving admin socket

This is needed to allow higher layers in the stack to add admin socket
commands.

Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Sat, 28 Jan 2012 00:40:53 +0000 (16:40 -0800)]

Merge branch 'wip-pg-stale'

commit | commitdiff | tree

Sage Weil [Fri, 27 Jan 2012 21:27:27 +0000 (13:27 -0800)]

mon: stale pgs -> HEALTH_WARN

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Fri, 27 Jan 2012 21:21:39 +0000 (13:21 -0800)]

mon: mark pgs stale in pg_map if primary osd is down

This alerts the administrator when all OSDs for a PG have failed and the
monitor doesn't receive any further updates. Otherwise we may continue
to think a pg is active+clean when it is in fact offline.

Fixes: #1993
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Fri, 27 Jan 2012 21:02:28 +0000 (13:02 -0800)]

osd: add STALE pg state bit

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Fri, 27 Jan 2012 18:42:21 +0000 (10:42 -0800)]

v0.41

commit | commitdiff | tree

Sage Weil [Fri, 27 Jan 2012 20:23:33 +0000 (12:23 -0800)]

objector: document Objecter::init_ops()

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Fri, 27 Jan 2012 20:23:23 +0000 (12:23 -0800)]

objecter: fix out_* initialization

This looks more like the real cause for #1986. Op ctor gets a vector of
ops but out_* aren't initialized to match.

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Greg Farnum [Thu, 12 Jan 2012 19:27:55 +0000 (11:27 -0800)]

Revert "common/Throttle: Remove unused return type on Throttle::get()"

This reverts commit 4549501c9b0968ce4243e06ff7e9ef03b19de667.
We're about to use it to avoid a time lookup if possible.

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>

commit | commitdiff | tree

Greg Farnum [Wed, 25 Jan 2012 23:58:49 +0000 (15:58 -0800)]

osd: remove unused PG::block_if_wrlocked declaration

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Fri, 27 Jan 2012 18:41:50 +0000 (10:41 -0800)]

filestore: dump offending transaction on any error

Clean this code up to explicitly whitelist what is ok so that the flow is
less annoying to follow/maintain, and so that we dump the transaction
contents on whitelisted errors.

Signed-off-by: Sage Weil <sage@newdream.net>
Reviewed-by: Josh Durgin <josh.durgin@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Fri, 27 Jan 2012 18:40:14 +0000 (10:40 -0800)]

objecter: warn when OSD returns mismatched op vector

The osd shouldn't do this (even though we should tolerate it).

Signed-off-by: Sage Weil <sage@newdream.net>
Reviewed-by: Greg Farnum <gregory.farnum@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Fri, 27 Jan 2012 18:39:49 +0000 (10:39 -0800)]

objecter: fix bounds checking on op reply demuxing

We can't assume that the size of out_ops (from the reply) matches the
op->out_* vectors from our request state. In particular, the out_ops might
be shorter than what we sent the OSD if the OSD was sloppy. Check them.

We can assume that op->ops and op->out_* all match; assert as much in
op_submit().

Fixes: #1986
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Reviewed-by: Greg Farnum <gregory.farnum@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Fri, 27 Jan 2012 18:01:47 +0000 (10:01 -0800)]

mds: remove test assert

Grr!

Signed-off-by: Sage Weil <sage@newdream.net>

commit | commitdiff | tree

Sage Weil [Fri, 27 Jan 2012 14:32:29 +0000 (06:32 -0800)]

assert: include timestamp

Also drop quotes around thread id.

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Greg Farnum [Wed, 25 Jan 2012 23:33:28 +0000 (15:33 -0800)]

osd: remove the unused require_current_map

Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Wed, 25 Jan 2012 22:07:06 +0000 (14:07 -0800)]

filestore: fix typo

Grr

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Wed, 25 Jan 2012 22:03:18 +0000 (14:03 -0800)]

Merge branch 'wip-kb'

Reviewed-by: Samuel Just <samuel.just@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Wed, 25 Jan 2012 21:52:32 +0000 (13:52 -0800)]

filestore: zero btrfs vol_args prior to ioctl

Just to be paranoid. Nothing we haven't set *should* affect the ABI,
but...

Always do this immediately after declaration so that we catch everything.

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Samuel Just [Wed, 25 Jan 2012 21:58:58 +0000 (13:58 -0800)]

Merge remote branch 'upstream/wip-osd-clone-obc'

commit | commitdiff | tree

Sage Weil [Wed, 25 Jan 2012 20:38:59 +0000 (12:38 -0800)]

mon: num_kb -> num_bytes in cluster perfcounters

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Wed, 25 Jan 2012 20:38:06 +0000 (12:38 -0800)]

osd: remove num_kb from object_stat_sum_t stats

This is redundant--we can just use num_bytes. If we're worried about the
per-object overhead or rounding, we can factor in some overhead based on
num_objects.

And, the kb accounting has a bug (#1988).

Avoid changing the encoding at all for now. Next time the encoding changes
we'll drop the old field.

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Wed, 25 Jan 2012 17:56:58 +0000 (09:56 -0800)]

osd: improve object context debug output

Include pointer. This may help with #1979.

Signed-off-by: Sage Weil <sage@newdream.net>

commit | commitdiff | tree

Sage Weil [Wed, 25 Jan 2012 06:03:51 +0000 (22:03 -0800)]

osd: track obc for clone from log replay

We need to keep an in-memory obc to track the state of the in-flight io
to disk. This is analogous to when an object is pushed + written, and we
can share the same completion function.

Signed-off-by: Sage Weil <sage@newdream.net>

commit | commitdiff | tree

Sage Weil [Wed, 25 Jan 2012 05:34:27 +0000 (21:34 -0800)]

osd: set object_info_t::oid properly when recovering clones

I saw a case (#1973) where the clone had the oid set to the head. That is
clearly wrong. Not sure what damage this caused.

Signed-off-by: Sage Weil <sage@newdream.net>

commit | commitdiff | tree

Sage Weil [Wed, 25 Jan 2012 05:19:44 +0000 (21:19 -0800)]

Merge remote branch 'gh/wip-filestore-errors'

commit | commitdiff | tree

Alexandre Oliva [Tue, 17 Jan 2012 19:22:17 +0000 (17:22 -0200)]

package *.py* files

Some post-install rpmbuild defaults byte-compile all packaged python
files, so don't bother removing the .pyc files, and package .py* to
get both .pyo and .pyc. It wastes a tiny little bit of space, but it
makes the spec file portable across a wider range of rpm and python
configurations.

Signed-off-by: Alexandre Oliva <oliva@lsd.ic.unicam.br>
Signed-off-by: Sage Weil <sage@newdream.net>

commit | commitdiff | tree

Josh Durgin [Wed, 25 Jan 2012 00:52:27 +0000 (16:52 -0800)]

librbd: don't infinite loop when header is too large

Since snapshots are currently stored at the end of the header, having
many snapshots made the header larger than the read size, resulting in
an infinite loop when the offset was not changed.

Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>
Reviewed-by: Samuel Just <samuel.just@dreamhost.com>

commit | commitdiff | tree

Samuel Just [Tue, 24 Jan 2012 22:57:07 +0000 (14:57 -0800)]

ReplicatedPG: data_subset may be empty during sub_op_push

Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Reviewed-by: Josh Durgin <josh.durgin@dreamhost.com>

commit | commitdiff | tree

Josh Durgin [Tue, 24 Jan 2012 21:23:21 +0000 (13:23 -0800)]

filestore: fix non-::-prefixed close

Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>

commit | commitdiff | tree

Josh Durgin [Tue, 24 Jan 2012 21:20:20 +0000 (13:20 -0800)]

filestore: add debugging to each error case in lfn_open

Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Tue, 24 Jan 2012 21:16:30 +0000 (13:16 -0800)]

filestore: TEMP_FAILURE_RETRY on ::close(2)

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Tue, 24 Jan 2012 17:31:39 +0000 (09:31 -0800)]

filestore: return -errno from lfn_open

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Tue, 24 Jan 2012 17:31:33 +0000 (09:31 -0800)]

filestore: audit + clean up error checks

- use temp var for errno
- in general return -errno from helpers

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Mon, 23 Jan 2012 21:50:19 +0000 (13:50 -0800)]

Merge commit '9dc7b9233b985bf859751fc89a5b02253e829836'

Reviewed-by: Greg Farnum <gregory.farnum@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Mon, 23 Jan 2012 20:48:46 +0000 (12:48 -0800)]

rgw: fix warning

rgw/rgw_rest.cc:258: warning: comparison between signed and unsigned integer expressions

Signed-off-by: Sage Weil <sage@newdream.net>

commit | commitdiff | tree

Sage Weil [Mon, 23 Jan 2012 20:43:19 +0000 (12:43 -0800)]

ceph: bail out on first failing command

Signed-off-by: Sage Weil <sage@newdream.net>

commit | commitdiff | tree

Sage Weil [Mon, 23 Jan 2012 20:43:03 +0000 (12:43 -0800)]

ceph: don't write output on error

Accumulate all output, and write it at the end. This way we can avoid
writing it if any of the commands fail.

Fixes: #1954
Signed-off-by: Sage Weil <sage@newdream.net>

commit | commitdiff | tree

Sage Weil [Mon, 23 Jan 2012 18:21:04 +0000 (10:21 -0800)]

osd: ignore MInfoRec, MNotifyRec in WaitActingChange

We should ignore logs, infos, and notifies while we are waiting for the
map to change. Peering has reached a dead-end (we need acting to change)
and we will redo our work when that happens. That includes the replicas
resending notifies.

Fixes: #1958
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Reviewed-by: Samuel Just <samuel.just@dreamhost.com>

commit | commitdiff | tree

Yehuda Sadeh [Mon, 23 Jan 2012 17:50:56 +0000 (09:50 -0800)]

rgw: fix warning in 32bit arch

commit | commitdiff | tree

Josh Durgin [Thu, 19 Jan 2012 01:34:50 +0000 (17:34 -0800)]

pg: unindex entries when clearing or removing from the log

Leaving the index around could cause use of the indexes to access
freed memory.

Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Thu, 19 Jan 2012 02:01:09 +0000 (18:01 -0800)]

osd: do not clobber log on backfill progress update

This is unnecessary and counterproductive, since the log is used to detect
dup ops. It's an artifact of an earlier backfill iteration that didn't
preserve the log on the backfill target.

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Yehuda Sadeh [Fri, 20 Jan 2012 20:54:14 +0000 (12:54 -0800)]

rgw: read_user_buckets() fix redone

The problem with the original fix is that it wasn't atomic. Going back
to the original inefficient (though atomic) method. We should limit
the number of buckets per user anyway, and shouldn't get into a point
where this code is actually execised.

Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>

commit | commitdiff | tree

Sage Weil [Sun, 15 Jan 2012 05:15:02 +0000 (21:15 -0800)]

osd: implement --dump-journal

Dump the contents of the journal to stdout in text form. Useful for
debugging.

Signed-off-by: Sage Weil <sage.weil@dreamhost.com>

commit | commitdiff | tree

Yehuda Sadeh [Fri, 20 Jan 2012 18:46:31 +0000 (10:46 -0800)]

rgw: read large bucket directory correctly

Issue #1955. When there wre too many buckets, we failed reading
the bucket directory.

Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>

commit | commitdiff | tree

Yehuda Sadeh [Thu, 19 Jan 2012 17:11:09 +0000 (09:11 -0800)]

rgw: fix warning

Signed-off-by: Yehuda Sadeh <yehuda.sadeh@dreamhost.com>

commit | commitdiff | tree

Sage Weil [Thu, 19 Jan 2012 04:41:04 +0000 (20:41 -0800)]

Merge remote branch 'gh/wip-op-data-mux'

Reviewed-by: Greg Farnum <greg.farnum@dreamhost.com>
Reviewed-by: Yehuda Sadeh <yehuda.sadeh@dreamhost.com>

commit | commitdiff | tree

Neil Horman [Wed, 18 Jan 2012 17:00:14 +0000 (12:00 -0500)]

Convert mount.ceph to use KEY_SPEC_PROCESS_KEYRING

having mount.ceph use KEY_SPEC_USER_KEYRING to pass keys to the kernel has
several disadvantages:

1) It leaves the key setting in the uid_keyring, which is reachable from the
session keyring via a link (see keyctl list <root session keyring ref>).  This
means its accessible to other processes in the same session that don't need
access to it, even after the kernel is done with it.

2) The user keyring has some very counter-intuitive semantics as far as keyring
permissions goes.  The user keyring is access via a link from the session
keyring, which a process may not have permission to access in some situations.
For instance if mount.ceph is executed via su without having started a new
session, mount.ceph will not have access to the uid keyring unless the calling
proces (in this case su) has granted access permission.  The result is a -EPERM
error when executing mount.ceph to a cephx enabled server.  If the same command
is attempted in a new root session (e.g. su - or su -l), the mount command will
work fine

Switching the mount.ceph command to use the KEY_SPEC_PROCESS_KEYRING solves both
of these problems.  By using this keyring, accessibility is guaranteed because
its added and accessed in the same process context both in user space and the
kernel, assuring aceesability, despite the session specifics.  It also ensures
that the key will get cleaned up after the mount.ceph process exits
automatically, since there is no longer a need for it (the kernel clones the key
during the mount process and releases it on unmount).

I've tested this here on my local ceph cluster, and it works properly under both
su and su -l .

Signed-off-by: Neil Horman <nhorman@tuxdriver.com>
CC: Josh Durgin <josh.durgin@dreamhost.com>

Unnamed repository; edit this file 'description' to name the repository.

RSS Atom