git.apps.os.sepia.ceph.com Git

]> git.apps.os.sepia.ceph.com Git - ceph.git/log

projects / ceph.git / log

commit | commitdiff | tree

Sage Weil [Sat, 21 Jul 2012 06:26:56 +0000 (23:26 -0700)]

v0.49

commit | commitdiff | tree

Samuel Just [Fri, 20 Jul 2012 20:09:39 +0000 (13:09 -0700)]

test/store_test.cc: verify collection_list_partial results are sorted

Synthetic test now also varies snapshots and uses a small variety of
hashes.

Signed-off-by: Samuel Just <sam.just@inktank.com>

commit | commitdiff | tree

Samuel Just [Fri, 20 Jul 2012 19:00:42 +0000 (12:00 -0700)]

os/HashIndex: use set<pair<string, hobject_t>> rather than multimap

Multimap does not make any guarantees about ordering of different
values with the same key. list_by_hash, however, assumes that
the iterator order matches hobject_t order. Thus, we use
set<pair<string, hobject_t> > to get the proper ordering.

Backport: stable

Signed-off-by: Samuel Just <sam.just@inktank.com>

commit | commitdiff | tree

Sage Weil [Thu, 19 Jul 2012 02:49:58 +0000 (19:49 -0700)]

add CRUSH_TUNABLES feature bit

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Josh Durgin [Wed, 18 Jul 2012 17:24:58 +0000 (10:24 -0700)]

ObjectCacher: fix cache_bytes_hit accounting

Misses are not hits!

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>

commit | commitdiff | tree

Sage Weil [Wed, 18 Jul 2012 02:19:39 +0000 (19:19 -0700)]

client: fix readdir locking

Several of the readdir-related methods were not taking client_lock.

Fixes: #1737
Backport: argonaut
Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Tue, 17 Jul 2012 19:38:50 +0000 (12:38 -0700)]

client: fix leak of client_lock when not initialized

Backport: argonaut
Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Pascal de Bruijn | Unilogic Networks B.V [Wed, 11 Jul 2012 13:23:16 +0000 (15:23 +0200)]

Robustify ceph-rbdnamer and adapt udev rules

Below is a patch which makes the ceph-rbdnamer script more robust and
fixes a problem with the rbd udev rules.

On our setup we encountered a symlink which was linked to the wrong rbd:

  /dev/rbd/mypool/myrbd -> /dev/rbd1

While that link should have gone to /dev/rbd3 (on which a
partition /dev/rbd3p1 was present).

Now the old udev rule passes %n to the ceph-rbdnamer script, the problem
with %n is that %n results in a value of 3 (for rbd3), but in a value of
1 (for rbd3p1), so it seems it can't be depended upon for rbdnaming.

In the patch below the ceph-rbdnamer script is made more robust and it
now it can be called in various ways:

  /usr/bin/ceph-rbdnamer /dev/rbd3
  /usr/bin/ceph-rbdnamer /dev/rbd3p1
  /usr/bin/ceph-rbdnamer rbd3
  /usr/bin/ceph-rbdnamer rbd3p1
  /usr/bin/ceph-rbdnamer 3

Even with all these different styles of calling the modified script, it
should now return the same rbdname. This change "has" to be combined
with calling it from udev with %k though.

With that fixed, we hit the second problem. We ended up with:

  /dev/rbd/mypool/myrbd -> /dev/rbd3p1

So the rbdname was symlinked to the partition on the rbd instead of the
rbd itself. So what probably went wrong is udev discovering the disk and
running ceph-rbdnamer which resolved it to myrbd so the following
symlink was created:

  /dev/rbd/mypool/myrbd -> /dev/rbd3

However partitions would be discovered next and ceph-rbdnamer would be
run with rbd3p1 (%k) as parameter, resulting in the name myrbd too, with
the previous correct symlink being overwritten with a faulty one:

  /dev/rbd/mypool/myrbd -> /dev/rbd3p1

The solution to the problem is in differentiating between disks and
partitions in udev and handling them slightly differently. So with the
patch below partitions now get their own symlinks in the following style
(which is fairly consistent with other udev rules):

  /dev/rbd/mypool/myrbd-part1 -> /dev/rbd3p1

Please let me know any feedback you have on this patch or the approach
used.

Regards,
Pascal de Bruijn
Unilogic B.V.

Signed-off-by: Pascal de Bruijn <pascal@unilogicnetworks.net>
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>

commit | commitdiff | tree

Sage Weil [Mon, 16 Jul 2012 23:02:14 +0000 (16:02 -0700)]

log: apply log_level to stderr/syslog logic

In non-crash situations, we want to make sure the message is both below the
syslog/stderr threshold and also below the normal log threshold. Otherwise
we get anything we gather on those channels, even when the log level is
low.

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Mon, 16 Jul 2012 22:40:53 +0000 (15:40 -0700)]

log: fix event gather condition

We should gather an event if it is below the log or gather threshold.

Previously we were only gathering if we were going to print it, which makes
the dump no more useful than what was already logged.

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Samuel Just [Mon, 16 Jul 2012 20:11:24 +0000 (13:11 -0700)]

PG::RecoveryState::Stray::react(LogEvt&): reset last_pg_scrub

We need to reset the last_pg_scrub data in the osd since we
are replacing the info.

Probably fixes #2453

In cases like 2453, we hit the following backtrace:

0> 2012-05-19 17:24:09.113684 7fe66be3d700 -1 osd/OSD.h: In function 'void OSD::unreg_last_pg_scrub(pg_t, utime_t)' thread 7fe66be3d700 time 2012-05-19 17:24:09.095719
osd/OSD.h: 840: FAILED assert(last_scrub_pg.count(p))

ceph version 0.46-313-g4277d4d (commit:4277d4d3378dde4264e2b8d211371569219c6e4b)
1: (OSD::unreg_last_pg_scrub(pg_t, utime_t)+0x149) [0x641f49]
2: (PG::proc_primary_info(ObjectStore::Transaction&, pg_info_t const&)+0x5e) [0x63383e]
3: (PG::RecoveryState::ReplicaActive::react(PG::RecoveryState::MInfoRec const&)+0x4a) [0x633eda]
4: (boost::statechart::detail::reaction_result boost::statechart::simple_state<PG::RecoveryState::ReplicaActive, PG::RecoveryState::Started, boost::mpl::list<mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na>, (boost::statechart::history_mode)0>::local_react_impl_non_empty::local_react_impl<boost::mpl::list3<boost::statechart::custom_reaction<PG::RecoveryState::MQuery>, boost::statechart::custom_reaction<PG::RecoveryState::MInfoRec>, boost::statechart::custom_reaction<PG::RecoveryState::MLogRec> >, boost::statechart::simple_state<PG::RecoveryState::ReplicaActive, PG::RecoveryState::Started, boost::mpl::list<mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na>, (boost::statechart::history_mode)0> >(boost::statechart::simple_state<PG::RecoveryState::ReplicaActive, PG::RecoveryState::Started, boost::mpl::list<mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na>, (boost::statechart::history_mode)0>&, boost::statechart::event_base const&, void const*)+0x130) [0x6466a0]
5: (boost::statechart::simple_state<PG::RecoveryState::ReplicaActive, PG::RecoveryState::Started, boost::mpl::list<mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na, mpl_::na>, (boost::statechart::history_mode)0>::react_impl(boost::statechart::event_base const&, void const*)+0x81) [0x646791]
6: (boost::statechart::state_machine<PG::RecoveryState::RecoveryMachine, PG::RecoveryState::Initial, std::allocator<void>, boost::statechart::null_exception_translator>::send_event(boost::statechart::event_base const&)+0x5b) [0x63dfcb]
7: (boost::statechart::state_machine<PG::RecoveryState::RecoveryMachine, PG::RecoveryState::Initial, std::allocator<void>, boost::statechart::null_exception_translator>::process_event(boost::statechart::event_base const&)+0x11) [0x63e0f1]
8: (PG::RecoveryState::handle_info(int, pg_info_t&, PG::RecoveryCtx*)+0x177) [0x616987]
9: (OSD::handle_pg_info(std::tr1::shared_ptr<OpRequest>)+0x665) [0x5d3d15]
10: (OSD::dispatch_op(std::tr1::shared_ptr<OpRequest>)+0x2a0) [0x5d7370]
11: (OSD::_dispatch(Message*)+0x191) [0x5dd4a1]
12: (OSD::ms_dispatch(Message*)+0x153) [0x5ddda3]
13: (SimpleMessenger::dispatch_entry()+0x863) [0x77fbc3]
14: (SimpleMessenger::DispatchThread::entry()+0xd) [0x746c5d]
15: (()+0x7efc) [0x7fe679b1fefc]
16: (clone()+0x6d) [0x7fe67815089d]
NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

Because we don't clear the scrub state before reseting info,
the last_scrub_stamp state in the info.history structure
changes without updating the osd state resulting in the
above assert failure.

Backport: stable

Signed-off-by: Samuel Just <sam.just@inktank.com>

commit | commitdiff | tree

Sage Weil [Sat, 14 Jul 2012 21:31:34 +0000 (14:31 -0700)]

osd: based misdirected op role calc on acting set

We want to look at the acting set here, nothing else. This was causing us
to erroneously queue ops for later (wasting memory) and to erroneously
print out a 'misdrected op' message in the cluster log (confusion and
incorrect [but ignored] -ENXIO reply).

Fixes: #2022
Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Mon, 16 Jul 2012 03:30:34 +0000 (20:30 -0700)]

mon/MonitorStore: always O_TRUNC when writing states

It is possible for a .new file to already exist, potentially with a
larger size. This would happen if:

- we were proposing a different value
- we crashed (or were stopped) before it got renamed into place
- after restarting, a different value was proposed and accepted.

This isn't so unlikely for the log state machine, where we're
aggregating random messages. O_TRUNC ensure we avoid getting the tail
end of some previous junk.

I observed #2593 and found that a logm state value had a larger size on
one mon (after slurping) than the others, pointing to put_bl_sn_map().

While we are at it, O_TRUNC put_int() too; the same type of bug is
possible there, too.

Fixes: #2593
Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Josh Durgin [Fri, 13 Jul 2012 16:42:20 +0000 (09:42 -0700)]

qa: download tests from specified branch

These python tests aren't installed, so they need to be downloaded

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>

commit | commitdiff | tree

Yehuda Sadeh [Mon, 25 Jun 2012 16:47:37 +0000 (09:47 -0700)]

rgw: don't override subuser perm mask if perm not specified

Bug #2650. We were overriding subuser perm mask whenever subuser
was modified, even if perm mask was not passed.

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>

commit | commitdiff | tree

James Page [Wed, 11 Jul 2012 18:34:21 +0000 (11:34 -0700)]

debian: fix ceph-fs-common-dbg depends

Signed-off-by: James Page <james.page@ubuntu.com>

commit | commitdiff | tree

Sage Weil [Thu, 12 Jul 2012 01:54:30 +0000 (18:54 -0700)]

rados: more usage cleanup

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Dan Mick [Wed, 11 Jul 2012 22:26:30 +0000 (15:26 -0700)]

rados: usage message
Bad linebreaks, wrapping, stringification, missing doc for bench args

Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Samuel Just <sam.just@inktank.com>

commit | commitdiff | tree

Yehuda Sadeh [Wed, 11 Jul 2012 18:52:24 +0000 (11:52 -0700)]

rados tool: remove -t param option for target pool

Bug #2772. This fixes an issue that was introduced when we
added the 'rados cp' command. The -t param was already used
for rados bench. With this change the only way to specify
a target pool is using --target-pool.
Though this problem is post argonaut, the 'rados cp' command
has been backported, so we need this fix there too.

Backport: argonaut

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>

commit | commitdiff | tree

Sage Weil [Wed, 11 Jul 2012 16:19:00 +0000 (09:19 -0700)]

Makefile: don't install crush headers

This is leftover from when we built a libcrush.so. We can re-add when we
start doing that again.

Reported-by: Laszlo Boszormenyi <gcs@debian.hu>
Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Wed, 11 Jul 2012 01:21:29 +0000 (18:21 -0700)]

Merge branch 'stable' into next

commit | commitdiff | tree

Sage Weil [Mon, 9 Jul 2012 20:22:42 +0000 (13:22 -0700)]

osd: guard class call decoding

Backport: argonaut
Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Tue, 10 Jul 2012 03:54:19 +0000 (20:54 -0700)]

test_stress_watch: just one librados instance

This was creating a new cluster connection/session per iteration, and
along with it a few service threads and sockets and so forth.

Unfortunately, librados leaks like a sieve, starting with CephContext
and ceph::crypto::init(). See #845 and #2067.

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Samuel Just [Tue, 10 Jul 2012 00:57:03 +0000 (17:57 -0700)]

ReplicatedPG: don't warn if backfill peer stats don't match

pinfo.stats might be wrong if we did log-based recovery on the
backfilled portion in addition to continuing backfill.

bug #2750

Signed-off-by: Samuel Just <sam.just@inktank.com>

commit | commitdiff | tree

Sage Weil [Fri, 6 Jul 2012 01:08:58 +0000 (18:08 -0700)]

librados: take lock when signaling notify cond

When we are signaling the cond to indicate that a notify is complete,
take the appropriate lock. This removes the possibility of a race
that loses our signal. (That would be very difficult given that there
are network round trips involved, but this makes the lock/cond usage
"correct.")

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Wed, 4 Jul 2012 22:11:21 +0000 (15:11 -0700)]

client: fix locking for SafeCond users

Need to wait on flock, not client_lock.

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Mon, 9 Jul 2012 03:33:12 +0000 (20:33 -0700)]

debian: include librados-config in librados-dev

Reported-by: Laszlo Boszormenyi <gcs@debian.hu>
Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Tue, 3 Jul 2012 20:04:28 +0000 (13:04 -0700)]

lockdep: increase max locks

Hit this limit with the rados api tests.

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Tue, 3 Jul 2012 19:07:28 +0000 (12:07 -0700)]

config: add unlocked version of get_my_sections; use it internally

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Tue, 3 Jul 2012 15:20:06 +0000 (08:20 -0700)]

config: fix lock recursion in get_val_from_conf_file()

Introduce a private, already-locked version.

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Tue, 3 Jul 2012 15:15:08 +0000 (08:15 -0700)]

config: fix recursive lock in parse_config_files()

The _impl() helper is only called from parse_config_files(); don't retake
the lock.

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Yehuda Sadeh [Fri, 6 Jul 2012 20:14:53 +0000 (13:14 -0700)]

rgw: handle response-* params

Handle response-* params that set response header field values.
Fixes #2734, #2735.
Backport: argonaut

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>

commit | commitdiff | tree

Sage Weil [Wed, 4 Jul 2012 01:51:02 +0000 (18:51 -0700)]

rgw: initialize fields of RGWObjEnt

This fixes various valgrind warnings triggered by the s3test
test_object_create_unreadable.

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Yehuda Sadeh [Fri, 6 Jul 2012 20:14:53 +0000 (13:14 -0700)]

rgw: handle response-* params

Handle response-* params that set response header field values.
Fixes #2734, #2735.
Backport: argonaut

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>

commit | commitdiff | tree

Sage Weil [Wed, 4 Jul 2012 20:59:04 +0000 (13:59 -0700)]

osd: add missing formatter close_section() to scrub status

Also add braces to make the open/close matchups easier to see. Broken
by f36617392710f9b3538bfd59d45fd72265993d57.

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Mike Ryan [Wed, 27 Jun 2012 21:14:30 +0000 (14:14 -0700)]

pg: report scrub status

Signed-off-by: Mike Ryan <mike.ryan@inktank.com>

commit | commitdiff | tree

Mike Ryan [Wed, 27 Jun 2012 20:30:45 +0000 (13:30 -0700)]

pg: track who we are waiting for maps from

Signed-off-by: Mike Ryan <mike.ryan@inktank.com>

commit | commitdiff | tree

Mike Ryan [Tue, 26 Jun 2012 23:25:27 +0000 (16:25 -0700)]

pg: reduce scrub write lock window

Wait for all replicas to construct the base scrub map before finalizing
the scrub and locking out writes.

Signed-off-by: Mike Ryan <mike.ryan@inktank.com>

commit | commitdiff | tree

Yehuda Sadeh [Thu, 5 Jul 2012 22:52:51 +0000 (15:52 -0700)]

rgw: don't store bucket info indexed by bucket_id

Issue #2701. This info wasn't really used anywhere and we weren't
removing it. It was also sharing the same pool namespace as the
info indexed by bucket name, which is bad.

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>

commit | commitdiff | tree

Yehuda Sadeh [Thu, 5 Jul 2012 22:52:51 +0000 (15:52 -0700)]

commit | commitdiff | tree

Yehuda Sadeh [Fri, 6 Jul 2012 17:16:07 +0000 (10:16 -0700)]

Merge branch 'stable' into next

commit | commitdiff | tree

Yehuda Sadeh [Thu, 5 Jul 2012 21:59:22 +0000 (14:59 -0700)]

test_rados_tool.sh: test copy pool

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>

commit | commitdiff | tree

Yehuda Sadeh [Thu, 5 Jul 2012 20:42:23 +0000 (13:42 -0700)]

rados tool: copy object in chunks

Instead of reading the entire object and then writing it,
we read it in chunks.

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>

commit | commitdiff | tree

Yehuda Sadeh [Fri, 29 Jun 2012 21:43:00 +0000 (14:43 -0700)]

rados tool: copy entire pool

A new rados tool command that copies an entire pool
into another existing pool.

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>

commit | commitdiff | tree

Yehuda Sadeh [Fri, 29 Jun 2012 21:09:08 +0000 (14:09 -0700)]

rados tool: copy object

New rados command: rados cp <src-obj> [dest-obj]

Requires specifying source pool. Target pool and locator can be specified.
The new command preserves object xattrs and omap data.

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>

commit | commitdiff | tree

Yehuda Sadeh [Fri, 6 Jul 2012 17:12:23 +0000 (10:12 -0700)]

Merge remote-tracking branch 'origin/stable' into next

commit | commitdiff | tree

Sage Weil [Fri, 6 Jul 2012 15:47:44 +0000 (08:47 -0700)]

ceph.spec.in: add ceph-disk-{activate,prepare}

Reported-by: Jimmy Tang <jtang@tchpc.tcd.ie>
Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Wido den Hollander [Thu, 5 Jul 2012 13:29:54 +0000 (15:29 +0200)]

Allow URL-safe base64 cephx keys to be decoded.

In these cases + and / are replaced by - and _ to prevent problems when using
the base64 strings in URLs.

Signed-off-by: Wido den Hollander <wido@widodh.nl>
Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Wed, 4 Jul 2012 20:59:04 +0000 (13:59 -0700)]

commit | commitdiff | tree

Sage Weil [Wed, 4 Jul 2012 16:30:21 +0000 (09:30 -0700)]

Merge branch 'stable'

Conflicts:
src/test/cli/radosgw-admin/help.t

commit | commitdiff | tree

Wido den Hollander [Wed, 4 Jul 2012 13:46:04 +0000 (15:46 +0200)]

librados: Bump the version to 0.48

Signed-off-by: Wido den Hollander <wido@widodh.nl>
Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Samuel Just [Tue, 3 Jul 2012 19:00:32 +0000 (12:00 -0700)]

librados: add assert_version as an operation on an ObjectOperation

Signed-off-by: Samuel Just <sam.just@inktank.com>

commit | commitdiff | tree

Samuel Just [Tue, 3 Jul 2012 22:35:29 +0000 (15:35 -0700)]

ReplicatedPG: do not set reply version to last_update

The version should be oi.user_version as set above.

Signed-off-by: Samuel Just <sam.just@inktank.com>

commit | commitdiff | tree

Sage Weil [Wed, 4 Jul 2012 01:51:02 +0000 (18:51 -0700)]

rgw: initialize fields of RGWObjEnt

This fixes various valgrind warnings triggered by the s3test
test_object_create_unreadable.

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Tue, 3 Jul 2012 23:49:29 +0000 (16:49 -0700)]

Merge remote-tracking branch 'gh/wip-crush'

commit | commitdiff | tree

Yehuda Sadeh [Wed, 27 Jun 2012 00:28:51 +0000 (17:28 -0700)]

rgw-admin: use correct modifier with strptime

Bug #2658: used %I (12h) instead of %H (24h)

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>

commit | commitdiff | tree

Yehuda Sadeh [Thu, 21 Jun 2012 22:40:27 +0000 (15:40 -0700)]

rgw: send both swift x-storage-token and x-auth-token

older clients need x-storage-token, newer x-auth-token

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>

commit | commitdiff | tree

Yehuda Sadeh [Thu, 21 Jun 2012 22:17:19 +0000 (15:17 -0700)]

rgw: radosgw-admin date params now also accept time

The date format now is "YYYY-MM-DD[ hh:mm:ss]". Got rid of
the --time param for the old ops log stuff.

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Conflicts:

src/test/cli/radosgw-admin/help.t

commit | commitdiff | tree

Yehuda Sadeh [Thu, 21 Jun 2012 20:14:47 +0000 (13:14 -0700)]

rgw-admin: fix usage help

s/show/trim

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>

commit | commitdiff | tree

Tommi Virtanen [Tue, 3 Jul 2012 22:24:26 +0000 (15:24 -0700)]

ceph-disk-prepare: Partition and format OSD data disks automatically.

Uses gdisk, as it seems to be the only tool that can automate GPT uuid
changes. Needs to run as root.

Adds Recommends: gdisk to ceph.deb.

Closes: #2547
Signed-off-by: Tommi Virtanen <tv@inktank.com>

commit | commitdiff | tree

John Wilkins [Tue, 3 Jul 2012 21:20:34 +0000 (14:20 -0700)]

doc: removed /srv/osd.$id.journal from ceph.conf example.

Signed-off-by: John Wilkins <john.wilkins@inktank.com>

commit | commitdiff | tree

caleb miles [Tue, 3 Jul 2012 20:05:48 +0000 (13:05 -0700)]

CrushTester.cc: remove BOOST dependencies.

remove calls to BOOST libraries for computing Chi-squared statistics and
producing discrete random variables with a given probability distribution.

Signed-off-by: caleb miles <caleb.miles@inktank.com>

commit | commitdiff | tree

John Wilkins [Tue, 3 Jul 2012 21:14:42 +0000 (14:14 -0700)]

doc: Updates to 5-minute quick start.

Signed-off-by: John Wilkins <john.wilkins@inktank.com>

commit | commitdiff | tree

Sage Weil [Tue, 3 Jul 2012 21:07:16 +0000 (14:07 -0700)]

radosgw-admin: fix clit test

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Tue, 3 Jul 2012 20:04:36 +0000 (13:04 -0700)]

Merge branch 'wip-config'

commit | commitdiff | tree

Sage Weil [Tue, 3 Jul 2012 20:04:28 +0000 (13:04 -0700)]

lockdep: increase max locks

Hit this limit with the rados api tests.

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Tue, 3 Jul 2012 19:07:28 +0000 (12:07 -0700)]

config: add unlocked version of get_my_sections; use it internally

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Tue, 3 Jul 2012 18:32:57 +0000 (11:32 -0700)]

ceph: fix cli help test

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

John Wilkins [Tue, 3 Jul 2012 18:48:31 +0000 (11:48 -0700)]

Merge branch 'master' of github.com:ceph/ceph

commit | commitdiff | tree

John Wilkins [Tue, 3 Jul 2012 18:48:15 +0000 (11:48 -0700)]

doc: Clean up of 5-minute quick start.

Signed-off-by: John Wilkins <john.wilkins@inktank.com>

commit | commitdiff | tree

Samuel Just [Tue, 3 Jul 2012 18:23:16 +0000 (11:23 -0700)]

ReplicatedPG: remove faulty scrub assert in sub_op_modify_applied

This assert assumed that all ops submitted before MOSDRepScrub was
submitted were processed by the time that MOSDRepScrub was
processed. In fact, MOSDRepScrub's scrub_to may refer to a
last_update yet to be seen by the replica.

Bug #2693

Signed-off-by: Samuel Just <sam.just@inktank.com>

commit | commitdiff | tree

Samuel Just [Tue, 3 Jul 2012 18:23:16 +0000 (11:23 -0700)]

commit | commitdiff | tree

John Wilkins [Tue, 3 Jul 2012 18:21:43 +0000 (11:21 -0700)]

doc: Updating Getting Started with 5-minute quick start.

Signed-off-by: John Wilkins <john.wilkins@inktank.com>

commit | commitdiff | tree

Kyle Bader [Tue, 3 Jul 2012 18:20:38 +0000 (11:20 -0700)]

ceph: better usage

Signed-off-by: Kyle Bader <kyle.bader@dreamhost.com>

commit | commitdiff | tree

John Wilkins [Tue, 3 Jul 2012 18:18:11 +0000 (11:18 -0700)]

Merge branch 'master' of github.com:ceph/ceph

commit | commitdiff | tree

John Wilkins [Tue, 3 Jul 2012 18:17:50 +0000 (11:17 -0700)]

doc: restructuring quick start section.

Signed-off-by: John Wilkins <john.wilkins@inktank.com>

commit | commitdiff | tree

Samuel Just [Tue, 3 Jul 2012 18:10:54 +0000 (11:10 -0700)]

IoCtxImpl: pass objver pointer to aio_operate_read

Signed-off-by: Samuel Just <sam.just@inktank.com>

commit | commitdiff | tree

Tommi Virtanen [Tue, 3 Jul 2012 16:22:28 +0000 (09:22 -0700)]

ceph-disk-prepare: Take fsid from config file.

Closes: #2546.
Signed-off-by: Tommi Virtanen <tv@inktank.com>

commit | commitdiff | tree

Sage Weil [Tue, 3 Jul 2012 13:46:10 +0000 (06:46 -0700)]

config: remove bad argparse_flag argument in parse_option()

This is wrong, and thankfully valgrind picks it up.

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Tue, 3 Jul 2012 16:20:35 +0000 (09:20 -0700)]

debian: strip new ceph-mds package

Reported-by: Amon Ott <a.ott@m-privacy.de>
Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

John Wilkins [Tue, 3 Jul 2012 15:46:14 +0000 (08:46 -0700)]

doc: Cleaned up rbd snapshots.

Signed-off-by: John Wilkins <john.wilkins@inktank.com>

commit | commitdiff | tree

Sage Weil [Tue, 3 Jul 2012 15:20:06 +0000 (08:20 -0700)]

config: fix lock recursion in get_val_from_conf_file()

Introduce a private, already-locked version.

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Tue, 3 Jul 2012 15:15:08 +0000 (08:15 -0700)]

config: fix recursive lock in parse_config_files()

The _impl() helper is only called from parse_config_files(); don't retake
the lock.

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Tue, 3 Jul 2012 13:46:10 +0000 (06:46 -0700)]

config: remove bad argparse_flag argument in parse_option()

This is wrong, and thankfully valgrind picks it up.

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Tue, 3 Jul 2012 04:08:27 +0000 (21:08 -0700)]

client: improve dump_cache output

Hunting #1737.

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Tue, 3 Jul 2012 03:13:51 +0000 (20:13 -0700)]

doc: release notes for 0.48

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Tue, 3 Jul 2012 01:03:02 +0000 (18:03 -0700)]

doc: 'Configuring a Storage Cluster' -> 'Configuration'

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Tue, 3 Jul 2012 04:24:56 +0000 (21:24 -0700)]

Merge tag 'v0.48argonaut'

v0.48argonaut

commit | commitdiff | tree

Sage Weil [Tue, 3 Jul 2012 00:54:35 +0000 (17:54 -0700)]

Merge branch 'wip-msgr'

commit | commitdiff | tree

Sage Weil [Thu, 28 Jun 2012 23:23:30 +0000 (16:23 -0700)]

lockdep: enable in common_init

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Mon, 2 Jul 2012 00:23:28 +0000 (17:23 -0700)]

msgr: restart_queue when replacing existing pipe and taking over the queue

The queue may have been previously stopped (by discard_queue()), and needs
to be restarted.

Fixes consistent failures from the mon_recovery.py integration tests.

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Sun, 1 Jul 2012 22:37:31 +0000 (15:37 -0700)]

msgr: choose incoming connection if ours is STANDBY

If the connect_seq matches, but our existing connection is in STANDBY, take
the incoming one.  Otherwise, the other end will wait indefinitely for us
to connect but we won't.

Alternatively, we could "win" the race and trigger a connection by sending
a keepalive (or similar), but that is more work; we may as well accept the
incoming connection we have now.

This removes STANDBY from the acceptable WAIT case states.  It also keeps
responsibility squarely on the shoulders of the peer with something to
deliver.

Without this patch, a 3-osd vstart cluster with
'ms inject socket failures = 100' and rados bench write -b 4096 would start
generating slow request warnings after a few minutes due to the osds
failing to connect to each other.  With the patch, I complete a 10 minute
run without problems.

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Fri, 29 Jun 2012 00:50:47 +0000 (17:50 -0700)]

msgr: preserve incoming message queue when replacing pipes

If we replace an existing pipe with a new one, move the incoming queue
of messages that have not yet been dispatched over to the new Pipe so that
they are not lost. This prevents messages from being lost.

Alternatively, we could set in_seq = existing->in_seq - existing->in_qlen,
but that would make the other end resend those messages, which is a waste
of bandwidth.

Very easy to reproduce the original bug with 'ms inject socket failures'.

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Fri, 29 Jun 2012 00:45:24 +0000 (17:45 -0700)]

msgr: move dispatch_entry into DispatchQueue class

A bit cleaner.

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Fri, 29 Jun 2012 00:38:34 +0000 (17:38 -0700)]

msgr: move incoming queue to separate class

This extricates the incoming queue and its funky relationship with
DispatchQueue from Pipe and moves it into IncomingQueue. There is now a
single IncomingQueue attached to each Pipe. DispatchQueue is now no
longer tied to Pipe.

This modularizes the code a bit better (tho that is still a work in
progress) and (more importantly) will make it possible to move the
incoming messages from one pipe to another in accept().

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Thu, 28 Jun 2012 00:06:40 +0000 (17:06 -0700)]

msgr: make D_CONNECT constant non-zero, fix ms_handle_connect() callback

A while ago we inadvertantly broke ms_handle_connect() callbacks because
of a check for m being non-zero in the dispatch_entry() thread. Adjust the
enums so that they get delivered again.

This fixes hangs when, for example, the ceph tool sends a command, gets a
connection reset, and doesn't get the connect callback to resend after
reconnecting to a new monitor.

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Wed, 27 Jun 2012 00:10:40 +0000 (17:10 -0700)]

msgr: fix pipe replacement assert

We may replace an existing pipe in the STANDBY state if the previous
attempt failed during accept() (see previous patches).

This might fix #1378.

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Wed, 27 Jun 2012 00:07:31 +0000 (17:07 -0700)]

msgr: do not try to reconnect con with CLOSED pipe

If we have a con with a closed pipe, drop the message. For lossless
sessions, the state will be STANDBY if we should reconnect. For lossy
sessions, we will end up with CLOSED and we *should* drop the message.

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Wed, 27 Jun 2012 00:06:41 +0000 (17:06 -0700)]

msgr: move to STANDBY if we replace during accept and then fail

If we replace an existing pipe during accept() and then fail, move to
STANDBY so that our connection state (connect_seq, etc.) is preserved.
Otherwise, we will throw out that information and falsely trigger a
RESETSESSION on the next connection attempt.

Signed-off-by: Sage Weil <sage@inktank.com>

commit | commitdiff | tree

Sage Weil [Sat, 30 Jun 2012 21:50:20 +0000 (14:50 -0700)]

v0.48argonaut

Unnamed repository; edit this file 'description' to name the repository.

RSS Atom