]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
12 years agodoc: Minor updates for usage.
John Wilkins [Fri, 14 Jun 2013 23:06:06 +0000 (16:06 -0700)]
doc: Minor updates for usage.

Signed-off-by: John Wilkins <john.wilkins@inktank.com>
12 years agolibrados: add tests for too-large objects
Sage Weil [Fri, 14 Jun 2013 17:17:31 +0000 (10:17 -0700)]
librados: add tests for too-large objects

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoosd: fix types for size checks
Sage Weil [Fri, 14 Jun 2013 17:14:54 +0000 (10:14 -0700)]
osd: fix types for size checks

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoremove RELEASE_CHECKLIST
Sage Weil [Fri, 14 Jun 2013 16:42:08 +0000 (09:42 -0700)]
remove RELEASE_CHECKLIST

This ancient document has long since been replaced by
doc/dev/release-process.rst.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoosd: EINVAL from truncate causes osd to crash
David Zafman [Fri, 14 Jun 2013 01:15:39 +0000 (18:15 -0700)]
osd: EINVAL from truncate causes osd to crash

Maximum object size is 100GB configurable with osd_max_object_size
Error EFBIG if attempt to WRITE/WRITEFULL/TRUNCATE beyond osd_max_object_size
Error EINVAL if length < 1 for WRITE/WRITEFULL/ZERO
Make ZERO beyond existing size a no-op

Fixes: #5252
Fixes: #5340
Signed-off-by: David Zafman <david.zafman@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
12 years agoceph_test_rados: add --pool <name> arg
Sage Weil [Fri, 14 Jun 2013 05:08:36 +0000 (22:08 -0700)]
ceph_test_rados: add --pool <name> arg

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoMerge remote-tracking branch 'gh/next'
Sage Weil [Fri, 14 Jun 2013 04:33:25 +0000 (21:33 -0700)]
Merge remote-tracking branch 'gh/next'

12 years agoMerge pull request #362 from ceph/wip-4984
Dan Mick [Fri, 14 Jun 2013 02:37:37 +0000 (19:37 -0700)]
Merge pull request #362 from ceph/wip-4984

ceph-disk: udev/partprobe redo, zap command, activate-journal command

12 years agoceph-fuse: fix uninitialized variable
Sage Weil [Fri, 14 Jun 2013 01:13:34 +0000 (18:13 -0700)]
ceph-fuse: fix uninitialized variable

There is a delete call in the out_mc_start_failed path.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoceph-disk: implement 'activate-journal' 362/head
Sage Weil [Thu, 13 Jun 2013 22:54:58 +0000 (15:54 -0700)]
ceph-disk: implement 'activate-journal'

Activate an osd via its journal device.  udev populates its symlinks and
triggers events in an order that is not related to whether the device is
an osd data partition or a journal.  That means that triggering
'ceph-disk activate' can happen before the journal (or journal symlink)
is present and then fail.

Similarly, it may be that they are on different disks that are hotplugged
with the journal second.

This can be wired up to the journal partition type to ensure that osds are
started when the journal appears second.

Include the udev rules to trigger this.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoceph-disk: call partprobe outside of the prepare lock; drop udevadm settle
Sage Weil [Wed, 12 Jun 2013 01:35:01 +0000 (18:35 -0700)]
ceph-disk: call partprobe outside of the prepare lock; drop udevadm settle

After we change the final partition type, sgdisk may or may not trigger a
udev event, depending on how well udev is behaving (it varies between
distros, it seems).  The old code would often settle and wait for udev to
activate the device, and then partprobe would uselessly fail because it
was already mounted.

Call partprobe only at the very end, after prepare is done.  This ensures
that if partprobe calls udevadm settle (which is sometimes does) we do not
get stuck.

Drop the udevadm settle.  I'm not sure what this accomplishes; take it out,
at least until we determine we need it.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoceph-disk: add 'zap' command
Sage Weil [Thu, 13 Jun 2013 18:03:37 +0000 (11:03 -0700)]
ceph-disk: add 'zap' command

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoMerge pull request #363 from dmick/wip-cli-help
Sage Weil [Fri, 14 Jun 2013 00:47:41 +0000 (17:47 -0700)]
Merge pull request #363 from dmick/wip-cli-help

Reviewed-by: Sage Weil <sage@inktank.com>
12 years agoceph.in: allow args with -h to limit help to cmds that match partially 363/head
Dan Mick [Fri, 14 Jun 2013 00:40:02 +0000 (17:40 -0700)]
ceph.in: allow args with -h to limit help to cmds that match partially

Enables "ceph -h pg" to see just the pg commands

Signed-off-by: Dan Mick <dan.mick@inktank.com>
12 years agoceph.in: better global description of tool
Dan Mick [Fri, 14 Jun 2013 00:38:50 +0000 (17:38 -0700)]
ceph.in: better global description of tool

Signed-off-by: Dan Mick <dan.mick@inktank.com>
12 years agoceph.in: less verbosity on error
Dan Mick [Fri, 14 Jun 2013 00:38:26 +0000 (17:38 -0700)]
ceph.in: less verbosity on error

Only show 'did you mean?' when in verbose mode
Only show first ten closest matches on error

Signed-off-by: Dan Mick <dan.mick@inktank.com>
12 years agolibrados: add missing #include
Sage Weil [Fri, 14 Jun 2013 00:38:02 +0000 (17:38 -0700)]
librados: add missing #include

librados/librados.cc: In function 'int rados_mon_command_target(void*, const char*, const char**, size_t, const char*, size_t, char**, size_t*, char**, size_t*)':
error: librados/librados.cc:1877: 'LONG_MAX' was not declared in this scope
error: librados/librados.cc:1877: 'LONG_MIN' was not declared in this scope

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agolibrados: wait for osdmap for commands that need it
Sage Weil [Thu, 13 Jun 2013 23:39:30 +0000 (16:39 -0700)]
librados: wait for osdmap for commands that need it

In commit 7e1cf87b5158c870e2a118ed6d316be8cb9818ce we stopped waiting for
the osdmap on start because the Objecter will normally wait, but for some
commands we assume the osdmap is recent(ish).

Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
12 years agorules: Don't disable tcmalloc on ARM (and other non-intel)
Gary Lowell [Thu, 13 Jun 2013 23:38:26 +0000 (16:38 -0700)]
rules:  Don't disable tcmalloc on ARM (and other non-intel)

Fixes #5342

Signed-off-by: Gary Lowell <gary.lowell@inktank.com>
12 years agoMerge pull request #356 from ceph/wip-leaks
Sage Weil [Thu, 13 Jun 2013 23:21:21 +0000 (16:21 -0700)]
Merge pull request #356 from ceph/wip-leaks

Reviewed-by: Samuel Just <sam.just@inktank.com>
12 years agoMerge branch 'wip-objecter' into next
Sage Weil [Thu, 13 Jun 2013 23:15:44 +0000 (16:15 -0700)]
Merge branch 'wip-objecter' into next

Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
12 years agoosdc/Objecter: dump command ops
Sage Weil [Thu, 13 Jun 2013 23:01:31 +0000 (16:01 -0700)]
osdc/Objecter: dump command ops

Dump command_ops along with everything else.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoosdc/Objecter: ping osds for which we have pending commands
Sage Weil [Thu, 13 Jun 2013 22:57:57 +0000 (15:57 -0700)]
osdc/Objecter: ping osds for which we have pending commands

As with ops and linger_ops, this ensures we detect connection resets.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoceph.in: refuse 'ceph <type> tell' commands; suggest 'ceph tell <type>'
Dan Mick [Thu, 13 Jun 2013 22:48:32 +0000 (15:48 -0700)]
ceph.in: refuse 'ceph <type> tell' commands; suggest 'ceph tell <type>'

Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
12 years agoceph.in: argparsing cleanup: suppress --completion, add help
Dan Mick [Thu, 13 Jun 2013 22:30:38 +0000 (15:30 -0700)]
ceph.in: argparsing cleanup: suppress --completion, add help

Options -v, --verbose, --concise didn't have helpstrings
Option --completion doesn't quite work yet, and should be hidden anyway

Signed-off-by: Dan Mick <dan.mick@inktank.com>
12 years agoMerge remote-tracking branch 'gh/next'
Sage Weil [Thu, 13 Jun 2013 22:17:05 +0000 (15:17 -0700)]
Merge remote-tracking branch 'gh/next'

12 years agoosdc/Objecter: kick command ops on osd con resets
Sage Weil [Thu, 13 Jun 2013 22:13:47 +0000 (15:13 -0700)]
osdc/Objecter: kick command ops on osd con resets

Resend osd/pg commands on the OSDSession, just as we do with other request
types.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoosdc/Objecter: add perfcounters for commands
Sage Weil [Thu, 13 Jun 2013 22:13:18 +0000 (15:13 -0700)]
osdc/Objecter: add perfcounters for commands

This matches the other counters we maintain for other kinds of ops.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agomon: fix idempotency of 'osd crush add'
Sage Weil [Thu, 13 Jun 2013 21:01:01 +0000 (14:01 -0700)]
mon: fix idempotency of 'osd crush add'

If we add an item that already exists in particular position, we should
update instead of inserting it; the CrushWrapper methods are not
idempotent.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agolibrados: do not wait for osdmap on start
Sage Weil [Thu, 13 Jun 2013 21:42:03 +0000 (14:42 -0700)]
librados: do not wait for osdmap on start

If we abort while waiting, we incorrect clean up (we switch the state value
incorrectly, and also fail to clean up the initialized objecter).

Intead, skip this wait.. it's useless!

Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
12 years agodoc: Updated with glossary terms.
John Wilkins [Thu, 13 Jun 2013 21:09:35 +0000 (14:09 -0700)]
doc: Updated with glossary terms.

Signed-off-by: John Wilkins <john.wilkins@inktank.com>
12 years agomon/MonmapMonitor: remove unused label
Sage Weil [Thu, 13 Jun 2013 18:27:49 +0000 (11:27 -0700)]
mon/MonmapMonitor: remove unused label

mon/MonmapMonitor.cc: In member function 'bool MonmapMonitor::preprocess_command(MMonCommand*)':
mon/MonmapMonitor.cc:273:2: warning: label 'out' defined but not used [-Wunused-label]

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agomon/MonCap: bootstrap-* need to subscribe to osdmap, monmap
Sage Weil [Thu, 13 Jun 2013 18:27:23 +0000 (11:27 -0700)]
mon/MonCap: bootstrap-* need to subscribe to osdmap, monmap

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agomon/MonClient: mark_down during get_monmap_privately() shutdown 356/head
Sage Weil [Thu, 13 Jun 2013 14:39:02 +0000 (07:39 -0700)]
mon/MonClient: mark_down during get_monmap_privately() shutdown

We explicitly mark_down() and clear cur_con when shutting down; do the same
for get_monmap_privately() to ensure that the reset event doesn't make us
do something silly (like, in this case, call _reopen_session() again).

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agomon/MonClient: mark_down connection on shutdown
Sage Weil [Thu, 13 Jun 2013 04:35:39 +0000 (21:35 -0700)]
mon/MonClient: mark_down connection on shutdown

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agomsgr: queue reset when marking down pipes on shutdown
Sage Weil [Thu, 13 Jun 2013 00:58:36 +0000 (17:58 -0700)]
msgr: queue reset when marking down pipes on shutdown

This lets the callbacks clean up ref cycles.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agomsg/DispatchQueue: do not discard queued events on stop
Sage Weil [Wed, 12 Jun 2013 02:27:01 +0000 (19:27 -0700)]
msg/DispatchQueue: do not discard queued events on stop

When the shutdown/stop flag is set, continue to work through the queue.
Process events, but discard messages.  This avoids the loss of reset events
on shutdown that are necessary to clean up ref cycles.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agomsgr: queue reset exactly once on any connection
Sage Weil [Tue, 11 Jun 2013 23:44:05 +0000 (16:44 -0700)]
msgr: queue reset exactly once on any connection

Use the atomic pipe link removal as a signal that we are the one failing
the con and use that to queue the reset event.

This fixes the case where we have an open, the session gets set up via the
handle_accept callback, and then race with another connection and go into
wait + close, or just close.  In that case, fault() needs to queue a reset
event to match the accept.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agomsg/Pipe: include con reef in debug prestring
Sage Weil [Tue, 11 Jun 2013 18:51:14 +0000 (11:51 -0700)]
msg/Pipe: include con reef in debug prestring

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agomsg/Pipe: reset replaced pipes
Sage Weil [Tue, 11 Jun 2013 18:38:44 +0000 (11:38 -0700)]
msg/Pipe: reset replaced pipes

This gives the ms_handle_reset call a chance to clean up (for example, by
breaking a con->priv <-> session reference cycle).

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agomsgr: use ConnectionRef throughout
Sage Weil [Mon, 10 Jun 2013 03:21:49 +0000 (20:21 -0700)]
msgr: use ConnectionRef throughout

Make RefCountedObject a private parent of Connection so that users are
forced to use ConnectionRef whenever references are taken.

Many methods can still take a raw Connection* when they are using the
caller's reference but not taking their own; this is cheaper than
twiddling the reference count, and the lifetime is still well defined.
Local variables generally use ConnectionRef, though.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agomon/MonClient: tear down version requests on shutdown
Sage Weil [Mon, 10 Jun 2013 17:31:22 +0000 (10:31 -0700)]
mon/MonClient: tear down version requests on shutdown

Make sure all callers can handle ECANCELED.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agomon/PaxosService: discard messages during shutdown
Sage Weil [Tue, 11 Jun 2013 00:34:24 +0000 (17:34 -0700)]
mon/PaxosService: discard messages during shutdown

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agomon: add is_shutdown() state helper/accessor
Sage Weil [Tue, 11 Jun 2013 00:34:12 +0000 (17:34 -0700)]
mon: add is_shutdown() state helper/accessor

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agomon: shut down Paxos on shutdown
Sage Weil [Tue, 11 Jun 2013 00:28:51 +0000 (17:28 -0700)]
mon: shut down Paxos on shutdown

This cleans up the completions for any paxos waiters.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoosd: break con <-> session cycle on reset
Sage Weil [Tue, 11 Jun 2013 18:59:24 +0000 (11:59 -0700)]
osd: break con <-> session cycle on reset

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoosd: do not leak HeartbeatSession on shutdown
Sage Weil [Tue, 11 Jun 2013 18:51:05 +0000 (11:51 -0700)]
osd: do not leak HeartbeatSession on shutdown

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoosd: close classes on shutdown
Sage Weil [Mon, 10 Jun 2013 18:55:16 +0000 (11:55 -0700)]
osd: close classes on shutdown

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoosd: do not leak MOSDPings on shutdown
Sage Weil [Mon, 10 Jun 2013 18:51:37 +0000 (11:51 -0700)]
osd: do not leak MOSDPings on shutdown

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoosd/ReplicatedPG: don't leak Session refs in do_osd_op_effects()
Sage Weil [Sun, 9 Jun 2013 04:50:53 +0000 (21:50 -0700)]
osd/ReplicatedPG: don't leak Session refs in do_osd_op_effects()

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agomessages/MMonSync: initialize crc in ctor
Sage Weil [Tue, 11 Jun 2013 00:28:22 +0000 (17:28 -0700)]
messages/MMonSync: initialize crc in ctor

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agovstart.sh: put exports at top
Sage Weil [Thu, 13 Jun 2013 17:52:00 +0000 (10:52 -0700)]
vstart.sh: put exports at top

Where I can 'head vstart.sh' to find them quickly.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoPendingReleaseNotes: notes on CLI changes
Sage Weil [Thu, 13 Jun 2013 17:46:45 +0000 (10:46 -0700)]
PendingReleaseNotes: notes on CLI changes

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoPendingReleaseNotes: cli changes, and ceph tell ...
Sage Weil [Thu, 13 Jun 2013 17:21:59 +0000 (10:21 -0700)]
PendingReleaseNotes: cli changes, and ceph tell ...

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agodoc/release-notes: add missed notes for 0.63 and 0.64
Sage Weil [Thu, 13 Jun 2013 17:19:39 +0000 (10:19 -0700)]
doc/release-notes: add missed notes for 0.63 and 0.64

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoMerge branch 'wip-tell' into next
Sage Weil [Thu, 13 Jun 2013 16:27:15 +0000 (09:27 -0700)]
Merge branch 'wip-tell' into next

Reviewed-by: Dan Mick <dan.mick@inktank.com>
12 years agomon: remove support for 'mon tell ...' and 'osd tell ...'
Sage Weil [Wed, 12 Jun 2013 23:56:45 +0000 (16:56 -0700)]
mon: remove support for 'mon tell ...' and 'osd tell ...'

It doesn't work.  The commands the ceph cli sends are vector<string>, and
the mon expects json.

Leave the MDS on in place since ceph-mds still takes strings.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoceph: add support for 'tell mon.X ...'
Sage Weil [Wed, 12 Jun 2013 23:55:03 +0000 (16:55 -0700)]
ceph: add support for 'tell mon.X ...'

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agolibrados: new rados_mon_command_target to talk to a specific monitor
Sage Weil [Wed, 12 Jun 2013 23:36:39 +0000 (16:36 -0700)]
librados: new rados_mon_command_target to talk to a specific monitor

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoMerge pull request #360 from dachary/master
Sage Weil [Thu, 13 Jun 2013 15:23:00 +0000 (08:23 -0700)]
Merge pull request #360 from dachary/master

add apt-get update to installation instructions

12 years agoadd apt-get update to installation instructions 360/head
Loic Dachary [Thu, 13 Jun 2013 06:53:26 +0000 (08:53 +0200)]
add apt-get update to installation instructions

Without apt-get update the repository added to the sources.list is not taken into consideration and an older version of ceph-deploy is going to be installed.

Signed-off-by: Loic Dachary <loic@dachary.org>
12 years agoUpdate README dependency lists
Dan Mick [Thu, 13 Jun 2013 05:25:04 +0000 (22:25 -0700)]
Update README dependency lists

Signed-off-by: Dan Mick <dan.mick@inktank.com>
12 years agoceph-disk: extra dash in error message
Dan Mick [Thu, 13 Jun 2013 05:22:42 +0000 (22:22 -0700)]
ceph-disk: extra dash in error message

Signed-off-by: Dan Mick <dan.mick@inktank.com>
12 years agoClean up CrushWrapper methods that take string: no c_str() necessary
Dan Mick [Thu, 13 Jun 2013 03:59:49 +0000 (20:59 -0700)]
Clean up CrushWrapper methods that take string: no c_str() necessary

Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
12 years agoOSDMonitor: osd id when id already exists needs to come to stdout too
Dan Mick [Thu, 13 Jun 2013 03:59:08 +0000 (20:59 -0700)]
OSDMonitor: osd id when id already exists needs to come to stdout too

Found by qa/workunits/mon/osd.sh

Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
12 years agoceph, mon/OSDMonitor: fix up osd crush commands for <osd.N> or <N>
Dan Mick [Thu, 13 Jun 2013 01:08:17 +0000 (18:08 -0700)]
ceph, mon/OSDMonitor: fix up osd crush commands for <osd.N> or <N>

The new parsing code had been trying to allow flexibility for the
'old form' commands (where id could be different from N in osd.N),
but also accept 'new form' commands.  The new rule is that where
there's an OSD specified in the osd crush command, it is of type
CephOsdName, which can be an id *or* 'osd.<id>', but not both.

Pass CephOsdName as int64_t 'id' for convenience in mon code

Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
12 years agoconfig: fix run_dir typo
Sage Weil [Thu, 13 Jun 2013 04:47:09 +0000 (21:47 -0700)]
config: fix run_dir typo

From 654299108bfb11e7dce45f54946d1505f71d2de8.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agomon/MonClient: send commands to a specific monitor
Sage Weil [Wed, 12 Jun 2013 23:36:21 +0000 (16:36 -0700)]
mon/MonClient: send commands to a specific monitor

This implementation is limited: we direct our command by reopening
a session with the specific monitor.  If there is more than one of these
queued we will fail to reach either.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoceph: implement 'ceph tell osd.* ...'
Sage Weil [Wed, 12 Jun 2013 21:55:15 +0000 (14:55 -0700)]
ceph: implement 'ceph tell osd.* ...'

Send the command to each target.  Do this in series, for now.  Error out if
any one fails.

Later, we should do them in parallel.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoMerge remote-tracking branch 'gh/next'
Sage Weil [Thu, 13 Jun 2013 04:26:17 +0000 (21:26 -0700)]
Merge remote-tracking branch 'gh/next'

12 years agoMerge pull request #351 from ceph/wip-var-run
Sage Weil [Thu, 13 Jun 2013 04:24:16 +0000 (21:24 -0700)]
Merge pull request #351 from ceph/wip-var-run

Reviewed-by: Dan Mick <dan.mick@inktank.com>
12 years agovstart.sh: set run_dir to out 351/head
Sage Weil [Thu, 13 Jun 2013 04:23:46 +0000 (21:23 -0700)]
vstart.sh: set run_dir to out

This avoids annoying errors about creating /var/run/ceph from
init-ceph.

Fixes: #4036
Signed-off-by: Sage Weil <sage@inktank.com>
12 years agorbd image_read.sh: wait for rbd sysfs files to appear
Josh Durgin [Thu, 13 Jun 2013 03:34:09 +0000 (20:34 -0700)]
rbd image_read.sh: wait for rbd sysfs files to appear

Poll until they are available for chmoding.

Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
12 years agoosdc/Objecter: fix handling for osd_command dne/down cases
Sage Weil [Thu, 13 Jun 2013 01:13:12 +0000 (18:13 -0700)]
osdc/Objecter: fix handling for osd_command dne/down cases

Generalize the map check machinery that the pool dne check uses to also
get the latest map for OSD down/dne checks.  This is better semantics, but
more important fixes the more immediate bug of returning the error code
to the caller from the osd_command -> _submit_command (that is ignored by
pretty much any caller) and then never triggering the callback.

Fixes: #5331
Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
12 years agoinit-ceph: look to ceph.conf instead of hard-coding /var/run/ceph
Sage Weil [Sat, 8 Jun 2013 00:04:04 +0000 (17:04 -0700)]
init-ceph: look to ceph.conf instead of hard-coding /var/run/ceph

It could be elsewhere!

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoglobal: create /var/run/ceph on daemon startup
Sage Weil [Sat, 8 Jun 2013 00:03:41 +0000 (17:03 -0700)]
global: create /var/run/ceph on daemon startup

This handles cases where the daemon is started without the benefit of
sysvinit or upstart (as with teuthology or ceph-fuse).

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoceph: only use readline when in interactive mode
Sage Weil [Wed, 12 Jun 2013 23:22:45 +0000 (16:22 -0700)]
ceph: only use readline when in interactive mode

A mere

  import readline

line is dumping this to stdout on CentOS 6.3:

  00000000  1b 5b 3f 31 30 33 34 68  .[?1034h

That confuses non-terminals that read from stdout, so only import when we
are in the interactive mode.

Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Dan Mick <dan.mick@inktank.com>
12 years agomon: fix read of format_version out of leveldb
Sage Weil [Wed, 12 Jun 2013 18:23:23 +0000 (11:23 -0700)]
mon: fix read of format_version out of leveldb

The get_version(string, string) is the wrong method; it combines the two
args into a key that is nested inside prefix (so it's prefix/a/b), but we
want perfix/format_version.  Add a method to grab an int for this
particular combo and use that.

This fixes an infinite loop when we actually trigger this code.

Bug introduced by f43c974571beac0c8e54fa699bfa96a1befaf56c.

Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Joao Eduardo Luis <joao.luis@inktank.com>
12 years agodoc/release-notes: v0.63 and v0.64 notes
Sage Weil [Wed, 12 Jun 2013 22:29:42 +0000 (15:29 -0700)]
doc/release-notes: v0.63 and v0.64 notes

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoMerge branch 'next'
Gary Lowell [Wed, 12 Jun 2013 22:00:05 +0000 (15:00 -0700)]
Merge branch 'next'

12 years agoceph: filter out empty lines from osdids()
Sage Weil [Wed, 12 Jun 2013 21:54:10 +0000 (14:54 -0700)]
ceph: filter out empty lines from osdids()

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoceph: accept osd.* as a valid name
Sage Weil [Wed, 12 Jun 2013 21:53:56 +0000 (14:53 -0700)]
ceph: accept osd.* as a valid name

This will be used for 'ceph tell osd.* ...'

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoceph: make life easier on developers by handling in-tree runs
Dan Mick [Wed, 12 Jun 2013 02:46:53 +0000 (19:46 -0700)]
ceph: make life easier on developers by handling in-tree runs

If <path-to-ceph> contains pybind and .libs:
- prepend <path-to-ceph>/pybind to PYTHONPATH
- append <path-to-ceph>/.libs to LD_LIBRARY_PATH if not already there
  and exec self so it takes effect

Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
12 years agoqa/workunits/cephtool/test.sh: look for 'ceph log' via -w, not in log file
Sage Weil [Wed, 12 Jun 2013 21:00:24 +0000 (14:00 -0700)]
qa/workunits/cephtool/test.sh: look for 'ceph log' via -w, not in log file

'ceph-conf ...' doesn't give you final/default values, only what is in the
conf file.  Use -w output to test this instead.

Fixes: #5327
Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoceph: flush stdout on watch print
Sage Weil [Wed, 12 Jun 2013 20:59:37 +0000 (13:59 -0700)]
ceph: flush stdout on watch print

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agoMerge pull request #357 from atwardowski/patch-1
Sage Weil [Wed, 12 Jun 2013 20:50:15 +0000 (13:50 -0700)]
Merge pull request #357 from atwardowski/patch-1

Usage log and ops log are disabled by defaults since 0.56

12 years agoUsage log and ops log are disabled by defaults since 0.56 357/head
atwardowski [Wed, 12 Jun 2013 20:48:44 +0000 (17:48 -0300)]
Usage log and ops log are disabled by defaults since 0.56

http://ceph.com/docs/next/release-notes/#v0-56-bobtail

12 years agomon: fix 'pg dump_stuck' stuckops type
Sage Weil [Wed, 12 Jun 2013 20:39:30 +0000 (13:39 -0700)]
mon: fix 'pg dump_stuck' stuckops type

It's a list.

Fixes: #5332
Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Dan Mick <dan.mick@inktank.com>
12 years agoMerge remote-tracking branch 'gh/wip_5238'
Sage Weil [Wed, 12 Jun 2013 20:31:22 +0000 (13:31 -0700)]
Merge remote-tracking branch 'gh/wip_5238'

Reviewed-by: Sage Weil <sage@inktank.com>
12 years agoqa: multiple_rsync.sh: more output
Sage Weil [Wed, 12 Jun 2013 20:26:03 +0000 (13:26 -0700)]
qa: multiple_rsync.sh: more output

Trying to track down this failure:

2013-06-12T06:11:13.430 INFO:teuthology.task.workunit.client.0.err:+ rsync -auv --exclude local/ /usr/ usr.2
2013-06-12T06:11:13.430 INFO:teuthology.task.workunit.client.0.err:+ tee a
2013-06-12T06:11:13.527 INFO:teuthology.task.workunit.client.0.out:sending incremental file list
2013-06-12T06:11:46.206 INFO:teuthology.task.workunit.client.0.out:
2013-06-12T06:11:46.208 INFO:teuthology.task.workunit.client.0.out:sent 1689627 bytes  received 8302 bytes  50684.45 bytes/sec
2013-06-12T06:11:46.208 INFO:teuthology.task.workunit.client.0.out:total size is 3274130495  speedup is 1928.31
2013-06-12T06:11:46.209 INFO:teuthology.task.workunit.client.0.err:+ wc -l a
2013-06-12T06:11:46.209 INFO:teuthology.task.workunit.client.0.err:+ grep 4
2013-06-12T06:11:46.211 INFO:teuthology.task.workunit:Stopping misc on client.0...

...and am perplexed!

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agov0.64 v0.64
Gary Lowell [Wed, 12 Jun 2013 16:54:06 +0000 (09:54 -0700)]
v0.64

12 years agoceph-fuse: older libfuses don't support FUSE_IOCTL_COMPAT
Dan Mick [Wed, 12 Jun 2013 01:33:08 +0000 (18:33 -0700)]
ceph-fuse: older libfuses don't support FUSE_IOCTL_COMPAT

Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
12 years agoceph-create-keys: Make sure directories for admin and bootstrap keys exist
Peter Wienemann [Tue, 11 Jun 2013 19:38:51 +0000 (21:38 +0200)]
ceph-create-keys: Make sure directories for admin and bootstrap keys exist

Signed-off-by: Peter Wienemann <wienemann@physik.uni-bonn.de>
12 years agostore_test: create_collection prior to split
Samuel Just [Tue, 11 Jun 2013 18:24:54 +0000 (11:24 -0700)]
store_test: create_collection prior to split

Fixes: #5310
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: David Zafman <david.zafman@inktank.com>
12 years agomon: adjust trim defaults
Sage Weil [Tue, 11 Jun 2013 23:30:41 +0000 (16:30 -0700)]
mon: adjust trim defaults

User testing has shown that smaller values yield better results; see #4917.
Jim's testing has had good results with even more aggressive trimming, but I
would like to do more validation yet before changing defaults.

Signed-off-by: Sage Weil <sage@inktank.com>
12 years agodoc: Reworked the landing page.
John Wilkins [Tue, 11 Jun 2013 22:32:23 +0000 (15:32 -0700)]
doc: Reworked the landing page.

Signed-off-by: John Wilkins <john.wilkins@inktank.com>
12 years agodoc: Added a hostname resolution section for local host execution.
John Wilkins [Tue, 11 Jun 2013 21:46:35 +0000 (14:46 -0700)]
doc: Added a hostname resolution section for local host execution.

Signed-off-by: John Wilkins <john.wilkins@inktank.com>
12 years agodoc: Added some tips and re-organized to simplify the process.
John Wilkins [Tue, 11 Jun 2013 21:46:12 +0000 (14:46 -0700)]
doc: Added some tips and re-organized to simplify the process.

Signed-off-by: John Wilkins <john.wilkins@inktank.com>
12 years agoclient: set issue_seq (not seq) in cap release
Sage Weil [Sun, 9 Jun 2013 00:38:07 +0000 (17:38 -0700)]
client: set issue_seq (not seq) in cap release

We regularly have been observing a stall where the MDS is blocked waiting
for a cap revocation (Ls, in our case) and never gets a reply.  We finally
tracked down the sequence:

 - mds issues cap seq 1 to client
 - mds does revocation (seq 2)
 - client replies
 - much time goes by
 - client trims inode from cache, sends release with seq == 2
 - mds ignores release because its issue_seq is 1
 - mds later tries to revoke other caps
 - client discards message because it doesn't have the inode in cache

The problem is simply that we are using seq instead of issue_seq in the
cap release message.  Note that the other release call site in
encode_inode_release() is correct.  That one is much more commonly
triggered by short tests, as compared to this case where the inode needs to
get pushed out of the client cache.

Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Greg Farnum <greg@inktank.com>
12 years agodoc: Added some Java S3 API troubleshooting entries.
John Wilkins [Tue, 11 Jun 2013 19:12:46 +0000 (12:12 -0700)]
doc: Added some Java S3 API troubleshooting entries.

Signed-off-by: John Wilkins <john.wilkins@inktank.com>