git.apps.os.sepia.ceph.com Git

mon: immediately propose after 'osd setmap'

Any subsequent osdmap changes will be ignored anyway.

Note that this still throws out changes _prior_ to the setmap. In
theory, that shouldn't matter, since we're replacing the map
anyway.

commit | commitdiff | tree

Sage Weil [Mon, 15 Dec 2008 20:33:10 +0000 (12:33 -0800)]

update debian, spec files to reflect cmonctl->ceph rename

commit | commitdiff | tree

Sage Weil [Mon, 15 Dec 2008 20:00:17 +0000 (12:00 -0800)]

ceph: allow > and < to redirect command input/output

commit | commitdiff | tree

Sage Weil [Mon, 15 Dec 2008 19:45:10 +0000 (11:45 -0800)]

ceph: fold cobserver into ceph

commit | commitdiff | tree

Sage Weil [Mon, 15 Dec 2008 19:34:11 +0000 (11:34 -0800)]

todo

commit | commitdiff | tree

Sage Weil [Mon, 15 Dec 2008 19:32:32 +0000 (11:32 -0800)]

mds: mark new directories new in journal; add to new list on replay

This ensures the dir is written when the logseg is eventually
expired.

commit | commitdiff | tree

Sage Weil [Mon, 15 Dec 2008 19:17:36 +0000 (11:17 -0800)]

cobserver: usage

commit | commitdiff | tree

Sage Weil [Mon, 15 Dec 2008 19:15:55 +0000 (11:15 -0800)]

rename cmonctl -> ceph

commit | commitdiff | tree

Yehuda Sadeh [Mon, 15 Dec 2008 19:21:33 +0000 (11:21 -0800)]

cobserver: retry if when no response on startup

commit | commitdiff | tree

Yehuda Sadeh [Mon, 15 Dec 2008 18:45:08 +0000 (10:45 -0800)]

vstart.sh: can specify mon address

commit | commitdiff | tree

Yehuda Sadeh [Mon, 15 Dec 2008 17:54:50 +0000 (09:54 -0800)]

osd: add missing declaration

commit | commitdiff | tree

Sage Weil [Mon, 15 Dec 2008 05:40:21 +0000 (21:40 -0800)]

vstart: debug pg reporting

commit | commitdiff | tree

Sage Weil [Mon, 15 Dec 2008 05:39:54 +0000 (21:39 -0800)]

osd: generate_backlog asynchronously in a work queue; simplify peering a bit

We do all backlog creation in a thread pool. Break it down into the
disk scan and log integration steps, and drop PG lock as much as possible.
We only worry about pg acting changes; backlogs are only generated when the
pg is inactive.

We also simplify the activation code a bit by observing that replicas only
generate backlogs when their logs are discontiguous with the primary; in
such cases, we pull the backlog during peering and no generate_backlog
(equivalent) is needed for activation.

commit | commitdiff | tree

Sage Weil [Fri, 12 Dec 2008 05:12:42 +0000 (21:12 -0800)]

osd: half-finished backlog_wq

commit | commitdiff | tree

Sage Weil [Fri, 12 Dec 2008 21:46:06 +0000 (13:46 -0800)]

crush: don't recurse to leaf unless item is a bucket

This avoids choking on 'chooseleaf indep 0 item device' (it's
equivalent to 'choose indep 0 item device').

commit | commitdiff | tree

Sage Weil [Sat, 13 Dec 2008 04:21:34 +0000 (20:21 -0800)]

osd: shift generate_backlog out of merge_log

...in preparation for shifting it off to a worker thread.

commit | commitdiff | tree

Sage Weil [Sat, 13 Dec 2008 04:01:14 +0000 (20:01 -0800)]

osd: for remaining peers, pull either log or backlog, but not both.

Pull as far back as peer's last_epoch_started (if they have that much).
This ensures we will pull any divergent entries, if there are any, so
that we can update our peer_missing map accordingly.

commit | commitdiff | tree

Sage Weil [Fri, 12 Dec 2008 23:12:42 +0000 (15:12 -0800)]

osd: comment clean up

commit | commitdiff | tree

Sage Weil [Fri, 12 Dec 2008 23:12:31 +0000 (15:12 -0800)]

dstart: 3x replication

commit | commitdiff | tree

Sage Weil [Fri, 12 Dec 2008 23:02:55 +0000 (15:02 -0800)]

osd: simplify peer code a bit

Combine the two loops.

commit | commitdiff | tree

Sage Weil [Fri, 12 Dec 2008 23:00:34 +0000 (15:00 -0800)]

osd: simplify master log recreation; fix up Log::copy_after

Pull log from a given point from peer with the largest last_update. Do
not worry about divergence on the peer; that is handled by the new
primary. Simplifies PG::Query struct.

Fix copy_after to set an accurate .bottom, and to behave if the split
point given is divergent (i.e. doesn't actually appear in the log).

commit | commitdiff | tree

Yehuda Sadeh [Fri, 12 Dec 2008 22:51:40 +0000 (14:51 -0800)]

vstart.sh/stop.sh can start and stop specific modules

commit | commitdiff | tree

Sage Weil [Fri, 12 Dec 2008 22:07:49 +0000 (14:07 -0800)]

dstop: kill crun too

commit | commitdiff | tree

Sage Weil [Fri, 12 Dec 2008 22:07:40 +0000 (14:07 -0800)]

osd: move max_rep back to 3x

commit | commitdiff | tree

Sage Weil [Fri, 12 Dec 2008 05:10:43 +0000 (21:10 -0800)]

osd: rewrite proc_replica_log

After we have the master log, our only real purpose with other peer/stray
logs is to update replica missing maps and to find any missing objects.
Rewrite the log handling to clearly do that, with some comments.

commit | commitdiff | tree

Sage Weil [Fri, 12 Dec 2008 05:11:38 +0000 (21:11 -0800)]

osd: fix merge_old_entry bug

We want to revise_need to the _new_ entry's version, not the old one
(which is what missing already refers to).

commit | commitdiff | tree

Sage Weil [Fri, 12 Dec 2008 00:06:16 +0000 (16:06 -0800)]

osd: small peer cleanup

Make sure we check peer_log_requested and peer_summary_requested
independently, depending on which we want. Move 'since'
calculation to where it is needed.

commit | commitdiff | tree

Sage Weil [Fri, 12 Dec 2008 05:13:02 +0000 (21:13 -0800)]

todos

commit | commitdiff | tree

Sage Weil [Thu, 11 Dec 2008 22:06:57 +0000 (14:06 -0800)]

cobserver: print all log entries in each state

commit | commitdiff | tree

Sage Weil [Thu, 11 Dec 2008 22:03:18 +0000 (14:03 -0800)]

osd: initialize all MOSDSubOp fields

commit | commitdiff | tree

Sage Weil [Thu, 11 Dec 2008 22:03:07 +0000 (14:03 -0800)]

mon: fix up MLog constructor

Initialize 'last'. More idiot proof.

commit | commitdiff | tree

Sage Weil [Thu, 11 Dec 2008 21:48:02 +0000 (13:48 -0800)]

filestore: fix buffer overruns, mismatched delete[], small buffer

commit | commitdiff | tree

Sage Weil [Thu, 11 Dec 2008 21:44:56 +0000 (13:44 -0800)]

osd: pad eversion_t and zero remainder

commit | commitdiff | tree

Sage Weil [Thu, 11 Dec 2008 21:26:37 +0000 (13:26 -0800)]

mon: mkfs log msg as error

Just because it'll then create log.error, log.warn, etc.

commit | commitdiff | tree

Sage Weil [Thu, 11 Dec 2008 19:14:46 +0000 (11:14 -0800)]

osd: clear out pg_stat_queue on shutdown

commit | commitdiff | tree

Sage Weil [Thu, 11 Dec 2008 18:26:57 +0000 (10:26 -0800)]

filestore: sort objects by ino

This will greatly increase the speed that we can stat() them, since
btrfs sorts them by ino in the btree.

commit | commitdiff | tree

Sage Weil [Thu, 11 Dec 2008 18:05:55 +0000 (10:05 -0800)]

workqueue: include types.h

commit | commitdiff | tree

Yehuda Sadeh [Thu, 11 Dec 2008 01:11:25 +0000 (17:11 -0800)]

dstart.sh fix broken commit

commit | commitdiff | tree

Yehuda Sadeh [Thu, 11 Dec 2008 01:09:44 +0000 (17:09 -0800)]

dstart.sh uses crun instead of -d (for gprof)

commit | commitdiff | tree

Sage Weil [Wed, 10 Dec 2008 23:54:05 +0000 (15:54 -0800)]

osd: don't clear pg_stats_valid on send

commit | commitdiff | tree

Sage Weil [Wed, 10 Dec 2008 23:31:25 +0000 (15:31 -0800)]

todo

commit | commitdiff | tree

Sage Weil [Wed, 10 Dec 2008 23:07:57 +0000 (15:07 -0800)]

osd: small cleanup

commit | commitdiff | tree

Sage Weil [Wed, 10 Dec 2008 22:50:16 +0000 (14:50 -0800)]

osd: call peer() if we need up_thru to activate

commit | commitdiff | tree

Sage Weil [Wed, 10 Dec 2008 22:25:44 +0000 (14:25 -0800)]

osd: remove/fix waiting_for_head primary recovery logic

Pulling map has the info we need. Simplify.

commit | commitdiff | tree

Sage Weil [Wed, 10 Dec 2008 21:46:40 +0000 (13:46 -0800)]

mon: observer cleanup

Simplify observer struct, some other stuff.

update_observers() when cmon is a single monitor (no cluster). Also
immediately after registering a new observer.

Make message in terms of latest summary vs state (Paxos class has no real
notion of 'incremental', just states and 'latest').

commit | commitdiff | tree

Sage Weil [Wed, 10 Dec 2008 21:22:28 +0000 (13:22 -0800)]

cmonctl: fix compile error

commit | commitdiff | tree

Sage Weil [Wed, 10 Dec 2008 21:04:46 +0000 (13:04 -0800)]

workqueue: drain

commit | commitdiff | tree

Sage Weil [Wed, 10 Dec 2008 20:51:59 +0000 (12:51 -0800)]

workqueue: virtual destructor

commit | commitdiff | tree

Sage Weil [Wed, 10 Dec 2008 20:19:15 +0000 (12:19 -0800)]

makefile: missing headers

commit | commitdiff | tree

Sage Weil [Wed, 10 Dec 2008 20:16:46 +0000 (12:16 -0800)]

osd: cleanup

commit | commitdiff | tree

Sage Weil [Wed, 10 Dec 2008 20:15:00 +0000 (12:15 -0800)]

workqueue: non-inline worker, control methods; debugging

commit | commitdiff | tree

Sage Weil [Wed, 10 Dec 2008 19:54:59 +0000 (11:54 -0800)]

mon: fix use after free

commit | commitdiff | tree

Sage Weil [Wed, 10 Dec 2008 19:35:04 +0000 (11:35 -0800)]

osd: use new workqueue in osd for ops

commit | commitdiff | tree

Sage Weil [Wed, 10 Dec 2008 19:13:00 +0000 (11:13 -0800)]

osd: shared threadpool for multiple work queues

commit | commitdiff | tree

Sage Weil [Wed, 10 Dec 2008 19:12:44 +0000 (11:12 -0800)]

osd: fix uninit value in scrub message

commit | commitdiff | tree

Sage Weil [Wed, 10 Dec 2008 00:47:40 +0000 (16:47 -0800)]

todos

commit | commitdiff | tree

Sage Weil [Wed, 10 Dec 2008 00:34:54 +0000 (16:34 -0800)]

mon: mark unresponsive mds laggy instead of failed until we can replace it

This way we flag laggy mds's, but hold out until they come back
online or we have a standby cmds to replace them. Should make
things much more tolerable.

commit | commitdiff | tree

Sage Weil [Tue, 9 Dec 2008 23:06:48 +0000 (15:06 -0800)]

cobserver: simplify headers

commit | commitdiff | tree

Sage Weil [Wed, 10 Dec 2008 00:00:27 +0000 (16:00 -0800)]

osd: make sure hb peers get marked down

We mark_down on osdmap update when we see an osd has gone down, but the
heartbeats are sent in a different thread without map_lock using
heartbeat_inst.  So, make sure heartbeat_inst entries are removed.

Also, we add hb peers at peers' request.  When removing such entries in
update_heartbeat_peers, mark_down then, too.  (We may mark_down a failed
peer, and then receive the hb request late.  So we mark that down next
time we update the heartbeat maps.)

commit | commitdiff | tree

Sage Weil [Tue, 9 Dec 2008 23:06:54 +0000 (15:06 -0800)]

osd: update_stat during recover_replicas()

commit | commitdiff | tree

Sage Weil [Tue, 9 Dec 2008 22:57:43 +0000 (14:57 -0800)]

dstart: --nostop option

to avoid ./dstop.sh

commit | commitdiff | tree

Sage Weil [Tue, 9 Dec 2008 22:57:19 +0000 (14:57 -0800)]

osd: drive primary recovery via missing map, not log

commit | commitdiff | tree

Sage Weil [Tue, 9 Dec 2008 22:57:01 +0000 (14:57 -0800)]

mon: osdmon cleanup

commit | commitdiff | tree

Sage Weil [Tue, 9 Dec 2008 21:34:27 +0000 (13:34 -0800)]

dstart: keep old cosd binaries around for a bit

commit | commitdiff | tree

Sage Weil [Tue, 9 Dec 2008 21:33:33 +0000 (13:33 -0800)]

osd: 'pg repair <pgid>' to repair an inconsistent pg using replicas

commit | commitdiff | tree

Sage Weil [Tue, 9 Dec 2008 20:27:31 +0000 (12:27 -0800)]

osd: don't read file content during _scrub

commit | commitdiff | tree

Sage Weil [Tue, 9 Dec 2008 19:56:51 +0000 (11:56 -0800)]

msgr: be noisier about mark_down calls

commit | commitdiff | tree

Sage Weil [Tue, 9 Dec 2008 19:00:04 +0000 (11:00 -0800)]

osd: avoid needless calls to peer(), build_prior()

Introduces PEERING pg state. Also is smarter about when build_prior and
peer are actually called.

commit | commitdiff | tree

Sage Weil [Tue, 9 Dec 2008 18:40:01 +0000 (10:40 -0800)]

osd: make prior_set_affected() slightly smarter

Only return true if an osd goes down that we didn't already know was
down (prior_set may contain down osds if the PG is marked DOWN).

commit | commitdiff | tree

Sage Weil [Tue, 9 Dec 2008 22:57:48 +0000 (14:57 -0800)]

cobserver: cleanups

commit | commitdiff | tree

Sage Weil [Tue, 9 Dec 2008 22:55:00 +0000 (14:55 -0800)]

mon: use 'latest' for latest osd, mds maps

Mainly for benefit of PaxosObserver, but it also cleans things up
a bit.

commit | commitdiff | tree

Sage Weil [Tue, 9 Dec 2008 22:44:58 +0000 (14:44 -0800)]

cobserver: cleanup; print map summaries w/ each new state

commit | commitdiff | tree

Sage Weil [Tue, 9 Dec 2008 22:44:41 +0000 (14:44 -0800)]

mon: refactor map print_summary/operator<< methods

commit | commitdiff | tree

Yehuda Sadeh [Tue, 9 Dec 2008 22:23:38 +0000 (14:23 -0800)]

cobserver: accidentaly removed a line

commit | commitdiff | tree

Yehuda Sadeh [Tue, 9 Dec 2008 22:20:44 +0000 (14:20 -0800)]

kclient: missing files

commit | commitdiff | tree

Yehuda Sadeh [Tue, 9 Dec 2008 22:19:27 +0000 (14:19 -0800)]

whitespaces

commit | commitdiff | tree

Yehuda Sadeh [Tue, 9 Dec 2008 22:11:09 +0000 (14:11 -0800)]

mon: factor ClientMap class out

commit | commitdiff | tree

Yehuda Sadeh [Tue, 9 Dec 2008 21:39:10 +0000 (13:39 -0800)]

cobserver: utility, observe changes in different maps

commit | commitdiff | tree

Sage Weil [Tue, 9 Dec 2008 19:47:06 +0000 (11:47 -0800)]

osd: use push() to push clone op

Also fixes missing updates to peer_missing[peer] and pushing
map.

commit | commitdiff | tree

Sage Weil [Tue, 9 Dec 2008 17:58:16 +0000 (09:58 -0800)]

mon: factor our osdmap print, print_summary

commit | commitdiff | tree

Sage Weil [Tue, 9 Dec 2008 17:58:08 +0000 (09:58 -0800)]

mon: factor out mds print, print_summary

commit | commitdiff | tree

Sage Weil [Mon, 8 Dec 2008 21:50:46 +0000 (13:50 -0800)]

mds: stay loner if client has B and no other reason to switch state

If the client has dirty data, and there is no other reason to
toggle the lock state, leave it as LONER. The client will write
out at its leisure, and we'll avoid an unstable lock state that
is waiting on a potentially slow writeout.

commit | commitdiff | tree

Sage Weil [Tue, 9 Dec 2008 17:50:26 +0000 (09:50 -0800)]

osd: missing last_mon_heartbeat declaration

commit | commitdiff | tree

Sage Weil [Tue, 9 Dec 2008 16:48:03 +0000 (08:48 -0800)]

msgr: make sure nonce matches too when connecting to peer

Otherwise the predictable port numbers cause problems.

commit | commitdiff | tree

Sage Weil [Tue, 9 Dec 2008 16:43:38 +0000 (08:43 -0800)]

msgr: print error when message type is unrecognized

commit | commitdiff | tree

Sage Weil [Tue, 9 Dec 2008 16:42:28 +0000 (08:42 -0800)]

osd: ping mon less frequently when peerless

Every second is too much. Make it tunable.

commit | commitdiff | tree

Sage Weil [Mon, 8 Dec 2008 22:03:46 +0000 (14:03 -0800)]

mon: typo in pg dump output

commit | commitdiff | tree

Sage Weil [Mon, 8 Dec 2008 19:44:21 +0000 (11:44 -0800)]

ceph: new default mon port; try to bind to port in known range

New monitor port in unused region (according to nmap-services).

Try to bind to a port in a known range, so that tools can easily
identify the protocol in use.

Remove some old .sh cruft.

Unnamed repository; edit this file 'description' to name the repository.

RSS Atom