]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
11 years agoceph_test_rados: wait for commit, not ack 1448/head 1449/head
Sage Weil [Thu, 13 Mar 2014 21:49:30 +0000 (14:49 -0700)]
ceph_test_rados: wait for commit, not ack

First, this is what we wanted in the first place

Second, if we wait for ACK, we may look at a user_version value that is
not stable.

Fixes: #7705
Signed-off-by: Sage Weil <sage@inktank.com>
11 years agoMOSDOp: include reassert_version in print
Sage Weil [Thu, 13 Mar 2014 21:45:49 +0000 (14:45 -0700)]
MOSDOp: include reassert_version in print

Signed-off-by: Sage Weil <sage@inktank.com>
11 years agorbd-fuse: fix signed/unsigned warning
Sage Weil [Thu, 13 Mar 2014 18:22:34 +0000 (11:22 -0700)]
rbd-fuse: fix signed/unsigned warning

rbd_fuse/rbd-fuse.c: In function 'enumerate_images':
rbd_fuse/rbd-fuse.c:113:2: warning: comparison between signed and unsigned integer expressions [-Wsign-compare]

Signed-off-by: Sage Weil <sage@inktank.com>
11 years agoMerge pull request #1440 from ceph/wip-7649
Samuel Just [Thu, 13 Mar 2014 01:33:03 +0000 (18:33 -0700)]
Merge pull request #1440 from ceph/wip-7649

Wip 7649

Reviewed-by: Sage Weil <sage@inktank.com>
11 years agoMerge pull request #1441 from ceph/wip-7671
Sage Weil [Thu, 13 Mar 2014 00:09:31 +0000 (17:09 -0700)]
Merge pull request #1441 from ceph/wip-7671

Wip 7671

Reviewed-by: Sage Weil <sage@inktank.com>
11 years agotest/librados/watch_notify: create foo before watching 1441/head
Samuel Just [Tue, 11 Mar 2014 21:17:47 +0000 (14:17 -0700)]
test/librados/watch_notify: create foo before watching

Signed-off-by: Samuel Just <sam.just@inktank.com>
11 years agotest/system/st_rados_watch: expect ENOENT for watch on non-existent object
Samuel Just [Tue, 11 Mar 2014 18:25:47 +0000 (11:25 -0700)]
test/system/st_rados_watch: expect ENOENT for watch on non-existent object

Signed-off-by: Samuel Just <sam.just@inktank.com>
11 years agoMerge pull request #1439 from ceph/wip-7682
Sage Weil [Wed, 12 Mar 2014 22:45:35 +0000 (15:45 -0700)]
Merge pull request #1439 from ceph/wip-7682

ReplicatedPG::already_(complete|ack) should skip temp object ops

Reviewed-by: Sage Weil <sage@inktank.com>
11 years agoReplicatedPG::already_(complete|ack) should skip temp object ops 1439/head
Samuel Just [Wed, 12 Mar 2014 21:07:50 +0000 (14:07 -0700)]
ReplicatedPG::already_(complete|ack) should skip temp object ops

We clearly won't get dup ops on these repops, and they don't
have meaningful versions since they don't carry log
entries.

Fixes: #7682
Signed-off-by: Samuel Just <sam.just@inktank.com>
11 years agoMerge pull request #1434 from ceph/wip-7695
Sage Weil [Wed, 12 Mar 2014 18:57:46 +0000 (11:57 -0700)]
Merge pull request #1434 from ceph/wip-7695

build-doc: fix checks for required commands for non-debian

Reviewed-by: Sage Weil <sage@inktank.com>
11 years agoMerge pull request #1436 from ceph/wip-7681
Sage Weil [Wed, 12 Mar 2014 17:46:54 +0000 (10:46 -0700)]
Merge pull request #1436 from ceph/wip-7681

ECBackend: when removing the temp obj, use the right shard

Reviewed-by: Sage Weil <sage@inktank.com>
11 years agoMerge pull request #1437 from ceph/wip-7650
Sage Weil [Wed, 12 Mar 2014 17:44:50 +0000 (10:44 -0700)]
Merge pull request #1437 from ceph/wip-7650

tools/rados/rados.cc: use write_full for sync_write for ec pools

Reviewed-by: Sage Weil <sage@inktank.com>
11 years agoPG: do not wait for flushed before activation 1440/head
Samuel Just [Tue, 11 Mar 2014 21:23:10 +0000 (14:23 -0700)]
PG: do not wait for flushed before activation

This should reduce the sting of the previous commit somewhat.  We wait
for the activation transactions to clear prior to accepting IO anyway,
so we can go ahead and get that process started without waiting for the
flush.

Signed-off-by: Samuel Just <sam.just@inktank.com>
11 years agoPG: do not serve requests until replicas have activated
Samuel Just [Tue, 11 Mar 2014 17:31:55 +0000 (10:31 -0700)]
PG: do not serve requests until replicas have activated

There are two problems:
1) We choose the min last_update amoung peers with the max local-les
value as an upper bound on requests which could have been reported to
the client as committed.  We then, for ec pools, roll back to that point
to ensure that we don't inadvertently commit to an update which fewer
than K replicas actually saw.  If the primary sets local-les, accepts an
update from a client, and there is a new interval before any of the
replicas have been activated, we will end up being forced to use that
update which no other replica has seen as the new last_update.  This
will cause the object to become unfound.  We don't have this problem as
long as all active replicas agree on last_update before we accept IO.

2) Even for replicated pools, we would then immediately respond to the
request which created the primary-only update with a commit since it is
in the log and we have no outstanding repops.  If we then lose that
primary before any of the replicas in the new interval record the new
log, we will not only lose the object, but also the log entry recording
it, which will result in a lost write.

For these reasons, it seems like we need to wait for the replicas to
activate before we can process new requests essentially because whatever
update we select as last_update is essentially regarded as committed as
soon as we accept IO.

Fixes: #7649
Signed-off-by: Samuel Just <sam.just@inktank.com>
11 years agoECBackend: when removing the temp obj, use the right shard 1436/head
Samuel Just [Tue, 11 Mar 2014 21:41:05 +0000 (14:41 -0700)]
ECBackend: when removing the temp obj, use the right shard

Introduced in d0b1094ff7b98ef9262ecb45ee8324853003a77c
Fixes: #7681
Signed-off-by: Samuel Just <sam.just@inktank.com>
11 years agobuild-doc: fix checks for required commands for non-debian 1434/head
Danny Al-Gaaf [Wed, 12 Mar 2014 17:09:59 +0000 (18:09 +0100)]
build-doc: fix checks for required commands for non-debian

Fixes: 7695
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
11 years agoMerge pull request #1412 from ceph/wip-libxfs-flag
Yehuda Sadeh [Wed, 12 Mar 2014 16:50:58 +0000 (09:50 -0700)]
Merge pull request #1412 from ceph/wip-libxfs-flag

FileStore: support compiling without libxfs

Reviewed-by: Yehuda Sadeh <yehuda@inktank.com>
11 years agoMerge pull request #1362 from dachary/wip-7548
Sage Weil [Wed, 12 Mar 2014 04:54:02 +0000 (21:54 -0700)]
Merge pull request #1362 from dachary/wip-7548

doc: erasure coded pool developer and operations documentation

Reviewed-by: Sage Weil <sage@inktank.com>
11 years agoMerge pull request #1425 from ceph/wip-rbd-fuse-enumerate
Sage Weil [Wed, 12 Mar 2014 04:41:53 +0000 (21:41 -0700)]
Merge pull request #1425 from ceph/wip-rbd-fuse-enumerate

rbd-fuse: fix enumerate_images() image names buffer size issue

Reviewed-by: Sage Weil <sage@inktank.com>
11 years agoMerge pull request #1409 from enovance/wip-brag
Sage Weil [Wed, 12 Mar 2014 04:25:25 +0000 (21:25 -0700)]
Merge pull request #1409 from enovance/wip-brag

ceph-brag enhancements

Reviewed-by: Sage Weil <sage@inktank.com>
11 years agoMerge pull request #1427 from ceph/wip-6889
Sage Weil [Wed, 12 Mar 2014 02:23:27 +0000 (19:23 -0700)]
Merge pull request #1427 from ceph/wip-6889

rgw: don't log system requests in usage log

Reviewed-by: Sage Weil <sage@inktank.com>
11 years agoMerge pull request #1432 from ceph/wip-7687
Sage Weil [Wed, 12 Mar 2014 01:23:14 +0000 (18:23 -0700)]
Merge pull request #1432 from ceph/wip-7687

rgw: don't overwrite bucket entry data when syncing user stats

Reviewed-by: Sage Weil <sage@inktank.com>
11 years agorgw: don't overwrite bucket entry data when syncing user stats 1432/head
Yehuda Sadeh [Wed, 12 Mar 2014 01:19:44 +0000 (18:19 -0700)]
rgw: don't overwrite bucket entry data when syncing user stats

Fixes: #7687
When syncing user bucket stats we overwritten the entire entry with the
passed in entry. We should only look at the stats portion, and not
overwrite the rest (which contains bucket creation time).

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
11 years agoMerge pull request #1400 from ceph/wip-hint-tests
Sage Weil [Wed, 12 Mar 2014 01:09:54 +0000 (18:09 -0700)]
Merge pull request #1400 from ceph/wip-hint-tests

wip-hint-tests

Reviewed-by: Sage Weil <sage@inktank.com>
11 years agoqa/workunits/cephtool/test.sh: fix thrash (more)
Sage Weil [Wed, 12 Mar 2014 00:03:23 +0000 (17:03 -0700)]
qa/workunits/cephtool/test.sh: fix thrash (more)

If I have to touch this again I will remove it.  Ugh.  This time,

ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-03-11_02:30:01-rados-firefly-distro-basic-plana/125922

hit NXIO a few lines down because one of the OSDs was still down.

Signed-off-by: Sage Weil <sage@inktank.com>
11 years agoMerge pull request #1417 from ceph/wip-7663
Sage Weil [Tue, 11 Mar 2014 23:38:28 +0000 (16:38 -0700)]
Merge pull request #1417 from ceph/wip-7663

Wip 7663

Reviewed-by: Sage Weil <sage@inktank.com>
11 years agoMerge pull request #1416 from ceph/wip-div
Sage Weil [Tue, 11 Mar 2014 23:05:12 +0000 (16:05 -0700)]
Merge pull request #1416 from ceph/wip-div

More log handling fixes

Reviewed-by: Sage Weil <sage@inktank.com>
11 years agoosd/ReplicatedPG: fix typo
Sage Weil [Tue, 11 Mar 2014 19:14:49 +0000 (12:14 -0700)]
osd/ReplicatedPG: fix typo

This is the object count, not the dirty object count.  Broken by
00bf3b56743830a4a9c5d6765946a4e68f530c57.

Reported-by: Greg Farnum <greg@inktank.com>
Signed-off-by: Sage Weil <sage@inktank.com>
11 years agoMerge pull request #1430 from ceph/wip-7674
Sage Weil [Tue, 11 Mar 2014 18:43:35 +0000 (11:43 -0700)]
Merge pull request #1430 from ceph/wip-7674

osd/ReplicatedPG: do not include hit_set objects in full calculation

Reviewed-by: Samuel Just <sam.just@inktank.com>
11 years agoReplicatedPG: CEPH_OSD_OP_WATCH return -ENOENT if !obs.exists
Samuel Just [Mon, 10 Mar 2014 20:01:36 +0000 (13:01 -0700)]
ReplicatedPG: CEPH_OSD_OP_WATCH return -ENOENT if !obs.exists

Fixes: #7671
Signed-off-by: Samuel Just <sam.just@inktank.com>
11 years agoosd/ReplicatedPG: do not include hit_set objects in full calculation 1430/head
Sage Weil [Tue, 11 Mar 2014 17:49:47 +0000 (10:49 -0700)]
osd/ReplicatedPG: do not include hit_set objects in full calculation

If we have a low target and there are hit_set objects (which cannot be
evicted), we can get stuck in a full state and never get out of it.

Fixes: #7674
Signed-off-by: Sage Weil <sage@inktank.com>
11 years agoMerge pull request #1418 from ceph/wip-7672
Sage Weil [Tue, 11 Mar 2014 17:23:04 +0000 (10:23 -0700)]
Merge pull request #1418 from ceph/wip-7672

PG::choose_acting: filter CRUSH_ITEM_NONE out of have

Reviewed-by: Sage Weil <sage@inktank.com>
11 years agoMerge pull request #1429 from ceph/wip-7592-final
Sage Weil [Tue, 11 Mar 2014 17:18:59 +0000 (10:18 -0700)]
Merge pull request #1429 from ceph/wip-7592-final

Wip 7592 final

Reviewed-by: Sage Weil <sage@inktank.com>
11 years agoMerge pull request #1410 from ceph/wip-flock
Sage Weil [Tue, 11 Mar 2014 17:00:48 +0000 (10:00 -0700)]
Merge pull request #1410 from ceph/wip-flock

mds: fix owner check of file lock

Reviewed-by: Sage Weil <sage@inktank.com>
11 years agoMerge pull request #1385 from ceph/wip-nfs-export
Sage Weil [Tue, 11 Mar 2014 16:59:33 +0000 (09:59 -0700)]
Merge pull request #1385 from ceph/wip-nfs-export

mds: introduce LOOKUPNAME MDS request

Reviewed-by: Sage Weil <sage@inktank.com>
11 years agoosd: hit_set_persist(): Verify all objects aren't degraded 1429/head
David Zafman [Tue, 11 Mar 2014 02:54:57 +0000 (19:54 -0700)]
osd: hit_set_persist(): Verify all objects aren't degraded

Fixes: #7592
Signed-off-by: David Zafman <david.zafman@inktank.com>
11 years agorgw: don't log system requests in usage log 1427/head
Yehuda Sadeh [Fri, 22 Nov 2013 23:41:49 +0000 (15:41 -0800)]
rgw: don't log system requests in usage log

Fixes: 6889
System requets should not be logged in the usage log.

Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
11 years agorbd-fuse: fix enumerate_images() image names buffer size issue 1425/head
Ilya Dryomov [Tue, 11 Mar 2014 14:00:37 +0000 (16:00 +0200)]
rbd-fuse: fix enumerate_images() image names buffer size issue

Image names buffer is fixed at 1024.  This turns out to be not enough:
there are at least two "rbd-fuse rbd_list: error %d Numerical result
out of range" reports on the ML.  Fix it by calling rbd_list() twice to
first get the expected buffer size.  Also, get rid of the memory leak
and tweak the error message while at it.

Signed-off-by: Ilya Dryomov <ilya.dryomov@inktank.com>
11 years agoUse pgrep radosgw to determine if rados gateway is running.
Warren Usui [Sat, 1 Mar 2014 05:43:31 +0000 (21:43 -0800)]
Use pgrep radosgw to determine if rados gateway is running.

Fixes: 7528
Reviewed-by: Yehuda Sadeh <yehuda@inktank.com>
Signed-off-by: Warren Usui <warren.usui@inktank.com>
(cherry picked from commit 5b88856cd25a13842fa8ad0699b84fbdfbc13694)

11 years agoFixed get_status() to find client.radosgw fields inside of ps output.
Warren Usui [Fri, 21 Feb 2014 05:07:53 +0000 (21:07 -0800)]
Fixed get_status() to find client.radosgw fields inside of ps output.

Fixes: 7375
Reviewed-by: Yehuda Sadeh <yehuda@inktank.com>
Signed-off-by: Warren Usui <warren.usui@inktank.com>
(cherry picked from commit 8020dcf7791a0f459bae5e8a77d70ff1dc9c60bc)

11 years agoFix get_status() to find client.rados text inside of ps command results.
Warren Usui [Fri, 21 Feb 2014 05:11:45 +0000 (21:11 -0800)]
Fix get_status() to find client.rados text inside of ps command results.

Added port (fixed value for right now in teuthology) to hostname.
Fixes: 7374
Reviewed-by: Yehuda Sadeh <yehuda@inktank.com>
Signed-off-by: Warren Usui <warren.usui@inktank.com>
(cherry picked from commit 8200b8a02511e367370d33cb74c3d45ef85fca31)

11 years agoosd: Remove unused checkpoint code
David Zafman [Mon, 10 Mar 2014 20:35:19 +0000 (13:35 -0700)]
osd: Remove unused checkpoint code

Signed-off-by: David Zafman <david.zafman@inktank.com>
11 years agomds: fix owner check of file lock 1410/head
Yan, Zheng [Sun, 9 Mar 2014 23:36:14 +0000 (07:36 +0800)]
mds: fix owner check of file lock

flock and posix lock do not use process ID as owner identifier.
The process ID of who holds the lock is just for F_GETLK fcntl(2).
For linux kernel, File lock's owner identifier is the file pointer
through which the lock is requested.

The fix is do not take the 'pid_namespace' into consideration when
checking conflict locks. Also rename the 'pid' fields of struct
ceph_mds_request_args and struct ceph_filelock to 'owner', rename
'pid_namespace' fields to 'pid'.

The kclient counterpart of this patch modifies the flock code to
assign the file pointer to the 'owner' field of lock message. It
also set the most significant bit of the 'owner' field. We can use
that bit to distinguish between old and new clients.

Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com>
11 years agoReplicatedPG: adjust pending_attrs correctly in copy_from 1417/head
Samuel Just [Sun, 9 Mar 2014 18:43:57 +0000 (11:43 -0700)]
ReplicatedPG: adjust pending_attrs correctly in copy_from

Otherwise, subsequent reads might not get the correct cached attrs.

Fixes: #7663
Signed-off-by: Samuel Just <sam.just@inktank.com>
11 years agoReplicatedPG: _delete_head should adjust pending_attrs
Samuel Just [Sun, 9 Mar 2014 18:41:48 +0000 (11:41 -0700)]
ReplicatedPG: _delete_head should adjust pending_attrs

We need the old attr_cache in make_writeable for the clone,
so make the changes to pending_attrs.

Fixes: #7663
Signed-off-by: Samuel Just <sam.just@inktank.com>
11 years agoReplicatedPG: use pending_attrs in rollback
Samuel Just [Sun, 9 Mar 2014 18:43:00 +0000 (11:43 -0700)]
ReplicatedPG: use pending_attrs in rollback

Otherwise, we won't have the correct attr_cache in
make_writeable for the clone.

Fixes: #7663
Signed-off-by: Samuel Just <sam.just@inktank.com>
11 years agoPG::choose_acting: filter CRUSH_ITEM_NONE out of have 1418/head
Samuel Just [Mon, 10 Mar 2014 20:36:37 +0000 (13:36 -0700)]
PG::choose_acting: filter CRUSH_ITEM_NONE out of have

Fixes: #7672
Signed-off-by: Samuel Just <sam.just@inktank.com>
11 years agorbdmap: bugfix upstart script
Stephan Renatus [Mon, 10 Mar 2014 14:17:41 +0000 (15:17 +0100)]
rbdmap: bugfix upstart script

It seems like the upstart script is lacking a little behind [the initscript](https://github.com/ceph/ceph/blob/master/src/init-rbdmap#L44-L49); however, this bugfix makes it actually do what it should do.

Before, the bug made the job just ignore all parameters, with the following error in /var/log/upstart/rbdmap.log:

```
rbd map volumes/volume-one
rbd: add failed: (22) Invalid argument
```

Signed-off-by: Stephan Renatus <s.renatus@x-ion.de>
Signed-off-by: Sage Weil <sage@inktank.com>
11 years agoFileStore: support compiling without libxfs 1412/head
Ilya Dryomov [Mon, 10 Mar 2014 08:36:48 +0000 (10:36 +0200)]
FileStore: support compiling without libxfs

When configured with --without-libxfs, use GenericFileStoreBackend
instead of XfsFileStoreBackend for XFS.  At this point this would only
impact the allocation hint op.  The default is to compile with
--with-libxfs.  (Previously it was unconditionally enabled on linux and
disabled for non-linux arches.)

Signed-off-by: Ilya Dryomov <ilya.dryomov@inktank.com>
11 years agoMerge branch 'master' of https://github.com/enovance/ceph-brag into firefly 1409/head
Babu Shanmugam [Mon, 10 Mar 2014 06:12:58 +0000 (06:12 +0000)]
Merge branch 'master' of https://github.com/enovance/ceph-brag into firefly

11 years agoRemoved all regular expression parsing and used '-f json' instead
Babu Shanmugam [Mon, 10 Mar 2014 06:11:03 +0000 (06:11 +0000)]
Removed all regular expression parsing and used '-f json' instead

Signed-off-by: Babu Shanmugam <anbu@enovance.com>
11 years agoModified num_bytes attribute in components_count to be a raw integer
Babu Shanmugam [Mon, 10 Mar 2014 04:35:15 +0000 (04:35 +0000)]
Modified num_bytes attribute in components_count to be a raw integer

Signed-off-by: Babu Shanmugam <anbu@enovance.com>
11 years agoReplicatedPG::finish_ctx: clear object_info if !obs.exists
Samuel Just [Fri, 7 Mar 2014 23:54:23 +0000 (15:54 -0800)]
ReplicatedPG::finish_ctx: clear object_info if !obs.exists

Otherwise, we see a different object_info_t depending on whether the
transaction deleting the object clears before another op recreating it appears.
In particular, we use oi.version to set the prior_version on the log entries in
finish_ctx.  If the oi is allowed to stick around the recreation log event will
have a prior version of the deletion event when it should have a prior version
of eversion_t().

Fixes: #7655
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
11 years agoMerge pull request #1405 from ceph/wip-7575
Sage Weil [Sun, 9 Mar 2014 19:21:35 +0000 (12:21 -0700)]
Merge pull request #1405 from ceph/wip-7575

osd: Add hit_set_flushing to track current flushes and prevent races

Reviewed-by: Sage Weil <sage@inktank.com>
Reviewed-by: Samuel Just <sam.just@inktank.com>
11 years agoconfig.cc: add debug_ prefix to subsys logging levels
Danny Al-Gaaf [Thu, 6 Mar 2014 00:23:48 +0000 (01:23 +0100)]
config.cc: add debug_ prefix to subsys logging levels

Add debug_ prefix also for 'ceph --admin-daemon *.asok config show'
as already done e.g. by 'ceph-osd --show-config'.

Fixes: #7602
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
Reviewed-by: Sage Weil <sage@inktank.com>
11 years agoqa: add script for testing rados allocation hint 1400/head
Ilya Dryomov [Fri, 7 Mar 2014 15:58:55 +0000 (17:58 +0200)]
qa: add script for testing rados allocation hint

Signed-off-by: Ilya Dryomov <ilya.dryomov@inktank.com>
11 years agoMerge pull request #1408 from ceph/wip-da-fix-doc
Sage Weil [Sun, 9 Mar 2014 16:56:18 +0000 (09:56 -0700)]
Merge pull request #1408 from ceph/wip-da-fix-doc

Fixes and updates for doc

Reviewed-by: Sage Weil <sage@inktank.com>
11 years agoget-involved.rst: update information 1408/head
Danny Al-Gaaf [Sun, 9 Mar 2014 01:18:28 +0000 (02:18 +0100)]
get-involved.rst: update information

Added #ceph-devel IRC channel, more mailing lists, wiki and planet.ceph.com.

Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
11 years agoswift/containerops.rst: fix some typos
Danny Al-Gaaf [Sun, 9 Mar 2014 00:02:43 +0000 (01:02 +0100)]
swift/containerops.rst: fix some typos

Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
11 years agoradosgw/troubleshooting.rst: s/ceph-osd/OSD/
Danny Al-Gaaf [Sat, 8 Mar 2014 23:58:57 +0000 (00:58 +0100)]
radosgw/troubleshooting.rst: s/ceph-osd/OSD/

Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
11 years agoradosgw/config-ref.rst: fix typo
Danny Al-Gaaf [Sat, 8 Mar 2014 23:30:49 +0000 (00:30 +0100)]
radosgw/config-ref.rst: fix typo

Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
11 years agosession_authentication.rst: fix some typos
Danny Al-Gaaf [Sat, 8 Mar 2014 23:19:08 +0000 (00:19 +0100)]
session_authentication.rst: fix some typos

Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
11 years agorelease-process.rst: fix some typos
Danny Al-Gaaf [Sat, 8 Mar 2014 23:07:39 +0000 (00:07 +0100)]
release-process.rst: fix some typos

Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
11 years agodoc: s/osd/OSD/ if not part of a command
Danny Al-Gaaf [Sat, 8 Mar 2014 23:01:40 +0000 (00:01 +0100)]
doc: s/osd/OSD/ if not part of a command

First attempt to unify usage of OSD over rst files.

Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
11 years agodoc/dev/logs.rst; fix some typos
Danny Al-Gaaf [Sat, 8 Mar 2014 22:31:11 +0000 (23:31 +0100)]
doc/dev/logs.rst; fix some typos

Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
11 years agofilestore-filesystem-compat.rst: fix typo
Danny Al-Gaaf [Sat, 8 Mar 2014 22:25:53 +0000 (23:25 +0100)]
filestore-filesystem-compat.rst: fix typo

Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
11 years agocorpus.rst: fix typo
Danny Al-Gaaf [Sat, 8 Mar 2014 22:22:38 +0000 (23:22 +0100)]
corpus.rst: fix typo

Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
11 years agoconfig.rst: fix typo
Danny Al-Gaaf [Sat, 8 Mar 2014 22:16:24 +0000 (23:16 +0100)]
config.rst: fix typo

Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
11 years agocephx_protocol.rst: fix typo
Danny Al-Gaaf [Sat, 8 Mar 2014 22:11:25 +0000 (23:11 +0100)]
cephx_protocol.rst: fix typo

Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
11 years agoarchitecture.rst: fix typos
Danny Al-Gaaf [Sat, 8 Mar 2014 10:27:15 +0000 (11:27 +0100)]
architecture.rst: fix typos

Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
11 years agorados: add set-alloc-hint command
Ilya Dryomov [Fri, 7 Mar 2014 15:58:55 +0000 (17:58 +0200)]
rados: add set-alloc-hint command

Set allocation hint for an object.

Signed-off-by: Ilya Dryomov <ilya.dryomov@inktank.com>
11 years agorados/operations/control.rst: fix typo
Danny Al-Gaaf [Sat, 8 Mar 2014 10:06:50 +0000 (11:06 +0100)]
rados/operations/control.rst: fix typo

Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
11 years agoBroke down sysinfo's format into a histogram with a value and count
Babu Shanmugam [Sat, 8 Mar 2014 05:17:13 +0000 (05:17 +0000)]
Broke down sysinfo's format into a histogram with a value and count
so that we just see how many of each version/distro/kernel/os/arch/cpu/etc are running

Signed-off-by: Babu Shanmugam <anbu@enovance.com>
11 years agoPGLog::proc_replica_log: select divergent log entries correctly 1416/head
Samuel Just [Sat, 8 Mar 2014 00:51:36 +0000 (16:51 -0800)]
PGLog::proc_replica_log: select divergent log entries correctly

Looking for an entry in olog which matches one of ours might add
extra divergent entries.  Instead, do what merge_log does and
walk back through the auth log looking for an entry in olog.

Fixes: 7657
Signed-off-by: Samuel Just <sam.just@inktank.com>
11 years agoPGLog::_merge_object_divergent_entries: handle missing.have == prior_version
Samuel Just [Sat, 8 Mar 2014 00:50:40 +0000 (16:50 -0800)]
PGLog::_merge_object_divergent_entries: handle missing.have == prior_version

Fixes: #7658
Signed-off-by: Samuel Just <sam.just@inktank.com>
11 years agoTestPGLog: add a test case verifying case where we have the prior_version
Samuel Just [Sat, 8 Mar 2014 00:49:18 +0000 (16:49 -0800)]
TestPGLog: add a test case verifying case where we have the prior_version

Signed-off-by: Samuel Just <sam.just@inktank.com>
11 years agoTestPGLog: check on last_update in run_test_case
Samuel Just [Sat, 8 Mar 2014 00:48:39 +0000 (16:48 -0800)]
TestPGLog: check on last_update in run_test_case

Signed-off-by: Samuel Just <sam.just@inktank.com>
11 years agoTestPGLog::proc_replica_log: call doesn't adjust olog
Samuel Just [Sat, 8 Mar 2014 00:48:11 +0000 (16:48 -0800)]
TestPGLog::proc_replica_log: call doesn't adjust olog

We don't adjust head with proc_replica_log, don't assert that
it's been adjusted.

Signed-off-by: Samuel Just <sam.just@inktank.com>
11 years agoclient: fix Client::getcwd() 1385/head
Yan, Zheng [Thu, 6 Mar 2014 23:12:39 +0000 (07:12 +0800)]
client: fix Client::getcwd()

An recent commit made MDS not include dentry trace in LOOKUPPARENT
reply. It broke Client::getcwd. The fix is change getcwd() to use
LOOKUPNAME MDS request

Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com>
11 years agoMerge remote-tracking branch 'gh/wip-7210' into firefly
Sage Weil [Fri, 7 Mar 2014 23:23:31 +0000 (15:23 -0800)]
Merge remote-tracking branch 'gh/wip-7210' into firefly

Reviewed-by: Sage Weil <sage@inktank.com>
11 years agomds: introduce LOOKUPNAME MDS request
Yan, Zheng [Thu, 6 Mar 2014 07:24:02 +0000 (15:24 +0800)]
mds: introduce LOOKUPNAME MDS request

The new MDS request is used for connecting a given inode to its
parent inode. It allows client to have efficient implementation of
get_rename() NFS export callback.

Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com>
11 years agoqa/workunits/cephtool/test.sh: fix 'osd thrash' test
Sage Weil [Fri, 7 Mar 2014 23:21:30 +0000 (15:21 -0800)]
qa/workunits/cephtool/test.sh: fix 'osd thrash' test

- fix the wait check for osds to come back up
- make sure they get marked back in, too

Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Dan Mick <dan.mick@inktank.com>
11 years agoMerge pull request #1403 from ceph/wip-7642
Sage Weil [Fri, 7 Mar 2014 23:05:30 +0000 (15:05 -0800)]
Merge pull request #1403 from ceph/wip-7642

mon: fix check for primary-affinity feature bit, and fix a race in similar checks

Reviewed-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Joao Eduardo Luis <joao.luis@inktank.com>
11 years agomon/OSDMonitor: feature feature check bit arithmetic 1403/head
Sage Weil [Fri, 7 Mar 2014 22:44:42 +0000 (14:44 -0800)]
mon/OSDMonitor: feature feature check bit arithmetic

Make sure all features are present (instead of just any of them).

Signed-off-by: Sage Weil <sage@inktank.com>
11 years agoMerge pull request #1404 from ceph/wip-7652
Sage Weil [Fri, 7 Mar 2014 22:19:58 +0000 (14:19 -0800)]
Merge pull request #1404 from ceph/wip-7652

mon: fix infininte pg create msgs for erasure pools

Reviewed-by: Samuel Just <sam.just@inktank.com>
11 years agomon/PGMap: send pg create messages to primary, not acting[0] 1404/head
Sage Weil [Fri, 7 Mar 2014 22:02:26 +0000 (14:02 -0800)]
mon/PGMap: send pg create messages to primary, not acting[0]

For erasure pools, these may not match.

In the case of #7652, this caused pg_create messages to be send
indefinitely.  register_pg() added it to the list for acting_primary, and
when we got the (non-creating) pg stat update we removed it from the list
for acting[0].

Fixes: #7652
Signed-off-by: Sage Weil <sage@inktank.com>
11 years agomon/PGMonitor: improve debugging on PGMap updates slightly
Sage Weil [Fri, 7 Mar 2014 21:56:31 +0000 (13:56 -0800)]
mon/PGMonitor: improve debugging on PGMap updates slightly

Chasing #7652
Signed-off-by: Sage Weil <sage@inktank.com>
11 years agomon/OSDMonitor: make osdmap feature checks non-racy
Sage Weil [Fri, 7 Mar 2014 21:29:03 +0000 (13:29 -0800)]
mon/OSDMonitor: make osdmap feature checks non-racy

The check for OSD features may race with the boot of an OSD that does not
have the necessary features.  Check the pending info too, and if there is
a missing feature, return -EAGAIN.  In the callers, wait on -EAGAIN.

Signed-off-by: Sage Weil <sage@inktank.com>
11 years agomon/OSDMonitor: prevent set primary-affinity unless all OSDs support it
Sage Weil [Fri, 7 Mar 2014 21:24:04 +0000 (13:24 -0800)]
mon/OSDMonitor: prevent set primary-affinity unless all OSDs support it

Make sure all running OSDs support the feature before we start using it
(even if the config option is on!).

Fixes: #7642
Signed-off-by: Sage Weil <sage@inktank.com>
11 years agotools/rados/rados.cc: use write_full for sync_write for ec pools 1437/head
Samuel Just [Fri, 7 Mar 2014 20:43:36 +0000 (12:43 -0800)]
tools/rados/rados.cc: use write_full for sync_write for ec pools

Fixes: #7650
Signed-off-by: Samuel Just <sam.just@inktank.com>
11 years agoqa: workunits/mon/rbd_snaps_ops.sh: ENOTSUP on snap rm from copied pool 1401/head
Joao Eduardo Luis [Fri, 7 Mar 2014 19:45:50 +0000 (19:45 +0000)]
qa: workunits/mon/rbd_snaps_ops.sh: ENOTSUP on snap rm from copied pool

'rados cppool' copies the contents but that doesn't make the destination
pool an unmanaged snaps pool.  Therefore, we must get an ENOTSUP when
we try to remove an unmanaged snap from a not-unmanaged pool.

Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
11 years agomon: OSDMonitor: don't remove unamanaged snaps from not-unmanaged pools
Joao Eduardo Luis [Fri, 7 Mar 2014 18:25:21 +0000 (18:25 +0000)]
mon: OSDMonitor: don't remove unamanaged snaps from not-unmanaged pools

Although we should allow creating unmanaged snaps on not-unamanaged pools,
as long as those pools don't have any managed snapshots in them, we cannot
allow removal -- because the pool will not have any unmanaged snapshots.

Fixes: 7210
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
11 years agoosd: Add hit_set_flushing to track current flushes and prevent races 1405/head
David Zafman [Fri, 7 Mar 2014 02:08:46 +0000 (18:08 -0800)]
osd: Add hit_set_flushing to track current flushes and prevent races

When flushing a HitSet track in hit_set_flushing map so that
agent_load_hit_sets() doesn't try to read it too soon.

Fixes: #7575
Signed-off-by: David Zafman <david.zafman@inktank.com>
11 years agoRemoved nw_info from sysinfo
Babu Shanmugam [Fri, 7 Mar 2014 12:49:20 +0000 (12:49 +0000)]
Removed nw_info from sysinfo

Signed-off-by: Babu Shanmugam <anbu@enovance.com>
11 years agoMade crush_types to be a map of type to count, so we can tell how many racks/rows...
Babu Shanmugam [Fri, 7 Mar 2014 06:47:06 +0000 (06:47 +0000)]
Made crush_types to be a map of type to count, so we can tell how many racks/rows/hosts/etc are there

Signed-off-by: Babu Shanmugam <anbu@enovance.com>
11 years agoMerge pull request #1394 from ceph/wip-7610
Sage Weil [Fri, 7 Mar 2014 05:11:25 +0000 (21:11 -0800)]
Merge pull request #1394 from ceph/wip-7610

obj_bencher: allocate contentsChars to object_size, not op_size

Reviewed-by: Sage Weil <sage@inktank.com>
11 years agoMerge pull request #1397 from ceph/wip-7638
Sage Weil [Fri, 7 Mar 2014 04:06:59 +0000 (20:06 -0800)]
Merge pull request #1397 from ceph/wip-7638

ReplicatedPG::trim_object: use old_snaps for rollback

Reviewed-by: Sage Weil <sage@inktank.com>
11 years agoMerge pull request #1398 from ceph/wip-7634
Sage Weil [Fri, 7 Mar 2014 04:05:19 +0000 (20:05 -0800)]
Merge pull request #1398 from ceph/wip-7634

ReplicatedPG: use hobject_t for snapset_contexts map

Reviewed-by: Sage Weil <sage@inktank.com>
11 years agoMerge pull request #1395 from ceph/wip-7637
Samuel Just [Fri, 7 Mar 2014 03:19:12 +0000 (19:19 -0800)]
Merge pull request #1395 from ceph/wip-7637

osd: fix agent thread shutdown

Reviewed-by: Samuel Just <sam.just@inktank.com>
11 years agoosd: fix agent thread shutdown 1395/head
Sage Weil [Fri, 7 Mar 2014 00:12:30 +0000 (16:12 -0800)]
osd: fix agent thread shutdown

We had an old invariant that agent_queue would have at least 1 entry in
it to simplify some other code paths, but it turns out that it is simpler
not to do that.

In particular, this was triggering a failed assertion on shutdown when we
assert that the queue is empty.

Dump offending items on shutdown if they are there, tho, to catch any
future bugs.

Fixes: #7637
Signed-off-by: Sage Weil <sage@inktank.com>