git.apps.os.sepia.ceph.com Git

]> git.apps.os.sepia.ceph.com Git - ceph.git/log

Greg Farnum [Wed, 13 Jan 2016 21:17:53 +0000 (13:17 -0800)]

fsx: checkout old version until it compiles properly on miras

I sent a patch to xfstests upstream at
http://article.gmane.org/gmane.comp.file-systems.fstests/1665, but
until that's fixed we need a version that works in our test lab.

Signed-off-by: Greg Farnum <gfarnum@redhat.com>
(cherry picked from commit 7d52372ae74878ebd001036ff0a7aad525eb15b6)

commit | commitdiff | tree

Loic Dachary [Wed, 3 Feb 2016 05:41:56 +0000 (12:41 +0700)]

Merge pull request #7454 from dachary/wip-14584-hammer

hammer: fsstress.sh fails

Reviewed-by: Loic Dachary <ldachary@redhat.com>

commit | commitdiff | tree

Loic Dachary [Wed, 3 Feb 2016 04:38:57 +0000 (11:38 +0700)]

Merge pull request #6918 from asheplyakov/hammer-bug-12449

osd: check for full before changing the cached obc (hammer)

Reviewed-by: Loic Dachary <ldachary@redhat.com>

commit | commitdiff | tree

Yan, Zheng [Tue, 18 Aug 2015 07:22:55 +0000 (15:22 +0800)]

qa/fsstress.sh: fix 'cp not writing through dangling symlink'

On some test machines, /usr/lib/ltp/testcases/bin/fsstress is
dangling symlink. 'cp -f' is impotent in this case.

Fixes: #12710
Signed-off-by: Yan, Zheng <zyan@redhat.com>
(cherry picked from commit 479f2a760baf6af4714d3391a366a6b3acf1bcaf)

commit | commitdiff | tree

Sage Weil [Sun, 31 Jan 2016 02:42:29 +0000 (21:42 -0500)]

Merge pull request #7236 from athanatos/wip-14376

config_opts: increase suicide timeout to 300 to match recovery

http://pulpito.ceph.com/sage-2016-01-30_09:58:32-rados-wip-sage-testing-hammer---basic-mira/

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 31 Jan 2016 02:42:12 +0000 (21:42 -0500)]

Merge pull request #6450 from dachary/wip-13672-hammer

tests: testprofile must be removed before it is re-created

http://pulpito.ceph.com/sage-2016-01-30_09:58:32-rados-wip-sage-testing-hammer---basic-mira/

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 31 Jan 2016 02:41:39 +0000 (21:41 -0500)]

Merge pull request #6680 from SUSE/wip-13859-hammer

hammer: ceph.spec.in License line does not reflect COPYING

http://pulpito.ceph.com/sage-2016-01-30_09:58:32-rados-wip-sage-testing-hammer---basic-mira/

commit | commitdiff | tree

Sage Weil [Sun, 31 Jan 2016 02:41:18 +0000 (21:41 -0500)]

Merge pull request #6791 from branch-predictor/bp-5812-backport

Race condition in rados bench

http://pulpito.ceph.com/sage-2016-01-30_09:58:32-rados-wip-sage-testing-hammer---basic-mira/

commit | commitdiff | tree

Sage Weil [Sun, 31 Jan 2016 02:40:38 +0000 (21:40 -0500)]

Merge pull request #6973 from dreamhost/wip-configure-hammer

configure.ac: no use to add "+" before ac_ext=c

http://pulpito.ceph.com/sage-2016-01-30_09:58:32-rados-wip-sage-testing-hammer---basic-mira/

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 31 Jan 2016 02:40:13 +0000 (21:40 -0500)]

Merge pull request #7206 from dzafman/wip-14292

osd/PG.cc: 3837: FAILED assert(0 == "Running incompatible OSD")

http://pulpito.ceph.com/sage-2016-01-30_09:58:32-rados-wip-sage-testing-hammer---basic-mira/

Reviewed-by: David Zafman <dzafman@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 31 Jan 2016 02:39:42 +0000 (21:39 -0500)]

Merge pull request #7207 from rldleblanc/recency_fix_for_hammer

hammer: osd/ReplicatedPG: Recency fix for Hammer

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 31 Jan 2016 02:39:11 +0000 (21:39 -0500)]

Merge pull request #7347 from tchaikov/wip-hammer-10093

tools: ceph-monstore-tool must do out_store.close()

http://pulpito.ceph.com/sage-2016-01-30_09:58:32-rados-wip-sage-testing-hammer---basic-mira/

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 31 Jan 2016 02:38:35 +0000 (21:38 -0500)]

Merge pull request #7411 from dachary/wip-14467-hammer

hammer: disable filestore_xfs_extsize by default

http://pulpito.ceph.com/sage-2016-01-30_09:58:32-rados-wip-sage-testing-hammer---basic-mira/

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 31 Jan 2016 02:38:13 +0000 (21:38 -0500)]

Merge pull request #7412 from dachary/wip-14470-hammer

tools: tool for artificially inflate the leveldb of the mon store for testing purposes

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Sun, 31 Jan 2016 02:37:46 +0000 (21:37 -0500)]

Merge pull request #7446 from liewegas/wip-14537-hammer

mon: compact full epochs also

http://pulpito.ceph.com/sage-2016-01-30_09:58:32-rados-wip-sage-testing-hammer---basic-mira/

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Josh Durgin [Sat, 30 Jan 2016 19:45:31 +0000 (11:45 -0800)]

Merge pull request #7182 from dachary/wip-14143-hammer

hammer: Verify self-managed snapshot functionality on image create

Reviewed-by: Josh Durgin <jdurgin@redhat.com>

commit | commitdiff | tree

Josh Durgin [Sat, 30 Jan 2016 19:45:20 +0000 (11:45 -0800)]

Merge pull request #7183 from dachary/wip-14283-hammer

hammer: rbd: fix bench-write

Reviewed-by: Josh Durgin <jdurgin@redhat.com>

commit | commitdiff | tree

Josh Durgin [Sat, 30 Jan 2016 19:45:05 +0000 (11:45 -0800)]

Merge pull request #7416 from dachary/wip-14466-hammer

hammer: rbd-replay does not check for EOF and goes to endless loop

Reviewed-by: Josh Durgin <jdurgin@redhat.com>

commit | commitdiff | tree

Josh Durgin [Sat, 30 Jan 2016 19:44:50 +0000 (11:44 -0800)]

Merge pull request #7417 from dachary/wip-14553-hammer

hammer: rbd: TaskFinisher::cancel should remove event from SafeTimer

Reviewed-by: Josh Durgin <jdurgin@redhat.com>

commit | commitdiff | tree

Josh Durgin [Sat, 30 Jan 2016 19:44:32 +0000 (11:44 -0800)]

Merge pull request #7407 from dillaman/wip-14543-hammer

librbd: ImageWatcher shouldn't block the notification thread

Reviewed-by: Josh Durgin <jdurgin@redhat.com>

commit | commitdiff | tree

Josh Durgin [Sat, 30 Jan 2016 19:44:12 +0000 (11:44 -0800)]

Merge pull request #6980 from dillaman/wip-14063-hammer

librbd: fix merge-diff for >2GB diff-files

Reviewed-by: Josh Durgin <jdurgin@redhat.com>

commit | commitdiff | tree

Kefu Chai [Thu, 28 Jan 2016 10:09:53 +0000 (02:09 -0800)]

mon: compact full epochs also

by compacting the ${prefix}.${start}..${prefix}..${end} does not
necessary compact the range of ${prefix}."full_"${start}..
${prefix}."full_"${end}. so when more and more epochs get trimmed
with out a full range compaction, the size of monitor store could
be very large.

Fixes: #14537
Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit 93d633a25ad8c36c972bb766c38187e2612041e1)

commit | commitdiff | tree

Alexey Sheplyakov [Wed, 16 Dec 2015 12:31:52 +0000 (15:31 +0300)]

Check for full before changing the cached obc

ReplicatedPG::prepare_transaction(): check if the pool is full before
updating the cached ObjectContext to avoid the discrepancy between
the cached and the actual object size (and other metadata).
While at it improve the check itself: consider cluster full flag,
not just the pool full flag, also consider object count changes too,
not just bytes.

Based on commit a1eb380c3d5254f9f1fe34b4629e51d77fe010c1

Fixes: #13335
Signed-off-by: Alexey Sheplyakov <asheplyakov@mirantis.com>

commit | commitdiff | tree

Loic Dachary [Fri, 29 Jan 2016 16:31:47 +0000 (23:31 +0700)]

Merge pull request #6353 from theanalyst/wip-13513-hammer

rgw: value of Swift API's X-Object-Manifest header is not url_decoded during segment look up

Reviewed-by: Yehuda Sadeh <ysadehwe@redhat.com>

commit | commitdiff | tree

Loic Dachary [Fri, 29 Jan 2016 16:31:16 +0000 (23:31 +0700)]

Merge pull request #6620 from SUSE/wip-13820-hammer

hammer: rgw: Setting ACL on Object removes ETag

Reviewed-by: Yehuda Sadeh <ysadehwe@redhat.com>

commit | commitdiff | tree

Loic Dachary [Fri, 29 Jan 2016 16:30:57 +0000 (23:30 +0700)]

Merge pull request #7186 from dachary/wip-13888-hammer

hammer: rgw: orphans finish segfaults

Reviewed-by: Yehuda Sadeh <ysadehwe@redhat.com>

commit | commitdiff | tree

Sage Weil [Wed, 25 Nov 2015 19:40:26 +0000 (14:40 -0500)]

osd: recency should look at newest (not oldest) hitsets

Reported-by: xinxin shu <xinxin.shu@intel.com>
Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 528eae92d010ea34ae8cf0e8b2290aaa5e058d24)

Conflicts:
src/osd/ReplicatedPG.cc
code section was moved to ReplicatedPG::maybe_promote
in master.
Signed-off-by: Robert LeBlanc <robert.leblanc@endurance.com>

commit | commitdiff | tree

Sage Weil [Wed, 25 Nov 2015 19:39:08 +0000 (14:39 -0500)]

osd/ReplicatedPG: fix promotion recency logic

Recency is defined as how many of the last N hitsets an object
must appear in in order to be promoted.  The previous logic did
nothing of the sort... it checked for the object in any one of
the last N hitsets, which led to way to many promotions and killed
any chance of the cache performing properly.

While we are here, we can simplify the code to drop the max_in_*
fields (no longer necessary).

Note that we may still want a notion of 'temperature' that does
tolerate the object missing in one of the recent hitsets.. but
that would be different than recency, and should probably be
modeled after the eviction temperature model.

Backport: infernalis, hammer
Reported-by: Nick Fisk <nick@fisk.me.uk>
Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 180c8743addc5ae2f1db9c58cd2996ca6e7ac18b)

Conflicts:
src/osd/ReplicatedPG.cc
                    code section was moved to ReplicatedPG::maybe_promote
                    in master.
Signed-off-by: Robert LeBlanc <robert.leblanc@endurance.com>

commit | commitdiff | tree

Sage Weil [Fri, 29 Jan 2016 13:52:51 +0000 (08:52 -0500)]

Merge pull request #5789 from SUSE/wip-12928-hammer

rpm: libcephfs_jni1 has no %post and %postun

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Fri, 29 Jan 2016 13:50:56 +0000 (08:50 -0500)]

Merge pull request #7434 from tchaikov/wip-14441-hammer

man: document listwatchers cmd in "rados" manpage

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Kefu Chai [Wed, 23 Dec 2015 03:23:38 +0000 (11:23 +0800)]

man: document listwatchers cmd in "rados" manpage

Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit c2e391301efc43f0b431e89737246b2c43bf10a9)

man/rados.8: also added the rendered man.8 man page, as we don't
put the generated man pages in master anymore. but
they are still in the hammer's source repo.

commit | commitdiff | tree

Douglas Fuller [Fri, 22 Jan 2016 19:18:40 +0000 (11:18 -0800)]

rbd: remove canceled tasks from timer thread

When canceling scheduled tasks using the timer thread, TaskFinisher::cancel
does not call SafeTimer::cancel_event, so events fire anyway. Add this call.

Fixes: #14476
Signed-off-by: Douglas Fuller <dfuller@redhat.com>
(cherry picked from commit 2aa0f318c862dbe3027d74d345671506605778eb)

commit | commitdiff | tree

Mykola Golub [Thu, 21 Jan 2016 11:45:42 +0000 (13:45 +0200)]

rbd-replay: handle EOF gracefully

Fixes: #14452
Signed-off-by: Mykola Golub <mgolub@mirantis.com>
(cherry picked from commit c59b84c3e2c9bbda68219e4d2288a889dd9ca6cb)

Conflicts:
src/rbd_replay/BufferReader.cc: trivial resolution
caused by s/CEPH_PAGE_SIZE/CEPH_BUFFER_APPEND_SIZE/
in the context of one hunk.

commit | commitdiff | tree

Cilang Zhao [Tue, 5 Jan 2016 06:34:05 +0000 (14:34 +0800)]

tools: monstore: add 'show-versions' command.

Using this tool, the first/last committed version of maps will be shown.

Signed-off-by: Cilang Zhao <zhao.cilang@h3c.com>
(cherry picked from commit 21e6ba0c18428caff45733e6b43d197be38af8bb)

commit | commitdiff | tree

Kefu Chai [Wed, 16 Sep 2015 10:28:52 +0000 (18:28 +0800)]

tools: ceph_monstore_tool: add inflate-pgmap command

this command repeatly add the latest pgmap to the monstore in order
to inflate it. the command helps with the testing of some monstore
related performance issue of monitor

Fixes: #14217
Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit b0f6af814a58dd374ca95e84a4e381f8ef1368de)

commit | commitdiff | tree

Bo Cai [Tue, 20 Oct 2015 07:23:49 +0000 (15:23 +0800)]

tools:support printing the crushmap in readable fashion.

Signed-off-by: Bo Cai <cai.bo@h3c.com>
(cherry picked from commit b7faf678ed0241abc7eb86b9baaa6db669a22d53)

commit | commitdiff | tree

Bo Cai [Mon, 14 Sep 2015 11:50:47 +0000 (19:50 +0800)]

tools:print the map infomation in human readable format.

Signed-off-by: Bo Cai <cai.bo@h3c.com>
(cherry picked from commit 5ed8cdc19150382c946a373ec940d76f98e6ecb7)

commit | commitdiff | tree

Bo Cai [Mon, 14 Sep 2015 11:19:05 +0000 (19:19 +0800)]

tools:remove the local file when get map failed.

Signed-off-by: Bo Cai <cai.bo@h3c.com>
(cherry picked from commit 0b03b32d8ba76fe9f6f1158e68eb440e3670393a)

commit | commitdiff | tree

Joao Eduardo Luis [Mon, 13 Jul 2015 11:35:13 +0000 (12:35 +0100)]

tools: ceph_monstore_tool: describe behavior of rewrite command

Signed-off-by: Joao Eduardo Luis <joao@suse.de>
(cherry picked from commit a881f9385feb0f5a61fa22357984d6f291c08177)

commit | commitdiff | tree

Kefu Chai [Fri, 19 Jun 2015 14:57:57 +0000 (22:57 +0800)]

tools/ceph-monstore-tools: add rewrite command

"rewrite" command will
- add a new osdmap version to update current osdmap held by OSDMonitor
- add a new paxos version, as a proposal it will
   * rewrite all osdmap epochs from specified epoch to  the last_committed
     one with the specified crush map.
   * add the new osdmap which is added just now
so the leader monitor can trigger a recovery process to apply the transaction
to all monitors in quorum, and hence bring them back to normal after being
injected with a faulty crushmap.

Fixes: #11815
Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit 306373427836ca0c2418dbe6caab26d74d94d12e)

commit | commitdiff | tree

huangjun [Tue, 21 Apr 2015 06:04:40 +0000 (14:04 +0800)]

tools: ceph-monstore-tool must do out_store.close()

this change fixes the "store-copy" command.
Like the bug reported in http://tracker.ceph.com/issues/10093.

Signed-off-by: huangjun <hjwsm1989@gmail.com>
(cherry picked from commit d85e0f8c50fce62be012506f7ffcb1cdeb0c819c)

commit | commitdiff | tree

Ken Dreyer [Mon, 18 Jan 2016 15:24:46 +0000 (08:24 -0700)]

osd: disable filestore_xfs_extsize by default

This option involves a tradeoff: When disabled, fragmentation is worse,
but large sequential writes are faster. When enabled, large sequential
writes are slower, but fragmentation is reduced.

Fixes: #14397
(cherry picked from commit aed85775bf53c273786ce4999320134822722af5)

commit | commitdiff | tree

Loic Dachary [Fri, 29 Jan 2016 03:36:05 +0000 (10:36 +0700)]

Merge pull request #7316 from ceph/wip-deb-lttng-hammer

deb: strip tracepoint libraries from Wheezy/Precise builds

All other "modern" Debian-based OSes have a functional LTTng-UST. Since only hammer needs to build on these older distros, this fix only affects the deb building process for those two releases(since autoconf detects that LTTng is broken).

Reviewed-by: Loic Dachary <ldachary@redhat.com>

commit | commitdiff | tree

Loic Dachary [Fri, 29 Jan 2016 03:21:43 +0000 (10:21 +0700)]

Merge pull request #7187 from dachary/wip-13831-hammer

hammer: init script reload doesn't work on EL7

Reviewed-by: Loic Dachary <ldachary@redhat.com>

commit | commitdiff | tree

Jason Dillaman [Thu, 28 Jan 2016 19:38:20 +0000 (14:38 -0500)]

librbd: ImageWatcher shouldn't block the notification thread

Blocking the notification thread will also result in librados async
callbacks becoming blocked (since they use the same thread).

Signed-off-by: Jason Dillaman <dillaman@redhat.com>
(cherry picked from commit 7e2019a72733dff43e55c9b22df12939d584f87d)

Conflicts:
src/librbd/ImageWatcher.[cc|h]: fewer RPC messages

commit | commitdiff | tree

Jason Dillaman [Thu, 28 Jan 2016 19:35:54 +0000 (14:35 -0500)]

librados_test_stub: watch/notify now behaves similar to librados

Notifications are executed via the same librados AIO callback
thread, so it's now possible to catch deadlock.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>
(cherry picked from commit 43722571838a2d78ca8583b5a1ea381cd988de0b)

commit | commitdiff | tree

Jason Dillaman [Thu, 28 Jan 2016 17:40:18 +0000 (12:40 -0500)]

tests: simulate writeback flush during snap create

Signed-off-by: Jason Dillaman <dillaman@redhat.com>
(cherry picked from commit e9570fcf4be9cd5c06937769d074fa52ffb32e4b)

commit | commitdiff | tree

Sage Weil [Tue, 26 Jan 2016 15:12:29 +0000 (10:12 -0500)]

Merge pull request #7365 from liewegas/wip-tcmalloc-hammer

osd: pass tcmalloc env through to ceph-osd

Reviewed-by: Ken Dreyer <kdreyer@redhat.com>

commit | commitdiff | tree

Sage Weil [Tue, 26 Jan 2016 13:43:15 +0000 (08:43 -0500)]

upstart/ceph-osd.conf: pass TCMALLOC_MAX_TOTAL_THREAD_CACHE_BYTES through

Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit a95a3d34cccb9f9f0782a96ca05e5fe9c2b01772)

commit | commitdiff | tree

Sage Weil [Wed, 20 Jan 2016 23:36:08 +0000 (18:36 -0500)]

init-ceph: pass TCMALLOC_MAX_TOTAL_THREAD_CACHE_BYTES through

..when set in the sysconfig/default file.

Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 03e01ae263ef207046933890821ae75d5e60d9b8)

[source default/sysconfig file too]

commit | commitdiff | tree

huangjun [Tue, 21 Apr 2015 06:04:40 +0000 (14:04 +0800)]

commit | commitdiff | tree

Jason Dillaman [Thu, 21 Jan 2016 20:36:37 +0000 (15:36 -0500)]

deb: strip tracepoint libraries from Wheezy/Precise builds

These releases do not (by default) have a sane LTTng-UST environment,
which results in autoconf disabling support for tracing.

Signed-off-by: Jason Dillaman <dillaman@redhat.com>

commit | commitdiff | tree

Jason Dillaman [Tue, 19 Jan 2016 01:46:07 +0000 (20:46 -0500)]

Merge pull request #7252 from ceph/wip-13483-hammer

qa: remove legacy OS support from rbd/qemu-iotests

Reviewed-by: Jason Dillaman <dillaman@redhat.com>

commit | commitdiff | tree

Jason Dillaman [Mon, 18 Jan 2016 20:57:43 +0000 (15:57 -0500)]

qa: disable rbd/qemu-iotests test case 055 on RHEL/CentOS

Fixes: #14385
Signed-off-by: Jason Dillaman <dillaman@redhat.com>

commit | commitdiff | tree

Jason Dillaman [Thu, 15 Oct 2015 16:44:06 +0000 (12:44 -0400)]

qa: remove legacy OS support from rbd/qemu-iotests

RHEL7 derivatives were failing test 002 since they were using
legacy test cases for now unsupported OSes.

Fixes: #13483
Signed-off-by: Vasu Kulkarni <vasu@redhat.com>

commit | commitdiff | tree

Jason Dillaman [Mon, 14 Dec 2015 22:41:49 +0000 (17:41 -0500)]

librbd: optionally validate RBD pool configuration (snapshot support)

Fixes: #13633
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
(cherry picked from commit 1fea4dadc60e13518e9ee55d136fbc4e9d3a621e)

Conflicts:
src/common/config_opts.h: trivial resolution

commit | commitdiff | tree

Samuel Just [Thu, 14 Jan 2016 16:35:23 +0000 (08:35 -0800)]

config_opts: increase suicide timeout to 300 to match recovery

Fixes: 14376
Backport: hammer, firefly
Signed-off-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 14 Jan 2016 14:21:50 +0000 (09:21 -0500)]

Merge pull request #7179 from dachary/wip-14287-hammer

hammer: ReplicatedPG: wrong result code checking logic during sparse_read

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 14 Jan 2016 14:20:53 +0000 (09:20 -0500)]

Merge pull request #7178 from dachary/wip-14285-hammer

hammer: osd/OSD.cc: 2469: FAILED assert(pg_stat_queue.empty()) on shutdown

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 14 Jan 2016 14:20:34 +0000 (09:20 -0500)]

Merge pull request #7177 from dachary/wip-14043-hammer

hammer: osd/PG.cc: 288: FAILED assert(info.last_epoch_started >= info.history.last_epoch_started)

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 14 Jan 2016 14:20:10 +0000 (09:20 -0500)]

Merge pull request #7180 from dachary/wip-14288-hammer

hammer: ceph osd pool stats broken in hammer

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 14 Jan 2016 14:19:18 +0000 (09:19 -0500)]

Merge pull request #6994 from badone/wip-13993-hammer

log: Log.cc: Assign LOG_DEBUG priority to syslog calls

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 14 Jan 2016 14:18:55 +0000 (09:18 -0500)]

Merge pull request #6839 from SUSE/wip-13789-hammer

Objecter: potential null pointer access when do pool_snap_list.

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 14 Jan 2016 14:18:23 +0000 (09:18 -0500)]

Merge pull request #6835 from SUSE/wip-13892-hammer

hammer: auth/cephx: large amounts of log are produced by osd

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 14 Jan 2016 14:17:53 +0000 (09:17 -0500)]

Merge pull request #6834 from SUSE/wip-13930-hammer

hammer: Ceph Pools' MAX AVAIL is 0 if some OSDs' weight is 0

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 14 Jan 2016 14:17:25 +0000 (09:17 -0500)]

Merge pull request #6832 from SUSE/wip-13936-hammer

hammer: Ceph daemon failed to start, because the service name was already used.

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 14 Jan 2016 14:16:54 +0000 (09:16 -0500)]

Merge pull request #6755 from SUSE/wip-13870-hammer

hammer: OSD: race condition detected during send_failures

Reviewed-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Loic Dachary [Thu, 14 Jan 2016 07:22:35 +0000 (08:22 +0100)]

Merge pull request #6415 from dillaman/wip-13541-hammer

LTTng-UST tracing should be dynamically enabled

Reviewed-by: Loic Dachary <ldachary@redhat.com>

commit | commitdiff | tree

Loic Dachary [Thu, 14 Jan 2016 07:15:02 +0000 (08:15 +0100)]

Merge pull request #7176 from dachary/wip-13440-hammer

ceph-disk prepare fails if device is a symlink

Reviewed-by: Loic Dachary <ldachary@redhat.com>

commit | commitdiff | tree

Loic Dachary [Thu, 14 Jan 2016 07:14:31 +0000 (08:14 +0100)]

Merge pull request #7150 from jecluis/wip-14236

mon: OSDMonitor: do not assume a session exists in send_incremental()

Reviewed-by: Kefu Chai <kchai@redhat.com>

commit | commitdiff | tree

Josh Durgin [Thu, 14 Jan 2016 01:15:32 +0000 (17:15 -0800)]

Merge pull request #7226 from dillaman/wip-13810-hammer

tests: notification slave needs to wait for master

Reviewed-by: Josh Durgin <jdurgin@redhat.com>

commit | commitdiff | tree

Jason Dillaman [Wed, 13 Jan 2016 17:44:01 +0000 (12:44 -0500)]

tests: notification slave needs to wait for master

If the slave instance starts before the master, race
conditions are possible.

Fixes: #13810
Backport: infernalis, hammer
Signed-off-by: Jason Dillaman <dillaman@redhat.com>
(cherry picked from commit 3992d6fe67bbf82322cedc1582406caaf6d4de60)

commit | commitdiff | tree

David Zafman [Tue, 16 Jun 2015 00:55:41 +0000 (17:55 -0700)]

ceph_osd: Add required feature bits related to this branch to osd_required mask

Signed-off-by: David Zafman <dzafman@redhat.com>
(cherry picked from commit 0b2bab460c013ea5cece10ea028d26da3485eaf5)

commit | commitdiff | tree

David Zafman [Fri, 5 Jun 2015 01:47:42 +0000 (18:47 -0700)]

osd: CEPH_FEATURE_CHUNKY_SCRUB feature now required

Feature present since at least the Dumpling release.
A later commit will add it to the osd_required mask

Fixes: #11661
Signed-off-by: David Zafman <dzafman@redhat.com>
(cherry picked from commit 7a10a7e196efd0f59fa7adb87d0a120655b227d8)

commit | commitdiff | tree

Loic Dachary [Tue, 12 Jan 2016 13:28:26 +0000 (14:28 +0100)]

Merge pull request #6530 from SUSE/wip-13760-hammer

unknown argument --quiet in udevadm settle

Reviewed-by: Loic Dachary <ldachary@redhat.com>

commit | commitdiff | tree

Kefu Chai [Tue, 5 May 2015 07:07:33 +0000 (15:07 +0800)]

configure.ac: no use to add "+" before ac_ext=c

Signed-off-by: Kefu Chai <kchai@redhat.com>
(cherry picked from commit 899dd23766c5ae6fef39bf24ef0692127d17deb3)
Signed-off-by: Robin H. Johnson <robin.johnson@dreamhost.com>

commit | commitdiff | tree

Herve Rousseau [Fri, 6 Nov 2015 08:52:28 +0000 (09:52 +0100)]

rgw: fix reload on non Debian systems.

When using reload in non-debian systems, /bin/sh's kill is used to send the HUP signal to the radosgw process.
This kill version doesn't understand -SIGHUP as a valid signal, using -HUP does work.

Fix: #13709
Backport: hammer
Signed-off-by: Hervé Rousseau <hroussea@cern.ch>
(cherry picked from commit 1b000abac3a02d1e788bf25eead4b6873133f5d2)

commit | commitdiff | tree

Igor Fedotov [Thu, 19 Nov 2015 10:38:40 +0000 (13:38 +0300)]

Fixing NULL pointer dereference

Signed-off-by: Igor Fedotov <ifedotov@mirantis.com>
(cherry picked from commit 93d3dfe0441be50a6990d458ee0ee3289af39b20)

commit | commitdiff | tree

Sage Weil [Tue, 18 Aug 2015 20:05:29 +0000 (16:05 -0400)]

rbd: fix bench-write

Make each IO get a unique offset!

Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit 333f3a01a9916c781f266078391c580efb81a0fc)

commit | commitdiff | tree

BJ Lougee [Mon, 13 Apr 2015 16:24:38 +0000 (11:24 -0500)]

Check that delta_sum.stats.sum.num_object_copies and delta_sum.stats.sum.num_object are greater than zero

This fixes division by zero.

Signed-off-by BJ Lougee <almightybeeij@gmail.com>

(cherry picked from commit 27ed729c1088133400aa072eeca9e125942f2d94)

commit | commitdiff | tree

xiexingguo [Tue, 22 Dec 2015 09:05:06 +0000 (17:05 +0800)]

ReplicatedPG: fix sparse-read result code checking logic

Move ahead the result code checking logic before we continue to verify the trailing hole, otherwise
the real result of non-hole reading may be overwritten and thus confuse caller.

Fixes: #14151
Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
(cherry picked from commit b3aa29e324cf0a96a1f2f5dcf1ba998219457bcd)

Conflicts:
src/osd/ReplicatedPG.cc: trivial resolution

commit | commitdiff | tree

Sage Weil [Fri, 1 Jan 2016 14:32:04 +0000 (09:32 -0500)]

osd: clear pg_stat_queue after stopping pgs

Fixes: #14212
Signed-off-by: Sage Weil <sage@redhat.com>
(cherry picked from commit c25ff9927febe77cda31b534971b62f75c196ca2)

commit | commitdiff | tree

David Zafman [Thu, 3 Dec 2015 22:52:24 +0000 (14:52 -0800)]

osd: Test osd_find_best_info_ignore_history_les config in another assert

Signed-off-by: David Zafman <dzafman@redhat.com>
(cherry picked from commit 02a9a41f151a3d968bf8066749658659dc6e3ac4)

commit | commitdiff | tree

Joe Julian [Fri, 9 Oct 2015 19:57:06 +0000 (12:57 -0700)]

Compare parted output with the dereferenced path

Compare parted output with the dereferenced path of the device as parted
prints that instead of the symlink we called it with.

http://tracker.ceph.com/issues/13438 Fixes: #13438

Signed-off-by: Joe Julian <jjulian@io.com>
(cherry picked from commit b3c7cb098195111b9c642e5a9b726b63717f2e0d)

commit | commitdiff | tree

Loic Dachary [Mon, 11 Jan 2016 08:01:12 +0000 (09:01 +0100)]

Merge pull request #6512 from SUSE/wip-13734-hammer

rgw: swift API returns more than real object count and bytes used when retrieving account metadata

Reviewed-by: Yehuda Sadeh <ysadehwe@redhat.com>

commit | commitdiff | tree

Joao Eduardo Luis [Thu, 7 Jan 2016 19:20:47 +0000 (19:20 +0000)]

mon: OSDMonitor: do not assume a session exists in send_incremental()

We may not have an open session for a given osd. If we blindly assume we
do, we may end up trying to send incrementals we do not have to the osd.

And then we will crash.

This fixes a regression introduced by

171fee1b82d2675e364da7f96dfb9dd286d9b6e6

which is meant as a backport of

de43a02e06650a552f048dc8acd17f255126fed9

but so happens to intruduce a line that wasn't on the original patch. We
imagine it was meant to make the 's->osd_epoch' assignment work without
checking the session, as per the original patch, but the backporter must
have forgotten to also backport the assertion on the not-null session.
The unfortunate introduction of the check for a not-null session
triggered this regression.

The regression itself is due to enforcing that a session exists for the
osd we are sending the incrementals to. However, if we come via the
OSDMonitor::process_failures() path, that may very well not be the case,
as we are handling potentially-old MOSDFailure messages that may no
longer have an associated session. By enforcing the not-null session, we
don't check whether we have the requested versions (i.e., if
our_earliest_version <= requested_version), and thus we end up on the
path that assumes that we DO HAVE all the necessary versions -- when we
may not, thus finally asserting because we are reading blank
incremental versions.

Fixes: #14236
Signed-off-by: Joao Eduardo Luis <joao@suse.de>

commit | commitdiff | tree

Brad Hubbard [Mon, 7 Dec 2015 01:31:28 +0000 (11:31 +1000)]

log: Log.cc: Assign LOG_DEBUG priority to syslog calls

Fixes: #13993
Signed-off-by: Brad Hubbard <bhubbard@redhat.com>
(cherry picked from commit 8e93f3f45db681f82633ca695a7dc4e7bd030584)

commit | commitdiff | tree

Jason Dillaman [Fri, 18 Dec 2015 20:22:13 +0000 (15:22 -0500)]

librbd: fix merge-diff for >2GB diff-files

Fixes: #14063
Signed-off-by: Jason Dillaman <dillaman@redhat.com>

commit | commitdiff | tree

Sage Weil [Thu, 17 Dec 2015 15:16:29 +0000 (10:16 -0500)]

Merge pull request #6544 from liewegas/wip-smaller-object-info

osd: make encoded object_info_t smaller to fit inside the XFS inode

Reviewed-by: Samuel Just <sjust@redhat.com>

commit | commitdiff | tree

Sage Weil [Mon, 14 Dec 2015 18:13:33 +0000 (13:13 -0500)]

osd/osd_types: skip encoding newer object_info_t fields if they are unused

This reduces the size of the encoded object_info_t in most cases,
enough to get us under the 255 byte limit for a single inline
xattr in XFS.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Mon, 14 Dec 2015 18:00:27 +0000 (13:00 -0500)]

osd/ReplicatedPG: do not set local_mtime on non-tiered pool

If a pool isn't tiered, don't bother with setting local_mtime. The only
users are the tiering agent (which isn't needed if there is not tiering)
and scrub for deciding if an object should get its digest recorded (we can
use mtime instead).

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Mon, 14 Dec 2015 17:59:29 +0000 (12:59 -0500)]

osd/PGBackend: use mtime for digest decision if local_mtime is empty

If we don't have a local_mtime value, use mtime instead, for the purposes
of deciding if we should record a digest after scrub.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

Sage Weil [Mon, 14 Dec 2015 16:35:46 +0000 (11:35 -0500)]

osd/ReplicatedPG: do not set omap digest if there is no omap

We want to avoid encoding it if we can. And if the FLAG_OMAP is not set
we don't need to *also* store an empty crc.

Signed-off-by: Sage Weil <sage@redhat.com>

commit | commitdiff | tree

xiexingguo [Mon, 2 Nov 2015 13:46:11 +0000 (21:46 +0800)]

Objecter: remove redundant result-check of _calc_target in _map_session.

Result-code check is currently redundant since _calc_target never returns a negative value.
Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
(cherry picked from commit 5a6117e667024f51e65847f73f7589467b6cb762)

commit | commitdiff | tree

xiexingguo [Thu, 29 Oct 2015 09:32:50 +0000 (17:32 +0800)]

Objecter: potential null pointer access when do pool_snap_list.

Objecter: potential null pointer access when do pool_snap_list. Shall check pool existence first.
Fixes: #13639
Signed-off-by: xie xingguo <xie.xingguo@zte.com.cn>
(cherry picked from commit 865541605b6c32f03e188ec33d079b44be42fa4a)

commit | commitdiff | tree

qiankunzheng [Thu, 5 Nov 2015 12:29:49 +0000 (07:29 -0500)]

auth/cephx: large amounts of log are produced by osd
if the auth of osd is deleted when the osd is running, the osd will produce large amounts of log.

Fixes:#13610
Signed-off-by: Qiankun Zheng <zheng.qiankun@h3c.com>
(cherry picked from commit 102f0b19326836e3b0754b4d32da89eb2bc0b03c)

commit | commitdiff | tree

Chengyuan Li [Fri, 20 Nov 2015 05:29:39 +0000 (22:29 -0700)]

mon/PGMonitor: MAX AVAIL is 0 if some OSDs' weight is 0

In get_rule_avail(), even p->second is 0, it's possible to be used
as divisor and quotient is infinity, then is converted to an integer
which is negative value.
So we should check p->second value before calculation.

It fixes BUG #13840.

Signed-off-by: Chengyuan Li <chengyli@ebay.com>
(cherry picked from commit 18713e60edd1fe16ab571f7c83e6de026db483ca)

commit | commitdiff | tree

wangchaunhong [Tue, 20 Oct 2015 10:40:23 +0000 (18:40 +0800)]

init-ceph: fix systemd-run cant't start ceph daemon sometimes

Fixes: #13474
Signed-off-by: Chuanhong Wang <wang.chuanhong@zte.com.cn>
(cherry picked from commit 2f36909e1e08bac993e77d1781a777b386335669)

Conflicts:
src/init-ceph.in
different content of cmd variable

commit | commitdiff | tree

Piotr Dałek [Mon, 6 Jul 2015 07:56:11 +0000 (09:56 +0200)]

tools: fix race condition in seq/rand bench

Under certain conditions (like bench seq/rand -b 1024 -t 128) it is
possible that aio_read reads data into destination buffers before or
during memcmp execution, resulting in "[..] is not correct!" errors
even if actual objects are perfectly fine.
Also, moved latencty calculation around, so it is no longer affeted
by memcmp.

Signed-off-by: Piotr Dałek <piotr.dalek@ts.fujitsu.com>
Conflicts:
src/common/obj_bencher.cc

commit | commitdiff | tree

Piotr Dałek [Wed, 20 May 2015 10:41:22 +0000 (12:41 +0200)]

tools: add --no-verify option to rados bench

When doing seq and rand read benchmarks using rados bench, a quite large
portion of cpu time is consumed by doing object verification. This patch
adds an option to disable this verification when it's not needed, in turn
giving better cluster utilization. rados -p storage bench 600 rand scores
without --no-verification:

Total time run:       600.228901
Total reads made:     144982
Read size:            4194304
Bandwidth (MB/sec):   966
Average IOPS:         241
Stddev IOPS:          38
Max IOPS:             909522486
Min IOPS:             0
Average Latency:      0.0662
Max latency:          1.51
Min latency:          0.004

real    10m1.173s
user    5m41.162s
sys     11m42.961s

Same command, but with --no-verify:

Total time run:       600.161379
Total reads made:     174142
Read size:            4194304
Bandwidth (MB/sec):   1.16e+03
Average IOPS:         290
Stddev IOPS:          20
Max IOPS:             909522486
Min IOPS:             0
Average Latency:      0.0551
Max latency:          1.12
Min latency:          0.00343

real    10m1.172s
user    4m13.792s
sys     13m38.556s

Note the decreased latencies, increased bandwidth and more reads performed.

Signed-off-by: Piotr Dałek <piotr.dalek@ts.fujitsu.com>
(cherry picked from commit ca6abca63de813c83a6960f83624be8e1a86a1f8)

Conflicts:
src/common/obj_bencher.cc
src/common/obj_bencher.h

Unnamed repository; edit this file 'description' to name the repository.

RSS Atom