]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
6 years agomsg/async/Protocol*: fix mark_down vs accept race 25823/head
Sage Weil [Tue, 8 Jan 2019 13:13:13 +0000 (07:13 -0600)]
msg/async/Protocol*: fix mark_down vs accept race

If we mark_down the connection while we have the lock dropped the state
maybe CLOSED when we retake it.  Bail out if that happens instead of
continuing to process the connection and eventually running into an
assertion on can_write.

Fixes: http://tracker.ceph.com/issues/36497
Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoMerge PR #25750 into master
Patrick Donnelly [Fri, 4 Jan 2019 22:27:34 +0000 (14:27 -0800)]
Merge PR #25750 into master

* refs/pull/25750/head:
common: expose internal StackStringStream via deref

Reviewed-by: Sage Weil <sage@redhat.com>
6 years agoMerge PR #25793 into master
Sage Weil [Fri, 4 Jan 2019 21:43:23 +0000 (15:43 -0600)]
Merge PR #25793 into master

* refs/pull/25793/head:
Merge remote-tracking branch 'private/wip-mon-kv-fix' into wip-mimic-4
Merge remote-tracking branch 'private/wip-rgw-max-keys' into wip-mimic-4
doc/releases/mimic: v13.2.4
rgw: fix issues with 'enforce bounds' patch
rgw: enforce bounds on max-keys/max-uploads/max-parts
mon/config-key: limit caps allowed to access the store

Reviewed-by: Gregory Farnum <gfarnum@redhat.com>
Reviewed-by: David Galloway <dgallowa@redhat.com>
Reviewed-by: Greg Farnum <gfarnum@redhat.com>
6 years agoMerge remote-tracking branch 'private/wip-mon-kv-fix' into wip-mimic-4 25793/head
Sage Weil [Fri, 4 Jan 2019 20:03:56 +0000 (14:03 -0600)]
Merge remote-tracking branch 'private/wip-mon-kv-fix' into wip-mimic-4

6 years agoMerge remote-tracking branch 'private/wip-rgw-max-keys' into wip-mimic-4
Sage Weil [Fri, 4 Jan 2019 20:03:38 +0000 (14:03 -0600)]
Merge remote-tracking branch 'private/wip-rgw-max-keys' into wip-mimic-4

6 years agodoc/releases/mimic: v13.2.4
Sage Weil [Fri, 4 Jan 2019 19:50:48 +0000 (13:50 -0600)]
doc/releases/mimic: v13.2.4

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoMerge PR #25009 into master
Sage Weil [Fri, 4 Jan 2019 19:42:09 +0000 (13:42 -0600)]
Merge PR #25009 into master

* refs/pull/25009/head:
librbd: stringify locker name with get_legacy_str()
osdc/Objecter: fix list_watchers addr rendering to match legacy
test/crimson: disable unittest_seastar_messenger test
msg/msg_types: encode entity_addr_t TYPE_ANY as TYPE_LEGACY for pre-nautilus
client: make blacklist detection handle TYPE_ANY entries
mon/OSDMonitor: maintain compat output for 'blacklist ls'
client: maintain compat for {inst,addr}_str in status dump
qa/tasks/ceph_manager: compare osd flush seq #'s as ints
qa/suites/fs: make use of simple.yaml where appropriate
qa/msgr: move msgr factet into generic re-usable dir
crimson: fix monmap build for seastar
doc/start/ceph.conf: trim the sample ceph.conf file
doc/rados/operations: only describe --public-{addr,network} method for adding mons
PendingReleaseNotes: deprecate 'mon addr'
doc: fix some 'mon addr' references
doc/rados/configuration: fix some 'mon addr' references
doc/rados/configuration/network-config-ref: revise network docs somewhat
doc/rados/configuration/network-config-ref: remove totally obsolete section
qa/suites/rados: replace mon_seesaw.py task with a small bash script
qa/suites/fs/upgrade: don't bind to v2 addrs
qa/tasks/mon_thrash: avoid 'mon addr' in mon section
mon/MonClient: disable ms_bind_msgr2 if NAUTILUS feature not set
osd/OSDMap: maintain compat addr fields
msg/msg_types: add get_legacy_str()
mds/MDSMap.h: maintain compat addr field
mon/MgrMap: maintain compat active_addr field
mon/MonClient: reconnect to mon if it's addrvec appears to have changed
qa/tasks/ceph.conf.template: increase mon_mgr_mkfs_grace
msg/async/ProtocolV2: fill in IP for all peer_addrs
msg/async: print all addrs on debug lines
mon/MonMap: no noname- mon name prefix when for_mkfs
ceph-monstore-tool: print initial monmap
msg/async/ProtocolV2: advertise ourselves as a v2 addr when using v2 protocol
msg/async: assert existing protocol matches current protocol
msg/async: add missing modelines
mon/MonMap: add missing modeline
vstart.sh: put mon addrs in mon_host, not 'mon addr'
msg/async: better debug around conn map lookups and updates
mon/MonClient: dump initial monmap at debug level 10
qa/standalone/osd/osd-fast-mark-down: use v1 addr w/ simplemessenger
qa/tasks/ceph: set initial monmap features with using addrvec addrs
monmaptool: add --enable-all-features option
qa/tasks/ceph: only use monmaptool --addv if addr has [,:v]
qa/tasks/ceph_manager: make get_mon_status use mon addr
qa/tasks/ceph: keep mon addrs in ctx namespace
mon/OSDMonitor: log all osd addrs on boot
msg/simple: behave when v2 and v1 addrs are present at target
mon/MonClient: warn if global_id changes
msg/Connection: add warning/note on get_peer_global_id
mds/MDSDaemon: clean up handle_mds_map debug output a bit
qa/suites/rados/upgrade: debug mds
mds/MDSRank: improve is_stale_message to handle addrvecs
msg/async: make loopback detect when sending to one of our many addrs
qa/suites/rados/upgrade: no aggressive pg num changes
mon/OSDMonitor: require nautilus mons for require_osd_release=nautilus
mon/OSDMonitor: require mimic mons for require_osd_release=mimic
qa/suites/rados/thrash-old-clients: use legacy addr syntax in ceph.conf
msg/async: preserve peer features when replacing a connection
qa/tasks/ceph.py: move methods from teuthology.git into ceph.py directly; support mon bind * options
mon/MonMap: adjust build_initial behavior for mkfs vs probe
mon/MonMap: improve ambiguous addr behavior
qa/suites/rados/upgrade: spread mons a bit
qa/rados/thrash-old-clients: keep mons on separate hosts
qa/standalone/mon/misc.sh: tweak test to be more robust
qa/tasks/mon_seesaw: expect v1/v2 prefix in addr
osd/OSDMap: fix is_blacklisted() check to assume type ANY
mon/OSDMonitor: use ANY addr type for blacklisting
mon/msg_types: TYPE_V1ORV2 -> TYPE_ANY
qa/workunits/cephtool: fix blacklist test
qa/suites/upgrade: install old version with only v1 addrs
common/options: by default, bind to both msgr v1 and v2 addresses
vstart.sh: add --msgr1, --msgr2, --msgr21 options
msg/async/ProtocolV2: be flexible with server identity check
msg/msg_types: fix entity_addrvec_t::parse() with null end arg
qa/suites/rados/basic/msgr: no msgr2 addrs in initial monmaps
qa/tasks/ceph: add 'mon_bind_addrvec' and 'mon_bind_msgr2' options
monmaptool: add --addv argument to pass in addrvec directly
qa/suites/rados/basic/msgr: do not use msgr2 with simplemessenger
qa/suites/rados/basic/msgr: async is not experimental
messages/MOSDBoot: fix compat with pre-nautilus
mon/MonMap: allow v1 or v2 to be explicitly specified along with part
msg/msg_types: allow parsing of IPs without assuming v1 vs v2
msg/msg_types: default parse to v2 addrs
msg: standarize on v1: and v2: prefixes for *all* entity_addr_t's
vstart.sh: use msgr2 by default
mon/MonMap: remove get_addr() methods
ceph-mon: adjust startup/bind/join sequence to use addrs
mon: use MonMap::get_addrs() (instead of get_addr())
mon/MonClient: change pending_cons to addrvec-based map
mon/MonMap: fix set_addr() caller, kill wrapper
mon/MonMap: remove addr-based add()
monmaptool: fix --add to do either legacy or msgr2+legacy
monmaptool: clean up iterator use a bit
mon/MonMap: handle ambiguous mon addrs by trying both legacy and msgr
mon/MonMap: take addrvec for set_initial_members
mon/MonMap: use addrvecs for test instances
mon: pass addrvec via MMonJoin
mon/MonmapMonitor: fix 'mon add' to populate addrvec
mon/MonMap: addr -> addrvec
msg/async/ProtocolV2: only update socket_addr if we learned our addr
osd: go active even if mon only accepted our v1 addr
test/msgr: add test for msgr2 protocol
msg/async/ProtocolV2: share socket_addr and all addrs during handshake
msg/async: print socket_addr for the connection
msg/async: msgr2 protocol placeholder
msg/async: move ProtocolV1 class to its own source file
msg/async: keep listen addr in ServerSocket, pass to new connections
msg/async/AsyncMessenger: fix set_addr_unknowns

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
6 years agoMerge PR #25790 into master
Sage Weil [Fri, 4 Jan 2019 19:40:58 +0000 (13:40 -0600)]
Merge PR #25790 into master

* refs/pull/25790/head:
common/blkdev: Fix missing stub block_device_get_metrics() on other platforms

Reviewed-by: Sage Weil <sage@redhat.com>
6 years agolibrbd: stringify locker name with get_legacy_str() 25009/head
Sage Weil [Fri, 4 Jan 2019 19:00:11 +0000 (13:00 -0600)]
librbd: stringify locker name with get_legacy_str()

This avoids the new v1: v2: prefixes.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoosdc/Objecter: fix list_watchers addr rendering to match legacy
Sage Weil [Fri, 4 Jan 2019 18:46:27 +0000 (12:46 -0600)]
osdc/Objecter: fix list_watchers addr rendering to match legacy

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoMerge pull request #25781 from neha-ojha/wip-fix-upgrade-rhel
Neha Ojha [Fri, 4 Jan 2019 16:01:17 +0000 (08:01 -0800)]
Merge pull request #25781 from neha-ojha/wip-fix-upgrade-rhel

qa/suites/upgrade/mimic-x: fix rhel runs

Reviewed-by: Yuri Weinstein <yweins@redhat.com>
6 years agoMerge pull request #25154 from rhcs-dashboard/dashboard-url
Ricardo Marques [Fri, 4 Jan 2019 15:00:32 +0000 (15:00 +0000)]
Merge pull request #25154 from rhcs-dashboard/dashboard-url

mgr/dashboard: check for existence of Grafana dashboard

Reviewed-by: Alfonso Martínez <almartin@redhat.com>
Reviewed-by: Patrick Nawracay <pnawracay@suse.com>
6 years agotest/crimson: disable unittest_seastar_messenger test
Sage Weil [Fri, 4 Jan 2019 14:54:25 +0000 (08:54 -0600)]
test/crimson: disable unittest_seastar_messenger test

This is currently failing, allegedly for reasons unrelated to this PR, so
I'm disabling for now!

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agocommon/blkdev: Fix missing stub block_device_get_metrics() on other platforms 25790/head
Willem Jan Withagen [Fri, 4 Jan 2019 14:44:28 +0000 (15:44 +0100)]
common/blkdev: Fix missing stub block_device_get_metrics() on other platforms

Signed-off-by: Willem Jan Withagen <wjw@digiware.nl>
6 years agoMerge PR #25637 into master
Sage Weil [Fri, 4 Jan 2019 14:25:25 +0000 (08:25 -0600)]
Merge PR #25637 into master

* refs/pull/25637/head:
doc: release notes for 13.2.3 mimic

Reviewed-by: Abhishek Lekshmanan <abhishek.lekshmanan@gmail.com>
6 years agoMerge PR #25455 into master
Sage Weil [Fri, 4 Jan 2019 14:17:38 +0000 (08:17 -0600)]
Merge PR #25455 into master

* refs/pull/25455/head:
mgr/diskprediction Document guide add predictor accuracy

Reviewed-by: Sage Weil <sage@redhat.com>
6 years agomgr/dashboard: check for existence of Grafana dashboard 25154/head
Kanika Murarka [Wed, 5 Dec 2018 10:33:53 +0000 (16:03 +0530)]
mgr/dashboard: check for existence of Grafana dashboard

Fixes: http://tracker.ceph.com/issues/36356
Signed-off-by: Kanika Murarka <kmurarka@redhat.com>
6 years agoMerge pull request #25210 from dillaman/wip-dashboard-rbd-mirroring
Ricardo Marques [Fri, 4 Jan 2019 10:39:09 +0000 (10:39 +0000)]
Merge pull request #25210 from dillaman/wip-dashboard-rbd-mirroring

dashboard: support configuring block mirroring pools and peers

6 years agoMerge pull request #25770 from votdev/fix_telemetry_issues
Kefu Chai [Fri, 4 Jan 2019 05:23:47 +0000 (13:23 +0800)]
Merge pull request #25770 from votdev/fix_telemetry_issues

mgr/telemetry: Fix various issues

Reviewed-by: Kefu Chai <kchai@redhat.com>
6 years agoMerge pull request #25783 from neha-ojha/wip-25672
Kefu Chai [Fri, 4 Jan 2019 05:18:35 +0000 (13:18 +0800)]
Merge pull request #25783 from neha-ojha/wip-25672

common/blkdev.h: use std::string

Reviewed-by: Kefu Chai <kchai@redhat.com>
Reviewed-by: Rick Chen <rick.chen@prophetstor.com>
6 years agoMerge pull request #25758 from xiexingguo/wip-async-prefetch
Kefu Chai [Fri, 4 Jan 2019 03:16:22 +0000 (11:16 +0800)]
Merge pull request #25758 from xiexingguo/wip-async-prefetch

msg/async: improve read-prefetch logic

Reviewed-by: Kefu Chai <kchai@redhat.com>
6 years agoMerge pull request #25760 from cxytt/bug-fix
Kefu Chai [Fri, 4 Jan 2019 03:13:28 +0000 (11:13 +0800)]
Merge pull request #25760 from cxytt/bug-fix

mon: shutdown messenger early to avoid accessing deleted logger

Reviewed-by: Sage Weil <sage@redhat.com>
Reviewed-by: Kefu Chai <kchai@redhat.com>
6 years agocommon/blkdev.h: use std::string 25783/head
Neha Ojha [Fri, 4 Jan 2019 02:57:28 +0000 (18:57 -0800)]
common/blkdev.h: use std::string

Signed-off-by: Neha Ojha <nojha@redhat.com>
6 years agoMerge PR #24159 into master
Sage Weil [Fri, 4 Jan 2019 02:38:30 +0000 (20:38 -0600)]
Merge PR #24159 into master

* refs/pull/24159/head:
mon/PGMap: show pg state age instead of stamp

Reviewed-by: John Spray <john.spray@redhat.com>
Reviewed-by: Josh Durgin <jdurgin@redhat.com>
6 years agoMerge PR #25623 into master
Sage Weil [Fri, 4 Jan 2019 02:38:14 +0000 (20:38 -0600)]
Merge PR #25623 into master

* refs/pull/25623/head:
common/ceph_time: 'mo' for month
common/options: use new parse_timespan
common/ceph_time: add parse_timespan
common/config_proxy: pass err_ss through on set_val
common/ceph_time: add exact_timespan_str

Reviewed-by: Kefu Chai <kchai@redhat.com>
6 years agoMerge PR #25756 into master
Sage Weil [Fri, 4 Jan 2019 02:37:54 +0000 (20:37 -0600)]
Merge PR #25756 into master

* refs/pull/25756/head:
vstart.sh: force setting dashboard options before first mgr starts
mon/ConfigMonitor: add --force flag to 'config set'

Reviewed-by: Kefu Chai <kchai@redhat.com>
6 years agoMerge PR #25763 into master
Sage Weil [Fri, 4 Jan 2019 02:37:38 +0000 (20:37 -0600)]
Merge PR #25763 into master

* refs/pull/25763/head:
ceph_argparse: make command ceph acceph SIGINT.

Reviewed-by: Sage Weil <sage@redhat.com>
6 years agoMerge PR #25672 into master
Sage Weil [Fri, 4 Jan 2019 02:11:11 +0000 (20:11 -0600)]
Merge PR #25672 into master

* refs/pull/25672/head:
osd: OSD device smart data include additional nvme data
common/blkdev: add missing get_device_id impl
os/bluestore,filestore: use get_raw_devices
osd: update metadata and smart code to report get_device_id errors
mon: update metadata and smart commands to use get_raw_devices
common/blkdev: add get_raw_devices helper
common/blkdev: fix BlkDev::get_devid when we got a devname, not fd
common/blkdev: return optional error string from get_device_id
common/blkdev: refactor to add block_device_get_metrics returning json

Reviewed-by: Neha Ojha <nojha@redhat.com>
6 years agoMerge PR #25593 into master
Patrick Donnelly [Fri, 4 Jan 2019 01:40:53 +0000 (17:40 -0800)]
Merge PR #25593 into master

* refs/pull/25593/head:
mds: create heartbeat grace config option

Reviewed-by: Zheng Yan <zyan@redhat.com>
6 years agoosd: OSD device smart data include additional nvme data 25672/head
hsiang41 [Fri, 28 Dec 2018 09:07:32 +0000 (17:07 +0800)]
osd: OSD device smart data include additional nvme data

Add nvme addition data into the deveh health data. That use nvme tool
and command syntax "nvme <vendor> smart-log-add <dev> -json". The nvme
json output append in the dev smart "nvme_smart_health_information_add_log".

- made run_smartctl static/private
- changed get_metrics to take a const string, not c str

Signed-off-by: Rick Chen <rick.chen@prophetstor.com>
Signed-off-by: Sage Weil <sage@redhat.com>
6 years agocommon/blkdev: add missing get_device_id impl
Sage Weil [Tue, 1 Jan 2019 16:58:56 +0000 (10:58 -0600)]
common/blkdev: add missing get_device_id impl

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoos/bluestore,filestore: use get_raw_devices
Sage Weil [Fri, 21 Dec 2018 17:21:32 +0000 (11:21 -0600)]
os/bluestore,filestore: use get_raw_devices

There is a minor change here: before the devices list would include
e.g. 'sda2' and 'sda', and now it will only include 'sda'.

Fixes: http://tracker.ceph.com/issues/37542
Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoosd: update metadata and smart code to report get_device_id errors
Sage Weil [Fri, 21 Dec 2018 17:22:17 +0000 (11:22 -0600)]
osd: update metadata and smart code to report get_device_id errors

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon: update metadata and smart commands to use get_raw_devices
Sage Weil [Fri, 21 Dec 2018 17:21:57 +0000 (11:21 -0600)]
mon: update metadata and smart commands to use get_raw_devices

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agocommon/blkdev: add get_raw_devices helper
Sage Weil [Fri, 21 Dec 2018 17:20:34 +0000 (11:20 -0600)]
common/blkdev: add get_raw_devices helper

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoqa/suites/upgrade/mimic-x: fix rhel runs 25781/head
Neha Ojha [Thu, 3 Jan 2019 21:39:30 +0000 (13:39 -0800)]
qa/suites/upgrade/mimic-x: fix rhel runs

The following fragment was required for rhel on ovh
overrides:
    ansible.cephlab:
      skip_tags: entitlements,packages,repos

Since, this suite runs on smithi in our nightlies, we should not need
this.

Signed-off-by: Neha Ojha <nojha@redhat.com>
6 years agoMerge pull request #25674 from alfredodeza/wip-rm37442
Alfredo Deza [Thu, 3 Jan 2019 18:44:06 +0000 (13:44 -0500)]
Merge pull request #25674 from alfredodeza/wip-rm37442

ceph-volume normalize comma to dot for string to int conversions

Reviewed-by: Andrew Schoen <aschoen@redhat.com>
6 years agoMerge PR #24635 into master
Patrick Donnelly [Thu, 3 Jan 2019 18:15:31 +0000 (10:15 -0800)]
Merge PR #24635 into master

* refs/pull/24635/head:
test: add scrub control command tests
mds: scrub abort/pause/resume/status control commands
test: switch using "scrub start" tell interface to initiate scrub
mds: introduce "scrub start" tell interface to initiate scrub
mds: dump scrub formatted output when context completion
mds: generate random scrub tag when empty
mds: introduce C_ExecAndReply context completion class
mds: use CInode::scrub_is_in_progress() wherever required

Reviewed-by: Patrick Donnelly <pdonnell@redhat.com>
Reviewed-by: Zheng Yan <zyan@redhat.com>
6 years agomsg/msg_types: encode entity_addr_t TYPE_ANY as TYPE_LEGACY for pre-nautilus
Sage Weil [Wed, 2 Jan 2019 23:07:45 +0000 (17:07 -0600)]
msg/msg_types: encode entity_addr_t TYPE_ANY as TYPE_LEGACY for pre-nautilus

This is primarily for OSDMap's blacklist, which, starting with nautilus,
is populated by ANY addrs instead of LEGACY addrs.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoclient: make blacklist detection handle TYPE_ANY entries
Sage Weil [Wed, 2 Jan 2019 17:55:19 +0000 (11:55 -0600)]
client: make blacklist detection handle TYPE_ANY entries

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/OSDMonitor: maintain compat output for 'blacklist ls'
Sage Weil [Tue, 1 Jan 2019 20:10:35 +0000 (14:10 -0600)]
mon/OSDMonitor: maintain compat output for 'blacklist ls'

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoclient: maintain compat for {inst,addr}_str in status dump
Sage Weil [Tue, 1 Jan 2019 20:08:59 +0000 (14:08 -0600)]
client: maintain compat for {inst,addr}_str in status dump

Add 'addr' field too.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoqa/tasks/ceph_manager: compare osd flush seq #'s as ints
Sage Weil [Tue, 1 Jan 2019 15:03:24 +0000 (09:03 -0600)]
qa/tasks/ceph_manager: compare osd flush seq #'s as ints

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoqa/suites/fs: make use of simple.yaml where appropriate
Sage Weil [Mon, 31 Dec 2018 21:45:01 +0000 (15:45 -0600)]
qa/suites/fs: make use of simple.yaml where appropriate

There's more needed than just ms_type=simple now.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoqa/msgr: move msgr factet into generic re-usable dir
Sage Weil [Mon, 31 Dec 2018 21:39:15 +0000 (15:39 -0600)]
qa/msgr: move msgr factet into generic re-usable dir

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agocrimson: fix monmap build for seastar
Sage Weil [Fri, 21 Dec 2018 21:55:56 +0000 (15:55 -0600)]
crimson: fix monmap build for seastar

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agodoc/start/ceph.conf: trim the sample ceph.conf file
Sage Weil [Sat, 22 Dec 2018 00:29:40 +0000 (18:29 -0600)]
doc/start/ceph.conf: trim the sample ceph.conf file

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agodoc/rados/operations: only describe --public-{addr,network} method for adding mons
Sage Weil [Sat, 22 Dec 2018 00:28:15 +0000 (18:28 -0600)]
doc/rados/operations: only describe --public-{addr,network} method for adding mons

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoPendingReleaseNotes: deprecate 'mon addr'
Sage Weil [Sat, 22 Dec 2018 00:27:37 +0000 (18:27 -0600)]
PendingReleaseNotes: deprecate 'mon addr'

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agodoc: fix some 'mon addr' references
Sage Weil [Sat, 22 Dec 2018 00:25:55 +0000 (18:25 -0600)]
doc: fix some 'mon addr' references

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agodoc/rados/configuration: fix some 'mon addr' references
Sage Weil [Sat, 22 Dec 2018 00:25:12 +0000 (18:25 -0600)]
doc/rados/configuration: fix some 'mon addr' references

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agodoc/rados/configuration/network-config-ref: revise network docs somewhat
Sage Weil [Sat, 22 Dec 2018 00:24:52 +0000 (18:24 -0600)]
doc/rados/configuration/network-config-ref: revise network docs somewhat

These still could use a larger rewrite, but this is a bit better.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agodoc/rados/configuration/network-config-ref: remove totally obsolete section
Sage Weil [Sat, 22 Dec 2018 00:24:23 +0000 (18:24 -0600)]
doc/rados/configuration/network-config-ref: remove totally obsolete section

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoqa/suites/rados: replace mon_seesaw.py task with a small bash script
Sage Weil [Fri, 21 Dec 2018 18:26:29 +0000 (12:26 -0600)]
qa/suites/rados: replace mon_seesaw.py task with a small bash script

The teuthology test did not like the change to remove 'mon addr' from
ceph.conf.  The standalone script is easier to test.

Note that it avoids mon names 'a', 'b', 'c' since the MonMap::build_initial
uses those.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoqa/suites/fs/upgrade: don't bind to v2 addrs
Sage Weil [Fri, 21 Dec 2018 19:21:57 +0000 (13:21 -0600)]
qa/suites/fs/upgrade: don't bind to v2 addrs

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoqa/tasks/mon_thrash: avoid 'mon addr' in mon section
Sage Weil [Fri, 21 Dec 2018 18:20:57 +0000 (12:20 -0600)]
qa/tasks/mon_thrash: avoid 'mon addr' in mon section

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/MonClient: disable ms_bind_msgr2 if NAUTILUS feature not set
Sage Weil [Fri, 21 Dec 2018 14:57:22 +0000 (08:57 -0600)]
mon/MonClient: disable ms_bind_msgr2 if NAUTILUS feature not set

Do not try to bind to v2 addresses until all of the mons will know what
we are doing and will be able to advertise those addresses.

This avoids the possibility of corner cases where we bind to one thing
but advertise something different via the various cluster maps.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoosd/OSDMap: maintain compat addr fields
Sage Weil [Fri, 21 Dec 2018 14:38:00 +0000 (08:38 -0600)]
osd/OSDMap: maintain compat addr fields

Fixes b47d9135d5345816037899cd1d12a5a20b54e6c1 and
9fb1e521c7c75c124b0dbf193e8b65ff1b5f461e.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomsg/msg_types: add get_legacy_str()
Sage Weil [Fri, 21 Dec 2018 14:37:25 +0000 (08:37 -0600)]
msg/msg_types: add get_legacy_str()

Render a pre-nautilus entity_addr_t string.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomds/MDSMap.h: maintain compat addr field
Sage Weil [Fri, 21 Dec 2018 14:32:05 +0000 (08:32 -0600)]
mds/MDSMap.h: maintain compat addr field

This avoids breaking anyone looking at a pre-nautilus dump.
Fixes ea1481d08d561cdf0229e440b70b88fe336d3f89.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/MgrMap: maintain compat active_addr field
Sage Weil [Fri, 21 Dec 2018 14:30:03 +0000 (08:30 -0600)]
mon/MgrMap: maintain compat active_addr field

This avoids breaking anyone looking at a pre-nautilus dump.  Fixes
7f787704cdcd55df18d204c5251cdd6a3cedd604.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/MonClient: reconnect to mon if it's addrvec appears to have changed
Sage Weil [Fri, 21 Dec 2018 13:17:34 +0000 (07:17 -0600)]
mon/MonClient: reconnect to mon if it's addrvec appears to have changed

This primarily kicks in if we connect to a mon's v1 address during the
initial probe and then discover that it has v2+v1.  It's a catch-all,
though, so that we'll reconnect to the (er, a) mon in any case where we
see it's addresses change.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoqa/tasks/ceph.conf.template: increase mon_mgr_mkfs_grace
Sage Weil [Thu, 20 Dec 2018 22:55:45 +0000 (16:55 -0600)]
qa/tasks/ceph.conf.template: increase mon_mgr_mkfs_grace

The grace starts with the monmap creation stamp, and ceph.py does a lot
of work between creating that map and actually starting daemons (e.g.,
preparing all of the osd devices), leading to occasional MGR_DOWN errors.
Double the grace period.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomsg/async/ProtocolV2: fill in IP for all peer_addrs
Sage Weil [Thu, 20 Dec 2018 19:23:59 +0000 (13:23 -0600)]
msg/async/ProtocolV2: fill in IP for all peer_addrs

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomsg/async: print all addrs on debug lines
Sage Weil [Thu, 20 Dec 2018 19:21:49 +0000 (13:21 -0600)]
msg/async: print all addrs on debug lines

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/MonMap: no noname- mon name prefix when for_mkfs
Sage Weil [Thu, 20 Dec 2018 16:57:44 +0000 (10:57 -0600)]
mon/MonMap: no noname- mon name prefix when for_mkfs

Teuthology no longer puts mon addr in ceph.conf, and instead sets the
mon_host option globally.  That means a different path for
ceph-monstore-tool rebuild to regenerate the monmap, and the generated
map's names need to match teuthologies.

This change fixes the teuthology rebuild test because that tests's mon
names happen to also be 'a', 'b', 'c'.  It's fragile, but it works.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoceph-monstore-tool: print initial monmap
Sage Weil [Thu, 20 Dec 2018 16:54:24 +0000 (10:54 -0600)]
ceph-monstore-tool: print initial monmap

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomsg/async/ProtocolV2: advertise ourselves as a v2 addr when using v2 protocol
Sage Weil [Wed, 19 Dec 2018 22:38:01 +0000 (16:38 -0600)]
msg/async/ProtocolV2: advertise ourselves as a v2 addr when using v2 protocol

We may have learned our address from a v1 connection, so myaddrs() is
a v1 addr like [v1:1.2.3.4:123/4392].  When we connect to someone using
msgr2, we should advertise ourselves as a v2 address, or else we risk
confusing everyone because we are a "v1" endpoint using the v2 protocol.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomsg/async: assert existing protocol matches current protocol
Sage Weil [Wed, 19 Dec 2018 20:53:43 +0000 (14:53 -0600)]
msg/async: assert existing protocol matches current protocol

If we are (potentially) replacing a connection, assert that the protocol
version matches.  If it doesn't, something very weird is going on!

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomsg/async: add missing modelines
Sage Weil [Wed, 19 Dec 2018 20:52:18 +0000 (14:52 -0600)]
msg/async: add missing modelines

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/MonMap: add missing modeline
Sage Weil [Wed, 19 Dec 2018 20:51:59 +0000 (14:51 -0600)]
mon/MonMap: add missing modeline

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agovstart.sh: put mon addrs in mon_host, not 'mon addr'
Sage Weil [Wed, 19 Dec 2018 16:42:58 +0000 (10:42 -0600)]
vstart.sh: put mon addrs in mon_host, not 'mon addr'

Notably, mon addr won't take an addrvec, while everything else will.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomsg/async: better debug around conn map lookups and updates
Sage Weil [Wed, 19 Dec 2018 16:39:55 +0000 (10:39 -0600)]
msg/async: better debug around conn map lookups and updates

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/MonClient: dump initial monmap at debug level 10
Sage Weil [Wed, 19 Dec 2018 16:37:15 +0000 (10:37 -0600)]
mon/MonClient: dump initial monmap at debug level 10

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoqa/standalone/osd/osd-fast-mark-down: use v1 addr w/ simplemessenger
Sage Weil [Wed, 19 Dec 2018 16:03:48 +0000 (10:03 -0600)]
qa/standalone/osd/osd-fast-mark-down: use v1 addr w/ simplemessenger

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoqa/tasks/ceph: set initial monmap features with using addrvec addrs
Sage Weil [Wed, 19 Dec 2018 03:35:28 +0000 (21:35 -0600)]
qa/tasks/ceph: set initial monmap features with using addrvec addrs

The --add option will only infer a bare IP to include a v2 addr if the
NAUTILUS feature is there, and that isn't normally present on a freshly
generate monmap.  Add it if we are doing addrvecs!

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomonmaptool: add --enable-all-features option
Sage Weil [Wed, 19 Dec 2018 03:34:30 +0000 (21:34 -0600)]
monmaptool: add --enable-all-features option

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoqa/tasks/ceph: only use monmaptool --addv if addr has [,:v]
Sage Weil [Wed, 19 Dec 2018 03:28:49 +0000 (21:28 -0600)]
qa/tasks/ceph: only use monmaptool --addv if addr has [,:v]

Otherwise, we want the --add path, which has the logic to infer ports,
v2+v1, etc.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoqa/tasks/ceph_manager: make get_mon_status use mon addr
Sage Weil [Wed, 19 Dec 2018 03:18:57 +0000 (21:18 -0600)]
qa/tasks/ceph_manager: make get_mon_status use mon addr

We don't have the 'mon addr' config property any more.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoqa/tasks/ceph: keep mon addrs in ctx namespace
Sage Weil [Wed, 19 Dec 2018 03:18:31 +0000 (21:18 -0600)]
qa/tasks/ceph: keep mon addrs in ctx namespace

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/OSDMonitor: log all osd addrs on boot
Sage Weil [Tue, 18 Dec 2018 18:52:24 +0000 (12:52 -0600)]
mon/OSDMonitor: log all osd addrs on boot

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomsg/simple: behave when v2 and v1 addrs are present at target
Sage Weil [Tue, 18 Dec 2018 13:17:12 +0000 (07:17 -0600)]
msg/simple: behave when v2 and v1 addrs are present at target

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/MonClient: warn if global_id changes
Sage Weil [Mon, 17 Dec 2018 19:18:29 +0000 (13:18 -0600)]
mon/MonClient: warn if global_id changes

Shouldn't happen!

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomsg/Connection: add warning/note on get_peer_global_id
Sage Weil [Mon, 17 Dec 2018 19:15:00 +0000 (13:15 -0600)]
msg/Connection: add warning/note on get_peer_global_id

This field isn't populated for loopback connections because the msgr
doesn't have any insight into what global_id its user has.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomds/MDSDaemon: clean up handle_mds_map debug output a bit
Sage Weil [Mon, 17 Dec 2018 13:33:48 +0000 (07:33 -0600)]
mds/MDSDaemon: clean up handle_mds_map debug output a bit

The old wording was misleading!

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoqa/suites/rados/upgrade: debug mds
Sage Weil [Mon, 17 Dec 2018 04:09:04 +0000 (22:09 -0600)]
qa/suites/rados/upgrade: debug mds

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomds/MDSRank: improve is_stale_message to handle addrvecs
Sage Weil [Fri, 14 Dec 2018 21:53:30 +0000 (15:53 -0600)]
mds/MDSRank: improve is_stale_message to handle addrvecs

If we get a connection on a loopback from ourselves, get_source_addrs()
will have everything we bound to, but the mdsmap may only have the v1
address.  Avoid the addrvec comparison by instead comparing the
ConnectionRefs.

NOTE: this implementation is a stopgap.  We should really maintain a map
of ConnectionRefs for the current up set and compare the ConnectionRefs
directly instead of comparing addr(vecs).

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomsg/async: make loopback detect when sending to one of our many addrs
Sage Weil [Fri, 14 Dec 2018 21:10:50 +0000 (15:10 -0600)]
msg/async: make loopback detect when sending to one of our many addrs

Drop the assert just because it's inefficient and not necessary.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoqa/suites/rados/upgrade: no aggressive pg num changes
Sage Weil [Thu, 13 Dec 2018 22:04:19 +0000 (16:04 -0600)]
qa/suites/rados/upgrade: no aggressive pg num changes

We now run with mixed mons and old mgrs, so this won't work.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/OSDMonitor: require nautilus mons for require_osd_release=nautilus
Sage Weil [Thu, 13 Dec 2018 15:15:17 +0000 (09:15 -0600)]
mon/OSDMonitor: require nautilus mons for require_osd_release=nautilus

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/OSDMonitor: require mimic mons for require_osd_release=mimic
Sage Weil [Thu, 13 Dec 2018 15:14:03 +0000 (09:14 -0600)]
mon/OSDMonitor: require mimic mons for require_osd_release=mimic

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoqa/suites/rados/thrash-old-clients: use legacy addr syntax in ceph.conf
Sage Weil [Thu, 13 Dec 2018 14:46:47 +0000 (08:46 -0600)]
qa/suites/rados/thrash-old-clients: use legacy addr syntax in ceph.conf

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomsg/async: preserve peer features when replacing a connection
Sage Weil [Sun, 9 Dec 2018 17:44:36 +0000 (11:44 -0600)]
msg/async: preserve peer features when replacing a connection

The features are now stored in the protocol implementation.  When we replace
an existing connection, copy those features so that our connect_msg_reply
calculates the correct features for the session.

This fixes an issue where a 3-mon cluster, after upgrading the two followers
but not the leader, was unable to include the (luminous) leader in the
quorum because it was seeing missing features in the connect reply, because
the new mons were replacing an old instance of the connection and weren't
copying the features, and that old instance had connect_msg.features == 0.

Add some debug lines that helped (finally) identify the problem.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoqa/tasks/ceph.py: move methods from teuthology.git into ceph.py directly; support...
Sage Weil [Fri, 7 Dec 2018 19:16:31 +0000 (13:16 -0600)]
qa/tasks/ceph.py: move methods from teuthology.git into ceph.py directly; support mon bind * options

Having these live in teuthology.git is silly, since they are only consumed
by the ceph task, and it is hard to revise the behavior.

Revise the behavior by adding mon_bind_* options.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/MonMap: adjust build_initial behavior for mkfs vs probe
Sage Weil [Tue, 11 Dec 2018 03:32:39 +0000 (21:32 -0600)]
mon/MonMap: adjust build_initial behavior for mkfs vs probe

For the mkfs case, interpret an ambiguous port as a v2 address.  For probe,
try both.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agomon/MonMap: improve ambiguous addr behavior
Sage Weil [Fri, 7 Dec 2018 16:23:16 +0000 (10:23 -0600)]
mon/MonMap: improve ambiguous addr behavior

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoqa/suites/rados/upgrade: spread mons a bit
Sage Weil [Fri, 7 Dec 2018 18:48:09 +0000 (12:48 -0600)]
qa/suites/rados/upgrade: spread mons a bit

This will mean 2/3 mons have default ports.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoqa/rados/thrash-old-clients: keep mons on separate hosts
Sage Weil [Fri, 7 Dec 2018 14:51:32 +0000 (08:51 -0600)]
qa/rados/thrash-old-clients: keep mons on separate hosts

This ensures the mons can use default ports, ceph.conf won't have v1: or
v2: prefixes, and old clients will be happy.

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoqa/standalone/mon/misc.sh: tweak test to be more robust
Sage Weil [Thu, 6 Dec 2018 20:16:48 +0000 (14:16 -0600)]
qa/standalone/mon/misc.sh: tweak test to be more robust

Signed-off-by: Sage Weil <sage@redhat.com>
6 years agoqa/tasks/mon_seesaw: expect v1/v2 prefix in addr
Sage Weil [Thu, 6 Dec 2018 17:25:48 +0000 (11:25 -0600)]
qa/tasks/mon_seesaw: expect v1/v2 prefix in addr

Signed-off-by: Sage Weil <sage@redhat.com>