]> git.apps.os.sepia.ceph.com Git - ceph.git/log
ceph.git
2 years agodoc/rados: edit ops/monitoring.rst (2 of 3) 50834/head
Zac Dover [Sun, 2 Apr 2023 22:03:29 +0000 (08:03 +1000)]
doc/rados: edit ops/monitoring.rst (2 of 3)

Line-edit the second third of doc/rados/operations/monitoring.rst.

Follows https://github.com/ceph/ceph/pull/50670.

https://tracker.ceph.com/issues/58485

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
2 years agoMerge pull request #50818 from anthonyeleven/anthonyeleven/tweak-fs-volumes.rst
Anthony D'Atri [Sun, 2 Apr 2023 05:47:36 +0000 (01:47 -0400)]
Merge pull request #50818 from anthonyeleven/anthonyeleven/tweak-fs-volumes.rst

doc/cephfs: Improve fs-volumes.rst

2 years agodoc/cephfs: Improve fs-volumes.rst 50818/head
Anthony D'Atri [Fri, 31 Mar 2023 13:37:52 +0000 (09:37 -0400)]
doc/cephfs: Improve fs-volumes.rst

Signed-off-by: Anthony D'Atri <anthonyeleven@users.noreply.github.com>
2 years agoMerge pull request #50670 from zdover23/wip-doc-2023-03-26-rados-operations-monitorin...
zdover23 [Fri, 31 Mar 2023 19:48:39 +0000 (05:48 +1000)]
Merge pull request #50670 from zdover23/wip-doc-2023-03-26-rados-operations-monitoring-1-of-3

doc/rados: edit ops/monitoring.rst (1 of 3)

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
2 years agodoc/rados: edit ops/monitoring.rst (1 of 3) 50670/head
Zac Dover [Sat, 25 Mar 2023 18:51:50 +0000 (04:51 +1000)]
doc/rados: edit ops/monitoring.rst (1 of 3)

Line-edit the first third of doc/rados/operations/monitoring.rst.

https://tracker.ceph.com/issues/58485

Signed-off-by: Zac Dover <zac.dover@proton.me>
2 years agoMerge pull request #46905 from lxbsz/wip-quota
Rishabh Dave [Fri, 31 Mar 2023 18:42:19 +0000 (00:12 +0530)]
Merge pull request #46905 from lxbsz/wip-quota

mds: align quota.max_bytes to 4MB or 4KB

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Milind Changire <mchangir@redhat.com>
Reviewed-by: Kotresh HR <khiremat@redhat.com>
Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>
2 years agoMerge pull request #49882 from zhsgao/dump_resolve_status
Rishabh Dave [Fri, 31 Mar 2023 18:40:05 +0000 (00:10 +0530)]
Merge pull request #49882 from zhsgao/dump_resolve_status

mds: fix bug of dump_resolve_status

Reviewed-by: Kotresh HR <khiremat@redhat.com>
Reviewed-by: Dhairya Parmar <dparmar@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>
2 years agoMerge pull request #50024 from kotreshhr/mds-unused-function-removal
Rishabh Dave [Fri, 31 Mar 2023 18:36:49 +0000 (00:06 +0530)]
Merge pull request #50024 from kotreshhr/mds-unused-function-removal

mds: Remove unused declaration of the function

Reviewed-by: Dhairya Parmar <dparmar@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>
2 years agoMerge pull request #50746 from cbodley/wip-rgw-bucket-full-sync-compat
Casey Bodley [Fri, 31 Mar 2023 18:34:42 +0000 (14:34 -0400)]
Merge pull request #50746 from cbodley/wip-rgw-bucket-full-sync-compat

rgw: set init_check_compat when bucket sync status doesn't exist

Reviewed-by: Shilpa Jagannath <smanjara@redhat.com>
Reviewed-by: Mark Kogan <mkogan@redhat.com>
2 years agoMerge pull request #50644 from cbodley/wip-59135
Ali Maredia [Fri, 31 Mar 2023 18:27:26 +0000 (14:27 -0400)]
Merge pull request #50644 from cbodley/wip-59135

qa/rgw: upgrade tests restrict supported distros

2 years agoMerge pull request #50690 from cbodley/wip-59174
Casey Bodley [Fri, 31 Mar 2023 17:48:52 +0000 (13:48 -0400)]
Merge pull request #50690 from cbodley/wip-59174

rgw/admin: 'data sync status' formats binary error repo entries

Reviewed-by: Matt Benjamin <mbenjamin@redhat.com>
2 years agoMerge pull request #50462 from matutter/main
Casey Bodley [Fri, 31 Mar 2023 17:47:48 +0000 (13:47 -0400)]
Merge pull request #50462 from matutter/main

sts: Fixes get_cert_url improper url path concatenation

Reviewed-by: Pritha Srivastava <prsrivas@redhat.com>
2 years agoMerge pull request #50623 from VVoidV/fix-rgw_admin
Casey Bodley [Fri, 31 Mar 2023 17:47:18 +0000 (13:47 -0400)]
Merge pull request #50623 from VVoidV/fix-rgw_admin

[RGW]fix minor bug in rgw_admin

Reviewed-by: Casey Bodley <cbodley@redhat.com>
Reviewed-by: J. Eric Ivancich <ivancich@redhat.com>
2 years agoMerge pull request #50648 from cbodley/wip-58035
Casey Bodley [Fri, 31 Mar 2023 17:43:01 +0000 (13:43 -0400)]
Merge pull request #50648 from cbodley/wip-58035

rgw: fix CopyObj crash after admin override

Reviewed-by: Matt Benjamin <mbenjamin@redhat.com>
Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
Reviewed-by: Mark Kogan <mkogan@redhat.com>
2 years agoMerge pull request #50817 from bluikko/patch-10
Anthony D'Atri [Fri, 31 Mar 2023 13:18:15 +0000 (09:18 -0400)]
Merge pull request #50817 from bluikko/patch-10

doc/cephfs: refer to placement of MDS and not NFS in fs-volumes.rst

2 years agodoc/cephfs: refer to MDS and not NFS in fs-volumes.rst 50817/head
Ville Ojamo [Fri, 31 Mar 2023 12:01:39 +0000 (19:01 +0700)]
doc/cephfs: refer to MDS and not NFS in fs-volumes.rst

Placement should talk about MDS placement and not NFS Ganesha placement.

Signed-off-by: Ville Ojamo <14869000+bluikko@users.noreply.github.com>
2 years agoMerge pull request #50793 from petrutlucian94/allperms 49838/head
Ilya Dryomov [Fri, 31 Mar 2023 08:54:45 +0000 (10:54 +0200)]
Merge pull request #50793 from petrutlucian94/allperms

include: move ALLPERMS definition to compat.h

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
2 years agoinclude: move ALLPERMS definition to compat.h 50793/head
Lucian Petrut [Thu, 30 Mar 2023 12:14:10 +0000 (12:14 +0000)]
include: move ALLPERMS definition to compat.h

The Windows CI job started to fail as some libcephfs tests that use
ALLPERMS have been moved [1] to a separate file which doesn't have
the ALLPERMS definition.

  /ceph/src/test/libcephfs/suidsgid.cc:240:36: error: ‘ALLPERMS’ was
  not declared in this scope
  240 |   ASSERT_EQ(stx.stx_mode & (mode_t)ALLPERMS, before_mode);

We'll move this definition to compat.h so that we won't have to
redefine it in each file that uses it.

Note that we're moving the Windows "fs_compat.h" include up,
ensuring that the constants used by ALLPERMS are defined.

[1] https://github.com/ceph/ceph/commit/254eeb2730881ee48b4743e1422749d5b643f52a

Signed-off-by: Lucian Petrut <lpetrut@cloudbasesolutions.com>
2 years agoMerge pull request #49586 from mohan7427/mohan/batch_args-issue
Guillaume Abrioux [Fri, 31 Mar 2023 07:07:43 +0000 (09:07 +0200)]
Merge pull request #49586 from mohan7427/mohan/batch_args-issue

ceph-volume: fix drive-group issue that expects the batch_args to be a string

2 years agoMerge pull request #50554 from zdover23/wip-doc-2023-03-16-rados-operations-pg-repair
zdover23 [Fri, 31 Mar 2023 06:45:06 +0000 (16:45 +1000)]
Merge pull request #50554 from zdover23/wip-doc-2023-03-16-rados-operations-pg-repair

doc/rados: line-edit pg-repair.rst

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
2 years agoMerge pull request #45354 from dvanders/dualdoc
zdover23 [Fri, 31 Mar 2023 06:25:31 +0000 (16:25 +1000)]
Merge pull request #45354 from dvanders/dualdoc

doc/msgr2: update dual stack status

Reviewed-by: Zac Dover <zac.dover@proton.me>
2 years agoMerge pull request #50426 from rhcs-dashboard/role-create
Nizamudeen A [Fri, 31 Mar 2023 05:12:09 +0000 (10:42 +0530)]
Merge pull request #50426 from rhcs-dashboard/role-create

mgr/dashboard: creation of rgw roles

Reviewed-by: Pegonzal <NOT@FOUND>
Reviewed-by: Ernesto Puerta <epuertat@redhat.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
2 years agoMerge PR #49460 into main
Venky Shankar [Fri, 31 Mar 2023 04:02:37 +0000 (09:32 +0530)]
Merge PR #49460 into main

* refs/pull/49460/head:
qa: fix issue with fn unable to fetch port and ip
qa: fix helper function _check_nfs_cluster_status()
qa: fix testcase 'test_cluster_set_user_config_with_non_existing_clusterid'
qa: fix cluster creation failure in test_nfs.py
qa: test export creation at filepath and symlink
qa: added test case test_nfs_export_with_invalid_path
mgr/nfs: disallow non-existent paths when creating export
mgr/nfs/tests: mock check_cephfs_path
mgr/nfs/utils: add helper func to check cephfs path

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Rishabh Dave <ridave@redhat.com>
2 years agoMerge pull request #50674 from zdover23/wip-doc-2023-03-26-rados-operations-health...
zdover23 [Thu, 30 Mar 2023 23:38:03 +0000 (09:38 +1000)]
Merge pull request #50674 from zdover23/wip-doc-2023-03-26-rados-operations-health-checks-1-of-5

doc/rados/ops: edit health-checks.rst (1 of 5)

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
2 years agoMerge pull request #50794 from ljflores/wip-crontab
Laura Flores [Thu, 30 Mar 2023 20:42:28 +0000 (15:42 -0500)]
Merge pull request #50794 from ljflores/wip-crontab

qa/crontab: check older builds on teuthology/nop when necessary

2 years agoMerge pull request #49824 from adk3798/iscsi-etc-hosts
Adam King [Thu, 30 Mar 2023 16:24:32 +0000 (12:24 -0400)]
Merge pull request #49824 from adk3798/iscsi-etc-hosts

cephadm: mount host /etc/hosts for daemon containers in podman deployments

Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
Reviewed-by: Michael Fritch <mfritch@suse.com>
Reviewed-by: Nizamudeen A <nia@redhat.com>
2 years agoMerge pull request #50328 from adk3798/maintenance-mode-force
Adam King [Thu, 30 Mar 2023 16:20:58 +0000 (12:20 -0400)]
Merge pull request #50328 from adk3798/maintenance-mode-force

mgr/cephadm: add more aggressive force flag for host maintenance enter

Reviewed-by: John Mulligan <jmulligan@redhat.com>
2 years agoMerge pull request #50547 from adk3798/ssh-denied-error-handling
Adam King [Thu, 30 Mar 2023 16:18:53 +0000 (12:18 -0400)]
Merge pull request #50547 from adk3798/ssh-denied-error-handling

mgr/cephadm: handle HostConnectionError when checking for valid addr

Reviewed-by: John Mulligan <jmulligan@redhat.com>
Reviewed-by: Redouane Kachach <rkachach@redhat.com>
2 years agoMerge pull request #50607 from adk3798/keepalive-2-2-4
Adam King [Thu, 30 Mar 2023 16:16:29 +0000 (12:16 -0400)]
Merge pull request #50607 from adk3798/keepalive-2-2-4

cephadm: bump default keepalived image to 2.2.4

Reviewed-by: Michael Fritch <mfritch@suse.com>
Reviewed-by: Redouane Kachach <rkachach@redhat.com>
2 years agoMerge pull request #50655 from rkachach/fix_issue_58158
Adam King [Thu, 30 Mar 2023 16:14:33 +0000 (12:14 -0400)]
Merge pull request #50655 from rkachach/fix_issue_58158

mgr/cephadm: increasing container stop timeout for OSDs

Reviewed-by: Adam King <adking@redhat.com>
2 years agoMerge pull request #50685 from rkachach/fix_issue_59170
Adam King [Thu, 30 Mar 2023 15:54:52 +0000 (11:54 -0400)]
Merge pull request #50685 from rkachach/fix_issue_59170

mgr/cephadm: fixing ceph-exporter prometheus's job section

Reviewed-by: avanthakkar <avanjohn@gmail.com>
2 years agoqa/crontab: check older builds on teuthology/nop when necessary 50794/head
Laura Flores [Thu, 30 Mar 2023 15:38:14 +0000 (10:38 -0500)]
qa/crontab: check older builds on teuthology/nop when necessary

Today's scheduled run failed since the newest build of main
had failed. If we add `-n 10` to the command, this will
make it so we start at the newest build and backtrack
up to 10 older builds if necessary.

A higher number than that is not necessary, as the suite
failing to run will signal to us that more than the last
10 main builds are broken in Shaman.

Signed-off-by: Laura Flores <lflores@redhat.com>
2 years agotest/rgw: ceph_test_librgw_file target also builds associated tests 50648/head
Casey Bodley [Thu, 30 Mar 2023 15:13:37 +0000 (11:13 -0400)]
test/rgw: ceph_test_librgw_file target also builds associated tests

this just makes it easier to build all of the tests needed by the
test_librgw_file.sh workunit

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2 years agoMerge pull request #50684 from yuvalif/wip-yuval-fix-59136
Matt Benjamin [Thu, 30 Mar 2023 14:25:16 +0000 (10:25 -0400)]
Merge pull request #50684 from yuvalif/wip-yuval-fix-59136

rgw/notifications: support bucket notification with bucket policy

2 years agoqa: fix issue with fn unable to fetch port and ip 49460/head
dparmar18 [Fri, 24 Feb 2023 21:29:24 +0000 (02:59 +0530)]
qa: fix issue with fn unable to fetch port and ip

_get_port_ip_info() fails to fetch port and ip due to empty 'backend' key:

2023-02-24T20:49:09.084 DEBUG:teuthology.orchestra.run.smithi042:> sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 120 ceph --cluster ceph nfs cluster info test
2023-02-24T20:49:09.471 INFO:teuthology.orchestra.run.smithi042.stdout:{
2023-02-24T20:49:09.472 INFO:teuthology.orchestra.run.smithi042.stdout:  "test": {
2023-02-24T20:49:09.472 INFO:teuthology.orchestra.run.smithi042.stdout:    "backend": [],
2023-02-24T20:49:09.472 INFO:teuthology.orchestra.run.smithi042.stdout:    "virtual_ip": null
2023-02-24T20:49:09.472 INFO:teuthology.orchestra.run.smithi042.stdout:  }
2023-02-24T20:49:09.472 INFO:teuthology.orchestra.run.smithi042.stdout:}

it then raises:

2023-02-24T20:49:10.323 INFO:tasks.cephfs_test_runner:    info_output = json.loads(self._nfs_cmd('cluster', 'info', self.cluster_id))['test']['backend'][0]
2023-02-24T20:49:10.323 INFO:tasks.cephfs_test_runner:IndexError: list index out of range

Signed-off-by: Dhairya Parmar <dparmar@redhat.com>
2 years agoqa: fix helper function _check_nfs_cluster_status()
dparmar18 [Tue, 21 Feb 2023 18:08:42 +0000 (23:38 +0530)]
qa: fix helper function _check_nfs_cluster_status()

Comment in the code says to wait for two minutes as cluster
creation takes time but actually it's waiting for thirteen
minutes, it's not required to wait this long, i think a minute
here is more than enough, also switched to using safe_while().

Signed-off-by: Dhairya Parmar <dparmar@redhat.com>
2 years agoqa: fix testcase 'test_cluster_set_user_config_with_non_existing_clusterid'
dparmar18 [Fri, 17 Feb 2023 18:35:19 +0000 (00:05 +0530)]
qa: fix testcase 'test_cluster_set_user_config_with_non_existing_clusterid'

Fixes: https://tracker.ceph.com/issues/58758
Signed-off-by: Dhairya Parmar <dparmar@redhat.com>
2 years agoqa: fix cluster creation failure in test_nfs.py
dparmar18 [Mon, 13 Feb 2023 14:32:06 +0000 (20:02 +0530)]
qa: fix cluster creation failure in test_nfs.py

Also adds a function _nfs_complete_cmd() that returns process obj so that stdout/stderr
can be used for evaluation(_nfs_cmd() uses raw_cluster_cmd() that returns just stdout
and it became difficult to time cluster creation errors in _test_create_cluster()).

It takes sometime to update the cluster data, therefore running the command set
(check nfs server status -> nfs cluster create test -> check cluster status) in
a loop (max six iteration with sleep of 5 secs at each iteration) fixes the issue.

Fixes: https://tracker.ceph.com/issues/58744
Signed-off-by: Dhairya Parmar <dparmar@redhat.com>
2 years agoqa: test export creation at filepath and symlink
dparmar18 [Tue, 7 Feb 2023 11:52:56 +0000 (17:22 +0530)]
qa: test export creation at filepath and symlink

Fixes: https://tracker.ceph.com/issues/58228
Signed-off-by: Dhairya Parmar <dparmar@redhat.com>
2 years agoqa: added test case test_nfs_export_with_invalid_path
dparmar18 [Thu, 15 Dec 2022 12:00:07 +0000 (17:30 +0530)]
qa: added test case test_nfs_export_with_invalid_path

Fixes: https://tracker.ceph.com/issues/58228
Signed-off-by: Dhairya Parmar <dparmar@redhat.com>
2 years agomgr/nfs: disallow non-existent paths when creating export
dparmar18 [Thu, 15 Dec 2022 11:24:27 +0000 (16:54 +0530)]
mgr/nfs: disallow non-existent paths when creating export

Fixes: https://tracker.ceph.com/issues/58228
Signed-off-by: Dhairya Parmar <dparmar@redhat.com>
2 years agomgr/nfs/tests: mock check_cephfs_path
Dhairya Parmar [Wed, 29 Mar 2023 17:58:59 +0000 (23:28 +0530)]
mgr/nfs/tests: mock check_cephfs_path

Signed-off-by: Dhairya Parmar <dparmar@redhat.com>
2 years agoMerge pull request #45693 from cbodley/wip-qa-rgw-scrub
Casey Bodley [Thu, 30 Mar 2023 12:50:30 +0000 (08:50 -0400)]
Merge pull request #45693 from cbodley/wip-qa-rgw-scrub

qa/rgw: reenable wait-for-scrub

Reviewed-by: Josh Durgin <jdurgin@redhat.com>
2 years agoMerge PR #47649 into main
Venky Shankar [Thu, 30 Mar 2023 10:43:48 +0000 (16:13 +0530)]
Merge PR #47649 into main

* refs/pull/47649/head:
mds: adjust MDSRank::command_tag_path invocation of enqueue_scrub()
doc/scrub: documented stray evaluation using recursive scrub
qa: added testcases
mds: make `scrub status` print flag `scrub_mdsdir`
mds: add scrub_mdsdir to ScrubHeader
mds: do not dump multiple JSON obj
mds: evaluate strays while performing scrub on root path
mds: remove inode from scrub_stack if being purged
mds: do not scrub inode if it is purging

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2 years agoMerge PR #50053 into main
Venky Shankar [Thu, 30 Mar 2023 09:18:26 +0000 (14:48 +0530)]
Merge PR #50053 into main

* refs/pull/50053/head:
libcephfs: move ClearSetuid to suidsgid.cc
libcephfs: add test cases for dropping the suid/sgid in write/truncate
libcephfs: add test cases for dropping the suid/sgid in fallocate
libcephfs: fix ClearSetuid incorrectly using SETATTR_MODE mask
client: switch to clear_suid_sgid for ftruncate
client: switch to clear_suid_sgid for _write()
mds/client: clear the suid/sgid in fallocate path
client: allow unprivileged users to clear suid/sgid

Reviewed-by: Venky Shankar <vshankar@redhat.com>
Reviewed-by: Milind Changire <mchangir@redhat.com>
2 years agoMerge pull request #50687 from rishabh-d-dave/fs-qa-kernel-untar-tar
Rishabh Dave [Thu, 30 Mar 2023 09:13:49 +0000 (14:43 +0530)]
Merge pull request #50687 from rishabh-d-dave/fs-qa-kernel-untar-tar

qa/workunit/fs: print commands for making debugging easier

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2 years agoMerge pull request #50764 from zdover23/wip-doc-2023-03-230-start-documenting-ceph...
Anthony D'Atri [Thu, 30 Mar 2023 04:59:11 +0000 (00:59 -0400)]
Merge pull request #50764 from zdover23/wip-doc-2023-03-230-start-documenting-ceph-squash-formatting

doc/start: format procedure in documenting-ceph

2 years agoMerge PR #50692 into main
Patrick Donnelly [Thu, 30 Mar 2023 03:01:33 +0000 (23:01 -0400)]
Merge PR #50692 into main

* refs/pull/50692/head:
tools/cephfs: include lost+found in scan_links

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2 years agodoc/rados/ops: edit health-checks.rst (1 of x) 50674/head
Zac Dover [Sat, 25 Mar 2023 20:02:19 +0000 (06:02 +1000)]
doc/rados/ops: edit health-checks.rst (1 of x)

Edit docs/rados/operations/health-checks.rst (1 of x).

https://tracker.ceph.com/issues/58485

Signed-off-by: Zac Dover <zac.dover@proton.me>
2 years agoMerge pull request #50742 from zdover23/wip-doc-2023-03-29-start-documenting-ceph...
zdover23 [Thu, 30 Mar 2023 01:01:29 +0000 (11:01 +1000)]
Merge pull request #50742 from zdover23/wip-doc-2023-03-29-start-documenting-ceph-notify-us

doc/start: update "notify us" section

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
2 years agodoc/start: format procedure in documenting-ceph 50764/head
Zac Dover [Thu, 30 Mar 2023 00:54:52 +0000 (10:54 +1000)]
doc/start: format procedure in documenting-ceph

Organize three steps in a procedure into one step with three sub-steps.

Signed-off-by: Zac Dover <zac.dover@proton.me>
2 years agoMerge pull request #50744 from zdover23/wip-doc-2023-03-29-rados-ops-health-checks
Anthony D'Atri [Wed, 29 Mar 2023 21:53:29 +0000 (17:53 -0400)]
Merge pull request #50744 from zdover23/wip-doc-2023-03-29-rados-ops-health-checks

doc/rados: add link to ops/health-checks.rst

2 years agoMerge PR #49773 into main
Patrick Donnelly [Wed, 29 Mar 2023 20:15:47 +0000 (16:15 -0400)]
Merge PR #49773 into main

* refs/pull/49773/head:
mds: add config to decide whether to mark dentry bad
qa: add missing scan_links step for data scan recovery
qa/tasks/cephfs: test damage to dentry's first is caught
qa/tasks/cephfs: use rank_asok and allow specifying rank
qa/tasks: allow specifying timeout command prefix to ceph
mds: provide test configs for creating first corruption
mds: catch damage to dentry's first field
mds: add debugging for pre_cow_old_inode
mds: cleanup code

Reviewed-by: Kotresh Hiremath Ravishankar <khiremat@redhat.com>
2 years agomgr/nfs/utils: add helper func to check cephfs path
Dhairya Parmar [Wed, 29 Mar 2023 17:50:50 +0000 (23:20 +0530)]
mgr/nfs/utils: add helper func to check cephfs path

this helper instantiates CephfsClient, however this was
initially planned in ExportMgr class in export.py but
due to make check failure where main python thread
experienced a dead lock which after several efforts
pointed at instantiation of CephfsClient in ExportMgr
was problematic, it was decided in order to achieve
singleton behavior, func has been added inside this
helper func that restricts instantiation using functool's
lru_cache.

Signed-off-by: Dhairya Parmar <dparmar@redhat.com>
2 years agoMerge pull request #50750 from Sainithin-bit/main
zdover23 [Wed, 29 Mar 2023 19:09:15 +0000 (05:09 +1000)]
Merge pull request #50750 from Sainithin-bit/main

 doc/rados/operations: Fix typo in erasure-code.rst

Reviewed-by: Zac Dover <zac.dover@proton.me>
2 years agodoc/rados/operations: Fix typo in erasure-code.rst 50750/head
SAINITHIN.ARTHAM [Wed, 29 Mar 2023 18:16:32 +0000 (23:46 +0530)]
doc/rados/operations: Fix typo in erasure-code.rst

Signed-off-by: Sainithin Artham <sai.artham.19cse@bmu.edu.in>
2 years agoMerge pull request #50633 from cbodley/wip-54102
Casey Bodley [Wed, 29 Mar 2023 18:16:15 +0000 (14:16 -0400)]
Merge pull request #50633 from cbodley/wip-54102

qa/rgw: unpin centos for verify suite

Reviewed-by: Daniel Gryniewicz <dang@redhat.com>
2 years agomds: add config to decide whether to mark dentry bad 49773/head
Patrick Donnelly [Wed, 29 Mar 2023 13:05:25 +0000 (09:05 -0400)]
mds: add config to decide whether to mark dentry bad

So admin can restore access to files if necessary.

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2 years agoqa: add missing scan_links step for data scan recovery
Patrick Donnelly [Fri, 24 Mar 2023 21:54:35 +0000 (17:54 -0400)]
qa: add missing scan_links step for data scan recovery

Without, the first field remains corrupt (HEAD).

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2 years agoqa/tasks/cephfs: test damage to dentry's first is caught
Patrick Donnelly [Wed, 18 Jan 2023 02:22:10 +0000 (21:22 -0500)]
qa/tasks/cephfs: test damage to dentry's first is caught

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2 years agoqa/tasks/cephfs: use rank_asok and allow specifying rank
Patrick Donnelly [Fri, 20 Jan 2023 23:20:15 +0000 (18:20 -0500)]
qa/tasks/cephfs: use rank_asok and allow specifying rank

This will use the more efficient:

    ceph tell mds.<fsname>:<rank> ...

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2 years agoqa/tasks: allow specifying timeout command prefix to ceph
Patrick Donnelly [Wed, 18 Jan 2023 02:21:08 +0000 (21:21 -0500)]
qa/tasks: allow specifying timeout command prefix to ceph

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2 years agomds: provide test configs for creating first corruption
Patrick Donnelly [Wed, 18 Jan 2023 02:30:04 +0000 (21:30 -0500)]
mds: provide test configs for creating first corruption

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2 years agomds: catch damage to dentry's first field
Patrick Donnelly [Wed, 18 Jan 2023 02:29:39 +0000 (21:29 -0500)]
mds: catch damage to dentry's first field

When possible. Abort the MDS before it can be written to the
journal/directory.

This is part of a series to address corruption first observed in [1].
How the corruption is introduced is yet unknown.

[1] https://tracker.ceph.com/issues/38452#note-10

Fixes: http://tracker.ceph.com/issues/58482
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2 years agomds: add debugging for pre_cow_old_inode
Patrick Donnelly [Wed, 15 Mar 2023 18:06:11 +0000 (14:06 -0400)]
mds: add debugging for pre_cow_old_inode

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2 years agomds: cleanup code
Patrick Donnelly [Tue, 10 Jan 2023 20:35:04 +0000 (15:35 -0500)]
mds: cleanup code

Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2 years agoMerge pull request #50553 from ljflores/wip-reef-crontab 50659/head
Laura Flores [Wed, 29 Mar 2023 16:59:21 +0000 (11:59 -0500)]
Merge pull request #50553 from ljflores/wip-reef-crontab

qa/crontab: add reef upgrade tests and teuthology/nop

2 years agoMerge pull request #50686 from rishabh-d-dave/fs-qa-client-upgrade
Rishabh Dave [Wed, 29 Mar 2023 16:49:35 +0000 (22:19 +0530)]
Merge pull request #50686 from rishabh-d-dave/fs-qa-client-upgrade

qa/suites/fs: check if cluster is healthy after upgrade

Reviewed-by: Venky Shankar <vshankar@redhat.com>
2 years agoMerge pull request #50307 from dparmar18/fix_pendingreleasenotes
Rishabh Dave [Wed, 29 Mar 2023 16:12:14 +0000 (21:42 +0530)]
Merge pull request #50307 from dparmar18/fix_pendingreleasenotes

PendingReleaseNotes: remove duplicate note and unwanted tracker link

Reviewed-by: Rishabh Dave <ridave@redhat.com>
Reviewed-by: Venky Shankar <vshankar@redhat.com>
2 years agoMerge pull request #50735 from yangdongsheng/rbd_quiesce
Ilya Dryomov [Wed, 29 Mar 2023 15:19:02 +0000 (17:19 +0200)]
Merge pull request #50735 from yangdongsheng/rbd_quiesce

librbd: fix wrong attribute for rbd_quiesce_complete api

Reviewed-by: Mykola Golub <mgolub@suse.com>
Reviewed-by: Ilya Dryomov <idryomov@gmail.com>
2 years agorgw: set init_check_compat when bucket sync status doesn't exist 50746/head
Casey Bodley [Wed, 29 Mar 2023 14:57:04 +0000 (10:57 -0400)]
rgw: set init_check_compat when bucket sync status doesn't exist

Signed-off-by: Casey Bodley <cbodley@redhat.com>
2 years agoqa/suites/fs: check if cluster is healthy after upgrade 50686/head
Rishabh Dave [Mon, 27 Mar 2023 12:36:28 +0000 (18:06 +0530)]
qa/suites/fs: check if cluster is healthy after upgrade

Signed-off-by: Rishabh Dave <ridave@redhat.com>
2 years agodoc/rados: add link to ops/health-checks.rst 50744/head
Zac Dover [Wed, 29 Mar 2023 14:25:23 +0000 (00:25 +1000)]
doc/rados: add link to ops/health-checks.rst

Add an internal navigation link to
doc/rados/operations/health-checks.rst.

Signed-off-by: Zac Dover <zac.dover@proton.me>
(cherry picked from commit f2f968f65bd1f1a6975fd340c66703cea335dcf6)

2 years agodoc/start: update "notify us" section 50742/head
Zac Dover [Wed, 29 Mar 2023 13:47:38 +0000 (23:47 +1000)]
doc/start: update "notify us" section

Rewrite the "Notify Us" section in doc/start/documenting-ceph.rst so
that it is written in English sentences.

Signed-off-by: Zac Dover <zac.dover@proton.me>
2 years agoMerge pull request #50723 from zdover23/wip-doc-2023-03-28-documenting-ceph-squashing
zdover23 [Wed, 29 Mar 2023 12:54:58 +0000 (22:54 +1000)]
Merge pull request #50723 from zdover23/wip-doc-2023-03-28-documenting-ceph-squashing

doc/start: documenting-ceph - add squash procedure

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
2 years agoqa/workunit/fs: print commands for making debugging easier 50687/head
Rishabh Dave [Mon, 27 Mar 2023 12:41:51 +0000 (18:11 +0530)]
qa/workunit/fs: print commands for making debugging easier

Print the commands and their arguments as they are being executed for
kernel_untar_tar.sh so that it's easier to debug when a teuthology
failure occurs due to it.

Signed-off-by: Rishabh Dave <ridave@redhat.com>
2 years agolibrbd: fix wrong attribute for rbd_quiesce_complete api 50735/head
Dongsheng Yang [Wed, 15 Mar 2023 06:54:39 +0000 (06:54 +0000)]
librbd: fix wrong attribute for rbd_quiesce_complete api

When we use rbd_quiesce_complete api, we got an error:
/usr/bin/ld: undefined reference to `rbd_quiesce_complete'

Then we found the problem is the symbol of rbd_quiesce_complete
in librbd.so is LOCAL. After some investigation, we found
the attribute of rbd_quiesce_complete api is CEPH_RADOS_API
rather than expected CEPH_RBD_API.

Fixes: https://tracker.ceph.com/issues/59208
Signed-off-by: Dongsheng Yang <dongsheng.yang.linux@gmail.com>
2 years agoMerge pull request #50718 from rhcs-dashboard/use-only-counter-dump-schema
Avan [Wed, 29 Mar 2023 08:52:51 +0000 (14:22 +0530)]
Merge pull request #50718 from rhcs-dashboard/use-only-counter-dump-schema

exporter: user only counter dump/schema commands for extacting counters

Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>
2 years agoceph-volume: fix drive-group issue 49586/head
Mohan Sharma [Tue, 27 Dec 2022 06:01:04 +0000 (11:31 +0530)]
ceph-volume: fix drive-group issue

The drive-group expects the batch_args to be a string,
however in the current version it is passed as a list
of one element, thus calling the first item of the list solves the issue.

Fixes: https://tracker.ceph.com/issues/59203
Signed-off-by: Mohan Sharma <mohan7427@gmail.com>
2 years agoMerge pull request #48183 from lxbsz/wip-57591
Xiubo Li [Wed, 29 Mar 2023 04:44:07 +0000 (12:44 +0800)]
Merge pull request #48183 from lxbsz/wip-57591

qa: enable kclient test for newop test

2 years agoqa: enable kclient test for newop test 48183/head
Xiubo Li [Tue, 21 Mar 2023 01:51:49 +0000 (09:51 +0800)]
qa: enable kclient test for newop test

The kclient have already fix this. This will only enable the upstream
kclient with the testing branch, the downstream ones may not include
the fixing yet, so skip them for now.

The nautilus will only support the syntax v1. And for kclient there
is not need to do the upgrade.

Fixes: https://tracker.ceph.com/issues/57591
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2 years agoqa: add one func to distiguish the mount syntax version
Xiubo Li [Tue, 21 Mar 2023 08:26:45 +0000 (16:26 +0800)]
qa: add one func to distiguish the mount syntax version

Fixes: https://tracker.ceph.com/issues/57591
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2 years agoqa: remove the FuseMount/KernelMount check in test_newops.py
Xiubo Li [Tue, 21 Mar 2023 01:47:42 +0000 (09:47 +0800)]
qa: remove the FuseMount/KernelMount check in test_newops.py

Will use the postmerge fragment to check this.

Fixes: https://tracker.ceph.com/issues/57591
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2 years agoqa: introduce postmerge for fuse/kclient mounts
Xiubo Li [Tue, 21 Mar 2023 01:37:58 +0000 (09:37 +0800)]
qa: introduce postmerge for fuse/kclient mounts

Suggested by Patrick and this will check mounter's type.

Fixes: https://tracker.ceph.com/issues/57591
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2 years agoqa: remove the '0-' prefix
Xiubo Li [Wed, 22 Mar 2023 00:56:46 +0000 (08:56 +0800)]
qa: remove the '0-' prefix

Both the nautilus and pacific will be run in parallel.

Fixes: https://tracker.ceph.com/issues/57591
Signed-off-by: Xiubo Li <xiubli@redhat.com>
2 years agodoc/start: documenting-ceph - add squash procedure 50723/head
Zac Dover [Tue, 28 Mar 2023 09:29:40 +0000 (19:29 +1000)]
doc/start: documenting-ceph - add squash procedure

Add a procedure to doc/start/documenting-ceph.rst that explains how to
perform an interactive rebase to squash commits.

Signed-off-by: Zac Dover <zac.dover@proton.me>
2 years agoexporter: user only counter dump/schema commands for extacting counters 50718/head
Avan Thakkar [Tue, 28 Mar 2023 13:32:47 +0000 (19:02 +0530)]
exporter: user only counter dump/schema commands for extacting counters

Fixes: https://tracker.ceph.com/issues/59191
Signed-off-by: Avan Thakkar <athakkar@redhat.com>
Ceph exporter no more required the output of perf dump/schema, as the ``counter dump`` command
returns both labeled and unlabeled perf counters which exporter can fetch and export.
Removed the ``exporter_get_labeled_counters`` confiug option as exporter will now export
all the counters, labeled or unlabeled.
Also the fix includes the support for renaming the metrics name of rgw multi-site and
adding labels to it, similar to what is there in prometheus module.

2 years agoMerge pull request #50713 from zdover23/wip-doc-2023-03-28-glossary-cephx
Anthony D'Atri [Tue, 28 Mar 2023 12:28:49 +0000 (08:28 -0400)]
Merge pull request #50713 from zdover23/wip-doc-2023-03-28-glossary-cephx

doc/glossary: improve "CephX" entry

2 years agoMerge pull request #50529 from rhcs-dashboard/dashboard-edit-rgw-multisite
Aashish Sharma [Tue, 28 Mar 2023 11:38:17 +0000 (17:08 +0530)]
Merge pull request #50529 from rhcs-dashboard/dashboard-edit-rgw-multisite

mgr/dashboard: edit realm in rgw-multisite

Reviewed-by: Pere Diaz Bou <pdiazbou@redhat.com>
2 years agomgr/dashboard: rgw role creation form 50426/head
Pere Diaz Bou [Thu, 2 Mar 2023 12:17:25 +0000 (13:17 +0100)]
mgr/dashboard: rgw role creation form

Fixes: https://tracker.ceph.com/issues/59187
Signed-off-by: Pere Diaz Bou <pdiazbou@redhat.com>
Signed-off-by: Nizamudeen A <nia@redhat.com>
2 years agodoc/glossary: improve "CephX" entry 50713/head
Zac Dover [Tue, 28 Mar 2023 08:42:11 +0000 (18:42 +1000)]
doc/glossary: improve "CephX" entry

Improve the glossary entry for "CephX".

Signed-off-by: Zac Dover <zac.dover@proton.me>
2 years agomgr/dashboard: Add unit test for realm 50529/head
Aashish Sharma [Mon, 27 Mar 2023 13:36:21 +0000 (19:06 +0530)]
mgr/dashboard: Add unit test for realm

Fixes: https://tracker.ceph.com/issues/59171
Signed-off-by: Aashish Sharma <aasharma@redhat.com>
2 years agoMerge pull request #50653 from xxhdx1985126/wip-exist-clean 49732/head
Yingxin [Tue, 28 Mar 2023 06:07:43 +0000 (14:07 +0800)]
Merge pull request #50653 from xxhdx1985126/wip-exist-clean

crimson/os/seastore/cache: consider EXIST_CLEAN extents as pending ones

Reviewed-by: Yingxin Cheng <yingxin.cheng@intel.com>
2 years agocrimson/os/seastore/cache: use CachedExtent::is_mutable() where appropriate 50653/head
Yingxin Cheng [Mon, 27 Mar 2023 09:38:17 +0000 (17:38 +0800)]
crimson/os/seastore/cache: use CachedExtent::is_mutable() where appropriate

CachedExtent::is_mutable() should only be used to check whether need to
call duplicate_for_write(extent).

Signed-off-by: Yingxin Cheng <yingxin.cheng@intel.com>
2 years agoMerge pull request #50697 from zdover23/wip-doc-2023-03-28-glossary-scrubbing
zdover23 [Tue, 28 Mar 2023 00:39:16 +0000 (10:39 +1000)]
Merge pull request #50697 from zdover23/wip-doc-2023-03-28-glossary-scrubbing

doc/glossary: add "Scrubbing"

Reviewed-by: Anthony D'Atri <anthony.datri@gmail.com>
2 years agodoc/glossary: add "Scrubbing" 50697/head
Zac Dover [Mon, 27 Mar 2023 20:48:32 +0000 (06:48 +1000)]
doc/glossary: add "Scrubbing"

Add "Scrubbing" to the glossary.

Co-authored-by: Anthony D'Atri <anthony.datri@gmail.com>
Signed-off-by: Zac Dover <zac.dover@proton.me>
2 years agotools/cephfs: include lost+found in scan_links 50692/head
Patrick Donnelly [Mon, 27 Mar 2023 18:47:29 +0000 (14:47 -0400)]
tools/cephfs: include lost+found in scan_links

Otherwise, any injected dentries have incorrect first snapids.

Fixes: https://tracker.ceph.com/issues/59183
Signed-off-by: Patrick Donnelly <pdonnell@redhat.com>
2 years agoqa/cephadm: add check that iscsi daemon /etc/hosts matches host /etc/hosts 49824/head
Adam King [Mon, 30 Jan 2023 16:27:09 +0000 (11:27 -0500)]
qa/cephadm: add check that iscsi daemon /etc/hosts matches host /etc/hosts

To make sure we aren't being affected by any podman introduced
changes to the /etc/hosts file and test that we're properly
mounting /etc/hosts in our daemon containers

Signed-off-by: Adam King <adking@redhat.com>
2 years agocephadm: mount host /etc/hosts for containers in podman deployments
Adam King [Sat, 21 Jan 2023 23:44:22 +0000 (18:44 -0500)]
cephadm: mount host /etc/hosts for containers in podman deployments

Podman messes with the /etc/hosts file in certain version. There
was already a past issue with it placing the container name
there fixed by https://github.com/ceph/ceph/pull/42242. This time
it is adding an entry for "host.containers.internal" (seems to be
podman 4.1 onward currently). Iscsi figures out the FQDN for a
host by running

python3 -c 'import socket; print(socket.getfqdn())

which is resolving to "host.containers.internal" when run in
the container with the podman modified /etc/hosts.

There is also an issue with grafana dashboard with
this entry present

Passing --no-hosts resolves this, but I think in the past
we avoided that due to not wanting to break deployments
where host name resolution was handled using /etc/hosts.
That's why we had that workaround previously linked. This
time I'm not sure such a workaround exists. The try here
is to mount a copy of the host's version of /etc/hosts
into the iscsi container. That copy won't have the extra
entry podman adds in but will have any user created entries in
case they were actually using it for host name resolution.
If /etc/hosts file isn't present for whatever reason, we're
assuming that this user isn't using /etc/hosts for hostname
resolution, and just going back to passing --no-hosts.

Fixes: https://tracker.ceph.com/issues/58532
Fixes: https://tracker.ceph.com/issues/57018
Signed-off-by: Adam King <adking@redhat.com>
2 years agoMerge pull request #48937 from adk3798/device-ls-size
Adam King [Mon, 27 Mar 2023 17:55:31 +0000 (13:55 -0400)]
Merge pull request #48937 from adk3798/device-ls-size

mgr/orchestrator: fix device size in `orch device ls` output

Reviewed-by: Redouane Kachach <rkachach@redhat.com>