]> git.apps.os.sepia.ceph.com Git - ceph.git/commit
os/bluestore: avoid premature onode release. 43770/head
authorIgor Fedotov <ifed@suse.com>
Tue, 2 Nov 2021 12:03:39 +0000 (15:03 +0300)
committerIgor Fedotov <ifedotov@suse.com>
Tue, 14 Dec 2021 14:54:23 +0000 (17:54 +0300)
commit96f0efe6d5307a55bea32f7216ef9511da0c5a47
tree369961724b22a384e4af96ea8ba2bb1312e4fdb8
parent44d706c353865abc44e284bd684f5328031579c9
os/bluestore: avoid premature onode release.

This was observed when onode's removal is followed by reading
and the latter causes object release before the removal is finalized.
The root cause is an improper 'pinned' state assessment in Onode::get

More detailed overview is:
At some point Onode::get() might face the case when nref == 2 and pinned = true
which means parallel incomplete put is running on the onode - ref count is
decremented but pinned state is still unmodified (and even lock hasn't been
acquired yet).
This might finally result in two puts racing over the same onode with nref == 2
which finally results in a premature onode release:
  // nref =3, pinned = 1
  // Thread 1                   Thread 2
  //   o->put()                   o->get()
  //   --nref(n = 2, pinned=1)
  //                              nref++ (n=3, pinned = 1)
  //                              return
  //                              ...
  //                              o->put()
  //                              --nref(n = 2)
  //                              pinned = 0,
  //                              --nref(n = 1)
  //                              ocs->_unpin_and_rm(o) -> o->put()
  //                                ...
  //                                --nref(n = 0)
  //                                release o
  //  o->c->get_onode_cache()
  //  FAULT!
  //
The suggested fix is to introduce additional atomic counter tracking
running put() functions. And permit onode release when both regular
nref and put_nref are both equal to zero.

Fixes: https://tracker.ceph.com/issues/53002
Signed-off-by: Igor Fedotov <igor.fedotov@croit.io>
src/os/bluestore/BlueStore.cc
src/os/bluestore/BlueStore.h