Skip to Main content Skip to Navigation
Journal articles

Single-shot 3D multi-person pose estimation in complex images

Abstract : In this paper, we propose a new single shot method for multi-person 3D human pose estimation in complex images. The model jointly learns to locate the human joints in the image, to estimate their 3D coordinates and to group these predictions into full human skeletons. The proposed method deals with a variable number of people and does not need bounding boxes to estimate the 3D poses. It leverages and extends the Stacked Hourglass Network and its multi-scale feature learning to manage multi-person situations. Thus, we exploit a robust 3D human pose formulation to fully describe several 3D human poses even in case of strong occlusions or crops. Then, joint grouping and human pose estimation for an arbitrary number of people are performed using the asso-ciative embedding method. Our approach significantly outperforms the state of the art on the challenging CMU Panoptic. Furthermore, it leads to good results on the complex and synthetic images from the newly proposed JTA Dataset.
Complete list of metadata

Cited literature [46 references]  Display  Hide  Download
Contributor : Catherine Achard Connect in order to contact the contributor
Submitted on : Tuesday, September 1, 2020 - 10:52:42 AM
Last modification on : Wednesday, January 26, 2022 - 3:14:22 AM
Long-term archiving on: : Wednesday, December 2, 2020 - 1:20:09 PM


Files produced by the author(s)



Abdallah Benzine, Bertrand Luvison, Quoc-Cuong Pham, Catherine Achard. Single-shot 3D multi-person pose estimation in complex images. Pattern Recognition, Elsevier, In press, ⟨10.1016/j.patcog.2020.107534⟩. ⟨hal-02926239⟩



Les métriques sont temporairement indisponibles