Deep, Robust and Single Shot 3D Multi-Person Human Pose Estimation from Monocular Images - Sorbonne Université
Communication Dans Un Congrès Année : 2019

Deep, Robust and Single Shot 3D Multi-Person Human Pose Estimation from Monocular Images

Résumé

In this paper, we propose a new single shot method for multi-person 3D pose estimation, from monocular RGB images. Our model jointly learns to locate the human joints in the image, to estimate their 3D coordinates and to group these predictions into full human skeletons. Our approach leverages and extends the Stacked Hourglass Network and its multi-scale feature learning to manage multi-person situations. Thus, we exploit the Occlusions Robust Pose Maps (ORPM) to fully describe several 3D human poses even in case of strong occlusions or cropping. Then, joint grouping and human pose estimation for an arbitrary number of people are performed using associative embedding. We evaluate our method on the challenging CMU Panoptic dataset, and demonstrate that it achieves better results than the state of the art.
Fichier principal
Vignette du fichier
ICIP_Benzine.pdf (5.1 Mo) Télécharger le fichier
Origine Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-02459886 , version 1 (15-05-2024)

Identifiants

Citer

Abdallah Benzine, Bertrand Luvison, Quoc Cuong Pham, Catherine Achard. Deep, Robust and Single Shot 3D Multi-Person Human Pose Estimation from Monocular Images. 2019 IEEE International Conference on Image Processing (ICIP), The Institute of Electrical and Electronics Engineers Signal Processing Society, Sep 2019, Taipei, Taiwan. pp.584-588, ⟨10.1109/ICIP.2019.8803833⟩. ⟨hal-02459886⟩
142 Consultations
37 Téléchargements

Altmetric

Partager

More