Deep, Robust and Single Shot 3D Multi-Person Human Pose Estimation from Monocular Images

Abdallah Benzine; Bertrand Luvison; Quoc Cuong Pham; Catherine Achard

doi:10.1109/ICIP.2019.8803833

Communication Dans Un Congrès Année : 2019

Deep, Robust and Single Shot 3D Multi-Person Human Pose Estimation from Monocular Images

(1) , (1) , (1) , (2, 3)

1
2
3

Abdallah Benzine

Fonction : Auteur

Laboratoire Vision et Ingénierie des Contenus

Bertrand Luvison

Fonction : Auteur

Laboratoire Vision et Ingénierie des Contenus

Quoc Cuong Pham

Fonction : Auteur

Laboratoire Vision et Ingénierie des Contenus

Catherine Achard

Fonction : Auteur
PersonId : 182097
IdHAL : catherine-achard
ORCID : 0000-0002-5790-0830
IdRef : 13796658X

Institut des Systèmes Intelligents et de Robotique

Perception, Interaction, Robotique sociales

Résumé

In this paper, we propose a new single shot method for multi-person 3D pose estimation, from monocular RGB images. Our model jointly learns to locate the human joints in the image, to estimate their 3D coordinates and to group these predictions into full human skeletons. Our approach leverages and extends the Stacked Hourglass Network and its multi-scale feature learning to manage multi-person situations. Thus, we exploit the Occlusions Robust Pose Maps (ORPM) to fully describe several 3D human poses even in case of strong occlusions or cropping. Then, joint grouping and human pose estimation for an arbitrary number of people are performed using associative embedding. We evaluate our method on the challenging CMU Panoptic dataset, and demonstrate that it achieves better results than the state of the art.

Mots clés

ORPM Stacked Hourglass Network CMU Panoptic dataset image processing associative embedding single shot method 3D pose estimation 3D coordinate estimation human pose estimation multi-scale feature learning multi-person monocular RGB images Occlusions Robust Pose Maps

Domaines

Intelligence artificielle [cs.AI] Traitement du signal et de l'image [eess.SP]

Fichier principal

ICIP_Benzine.pdf (5.1 Mo)

Origine	Fichiers produits par l'(les) auteur(s)

Bertrand Luvison : Connectez-vous pour contacter le contributeur

https://hal.sorbonne-universite.fr/hal-02459886

Soumis le : mercredi 15 mai 2024-15:56:58

Dernière modification le : mercredi 30 octobre 2024-13:28:13

Dates et versions

hal-02459886 , version 1 (15-05-2024)

Identifiants

HAL Id : hal-02459886 , version 1
DOI : 10.1109/ICIP.2019.8803833

Citer

Abdallah Benzine, Bertrand Luvison, Quoc Cuong Pham, Catherine Achard. Deep, Robust and Single Shot 3D Multi-Person Human Pose Estimation from Monocular Images. 2019 IEEE International Conference on Image Processing (ICIP), The Institute of Electrical and Electronics Engineers Signal Processing Society, Sep 2019, Taipei, Taiwan. pp.584-588, ⟨10.1109/ICIP.2019.8803833⟩. ⟨hal-02459886⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CEA CNRS ISIR DRT CEA-UPSAY UNIV-PARIS-SACLAY LIST SORBONNE-UNIVERSITE SU-SCIENCES ISIR_PIROS GS-ENGINEERING GS-COMPUTER-SCIENCE GS-SPORT-HUMAN-MOVEMENT

142 Consultations

37 Téléchargements

Deep, Robust and Single Shot 3D Multi-Person Human Pose Estimation from Monocular Images

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager