Single-shot 3D multi-person pose estimation in complex images

Abdallah Benzine; Bertrand Luvison; Quoc-Cuong Pham; Catherine Achard

doi:10.1016/j.patcog.2020.107534

Article Dans Une Revue Pattern Recognition Année : 2021

Single-shot 3D multi-person pose estimation in complex images

(1) , (2) , (2) , (1, 3, 4)

1
2
3
4

Abdallah Benzine

Fonction : Auteur

Institut des Systèmes Intelligents et de Robotique

Bertrand Luvison

Fonction : Auteur

Laboratoire d'Intégration des Systèmes et des Technologies

Quoc-Cuong Pham

Fonction : Auteur
PersonId : 855568
IdHAL : quoc-cuong-pham

Laboratoire d'Intégration des Systèmes et des Technologies

Catherine Achard

Fonction : Auteur
PersonId : 182097
IdHAL : catherine-achard
ORCID : 0000-0002-5790-0830
IdRef : 13796658X

Institut des Systèmes Intelligents et de Robotique

Sorbonne Université

Perception, Interaction, Robotique sociales

Résumé

In this paper, we propose a new single shot method for multi-person 3D human pose estimation in complex images. The model jointly learns to locate the human joints in the image, to estimate their 3D coordinates and to group these predictions into full human skeletons. The proposed method deals with a variable number of people and does not need bounding boxes to estimate the 3D poses. It leverages and extends the Stacked Hourglass Network and its multi-scale feature learning to manage multi-person situations. Thus, we exploit a robust 3D human pose formulation to fully describe several 3D human poses even in case of strong occlusions or crops. Then, joint grouping and human pose estimation for an arbitrary number of people are performed using the asso-ciative embedding method. Our approach significantly outperforms the state of the art on the challenging CMU Panoptic. Furthermore, it leads to good results on the complex and synthetic images from the newly proposed JTA Dataset.

Mots clés

multi-person 3D human pose deep learning

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV] Intelligence artificielle [cs.AI]

Fichier principal

Abd1.pdf (2.67 Mo)

Origine	Fichiers produits par l'(les) auteur(s)

Catherine ACHARD : Connectez-vous pour contacter le contributeur

https://hal.sorbonne-universite.fr/hal-02926239

Soumis le : mardi 1 septembre 2020-10:52:42

Dernière modification le : mercredi 30 octobre 2024-13:28:36

Archivage à long terme le : mercredi 2 décembre 2020-13:20:09

Dates et versions

hal-02926239 , version 1 (01-09-2020)

Identifiants

HAL Id : hal-02926239 , version 1
DOI : 10.1016/j.patcog.2020.107534

Citer

Abdallah Benzine, Bertrand Luvison, Quoc-Cuong Pham, Catherine Achard. Single-shot 3D multi-person pose estimation in complex images. Pattern Recognition, 2021, ⟨10.1016/j.patcog.2020.107534⟩. ⟨hal-02926239⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CEA CNRS ISIR DRT LIST SORBONNE-UNIVERSITE SU-SCIENCES ISIR_PIROS GS-ENGINEERING GS-COMPUTER-SCIENCE GS-SPORT-HUMAN-MOVEMENT

131 Consultations

192 Téléchargements

Single-shot 3D multi-person pose estimation in complex images

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager