PandaNet: Anchor-Based Single-Shot Multi-Person 3D Pose Estimation - Sorbonne Université Accéder directement au contenu
Communication Dans Un Congrès Année : 2020

PandaNet: Anchor-Based Single-Shot Multi-Person 3D Pose Estimation

Résumé

Recently, several deep learning models have been pro-posed for 3D human pose estimation. Nevertheless, mostof these approaches only focus on the single-person caseor estimate 3D pose of a few people at high resolution.Furthermore, many applications such as autonomous driv-ing or crowd analysis require pose estimation of a largenumber of people possibly at low-resolution. In this work,we present PandaNet (Pose estimAtioN and DectectionAnchor-based Network), a new single-shot, anchor-basedand multi-person 3D pose estimation approach. The pro-posed model performs bounding box detection and, for eachdetected person, 2D and 3D pose regression into a singleforward pass. It does not need any post-processing to re-group joints since the network predicts a full 3D pose foreach bounding box and allows the pose estimation of a pos-sibly large number of people at low resolution. To managepeople overlapping, we introduce a Pose-Aware Anchor Se-lection strategy. Moreover, as imbalance exists between dif-ferent people sizes in the image, and joints coordinates have ifferent uncertainties depending on these sizes, we pro-pose a method to automatically optimize weights associatedto different people scales and joints for efficient training.PandaNet surpasses previous single-shot methods on sev-eral challenging datasets: a multi-person urban virtual butvery realistic dataset (JTA Dataset), and two real world 3Dmulti-person datasets (CMU Panoptic and MuPoTS-3D).
Fichier principal
Vignette du fichier
Benzine_PandaNet_Anchor-Based_Single-Shot_Multi-Person_3D_Pose_Estimation_CVPR_2020_paper.pdf (971.49 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02926220 , version 1 (31-08-2020)

Identifiants

Citer

Abdallah Benzine, Florian Chabot, Bertrand Luvison, Quoc Cuong Pham, Catherine Achard. PandaNet: Anchor-Based Single-Shot Multi-Person 3D Pose Estimation. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Jun 2020, Seattle, United States. pp.6855-6864, ⟨10.1109/CVPR42600.2020.00689⟩. ⟨hal-02926220⟩
192 Consultations
285 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More