Z. Cao, T. Simon, S. Wei, and Y. Sheikh, Realtime multi-person 2d pose estimation using part affinity fields, 2017.

A. Newell, Z. Huang, and J. Deng, Associative embedding: End-to-end learning for joint detection and grouping, 2017.

G. Rogez, P. Weinzaepfel, and C. Schmid, Lcr-net: Localization-classificationregression for human pose, 2017.

A. Zanfir, E. Marinoiu, M. Zanfir, A. Popa, and C. Sminchisescu, Deep network for the integrated 3d sensing of multiple people in natural images, NIPS, 2018.

A. Zanfir, E. Marinoiu, and C. Sminchisescu, Monocular 3d pose and shape estimation of multiple people in natural scenes-the importance of multiple scene constraints, CVPR, 2018.

D. Mehta, O. Sotnychenko, F. Mueller, W. Xu, S. Sridhar et al., Single-shot multi-person 3d body pose estimation from monocular rgb input, p.3, 2017.

A. Newell, K. Yang, and J. Deng, Stacked hourglass networks for human pose estimation, ECCV, 2016.

D. Mehta, H. Rhodin, D. Casas, P. Fua, O. Sotnychenko et al., Monocular 3d human pose estimation in the wild using improved cnn supervision, p.3, 2017.

H. Joo, T. Simon, X. Li, H. Liu, L. Tan et al., Panoptic studio: A massively multiview system for social interaction capture, 2019.

M. Fabbri, F. Lanzi, S. Calderara, A. Palazzi, R. Vezzani et al., Learning to detect and track visible and occluded body joints in a virtual world, ECCV, 2018.

S. Wei, V. Ramakrishna, T. Kanade, and Y. Sheikh, Convolutional pose machines, CVPR, 2016.

G. Papandreou, T. Zhu, N. Kanazawa, A. Toshev, J. Tompson et al., Towards accurate multi-person pose estimation in the wild, 2017.

K. He, G. Gkioxari, P. Dollár, and R. Girshick, Mask r-cnn, ICCV, 2017.

J. Martinez, R. Hossain, J. Romero, and J. J. Little, A simple yet effective baseline for 3d human pose estimation, 2017.

H. Fang, Y. Xu, W. Wang, X. Liu, and S. Zhu, Learning pose grammar to encode human body configuration for 3d pose estimation, AAAI Conference on Artificial Intelligence, 2018.

F. Bogo, A. Kanazawa, C. Lassner, P. Gehler, J. Romero et al., Keep it smpl: Automatic estimation of 3d human pose and shape from a single image, ECCV, 2016.

E. Simo-serra, A. Ramisa, G. Alenyà, C. Torras, and F. Moreno-noguer, Single image 3d human pose estimation from noisy observations, CVPR, 2012.

C. Wang, Y. Wang, Z. Lin, A. L. Yuille, and W. Gao, Robust estimation of 3d human poses from a single image, CVPR, 2014.

V. Ramakrishna, T. Kanade, and Y. Sheikh, Reconstructing 3d human pose from 2d image landmarks, ECCV, 2012.

C. Chen and D. Ramanan, , 2017.

F. Moreno-noguer, 3d human pose estimation from a single image via distance matrix regression, 2017.

B. X. Nie, P. Wei, and S. Zhu, Monocular 3d human pose estimation by predicting depth on joints, 2017.

A. Agarwal and B. , Triggs, 3d human pose from silhouettes by relevance vector regression, CVPR, 2004.

G. Rogez, J. Rihan, S. Ramalingam, C. Orrite, and P. H. Torr, Randomized trees for human pose detection, CVPR, 2008.

C. Sminchisescu and A. Jepson, Generative modeling for continuous nonlinearly embedded visual inference, ICML, 2004.

L. Bo, C. Sminchisescu, A. Kanaujia, and D. Metaxas, Fast algorithms for large scale conditional 3d prediction, CVPR, 2008.

G. Shakhnarovich, P. Viola, and T. Darrell, Fast pose estimation with parameter sensitive hashing, ICCV, 2003.

G. Pavlakos, X. Zhou, K. G. Derpanis, and K. Daniilidis, Coarse-to-fine volumetric prediction for single-image 3d human pose, 2017.

D. Mehta, S. Sridhar, O. Sotnychenko, H. Rhodin, M. Shafiei et al., Vnect: Real-time 3d human pose estimation with a single rgb camera, 2017.

A. Popa, M. Zanfir, and C. Sminchisescu, Deep multitask architecture for integrated 2d and 3d human sensing, 2017.

W. Chen, H. Wang, Y. Li, H. Su, Z. Wang et al., Synthesizing training images for boosting human 3d pose estimation, p.3, 2016.

G. Rogez and C. Schmid, Mocap-guided data augmentation for 3d pose estimation in the wild, NIPS, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01389486

S. Li and A. B. Chan, 3d human pose estimation from monocular images with deep convolutional neural network, ACCV, 2014.

B. Tekin, A. Rozantsev, V. Lepetit, and P. Fua, Direct prediction of 3d body poses from motion compensated sequences, CVPR, 2016.

X. Zhou, M. Zhu, S. Leonardos, K. G. Derpanis, and K. Daniilidis, Sparseness meets deepness: 3d human pose estimation from monocular video, CVPR, 2016.

G. Varol, J. Romero, X. Martin, N. Mahmood, M. J. Black et al., Learning from synthetic humans, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01505711

X. Sun, J. Shang, S. Liang, and Y. Wei, Compositional human pose regression, 2017.

E. Simo-serra, A. Quattoni, C. Torras, and F. Moreno-noguer, A joint model for 2d and 3d pose estimation from a single image, CVPR, 2013.

F. Zhou, F. De-la, and T. , Spatio-temporal matching for human detection in video, European Conference on Computer Vision, 2014.

B. Tekin, P. Márquez-neila, M. Salzmann, and P. Fua, Learning to fuse 2d and 3d image cues for monocular body pose estimation, 2017.

X. Zhou, Q. Huang, X. Sun, X. Xue, and Y. Wei, Towards 3d human pose estimation in the wild: a weakly-supervised approach, 2017.

G. Pavlakos, X. Zhou, and K. Daniilidis, Ordinal depth supervision for 3d human pose estimation, CVPR, 2018.

Y. Chen, C. Shen, X. Wei, L. Liu, and J. Yang, Adversarial learning of structure-aware fully convolutional networks for landmark localization, 2017.

W. Yang, W. Ouyang, X. Wang, J. Ren, H. Li et al., CVPR, 2018.

D. P. Kingma and J. Ba, Adam: A method for stochastic optimization, 2014.

C. Ionescu, D. Papava, V. Olaru, and C. Sminchisescu, Human3.6m: Large scale datasets and predictive methods for 3d human sensing in natural environments, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.36, pp.1325-1339, 2014.