Improved Strategies for HPE Employing Learning-by-Synthesis Approaches


Conference Paper

© 2017 IEEE. The first contribution of this paper is the presentation of a synthetic video database where the groundtruth of 2D facial landmarks and 3D head poses is available to be used for training and evaluating Head Pose Estimation (HPE) methods. The database is publicly available and contains videos of users performing guided and natural movements. The second and main contribution is the submission of a hybrid method for HPE based on Pose from Ortography and Scaling by Iterations (POSIT). The 2D landmark detection is performed using Random Cascaded-Regression Copse (R-CR-C). For the training stage we use, state of the art labeled databases. Learning-by-synthesis approach has been also used to augment the size of the database employing the synthetic database. HPE accuracy is tested by using two literature 3D head models. The tracking method proposed has been compared with state of the art methods using Supervised Descent Regressors (SDR) in terms of accuracy, achieving an improvement of 60%.

Full Text

Duke Authors

Cited Authors

  • Larumbe, A; Ariz, M; Bengoechea, JJ; Segura, R; Cabeza, R; Villanueva, A

Published Date

  • January 19, 2018

Published In

  • Proceedings 2017 Ieee International Conference on Computer Vision Workshops, Iccvw 2017

Volume / Issue

  • 2018-January /

Start / End Page

  • 1545 - 1554

International Standard Book Number 13 (ISBN-13)

  • 9781538610343

Digital Object Identifier (DOI)

  • 10.1109/ICCVW.2017.182

Citation Source

  • Scopus