Single-Frame Indexing for 3D Hand Pose Estimation


Conference Paper

© 2015 IEEE. Hand pose estimation from 3D sensor data matches a point cloud to a hand model, and has broad applications from gestural interfaces to scene understanding. We propose a novel scheme to index into a database of precomputed hand poses to initialize the match. Our index describes 2D hand silhouettes, which can be computed from either depth maps or standard video, in the form of simple yet expressive signatures. We compare signatures to each other through a new variant of the Earth Mover's Distance that makes small distances in feature space correlate highly with those in pose space. We present a new technique that uses a depth sensor and a sensor glove to create databases of real images and ground-truth poses for both training and testing. We show state-of-the-art accuracy and speed for both gesture classification and joint-pose regression, even when comparing our 2D single-frame method with those that employ RGB-D features or multi-sensor inputs and report quantitative results.

Full Text

Duke Authors

Cited Authors

  • Carley, C; Tomasi, C

Published Date

  • February 11, 2015

Published In

Volume / Issue

  • 2015-February /

Start / End Page

  • 493 - 501

International Standard Serial Number (ISSN)

  • 1550-5499

International Standard Book Number 13 (ISBN-13)

  • 9781467383905

Digital Object Identifier (DOI)

  • 10.1109/ICCVW.2015.71

Citation Source

  • Scopus