Sparse modeling of human actions from motion imagery

An efficient sparse modeling pipeline for the classification of human actions from video is here developed. Spatio-temporal features that characterize local changes in the image are first extracted. This is followed by the learning of a class-structured dictionary encoding the individual actions of interest. Classification is then based on reconstruction, where the label assigned to each video comes from the optimal sparse linear combination of the learned basis vectors (action primitives) representing the actions. A low computational cost deep-layer model learning the inter-class correlations of the data is added for increasing discriminative power. In spite of its simplicity and low computational cost, the method outperforms previously reported results for virtually all standard datasets. © 2012 Springer Science+Business Media, LLC (outside the USA).

Full Text

Duke Authors

Cited Authors

  • Castrodad, A; Sapiro, G

Published Date

  • 2012

Published In

Volume / Issue

  • 100 / 1

Start / End Page

  • 1 - 15

International Standard Serial Number (ISSN)

  • 0920-5691

Digital Object Identifier (DOI)

  • 10.1007/s11263-012-0534-7