Translated poisson mixture model for stratification learning

A framework for the regularized and robust estimation of non-uniform dimensionality and density in high dimensional noisy data is introduced in this work. This leads to learning stratifications, that is, mixture of manifolds representing different characteristics and complexities in the data set. The basic idea relies on modeling the high dimensional sample points as a process of translated Poisson mixtures, with regularizing restrictions, leading to a model which includes the presence of noise. The translated Poisson distribution is useful to model a noisy counting process, and it is derived from the noise-induced translation of a regular Poisson distribution. By maximizing the log-likelihood of the process counting the points falling into a local ball, we estimate the local dimension and density. We show that the sequence of all possible local countings in a point cloud formed by samples of a stratification can be modeled by a mixture of different translated Poisson distributions, thus allowing the presence of mixed dimensionality and densities in the same data set. With this statistical model, the parameters which best describe the data, estimated via expectation maximization, divide the points in different classes according to both dimensionality and density, together with an estimation of these quantities for each class. Theoretical asymptotic results for the model are presented as well. The presentation of the theoretical framework is complemented with artificial and real examples showing the importance of regularized stratification learning in high dimensional data analysis in general and computer vision and image analysis in particular. © 2008 Springer Science+Business Media, LLC.

Full Text

Duke Authors

Cited Authors

  • Haro, G; Randall, G; Sapiro, G

Published Date

  • 2008

Published In

Volume / Issue

  • 80 / 3

Start / End Page

  • 358 - 374

International Standard Serial Number (ISSN)

  • 0920-5691

Digital Object Identifier (DOI)

  • 10.1007/s11263-008-0144-6

Citation Source

  • SciVal