A geodesic framework for fast interactive image and video segmentation and matting

An interactive framework for soft segmentation and matting of natural images and videos is presented in this paper. The proposed technique is based on the optimal, linear time, computation of weighted geodesic distances to the user-provided scribbles, from which the whole data is automatically segmented. The weights are based on spatial and/or temporal gradients, without explicit optical flow or any advanced and often computationally expensive feature detectors. These could be naturally added to the proposed framework as well if desired, in the form of weights in the geodesic distances. A localized refinement step follows this fast segmentation in order to accurately compute the corresponding matte function. Additional constraints into the distance definition permit to efficiently handle occlusions such as people or objects crossing each other in a video sequence. The presentation of the framework is complemented with numerous and diverse examples, including extraction of moving foreground from dynamic background, and comparisons with the recent literature. ©2007 IEEE.

Full Text

Duke Authors

Cited Authors

  • Bai, X; Sapiro, G

Published Date

  • 2007

Published In

  • Proceedings of the IEEE International Conference on Computer Vision

Digital Object Identifier (DOI)

  • 10.1109/ICCV.2007.4408931