Skip to main content
Journal cover image

eRPCA: Robust Principal Component Analysis for Exponential Family Distributions

Publication ,  Journal Article
Zheng, X; Mak, S; Xie, L; Xie, Y
Published in: Statistical Analysis and Data Mining
April 1, 2024

Robust principal component analysis (RPCA) is a widely used method for recovering low-rank structure from data matrices corrupted by significant and sparse outliers. These corruptions may arise from occlusions, malicious tampering, or other causes for anomalies, and the joint identification of such corruptions with low-rank background is critical for process monitoring and diagnosis. However, existing RPCA methods and their extensions largely do not account for the underlying probabilistic distribution for the data matrices, which in many applications are known and can be highly non-Gaussian. We thus propose a new method called RPCA for exponential family distributions ((Formula presented.)), which can perform the desired decomposition into low-rank and sparse matrices when such a distribution falls within the exponential family. We present a novel alternating direction method of multiplier optimization algorithm for efficient (Formula presented.) decomposition, under either its natural or canonical parametrization. The effectiveness of (Formula presented.) is then demonstrated in two applications: the first for steel sheet defect detection and the second for crime activity monitoring in the Atlanta metropolitan area.

Duke Scholars

Published In

Statistical Analysis and Data Mining

DOI

EISSN

1932-1872

ISSN

1932-1864

Publication Date

April 1, 2024

Volume

17

Issue

2

Related Subject Headings

  • 4905 Statistics
  • 4605 Data management and data science
  • 0104 Statistics
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Zheng, X., Mak, S., Xie, L., & Xie, Y. (2024). eRPCA: Robust Principal Component Analysis for Exponential Family Distributions. Statistical Analysis and Data Mining, 17(2). https://doi.org/10.1002/sam.11670
Zheng, X., S. Mak, L. Xie, and Y. Xie. “eRPCA: Robust Principal Component Analysis for Exponential Family Distributions.” Statistical Analysis and Data Mining 17, no. 2 (April 1, 2024). https://doi.org/10.1002/sam.11670.
Zheng X, Mak S, Xie L, Xie Y. eRPCA: Robust Principal Component Analysis for Exponential Family Distributions. Statistical Analysis and Data Mining. 2024 Apr 1;17(2).
Zheng, X., et al. “eRPCA: Robust Principal Component Analysis for Exponential Family Distributions.” Statistical Analysis and Data Mining, vol. 17, no. 2, Apr. 2024. Scopus, doi:10.1002/sam.11670.
Zheng X, Mak S, Xie L, Xie Y. eRPCA: Robust Principal Component Analysis for Exponential Family Distributions. Statistical Analysis and Data Mining. 2024 Apr 1;17(2).
Journal cover image

Published In

Statistical Analysis and Data Mining

DOI

EISSN

1932-1872

ISSN

1932-1864

Publication Date

April 1, 2024

Volume

17

Issue

2

Related Subject Headings

  • 4905 Statistics
  • 4605 Data management and data science
  • 0104 Statistics