Skip to main content

Sparse Dual of the Density Peaks Algorithm for Cluster Analysis of High-dimensional Data

Publication ,  Conference
Floros, D; Liu, T; Pitsianis, N; Sun, X
Published in: 2018 IEEE High Performance Extreme Computing Conference, HPEC 2018
November 26, 2018

The density peaks (DP) algorithm for cluster analysis, introduced by Rodriguez and Laio in 2014, has proven empirically competitive or superior in multiple aspects to other contemporary clustering algorithms. Yet, it suffers from certain drawbacks and limitations when used for clustering high-dimensional data. We introduce SD-DP, the sparse dual version of DP. While following the DP principle and maintaining its appealing properties, we find and use a sparse descriptor of local density as a robust representation. By analyzing and exploiting the consequential properties, we are able to use sparse graph-matrix expressions and operations throughout the clustering process. As a result, SD-DP has provably linear-scaling computation complexity under practical conditions. We show with experimental results on several real-world high-dimensional datasets, that SD-DP outperforms DP in robustness, accuracy, self-governess, and efficiency.

Duke Scholars

Published In

2018 IEEE High Performance Extreme Computing Conference, HPEC 2018

DOI

ISBN

9781538659892

Publication Date

November 26, 2018
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Floros, D., Liu, T., Pitsianis, N., & Sun, X. (2018). Sparse Dual of the Density Peaks Algorithm for Cluster Analysis of High-dimensional Data. In 2018 IEEE High Performance Extreme Computing Conference, HPEC 2018. https://doi.org/10.1109/HPEC.2018.8547519
Floros, D., T. Liu, N. Pitsianis, and X. Sun. “Sparse Dual of the Density Peaks Algorithm for Cluster Analysis of High-dimensional Data.” In 2018 IEEE High Performance Extreme Computing Conference, HPEC 2018, 2018. https://doi.org/10.1109/HPEC.2018.8547519.
Floros D, Liu T, Pitsianis N, Sun X. Sparse Dual of the Density Peaks Algorithm for Cluster Analysis of High-dimensional Data. In: 2018 IEEE High Performance Extreme Computing Conference, HPEC 2018. 2018.
Floros, D., et al. “Sparse Dual of the Density Peaks Algorithm for Cluster Analysis of High-dimensional Data.” 2018 IEEE High Performance Extreme Computing Conference, HPEC 2018, 2018. Scopus, doi:10.1109/HPEC.2018.8547519.
Floros D, Liu T, Pitsianis N, Sun X. Sparse Dual of the Density Peaks Algorithm for Cluster Analysis of High-dimensional Data. 2018 IEEE High Performance Extreme Computing Conference, HPEC 2018. 2018.

Published In

2018 IEEE High Performance Extreme Computing Conference, HPEC 2018

DOI

ISBN

9781538659892

Publication Date

November 26, 2018