Skip to main content
Journal cover image

Clustering in applications with multiple data sources-A mutual subspace clustering approach

Publication ,  Journal Article
Hua, M; Pei, J
Published in: Neurocomputing
September 1, 2012

In many applications, such as bioinformatics and cross-market customer relationship management, there are data from multiple sources jointly describing the same set of objects. An important data mining task is to find interesting groups of objects that form clusters in subspaces of the data sources jointly supported by those data sources.In this paper, we study a novel problem of mining mutual subspace clusters from multiple sources. We develop two interesting models and the corresponding methods for mutual subspace clustering. The density-based model identifies dense regions in subspaces as clusters. The bottom-up method searches for density-based mutual subspace clusters systematically from low-dimensional subspaces to high-dimensional ones. The partitioning model divides points in a data set into . k exclusive clusters and a signature subspace is found for each cluster, where . k is the number of clusters desired by a user. The top-down method interleaves the well-known . k-means clustering procedures in multiple sources. We use experimental results on synthetic data sets and real data sets to report the effectiveness and the efficiency of the methods. © 2012 Elsevier B.V..

Duke Scholars

Published In

Neurocomputing

DOI

EISSN

1872-8286

ISSN

0925-2312

Publication Date

September 1, 2012

Volume

92

Start / End Page

133 / 144

Related Subject Headings

  • Artificial Intelligence & Image Processing
  • 52 Psychology
  • 46 Information and computing sciences
  • 40 Engineering
  • 17 Psychology and Cognitive Sciences
  • 09 Engineering
  • 08 Information and Computing Sciences
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Hua, M., & Pei, J. (2012). Clustering in applications with multiple data sources-A mutual subspace clustering approach. Neurocomputing, 92, 133–144. https://doi.org/10.1016/j.neucom.2011.08.032
Hua, M., and J. Pei. “Clustering in applications with multiple data sources-A mutual subspace clustering approach.” Neurocomputing 92 (September 1, 2012): 133–44. https://doi.org/10.1016/j.neucom.2011.08.032.
Hua, M., and J. Pei. “Clustering in applications with multiple data sources-A mutual subspace clustering approach.” Neurocomputing, vol. 92, Sept. 2012, pp. 133–44. Scopus, doi:10.1016/j.neucom.2011.08.032.
Journal cover image

Published In

Neurocomputing

DOI

EISSN

1872-8286

ISSN

0925-2312

Publication Date

September 1, 2012

Volume

92

Start / End Page

133 / 144

Related Subject Headings

  • Artificial Intelligence & Image Processing
  • 52 Psychology
  • 46 Information and computing sciences
  • 40 Engineering
  • 17 Psychology and Cognitive Sciences
  • 09 Engineering
  • 08 Information and Computing Sciences