Skip to main content
Journal cover image

Mining outlying aspects on numeric data

Publication ,  Journal Article
Duan, L; Tang, G; Pei, J; Bailey, J; Campbell, A; Tang, C
Published in: Data Mining and Knowledge Discovery
September 22, 2015

When we are investigating an object in a data set, which itself may or may not be an outlier, can we identify unusual (i.e., outlying) aspects of the object? In this paper, we identify the novel problem of mining outlying aspects on numeric data. Given a query object o in a multidimensional numeric data set O, in which subspace is o most outlying? Technically, we use the rank of the probability density of an object in a subspace to measure the outlyingness of the object in the subspace. A minimal subspace where the query object is ranked the best is an outlying aspect. Computing the outlying aspects of a query object is far from trivial. A naïve method has to calculate the probability densities of all objects and rank them in every subspace, which is very costly when the dimensionality is high. We systematically develop a heuristic method that is capable of searching data sets with tens of dimensions efficiently. Our empirical study using both real data and synthetic data demonstrates that our method is effective and efficient.

Duke Scholars

Published In

Data Mining and Knowledge Discovery

DOI

ISSN

1384-5810

Publication Date

September 22, 2015

Volume

29

Issue

5

Start / End Page

1116 / 1151

Related Subject Headings

  • Artificial Intelligence & Image Processing
  • 46 Information and computing sciences
  • 0806 Information Systems
  • 0804 Data Format
  • 0801 Artificial Intelligence and Image Processing
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Duan, L., Tang, G., Pei, J., Bailey, J., Campbell, A., & Tang, C. (2015). Mining outlying aspects on numeric data. Data Mining and Knowledge Discovery, 29(5), 1116–1151. https://doi.org/10.1007/s10618-014-0398-2
Duan, L., G. Tang, J. Pei, J. Bailey, A. Campbell, and C. Tang. “Mining outlying aspects on numeric data.” Data Mining and Knowledge Discovery 29, no. 5 (September 22, 2015): 1116–51. https://doi.org/10.1007/s10618-014-0398-2.
Duan L, Tang G, Pei J, Bailey J, Campbell A, Tang C. Mining outlying aspects on numeric data. Data Mining and Knowledge Discovery. 2015 Sep 22;29(5):1116–51.
Duan, L., et al. “Mining outlying aspects on numeric data.” Data Mining and Knowledge Discovery, vol. 29, no. 5, Sept. 2015, pp. 1116–51. Scopus, doi:10.1007/s10618-014-0398-2.
Duan L, Tang G, Pei J, Bailey J, Campbell A, Tang C. Mining outlying aspects on numeric data. Data Mining and Knowledge Discovery. 2015 Sep 22;29(5):1116–1151.
Journal cover image

Published In

Data Mining and Knowledge Discovery

DOI

ISSN

1384-5810

Publication Date

September 22, 2015

Volume

29

Issue

5

Start / End Page

1116 / 1151

Related Subject Headings

  • Artificial Intelligence & Image Processing
  • 46 Information and computing sciences
  • 0806 Information Systems
  • 0804 Data Format
  • 0801 Artificial Intelligence and Image Processing