Skip to main content

PrefixSpan: Mining sequential patterns efficiently by prefix-projected pattern growth

Publication ,  Conference
Pei, J; Han, J; Mortazavi-Asl, B; Pinto, H; Chen, Q; Dayal, U; Hsu, MC
Published in: Proceedings - International Conference on Data Engineering
January 1, 2001

Sequential pattern mining is an important data mining problem with broad applications. It is challenging since one may need to examine a combinatorially explosive number of possible subsequence patterns. Most of the previously developed sequential pattern mining methods follow the methodology of Apriori which may substantially reduce the number of combinations to be examined. However Apriori still encounters problems when a sequence database is large and/or when sequential patterns to be mined are numerous and/or long. In this paper, we propose a novel sequential pattern mining method, called PrefixSpan (i.e., Prefix-projected Sequential pattern mining), which explores prefix-projection in sequential pattern mining. PrefixSpan mines the complete set of patterns but greatly reduces the efforts of candidate subsequence generation. Moreover, prefix-projection substantially reduces the size of projected databases and leads to efficient processing. Our performance study shows that PrefixSpan outperforms both the Apriori-based GSP algorithm and another recently proposed method, FreeSpan, in mining large sequence databases.

Duke Scholars

Published In

Proceedings - International Conference on Data Engineering

Publication Date

January 1, 2001

Start / End Page

215 / 224
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Pei, J., Han, J., Mortazavi-Asl, B., Pinto, H., Chen, Q., Dayal, U., & Hsu, M. C. (2001). PrefixSpan: Mining sequential patterns efficiently by prefix-projected pattern growth. In Proceedings - International Conference on Data Engineering (pp. 215–224).
Pei, J., J. Han, B. Mortazavi-Asl, H. Pinto, Q. Chen, U. Dayal, and M. C. Hsu. “PrefixSpan: Mining sequential patterns efficiently by prefix-projected pattern growth.” In Proceedings - International Conference on Data Engineering, 215–24, 2001.
Pei J, Han J, Mortazavi-Asl B, Pinto H, Chen Q, Dayal U, et al. PrefixSpan: Mining sequential patterns efficiently by prefix-projected pattern growth. In: Proceedings - International Conference on Data Engineering. 2001. p. 215–24.
Pei, J., et al. “PrefixSpan: Mining sequential patterns efficiently by prefix-projected pattern growth.” Proceedings - International Conference on Data Engineering, 2001, pp. 215–24.
Pei J, Han J, Mortazavi-Asl B, Pinto H, Chen Q, Dayal U, Hsu MC. PrefixSpan: Mining sequential patterns efficiently by prefix-projected pattern growth. Proceedings - International Conference on Data Engineering. 2001. p. 215–224.

Published In

Proceedings - International Conference on Data Engineering

Publication Date

January 1, 2001

Start / End Page

215 / 224