Skip to main content
Journal cover image

From sequential pattern mining to structured pattern mining: A pattern-growth approach

Publication ,  Journal Article
Han, JW; Pei, J; Yan, XF
Published in: Journal of Computer Science and Technology
January 1, 2004

Sequential pattern mining is an important data mining problem with broad applications. However, it is also a challenging problem since the mining may have to generate or examine a combinatorially explosive number of intermediate subsequences. Recent studies have developed two major classes of sequential pattern mining methods: (1) a candidate generation-and-test approach, represented by (i) GSP, a horizontal format-based sequential pattern mining method, and (ii) SPADE, a vertical format-based method; and (2) a pattern-growth method, represented by PrefixSpan and its further extensions, such as gSpan for mining structured patterns. In this study, we perform a systematic introduction and presentation of the pattern-growth methodology and study its principles and extensions. We first introduce two interesting pattern-growth algorithms, FreeSpan and PrefixSpan, for efficient sequential pattern mining. Then we introduce gSpan for mining structured patterns using the same methodology. Their relative performance in large databases is presented and analyzed. Several extensions of these methods are also discussed in the paper, including mining multi-level, multi-dimensional patterns and mining constraint-based patterns.

Duke Scholars

Published In

Journal of Computer Science and Technology

DOI

ISSN

1000-9000

Publication Date

January 1, 2004

Volume

19

Issue

3

Start / End Page

257 / 279

Related Subject Headings

  • Software Engineering
  • 46 Information and computing sciences
  • 08 Information and Computing Sciences
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Han, J. W., Pei, J., & Yan, X. F. (2004). From sequential pattern mining to structured pattern mining: A pattern-growth approach. Journal of Computer Science and Technology, 19(3), 257–279. https://doi.org/10.1007/BF02944897
Han, J. W., J. Pei, and X. F. Yan. “From sequential pattern mining to structured pattern mining: A pattern-growth approach.” Journal of Computer Science and Technology 19, no. 3 (January 1, 2004): 257–79. https://doi.org/10.1007/BF02944897.
Han JW, Pei J, Yan XF. From sequential pattern mining to structured pattern mining: A pattern-growth approach. Journal of Computer Science and Technology. 2004 Jan 1;19(3):257–79.
Han, J. W., et al. “From sequential pattern mining to structured pattern mining: A pattern-growth approach.” Journal of Computer Science and Technology, vol. 19, no. 3, Jan. 2004, pp. 257–79. Scopus, doi:10.1007/BF02944897.
Han JW, Pei J, Yan XF. From sequential pattern mining to structured pattern mining: A pattern-growth approach. Journal of Computer Science and Technology. 2004 Jan 1;19(3):257–279.
Journal cover image

Published In

Journal of Computer Science and Technology

DOI

ISSN

1000-9000

Publication Date

January 1, 2004

Volume

19

Issue

3

Start / End Page

257 / 279

Related Subject Headings

  • Software Engineering
  • 46 Information and computing sciences
  • 08 Information and Computing Sciences