Skip to main content
Journal cover image

A binary decision diagram based approach for mining frequent subsequences

Publication ,  Journal Article
Loekito, E; Bailey, J; Pei, J
Published in: Knowledge and Information Systems
January 1, 2010

Sequential pattern mining is an important problem in data mining. State of the art techniques for mining sequential patterns, such as frequent subsequences, are often based on the pattern-growth approach, which recursively projects conditional databases. Explicitly creating database projections is thought to be a major computational bottleneck, but we will show in this paper that it can be beneficial when the appropriate data structure is used. Our technique uses a canonical directed acyclic graph as the sequence database representation, which can be represented as a binary decision diagram (BDD). In this paper, we introduce a new type of BDD, namely a sequence BDD (SeqBDD), and show how it can be used for efficiently mining frequent subsequences. A novel feature of the SeqBDD is its ability to share results between similar intermediate computations and avoid redundant computation. We perform an experimental study to compare the SeqBDD technique with existing pattern growth techniques, that are based on other data structures such as prefix trees. Our results show that a SeqBDD can be half as large as a prefix tree, especially when many similar sequences exist. In terms of mining time, it can be substantially more efficient when the support is low, the number of patterns is large, or the input sequences are long and highly similar. © 2009 Springer-Verlag London Limited.

Duke Scholars

Published In

Knowledge and Information Systems

DOI

EISSN

0219-3116

ISSN

0219-1377

Publication Date

January 1, 2010

Volume

24

Issue

2

Start / End Page

235 / 268

Related Subject Headings

  • Information Systems
  • 46 Information and computing sciences
  • 0806 Information Systems
  • 0801 Artificial Intelligence and Image Processing
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Loekito, E., Bailey, J., & Pei, J. (2010). A binary decision diagram based approach for mining frequent subsequences. Knowledge and Information Systems, 24(2), 235–268. https://doi.org/10.1007/s10115-009-0252-9
Loekito, E., J. Bailey, and J. Pei. “A binary decision diagram based approach for mining frequent subsequences.” Knowledge and Information Systems 24, no. 2 (January 1, 2010): 235–68. https://doi.org/10.1007/s10115-009-0252-9.
Loekito E, Bailey J, Pei J. A binary decision diagram based approach for mining frequent subsequences. Knowledge and Information Systems. 2010 Jan 1;24(2):235–68.
Loekito, E., et al. “A binary decision diagram based approach for mining frequent subsequences.” Knowledge and Information Systems, vol. 24, no. 2, Jan. 2010, pp. 235–68. Scopus, doi:10.1007/s10115-009-0252-9.
Loekito E, Bailey J, Pei J. A binary decision diagram based approach for mining frequent subsequences. Knowledge and Information Systems. 2010 Jan 1;24(2):235–268.
Journal cover image

Published In

Knowledge and Information Systems

DOI

EISSN

0219-3116

ISSN

0219-1377

Publication Date

January 1, 2010

Volume

24

Issue

2

Start / End Page

235 / 268

Related Subject Headings

  • Information Systems
  • 46 Information and computing sciences
  • 0806 Information Systems
  • 0801 Artificial Intelligence and Image Processing