Skip to main content

CLOSET+: Searching for the best strategies for mining frequent closed itemsets

Publication ,  Conference
Wang, J; Han, J; Pei, J
Published in: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
December 1, 2003

Mining frequent closed itemsets provides complete and non-redundant results for frequent pattern analysis. Extensive studies have proposed various strategies for efficient frequent closed itemset mining, such as depth-first search vs. breadthfirst search, vertical formats vs. horizontal formats, tree-structure vs. other data structures, top-down vs. bottom-up traversal, pseudo projection vs. physical projection of conditional database, etc. It is the right time to ask "what are the pros and cons of the strategies?" and "what and how can we pick and integrate the best strategies to achieve higher performance in general cases?"In this study, we answer the above questions by a systematic study of the search strategies and develop a winning algorithm CLOSET+. CLOSET+ integrates the advantages of the previously proposed effective strategies as well as some ones newly developed here. A thorough performance study on synthetic and real data sets has shown the advantages of the strategies and the improvement of CLOSET+ over existing mining algorithms, including CLOSET, CHARM and OP, in terms of runtime, memory usage and scalability. Copyright 2003 ACM.

Duke Scholars

Published In

Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

DOI

Publication Date

December 1, 2003

Start / End Page

236 / 245
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Wang, J., Han, J., & Pei, J. (2003). CLOSET+: Searching for the best strategies for mining frequent closed itemsets. In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 236–245). https://doi.org/10.1145/956750.956779
Wang, J., J. Han, and J. Pei. “CLOSET+: Searching for the best strategies for mining frequent closed itemsets.” In Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 236–45, 2003. https://doi.org/10.1145/956750.956779.
Wang J, Han J, Pei J. CLOSET+: Searching for the best strategies for mining frequent closed itemsets. In: Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2003. p. 236–45.
Wang, J., et al. “CLOSET+: Searching for the best strategies for mining frequent closed itemsets.” Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2003, pp. 236–45. Scopus, doi:10.1145/956750.956779.
Wang J, Han J, Pei J. CLOSET+: Searching for the best strategies for mining frequent closed itemsets. Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2003. p. 236–245.

Published In

Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

DOI

Publication Date

December 1, 2003

Start / End Page

236 / 245