Skip to main content

Minimum description length principle: Generators are preferable to closed patterns

Publication ,  Conference
Li, J; Li, H; Wong, L; Pei, J; Dong, G
Published in: Proceedings of the National Conference on Artificial Intelligence
November 13, 2006

The generators and the unique closed pattern of an equivalence class of itemsets share a common set of transactions. The generators are the minimal ones among the equivalent itemsets, while the closed pattern is the maximum one. As a generator is usually smaller than the closed pattern in cardinality, by the Minimum Description Length Principle, the generator is preferable to the closed pattern in inductive inference and classification. To efficiently discover frequent generators from a large dataset, we develop a depth-first algorithm called Gr-growth. The idea is novel in contrast to traditional breadth-first bottom-up generator-mining algorithms. Our extensive performance study shows that Gr-growth is significantly faster (an order or even two orders of magnitudes when the support thresholds are low) than the existing generator mining algorithms. It can be also faster than the state-of-the-art frequent closed itemset mining algorithms such as FPclose and CLOSET+. Copyright © 2006, American Association for Artificial Intelligence (www.aaai.org). All rights reserved.

Duke Scholars

Published In

Proceedings of the National Conference on Artificial Intelligence

Publication Date

November 13, 2006

Volume

1

Start / End Page

409 / 414
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Li, J., Li, H., Wong, L., Pei, J., & Dong, G. (2006). Minimum description length principle: Generators are preferable to closed patterns. In Proceedings of the National Conference on Artificial Intelligence (Vol. 1, pp. 409–414).
Li, J., H. Li, L. Wong, J. Pei, and G. Dong. “Minimum description length principle: Generators are preferable to closed patterns.” In Proceedings of the National Conference on Artificial Intelligence, 1:409–14, 2006.
Li J, Li H, Wong L, Pei J, Dong G. Minimum description length principle: Generators are preferable to closed patterns. In: Proceedings of the National Conference on Artificial Intelligence. 2006. p. 409–14.
Li, J., et al. “Minimum description length principle: Generators are preferable to closed patterns.” Proceedings of the National Conference on Artificial Intelligence, vol. 1, 2006, pp. 409–14.
Li J, Li H, Wong L, Pei J, Dong G. Minimum description length principle: Generators are preferable to closed patterns. Proceedings of the National Conference on Artificial Intelligence. 2006. p. 409–414.

Published In

Proceedings of the National Conference on Artificial Intelligence

Publication Date

November 13, 2006

Volume

1

Start / End Page

409 / 414