Scholars@Duke publication: Mining frequent co-occurrence patterns across multiple data streams

Mining frequent co-occurrence patterns across multiple data streams

Publication , Conference

Yu, Z; Yu, X; Liu, Y; Li, W; Pei, J

Published in: EDBT 2015 - 18th International Conference on Extending Database Technology, Proceedings

January 1, 2015

This paper studies the problem of mining frequent co-occurrence patterns across multiple data streams, which has not been addressed by existing works. Co-occurrence pattern in this context refers to the case that the same group of objects appear consecutively in multiple streams over a short time span, signaling tight correlations between these objects. The need for mining such patterns in real-time arises in a variety of applications ranging from crime prevention to location-based services to event discovery in social media. Since the data streams are usually fast, continuous, and unbounded, existing methods on mining frequent patterns requiring more than one pass over the data cannot be directly applied. Therefore, we propose DIMine and CooMine, two algorithms to discover frequent co-occurrence patterns across multiple data streams. DIMine is an Apriori-style algorithm based on an inverted index, while CooMine uses an in-memory data structure called the Seg-tree to compactly index the data that are already seen but have not expired yet. CooMine employs a one-pass algorithm that uses the filter-and-refine strategy to obtain the co-occurrence patterns from the Seg-tree as updates to the streams arrive. Extensive experiments on two real datasets demonstrate the superiority of the proposed approaches over a baseline method, and show their respective applicability in different senarios.

Duke Scholars

Author Jian Pei Computer Science

Published In

EDBT 2015 - 18th International Conference on Extending Database Technology, Proceedings

DOI

10.5441/002/edbt.2015.08

Publication Date

January 1, 2015

Start / End Page

73 / 84

Citation

APA

Chicago

ICMJE

MLA

NLM

Yu, Z., Yu, X., Liu, Y., Li, W., & Pei, J. (2015). Mining frequent co-occurrence patterns across multiple data streams. In EDBT 2015 - 18th International Conference on Extending Database Technology, Proceedings (pp. 73–84). https://doi.org/10.5441/002/edbt.2015.08

Yu, Z., X. Yu, Y. Liu, W. Li, and J. Pei. “Mining frequent co-occurrence patterns across multiple data streams.” In EDBT 2015 - 18th International Conference on Extending Database Technology, Proceedings, 73–84, 2015. https://doi.org/10.5441/002/edbt.2015.08.

Yu Z, Yu X, Liu Y, Li W, Pei J. Mining frequent co-occurrence patterns across multiple data streams. In: EDBT 2015 - 18th International Conference on Extending Database Technology, Proceedings. 2015. p. 73–84.

Yu, Z., et al. “Mining frequent co-occurrence patterns across multiple data streams.” EDBT 2015 - 18th International Conference on Extending Database Technology, Proceedings, 2015, pp. 73–84. Scopus, doi:10.5441/002/edbt.2015.08.

Yu Z, Yu X, Liu Y, Li W, Pei J. Mining frequent co-occurrence patterns across multiple data streams. EDBT 2015 - 18th International Conference on Extending Database Technology, Proceedings. 2015. p. 73–84.

Published In

EDBT 2015 - 18th International Conference on Extending Database Technology, Proceedings

DOI

10.5441/002/edbt.2015.08

Publication Date

January 1, 2015

Start / End Page

73 / 84