Scholars@Duke publication: Cross table cubing: Mining iceberg cubes from data warehouses

Cross table cubing: Mining iceberg cubes from data warehouses

Publication , Conference

Cho, M; Pei, J; Cheung, DW

Published in: Proceedings of the 2005 SIAM International Conference on Data Mining Sdm 2005

January 1, 2005

All of the existing (iceberg) cube computation algorithms assume that the data is stored in a single base table, however, in practice, a data warehouse is often organized in a schema of multiple tables, such as star schema and snowflake schema. In terms of both computation time and space, materializing a universal base table by joining multiple tables is often very expensive or even unaffordable in real data warehouses. In this paper, we investigate the problem of computing iceberg cubes from data warehouses. Surprisingly, our study shows that computing iceberg cube from multiple tables directly can be even more efficient in both space and runtime than computing from a materialized universal base table. We develop an efficient algorithm, CTC (for Cross Table Cubing) to tackle the problem. An extensive performance study on synthetic data sets demonstrates that our new approach is efficient and scalable for large data warehouses. Copyright © by SIAM.

Duke Scholars

Author Jian Pei Computer Science

Published In

Proceedings of the 2005 SIAM International Conference on Data Mining Sdm 2005

DOI

10.1137/1.9781611972757.41

Publication Date

January 1, 2005

Start / End Page

461 / 465

Citation

APA

Chicago

ICMJE

MLA

NLM

Cho, M., Pei, J., & Cheung, D. W. (2005). Cross table cubing: Mining iceberg cubes from data warehouses. In Proceedings of the 2005 SIAM International Conference on Data Mining Sdm 2005 (pp. 461–465). https://doi.org/10.1137/1.9781611972757.41

Cho, M., J. Pei, and D. W. Cheung. “Cross table cubing: Mining iceberg cubes from data warehouses.” In Proceedings of the 2005 SIAM International Conference on Data Mining Sdm 2005, 461–65, 2005. https://doi.org/10.1137/1.9781611972757.41.

Cho M, Pei J, Cheung DW. Cross table cubing: Mining iceberg cubes from data warehouses. In: Proceedings of the 2005 SIAM International Conference on Data Mining Sdm 2005. 2005. p. 461–5.

Cho, M., et al. “Cross table cubing: Mining iceberg cubes from data warehouses.” Proceedings of the 2005 SIAM International Conference on Data Mining Sdm 2005, 2005, pp. 461–65. Scopus, doi:10.1137/1.9781611972757.41.

Cho M, Pei J, Cheung DW. Cross table cubing: Mining iceberg cubes from data warehouses. Proceedings of the 2005 SIAM International Conference on Data Mining Sdm 2005. 2005. p. 461–465.

Published In

Proceedings of the 2005 SIAM International Conference on Data Mining Sdm 2005

DOI

10.1137/1.9781611972757.41

Publication Date

January 1, 2005

Start / End Page

461 / 465