Skip to main content

Cross table cubing: Mining iceberg cubes from data warehouses

Publication ,  Conference
Cho, M; Pei, J; Cheung, DW
Published in: Proceedings of the 2005 SIAM International Conference on Data Mining, SDM 2005
January 1, 2005

All of the existing (iceberg) cube computation algorithms assume that the data is stored in a single base table, however, in practice, a data warehouse is often organized in a schema of multiple tables, such as star schema and snowflake schema. In terms of both computation time and space, materializing a universal base table by joining multiple tables is often very expensive or even unaffordable in real data warehouses. In this paper, we investigate the problem of computing iceberg cubes from data warehouses. Surprisingly, our study shows that computing iceberg cube from multiple tables directly can be even more efficient in both space and runtime than computing from a materialized universal base table. We develop an efficient algorithm, CTC (for Cross Table Cubing) to tackle the problem. An extensive performance study on synthetic data sets demonstrates that our new approach is efficient and scalable for large data warehouses. Copyright © by SIAM.

Duke Scholars

Published In

Proceedings of the 2005 SIAM International Conference on Data Mining, SDM 2005

DOI

Publication Date

January 1, 2005

Start / End Page

461 / 465
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Cho, M., Pei, J., & Cheung, D. W. (2005). Cross table cubing: Mining iceberg cubes from data warehouses. In Proceedings of the 2005 SIAM International Conference on Data Mining, SDM 2005 (pp. 461–465). https://doi.org/10.1137/1.9781611972757.41
Cho, M., J. Pei, and D. W. Cheung. “Cross table cubing: Mining iceberg cubes from data warehouses.” In Proceedings of the 2005 SIAM International Conference on Data Mining, SDM 2005, 461–65, 2005. https://doi.org/10.1137/1.9781611972757.41.
Cho M, Pei J, Cheung DW. Cross table cubing: Mining iceberg cubes from data warehouses. In: Proceedings of the 2005 SIAM International Conference on Data Mining, SDM 2005. 2005. p. 461–5.
Cho, M., et al. “Cross table cubing: Mining iceberg cubes from data warehouses.” Proceedings of the 2005 SIAM International Conference on Data Mining, SDM 2005, 2005, pp. 461–65. Scopus, doi:10.1137/1.9781611972757.41.
Cho M, Pei J, Cheung DW. Cross table cubing: Mining iceberg cubes from data warehouses. Proceedings of the 2005 SIAM International Conference on Data Mining, SDM 2005. 2005. p. 461–465.

Published In

Proceedings of the 2005 SIAM International Conference on Data Mining, SDM 2005

DOI

Publication Date

January 1, 2005

Start / End Page

461 / 465