Skip to main content
Journal cover image

Massively parallel databases and MapReduce systems

Publication ,  Journal Article
Babu, S; Herodotou, H
Published in: Foundations and Trends in Databases
December 1, 2012

Timely and cost-effective analytics over "big data" has emerged as a key ingredient for success in many businesses, scientific and engineering disciplines, and government endeavors. Web clicks, social media, scientific experiments, and datacenter monitoring are among data sources that generate vast amounts of raw data every day. The need to convert this raw data into useful information has spawned considerable innovation in systems for large-scale data analytics, especially over the last decade. This monograph covers the design principles and core features of systems for analyzing very large datasets using massively-parallel computation and storage techniques on large clusters of nodes. We first discuss how the requirements of data analytics have evolved since the early work on parallel database systems. We then describe some of the major technological innovations that have each spawned a distinct category of systems for data analytics. Each unique system category is described along a number of dimensions including data model and query interface, storage layer, execution engine, query optimization, scheduling, resource management, and fault tolerance. We conclude with a summary of present trends in large-scale data analytics. © 2013 S. Babu and H. Herodotou.

Duke Scholars

Altmetric Attention Stats
Dimensions Citation Stats

Published In

Foundations and Trends in Databases

DOI

EISSN

1931-7891

ISSN

1931-7883

Publication Date

December 1, 2012

Volume

5

Issue

1

Start / End Page

1 / 104
 

Citation

APA
Chicago
ICMJE
MLA
NLM
Babu, S., & Herodotou, H. (2012). Massively parallel databases and MapReduce systems. Foundations and Trends in Databases, 5(1), 1–104. https://doi.org/10.1561/1900000036
Babu, S., and H. Herodotou. “Massively parallel databases and MapReduce systems.” Foundations and Trends in Databases 5, no. 1 (December 1, 2012): 1–104. https://doi.org/10.1561/1900000036.
Babu S, Herodotou H. Massively parallel databases and MapReduce systems. Foundations and Trends in Databases. 2012 Dec 1;5(1):1–104.
Babu, S., and H. Herodotou. “Massively parallel databases and MapReduce systems.” Foundations and Trends in Databases, vol. 5, no. 1, Dec. 2012, pp. 1–104. Scopus, doi:10.1561/1900000036.
Babu S, Herodotou H. Massively parallel databases and MapReduce systems. Foundations and Trends in Databases. 2012 Dec 1;5(1):1–104.
Journal cover image

Published In

Foundations and Trends in Databases

DOI

EISSN

1931-7891

ISSN

1931-7883

Publication Date

December 1, 2012

Volume

5

Issue

1

Start / End Page

1 / 104