MapReduce: Simplified data processing on large clusters

11.9kCitations
Citations of this article
22.1kReaders
Mendeley users who have this article in their library.
Get full text

Abstract

MapReduce is a programming model and an associated implementation for processing and generating large datasets that is amenable to a broad variety of real-world tasks. Users specify the computation in terms of a map and a reduce function, and the underlying runtime system automatically parallelizes the computation across large-scale clusters of machines, handles machine failures, and schedules inter-machine communication to make efficient use of the network and disks. Programmers find the system easy to use: more than ten thousand distinct MapReduce programs have been implemented internally at Google over the past four years, and an average of one hundred thousand MapReduce jobs are executed on Google's clusters every day, processing a total of more than twenty petabytes of data per day.

References Powered by Scopus

The google file system

4508Citations
N/AReaders
Get full text

Efficient Dispersal of Information for Security, Load Balancing, and Fault Tolerance

998Citations
N/AReaders
Get full text

Parallel Prefix Computation

961Citations
N/AReaders
Get full text

Cited by Powered by Scopus

The genome analysis toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data

19337Citations
N/AReaders
Get full text

Distributed optimization and statistical learning via the alternating direction method of multipliers

16018Citations
N/AReaders
Get full text

Edge Computing: Vision and Challenges

6070Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Dean, J., & Ghemawat, S. (2008). MapReduce: Simplified data processing on large clusters. Communications of the ACM, 51(1), 107–113. https://doi.org/10.1145/1327452.1327492

Readers over time

‘08‘09‘10‘11‘12‘13‘14‘15‘16‘17‘18‘19‘20‘21‘22‘23‘24‘2502000400060008000

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 11755

76%

Researcher 2031

13%

Professor / Associate Prof. 1115

7%

Lecturer / Post doc 521

3%

Readers' Discipline

Tooltip

Computer Science 18146

89%

Engineering 1488

7%

Environmental Science 324

2%

Agricultural and Biological Sciences 317

2%

Article Metrics

Tooltip
Mentions
Blog Mentions: 1
News Mentions: 6
References: 14

Save time finding and organizing research with Mendeley

Sign up for free
0