Implementing and Optimizing Multiple Group by Query in a MapReduce Approach

MapReduce model is a new parallel programming model initially developed for large-scale web content processing. Data analysis meets the issue of how to do calculation over extremely large dataset. The arrival of MapReduce provides a chance to utilize commodity hardware for massively parallel data an...

Full description

Bibliographic Details
Main Authors: Jie Pan, Frédéric Magoulès, Yann Le Biannic
Format: Article
Language:English
Published: SAGE Publishing 2010-06-01
Series:Journal of Algorithms & Computational Technology
Online Access:https://doi.org/10.1260/1748-3018.4.2.183