Performance Evaluation and Estimation Model Using Regression Method for Hadoop WordCount

Given the rapid growth in cloud computing, it is important to analyze the performance of different Hadoop MapReduce applications and to understand the performance bottleneck in a cloud cluster that contributes to higher or lower performance. It is also important to analyze the underlying hardware in...

Full description

Bibliographic Details
Main Author: Joseph A. Issa
Format: Article
Language:English
Published: IEEE 2015-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/7360871/
id doaj-dc8966e7518040deb15dcdcc4e230ba3
record_format Article
spelling doaj-dc8966e7518040deb15dcdcc4e230ba32021-03-29T19:34:51ZengIEEEIEEE Access2169-35362015-01-0132784279310.1109/ACCESS.2015.25095987360871Performance Evaluation and Estimation Model Using Regression Method for Hadoop WordCountJoseph A. Issa0Department of Electrical and Computer Engineering, Notre Dame University at Louaize, Zouk Mosbeh, LebanonGiven the rapid growth in cloud computing, it is important to analyze the performance of different Hadoop MapReduce applications and to understand the performance bottleneck in a cloud cluster that contributes to higher or lower performance. It is also important to analyze the underlying hardware in cloud cluster servers to enable the optimization of software and hardware to achieve the maximum performance possible. Hadoop is based on MapReduce, which is one of the most popular programming models for big data analysis in a parallel computing environment. In this paper, we present a detailed performance analysis, characterization, and evaluation of Hadoop MapReduce WordCount application. We also propose an estimation model based on Amdahl's law regression method to estimate performance and total processing time versus different input sizes for a given processor architecture. The estimation regression model is verified to estimate performance and run time with an error margin of <;5%.https://ieeexplore.ieee.org/document/7360871/performance analysiscloud computingdoop WordCount
collection DOAJ
language English
format Article
sources DOAJ
author Joseph A. Issa
spellingShingle Joseph A. Issa
Performance Evaluation and Estimation Model Using Regression Method for Hadoop WordCount
IEEE Access
performance analysis
cloud computing
doop WordCount
author_facet Joseph A. Issa
author_sort Joseph A. Issa
title Performance Evaluation and Estimation Model Using Regression Method for Hadoop WordCount
title_short Performance Evaluation and Estimation Model Using Regression Method for Hadoop WordCount
title_full Performance Evaluation and Estimation Model Using Regression Method for Hadoop WordCount
title_fullStr Performance Evaluation and Estimation Model Using Regression Method for Hadoop WordCount
title_full_unstemmed Performance Evaluation and Estimation Model Using Regression Method for Hadoop WordCount
title_sort performance evaluation and estimation model using regression method for hadoop wordcount
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2015-01-01
description Given the rapid growth in cloud computing, it is important to analyze the performance of different Hadoop MapReduce applications and to understand the performance bottleneck in a cloud cluster that contributes to higher or lower performance. It is also important to analyze the underlying hardware in cloud cluster servers to enable the optimization of software and hardware to achieve the maximum performance possible. Hadoop is based on MapReduce, which is one of the most popular programming models for big data analysis in a parallel computing environment. In this paper, we present a detailed performance analysis, characterization, and evaluation of Hadoop MapReduce WordCount application. We also propose an estimation model based on Amdahl's law regression method to estimate performance and total processing time versus different input sizes for a given processor architecture. The estimation regression model is verified to estimate performance and run time with an error margin of <;5%.
topic performance analysis
cloud computing
doop WordCount
url https://ieeexplore.ieee.org/document/7360871/
work_keys_str_mv AT josephaissa performanceevaluationandestimationmodelusingregressionmethodforhadoopwordcount
_version_ 1724195981744930816