Performance Evaluation and Estimation Model Using Regression Method for Hadoop WordCount
Given the rapid growth in cloud computing, it is important to analyze the performance of different Hadoop MapReduce applications and to understand the performance bottleneck in a cloud cluster that contributes to higher or lower performance. It is also important to analyze the underlying hardware in...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2015-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/7360871/ |
id |
doaj-dc8966e7518040deb15dcdcc4e230ba3 |
---|---|
record_format |
Article |
spelling |
doaj-dc8966e7518040deb15dcdcc4e230ba32021-03-29T19:34:51ZengIEEEIEEE Access2169-35362015-01-0132784279310.1109/ACCESS.2015.25095987360871Performance Evaluation and Estimation Model Using Regression Method for Hadoop WordCountJoseph A. Issa0Department of Electrical and Computer Engineering, Notre Dame University at Louaize, Zouk Mosbeh, LebanonGiven the rapid growth in cloud computing, it is important to analyze the performance of different Hadoop MapReduce applications and to understand the performance bottleneck in a cloud cluster that contributes to higher or lower performance. It is also important to analyze the underlying hardware in cloud cluster servers to enable the optimization of software and hardware to achieve the maximum performance possible. Hadoop is based on MapReduce, which is one of the most popular programming models for big data analysis in a parallel computing environment. In this paper, we present a detailed performance analysis, characterization, and evaluation of Hadoop MapReduce WordCount application. We also propose an estimation model based on Amdahl's law regression method to estimate performance and total processing time versus different input sizes for a given processor architecture. The estimation regression model is verified to estimate performance and run time with an error margin of <;5%.https://ieeexplore.ieee.org/document/7360871/performance analysiscloud computingdoop WordCount |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Joseph A. Issa |
spellingShingle |
Joseph A. Issa Performance Evaluation and Estimation Model Using Regression Method for Hadoop WordCount IEEE Access performance analysis cloud computing doop WordCount |
author_facet |
Joseph A. Issa |
author_sort |
Joseph A. Issa |
title |
Performance Evaluation and Estimation Model Using Regression Method for Hadoop WordCount |
title_short |
Performance Evaluation and Estimation Model Using Regression Method for Hadoop WordCount |
title_full |
Performance Evaluation and Estimation Model Using Regression Method for Hadoop WordCount |
title_fullStr |
Performance Evaluation and Estimation Model Using Regression Method for Hadoop WordCount |
title_full_unstemmed |
Performance Evaluation and Estimation Model Using Regression Method for Hadoop WordCount |
title_sort |
performance evaluation and estimation model using regression method for hadoop wordcount |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2015-01-01 |
description |
Given the rapid growth in cloud computing, it is important to analyze the performance of different Hadoop MapReduce applications and to understand the performance bottleneck in a cloud cluster that contributes to higher or lower performance. It is also important to analyze the underlying hardware in cloud cluster servers to enable the optimization of software and hardware to achieve the maximum performance possible. Hadoop is based on MapReduce, which is one of the most popular programming models for big data analysis in a parallel computing environment. In this paper, we present a detailed performance analysis, characterization, and evaluation of Hadoop MapReduce WordCount application. We also propose an estimation model based on Amdahl's law regression method to estimate performance and total processing time versus different input sizes for a given processor architecture. The estimation regression model is verified to estimate performance and run time with an error margin of <;5%. |
topic |
performance analysis cloud computing doop WordCount |
url |
https://ieeexplore.ieee.org/document/7360871/ |
work_keys_str_mv |
AT josephaissa performanceevaluationandestimationmodelusingregressionmethodforhadoopwordcount |
_version_ |
1724195981744930816 |