A large data processing algorithm for energy efficiency in a heterogeneous cluster

It is reportedi that the electricity cost to operate a cluster may well exceed its acquisition cost, and the processing of big data requires large scale cluster and long period. Therefore, energy efficient processing of big data is essential for the data owners and users. In this paper, we propose a...

Full description

Bibliographic Details
Main Authors: Wang Lei, Ge Weichun, Li Zhao, Lei Zhenjiang, Chen Shuo
Format: Article
Language:English
Published: EDP Sciences 2018-01-01
Series:ITM Web of Conferences
Online Access:https://doi.org/10.1051/itmconf/20181703023
id doaj-a01368c47f5d45a8b3bc7f2b6c66f59a
record_format Article
spelling doaj-a01368c47f5d45a8b3bc7f2b6c66f59a2021-02-02T08:43:58ZengEDP SciencesITM Web of Conferences2271-20972018-01-01170302310.1051/itmconf/20181703023itmconf_wcsn2018_03023A large data processing algorithm for energy efficiency in a heterogeneous clusterWang LeiGe WeichunLi ZhaoLei ZhenjiangChen ShuoIt is reportedi that the electricity cost to operate a cluster may well exceed its acquisition cost, and the processing of big data requires large scale cluster and long period. Therefore, energy efficient processing of big data is essential for the data owners and users. In this paper, we propose a novel algorithm MinBalance to processing I/O intensive big data tasks energy efficiently in heterogeneous cluster. In the former step, four greedy policies are used to select the proper nodes considering heterogeneity of the cluster. While in the latter step, the workloads of the selected nodes will be well balanced to avoid the energy wastes caused by waiting. MinBalance is a universal algorithm and cannot be affected by the data storage strategies. Experimental results indicate that MinBalance can achieve over 60% energy reduction for large sets over the traditional methods of powering down partial nodes.https://doi.org/10.1051/itmconf/20181703023
collection DOAJ
language English
format Article
sources DOAJ
author Wang Lei
Ge Weichun
Li Zhao
Lei Zhenjiang
Chen Shuo
spellingShingle Wang Lei
Ge Weichun
Li Zhao
Lei Zhenjiang
Chen Shuo
A large data processing algorithm for energy efficiency in a heterogeneous cluster
ITM Web of Conferences
author_facet Wang Lei
Ge Weichun
Li Zhao
Lei Zhenjiang
Chen Shuo
author_sort Wang Lei
title A large data processing algorithm for energy efficiency in a heterogeneous cluster
title_short A large data processing algorithm for energy efficiency in a heterogeneous cluster
title_full A large data processing algorithm for energy efficiency in a heterogeneous cluster
title_fullStr A large data processing algorithm for energy efficiency in a heterogeneous cluster
title_full_unstemmed A large data processing algorithm for energy efficiency in a heterogeneous cluster
title_sort large data processing algorithm for energy efficiency in a heterogeneous cluster
publisher EDP Sciences
series ITM Web of Conferences
issn 2271-2097
publishDate 2018-01-01
description It is reportedi that the electricity cost to operate a cluster may well exceed its acquisition cost, and the processing of big data requires large scale cluster and long period. Therefore, energy efficient processing of big data is essential for the data owners and users. In this paper, we propose a novel algorithm MinBalance to processing I/O intensive big data tasks energy efficiently in heterogeneous cluster. In the former step, four greedy policies are used to select the proper nodes considering heterogeneity of the cluster. While in the latter step, the workloads of the selected nodes will be well balanced to avoid the energy wastes caused by waiting. MinBalance is a universal algorithm and cannot be affected by the data storage strategies. Experimental results indicate that MinBalance can achieve over 60% energy reduction for large sets over the traditional methods of powering down partial nodes.
url https://doi.org/10.1051/itmconf/20181703023
work_keys_str_mv AT wanglei alargedataprocessingalgorithmforenergyefficiencyinaheterogeneouscluster
AT geweichun alargedataprocessingalgorithmforenergyefficiencyinaheterogeneouscluster
AT lizhao alargedataprocessingalgorithmforenergyefficiencyinaheterogeneouscluster
AT leizhenjiang alargedataprocessingalgorithmforenergyefficiencyinaheterogeneouscluster
AT chenshuo alargedataprocessingalgorithmforenergyefficiencyinaheterogeneouscluster
AT wanglei largedataprocessingalgorithmforenergyefficiencyinaheterogeneouscluster
AT geweichun largedataprocessingalgorithmforenergyefficiencyinaheterogeneouscluster
AT lizhao largedataprocessingalgorithmforenergyefficiencyinaheterogeneouscluster
AT leizhenjiang largedataprocessingalgorithmforenergyefficiencyinaheterogeneouscluster
AT chenshuo largedataprocessingalgorithmforenergyefficiencyinaheterogeneouscluster
_version_ 1724296591030878208