A Machine Learning Solution for Data Center Thermal Characteristics Analysis

The energy efficiency of Data Center (DC) operations heavily relies on a DC ambient temperature as well as its IT and cooling systems performance. A reliable and efficient cooling system is necessary to produce a persistent flow of cold air to cool servers that are subjected to constantly increasing...

Full description

Bibliographic Details
Main Authors: Anastasiia Grishina, Marta Chinnici, Ah-Lian Kor, Eric Rondeau, Jean-Philippe Georges
Format: Article
Language:English
Published: MDPI AG 2020-08-01
Series:Energies
Subjects:
Online Access:https://www.mdpi.com/1996-1073/13/17/4378
id doaj-27a8f37ce1434517b95a5c7145a7f972
record_format Article
spelling doaj-27a8f37ce1434517b95a5c7145a7f9722020-11-25T03:48:13ZengMDPI AGEnergies1996-10732020-08-01134378437810.3390/en13174378A Machine Learning Solution for Data Center Thermal Characteristics AnalysisAnastasiia Grishina0Marta Chinnici1Ah-Lian Kor2Eric Rondeau3Jean-Philippe Georges4Department of Mathematics and Computer Science, Eindhoven University of Technology, 5612 AZ Eindhoven, The NetherlandsDepartment of Energy Technologies and Renewable Sources, ICT Division, ENEA Casaccia Research Center, 00123 Rome, ItalySchool of Built Environment, Engineering and Computing, Leeds Beckett University, Leeds LS1 3HE, UKUniversité de Lorraine, Centre National de la Recherche Scientifique (CNRS), Centre de Recherche en Automatique de Nancy (CRAN), F-54000 Nancy, FranceUniversité de Lorraine, Centre National de la Recherche Scientifique (CNRS), Centre de Recherche en Automatique de Nancy (CRAN), F-54000 Nancy, FranceThe energy efficiency of Data Center (DC) operations heavily relies on a DC ambient temperature as well as its IT and cooling systems performance. A reliable and efficient cooling system is necessary to produce a persistent flow of cold air to cool servers that are subjected to constantly increasing computational load due to the advent of smart cloud-based applications. Consequently, the increased demand for computing power will inadvertently increase server waste heat creation in data centers. To improve a DC thermal profile which could undeniably influence energy efficiency and reliability of IT equipment, it is imperative to explore the thermal characteristics analysis of an IT room. This work encompasses the employment of an unsupervised machine learning technique for uncovering weaknesses of a DC cooling system based on real DC monitoring thermal data. The findings of the analysis result in the identification of areas for thermal management and cooling improvement that further feeds into DC recommendations. With the aim to identify overheated zones in a DC IT room and corresponding servers, we applied analyzed thermal characteristics of the IT room. Experimental dataset includes measurements of ambient air temperature in the hot aisle of the IT room in ENEA Portici research center hosting the CRESCO6 computing cluster. We use machine learning clustering techniques to identify overheated locations and categorize computing nodes based on surrounding air temperature ranges abstracted from the data. This work employs the principles and approaches replicable for the analysis of thermal characteristics of any DC, thereby fostering transferability. This paper demonstrates how best practices and guidelines could be applied for thermal analysis and profiling of a commercial DC based on real thermal monitoring data.https://www.mdpi.com/1996-1073/13/17/4378data centerthermal characteristics analysismachine learningenergy efficiencyclusteringunsupervised learning
collection DOAJ
language English
format Article
sources DOAJ
author Anastasiia Grishina
Marta Chinnici
Ah-Lian Kor
Eric Rondeau
Jean-Philippe Georges
spellingShingle Anastasiia Grishina
Marta Chinnici
Ah-Lian Kor
Eric Rondeau
Jean-Philippe Georges
A Machine Learning Solution for Data Center Thermal Characteristics Analysis
Energies
data center
thermal characteristics analysis
machine learning
energy efficiency
clustering
unsupervised learning
author_facet Anastasiia Grishina
Marta Chinnici
Ah-Lian Kor
Eric Rondeau
Jean-Philippe Georges
author_sort Anastasiia Grishina
title A Machine Learning Solution for Data Center Thermal Characteristics Analysis
title_short A Machine Learning Solution for Data Center Thermal Characteristics Analysis
title_full A Machine Learning Solution for Data Center Thermal Characteristics Analysis
title_fullStr A Machine Learning Solution for Data Center Thermal Characteristics Analysis
title_full_unstemmed A Machine Learning Solution for Data Center Thermal Characteristics Analysis
title_sort machine learning solution for data center thermal characteristics analysis
publisher MDPI AG
series Energies
issn 1996-1073
publishDate 2020-08-01
description The energy efficiency of Data Center (DC) operations heavily relies on a DC ambient temperature as well as its IT and cooling systems performance. A reliable and efficient cooling system is necessary to produce a persistent flow of cold air to cool servers that are subjected to constantly increasing computational load due to the advent of smart cloud-based applications. Consequently, the increased demand for computing power will inadvertently increase server waste heat creation in data centers. To improve a DC thermal profile which could undeniably influence energy efficiency and reliability of IT equipment, it is imperative to explore the thermal characteristics analysis of an IT room. This work encompasses the employment of an unsupervised machine learning technique for uncovering weaknesses of a DC cooling system based on real DC monitoring thermal data. The findings of the analysis result in the identification of areas for thermal management and cooling improvement that further feeds into DC recommendations. With the aim to identify overheated zones in a DC IT room and corresponding servers, we applied analyzed thermal characteristics of the IT room. Experimental dataset includes measurements of ambient air temperature in the hot aisle of the IT room in ENEA Portici research center hosting the CRESCO6 computing cluster. We use machine learning clustering techniques to identify overheated locations and categorize computing nodes based on surrounding air temperature ranges abstracted from the data. This work employs the principles and approaches replicable for the analysis of thermal characteristics of any DC, thereby fostering transferability. This paper demonstrates how best practices and guidelines could be applied for thermal analysis and profiling of a commercial DC based on real thermal monitoring data.
topic data center
thermal characteristics analysis
machine learning
energy efficiency
clustering
unsupervised learning
url https://www.mdpi.com/1996-1073/13/17/4378
work_keys_str_mv AT anastasiiagrishina amachinelearningsolutionfordatacenterthermalcharacteristicsanalysis
AT martachinnici amachinelearningsolutionfordatacenterthermalcharacteristicsanalysis
AT ahliankor amachinelearningsolutionfordatacenterthermalcharacteristicsanalysis
AT ericrondeau amachinelearningsolutionfordatacenterthermalcharacteristicsanalysis
AT jeanphilippegeorges amachinelearningsolutionfordatacenterthermalcharacteristicsanalysis
AT anastasiiagrishina machinelearningsolutionfordatacenterthermalcharacteristicsanalysis
AT martachinnici machinelearningsolutionfordatacenterthermalcharacteristicsanalysis
AT ahliankor machinelearningsolutionfordatacenterthermalcharacteristicsanalysis
AT ericrondeau machinelearningsolutionfordatacenterthermalcharacteristicsanalysis
AT jeanphilippegeorges machinelearningsolutionfordatacenterthermalcharacteristicsanalysis
_version_ 1724499535316647936