Automated pipeline framework for processing of large-scale building energy time series data.

Commercial buildings account for one third of the total electricity consumption in the United States and a significant amount of this energy is wasted. Therefore, there is a need for "virtual" energy audits, to identify energy inefficiencies and their associated savings opportunities using...

Full description

Bibliographic Details
Main Authors:	Arash Khalilnejad, Ahmad M Karimi, Shreyas Kamath, Rojiar Haddadian, Roger H French, Alexis R Abramson
Format:	Article
Language:	English
Published:	Public Library of Science (PLoS) 2020-01-01
Series:	PLoS ONE
Online Access:	https://doi.org/10.1371/journal.pone.0240461

id	doaj-5dd83d46c3df49a3b018ebe28cd40fd2
record_format	Article
spelling	doaj-5dd83d46c3df49a3b018ebe28cd40fd22021-03-04T12:50:00ZengPublic Library of Science (PLoS)PLoS ONE1932-62032020-01-011512e024046110.1371/journal.pone.0240461Automated pipeline framework for processing of large-scale building energy time series data.Arash KhalilnejadAhmad M KarimiShreyas KamathRojiar HaddadianRoger H FrenchAlexis R AbramsonCommercial buildings account for one third of the total electricity consumption in the United States and a significant amount of this energy is wasted. Therefore, there is a need for "virtual" energy audits, to identify energy inefficiencies and their associated savings opportunities using methods that can be non-intrusive and automated for application to large populations of buildings. Here we demonstrate virtual energy audits applied to large populations of buildings' time-series smart-meter data using a systematic approach and a fully automated Building Energy Analytics (BEA) Pipeline that unifies, cleans, stores and analyzes building energy datasets in a non-relational data warehouse for efficient insights and results. This BEA pipeline is based on a custom compute job scheduler for a high performance computing cluster to enable parallel processing of Slurm jobs. Within the analytics pipeline, we introduced a data qualification tool that enhances data quality by fixing common errors, while also detecting abnormalities in a building's daily operation using hierarchical clustering. We analyze the HVAC scheduling of a population of 816 buildings, using this analytics pipeline, as part of a cross-sectional study. With our approach, this sample of 816 buildings is improved in data quality and is efficiently analyzed in 34 minutes, which is 85 times faster than the time taken by a sequential processing. The analytical results for the HVAC operational hours of these buildings show that among 10 building use types, food sales buildings with 17.75 hours of daily HVAC cooling operation are decent targets for HVAC savings. Overall, this analytics pipeline enables the identification of statistically significant results from population based studies of large numbers of building energy time-series datasets with robust results. These types of BEA studies can explore numerous factors impacting building energy efficiency and virtual building energy audits. This approach enables a new generation of data-driven buildings energy analysis at scale.https://doi.org/10.1371/journal.pone.0240461
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Arash Khalilnejad Ahmad M Karimi Shreyas Kamath Rojiar Haddadian Roger H French Alexis R Abramson
spellingShingle	Arash Khalilnejad Ahmad M Karimi Shreyas Kamath Rojiar Haddadian Roger H French Alexis R Abramson Automated pipeline framework for processing of large-scale building energy time series data. PLoS ONE
author_facet	Arash Khalilnejad Ahmad M Karimi Shreyas Kamath Rojiar Haddadian Roger H French Alexis R Abramson
author_sort	Arash Khalilnejad
title	Automated pipeline framework for processing of large-scale building energy time series data.
title_short	Automated pipeline framework for processing of large-scale building energy time series data.
title_full	Automated pipeline framework for processing of large-scale building energy time series data.
title_fullStr	Automated pipeline framework for processing of large-scale building energy time series data.
title_full_unstemmed	Automated pipeline framework for processing of large-scale building energy time series data.
title_sort	automated pipeline framework for processing of large-scale building energy time series data.
publisher	Public Library of Science (PLoS)
series	PLoS ONE
issn	1932-6203
publishDate	2020-01-01
description	Commercial buildings account for one third of the total electricity consumption in the United States and a significant amount of this energy is wasted. Therefore, there is a need for "virtual" energy audits, to identify energy inefficiencies and their associated savings opportunities using methods that can be non-intrusive and automated for application to large populations of buildings. Here we demonstrate virtual energy audits applied to large populations of buildings' time-series smart-meter data using a systematic approach and a fully automated Building Energy Analytics (BEA) Pipeline that unifies, cleans, stores and analyzes building energy datasets in a non-relational data warehouse for efficient insights and results. This BEA pipeline is based on a custom compute job scheduler for a high performance computing cluster to enable parallel processing of Slurm jobs. Within the analytics pipeline, we introduced a data qualification tool that enhances data quality by fixing common errors, while also detecting abnormalities in a building's daily operation using hierarchical clustering. We analyze the HVAC scheduling of a population of 816 buildings, using this analytics pipeline, as part of a cross-sectional study. With our approach, this sample of 816 buildings is improved in data quality and is efficiently analyzed in 34 minutes, which is 85 times faster than the time taken by a sequential processing. The analytical results for the HVAC operational hours of these buildings show that among 10 building use types, food sales buildings with 17.75 hours of daily HVAC cooling operation are decent targets for HVAC savings. Overall, this analytics pipeline enables the identification of statistically significant results from population based studies of large numbers of building energy time-series datasets with robust results. These types of BEA studies can explore numerous factors impacting building energy efficiency and virtual building energy audits. This approach enables a new generation of data-driven buildings energy analysis at scale.
url	https://doi.org/10.1371/journal.pone.0240461
work_keys_str_mv	AT arashkhalilnejad automatedpipelineframeworkforprocessingoflargescalebuildingenergytimeseriesdata AT ahmadmkarimi automatedpipelineframeworkforprocessingoflargescalebuildingenergytimeseriesdata AT shreyaskamath automatedpipelineframeworkforprocessingoflargescalebuildingenergytimeseriesdata AT rojiarhaddadian automatedpipelineframeworkforprocessingoflargescalebuildingenergytimeseriesdata AT rogerhfrench automatedpipelineframeworkforprocessingoflargescalebuildingenergytimeseriesdata AT alexisrabramson automatedpipelineframeworkforprocessingoflargescalebuildingenergytimeseriesdata
_version_	1714801316657627136

Automated pipeline framework for processing of large-scale building energy time series data.

Similar Items