Automated pipeline framework for processing of large-scale building energy time series data.

Commercial buildings account for one third of the total electricity consumption in the United States and a significant amount of this energy is wasted. Therefore, there is a need for "virtual" energy audits, to identify energy inefficiencies and their associated savings opportunities using...

Full description

Bibliographic Details
Main Authors: Arash Khalilnejad, Ahmad M Karimi, Shreyas Kamath, Rojiar Haddadian, Roger H French, Alexis R Abramson
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2020-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0240461
id doaj-5dd83d46c3df49a3b018ebe28cd40fd2
record_format Article
spelling doaj-5dd83d46c3df49a3b018ebe28cd40fd22021-03-04T12:50:00ZengPublic Library of Science (PLoS)PLoS ONE1932-62032020-01-011512e024046110.1371/journal.pone.0240461Automated pipeline framework for processing of large-scale building energy time series data.Arash KhalilnejadAhmad M KarimiShreyas KamathRojiar HaddadianRoger H FrenchAlexis R AbramsonCommercial buildings account for one third of the total electricity consumption in the United States and a significant amount of this energy is wasted. Therefore, there is a need for "virtual" energy audits, to identify energy inefficiencies and their associated savings opportunities using methods that can be non-intrusive and automated for application to large populations of buildings. Here we demonstrate virtual energy audits applied to large populations of buildings' time-series smart-meter data using a systematic approach and a fully automated Building Energy Analytics (BEA) Pipeline that unifies, cleans, stores and analyzes building energy datasets in a non-relational data warehouse for efficient insights and results. This BEA pipeline is based on a custom compute job scheduler for a high performance computing cluster to enable parallel processing of Slurm jobs. Within the analytics pipeline, we introduced a data qualification tool that enhances data quality by fixing common errors, while also detecting abnormalities in a building's daily operation using hierarchical clustering. We analyze the HVAC scheduling of a population of 816 buildings, using this analytics pipeline, as part of a cross-sectional study. With our approach, this sample of 816 buildings is improved in data quality and is efficiently analyzed in 34 minutes, which is 85 times faster than the time taken by a sequential processing. The analytical results for the HVAC operational hours of these buildings show that among 10 building use types, food sales buildings with 17.75 hours of daily HVAC cooling operation are decent targets for HVAC savings. Overall, this analytics pipeline enables the identification of statistically significant results from population based studies of large numbers of building energy time-series datasets with robust results. These types of BEA studies can explore numerous factors impacting building energy efficiency and virtual building energy audits. This approach enables a new generation of data-driven buildings energy analysis at scale.https://doi.org/10.1371/journal.pone.0240461
collection DOAJ
language English
format Article
sources DOAJ
author Arash Khalilnejad
Ahmad M Karimi
Shreyas Kamath
Rojiar Haddadian
Roger H French
Alexis R Abramson
spellingShingle Arash Khalilnejad
Ahmad M Karimi
Shreyas Kamath
Rojiar Haddadian
Roger H French
Alexis R Abramson
Automated pipeline framework for processing of large-scale building energy time series data.
PLoS ONE
author_facet Arash Khalilnejad
Ahmad M Karimi
Shreyas Kamath
Rojiar Haddadian
Roger H French
Alexis R Abramson
author_sort Arash Khalilnejad
title Automated pipeline framework for processing of large-scale building energy time series data.
title_short Automated pipeline framework for processing of large-scale building energy time series data.
title_full Automated pipeline framework for processing of large-scale building energy time series data.
title_fullStr Automated pipeline framework for processing of large-scale building energy time series data.
title_full_unstemmed Automated pipeline framework for processing of large-scale building energy time series data.
title_sort automated pipeline framework for processing of large-scale building energy time series data.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2020-01-01
description Commercial buildings account for one third of the total electricity consumption in the United States and a significant amount of this energy is wasted. Therefore, there is a need for "virtual" energy audits, to identify energy inefficiencies and their associated savings opportunities using methods that can be non-intrusive and automated for application to large populations of buildings. Here we demonstrate virtual energy audits applied to large populations of buildings' time-series smart-meter data using a systematic approach and a fully automated Building Energy Analytics (BEA) Pipeline that unifies, cleans, stores and analyzes building energy datasets in a non-relational data warehouse for efficient insights and results. This BEA pipeline is based on a custom compute job scheduler for a high performance computing cluster to enable parallel processing of Slurm jobs. Within the analytics pipeline, we introduced a data qualification tool that enhances data quality by fixing common errors, while also detecting abnormalities in a building's daily operation using hierarchical clustering. We analyze the HVAC scheduling of a population of 816 buildings, using this analytics pipeline, as part of a cross-sectional study. With our approach, this sample of 816 buildings is improved in data quality and is efficiently analyzed in 34 minutes, which is 85 times faster than the time taken by a sequential processing. The analytical results for the HVAC operational hours of these buildings show that among 10 building use types, food sales buildings with 17.75 hours of daily HVAC cooling operation are decent targets for HVAC savings. Overall, this analytics pipeline enables the identification of statistically significant results from population based studies of large numbers of building energy time-series datasets with robust results. These types of BEA studies can explore numerous factors impacting building energy efficiency and virtual building energy audits. This approach enables a new generation of data-driven buildings energy analysis at scale.
url https://doi.org/10.1371/journal.pone.0240461
work_keys_str_mv AT arashkhalilnejad automatedpipelineframeworkforprocessingoflargescalebuildingenergytimeseriesdata
AT ahmadmkarimi automatedpipelineframeworkforprocessingoflargescalebuildingenergytimeseriesdata
AT shreyaskamath automatedpipelineframeworkforprocessingoflargescalebuildingenergytimeseriesdata
AT rojiarhaddadian automatedpipelineframeworkforprocessingoflargescalebuildingenergytimeseriesdata
AT rogerhfrench automatedpipelineframeworkforprocessingoflargescalebuildingenergytimeseriesdata
AT alexisrabramson automatedpipelineframeworkforprocessingoflargescalebuildingenergytimeseriesdata
_version_ 1714801316657627136