Automated pipeline framework for processing of large-scale building energy time series data.
Commercial buildings account for one third of the total electricity consumption in the United States and a significant amount of this energy is wasted. Therefore, there is a need for "virtual" energy audits, to identify energy inefficiencies and their associated savings opportunities using...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Public Library of Science (PLoS)
2020-01-01
|
Series: | PLoS ONE |
Online Access: | https://doi.org/10.1371/journal.pone.0240461 |
id |
doaj-5dd83d46c3df49a3b018ebe28cd40fd2 |
---|---|
record_format |
Article |
spelling |
doaj-5dd83d46c3df49a3b018ebe28cd40fd22021-03-04T12:50:00ZengPublic Library of Science (PLoS)PLoS ONE1932-62032020-01-011512e024046110.1371/journal.pone.0240461Automated pipeline framework for processing of large-scale building energy time series data.Arash KhalilnejadAhmad M KarimiShreyas KamathRojiar HaddadianRoger H FrenchAlexis R AbramsonCommercial buildings account for one third of the total electricity consumption in the United States and a significant amount of this energy is wasted. Therefore, there is a need for "virtual" energy audits, to identify energy inefficiencies and their associated savings opportunities using methods that can be non-intrusive and automated for application to large populations of buildings. Here we demonstrate virtual energy audits applied to large populations of buildings' time-series smart-meter data using a systematic approach and a fully automated Building Energy Analytics (BEA) Pipeline that unifies, cleans, stores and analyzes building energy datasets in a non-relational data warehouse for efficient insights and results. This BEA pipeline is based on a custom compute job scheduler for a high performance computing cluster to enable parallel processing of Slurm jobs. Within the analytics pipeline, we introduced a data qualification tool that enhances data quality by fixing common errors, while also detecting abnormalities in a building's daily operation using hierarchical clustering. We analyze the HVAC scheduling of a population of 816 buildings, using this analytics pipeline, as part of a cross-sectional study. With our approach, this sample of 816 buildings is improved in data quality and is efficiently analyzed in 34 minutes, which is 85 times faster than the time taken by a sequential processing. The analytical results for the HVAC operational hours of these buildings show that among 10 building use types, food sales buildings with 17.75 hours of daily HVAC cooling operation are decent targets for HVAC savings. Overall, this analytics pipeline enables the identification of statistically significant results from population based studies of large numbers of building energy time-series datasets with robust results. These types of BEA studies can explore numerous factors impacting building energy efficiency and virtual building energy audits. This approach enables a new generation of data-driven buildings energy analysis at scale.https://doi.org/10.1371/journal.pone.0240461 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Arash Khalilnejad Ahmad M Karimi Shreyas Kamath Rojiar Haddadian Roger H French Alexis R Abramson |
spellingShingle |
Arash Khalilnejad Ahmad M Karimi Shreyas Kamath Rojiar Haddadian Roger H French Alexis R Abramson Automated pipeline framework for processing of large-scale building energy time series data. PLoS ONE |
author_facet |
Arash Khalilnejad Ahmad M Karimi Shreyas Kamath Rojiar Haddadian Roger H French Alexis R Abramson |
author_sort |
Arash Khalilnejad |
title |
Automated pipeline framework for processing of large-scale building energy time series data. |
title_short |
Automated pipeline framework for processing of large-scale building energy time series data. |
title_full |
Automated pipeline framework for processing of large-scale building energy time series data. |
title_fullStr |
Automated pipeline framework for processing of large-scale building energy time series data. |
title_full_unstemmed |
Automated pipeline framework for processing of large-scale building energy time series data. |
title_sort |
automated pipeline framework for processing of large-scale building energy time series data. |
publisher |
Public Library of Science (PLoS) |
series |
PLoS ONE |
issn |
1932-6203 |
publishDate |
2020-01-01 |
description |
Commercial buildings account for one third of the total electricity consumption in the United States and a significant amount of this energy is wasted. Therefore, there is a need for "virtual" energy audits, to identify energy inefficiencies and their associated savings opportunities using methods that can be non-intrusive and automated for application to large populations of buildings. Here we demonstrate virtual energy audits applied to large populations of buildings' time-series smart-meter data using a systematic approach and a fully automated Building Energy Analytics (BEA) Pipeline that unifies, cleans, stores and analyzes building energy datasets in a non-relational data warehouse for efficient insights and results. This BEA pipeline is based on a custom compute job scheduler for a high performance computing cluster to enable parallel processing of Slurm jobs. Within the analytics pipeline, we introduced a data qualification tool that enhances data quality by fixing common errors, while also detecting abnormalities in a building's daily operation using hierarchical clustering. We analyze the HVAC scheduling of a population of 816 buildings, using this analytics pipeline, as part of a cross-sectional study. With our approach, this sample of 816 buildings is improved in data quality and is efficiently analyzed in 34 minutes, which is 85 times faster than the time taken by a sequential processing. The analytical results for the HVAC operational hours of these buildings show that among 10 building use types, food sales buildings with 17.75 hours of daily HVAC cooling operation are decent targets for HVAC savings. Overall, this analytics pipeline enables the identification of statistically significant results from population based studies of large numbers of building energy time-series datasets with robust results. These types of BEA studies can explore numerous factors impacting building energy efficiency and virtual building energy audits. This approach enables a new generation of data-driven buildings energy analysis at scale. |
url |
https://doi.org/10.1371/journal.pone.0240461 |
work_keys_str_mv |
AT arashkhalilnejad automatedpipelineframeworkforprocessingoflargescalebuildingenergytimeseriesdata AT ahmadmkarimi automatedpipelineframeworkforprocessingoflargescalebuildingenergytimeseriesdata AT shreyaskamath automatedpipelineframeworkforprocessingoflargescalebuildingenergytimeseriesdata AT rojiarhaddadian automatedpipelineframeworkforprocessingoflargescalebuildingenergytimeseriesdata AT rogerhfrench automatedpipelineframeworkforprocessingoflargescalebuildingenergytimeseriesdata AT alexisrabramson automatedpipelineframeworkforprocessingoflargescalebuildingenergytimeseriesdata |
_version_ |
1714801316657627136 |