A High-Performance Data Accessing and Processing System for Campus Real-time Power Usage

With the flourishing of Internet of Things (IoT) technology, ubiquitous power data can be linked to the Internet and be analyzed for real-time monitoring requirements. Numerous power data would be accumulated to even Tera-byte level as the time goes. To approach a real-time power monitoring platform...

Full description

Bibliographic Details
Main Authors: Sheng-Cang Chou, Chao-Tung Yang
Format: Article
Language:English
Published: Bright Publisher 2020-12-01
Series:IJIIS: International Journal of Informatics and Information Systems
Subjects:
Online Access:http://ijiis.org/index.php/IJIIS/article/view/98
id doaj-cce25125e2e0428ea1a6b214900cc447
record_format Article
spelling doaj-cce25125e2e0428ea1a6b214900cc4472021-07-03T00:32:10ZengBright PublisherIJIIS: International Journal of Informatics and Information Systems2579-70692020-12-013312813510.47738/ijiis.v3i3.9851A High-Performance Data Accessing and Processing System for Campus Real-time Power UsageSheng-Cang Chou0Chao-Tung Yang1Dept. of Computer Science, Tunghai University,Taichung City, TaiwanDept. of Computer Science, Tunghai University,Taichung City, TaiwanWith the flourishing of Internet of Things (IoT) technology, ubiquitous power data can be linked to the Internet and be analyzed for real-time monitoring requirements. Numerous power data would be accumulated to even Tera-byte level as the time goes. To approach a real-time power monitoring platform on them, an efficient and novel implementation techniques has been developed and formed to be the kernel material of this thesis. Based on the integration of multiple software subsystems in a layered manner, the proposed power-monitoring platform has been established and is composed of Ubuntu (as operating system), Hadoop (as storage subsystem), Hive (as data warehouse), and the Spark MLlib (as data analytics) from bottom to top. The generic power-data source is provided by the so-called smart meters equipped inside factories located in an enterprise practically. The data collection and storage are handled by the Hadoop subsystem and the data ingestion to Hive data warehouse is conducted by the Spark unit. On the aspect of system verification, under single-record query, these software modules: HiveQL and Impala SQL had been tested in terms of query-response efficiency. And for the performance exploration on the full-table query function. The relevant experiments have been conducted on the same software modules as well. The kernel contributions of this research work can be highlighted by two parts: the details of building an efficient real-time power-monitoring platform, and the relevant query-response efficiency for reference.http://ijiis.org/index.php/IJIIS/article/view/98internet of thingsbig data warehousereal-time processingsparkhiveimpala
collection DOAJ
language English
format Article
sources DOAJ
author Sheng-Cang Chou
Chao-Tung Yang
spellingShingle Sheng-Cang Chou
Chao-Tung Yang
A High-Performance Data Accessing and Processing System for Campus Real-time Power Usage
IJIIS: International Journal of Informatics and Information Systems
internet of things
big data warehouse
real-time processing
spark
hive
impala
author_facet Sheng-Cang Chou
Chao-Tung Yang
author_sort Sheng-Cang Chou
title A High-Performance Data Accessing and Processing System for Campus Real-time Power Usage
title_short A High-Performance Data Accessing and Processing System for Campus Real-time Power Usage
title_full A High-Performance Data Accessing and Processing System for Campus Real-time Power Usage
title_fullStr A High-Performance Data Accessing and Processing System for Campus Real-time Power Usage
title_full_unstemmed A High-Performance Data Accessing and Processing System for Campus Real-time Power Usage
title_sort high-performance data accessing and processing system for campus real-time power usage
publisher Bright Publisher
series IJIIS: International Journal of Informatics and Information Systems
issn 2579-7069
publishDate 2020-12-01
description With the flourishing of Internet of Things (IoT) technology, ubiquitous power data can be linked to the Internet and be analyzed for real-time monitoring requirements. Numerous power data would be accumulated to even Tera-byte level as the time goes. To approach a real-time power monitoring platform on them, an efficient and novel implementation techniques has been developed and formed to be the kernel material of this thesis. Based on the integration of multiple software subsystems in a layered manner, the proposed power-monitoring platform has been established and is composed of Ubuntu (as operating system), Hadoop (as storage subsystem), Hive (as data warehouse), and the Spark MLlib (as data analytics) from bottom to top. The generic power-data source is provided by the so-called smart meters equipped inside factories located in an enterprise practically. The data collection and storage are handled by the Hadoop subsystem and the data ingestion to Hive data warehouse is conducted by the Spark unit. On the aspect of system verification, under single-record query, these software modules: HiveQL and Impala SQL had been tested in terms of query-response efficiency. And for the performance exploration on the full-table query function. The relevant experiments have been conducted on the same software modules as well. The kernel contributions of this research work can be highlighted by two parts: the details of building an efficient real-time power-monitoring platform, and the relevant query-response efficiency for reference.
topic internet of things
big data warehouse
real-time processing
spark
hive
impala
url http://ijiis.org/index.php/IJIIS/article/view/98
work_keys_str_mv AT shengcangchou ahighperformancedataaccessingandprocessingsystemforcampusrealtimepowerusage
AT chaotungyang ahighperformancedataaccessingandprocessingsystemforcampusrealtimepowerusage
AT shengcangchou highperformancedataaccessingandprocessingsystemforcampusrealtimepowerusage
AT chaotungyang highperformancedataaccessingandprocessingsystemforcampusrealtimepowerusage
_version_ 1721321331143737344