A High-Performance Data Accessing and Processing System for Campus Real-time Power Usage
With the flourishing of Internet of Things (IoT) technology, ubiquitous power data can be linked to the Internet and be analyzed for real-time monitoring requirements. Numerous power data would be accumulated to even Tera-byte level as the time goes. To approach a real-time power monitoring platform...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Bright Publisher
2020-12-01
|
Series: | IJIIS: International Journal of Informatics and Information Systems |
Subjects: | |
Online Access: | http://ijiis.org/index.php/IJIIS/article/view/98 |
id |
doaj-cce25125e2e0428ea1a6b214900cc447 |
---|---|
record_format |
Article |
spelling |
doaj-cce25125e2e0428ea1a6b214900cc4472021-07-03T00:32:10ZengBright PublisherIJIIS: International Journal of Informatics and Information Systems2579-70692020-12-013312813510.47738/ijiis.v3i3.9851A High-Performance Data Accessing and Processing System for Campus Real-time Power UsageSheng-Cang Chou0Chao-Tung Yang1Dept. of Computer Science, Tunghai University,Taichung City, TaiwanDept. of Computer Science, Tunghai University,Taichung City, TaiwanWith the flourishing of Internet of Things (IoT) technology, ubiquitous power data can be linked to the Internet and be analyzed for real-time monitoring requirements. Numerous power data would be accumulated to even Tera-byte level as the time goes. To approach a real-time power monitoring platform on them, an efficient and novel implementation techniques has been developed and formed to be the kernel material of this thesis. Based on the integration of multiple software subsystems in a layered manner, the proposed power-monitoring platform has been established and is composed of Ubuntu (as operating system), Hadoop (as storage subsystem), Hive (as data warehouse), and the Spark MLlib (as data analytics) from bottom to top. The generic power-data source is provided by the so-called smart meters equipped inside factories located in an enterprise practically. The data collection and storage are handled by the Hadoop subsystem and the data ingestion to Hive data warehouse is conducted by the Spark unit. On the aspect of system verification, under single-record query, these software modules: HiveQL and Impala SQL had been tested in terms of query-response efficiency. And for the performance exploration on the full-table query function. The relevant experiments have been conducted on the same software modules as well. The kernel contributions of this research work can be highlighted by two parts: the details of building an efficient real-time power-monitoring platform, and the relevant query-response efficiency for reference.http://ijiis.org/index.php/IJIIS/article/view/98internet of thingsbig data warehousereal-time processingsparkhiveimpala |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Sheng-Cang Chou Chao-Tung Yang |
spellingShingle |
Sheng-Cang Chou Chao-Tung Yang A High-Performance Data Accessing and Processing System for Campus Real-time Power Usage IJIIS: International Journal of Informatics and Information Systems internet of things big data warehouse real-time processing spark hive impala |
author_facet |
Sheng-Cang Chou Chao-Tung Yang |
author_sort |
Sheng-Cang Chou |
title |
A High-Performance Data Accessing and Processing System for Campus Real-time Power Usage |
title_short |
A High-Performance Data Accessing and Processing System for Campus Real-time Power Usage |
title_full |
A High-Performance Data Accessing and Processing System for Campus Real-time Power Usage |
title_fullStr |
A High-Performance Data Accessing and Processing System for Campus Real-time Power Usage |
title_full_unstemmed |
A High-Performance Data Accessing and Processing System for Campus Real-time Power Usage |
title_sort |
high-performance data accessing and processing system for campus real-time power usage |
publisher |
Bright Publisher |
series |
IJIIS: International Journal of Informatics and Information Systems |
issn |
2579-7069 |
publishDate |
2020-12-01 |
description |
With the flourishing of Internet of Things (IoT) technology, ubiquitous power data can be linked to the Internet and be analyzed for real-time monitoring requirements. Numerous power data would be accumulated to even Tera-byte level as the time goes. To approach a real-time power monitoring platform on them, an efficient and novel implementation techniques has been developed and formed to be the kernel material of this thesis. Based on the integration of multiple software subsystems in a layered manner, the proposed power-monitoring platform has been established and is composed of Ubuntu (as operating system), Hadoop (as storage subsystem), Hive (as data warehouse), and the Spark MLlib (as data analytics) from bottom to top. The generic power-data source is provided by the so-called smart meters equipped inside factories located in an enterprise practically. The data collection and storage are handled by the Hadoop subsystem and the data ingestion to Hive data warehouse is conducted by the Spark unit. On the aspect of system verification, under single-record query, these software modules: HiveQL and Impala SQL had been tested in terms of query-response efficiency. And for the performance exploration on the full-table query function. The relevant experiments have been conducted on the same software modules as well. The kernel contributions of this research work can be highlighted by two parts: the details of building an efficient real-time power-monitoring platform, and the relevant query-response efficiency for reference. |
topic |
internet of things big data warehouse real-time processing spark hive impala |
url |
http://ijiis.org/index.php/IJIIS/article/view/98 |
work_keys_str_mv |
AT shengcangchou ahighperformancedataaccessingandprocessingsystemforcampusrealtimepowerusage AT chaotungyang ahighperformancedataaccessingandprocessingsystemforcampusrealtimepowerusage AT shengcangchou highperformancedataaccessingandprocessingsystemforcampusrealtimepowerusage AT chaotungyang highperformancedataaccessingandprocessingsystemforcampusrealtimepowerusage |
_version_ |
1721321331143737344 |