Inferring Missing Sensor Values and Recommending Sensor Deployment Locations Based on Urban Big Data: A Case Study on Air Quality
博士 === 國立臺灣大學 === 資訊網路與多媒體研究所 === 103 === With the gradual maturity of the persuasive techniques for networking, sensors, and Internet, the paradigm of information and communication technology has shifted from fundamental facilities and theories to real-world ubiquitous applications, such as urban c...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2015
|
Online Access: | http://ndltd.ncl.edu.tw/handle/qw94dv |
id |
ndltd-TW-103NTU05641025 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-103NTU056410252019-05-15T22:17:25Z http://ndltd.ncl.edu.tw/handle/qw94dv Inferring Missing Sensor Values and Recommending Sensor Deployment Locations Based on Urban Big Data: A Case Study on Air Quality 基於巨量城市資料之監測數值推估與感測器地點配置:以空氣質量爲例 Hsun-Ping Hsieh 解巽評 博士 國立臺灣大學 資訊網路與多媒體研究所 103 With the gradual maturity of the persuasive techniques for networking, sensors, and Internet, the paradigm of information and communication technology has shifted from fundamental facilities and theories to real-world ubiquitous applications, such as urban computing, sensor networks, and smart cities. Along with the power of big data techniques, such advances also bring new research opportunities and practical problems, especially for Internet of Things (IoT). This thesis aims to answer two questions. First, how to infer real-time sensor value of any arbitrary location given the environmental data and the sensor data collected from monitoring stations that are extremely sparse in urban areas. Second, if the government agency needs to deploy a number of new sensors to improve the quality of sensor value inference, how to automatically learn and determine the best locations to fulfill such purpose? These two problems are very essential for urban computing and big sensor data analysis because sensors will be mounted in urban areas in the future era of Internet of Things, and are considerably challenging since for most of the locations (more than 99%) we do not have any sensor data to train a model from. In this thesis, by considering air quality indices as sensor values and treating air-quality monitoring stations as sensors, we aim to solve the two research problems in an effective, efficient, and scalable manner. We develop a general-purpose semi-supervised inference model, which is capable of not only intelligently learning to correlation between air quality values and heterogeneous spatial and temporal features of city dynamics, including meteorology, human mobility, structure of road networks, and point of interests (POIs), but also accurately predicting the air quality values of arbitrary locations without monitoring stations placed ever. In addition, to facilitate the cost-effective deployment of new sensors, we devise an entropy-minimization model to efficiently recommend the geographical locations such that new sensors established there can lead to the maximum improvement of air quality inference accuracy. We evaluate the proposed models using a huge-scale air quality data in Beijing city. Experimental results exhibit a set of clear advantages over a series of state-of-the-art and commonly used methods for both tasks, and suggest the models have superior potential for applications for current big sensor data analysis and the upcoming era of Internet of Things. Shou-De Lin 林守德 2015 學位論文 ; thesis 54 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
博士 === 國立臺灣大學 === 資訊網路與多媒體研究所 === 103 === With the gradual maturity of the persuasive techniques for networking, sensors, and Internet, the paradigm of information and communication technology has shifted from fundamental facilities and theories to real-world ubiquitous applications, such as urban computing, sensor networks, and smart cities. Along with the power of big data techniques, such advances also bring new research opportunities and practical problems, especially for Internet of Things (IoT). This thesis aims to answer two questions. First, how to infer real-time sensor value of any arbitrary location given the environmental data and the sensor data collected from monitoring stations that are extremely sparse in urban areas. Second, if the government agency needs to deploy a number of new sensors to improve the quality of sensor value inference, how to automatically learn and determine the best locations to fulfill such purpose? These two problems are very essential for urban computing and big sensor data analysis because sensors will be mounted in urban areas in the future era of Internet of Things, and are considerably challenging since for most of the locations (more than 99%) we do not have any sensor data to train a model from. In this thesis, by considering air quality indices as sensor values and treating air-quality monitoring stations as sensors, we aim to solve the two research problems in an effective, efficient, and scalable manner. We develop a general-purpose semi-supervised inference model, which is capable of not only intelligently learning to correlation between air quality values and heterogeneous spatial and temporal features of city dynamics, including meteorology, human mobility, structure of road networks, and point of interests (POIs), but also accurately predicting the air quality values of arbitrary locations without monitoring stations placed ever. In addition, to facilitate the cost-effective deployment of new sensors, we devise an entropy-minimization model to efficiently recommend the geographical locations such that new sensors established there can lead to the maximum improvement of air quality inference accuracy. We evaluate the proposed models using a huge-scale air quality data in Beijing city. Experimental results exhibit a set of clear advantages over a series of state-of-the-art and commonly used methods for both tasks, and suggest the models have superior potential for applications for current big sensor data analysis and the upcoming era of Internet of Things.
|
author2 |
Shou-De Lin |
author_facet |
Shou-De Lin Hsun-Ping Hsieh 解巽評 |
author |
Hsun-Ping Hsieh 解巽評 |
spellingShingle |
Hsun-Ping Hsieh 解巽評 Inferring Missing Sensor Values and Recommending Sensor Deployment Locations Based on Urban Big Data: A Case Study on Air Quality |
author_sort |
Hsun-Ping Hsieh |
title |
Inferring Missing Sensor Values and Recommending Sensor Deployment Locations Based on Urban Big Data: A Case Study on Air Quality |
title_short |
Inferring Missing Sensor Values and Recommending Sensor Deployment Locations Based on Urban Big Data: A Case Study on Air Quality |
title_full |
Inferring Missing Sensor Values and Recommending Sensor Deployment Locations Based on Urban Big Data: A Case Study on Air Quality |
title_fullStr |
Inferring Missing Sensor Values and Recommending Sensor Deployment Locations Based on Urban Big Data: A Case Study on Air Quality |
title_full_unstemmed |
Inferring Missing Sensor Values and Recommending Sensor Deployment Locations Based on Urban Big Data: A Case Study on Air Quality |
title_sort |
inferring missing sensor values and recommending sensor deployment locations based on urban big data: a case study on air quality |
publishDate |
2015 |
url |
http://ndltd.ncl.edu.tw/handle/qw94dv |
work_keys_str_mv |
AT hsunpinghsieh inferringmissingsensorvaluesandrecommendingsensordeploymentlocationsbasedonurbanbigdataacasestudyonairquality AT jiěxùnpíng inferringmissingsensorvaluesandrecommendingsensordeploymentlocationsbasedonurbanbigdataacasestudyonairquality AT hsunpinghsieh jīyújùliàngchéngshìzīliàozhījiāncèshùzhítuīgūyǔgǎncèqìdediǎnpèizhìyǐkōngqìzhìliàngwèilì AT jiěxùnpíng jīyújùliàngchéngshìzīliàozhījiāncèshùzhítuīgūyǔgǎncèqìdediǎnpèizhìyǐkōngqìzhìliàngwèilì |
_version_ |
1719127456463454208 |