Analyzing Open Data by R Language and Hadoop - Using Data of Weather and Agricultural Transactions

碩士 === 國立虎尾科技大學 === 資訊管理研究所 === 103 === In recent years, because of the massive potential values in “open data”, it has been become a quite popular topic in the domain of information technology. In addition, western countries and international organizations, such as United Nations endeavored to prom...

Full description

Bibliographic Details
Main Authors: Hao-Ren Wu, 吳豪仁
Other Authors: Nian-Ze Hu
Format: Others
Language:zh-TW
Published: 2015
Online Access:http://ndltd.ncl.edu.tw/handle/7dd75x
id ndltd-TW-103NYPI5396045
record_format oai_dc
spelling ndltd-TW-103NYPI53960452019-09-22T03:41:17Z http://ndltd.ncl.edu.tw/handle/7dd75x Analyzing Open Data by R Language and Hadoop - Using Data of Weather and Agricultural Transactions 運用R語言與Hadoop分析開放資料-以天氣與農產品資料為例 Hao-Ren Wu 吳豪仁 碩士 國立虎尾科技大學 資訊管理研究所 103 In recent years, because of the massive potential values in “open data”, it has been become a quite popular topic in the domain of information technology. In addition, western countries and international organizations, such as United Nations endeavored to prompt the open government data. Moreover, we obtain data from various sources, which usually do not transform the content with unique format. This would cause inconvenient to integrate and analyze the data. Therefore, it is a prominent issue to develop a mechanism which is capable of collecting and integrating the heterogeneous open dataset seamlessly and support the analysts to retrieve the potential information efficiently. Thus, this study adopts Hadoop platform and R language to implement a prototype system that can automatically capture and consolidate the open data. After the processes are finished, all results, including summarized data, analytical models, decision tree rules, and discovered key factors will be stored in relational database and HDFS. We try to collect the agriculture transactional data and historical climate records through our procedures. Additionally, this system generates the common key factors from various crops belong to a specified category by adopting proposed looping decision tree mechanism. Nian-Ze Hu 胡念祖 2015 學位論文 ; thesis 59 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立虎尾科技大學 === 資訊管理研究所 === 103 === In recent years, because of the massive potential values in “open data”, it has been become a quite popular topic in the domain of information technology. In addition, western countries and international organizations, such as United Nations endeavored to prompt the open government data. Moreover, we obtain data from various sources, which usually do not transform the content with unique format. This would cause inconvenient to integrate and analyze the data. Therefore, it is a prominent issue to develop a mechanism which is capable of collecting and integrating the heterogeneous open dataset seamlessly and support the analysts to retrieve the potential information efficiently. Thus, this study adopts Hadoop platform and R language to implement a prototype system that can automatically capture and consolidate the open data. After the processes are finished, all results, including summarized data, analytical models, decision tree rules, and discovered key factors will be stored in relational database and HDFS. We try to collect the agriculture transactional data and historical climate records through our procedures. Additionally, this system generates the common key factors from various crops belong to a specified category by adopting proposed looping decision tree mechanism.
author2 Nian-Ze Hu
author_facet Nian-Ze Hu
Hao-Ren Wu
吳豪仁
author Hao-Ren Wu
吳豪仁
spellingShingle Hao-Ren Wu
吳豪仁
Analyzing Open Data by R Language and Hadoop - Using Data of Weather and Agricultural Transactions
author_sort Hao-Ren Wu
title Analyzing Open Data by R Language and Hadoop - Using Data of Weather and Agricultural Transactions
title_short Analyzing Open Data by R Language and Hadoop - Using Data of Weather and Agricultural Transactions
title_full Analyzing Open Data by R Language and Hadoop - Using Data of Weather and Agricultural Transactions
title_fullStr Analyzing Open Data by R Language and Hadoop - Using Data of Weather and Agricultural Transactions
title_full_unstemmed Analyzing Open Data by R Language and Hadoop - Using Data of Weather and Agricultural Transactions
title_sort analyzing open data by r language and hadoop - using data of weather and agricultural transactions
publishDate 2015
url http://ndltd.ncl.edu.tw/handle/7dd75x
work_keys_str_mv AT haorenwu analyzingopendatabyrlanguageandhadoopusingdataofweatherandagriculturaltransactions
AT wúháorén analyzingopendatabyrlanguageandhadoopusingdataofweatherandagriculturaltransactions
AT haorenwu yùnyòngryǔyányǔhadoopfēnxīkāifàngzīliàoyǐtiānqìyǔnóngchǎnpǐnzīliàowèilì
AT wúháorén yùnyòngryǔyányǔhadoopfēnxīkāifàngzīliàoyǐtiānqìyǔnóngchǎnpǐnzīliàowèilì
_version_ 1719255231979585536