Based on Hadoop Platform-Accelerating Query to Process Big Data Combined with Heterogeneous Databases and Hive

碩士 === 國立東華大學 === 資訊工程學系 === 101 === Cloud Computing has been a topic issue in the field of research in recent years. Nowadays, the environment of Internet has become more popular and powerful. Information can be gotten on any mobile devices (e.g., smartphone, laptop and tablet) through the web...

Full description

Bibliographic Details
Main Authors:	Jia-Sheng Liang, 梁嘉勝
Other Authors:	Ruay-Shiung Chang
Format:	Others
Published:	2013
Online Access:	http://ndltd.ncl.edu.tw/handle/222554

id	ndltd-TW-101NDHU5392009
record_format	oai_dc
spelling	ndltd-TW-101NDHU53920092018-04-10T17:22:13Z http://ndltd.ncl.edu.tw/handle/222554 Based on Hadoop Platform-Accelerating Query to Process Big Data Combined with Heterogeneous Databases and Hive 以Hadoop為平台-結合異質資料庫與Hive之加速查詢應用 Jia-Sheng Liang 梁嘉勝碩士國立東華大學資訊工程學系 101 Cloud Computing has been a topic issue in the field of research in recent years. Nowadays, the environment of Internet has become more popular and powerful. Information can be gotten on any mobile devices (e.g., smartphone, laptop and tablet) through the web services easily. Parallelization in computing and data storage increases the performance of computing and capacity of fault-tolerant. These significant features speed up the development of Cloud Computing service. Besides, more and more computing power and storage provided by cloud platforms are required to process big data. Information system needs to be reconsidered to deal with the parallelization of Big Data analytics services. In the traditional framework, the Information system combines with front-end application and database. The system will cost significant time and resources while the RDBMS (Relational Database Management System) is used to analyze Big Data. In order to solve this problem of Big Data analysis, we propose the WSG (Web Service GUI) that takes the advantages of Hadoop and RDBMS to overcome the obstacle. In this thesis, WSG installed on the Hadoop platform acts as a Web Service combined with Hive and Sqoop to integrate Big Data between Heterogeneous Databases. Therefore, the technician can utilize Hive QL to analyze Big Data, and WSG can convey data from HDFS to RDBMS by using Sqoop. As a result, our proposed method will reduce the burden of reconstruction information system to accomplish the standard of Cloud Computing, and also be benefit to acceleration in querying data of small size. Ruay-Shiung Chang 張瑞雄 2013 學位論文 ; thesis 80
collection	NDLTD
format	Others
sources	NDLTD
description	碩士 === 國立東華大學 === 資訊工程學系 === 101 === Cloud Computing has been a topic issue in the field of research in recent years. Nowadays, the environment of Internet has become more popular and powerful. Information can be gotten on any mobile devices (e.g., smartphone, laptop and tablet) through the web services easily. Parallelization in computing and data storage increases the performance of computing and capacity of fault-tolerant. These significant features speed up the development of Cloud Computing service. Besides, more and more computing power and storage provided by cloud platforms are required to process big data. Information system needs to be reconsidered to deal with the parallelization of Big Data analytics services. In the traditional framework, the Information system combines with front-end application and database. The system will cost significant time and resources while the RDBMS (Relational Database Management System) is used to analyze Big Data. In order to solve this problem of Big Data analysis, we propose the WSG (Web Service GUI) that takes the advantages of Hadoop and RDBMS to overcome the obstacle. In this thesis, WSG installed on the Hadoop platform acts as a Web Service combined with Hive and Sqoop to integrate Big Data between Heterogeneous Databases. Therefore, the technician can utilize Hive QL to analyze Big Data, and WSG can convey data from HDFS to RDBMS by using Sqoop. As a result, our proposed method will reduce the burden of reconstruction information system to accomplish the standard of Cloud Computing, and also be benefit to acceleration in querying data of small size.
author2	Ruay-Shiung Chang
author_facet	Ruay-Shiung Chang Jia-Sheng Liang 梁嘉勝
author	Jia-Sheng Liang 梁嘉勝
spellingShingle	Jia-Sheng Liang 梁嘉勝 Based on Hadoop Platform-Accelerating Query to Process Big Data Combined with Heterogeneous Databases and Hive
author_sort	Jia-Sheng Liang
title	Based on Hadoop Platform-Accelerating Query to Process Big Data Combined with Heterogeneous Databases and Hive
title_short	Based on Hadoop Platform-Accelerating Query to Process Big Data Combined with Heterogeneous Databases and Hive
title_full	Based on Hadoop Platform-Accelerating Query to Process Big Data Combined with Heterogeneous Databases and Hive
title_fullStr	Based on Hadoop Platform-Accelerating Query to Process Big Data Combined with Heterogeneous Databases and Hive
title_full_unstemmed	Based on Hadoop Platform-Accelerating Query to Process Big Data Combined with Heterogeneous Databases and Hive
title_sort	based on hadoop platform-accelerating query to process big data combined with heterogeneous databases and hive
publishDate	2013
url	http://ndltd.ncl.edu.tw/handle/222554
work_keys_str_mv	AT jiashengliang basedonhadoopplatformacceleratingquerytoprocessbigdatacombinedwithheterogeneousdatabasesandhive AT liángjiāshèng basedonhadoopplatformacceleratingquerytoprocessbigdatacombinedwithheterogeneousdatabasesandhive AT jiashengliang yǐhadoopwèipíngtáijiéhéyìzhìzīliàokùyǔhivezhījiāsùcháxúnyīngyòng AT liángjiāshèng yǐhadoopwèipíngtáijiéhéyìzhìzīliàokùyǔhivezhījiāsùcháxúnyīngyòng
_version_	1718627329292369920

Based on Hadoop Platform-Accelerating Query to Process Big Data Combined with Heterogeneous Databases and Hive

Similar Items