Two Level Data Extraction Scheme for Geographical Ocean Data

碩士 === 國立臺北大學 === 通訊工程研究所 === 98 === There are large scientific data archives manage and store huge quantities of data with the help of metadata, deal with this data throughout its life cycle, and focus on particular scientific domains. An effective technology for searching desired data becomes incr...

Full description

Bibliographic Details
Main Authors: Hsuan Jen Lai, 賴宣任
Other Authors: Yue Shan Chang
Format: Others
Language:zh-TW
Published: 2010
Online Access:http://ndltd.ncl.edu.tw/handle/82449432820973820042
id ndltd-TW-098NTPU0650010
record_format oai_dc
spelling ndltd-TW-098NTPU06500102015-10-13T18:21:30Z http://ndltd.ncl.edu.tw/handle/82449432820973820042 Two Level Data Extraction Scheme for Geographical Ocean Data 兩層海洋地理資料之擷取方法 Hsuan Jen Lai 賴宣任 碩士 國立臺北大學 通訊工程研究所 98 There are large scientific data archives manage and store huge quantities of data with the help of metadata, deal with this data throughout its life cycle, and focus on particular scientific domains. An effective technology for searching desired data becomes increasingly important. We propose two level data extraction scheme. As well known, metadata can be used for assisting the information retrieval. Using metadata to present the file system also reduces the processing required to handle operations. While the number of metadata file is daily incremental with the number of scientific data file increased, utilizing metadata file to help accessing daily-incremental data set is increasing difficult. In this thesis, we first propose a Metadata Classification approach to classify the records in metadata file to a month-level metadata and construct a two dimension array to store the classified records, which can assist user program quickly inquiring the target files in order to search desired data. To further improve the performance, we modify the MC approach and present a Modified Metadata Classification (MMC) approach. In the MMC, the Metadata Classifier not only reclassifies the day-level metadata to a year-level metadata, but also adjusts the granularity of GridMap. In addition, in data level we propose data distribution scheme that is the important issue for making lookup efficiently. We conduct some experiments to evaluate the performance of MC and MMC and make a comparison with a traditional approach (named Raw approach) and existing system. It shows that the MMC have better performance than MC and Raw approach while granularity of GridMap increasing. And we also conduct experiments to performance of our data extraction scheme and Raw approach and SQL. It show our approach have better performance. Yue Shan Chang 張玉山 2010 學位論文 ; thesis 66 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺北大學 === 通訊工程研究所 === 98 === There are large scientific data archives manage and store huge quantities of data with the help of metadata, deal with this data throughout its life cycle, and focus on particular scientific domains. An effective technology for searching desired data becomes increasingly important. We propose two level data extraction scheme. As well known, metadata can be used for assisting the information retrieval. Using metadata to present the file system also reduces the processing required to handle operations. While the number of metadata file is daily incremental with the number of scientific data file increased, utilizing metadata file to help accessing daily-incremental data set is increasing difficult. In this thesis, we first propose a Metadata Classification approach to classify the records in metadata file to a month-level metadata and construct a two dimension array to store the classified records, which can assist user program quickly inquiring the target files in order to search desired data. To further improve the performance, we modify the MC approach and present a Modified Metadata Classification (MMC) approach. In the MMC, the Metadata Classifier not only reclassifies the day-level metadata to a year-level metadata, but also adjusts the granularity of GridMap. In addition, in data level we propose data distribution scheme that is the important issue for making lookup efficiently. We conduct some experiments to evaluate the performance of MC and MMC and make a comparison with a traditional approach (named Raw approach) and existing system. It shows that the MMC have better performance than MC and Raw approach while granularity of GridMap increasing. And we also conduct experiments to performance of our data extraction scheme and Raw approach and SQL. It show our approach have better performance.
author2 Yue Shan Chang
author_facet Yue Shan Chang
Hsuan Jen Lai
賴宣任
author Hsuan Jen Lai
賴宣任
spellingShingle Hsuan Jen Lai
賴宣任
Two Level Data Extraction Scheme for Geographical Ocean Data
author_sort Hsuan Jen Lai
title Two Level Data Extraction Scheme for Geographical Ocean Data
title_short Two Level Data Extraction Scheme for Geographical Ocean Data
title_full Two Level Data Extraction Scheme for Geographical Ocean Data
title_fullStr Two Level Data Extraction Scheme for Geographical Ocean Data
title_full_unstemmed Two Level Data Extraction Scheme for Geographical Ocean Data
title_sort two level data extraction scheme for geographical ocean data
publishDate 2010
url http://ndltd.ncl.edu.tw/handle/82449432820973820042
work_keys_str_mv AT hsuanjenlai twoleveldataextractionschemeforgeographicaloceandata
AT làixuānrèn twoleveldataextractionschemeforgeographicaloceandata
AT hsuanjenlai liǎngcénghǎiyángdelǐzīliàozhīxiéqǔfāngfǎ
AT làixuānrèn liǎngcénghǎiyángdelǐzīliàozhīxiéqǔfāngfǎ
_version_ 1718031402407034880