Using Offline Wikipedia Database to Reduce Time Costing of NGD

碩士 === 國立中央大學 === 資訊管理研究所 === 100 === With the rapid development of Internet, many kinds of information website continued a steady increase; the user can easily obtain a great deal of information from a variety of search engines and portals such as Google and Yahoo! However, Jansen, et al. pointed o...

Full description

Bibliographic Details
Main Authors: Yi-chun Cheng, 鄭奕駿
Other Authors: Shi-jen Lin
Format: Others
Language:zh-TW
Published: 2012
Online Access:http://ndltd.ncl.edu.tw/handle/81007608444715028955
id ndltd-TW-100NCU05396062
record_format oai_dc
spelling ndltd-TW-100NCU053960622015-10-13T21:22:38Z http://ndltd.ncl.edu.tw/handle/81007608444715028955 Using Offline Wikipedia Database to Reduce Time Costing of NGD 離線搜尋Wikipedia以縮減NGD運算時間之研究 Yi-chun Cheng 鄭奕駿 碩士 國立中央大學 資訊管理研究所 100 With the rapid development of Internet, many kinds of information website continued a steady increase; the user can easily obtain a great deal of information from a variety of search engines and portals such as Google and Yahoo! However, Jansen, et al. pointed out that under normal circumstances, most users enter only 2.35 keywords, and mostly unclear or incomplete keyword results in returning a lot of websites so that lead to information overload. The research literature in the past, often using the categories of information, or filtering to help reduce the cost of user access to information, but these methods have to be built under the premise of a large number of training data can have good results. Recent studies have proposed NGD provided by Google''s search engine, key in the keywords to get the number of results to calculate the abstract distance between the two words, and then draw a conclusion of two words where the file is similar. However NGD rely on Google''s online search function, with the high-frequency query, Google will refused user to use the search service. In order to solve this problem, this study advances a method that use Wikipedia to establish the offline search engine, because Wikipedia has a structured concepts and high purity content. And with the experimental proofs, when user uses the offline Wikipedia database, the method proposed in this study still provides the user has a stable filtration performance, and saves the user a plenty of time costs. Shi-jen Lin 林熙禎 2012 學位論文 ; thesis 58 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立中央大學 === 資訊管理研究所 === 100 === With the rapid development of Internet, many kinds of information website continued a steady increase; the user can easily obtain a great deal of information from a variety of search engines and portals such as Google and Yahoo! However, Jansen, et al. pointed out that under normal circumstances, most users enter only 2.35 keywords, and mostly unclear or incomplete keyword results in returning a lot of websites so that lead to information overload. The research literature in the past, often using the categories of information, or filtering to help reduce the cost of user access to information, but these methods have to be built under the premise of a large number of training data can have good results. Recent studies have proposed NGD provided by Google''s search engine, key in the keywords to get the number of results to calculate the abstract distance between the two words, and then draw a conclusion of two words where the file is similar. However NGD rely on Google''s online search function, with the high-frequency query, Google will refused user to use the search service. In order to solve this problem, this study advances a method that use Wikipedia to establish the offline search engine, because Wikipedia has a structured concepts and high purity content. And with the experimental proofs, when user uses the offline Wikipedia database, the method proposed in this study still provides the user has a stable filtration performance, and saves the user a plenty of time costs.
author2 Shi-jen Lin
author_facet Shi-jen Lin
Yi-chun Cheng
鄭奕駿
author Yi-chun Cheng
鄭奕駿
spellingShingle Yi-chun Cheng
鄭奕駿
Using Offline Wikipedia Database to Reduce Time Costing of NGD
author_sort Yi-chun Cheng
title Using Offline Wikipedia Database to Reduce Time Costing of NGD
title_short Using Offline Wikipedia Database to Reduce Time Costing of NGD
title_full Using Offline Wikipedia Database to Reduce Time Costing of NGD
title_fullStr Using Offline Wikipedia Database to Reduce Time Costing of NGD
title_full_unstemmed Using Offline Wikipedia Database to Reduce Time Costing of NGD
title_sort using offline wikipedia database to reduce time costing of ngd
publishDate 2012
url http://ndltd.ncl.edu.tw/handle/81007608444715028955
work_keys_str_mv AT yichuncheng usingofflinewikipediadatabasetoreducetimecostingofngd
AT zhèngyìjùn usingofflinewikipediadatabasetoreducetimecostingofngd
AT yichuncheng líxiànsōuxúnwikipediayǐsuōjiǎnngdyùnsuànshíjiānzhīyánjiū
AT zhèngyìjùn líxiànsōuxúnwikipediayǐsuōjiǎnngdyùnsuànshíjiānzhīyánjiū
_version_ 1718061611840700416