Using Offline Wikipedia Database to Reduce Time Costing of NGD
碩士 === 國立中央大學 === 資訊管理研究所 === 100 === With the rapid development of Internet, many kinds of information website continued a steady increase; the user can easily obtain a great deal of information from a variety of search engines and portals such as Google and Yahoo! However, Jansen, et al. pointed o...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2012
|
Online Access: | http://ndltd.ncl.edu.tw/handle/81007608444715028955 |
id |
ndltd-TW-100NCU05396062 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-100NCU053960622015-10-13T21:22:38Z http://ndltd.ncl.edu.tw/handle/81007608444715028955 Using Offline Wikipedia Database to Reduce Time Costing of NGD 離線搜尋Wikipedia以縮減NGD運算時間之研究 Yi-chun Cheng 鄭奕駿 碩士 國立中央大學 資訊管理研究所 100 With the rapid development of Internet, many kinds of information website continued a steady increase; the user can easily obtain a great deal of information from a variety of search engines and portals such as Google and Yahoo! However, Jansen, et al. pointed out that under normal circumstances, most users enter only 2.35 keywords, and mostly unclear or incomplete keyword results in returning a lot of websites so that lead to information overload. The research literature in the past, often using the categories of information, or filtering to help reduce the cost of user access to information, but these methods have to be built under the premise of a large number of training data can have good results. Recent studies have proposed NGD provided by Google''s search engine, key in the keywords to get the number of results to calculate the abstract distance between the two words, and then draw a conclusion of two words where the file is similar. However NGD rely on Google''s online search function, with the high-frequency query, Google will refused user to use the search service. In order to solve this problem, this study advances a method that use Wikipedia to establish the offline search engine, because Wikipedia has a structured concepts and high purity content. And with the experimental proofs, when user uses the offline Wikipedia database, the method proposed in this study still provides the user has a stable filtration performance, and saves the user a plenty of time costs. Shi-jen Lin 林熙禎 2012 學位論文 ; thesis 58 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立中央大學 === 資訊管理研究所 === 100 === With the rapid development of Internet, many kinds of information website continued a steady increase; the user can easily obtain a great deal of information from a variety of search engines and portals such as Google and Yahoo! However, Jansen, et al. pointed out that under normal circumstances, most users enter only 2.35 keywords, and mostly unclear or incomplete keyword results in returning a lot of websites so that lead to information overload. The research literature in the past, often using the categories of information, or filtering to help reduce the cost of user access to information, but these methods have to be built under the premise of a large number of training data can have good results. Recent studies have proposed NGD provided by Google''s search engine, key in the keywords to get the number of results to calculate the abstract distance between the two words, and then draw a conclusion of two words where the file is similar. However NGD rely on Google''s online search function, with the high-frequency query, Google will refused user to use the search service. In order to solve this problem, this study advances a method that use Wikipedia to establish the offline search engine, because Wikipedia has a structured concepts and high purity content. And with the experimental proofs, when user uses the offline Wikipedia database, the method proposed in this study still provides the user has a stable filtration performance, and saves the user a plenty of time costs.
|
author2 |
Shi-jen Lin |
author_facet |
Shi-jen Lin Yi-chun Cheng 鄭奕駿 |
author |
Yi-chun Cheng 鄭奕駿 |
spellingShingle |
Yi-chun Cheng 鄭奕駿 Using Offline Wikipedia Database to Reduce Time Costing of NGD |
author_sort |
Yi-chun Cheng |
title |
Using Offline Wikipedia Database to Reduce Time Costing of NGD |
title_short |
Using Offline Wikipedia Database to Reduce Time Costing of NGD |
title_full |
Using Offline Wikipedia Database to Reduce Time Costing of NGD |
title_fullStr |
Using Offline Wikipedia Database to Reduce Time Costing of NGD |
title_full_unstemmed |
Using Offline Wikipedia Database to Reduce Time Costing of NGD |
title_sort |
using offline wikipedia database to reduce time costing of ngd |
publishDate |
2012 |
url |
http://ndltd.ncl.edu.tw/handle/81007608444715028955 |
work_keys_str_mv |
AT yichuncheng usingofflinewikipediadatabasetoreducetimecostingofngd AT zhèngyìjùn usingofflinewikipediadatabasetoreducetimecostingofngd AT yichuncheng líxiànsōuxúnwikipediayǐsuōjiǎnngdyùnsuànshíjiānzhīyánjiū AT zhèngyìjùn líxiànsōuxúnwikipediayǐsuōjiǎnngdyùnsuànshíjiānzhīyánjiū |
_version_ |
1718061611840700416 |