A Research Based on Weights of Words and Intentions Analysis to Enhance the Accuracy of Search Engines
碩士 === 南台科技大學 === 資訊管理系 === 101 === Due to the vigorous development of the Internet, the amount of data around the world is rising every year. Although we use the search engines to search for information on the Internet, the search results may not be useful to users. People like to use keywords to s...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
102
|
Online Access: | http://ndltd.ncl.edu.tw/handle/45992842858957414748 |
id |
ndltd-TW-101STUT8396013 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-101STUT83960132015-10-13T23:10:33Z http://ndltd.ncl.edu.tw/handle/45992842858957414748 A Research Based on Weights of Words and Intentions Analysis to Enhance the Accuracy of Search Engines 運用詞彙權重與意圖分析技術提升搜尋引擎精準度之研究 You-Hong SYU 許佑鴻 碩士 南台科技大學 資訊管理系 101 Due to the vigorous development of the Internet, the amount of data around the world is rising every year. Although we use the search engines to search for information on the Internet, the search results may not be useful to users. People like to use keywords to search in traditional search engines, but the search results do not meet the needs of users. Therefore, in this thesis we propose a search engine platform to distinguish the intentions of the users in order to help users to find the information they need. We mainly develop a search engine platform which integrates with intentions analysis to improve accuracy of search results by using a web mining and a Chinese word segmentation technique. In this thesis a system with Chinese word segmentation is employed CKIP (Chinese Knowledge Information processing) system for conducting system design and research. In order to improve the accuracy of searching, we use the 5W1H to analyze searched strings to understand the intentions of users and use the TF-IDF weights values of keywords and similar questions as the two-stage weight ranking for each web page. The experimental results indicate that the proposed approach outperform a significant improvement on the accuracy of search engines. Jen-peng Huang 黃仁鵬 102 學位論文 ; thesis 121 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 南台科技大學 === 資訊管理系 === 101 === Due to the vigorous development of the Internet, the amount of data around the world is rising every year. Although we use the search engines to search for information on the Internet, the search results may not be useful to users. People like to use keywords to search in traditional search engines, but the search results do not meet the needs of users. Therefore, in this thesis we propose a search engine platform to distinguish the intentions of the users in order to help users to find the information they need.
We mainly develop a search engine platform which integrates with intentions analysis to improve accuracy of search results by using a web mining and a Chinese word segmentation technique. In this thesis a system with Chinese word segmentation is employed CKIP (Chinese Knowledge Information processing) system for conducting system design and research. In order to improve the accuracy of searching, we use the 5W1H to analyze searched strings to understand the intentions of users and use the TF-IDF weights values of keywords and similar questions as the two-stage weight ranking for each web page. The experimental results indicate that the proposed approach outperform a significant improvement on the accuracy of search engines.
|
author2 |
Jen-peng Huang |
author_facet |
Jen-peng Huang You-Hong SYU 許佑鴻 |
author |
You-Hong SYU 許佑鴻 |
spellingShingle |
You-Hong SYU 許佑鴻 A Research Based on Weights of Words and Intentions Analysis to Enhance the Accuracy of Search Engines |
author_sort |
You-Hong SYU |
title |
A Research Based on Weights of Words and Intentions Analysis to Enhance the Accuracy of Search Engines |
title_short |
A Research Based on Weights of Words and Intentions Analysis to Enhance the Accuracy of Search Engines |
title_full |
A Research Based on Weights of Words and Intentions Analysis to Enhance the Accuracy of Search Engines |
title_fullStr |
A Research Based on Weights of Words and Intentions Analysis to Enhance the Accuracy of Search Engines |
title_full_unstemmed |
A Research Based on Weights of Words and Intentions Analysis to Enhance the Accuracy of Search Engines |
title_sort |
research based on weights of words and intentions analysis to enhance the accuracy of search engines |
publishDate |
102 |
url |
http://ndltd.ncl.edu.tw/handle/45992842858957414748 |
work_keys_str_mv |
AT youhongsyu aresearchbasedonweightsofwordsandintentionsanalysistoenhancetheaccuracyofsearchengines AT xǔyòuhóng aresearchbasedonweightsofwordsandintentionsanalysistoenhancetheaccuracyofsearchengines AT youhongsyu yùnyòngcíhuìquánzhòngyǔyìtúfēnxījìshùtíshēngsōuxúnyǐnqíngjīngzhǔndùzhīyánjiū AT xǔyòuhóng yùnyòngcíhuìquánzhòngyǔyìtúfēnxījìshùtíshēngsōuxúnyǐnqíngjīngzhǔndùzhīyánjiū AT youhongsyu researchbasedonweightsofwordsandintentionsanalysistoenhancetheaccuracyofsearchengines AT xǔyòuhóng researchbasedonweightsofwordsandintentionsanalysistoenhancetheaccuracyofsearchengines |
_version_ |
1718084696654479360 |