Chinese phrase segmentation method of Information Retrieval and Search Engine
碩士 === 樹德科技大學 === 資訊工程學系 === 96 === For most people, the techniques of search engine are both familiar and strange. It is familiar because people keep using it in the network activity. The well-known technology of search engine lets many people research to improve it. But only few people knew how to...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2008
|
Online Access: | http://ndltd.ncl.edu.tw/handle/20227612775838533643 |
id |
ndltd-TW-096STU00392019 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-096STU003920192016-05-16T04:10:39Z http://ndltd.ncl.edu.tw/handle/20227612775838533643 Chinese phrase segmentation method of Information Retrieval and Search Engine 搜尋引擎與資訊索引中文斷詞方法 Dewei Yen 焉德葳 碩士 樹德科技大學 資訊工程學系 96 For most people, the techniques of search engine are both familiar and strange. It is familiar because people keep using it in the network activity. The well-known technology of search engine lets many people research to improve it. But only few people knew how to establish a search engine. This paper tries to explain the technology of search engine by graphs and examples. These researches present the details of each part by actually creating an open source search engine “Ozearch” as example. This paper also presents an algorithm for segmenting Chinese phrases. It utilizes both the N-gram algorithm and the word-based algorithm to improve precision and recall of the search engine. In this paper, we also find few defect of segmenting Chinese phrases for now and presents workable method to improve it. Chao-Kuei Hung Yu-Chang Chen 洪朝貴 陳毓璋 2008 學位論文 ; thesis 72 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 樹德科技大學 === 資訊工程學系 === 96 === For most people, the techniques of search engine are both familiar and strange. It is familiar because people keep using it in the network activity. The well-known technology of search engine lets many people research to improve it. But only few people knew how to establish a search engine. This paper tries to explain the technology of search engine by graphs and examples. These researches present the details of each part by actually creating an open source search engine “Ozearch” as example.
This paper also presents an algorithm for segmenting Chinese phrases. It utilizes both the N-gram algorithm and the word-based algorithm to improve precision and recall of the search engine. In this paper, we also find few defect of segmenting Chinese phrases for now and presents workable method to improve it.
|
author2 |
Chao-Kuei Hung |
author_facet |
Chao-Kuei Hung Dewei Yen 焉德葳 |
author |
Dewei Yen 焉德葳 |
spellingShingle |
Dewei Yen 焉德葳 Chinese phrase segmentation method of Information Retrieval and Search Engine |
author_sort |
Dewei Yen |
title |
Chinese phrase segmentation method of Information Retrieval and Search Engine |
title_short |
Chinese phrase segmentation method of Information Retrieval and Search Engine |
title_full |
Chinese phrase segmentation method of Information Retrieval and Search Engine |
title_fullStr |
Chinese phrase segmentation method of Information Retrieval and Search Engine |
title_full_unstemmed |
Chinese phrase segmentation method of Information Retrieval and Search Engine |
title_sort |
chinese phrase segmentation method of information retrieval and search engine |
publishDate |
2008 |
url |
http://ndltd.ncl.edu.tw/handle/20227612775838533643 |
work_keys_str_mv |
AT deweiyen chinesephrasesegmentationmethodofinformationretrievalandsearchengine AT yāndéwēi chinesephrasesegmentationmethodofinformationretrievalandsearchengine AT deweiyen sōuxúnyǐnqíngyǔzīxùnsuǒyǐnzhōngwénduàncífāngfǎ AT yāndéwēi sōuxúnyǐnqíngyǔzīxùnsuǒyǐnzhōngwénduàncífāngfǎ |
_version_ |
1718269818761641984 |