Chinese phrase segmentation method of Information Retrieval and Search Engine

碩士 === 樹德科技大學 === 資訊工程學系 === 96 === For most people, the techniques of search engine are both familiar and strange. It is familiar because people keep using it in the network activity. The well-known technology of search engine lets many people research to improve it. But only few people knew how to...

Full description

Bibliographic Details
Main Authors: Dewei Yen, 焉德葳
Other Authors: Chao-Kuei Hung
Format: Others
Language:zh-TW
Published: 2008
Online Access:http://ndltd.ncl.edu.tw/handle/20227612775838533643
id ndltd-TW-096STU00392019
record_format oai_dc
spelling ndltd-TW-096STU003920192016-05-16T04:10:39Z http://ndltd.ncl.edu.tw/handle/20227612775838533643 Chinese phrase segmentation method of Information Retrieval and Search Engine 搜尋引擎與資訊索引中文斷詞方法 Dewei Yen 焉德葳 碩士 樹德科技大學 資訊工程學系 96 For most people, the techniques of search engine are both familiar and strange. It is familiar because people keep using it in the network activity. The well-known technology of search engine lets many people research to improve it. But only few people knew how to establish a search engine. This paper tries to explain the technology of search engine by graphs and examples. These researches present the details of each part by actually creating an open source search engine “Ozearch” as example. This paper also presents an algorithm for segmenting Chinese phrases. It utilizes both the N-gram algorithm and the word-based algorithm to improve precision and recall of the search engine. In this paper, we also find few defect of segmenting Chinese phrases for now and presents workable method to improve it. Chao-Kuei Hung Yu-Chang Chen 洪朝貴 陳毓璋 2008 學位論文 ; thesis 72 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 樹德科技大學 === 資訊工程學系 === 96 === For most people, the techniques of search engine are both familiar and strange. It is familiar because people keep using it in the network activity. The well-known technology of search engine lets many people research to improve it. But only few people knew how to establish a search engine. This paper tries to explain the technology of search engine by graphs and examples. These researches present the details of each part by actually creating an open source search engine “Ozearch” as example. This paper also presents an algorithm for segmenting Chinese phrases. It utilizes both the N-gram algorithm and the word-based algorithm to improve precision and recall of the search engine. In this paper, we also find few defect of segmenting Chinese phrases for now and presents workable method to improve it.
author2 Chao-Kuei Hung
author_facet Chao-Kuei Hung
Dewei Yen
焉德葳
author Dewei Yen
焉德葳
spellingShingle Dewei Yen
焉德葳
Chinese phrase segmentation method of Information Retrieval and Search Engine
author_sort Dewei Yen
title Chinese phrase segmentation method of Information Retrieval and Search Engine
title_short Chinese phrase segmentation method of Information Retrieval and Search Engine
title_full Chinese phrase segmentation method of Information Retrieval and Search Engine
title_fullStr Chinese phrase segmentation method of Information Retrieval and Search Engine
title_full_unstemmed Chinese phrase segmentation method of Information Retrieval and Search Engine
title_sort chinese phrase segmentation method of information retrieval and search engine
publishDate 2008
url http://ndltd.ncl.edu.tw/handle/20227612775838533643
work_keys_str_mv AT deweiyen chinesephrasesegmentationmethodofinformationretrievalandsearchengine
AT yāndéwēi chinesephrasesegmentationmethodofinformationretrievalandsearchengine
AT deweiyen sōuxúnyǐnqíngyǔzīxùnsuǒyǐnzhōngwénduàncífāngfǎ
AT yāndéwēi sōuxúnyǐnqíngyǔzīxùnsuǒyǐnzhōngwénduàncífāngfǎ
_version_ 1718269818761641984