Chinese Word Segmentation Algorithm Base on Hybrid Statistical and Dictionary Methods
碩士 === 長庚大學 === 資訊工程學系 === 103 === Due to the rapid development of China, Chinese Internet daily amount of information and increase with age, the search engine has become a very important part of the for Access to knowledge, people want to find information through search engines, especially now that...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Published: |
2015
|
Online Access: | http://ndltd.ncl.edu.tw/handle/27751759058705696752 |
id |
ndltd-TW-103CGU05392009 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-103CGU053920092016-07-31T04:22:27Z http://ndltd.ncl.edu.tw/handle/27751759058705696752 Chinese Word Segmentation Algorithm Base on Hybrid Statistical and Dictionary Methods 基於混合統計法與詞庫法之中文斷詞演算法 Yu Tang Chang 張郁堂 碩士 長庚大學 資訊工程學系 103 Due to the rapid development of China, Chinese Internet daily amount of information and increase with age, the search engine has become a very important part of the for Access to knowledge, people want to find information through search engines, especially now that most people start use the Internet network to watch the news and get entertainment news, just to get relevant information in the search field, enter the string you want to query in search engines, in fact, these strings must through the word segmentation processing, cut the entered strings according to semantic reasonable combination, can easily get the information they want from the search engines. Currently English word segmentation technology has well-developed, however, the English differ very much on the writing and usage, resulting in complex grammatical structure of Chinese, there are ambiguous words and unknown words, leading to difficult to complete an Chinese word segmentation algorithms. We use dynamic programming algorithm as the core design a statistical method with lexicon to improve accuracy, finally, we got efficient and accurate of word segmentation. H. T. Chang 張賢宗 2015 學位論文 ; thesis 54 |
collection |
NDLTD |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 長庚大學 === 資訊工程學系 === 103 === Due to the rapid development of China, Chinese Internet daily amount of information and increase with age, the search engine has become a very important part of the for Access to knowledge, people want to find information through search engines, especially now that most people start use the Internet network to watch the news and get entertainment news, just to get relevant information in the search field, enter the string you want to query in search engines, in fact, these strings must through the word segmentation processing, cut the entered strings according to semantic reasonable combination, can easily get the information they want from the search engines.
Currently English word segmentation technology has well-developed, however, the English differ very much on the writing and usage, resulting in complex grammatical structure of Chinese, there are ambiguous words and unknown words, leading to difficult to complete an Chinese word segmentation algorithms.
We use dynamic programming algorithm as the core design a statistical method with lexicon to improve accuracy, finally, we got efficient and accurate of word segmentation.
|
author2 |
H. T. Chang |
author_facet |
H. T. Chang Yu Tang Chang 張郁堂 |
author |
Yu Tang Chang 張郁堂 |
spellingShingle |
Yu Tang Chang 張郁堂 Chinese Word Segmentation Algorithm Base on Hybrid Statistical and Dictionary Methods |
author_sort |
Yu Tang Chang |
title |
Chinese Word Segmentation Algorithm Base on Hybrid Statistical and Dictionary Methods |
title_short |
Chinese Word Segmentation Algorithm Base on Hybrid Statistical and Dictionary Methods |
title_full |
Chinese Word Segmentation Algorithm Base on Hybrid Statistical and Dictionary Methods |
title_fullStr |
Chinese Word Segmentation Algorithm Base on Hybrid Statistical and Dictionary Methods |
title_full_unstemmed |
Chinese Word Segmentation Algorithm Base on Hybrid Statistical and Dictionary Methods |
title_sort |
chinese word segmentation algorithm base on hybrid statistical and dictionary methods |
publishDate |
2015 |
url |
http://ndltd.ncl.edu.tw/handle/27751759058705696752 |
work_keys_str_mv |
AT yutangchang chinesewordsegmentationalgorithmbaseonhybridstatisticalanddictionarymethods AT zhāngyùtáng chinesewordsegmentationalgorithmbaseonhybridstatisticalanddictionarymethods AT yutangchang jīyúhùnhétǒngjìfǎyǔcíkùfǎzhīzhōngwénduàncíyǎnsuànfǎ AT zhāngyùtáng jīyúhùnhétǒngjìfǎyǔcíkùfǎzhīzhōngwénduàncíyǎnsuànfǎ |
_version_ |
1718366752368230400 |