Chinese Word Segmentation Algorithm Base on Hybrid Statistical and Dictionary Methods

碩士 === 長庚大學 === 資訊工程學系 === 103 === Due to the rapid development of China, Chinese Internet daily amount of information and increase with age, the search engine has become a very important part of the for Access to knowledge, people want to find information through search engines, especially now that...

Full description

Bibliographic Details
Main Authors: Yu Tang Chang, 張郁堂
Other Authors: H. T. Chang
Format: Others
Published: 2015
Online Access:http://ndltd.ncl.edu.tw/handle/27751759058705696752
id ndltd-TW-103CGU05392009
record_format oai_dc
spelling ndltd-TW-103CGU053920092016-07-31T04:22:27Z http://ndltd.ncl.edu.tw/handle/27751759058705696752 Chinese Word Segmentation Algorithm Base on Hybrid Statistical and Dictionary Methods 基於混合統計法與詞庫法之中文斷詞演算法 Yu Tang Chang 張郁堂 碩士 長庚大學 資訊工程學系 103 Due to the rapid development of China, Chinese Internet daily amount of information and increase with age, the search engine has become a very important part of the for Access to knowledge, people want to find information through search engines, especially now that most people start use the Internet network to watch the news and get entertainment news, just to get relevant information in the search field, enter the string you want to query in search engines, in fact, these strings must through the word segmentation processing, cut the entered strings according to semantic reasonable combination, can easily get the information they want from the search engines. Currently English word segmentation technology has well-developed, however, the English differ very much on the writing and usage, resulting in complex grammatical structure of Chinese, there are ambiguous words and unknown words, leading to difficult to complete an Chinese word segmentation algorithms. We use dynamic programming algorithm as the core design a statistical method with lexicon to improve accuracy, finally, we got efficient and accurate of word segmentation. H. T. Chang 張賢宗 2015 學位論文 ; thesis 54
collection NDLTD
format Others
sources NDLTD
description 碩士 === 長庚大學 === 資訊工程學系 === 103 === Due to the rapid development of China, Chinese Internet daily amount of information and increase with age, the search engine has become a very important part of the for Access to knowledge, people want to find information through search engines, especially now that most people start use the Internet network to watch the news and get entertainment news, just to get relevant information in the search field, enter the string you want to query in search engines, in fact, these strings must through the word segmentation processing, cut the entered strings according to semantic reasonable combination, can easily get the information they want from the search engines. Currently English word segmentation technology has well-developed, however, the English differ very much on the writing and usage, resulting in complex grammatical structure of Chinese, there are ambiguous words and unknown words, leading to difficult to complete an Chinese word segmentation algorithms. We use dynamic programming algorithm as the core design a statistical method with lexicon to improve accuracy, finally, we got efficient and accurate of word segmentation.
author2 H. T. Chang
author_facet H. T. Chang
Yu Tang Chang
張郁堂
author Yu Tang Chang
張郁堂
spellingShingle Yu Tang Chang
張郁堂
Chinese Word Segmentation Algorithm Base on Hybrid Statistical and Dictionary Methods
author_sort Yu Tang Chang
title Chinese Word Segmentation Algorithm Base on Hybrid Statistical and Dictionary Methods
title_short Chinese Word Segmentation Algorithm Base on Hybrid Statistical and Dictionary Methods
title_full Chinese Word Segmentation Algorithm Base on Hybrid Statistical and Dictionary Methods
title_fullStr Chinese Word Segmentation Algorithm Base on Hybrid Statistical and Dictionary Methods
title_full_unstemmed Chinese Word Segmentation Algorithm Base on Hybrid Statistical and Dictionary Methods
title_sort chinese word segmentation algorithm base on hybrid statistical and dictionary methods
publishDate 2015
url http://ndltd.ncl.edu.tw/handle/27751759058705696752
work_keys_str_mv AT yutangchang chinesewordsegmentationalgorithmbaseonhybridstatisticalanddictionarymethods
AT zhāngyùtáng chinesewordsegmentationalgorithmbaseonhybridstatisticalanddictionarymethods
AT yutangchang jīyúhùnhétǒngjìfǎyǔcíkùfǎzhīzhōngwénduàncíyǎnsuànfǎ
AT zhāngyùtáng jīyúhùnhétǒngjìfǎyǔcíkùfǎzhīzhōngwénduàncíyǎnsuànfǎ
_version_ 1718366752368230400