SeqWORDS: an unsupervised Chinese segmentation method using relationship of two consecutive words.
碩士 === 國立政治大學 === 統計學系 === 107 === Unlike alphabet-based language, there exists no space between words in Chinese corpus. The first step in Chinese text mining is to segment words in a sentence. Many existing segmentation methods are supervised in terms of requiring an adequate dictionary. However,...
Main Authors: | Wu, Guan-Hui, 吳冠輝 |
---|---|
Other Authors: | Hsueh, Huey-Miin |
Format: | Others |
Language: | zh-TW |
Published: |
2019
|
Online Access: | http://ndltd.ncl.edu.tw/handle/5asc88 |
Similar Items
-
The SeqWord Genome Browser: an online tool for the identification and visualization of atypical regions of bacterial genomes through oligonucleotide usage
by: Tümmler Burkhard, et al.
Published: (2008-08-01) -
A Lexicon-Corpus-based Unsupervised Chinese Word Segmentation Approach
by: Lu Pengyu, et al.
Published: (2014-03-01) -
Proper Noun Identification Using Machine Learning – Elevate the Performance of Chinese Word Segmentation
by: Wu, Guan-Yi, et al.
Published: (2018) -
Unsupervised Word Sense Disambiguation Using Word Embeddings
by: Behzad Moradi, et al.
Published: (2019-11-01) -
Unsupervised pattern discovery in speech : applications to word acquisition and speaker segmentation
by: Park, Alex S. (Alex Seungryong), 1979-
Published: (2007)