Segmenting Chinese Texts into Words for Semantic Network Analysis

Unlike most languages, written Chinese has no spaces between words. Word segmentation must be performed before semantic network analysis can be conducted. This paper describes how to perform Chinese word segmentation using the Stanford Natural Language Processing group’s Stanford Word Segmenter v. 3...

Full description

Bibliographic Details
Main Author: James A. Danowski
Format: Article
Language:English
Published: World Association for Triple Helix and Future Strategy Studies 2017-12-01
Series:Journal of Contemporary Eastern Asia
Subjects:
Online Access:http://koreascience.or.kr/article/JAKO201706163144730.pdf
Description
Summary:Unlike most languages, written Chinese has no spaces between words. Word segmentation must be performed before semantic network analysis can be conducted. This paper describes how to perform Chinese word segmentation using the Stanford Natural Language Processing group’s Stanford Word Segmenter v. 3.8.0, released in June 2017.
ISSN:2383-9449