Chinese Word Segmentation Based on Self‐Learning Model and Geological Knowledge for the Geoscience Domain

Abstract Chinese word segmentation (CWS) is the foundational work of geological report text mining and has an important influence on various tasks, such as named entity recognition and relation extraction. In recent years, the accuracy of the domain‐general CWS model has been limited by the domain a...

Full description

Bibliographic Details
Main Authors: Wenjia Li, Kai Ma, Qinjun Qiu, Liang Wu, Zhong Xie, Sanfeng Li, Siqiong Chen
Format: Article
Language:English
Published: American Geophysical Union (AGU) 2021-06-01
Series:Earth and Space Science
Subjects:
Online Access:https://doi.org/10.1029/2021EA001673