A Set Space Model to Capture Structural Information of a Sentence

The context of a sentence is composed of a limited number of words. This leads to the feature sparsity problem whereby the sentence's meaning is easily influenced by language phenomena such as polysemy, ambiguity and puns. To resolve these problems, the set space model (SSM) uses language chara...

Full description

Bibliographic Details
Main Authors: Yanping Chen, Guorong Wang, Qinghua Zheng, Yongbin Qin, Ruizhang Huang, Ping Chen
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8853305/
id doaj-af1a96cb626c4da18d73db45e0b3eb6f
record_format Article
spelling doaj-af1a96cb626c4da18d73db45e0b3eb6f2021-03-29T23:54:58ZengIEEEIEEE Access2169-35362019-01-01714251514253010.1109/ACCESS.2019.29445598853305A Set Space Model to Capture Structural Information of a SentenceYanping Chen0Guorong Wang1https://orcid.org/0000-0002-3195-7775Qinghua Zheng2Yongbin Qin3Ruizhang Huang4Ping Chen5College of Computer Science and Technology, Guizhou University, Guizhou, ChinaCollege of Computer Science and Technology, Guizhou University, Guizhou, ChinaDepartment of Computer Science and Technology, Xi’an Jiaotong University, Xi’an, ChinaCollege of Computer Science and Technology, Guizhou University, Guizhou, ChinaCollege of Computer Science and Technology, Guizhou University, Guizhou, ChinaDepartment of Computer Science, University of Massachusetts Boston, Boston, MA, USAThe context of a sentence is composed of a limited number of words. This leads to the feature sparsity problem whereby the sentence's meaning is easily influenced by language phenomena such as polysemy, ambiguity and puns. To resolve these problems, the set space model (SSM) uses language characteristics to group features of a sentence into different sets. Afterwards, the proposed feature calculus is used to capture the structural information of the sentence. Experiments have shown that this approach to the relation recognition task is effective. However, at least three weaknesses remain. First, due to the lack of a probabilistic explanation, several aspects of SSM (e.g., filter selection) have not yet been covered. Second, the existing studies have only provided an outline of SSM, and many issues remain unclear. To understand this approach, it is necessary to discuss a suitable example in detail. Third, SSM has been applied only to the task of relation recognition. Case studies of more typical topics (e.g., named entity recognition) will help illustrate the use of SSM's methodology to manipulate features. This paper develops SSM to cover these problems. It describes a systematic and novel approach to manipulating features of a sentence. In the experimental part, two typical information extraction tasks are performed to demonstrate SSM's capabilities. Two case studies are considered, and favorable improvements are observed. All of the obtained results surpass those of compared approaches. The experiments also show the influence of sentence structural information on information extraction.https://ieeexplore.ieee.org/document/8853305/Set space modelinformation extraction
collection DOAJ
language English
format Article
sources DOAJ
author Yanping Chen
Guorong Wang
Qinghua Zheng
Yongbin Qin
Ruizhang Huang
Ping Chen
spellingShingle Yanping Chen
Guorong Wang
Qinghua Zheng
Yongbin Qin
Ruizhang Huang
Ping Chen
A Set Space Model to Capture Structural Information of a Sentence
IEEE Access
Set space model
information extraction
author_facet Yanping Chen
Guorong Wang
Qinghua Zheng
Yongbin Qin
Ruizhang Huang
Ping Chen
author_sort Yanping Chen
title A Set Space Model to Capture Structural Information of a Sentence
title_short A Set Space Model to Capture Structural Information of a Sentence
title_full A Set Space Model to Capture Structural Information of a Sentence
title_fullStr A Set Space Model to Capture Structural Information of a Sentence
title_full_unstemmed A Set Space Model to Capture Structural Information of a Sentence
title_sort set space model to capture structural information of a sentence
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description The context of a sentence is composed of a limited number of words. This leads to the feature sparsity problem whereby the sentence's meaning is easily influenced by language phenomena such as polysemy, ambiguity and puns. To resolve these problems, the set space model (SSM) uses language characteristics to group features of a sentence into different sets. Afterwards, the proposed feature calculus is used to capture the structural information of the sentence. Experiments have shown that this approach to the relation recognition task is effective. However, at least three weaknesses remain. First, due to the lack of a probabilistic explanation, several aspects of SSM (e.g., filter selection) have not yet been covered. Second, the existing studies have only provided an outline of SSM, and many issues remain unclear. To understand this approach, it is necessary to discuss a suitable example in detail. Third, SSM has been applied only to the task of relation recognition. Case studies of more typical topics (e.g., named entity recognition) will help illustrate the use of SSM's methodology to manipulate features. This paper develops SSM to cover these problems. It describes a systematic and novel approach to manipulating features of a sentence. In the experimental part, two typical information extraction tasks are performed to demonstrate SSM's capabilities. Two case studies are considered, and favorable improvements are observed. All of the obtained results surpass those of compared approaches. The experiments also show the influence of sentence structural information on information extraction.
topic Set space model
information extraction
url https://ieeexplore.ieee.org/document/8853305/
work_keys_str_mv AT yanpingchen asetspacemodeltocapturestructuralinformationofasentence
AT guorongwang asetspacemodeltocapturestructuralinformationofasentence
AT qinghuazheng asetspacemodeltocapturestructuralinformationofasentence
AT yongbinqin asetspacemodeltocapturestructuralinformationofasentence
AT ruizhanghuang asetspacemodeltocapturestructuralinformationofasentence
AT pingchen asetspacemodeltocapturestructuralinformationofasentence
AT yanpingchen setspacemodeltocapturestructuralinformationofasentence
AT guorongwang setspacemodeltocapturestructuralinformationofasentence
AT qinghuazheng setspacemodeltocapturestructuralinformationofasentence
AT yongbinqin setspacemodeltocapturestructuralinformationofasentence
AT ruizhanghuang setspacemodeltocapturestructuralinformationofasentence
AT pingchen setspacemodeltocapturestructuralinformationofasentence
_version_ 1724188907328765952