A Set Space Model to Capture Structural Information of a Sentence
The context of a sentence is composed of a limited number of words. This leads to the feature sparsity problem whereby the sentence's meaning is easily influenced by language phenomena such as polysemy, ambiguity and puns. To resolve these problems, the set space model (SSM) uses language chara...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8853305/ |
id |
doaj-af1a96cb626c4da18d73db45e0b3eb6f |
---|---|
record_format |
Article |
spelling |
doaj-af1a96cb626c4da18d73db45e0b3eb6f2021-03-29T23:54:58ZengIEEEIEEE Access2169-35362019-01-01714251514253010.1109/ACCESS.2019.29445598853305A Set Space Model to Capture Structural Information of a SentenceYanping Chen0Guorong Wang1https://orcid.org/0000-0002-3195-7775Qinghua Zheng2Yongbin Qin3Ruizhang Huang4Ping Chen5College of Computer Science and Technology, Guizhou University, Guizhou, ChinaCollege of Computer Science and Technology, Guizhou University, Guizhou, ChinaDepartment of Computer Science and Technology, Xi’an Jiaotong University, Xi’an, ChinaCollege of Computer Science and Technology, Guizhou University, Guizhou, ChinaCollege of Computer Science and Technology, Guizhou University, Guizhou, ChinaDepartment of Computer Science, University of Massachusetts Boston, Boston, MA, USAThe context of a sentence is composed of a limited number of words. This leads to the feature sparsity problem whereby the sentence's meaning is easily influenced by language phenomena such as polysemy, ambiguity and puns. To resolve these problems, the set space model (SSM) uses language characteristics to group features of a sentence into different sets. Afterwards, the proposed feature calculus is used to capture the structural information of the sentence. Experiments have shown that this approach to the relation recognition task is effective. However, at least three weaknesses remain. First, due to the lack of a probabilistic explanation, several aspects of SSM (e.g., filter selection) have not yet been covered. Second, the existing studies have only provided an outline of SSM, and many issues remain unclear. To understand this approach, it is necessary to discuss a suitable example in detail. Third, SSM has been applied only to the task of relation recognition. Case studies of more typical topics (e.g., named entity recognition) will help illustrate the use of SSM's methodology to manipulate features. This paper develops SSM to cover these problems. It describes a systematic and novel approach to manipulating features of a sentence. In the experimental part, two typical information extraction tasks are performed to demonstrate SSM's capabilities. Two case studies are considered, and favorable improvements are observed. All of the obtained results surpass those of compared approaches. The experiments also show the influence of sentence structural information on information extraction.https://ieeexplore.ieee.org/document/8853305/Set space modelinformation extraction |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Yanping Chen Guorong Wang Qinghua Zheng Yongbin Qin Ruizhang Huang Ping Chen |
spellingShingle |
Yanping Chen Guorong Wang Qinghua Zheng Yongbin Qin Ruizhang Huang Ping Chen A Set Space Model to Capture Structural Information of a Sentence IEEE Access Set space model information extraction |
author_facet |
Yanping Chen Guorong Wang Qinghua Zheng Yongbin Qin Ruizhang Huang Ping Chen |
author_sort |
Yanping Chen |
title |
A Set Space Model to Capture Structural Information of a Sentence |
title_short |
A Set Space Model to Capture Structural Information of a Sentence |
title_full |
A Set Space Model to Capture Structural Information of a Sentence |
title_fullStr |
A Set Space Model to Capture Structural Information of a Sentence |
title_full_unstemmed |
A Set Space Model to Capture Structural Information of a Sentence |
title_sort |
set space model to capture structural information of a sentence |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2019-01-01 |
description |
The context of a sentence is composed of a limited number of words. This leads to the feature sparsity problem whereby the sentence's meaning is easily influenced by language phenomena such as polysemy, ambiguity and puns. To resolve these problems, the set space model (SSM) uses language characteristics to group features of a sentence into different sets. Afterwards, the proposed feature calculus is used to capture the structural information of the sentence. Experiments have shown that this approach to the relation recognition task is effective. However, at least three weaknesses remain. First, due to the lack of a probabilistic explanation, several aspects of SSM (e.g., filter selection) have not yet been covered. Second, the existing studies have only provided an outline of SSM, and many issues remain unclear. To understand this approach, it is necessary to discuss a suitable example in detail. Third, SSM has been applied only to the task of relation recognition. Case studies of more typical topics (e.g., named entity recognition) will help illustrate the use of SSM's methodology to manipulate features. This paper develops SSM to cover these problems. It describes a systematic and novel approach to manipulating features of a sentence. In the experimental part, two typical information extraction tasks are performed to demonstrate SSM's capabilities. Two case studies are considered, and favorable improvements are observed. All of the obtained results surpass those of compared approaches. The experiments also show the influence of sentence structural information on information extraction. |
topic |
Set space model information extraction |
url |
https://ieeexplore.ieee.org/document/8853305/ |
work_keys_str_mv |
AT yanpingchen asetspacemodeltocapturestructuralinformationofasentence AT guorongwang asetspacemodeltocapturestructuralinformationofasentence AT qinghuazheng asetspacemodeltocapturestructuralinformationofasentence AT yongbinqin asetspacemodeltocapturestructuralinformationofasentence AT ruizhanghuang asetspacemodeltocapturestructuralinformationofasentence AT pingchen asetspacemodeltocapturestructuralinformationofasentence AT yanpingchen setspacemodeltocapturestructuralinformationofasentence AT guorongwang setspacemodeltocapturestructuralinformationofasentence AT qinghuazheng setspacemodeltocapturestructuralinformationofasentence AT yongbinqin setspacemodeltocapturestructuralinformationofasentence AT ruizhanghuang setspacemodeltocapturestructuralinformationofasentence AT pingchen setspacemodeltocapturestructuralinformationofasentence |
_version_ |
1724188907328765952 |