Automatic Construction and Global Optimization of a Multisentiment Lexicon
Manual annotation of sentiment lexicons costs too much labor and time, and it is also difficult to get accurate quantification of emotional intensity. Besides, the excessive emphasis on one specific field has greatly limited the applicability of domain sentiment lexicons (Wang et al., 2010). This pa...
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi Limited
2016-01-01
|
Series: | Computational Intelligence and Neuroscience |
Online Access: | http://dx.doi.org/10.1155/2016/2093406 |
id |
doaj-0d05a4398f974973bcedb83219e1ce47 |
---|---|
record_format |
Article |
spelling |
doaj-0d05a4398f974973bcedb83219e1ce472020-11-25T01:06:47ZengHindawi LimitedComputational Intelligence and Neuroscience1687-52651687-52732016-01-01201610.1155/2016/20934062093406Automatic Construction and Global Optimization of a Multisentiment LexiconXiaoping Yang0Zhongxia Zhang1Zhongqiu Zhang2Yuting Mo3Lianbei Li4Li Yu5Peican Zhu6School of Information, Renmin University of China, Beijing 100872, ChinaSchool of Information, Renmin University of China, Beijing 100872, ChinaSchool of Computer Science, Northeastern University, Shenyang 110819, ChinaSchool of Information, Renmin University of China, Beijing 100872, ChinaSchool of Information, Renmin University of China, Beijing 100872, ChinaSchool of Information, Renmin University of China, Beijing 100872, ChinaSchool of Computer Science and Technology, Northwestern Polytechnical University, Xi’an 710129, ChinaManual annotation of sentiment lexicons costs too much labor and time, and it is also difficult to get accurate quantification of emotional intensity. Besides, the excessive emphasis on one specific field has greatly limited the applicability of domain sentiment lexicons (Wang et al., 2010). This paper implements statistical training for large-scale Chinese corpus through neural network language model and proposes an automatic method of constructing a multidimensional sentiment lexicon based on constraints of coordinate offset. In order to distinguish the sentiment polarities of those words which may express either positive or negative meanings in different contexts, we further present a sentiment disambiguation algorithm to increase the flexibility of our lexicon. Lastly, we present a global optimization framework that provides a unified way to combine several human-annotated resources for learning our 10-dimensional sentiment lexicon SentiRuc. Experiments show the superior performance of SentiRuc lexicon in category labeling test, intensity labeling test, and sentiment classification tasks. It is worth mentioning that, in intensity label test, SentiRuc outperforms the second place by 21 percent.http://dx.doi.org/10.1155/2016/2093406 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Xiaoping Yang Zhongxia Zhang Zhongqiu Zhang Yuting Mo Lianbei Li Li Yu Peican Zhu |
spellingShingle |
Xiaoping Yang Zhongxia Zhang Zhongqiu Zhang Yuting Mo Lianbei Li Li Yu Peican Zhu Automatic Construction and Global Optimization of a Multisentiment Lexicon Computational Intelligence and Neuroscience |
author_facet |
Xiaoping Yang Zhongxia Zhang Zhongqiu Zhang Yuting Mo Lianbei Li Li Yu Peican Zhu |
author_sort |
Xiaoping Yang |
title |
Automatic Construction and Global Optimization of a Multisentiment Lexicon |
title_short |
Automatic Construction and Global Optimization of a Multisentiment Lexicon |
title_full |
Automatic Construction and Global Optimization of a Multisentiment Lexicon |
title_fullStr |
Automatic Construction and Global Optimization of a Multisentiment Lexicon |
title_full_unstemmed |
Automatic Construction and Global Optimization of a Multisentiment Lexicon |
title_sort |
automatic construction and global optimization of a multisentiment lexicon |
publisher |
Hindawi Limited |
series |
Computational Intelligence and Neuroscience |
issn |
1687-5265 1687-5273 |
publishDate |
2016-01-01 |
description |
Manual annotation of sentiment lexicons costs too much labor and time, and it is also difficult to get accurate quantification of emotional intensity. Besides, the excessive emphasis on one specific field has greatly limited the applicability of domain sentiment lexicons (Wang et al., 2010). This paper implements statistical training for large-scale Chinese corpus through neural network language model and proposes an automatic method of constructing a multidimensional sentiment lexicon based on constraints of coordinate offset. In order to distinguish the sentiment polarities of those words which may express either positive or negative meanings in different contexts, we further present a sentiment disambiguation algorithm to increase the flexibility of our lexicon. Lastly, we present a global optimization framework that provides a unified way to combine several human-annotated resources for learning our 10-dimensional sentiment lexicon SentiRuc. Experiments show the superior performance of SentiRuc lexicon in category labeling test, intensity labeling test, and sentiment classification tasks. It is worth mentioning that, in intensity label test, SentiRuc outperforms the second place by 21 percent. |
url |
http://dx.doi.org/10.1155/2016/2093406 |
work_keys_str_mv |
AT xiaopingyang automaticconstructionandglobaloptimizationofamultisentimentlexicon AT zhongxiazhang automaticconstructionandglobaloptimizationofamultisentimentlexicon AT zhongqiuzhang automaticconstructionandglobaloptimizationofamultisentimentlexicon AT yutingmo automaticconstructionandglobaloptimizationofamultisentimentlexicon AT lianbeili automaticconstructionandglobaloptimizationofamultisentimentlexicon AT liyu automaticconstructionandglobaloptimizationofamultisentimentlexicon AT peicanzhu automaticconstructionandglobaloptimizationofamultisentimentlexicon |
_version_ |
1725188216606162944 |