Automatic Construction and Global Optimization of a Multisentiment Lexicon

Manual annotation of sentiment lexicons costs too much labor and time, and it is also difficult to get accurate quantification of emotional intensity. Besides, the excessive emphasis on one specific field has greatly limited the applicability of domain sentiment lexicons (Wang et al., 2010). This pa...

Full description

Bibliographic Details
Main Authors: Xiaoping Yang, Zhongxia Zhang, Zhongqiu Zhang, Yuting Mo, Lianbei Li, Li Yu, Peican Zhu
Format: Article
Language:English
Published: Hindawi Limited 2016-01-01
Series:Computational Intelligence and Neuroscience
Online Access:http://dx.doi.org/10.1155/2016/2093406
id doaj-0d05a4398f974973bcedb83219e1ce47
record_format Article
spelling doaj-0d05a4398f974973bcedb83219e1ce472020-11-25T01:06:47ZengHindawi LimitedComputational Intelligence and Neuroscience1687-52651687-52732016-01-01201610.1155/2016/20934062093406Automatic Construction and Global Optimization of a Multisentiment LexiconXiaoping Yang0Zhongxia Zhang1Zhongqiu Zhang2Yuting Mo3Lianbei Li4Li Yu5Peican Zhu6School of Information, Renmin University of China, Beijing 100872, ChinaSchool of Information, Renmin University of China, Beijing 100872, ChinaSchool of Computer Science, Northeastern University, Shenyang 110819, ChinaSchool of Information, Renmin University of China, Beijing 100872, ChinaSchool of Information, Renmin University of China, Beijing 100872, ChinaSchool of Information, Renmin University of China, Beijing 100872, ChinaSchool of Computer Science and Technology, Northwestern Polytechnical University, Xi’an 710129, ChinaManual annotation of sentiment lexicons costs too much labor and time, and it is also difficult to get accurate quantification of emotional intensity. Besides, the excessive emphasis on one specific field has greatly limited the applicability of domain sentiment lexicons (Wang et al., 2010). This paper implements statistical training for large-scale Chinese corpus through neural network language model and proposes an automatic method of constructing a multidimensional sentiment lexicon based on constraints of coordinate offset. In order to distinguish the sentiment polarities of those words which may express either positive or negative meanings in different contexts, we further present a sentiment disambiguation algorithm to increase the flexibility of our lexicon. Lastly, we present a global optimization framework that provides a unified way to combine several human-annotated resources for learning our 10-dimensional sentiment lexicon SentiRuc. Experiments show the superior performance of SentiRuc lexicon in category labeling test, intensity labeling test, and sentiment classification tasks. It is worth mentioning that, in intensity label test, SentiRuc outperforms the second place by 21 percent.http://dx.doi.org/10.1155/2016/2093406
collection DOAJ
language English
format Article
sources DOAJ
author Xiaoping Yang
Zhongxia Zhang
Zhongqiu Zhang
Yuting Mo
Lianbei Li
Li Yu
Peican Zhu
spellingShingle Xiaoping Yang
Zhongxia Zhang
Zhongqiu Zhang
Yuting Mo
Lianbei Li
Li Yu
Peican Zhu
Automatic Construction and Global Optimization of a Multisentiment Lexicon
Computational Intelligence and Neuroscience
author_facet Xiaoping Yang
Zhongxia Zhang
Zhongqiu Zhang
Yuting Mo
Lianbei Li
Li Yu
Peican Zhu
author_sort Xiaoping Yang
title Automatic Construction and Global Optimization of a Multisentiment Lexicon
title_short Automatic Construction and Global Optimization of a Multisentiment Lexicon
title_full Automatic Construction and Global Optimization of a Multisentiment Lexicon
title_fullStr Automatic Construction and Global Optimization of a Multisentiment Lexicon
title_full_unstemmed Automatic Construction and Global Optimization of a Multisentiment Lexicon
title_sort automatic construction and global optimization of a multisentiment lexicon
publisher Hindawi Limited
series Computational Intelligence and Neuroscience
issn 1687-5265
1687-5273
publishDate 2016-01-01
description Manual annotation of sentiment lexicons costs too much labor and time, and it is also difficult to get accurate quantification of emotional intensity. Besides, the excessive emphasis on one specific field has greatly limited the applicability of domain sentiment lexicons (Wang et al., 2010). This paper implements statistical training for large-scale Chinese corpus through neural network language model and proposes an automatic method of constructing a multidimensional sentiment lexicon based on constraints of coordinate offset. In order to distinguish the sentiment polarities of those words which may express either positive or negative meanings in different contexts, we further present a sentiment disambiguation algorithm to increase the flexibility of our lexicon. Lastly, we present a global optimization framework that provides a unified way to combine several human-annotated resources for learning our 10-dimensional sentiment lexicon SentiRuc. Experiments show the superior performance of SentiRuc lexicon in category labeling test, intensity labeling test, and sentiment classification tasks. It is worth mentioning that, in intensity label test, SentiRuc outperforms the second place by 21 percent.
url http://dx.doi.org/10.1155/2016/2093406
work_keys_str_mv AT xiaopingyang automaticconstructionandglobaloptimizationofamultisentimentlexicon
AT zhongxiazhang automaticconstructionandglobaloptimizationofamultisentimentlexicon
AT zhongqiuzhang automaticconstructionandglobaloptimizationofamultisentimentlexicon
AT yutingmo automaticconstructionandglobaloptimizationofamultisentimentlexicon
AT lianbeili automaticconstructionandglobaloptimizationofamultisentimentlexicon
AT liyu automaticconstructionandglobaloptimizationofamultisentimentlexicon
AT peicanzhu automaticconstructionandglobaloptimizationofamultisentimentlexicon
_version_ 1725188216606162944