A Self-Adaptive Hidden Markov Model for Emotion Classification in Chinese Microblogs
Microblogging is increasingly becoming one of the most popular online social media for people to express ideas and emotions. The amount of socially generated content from this medium is enormous. Text mining techniques have been intensively applied to discover the hidden knowledge and emotions from...
Main Authors: | , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi Limited
2015-01-01
|
Series: | Mathematical Problems in Engineering |
Online Access: | http://dx.doi.org/10.1155/2015/987189 |
id |
doaj-691ae2bcdad34739994f320aee40e148 |
---|---|
record_format |
Article |
spelling |
doaj-691ae2bcdad34739994f320aee40e1482020-11-24T22:26:54ZengHindawi LimitedMathematical Problems in Engineering1024-123X1563-51472015-01-01201510.1155/2015/987189987189A Self-Adaptive Hidden Markov Model for Emotion Classification in Chinese MicroblogsLi Liu0Dashi Luo1Ming Liu2Jun Zhong3Ye Wei4Letian Sun5School of Software Engineering, Chongqing University, Chongqing 401331, ChinaSchool of Information Science and Engineering, Lanzhou University, Lanzhou 730000, ChinaFaculty of Computer and Information Science, Southwest University, Chongqing 400715, ChinaSchool of Information Science and Engineering, Lanzhou University, Lanzhou 730000, ChinaSchool of Information Science and Engineering, Lanzhou University, Lanzhou 730000, ChinaSchool of Information Science and Engineering, Lanzhou University, Lanzhou 730000, ChinaMicroblogging is increasingly becoming one of the most popular online social media for people to express ideas and emotions. The amount of socially generated content from this medium is enormous. Text mining techniques have been intensively applied to discover the hidden knowledge and emotions from this huge dataset. In this paper, we propose a modified version of hidden Markov model (HMM) classifier, called self-adaptive HMM, whose parameters are optimized by Particle Swarm Optimization algorithms. Since manually labeling large-scale dataset is difficult, we also employ the entropy to decide whether a new unlabeled tweet shall be contained in the training dataset after being assigned an emotion using our HMM-based approach. In the experiment, we collected about 200,000 Chinese tweets from Sina Weibo. The results show that the F-score of our approach gets 76% on happiness and fear and 65% on anger, surprise, and sadness. In addition, the self-adaptive HMM classifier outperforms Naive Bayes and Support Vector Machine on recognition of happiness, anger, and sadness.http://dx.doi.org/10.1155/2015/987189 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Li Liu Dashi Luo Ming Liu Jun Zhong Ye Wei Letian Sun |
spellingShingle |
Li Liu Dashi Luo Ming Liu Jun Zhong Ye Wei Letian Sun A Self-Adaptive Hidden Markov Model for Emotion Classification in Chinese Microblogs Mathematical Problems in Engineering |
author_facet |
Li Liu Dashi Luo Ming Liu Jun Zhong Ye Wei Letian Sun |
author_sort |
Li Liu |
title |
A Self-Adaptive Hidden Markov Model for Emotion Classification in Chinese Microblogs |
title_short |
A Self-Adaptive Hidden Markov Model for Emotion Classification in Chinese Microblogs |
title_full |
A Self-Adaptive Hidden Markov Model for Emotion Classification in Chinese Microblogs |
title_fullStr |
A Self-Adaptive Hidden Markov Model for Emotion Classification in Chinese Microblogs |
title_full_unstemmed |
A Self-Adaptive Hidden Markov Model for Emotion Classification in Chinese Microblogs |
title_sort |
self-adaptive hidden markov model for emotion classification in chinese microblogs |
publisher |
Hindawi Limited |
series |
Mathematical Problems in Engineering |
issn |
1024-123X 1563-5147 |
publishDate |
2015-01-01 |
description |
Microblogging is increasingly becoming one of the most popular online social media for people to express ideas and emotions. The amount of socially generated content from this medium is enormous. Text mining techniques have been intensively applied to discover the hidden knowledge and emotions from this huge dataset. In this paper,
we propose a modified version of hidden Markov model (HMM) classifier, called self-adaptive HMM, whose parameters are optimized by Particle Swarm Optimization algorithms. Since manually labeling large-scale dataset is difficult, we also employ the entropy to decide whether a new unlabeled tweet shall be contained in the training dataset after being assigned an emotion using our HMM-based approach. In the experiment, we collected about 200,000 Chinese tweets from Sina Weibo. The results show that the F-score of our approach gets 76% on happiness and fear and 65% on anger, surprise, and sadness. In addition, the self-adaptive HMM classifier outperforms Naive Bayes and Support Vector Machine on recognition of happiness, anger, and sadness. |
url |
http://dx.doi.org/10.1155/2015/987189 |
work_keys_str_mv |
AT liliu aselfadaptivehiddenmarkovmodelforemotionclassificationinchinesemicroblogs AT dashiluo aselfadaptivehiddenmarkovmodelforemotionclassificationinchinesemicroblogs AT mingliu aselfadaptivehiddenmarkovmodelforemotionclassificationinchinesemicroblogs AT junzhong aselfadaptivehiddenmarkovmodelforemotionclassificationinchinesemicroblogs AT yewei aselfadaptivehiddenmarkovmodelforemotionclassificationinchinesemicroblogs AT letiansun aselfadaptivehiddenmarkovmodelforemotionclassificationinchinesemicroblogs AT liliu selfadaptivehiddenmarkovmodelforemotionclassificationinchinesemicroblogs AT dashiluo selfadaptivehiddenmarkovmodelforemotionclassificationinchinesemicroblogs AT mingliu selfadaptivehiddenmarkovmodelforemotionclassificationinchinesemicroblogs AT junzhong selfadaptivehiddenmarkovmodelforemotionclassificationinchinesemicroblogs AT yewei selfadaptivehiddenmarkovmodelforemotionclassificationinchinesemicroblogs AT letiansun selfadaptivehiddenmarkovmodelforemotionclassificationinchinesemicroblogs |
_version_ |
1725751240361508864 |