A Self-Adaptive Hidden Markov Model for Emotion Classification in Chinese Microblogs

Microblogging is increasingly becoming one of the most popular online social media for people to express ideas and emotions. The amount of socially generated content from this medium is enormous. Text mining techniques have been intensively applied to discover the hidden knowledge and emotions from...

Full description

Bibliographic Details
Main Authors: Li Liu, Dashi Luo, Ming Liu, Jun Zhong, Ye Wei, Letian Sun
Format: Article
Language:English
Published: Hindawi Limited 2015-01-01
Series:Mathematical Problems in Engineering
Online Access:http://dx.doi.org/10.1155/2015/987189
id doaj-691ae2bcdad34739994f320aee40e148
record_format Article
spelling doaj-691ae2bcdad34739994f320aee40e1482020-11-24T22:26:54ZengHindawi LimitedMathematical Problems in Engineering1024-123X1563-51472015-01-01201510.1155/2015/987189987189A Self-Adaptive Hidden Markov Model for Emotion Classification in Chinese MicroblogsLi Liu0Dashi Luo1Ming Liu2Jun Zhong3Ye Wei4Letian Sun5School of Software Engineering, Chongqing University, Chongqing 401331, ChinaSchool of Information Science and Engineering, Lanzhou University, Lanzhou 730000, ChinaFaculty of Computer and Information Science, Southwest University, Chongqing 400715, ChinaSchool of Information Science and Engineering, Lanzhou University, Lanzhou 730000, ChinaSchool of Information Science and Engineering, Lanzhou University, Lanzhou 730000, ChinaSchool of Information Science and Engineering, Lanzhou University, Lanzhou 730000, ChinaMicroblogging is increasingly becoming one of the most popular online social media for people to express ideas and emotions. The amount of socially generated content from this medium is enormous. Text mining techniques have been intensively applied to discover the hidden knowledge and emotions from this huge dataset. In this paper, we propose a modified version of hidden Markov model (HMM) classifier, called self-adaptive HMM, whose parameters are optimized by Particle Swarm Optimization algorithms. Since manually labeling large-scale dataset is difficult, we also employ the entropy to decide whether a new unlabeled tweet shall be contained in the training dataset after being assigned an emotion using our HMM-based approach. In the experiment, we collected about 200,000 Chinese tweets from Sina Weibo. The results show that the F-score of our approach gets 76% on happiness and fear and 65% on anger, surprise, and sadness. In addition, the self-adaptive HMM classifier outperforms Naive Bayes and Support Vector Machine on recognition of happiness, anger, and sadness.http://dx.doi.org/10.1155/2015/987189
collection DOAJ
language English
format Article
sources DOAJ
author Li Liu
Dashi Luo
Ming Liu
Jun Zhong
Ye Wei
Letian Sun
spellingShingle Li Liu
Dashi Luo
Ming Liu
Jun Zhong
Ye Wei
Letian Sun
A Self-Adaptive Hidden Markov Model for Emotion Classification in Chinese Microblogs
Mathematical Problems in Engineering
author_facet Li Liu
Dashi Luo
Ming Liu
Jun Zhong
Ye Wei
Letian Sun
author_sort Li Liu
title A Self-Adaptive Hidden Markov Model for Emotion Classification in Chinese Microblogs
title_short A Self-Adaptive Hidden Markov Model for Emotion Classification in Chinese Microblogs
title_full A Self-Adaptive Hidden Markov Model for Emotion Classification in Chinese Microblogs
title_fullStr A Self-Adaptive Hidden Markov Model for Emotion Classification in Chinese Microblogs
title_full_unstemmed A Self-Adaptive Hidden Markov Model for Emotion Classification in Chinese Microblogs
title_sort self-adaptive hidden markov model for emotion classification in chinese microblogs
publisher Hindawi Limited
series Mathematical Problems in Engineering
issn 1024-123X
1563-5147
publishDate 2015-01-01
description Microblogging is increasingly becoming one of the most popular online social media for people to express ideas and emotions. The amount of socially generated content from this medium is enormous. Text mining techniques have been intensively applied to discover the hidden knowledge and emotions from this huge dataset. In this paper, we propose a modified version of hidden Markov model (HMM) classifier, called self-adaptive HMM, whose parameters are optimized by Particle Swarm Optimization algorithms. Since manually labeling large-scale dataset is difficult, we also employ the entropy to decide whether a new unlabeled tweet shall be contained in the training dataset after being assigned an emotion using our HMM-based approach. In the experiment, we collected about 200,000 Chinese tweets from Sina Weibo. The results show that the F-score of our approach gets 76% on happiness and fear and 65% on anger, surprise, and sadness. In addition, the self-adaptive HMM classifier outperforms Naive Bayes and Support Vector Machine on recognition of happiness, anger, and sadness.
url http://dx.doi.org/10.1155/2015/987189
work_keys_str_mv AT liliu aselfadaptivehiddenmarkovmodelforemotionclassificationinchinesemicroblogs
AT dashiluo aselfadaptivehiddenmarkovmodelforemotionclassificationinchinesemicroblogs
AT mingliu aselfadaptivehiddenmarkovmodelforemotionclassificationinchinesemicroblogs
AT junzhong aselfadaptivehiddenmarkovmodelforemotionclassificationinchinesemicroblogs
AT yewei aselfadaptivehiddenmarkovmodelforemotionclassificationinchinesemicroblogs
AT letiansun aselfadaptivehiddenmarkovmodelforemotionclassificationinchinesemicroblogs
AT liliu selfadaptivehiddenmarkovmodelforemotionclassificationinchinesemicroblogs
AT dashiluo selfadaptivehiddenmarkovmodelforemotionclassificationinchinesemicroblogs
AT mingliu selfadaptivehiddenmarkovmodelforemotionclassificationinchinesemicroblogs
AT junzhong selfadaptivehiddenmarkovmodelforemotionclassificationinchinesemicroblogs
AT yewei selfadaptivehiddenmarkovmodelforemotionclassificationinchinesemicroblogs
AT letiansun selfadaptivehiddenmarkovmodelforemotionclassificationinchinesemicroblogs
_version_ 1725751240361508864