HILATSA: A hybrid Incremental learning approach for Arabic tweets sentiment analysis
A huge amount of data is generated since the evolution in technology and the tremendous growth of social networks. In spite of the availability of data, there is a lack of tools and resources for analysis. Though Arabic is a popular language, there are too few dialectal Arabic analysis tools. This i...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2019-11-01
|
Series: | Egyptian Informatics Journal |
Online Access: | http://www.sciencedirect.com/science/article/pii/S1110866518302123 |
id |
doaj-510e5062650944b6ae958e29b2e1d141 |
---|---|
record_format |
Article |
spelling |
doaj-510e5062650944b6ae958e29b2e1d1412021-07-02T07:08:39ZengElsevierEgyptian Informatics Journal1110-86652019-11-01203163171HILATSA: A hybrid Incremental learning approach for Arabic tweets sentiment analysisKariman Elshakankery0Mona F. Ahmed1Corresponding author.; Faculty of Engineering, Cairo University, EgyptFaculty of Engineering, Cairo University, EgyptA huge amount of data is generated since the evolution in technology and the tremendous growth of social networks. In spite of the availability of data, there is a lack of tools and resources for analysis. Though Arabic is a popular language, there are too few dialectal Arabic analysis tools. This is because of the many challenges in Arabic due to its morphological complexity and dynamic nature. Sentiment Analysis (SA) is used by different organizations for many reasons as developing product quality, adjusting market strategy and improving customer services. This paper introduces a semi- automatic learning system for sentiment analysis that is capable of updating the lexicon in order to be up to date with language changes. HILATSA is a hybrid approach which combines both lexicon based and machine learning approaches in order to identify the tweets sentiments polarities. The proposed approach has been tested using different datasets. It achieved an accuracy of 73.67% for 3-class classification problem and 83.73% for 2-class classification problem. The semi-automatic learning component proved to be effective as it improved the accuracy by 17.55%. Keywords: Sentiment analysis, Opinion mining, Hybrid approach, Arabic Tweets Sentiment Analysis, Sentiment classification, Self-learninghttp://www.sciencedirect.com/science/article/pii/S1110866518302123 |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Kariman Elshakankery Mona F. Ahmed |
spellingShingle |
Kariman Elshakankery Mona F. Ahmed HILATSA: A hybrid Incremental learning approach for Arabic tweets sentiment analysis Egyptian Informatics Journal |
author_facet |
Kariman Elshakankery Mona F. Ahmed |
author_sort |
Kariman Elshakankery |
title |
HILATSA: A hybrid Incremental learning approach for Arabic tweets sentiment analysis |
title_short |
HILATSA: A hybrid Incremental learning approach for Arabic tweets sentiment analysis |
title_full |
HILATSA: A hybrid Incremental learning approach for Arabic tweets sentiment analysis |
title_fullStr |
HILATSA: A hybrid Incremental learning approach for Arabic tweets sentiment analysis |
title_full_unstemmed |
HILATSA: A hybrid Incremental learning approach for Arabic tweets sentiment analysis |
title_sort |
hilatsa: a hybrid incremental learning approach for arabic tweets sentiment analysis |
publisher |
Elsevier |
series |
Egyptian Informatics Journal |
issn |
1110-8665 |
publishDate |
2019-11-01 |
description |
A huge amount of data is generated since the evolution in technology and the tremendous growth of social networks. In spite of the availability of data, there is a lack of tools and resources for analysis. Though Arabic is a popular language, there are too few dialectal Arabic analysis tools. This is because of the many challenges in Arabic due to its morphological complexity and dynamic nature. Sentiment Analysis (SA) is used by different organizations for many reasons as developing product quality, adjusting market strategy and improving customer services. This paper introduces a semi- automatic learning system for sentiment analysis that is capable of updating the lexicon in order to be up to date with language changes. HILATSA is a hybrid approach which combines both lexicon based and machine learning approaches in order to identify the tweets sentiments polarities. The proposed approach has been tested using different datasets. It achieved an accuracy of 73.67% for 3-class classification problem and 83.73% for 2-class classification problem. The semi-automatic learning component proved to be effective as it improved the accuracy by 17.55%. Keywords: Sentiment analysis, Opinion mining, Hybrid approach, Arabic Tweets Sentiment Analysis, Sentiment classification, Self-learning |
url |
http://www.sciencedirect.com/science/article/pii/S1110866518302123 |
work_keys_str_mv |
AT karimanelshakankery hilatsaahybridincrementallearningapproachforarabictweetssentimentanalysis AT monafahmed hilatsaahybridincrementallearningapproachforarabictweetssentimentanalysis |
_version_ |
1721336362536271872 |