HILATSA: A hybrid Incremental learning approach for Arabic tweets sentiment analysis

A huge amount of data is generated since the evolution in technology and the tremendous growth of social networks. In spite of the availability of data, there is a lack of tools and resources for analysis. Though Arabic is a popular language, there are too few dialectal Arabic analysis tools. This i...

Full description

Bibliographic Details
Main Authors: Kariman Elshakankery, Mona F. Ahmed
Format: Article
Language:English
Published: Elsevier 2019-11-01
Series:Egyptian Informatics Journal
Online Access:http://www.sciencedirect.com/science/article/pii/S1110866518302123
id doaj-510e5062650944b6ae958e29b2e1d141
record_format Article
spelling doaj-510e5062650944b6ae958e29b2e1d1412021-07-02T07:08:39ZengElsevierEgyptian Informatics Journal1110-86652019-11-01203163171HILATSA: A hybrid Incremental learning approach for Arabic tweets sentiment analysisKariman Elshakankery0Mona F. Ahmed1Corresponding author.; Faculty of Engineering, Cairo University, EgyptFaculty of Engineering, Cairo University, EgyptA huge amount of data is generated since the evolution in technology and the tremendous growth of social networks. In spite of the availability of data, there is a lack of tools and resources for analysis. Though Arabic is a popular language, there are too few dialectal Arabic analysis tools. This is because of the many challenges in Arabic due to its morphological complexity and dynamic nature. Sentiment Analysis (SA) is used by different organizations for many reasons as developing product quality, adjusting market strategy and improving customer services. This paper introduces a semi- automatic learning system for sentiment analysis that is capable of updating the lexicon in order to be up to date with language changes. HILATSA is a hybrid approach which combines both lexicon based and machine learning approaches in order to identify the tweets sentiments polarities. The proposed approach has been tested using different datasets. It achieved an accuracy of 73.67% for 3-class classification problem and 83.73% for 2-class classification problem. The semi-automatic learning component proved to be effective as it improved the accuracy by 17.55%. Keywords: Sentiment analysis, Opinion mining, Hybrid approach, Arabic Tweets Sentiment Analysis, Sentiment classification, Self-learninghttp://www.sciencedirect.com/science/article/pii/S1110866518302123
collection DOAJ
language English
format Article
sources DOAJ
author Kariman Elshakankery
Mona F. Ahmed
spellingShingle Kariman Elshakankery
Mona F. Ahmed
HILATSA: A hybrid Incremental learning approach for Arabic tweets sentiment analysis
Egyptian Informatics Journal
author_facet Kariman Elshakankery
Mona F. Ahmed
author_sort Kariman Elshakankery
title HILATSA: A hybrid Incremental learning approach for Arabic tweets sentiment analysis
title_short HILATSA: A hybrid Incremental learning approach for Arabic tweets sentiment analysis
title_full HILATSA: A hybrid Incremental learning approach for Arabic tweets sentiment analysis
title_fullStr HILATSA: A hybrid Incremental learning approach for Arabic tweets sentiment analysis
title_full_unstemmed HILATSA: A hybrid Incremental learning approach for Arabic tweets sentiment analysis
title_sort hilatsa: a hybrid incremental learning approach for arabic tweets sentiment analysis
publisher Elsevier
series Egyptian Informatics Journal
issn 1110-8665
publishDate 2019-11-01
description A huge amount of data is generated since the evolution in technology and the tremendous growth of social networks. In spite of the availability of data, there is a lack of tools and resources for analysis. Though Arabic is a popular language, there are too few dialectal Arabic analysis tools. This is because of the many challenges in Arabic due to its morphological complexity and dynamic nature. Sentiment Analysis (SA) is used by different organizations for many reasons as developing product quality, adjusting market strategy and improving customer services. This paper introduces a semi- automatic learning system for sentiment analysis that is capable of updating the lexicon in order to be up to date with language changes. HILATSA is a hybrid approach which combines both lexicon based and machine learning approaches in order to identify the tweets sentiments polarities. The proposed approach has been tested using different datasets. It achieved an accuracy of 73.67% for 3-class classification problem and 83.73% for 2-class classification problem. The semi-automatic learning component proved to be effective as it improved the accuracy by 17.55%. Keywords: Sentiment analysis, Opinion mining, Hybrid approach, Arabic Tweets Sentiment Analysis, Sentiment classification, Self-learning
url http://www.sciencedirect.com/science/article/pii/S1110866518302123
work_keys_str_mv AT karimanelshakankery hilatsaahybridincrementallearningapproachforarabictweetssentimentanalysis
AT monafahmed hilatsaahybridincrementallearningapproachforarabictweetssentimentanalysis
_version_ 1721336362536271872