Investigating the impact of pre-processing techniques and pre-trained word embeddings in detecting Arabic health information on social media

Abstract This paper presents a comprehensive evaluation of data pre-processing and word embedding techniques in the context of Arabic document classification in the domain of health-related communication on social media. We evaluate 26 text pre-processings applied to Arabic tweets within the process...

Full description

Bibliographic Details
Main Authors: Yahya Albalawi, Jim Buckley, Nikola S. Nikolov
Format: Article
Language:English
Published: SpringerOpen 2021-07-01
Series:Journal of Big Data
Subjects:
Online Access:https://doi.org/10.1186/s40537-021-00488-w