ArCAR: A Novel Deep Learning Computer-Aided Recognition for Character-Level Arabic Text Representation and Recognition

Arabic text classification is a process to simultaneously categorize the different contextual Arabic contents into a proper category. In this paper, a novel deep learning Arabic text computer-aided recognition (ArCAR) is proposed to represent and recognize Arabic text at the character level. The inp...

Full description

Bibliographic Details
Main Authors: Abdullah Y. Muaad, Hanumanthappa Jayappa, Mugahed A. Al-antari, Sungyoung Lee
Format: Article
Language:English
Published: MDPI AG 2021-07-01
Series:Algorithms
Subjects:
Online Access:https://www.mdpi.com/1999-4893/14/7/216
id doaj-6320a4c7ef494555940410baaed12bce
record_format Article
spelling doaj-6320a4c7ef494555940410baaed12bce2021-07-23T13:26:52ZengMDPI AGAlgorithms1999-48932021-07-011421621610.3390/a14070216ArCAR: A Novel Deep Learning Computer-Aided Recognition for Character-Level Arabic Text Representation and RecognitionAbdullah Y. Muaad0Hanumanthappa Jayappa1Mugahed A. Al-antari2Sungyoung Lee3Department of Studies in Computer Science, Mysore University, Manasagangothri, Mysore 570006, IndiaDepartment of Studies in Computer Science, Mysore University, Manasagangothri, Mysore 570006, IndiaSana’a Community College, Sana’a 5695, YemenDepartment of Computer Science and Engineering, College of Software, Kyung Hee University, Suwon-si 17104, Gyeonggi-do, KoreaArabic text classification is a process to simultaneously categorize the different contextual Arabic contents into a proper category. In this paper, a novel deep learning Arabic text computer-aided recognition (ArCAR) is proposed to represent and recognize Arabic text at the character level. The input Arabic text is quantized in the form of 1D vectors for each Arabic character to represent a 2D array for the ArCAR system. The ArCAR system is validated over 5-fold cross-validation tests for two applications: Arabic text document classification and Arabic sentiment analysis. For document classification, the ArCAR system achieves the best performance using the Alarabiya-balance dataset in terms of overall accuracy, recall, precision, and F1-score by 97.76%, 94.08%, 94.16%, and 94.09%, respectively. Meanwhile, the ArCAR performs well for Arabic sentiment analysis, achieving the best performance using the hotel Arabic reviews dataset (HARD) balance dataset in terms of overall accuracy and F1-score by 93.58% and 93.23%, respectively. The proposed ArCAR seems to provide a practical solution for accurate Arabic text representation, understanding, and classification.https://www.mdpi.com/1999-4893/14/7/216natural language processing (NLP)deep convolutional neural networkArabic text recognitionArabic sentiment analysisArabic text computer-aided recognition (ArCAR)
collection DOAJ
language English
format Article
sources DOAJ
author Abdullah Y. Muaad
Hanumanthappa Jayappa
Mugahed A. Al-antari
Sungyoung Lee
spellingShingle Abdullah Y. Muaad
Hanumanthappa Jayappa
Mugahed A. Al-antari
Sungyoung Lee
ArCAR: A Novel Deep Learning Computer-Aided Recognition for Character-Level Arabic Text Representation and Recognition
Algorithms
natural language processing (NLP)
deep convolutional neural network
Arabic text recognition
Arabic sentiment analysis
Arabic text computer-aided recognition (ArCAR)
author_facet Abdullah Y. Muaad
Hanumanthappa Jayappa
Mugahed A. Al-antari
Sungyoung Lee
author_sort Abdullah Y. Muaad
title ArCAR: A Novel Deep Learning Computer-Aided Recognition for Character-Level Arabic Text Representation and Recognition
title_short ArCAR: A Novel Deep Learning Computer-Aided Recognition for Character-Level Arabic Text Representation and Recognition
title_full ArCAR: A Novel Deep Learning Computer-Aided Recognition for Character-Level Arabic Text Representation and Recognition
title_fullStr ArCAR: A Novel Deep Learning Computer-Aided Recognition for Character-Level Arabic Text Representation and Recognition
title_full_unstemmed ArCAR: A Novel Deep Learning Computer-Aided Recognition for Character-Level Arabic Text Representation and Recognition
title_sort arcar: a novel deep learning computer-aided recognition for character-level arabic text representation and recognition
publisher MDPI AG
series Algorithms
issn 1999-4893
publishDate 2021-07-01
description Arabic text classification is a process to simultaneously categorize the different contextual Arabic contents into a proper category. In this paper, a novel deep learning Arabic text computer-aided recognition (ArCAR) is proposed to represent and recognize Arabic text at the character level. The input Arabic text is quantized in the form of 1D vectors for each Arabic character to represent a 2D array for the ArCAR system. The ArCAR system is validated over 5-fold cross-validation tests for two applications: Arabic text document classification and Arabic sentiment analysis. For document classification, the ArCAR system achieves the best performance using the Alarabiya-balance dataset in terms of overall accuracy, recall, precision, and F1-score by 97.76%, 94.08%, 94.16%, and 94.09%, respectively. Meanwhile, the ArCAR performs well for Arabic sentiment analysis, achieving the best performance using the hotel Arabic reviews dataset (HARD) balance dataset in terms of overall accuracy and F1-score by 93.58% and 93.23%, respectively. The proposed ArCAR seems to provide a practical solution for accurate Arabic text representation, understanding, and classification.
topic natural language processing (NLP)
deep convolutional neural network
Arabic text recognition
Arabic sentiment analysis
Arabic text computer-aided recognition (ArCAR)
url https://www.mdpi.com/1999-4893/14/7/216
work_keys_str_mv AT abdullahymuaad arcaranoveldeeplearningcomputeraidedrecognitionforcharacterlevelarabictextrepresentationandrecognition
AT hanumanthappajayappa arcaranoveldeeplearningcomputeraidedrecognitionforcharacterlevelarabictextrepresentationandrecognition
AT mugahedaalantari arcaranoveldeeplearningcomputeraidedrecognitionforcharacterlevelarabictextrepresentationandrecognition
AT sungyounglee arcaranoveldeeplearningcomputeraidedrecognitionforcharacterlevelarabictextrepresentationandrecognition
_version_ 1721289903265808384