Automated Amharic News Categorization Using Deep Learning Models

For decades, machine learning techniques have been used to process Amharic texts. The potential application of deep learning on Amharic document classification has not been exploited due to a lack of language resources. In this paper, we present a deep learning model for Amharic news document classi...

Full description

Bibliographic Details
Main Authors: Demeke Endalie, Getamesay Haile
Format: Article
Language:English
Published: Hindawi Limited 2021-01-01
Series:Computational Intelligence and Neuroscience
Online Access:http://dx.doi.org/10.1155/2021/3774607
id doaj-258d52f6832c4937bc92f3b956fd7416
record_format Article
spelling doaj-258d52f6832c4937bc92f3b956fd74162021-08-09T00:01:39ZengHindawi LimitedComputational Intelligence and Neuroscience1687-52732021-01-01202110.1155/2021/3774607Automated Amharic News Categorization Using Deep Learning ModelsDemeke Endalie0Getamesay Haile1Faculty of Computing and InformaticsFaculty of Computing and InformaticsFor decades, machine learning techniques have been used to process Amharic texts. The potential application of deep learning on Amharic document classification has not been exploited due to a lack of language resources. In this paper, we present a deep learning model for Amharic news document classification. The proposed model uses fastText to generate text vectors to represent semantic meaning of texts and solve the problem of traditional methods. The text vectors matrix is then fed into the embedding layer of a convolutional neural network (CNN), which automatically extracts features. We conduct experiments on a data set with six news categories, and our approach produced a classification accuracy of 93.79%. We compared our method to well-known machine learning algorithms such as support vector machine (SVM), multilayer perceptron (MLP), decision tree (DT), XGBoost (XGB), and random forest (RF) and achieved good results.http://dx.doi.org/10.1155/2021/3774607
collection DOAJ
language English
format Article
sources DOAJ
author Demeke Endalie
Getamesay Haile
spellingShingle Demeke Endalie
Getamesay Haile
Automated Amharic News Categorization Using Deep Learning Models
Computational Intelligence and Neuroscience
author_facet Demeke Endalie
Getamesay Haile
author_sort Demeke Endalie
title Automated Amharic News Categorization Using Deep Learning Models
title_short Automated Amharic News Categorization Using Deep Learning Models
title_full Automated Amharic News Categorization Using Deep Learning Models
title_fullStr Automated Amharic News Categorization Using Deep Learning Models
title_full_unstemmed Automated Amharic News Categorization Using Deep Learning Models
title_sort automated amharic news categorization using deep learning models
publisher Hindawi Limited
series Computational Intelligence and Neuroscience
issn 1687-5273
publishDate 2021-01-01
description For decades, machine learning techniques have been used to process Amharic texts. The potential application of deep learning on Amharic document classification has not been exploited due to a lack of language resources. In this paper, we present a deep learning model for Amharic news document classification. The proposed model uses fastText to generate text vectors to represent semantic meaning of texts and solve the problem of traditional methods. The text vectors matrix is then fed into the embedding layer of a convolutional neural network (CNN), which automatically extracts features. We conduct experiments on a data set with six news categories, and our approach produced a classification accuracy of 93.79%. We compared our method to well-known machine learning algorithms such as support vector machine (SVM), multilayer perceptron (MLP), decision tree (DT), XGBoost (XGB), and random forest (RF) and achieved good results.
url http://dx.doi.org/10.1155/2021/3774607
work_keys_str_mv AT demekeendalie automatedamharicnewscategorizationusingdeeplearningmodels
AT getamesayhaile automatedamharicnewscategorizationusingdeeplearningmodels
_version_ 1721215284888469504