Naive Bayes classifier optimization for text classification on e-government using particle swarm optimization

One of the public e-government services is a web-based online complaints portal. Text of complaint needs to be classified so that it can be forwarded to the responsible office quickly and accurately. The standard classification approach commonly used is the Naive Bayes Classifier (NBC) and k-Nearest...

Full description

Bibliographic Details
Main Authors: Kuncahyo Setyo Nugroho, Istiadi Istiadi, Fitri Marisa
Format: Article
Language:English
Published: Diponegoro University 2020-01-01
Series:Jurnal Teknologi dan Sistem Komputer
Subjects:
Online Access:https://jtsiskom.undip.ac.id/index.php/jtsiskom/article/view/13362
id doaj-8126fa2317264ea78724fd88656abb07
record_format Article
spelling doaj-8126fa2317264ea78724fd88656abb072021-10-02T15:10:49ZengDiponegoro UniversityJurnal Teknologi dan Sistem Komputer2338-04032020-01-0181212610.14710/jtsiskom.8.1.2020.21-2612799Naive Bayes classifier optimization for text classification on e-government using particle swarm optimizationKuncahyo Setyo Nugroho0Istiadi Istiadi1Fitri Marisa2https://orcid.org/0000-0002-5152-3544Program Studi Teknik Informatika, Universitas Widyagama Malang, IndonesiaProgram Studi Teknik Informatika, Universitas Widyagama Malang, IndonesiaProgram Studi Teknik Informatika, Universitas Widyagama Malang, IndonesiaOne of the public e-government services is a web-based online complaints portal. Text of complaint needs to be classified so that it can be forwarded to the responsible office quickly and accurately. The standard classification approach commonly used is the Naive Bayes Classifier (NBC) and k-Nearest Neighbor (k-NN), which still classifies one label and needs to be optimized. This research aims to classify the complaint text of more than one label at the same time with NBC, which is optimized using Particle Swarm Optimization (PSO). The data source comes from the Sambat Online portal and is divided into 70 % as training data and 30 % as testing data to be classified into seven labels. NBC and k-NN algorithms are used as a comparison method to find out the performance of PSO optimization. The 10-fold cross-validation shows that NBC optimization using PSO achieves an accuracy of 87.44 % better than k-NN of 75 % and NBC of 64.38 %. The optimization model can be used to increase the effectiveness of services to e-government in society.https://jtsiskom.undip.ac.id/index.php/jtsiskom/article/view/13362layanan publik onlineweb miningoptimasi klasifikasi teks pengaduan
collection DOAJ
language English
format Article
sources DOAJ
author Kuncahyo Setyo Nugroho
Istiadi Istiadi
Fitri Marisa
spellingShingle Kuncahyo Setyo Nugroho
Istiadi Istiadi
Fitri Marisa
Naive Bayes classifier optimization for text classification on e-government using particle swarm optimization
Jurnal Teknologi dan Sistem Komputer
layanan publik online
web mining
optimasi klasifikasi teks pengaduan
author_facet Kuncahyo Setyo Nugroho
Istiadi Istiadi
Fitri Marisa
author_sort Kuncahyo Setyo Nugroho
title Naive Bayes classifier optimization for text classification on e-government using particle swarm optimization
title_short Naive Bayes classifier optimization for text classification on e-government using particle swarm optimization
title_full Naive Bayes classifier optimization for text classification on e-government using particle swarm optimization
title_fullStr Naive Bayes classifier optimization for text classification on e-government using particle swarm optimization
title_full_unstemmed Naive Bayes classifier optimization for text classification on e-government using particle swarm optimization
title_sort naive bayes classifier optimization for text classification on e-government using particle swarm optimization
publisher Diponegoro University
series Jurnal Teknologi dan Sistem Komputer
issn 2338-0403
publishDate 2020-01-01
description One of the public e-government services is a web-based online complaints portal. Text of complaint needs to be classified so that it can be forwarded to the responsible office quickly and accurately. The standard classification approach commonly used is the Naive Bayes Classifier (NBC) and k-Nearest Neighbor (k-NN), which still classifies one label and needs to be optimized. This research aims to classify the complaint text of more than one label at the same time with NBC, which is optimized using Particle Swarm Optimization (PSO). The data source comes from the Sambat Online portal and is divided into 70 % as training data and 30 % as testing data to be classified into seven labels. NBC and k-NN algorithms are used as a comparison method to find out the performance of PSO optimization. The 10-fold cross-validation shows that NBC optimization using PSO achieves an accuracy of 87.44 % better than k-NN of 75 % and NBC of 64.38 %. The optimization model can be used to increase the effectiveness of services to e-government in society.
topic layanan publik online
web mining
optimasi klasifikasi teks pengaduan
url https://jtsiskom.undip.ac.id/index.php/jtsiskom/article/view/13362
work_keys_str_mv AT kuncahyosetyonugroho naivebayesclassifieroptimizationfortextclassificationonegovernmentusingparticleswarmoptimization
AT istiadiistiadi naivebayesclassifieroptimizationfortextclassificationonegovernmentusingparticleswarmoptimization
AT fitrimarisa naivebayesclassifieroptimizationfortextclassificationonegovernmentusingparticleswarmoptimization
_version_ 1716854350965374976