A Comparison of Text Classification Methods k-NN, Naïve Bayes, and Support Vector Machine for News Classification

<p>In this era, a rapid thriving Internet occasionally complicates users to retrieve news category furthermore if there are plentiful of news to be categorized. News categorization is a technique can be used to retrieve a category of news which gives easiness for users. Internet has vast amoun...

Full description

Bibliographic Details
Main Authors: Fanny Fanny, Yohan Muliono, Fidelson Tanzil
Format: Article
Language:English
Published: Politeknik Harapan Bersama Tegal 2018-05-01
Series:Jurnal Informatika: Jurnal Pengembangan IT
Online Access:http://ejournal.poltektegal.ac.id/index.php/informatika/article/view/828
id doaj-7dbfe1cca6334eac8f55ca853d77cb3e
record_format Article
spelling doaj-7dbfe1cca6334eac8f55ca853d77cb3e2020-11-25T01:18:03ZengPoliteknik Harapan Bersama TegalJurnal Informatika: Jurnal Pengembangan IT2477-51262548-93562018-05-013215716010.30591/jpit.v3i2.828649A Comparison of Text Classification Methods k-NN, Naïve Bayes, and Support Vector Machine for News ClassificationFanny Fanny0Yohan Muliono1Fidelson Tanzil2Bina Nusantara UniversityBina Nusantara UniversityBina Nusantara University<p>In this era, a rapid thriving Internet occasionally complicates users to retrieve news category furthermore if there are plentiful of news to be categorized. News categorization is a technique can be used to retrieve a category of news which gives easiness for users. Internet has vast amounts of information especially at news. Therefore, accurate and speedy access is becoming ever more difficult. This paper compares a news categorization using <em>k</em>-Nearest Neighbor, Naive Bayes and Support Vector Machine. Using vary of variables and through a several steps of preprocessing which proving k-Nearest Neighbor is producing a capable accuracy competes with Support Vector Machine whereas Naive Bayes producing just an average result, not as good as <em>k</em>-Nearest Neighbor and Support Vector Machine yet as bad as <em>k</em>-Nearest Neighbor and Support Vector Machine ever reach. As the results, <em>k</em>-Nearest Neighbor using correlation measurement type produces the best result of this experiment.  <br /><em></em></p>http://ejournal.poltektegal.ac.id/index.php/informatika/article/view/828
collection DOAJ
language English
format Article
sources DOAJ
author Fanny Fanny
Yohan Muliono
Fidelson Tanzil
spellingShingle Fanny Fanny
Yohan Muliono
Fidelson Tanzil
A Comparison of Text Classification Methods k-NN, Naïve Bayes, and Support Vector Machine for News Classification
Jurnal Informatika: Jurnal Pengembangan IT
author_facet Fanny Fanny
Yohan Muliono
Fidelson Tanzil
author_sort Fanny Fanny
title A Comparison of Text Classification Methods k-NN, Naïve Bayes, and Support Vector Machine for News Classification
title_short A Comparison of Text Classification Methods k-NN, Naïve Bayes, and Support Vector Machine for News Classification
title_full A Comparison of Text Classification Methods k-NN, Naïve Bayes, and Support Vector Machine for News Classification
title_fullStr A Comparison of Text Classification Methods k-NN, Naïve Bayes, and Support Vector Machine for News Classification
title_full_unstemmed A Comparison of Text Classification Methods k-NN, Naïve Bayes, and Support Vector Machine for News Classification
title_sort comparison of text classification methods k-nn, naïve bayes, and support vector machine for news classification
publisher Politeknik Harapan Bersama Tegal
series Jurnal Informatika: Jurnal Pengembangan IT
issn 2477-5126
2548-9356
publishDate 2018-05-01
description <p>In this era, a rapid thriving Internet occasionally complicates users to retrieve news category furthermore if there are plentiful of news to be categorized. News categorization is a technique can be used to retrieve a category of news which gives easiness for users. Internet has vast amounts of information especially at news. Therefore, accurate and speedy access is becoming ever more difficult. This paper compares a news categorization using <em>k</em>-Nearest Neighbor, Naive Bayes and Support Vector Machine. Using vary of variables and through a several steps of preprocessing which proving k-Nearest Neighbor is producing a capable accuracy competes with Support Vector Machine whereas Naive Bayes producing just an average result, not as good as <em>k</em>-Nearest Neighbor and Support Vector Machine yet as bad as <em>k</em>-Nearest Neighbor and Support Vector Machine ever reach. As the results, <em>k</em>-Nearest Neighbor using correlation measurement type produces the best result of this experiment.  <br /><em></em></p>
url http://ejournal.poltektegal.ac.id/index.php/informatika/article/view/828
work_keys_str_mv AT fannyfanny acomparisonoftextclassificationmethodsknnnaivebayesandsupportvectormachinefornewsclassification
AT yohanmuliono acomparisonoftextclassificationmethodsknnnaivebayesandsupportvectormachinefornewsclassification
AT fidelsontanzil acomparisonoftextclassificationmethodsknnnaivebayesandsupportvectormachinefornewsclassification
AT fannyfanny comparisonoftextclassificationmethodsknnnaivebayesandsupportvectormachinefornewsclassification
AT yohanmuliono comparisonoftextclassificationmethodsknnnaivebayesandsupportvectormachinefornewsclassification
AT fidelsontanzil comparisonoftextclassificationmethodsknnnaivebayesandsupportvectormachinefornewsclassification
_version_ 1725144040279638016