Detecting Controversial Articles on Citizen Journalism

Someone's understanding and stance on a particular controversial topic can be influenced by daily news or articles he consume everyday. Unfortunately, readers usually do not realize that they are reading controversial articles. In this paper, we address the problem of automatically detecting co...

Full description

Bibliographic Details
Main Authors: Alfan Farizki Wicaksono, Sharon Raissa Herdiyana, Mirna Adriani
Format: Article
Language:English
Published: Universitas Indonesia 2018-02-01
Series:Jurnal Ilmu Komputer dan Informasi
Subjects:
Online Access:http://jiki.cs.ui.ac.id/index.php/jiki/article/view/478
id doaj-c044820d5dca4dc7942dc95909677cb2
record_format Article
spelling doaj-c044820d5dca4dc7942dc95909677cb22020-11-25T00:55:05ZengUniversitas IndonesiaJurnal Ilmu Komputer dan Informasi2088-70512502-92742018-02-01111344110.21609/jiki.v11i1.478238Detecting Controversial Articles on Citizen JournalismAlfan Farizki Wicaksono0Sharon Raissa Herdiyana1Mirna Adriani2Information Retrieval Lab. Faculty of Computer Science Universitas IndonesiaInformation Retrieval Lab. Faculty of Computer Science Universitas IndonesiaInformation Retrieval Lab. Faculty of Computer Science Universitas IndonesiaSomeone's understanding and stance on a particular controversial topic can be influenced by daily news or articles he consume everyday. Unfortunately, readers usually do not realize that they are reading controversial articles. In this paper, we address the problem of automatically detecting controversial article from citizen journalism media. To solve the problem, we employ a supervised machine learning approach with several hand-crafted features that exploits linguistic information, meta-data of an article, structural information in the commentary section, and sentiment expressed inside the body of an article. The experimental results shows that our proposed method manages to perform the addressed task effectively. The best performance so far is achieved when we use all proposed feature with Logistic Regression as our model (82.89\% in terms of accuracy). Moreover, we found that information from commentary section (structural features) contributes most to the classification task.http://jiki.cs.ui.ac.id/index.php/jiki/article/view/478controversy detection, text classification, supervised learning
collection DOAJ
language English
format Article
sources DOAJ
author Alfan Farizki Wicaksono
Sharon Raissa Herdiyana
Mirna Adriani
spellingShingle Alfan Farizki Wicaksono
Sharon Raissa Herdiyana
Mirna Adriani
Detecting Controversial Articles on Citizen Journalism
Jurnal Ilmu Komputer dan Informasi
controversy detection, text classification, supervised learning
author_facet Alfan Farizki Wicaksono
Sharon Raissa Herdiyana
Mirna Adriani
author_sort Alfan Farizki Wicaksono
title Detecting Controversial Articles on Citizen Journalism
title_short Detecting Controversial Articles on Citizen Journalism
title_full Detecting Controversial Articles on Citizen Journalism
title_fullStr Detecting Controversial Articles on Citizen Journalism
title_full_unstemmed Detecting Controversial Articles on Citizen Journalism
title_sort detecting controversial articles on citizen journalism
publisher Universitas Indonesia
series Jurnal Ilmu Komputer dan Informasi
issn 2088-7051
2502-9274
publishDate 2018-02-01
description Someone's understanding and stance on a particular controversial topic can be influenced by daily news or articles he consume everyday. Unfortunately, readers usually do not realize that they are reading controversial articles. In this paper, we address the problem of automatically detecting controversial article from citizen journalism media. To solve the problem, we employ a supervised machine learning approach with several hand-crafted features that exploits linguistic information, meta-data of an article, structural information in the commentary section, and sentiment expressed inside the body of an article. The experimental results shows that our proposed method manages to perform the addressed task effectively. The best performance so far is achieved when we use all proposed feature with Logistic Regression as our model (82.89\% in terms of accuracy). Moreover, we found that information from commentary section (structural features) contributes most to the classification task.
topic controversy detection, text classification, supervised learning
url http://jiki.cs.ui.ac.id/index.php/jiki/article/view/478
work_keys_str_mv AT alfanfarizkiwicaksono detectingcontroversialarticlesoncitizenjournalism
AT sharonraissaherdiyana detectingcontroversialarticlesoncitizenjournalism
AT mirnaadriani detectingcontroversialarticlesoncitizenjournalism
_version_ 1725232169135112192