Detecting Controversial Articles on Citizen Journalism

Someone's understanding and stance on a particular controversial topic can be influenced by daily news or articles he consume everyday. Unfortunately, readers usually do not realize that they are reading controversial articles. In this paper, we address the problem of automatically detecting co...

Full description

Bibliographic Details
Main Authors:	Alfan Farizki Wicaksono, Sharon Raissa Herdiyana, Mirna Adriani
Format:	Article
Language:	English
Published:	Universitas Indonesia 2018-02-01
Series:	Jurnal Ilmu Komputer dan Informasi
Subjects:	controversy detection, text classification, supervised learning
Online Access:	http://jiki.cs.ui.ac.id/index.php/jiki/article/view/478

id	doaj-c044820d5dca4dc7942dc95909677cb2
record_format	Article
spelling	doaj-c044820d5dca4dc7942dc95909677cb22020-11-25T00:55:05ZengUniversitas IndonesiaJurnal Ilmu Komputer dan Informasi2088-70512502-92742018-02-01111344110.21609/jiki.v11i1.478238Detecting Controversial Articles on Citizen JournalismAlfan Farizki Wicaksono0Sharon Raissa Herdiyana1Mirna Adriani2Information Retrieval Lab. Faculty of Computer Science Universitas IndonesiaInformation Retrieval Lab. Faculty of Computer Science Universitas IndonesiaInformation Retrieval Lab. Faculty of Computer Science Universitas IndonesiaSomeone's understanding and stance on a particular controversial topic can be influenced by daily news or articles he consume everyday. Unfortunately, readers usually do not realize that they are reading controversial articles. In this paper, we address the problem of automatically detecting controversial article from citizen journalism media. To solve the problem, we employ a supervised machine learning approach with several hand-crafted features that exploits linguistic information, meta-data of an article, structural information in the commentary section, and sentiment expressed inside the body of an article. The experimental results shows that our proposed method manages to perform the addressed task effectively. The best performance so far is achieved when we use all proposed feature with Logistic Regression as our model (82.89\% in terms of accuracy). Moreover, we found that information from commentary section (structural features) contributes most to the classification task.http://jiki.cs.ui.ac.id/index.php/jiki/article/view/478controversy detection, text classification, supervised learning
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Alfan Farizki Wicaksono Sharon Raissa Herdiyana Mirna Adriani
spellingShingle	Alfan Farizki Wicaksono Sharon Raissa Herdiyana Mirna Adriani Detecting Controversial Articles on Citizen Journalism Jurnal Ilmu Komputer dan Informasi controversy detection, text classification, supervised learning
author_facet	Alfan Farizki Wicaksono Sharon Raissa Herdiyana Mirna Adriani
author_sort	Alfan Farizki Wicaksono
title	Detecting Controversial Articles on Citizen Journalism
title_short	Detecting Controversial Articles on Citizen Journalism
title_full	Detecting Controversial Articles on Citizen Journalism
title_fullStr	Detecting Controversial Articles on Citizen Journalism
title_full_unstemmed	Detecting Controversial Articles on Citizen Journalism
title_sort	detecting controversial articles on citizen journalism
publisher	Universitas Indonesia
series	Jurnal Ilmu Komputer dan Informasi
issn	2088-7051 2502-9274
publishDate	2018-02-01
description	Someone's understanding and stance on a particular controversial topic can be influenced by daily news or articles he consume everyday. Unfortunately, readers usually do not realize that they are reading controversial articles. In this paper, we address the problem of automatically detecting controversial article from citizen journalism media. To solve the problem, we employ a supervised machine learning approach with several hand-crafted features that exploits linguistic information, meta-data of an article, structural information in the commentary section, and sentiment expressed inside the body of an article. The experimental results shows that our proposed method manages to perform the addressed task effectively. The best performance so far is achieved when we use all proposed feature with Logistic Regression as our model (82.89\% in terms of accuracy). Moreover, we found that information from commentary section (structural features) contributes most to the classification task.
topic	controversy detection, text classification, supervised learning
url	http://jiki.cs.ui.ac.id/index.php/jiki/article/view/478
work_keys_str_mv	AT alfanfarizkiwicaksono detectingcontroversialarticlesoncitizenjournalism AT sharonraissaherdiyana detectingcontroversialarticlesoncitizenjournalism AT mirnaadriani detectingcontroversialarticlesoncitizenjournalism
_version_	1725232169135112192

Detecting Controversial Articles on Citizen Journalism

Similar Items