Detecting Controversial Articles on Citizen Journalism
Someone's understanding and stance on a particular controversial topic can be influenced by daily news or articles he consume everyday. Unfortunately, readers usually do not realize that they are reading controversial articles. In this paper, we address the problem of automatically detecting co...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Universitas Indonesia
2018-02-01
|
Series: | Jurnal Ilmu Komputer dan Informasi |
Subjects: | |
Online Access: | http://jiki.cs.ui.ac.id/index.php/jiki/article/view/478 |
id |
doaj-c044820d5dca4dc7942dc95909677cb2 |
---|---|
record_format |
Article |
spelling |
doaj-c044820d5dca4dc7942dc95909677cb22020-11-25T00:55:05ZengUniversitas IndonesiaJurnal Ilmu Komputer dan Informasi2088-70512502-92742018-02-01111344110.21609/jiki.v11i1.478238Detecting Controversial Articles on Citizen JournalismAlfan Farizki Wicaksono0Sharon Raissa Herdiyana1Mirna Adriani2Information Retrieval Lab. Faculty of Computer Science Universitas IndonesiaInformation Retrieval Lab. Faculty of Computer Science Universitas IndonesiaInformation Retrieval Lab. Faculty of Computer Science Universitas IndonesiaSomeone's understanding and stance on a particular controversial topic can be influenced by daily news or articles he consume everyday. Unfortunately, readers usually do not realize that they are reading controversial articles. In this paper, we address the problem of automatically detecting controversial article from citizen journalism media. To solve the problem, we employ a supervised machine learning approach with several hand-crafted features that exploits linguistic information, meta-data of an article, structural information in the commentary section, and sentiment expressed inside the body of an article. The experimental results shows that our proposed method manages to perform the addressed task effectively. The best performance so far is achieved when we use all proposed feature with Logistic Regression as our model (82.89\% in terms of accuracy). Moreover, we found that information from commentary section (structural features) contributes most to the classification task.http://jiki.cs.ui.ac.id/index.php/jiki/article/view/478controversy detection, text classification, supervised learning |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Alfan Farizki Wicaksono Sharon Raissa Herdiyana Mirna Adriani |
spellingShingle |
Alfan Farizki Wicaksono Sharon Raissa Herdiyana Mirna Adriani Detecting Controversial Articles on Citizen Journalism Jurnal Ilmu Komputer dan Informasi controversy detection, text classification, supervised learning |
author_facet |
Alfan Farizki Wicaksono Sharon Raissa Herdiyana Mirna Adriani |
author_sort |
Alfan Farizki Wicaksono |
title |
Detecting Controversial Articles on Citizen Journalism |
title_short |
Detecting Controversial Articles on Citizen Journalism |
title_full |
Detecting Controversial Articles on Citizen Journalism |
title_fullStr |
Detecting Controversial Articles on Citizen Journalism |
title_full_unstemmed |
Detecting Controversial Articles on Citizen Journalism |
title_sort |
detecting controversial articles on citizen journalism |
publisher |
Universitas Indonesia |
series |
Jurnal Ilmu Komputer dan Informasi |
issn |
2088-7051 2502-9274 |
publishDate |
2018-02-01 |
description |
Someone's understanding and stance on a particular controversial topic can be influenced by daily news or articles he consume everyday. Unfortunately, readers usually do not realize that they are reading controversial articles. In this paper, we address the problem of automatically detecting controversial article from citizen journalism media. To solve the problem, we employ a supervised machine learning approach with several hand-crafted features that exploits linguistic information, meta-data of an article, structural information in the commentary section, and sentiment expressed inside the body of an article. The experimental results shows that our proposed method manages to perform the addressed task effectively. The best performance so far is achieved when we use all proposed feature with Logistic Regression as our model (82.89\% in terms of accuracy). Moreover, we found that information from commentary section (structural features) contributes most to the classification task. |
topic |
controversy detection, text classification, supervised learning |
url |
http://jiki.cs.ui.ac.id/index.php/jiki/article/view/478 |
work_keys_str_mv |
AT alfanfarizkiwicaksono detectingcontroversialarticlesoncitizenjournalism AT sharonraissaherdiyana detectingcontroversialarticlesoncitizenjournalism AT mirnaadriani detectingcontroversialarticlesoncitizenjournalism |
_version_ |
1725232169135112192 |