Machine learning methods for toxic comment classification: a systematic review

Nowadays users leave numerous comments on different social networks, news portals, and forums. Some of the comments are toxic or abusive. Due to numbers of comments, it is unfeasible to manually moderate them, so most of the systems use some kind of automatic discovery of toxicity using machine lear...

Full description

Bibliographic Details
Main Author: Andročec Darko
Format: Article
Language:English
Published: Sciendo 2020-12-01
Series:Acta Universitatis Sapientiae: Informatica
Subjects:
Online Access:https://doi.org/10.2478/ausi-2020-0012
id doaj-33783a5f35474d1e9bbf6410e249cf9e
record_format Article
spelling doaj-33783a5f35474d1e9bbf6410e249cf9e2021-09-06T19:41:25ZengSciendoActa Universitatis Sapientiae: Informatica2066-77602020-12-0112220521610.2478/ausi-2020-0012Machine learning methods for toxic comment classification: a systematic reviewAndročec Darko0Faculty of Organization and Informatics, University of Zagreb, Pavlinska 2, 42000 Varaždin, CroatiaNowadays users leave numerous comments on different social networks, news portals, and forums. Some of the comments are toxic or abusive. Due to numbers of comments, it is unfeasible to manually moderate them, so most of the systems use some kind of automatic discovery of toxicity using machine learning models. In this work, we performed a systematic review of the state-of-the-art in toxic comment classification using machine learning methods. We extracted data from 31 selected primary relevant studies. First, we have investigated when and where the papers were published and their maturity level. In our analysis of every primary study we investigated: data set used, evaluation metric, used machine learning methods, classes of toxicity, and comment language. We finish our work with comprehensive list of gaps in current research and suggestions for future research themes related to online toxic comment classification problem.https://doi.org/10.2478/ausi-2020-0012machine learningtoxic commentdeep learningsystematic review68t50
collection DOAJ
language English
format Article
sources DOAJ
author Andročec Darko
spellingShingle Andročec Darko
Machine learning methods for toxic comment classification: a systematic review
Acta Universitatis Sapientiae: Informatica
machine learning
toxic comment
deep learning
systematic review
68t50
author_facet Andročec Darko
author_sort Andročec Darko
title Machine learning methods for toxic comment classification: a systematic review
title_short Machine learning methods for toxic comment classification: a systematic review
title_full Machine learning methods for toxic comment classification: a systematic review
title_fullStr Machine learning methods for toxic comment classification: a systematic review
title_full_unstemmed Machine learning methods for toxic comment classification: a systematic review
title_sort machine learning methods for toxic comment classification: a systematic review
publisher Sciendo
series Acta Universitatis Sapientiae: Informatica
issn 2066-7760
publishDate 2020-12-01
description Nowadays users leave numerous comments on different social networks, news portals, and forums. Some of the comments are toxic or abusive. Due to numbers of comments, it is unfeasible to manually moderate them, so most of the systems use some kind of automatic discovery of toxicity using machine learning models. In this work, we performed a systematic review of the state-of-the-art in toxic comment classification using machine learning methods. We extracted data from 31 selected primary relevant studies. First, we have investigated when and where the papers were published and their maturity level. In our analysis of every primary study we investigated: data set used, evaluation metric, used machine learning methods, classes of toxicity, and comment language. We finish our work with comprehensive list of gaps in current research and suggestions for future research themes related to online toxic comment classification problem.
topic machine learning
toxic comment
deep learning
systematic review
68t50
url https://doi.org/10.2478/ausi-2020-0012
work_keys_str_mv AT androcecdarko machinelearningmethodsfortoxiccommentclassificationasystematicreview
_version_ 1717766308924227584