Towards generalisable hate speech detection: a review on obstacles and solutions

Hate speech is one type of harmful online content which directly attacks or promotes hate towards a group or an individual member based on their actual or perceived aspects of identity, such as ethnicity, religion, and sexual orientation. With online hate speech on the rise, its automatic detection...

Full description

Bibliographic Details
Main Authors: Wenjie Yin, Arkaitz Zubiaga
Format: Article
Language:English
Published: PeerJ Inc. 2021-06-01
Series:PeerJ Computer Science
Subjects:
Online Access:https://peerj.com/articles/cs-598.pdf
id doaj-f47377a99dd34e0d8afaef07caf6f2ed
record_format Article
spelling doaj-f47377a99dd34e0d8afaef07caf6f2ed2021-06-19T15:05:19ZengPeerJ Inc.PeerJ Computer Science2376-59922021-06-017e59810.7717/peerj-cs.598Towards generalisable hate speech detection: a review on obstacles and solutionsWenjie Yin0Arkaitz Zubiaga1School of Electronic Engineering and Computer Science, Queen Mary University of London, London, United KingdomSchool of Electronic Engineering and Computer Science, Queen Mary University of London, London, United KingdomHate speech is one type of harmful online content which directly attacks or promotes hate towards a group or an individual member based on their actual or perceived aspects of identity, such as ethnicity, religion, and sexual orientation. With online hate speech on the rise, its automatic detection as a natural language processing task is gaining increasing interest. However, it is only recently that it has been shown that existing models generalise poorly to unseen data. This survey paper attempts to summarise how generalisable existing hate speech detection models are and the reasons why hate speech models struggle to generalise, sums up existing attempts at addressing the main obstacles, and then proposes directions of future research to improve generalisation in hate speech detection.https://peerj.com/articles/cs-598.pdfHate speechText classificationAbusive languageSocial mediaLiterature reviewGeneralisation
collection DOAJ
language English
format Article
sources DOAJ
author Wenjie Yin
Arkaitz Zubiaga
spellingShingle Wenjie Yin
Arkaitz Zubiaga
Towards generalisable hate speech detection: a review on obstacles and solutions
PeerJ Computer Science
Hate speech
Text classification
Abusive language
Social media
Literature review
Generalisation
author_facet Wenjie Yin
Arkaitz Zubiaga
author_sort Wenjie Yin
title Towards generalisable hate speech detection: a review on obstacles and solutions
title_short Towards generalisable hate speech detection: a review on obstacles and solutions
title_full Towards generalisable hate speech detection: a review on obstacles and solutions
title_fullStr Towards generalisable hate speech detection: a review on obstacles and solutions
title_full_unstemmed Towards generalisable hate speech detection: a review on obstacles and solutions
title_sort towards generalisable hate speech detection: a review on obstacles and solutions
publisher PeerJ Inc.
series PeerJ Computer Science
issn 2376-5992
publishDate 2021-06-01
description Hate speech is one type of harmful online content which directly attacks or promotes hate towards a group or an individual member based on their actual or perceived aspects of identity, such as ethnicity, religion, and sexual orientation. With online hate speech on the rise, its automatic detection as a natural language processing task is gaining increasing interest. However, it is only recently that it has been shown that existing models generalise poorly to unseen data. This survey paper attempts to summarise how generalisable existing hate speech detection models are and the reasons why hate speech models struggle to generalise, sums up existing attempts at addressing the main obstacles, and then proposes directions of future research to improve generalisation in hate speech detection.
topic Hate speech
Text classification
Abusive language
Social media
Literature review
Generalisation
url https://peerj.com/articles/cs-598.pdf
work_keys_str_mv AT wenjieyin towardsgeneralisablehatespeechdetectionareviewonobstaclesandsolutions
AT arkaitzzubiaga towardsgeneralisablehatespeechdetectionareviewonobstaclesandsolutions
_version_ 1721370975528812544