Towards generalisable hate speech detection: a review on obstacles and solutions
Hate speech is one type of harmful online content which directly attacks or promotes hate towards a group or an individual member based on their actual or perceived aspects of identity, such as ethnicity, religion, and sexual orientation. With online hate speech on the rise, its automatic detection...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
PeerJ Inc.
2021-06-01
|
Series: | PeerJ Computer Science |
Subjects: | |
Online Access: | https://peerj.com/articles/cs-598.pdf |
id |
doaj-f47377a99dd34e0d8afaef07caf6f2ed |
---|---|
record_format |
Article |
spelling |
doaj-f47377a99dd34e0d8afaef07caf6f2ed2021-06-19T15:05:19ZengPeerJ Inc.PeerJ Computer Science2376-59922021-06-017e59810.7717/peerj-cs.598Towards generalisable hate speech detection: a review on obstacles and solutionsWenjie Yin0Arkaitz Zubiaga1School of Electronic Engineering and Computer Science, Queen Mary University of London, London, United KingdomSchool of Electronic Engineering and Computer Science, Queen Mary University of London, London, United KingdomHate speech is one type of harmful online content which directly attacks or promotes hate towards a group or an individual member based on their actual or perceived aspects of identity, such as ethnicity, religion, and sexual orientation. With online hate speech on the rise, its automatic detection as a natural language processing task is gaining increasing interest. However, it is only recently that it has been shown that existing models generalise poorly to unseen data. This survey paper attempts to summarise how generalisable existing hate speech detection models are and the reasons why hate speech models struggle to generalise, sums up existing attempts at addressing the main obstacles, and then proposes directions of future research to improve generalisation in hate speech detection.https://peerj.com/articles/cs-598.pdfHate speechText classificationAbusive languageSocial mediaLiterature reviewGeneralisation |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Wenjie Yin Arkaitz Zubiaga |
spellingShingle |
Wenjie Yin Arkaitz Zubiaga Towards generalisable hate speech detection: a review on obstacles and solutions PeerJ Computer Science Hate speech Text classification Abusive language Social media Literature review Generalisation |
author_facet |
Wenjie Yin Arkaitz Zubiaga |
author_sort |
Wenjie Yin |
title |
Towards generalisable hate speech detection: a review on obstacles and solutions |
title_short |
Towards generalisable hate speech detection: a review on obstacles and solutions |
title_full |
Towards generalisable hate speech detection: a review on obstacles and solutions |
title_fullStr |
Towards generalisable hate speech detection: a review on obstacles and solutions |
title_full_unstemmed |
Towards generalisable hate speech detection: a review on obstacles and solutions |
title_sort |
towards generalisable hate speech detection: a review on obstacles and solutions |
publisher |
PeerJ Inc. |
series |
PeerJ Computer Science |
issn |
2376-5992 |
publishDate |
2021-06-01 |
description |
Hate speech is one type of harmful online content which directly attacks or promotes hate towards a group or an individual member based on their actual or perceived aspects of identity, such as ethnicity, religion, and sexual orientation. With online hate speech on the rise, its automatic detection as a natural language processing task is gaining increasing interest. However, it is only recently that it has been shown that existing models generalise poorly to unseen data. This survey paper attempts to summarise how generalisable existing hate speech detection models are and the reasons why hate speech models struggle to generalise, sums up existing attempts at addressing the main obstacles, and then proposes directions of future research to improve generalisation in hate speech detection. |
topic |
Hate speech Text classification Abusive language Social media Literature review Generalisation |
url |
https://peerj.com/articles/cs-598.pdf |
work_keys_str_mv |
AT wenjieyin towardsgeneralisablehatespeechdetectionareviewonobstaclesandsolutions AT arkaitzzubiaga towardsgeneralisablehatespeechdetectionareviewonobstaclesandsolutions |
_version_ |
1721370975528812544 |