Bilingual Cyber-aggression Detection on Social Media using LSTM Autoencoder

Yes === Cyber-aggression is an offensive behaviour attacking people based on race, ethnicity, religion, gender, sexual orientation, and other traits. It has become a major issue plaguing the online social media. In this research, we have developed a deep learning-based model to identify different...

Full description

Bibliographic Details
Main Authors: Kumari, K., Singh, J.P., Dwivedi, Y.K., Rana, Nripendra P.
Language:en
Published: 2021
Subjects:
Online Access:http://hdl.handle.net/10454/18439
Description
Summary:Yes === Cyber-aggression is an offensive behaviour attacking people based on race, ethnicity, religion, gender, sexual orientation, and other traits. It has become a major issue plaguing the online social media. In this research, we have developed a deep learning-based model to identify different levels of aggression (direct, indirect and no aggression) in a social media post in a bilingual scenario. The model is an autoencoder built using the LSTM network and trained with non-aggressive comments only. Any aggressive comment (direct or indirect) will be regarded as an anomaly to the system and will be marked as Overtly (direct) or Covertly (indirect) aggressive comment depending on the reconstruction loss by the autoencoder. The validation results on the dataset from two popular social media sites: Facebook and Twitter with bilingual (English and Hindi) data outperformed the current state-of-the-art models with improvements of more than 11% on the test sets of the English dataset and more than 6% on the test sets of the Hindi dataset. === The full-text of this article will be released for public view at the end of the publisher embargo on 24 Apr 2022.