From rumor to genetic mutation detection with explanations: a GAN approach

Abstract Social media have emerged as increasingly popular means and environments for information gathering and propagation. This vigorous growth of social media contributed not only to a pandemic (fast-spreading and far-reaching) of rumors and misinformation, but also to an urgent need for text-bas...

Full description

Bibliographic Details
Main Authors: Mingxi Cheng, Yizhi Li, Shahin Nazarian, Paul Bogdan
Format: Article
Language:English
Published: Nature Publishing Group 2021-03-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-021-84993-1
id doaj-8ccb602da5234c52b030422cbbd11a40
record_format Article
spelling doaj-8ccb602da5234c52b030422cbbd11a402021-03-14T12:16:14ZengNature Publishing GroupScientific Reports2045-23222021-03-0111111410.1038/s41598-021-84993-1From rumor to genetic mutation detection with explanations: a GAN approachMingxi Cheng0Yizhi Li1Shahin Nazarian2Paul Bogdan3Ming Hsieh Department of Electrical and Computer Engineering, University of Southern CaliforniaSchool of Computer Science, Beijing University of Posts and TelecommunicationsMing Hsieh Department of Electrical and Computer Engineering, University of Southern CaliforniaMing Hsieh Department of Electrical and Computer Engineering, University of Southern CaliforniaAbstract Social media have emerged as increasingly popular means and environments for information gathering and propagation. This vigorous growth of social media contributed not only to a pandemic (fast-spreading and far-reaching) of rumors and misinformation, but also to an urgent need for text-based rumor detection strategies. To speed up the detection of misinformation, traditional rumor detection methods based on hand-crafted feature selection need to be replaced by automatic artificial intelligence (AI) approaches. AI decision making systems require to provide explanations in order to assure users of their trustworthiness. Inspired by the thriving development of generative adversarial networks (GANs) on text applications, we propose a GAN-based layered model for rumor detection with explanations. To demonstrate the universality of the proposed approach, we demonstrate its benefits on a gene classification with mutation detection case study. Similarly to the rumor detection, the gene classification can also be formulated as a text-based classification problem. Unlike fake news detection that needs a previously collected verified news database, our model provides explanations in rumor detection based on tweet-level texts only without referring to a verified news database. The layered structure of both generative and discriminative models contributes to the outstanding performance. The layered generators produce rumors by intelligently inserting controversial information in non-rumors, and force the layered discriminators to detect detailed glitches and deduce exactly which parts in the sentence are problematic. On average, in the rumor detection task, our proposed model outperforms state-of-the-art baselines on PHEME dataset by $$26.85\%$$ 26.85 % in terms of macro-f1. The excellent performance of our model for textural sequences is also demonstrated by the gene mutation case study on which it achieves $$72.69\%$$ 72.69 % macro-f1 score.https://doi.org/10.1038/s41598-021-84993-1
collection DOAJ
language English
format Article
sources DOAJ
author Mingxi Cheng
Yizhi Li
Shahin Nazarian
Paul Bogdan
spellingShingle Mingxi Cheng
Yizhi Li
Shahin Nazarian
Paul Bogdan
From rumor to genetic mutation detection with explanations: a GAN approach
Scientific Reports
author_facet Mingxi Cheng
Yizhi Li
Shahin Nazarian
Paul Bogdan
author_sort Mingxi Cheng
title From rumor to genetic mutation detection with explanations: a GAN approach
title_short From rumor to genetic mutation detection with explanations: a GAN approach
title_full From rumor to genetic mutation detection with explanations: a GAN approach
title_fullStr From rumor to genetic mutation detection with explanations: a GAN approach
title_full_unstemmed From rumor to genetic mutation detection with explanations: a GAN approach
title_sort from rumor to genetic mutation detection with explanations: a gan approach
publisher Nature Publishing Group
series Scientific Reports
issn 2045-2322
publishDate 2021-03-01
description Abstract Social media have emerged as increasingly popular means and environments for information gathering and propagation. This vigorous growth of social media contributed not only to a pandemic (fast-spreading and far-reaching) of rumors and misinformation, but also to an urgent need for text-based rumor detection strategies. To speed up the detection of misinformation, traditional rumor detection methods based on hand-crafted feature selection need to be replaced by automatic artificial intelligence (AI) approaches. AI decision making systems require to provide explanations in order to assure users of their trustworthiness. Inspired by the thriving development of generative adversarial networks (GANs) on text applications, we propose a GAN-based layered model for rumor detection with explanations. To demonstrate the universality of the proposed approach, we demonstrate its benefits on a gene classification with mutation detection case study. Similarly to the rumor detection, the gene classification can also be formulated as a text-based classification problem. Unlike fake news detection that needs a previously collected verified news database, our model provides explanations in rumor detection based on tweet-level texts only without referring to a verified news database. The layered structure of both generative and discriminative models contributes to the outstanding performance. The layered generators produce rumors by intelligently inserting controversial information in non-rumors, and force the layered discriminators to detect detailed glitches and deduce exactly which parts in the sentence are problematic. On average, in the rumor detection task, our proposed model outperforms state-of-the-art baselines on PHEME dataset by $$26.85\%$$ 26.85 % in terms of macro-f1. The excellent performance of our model for textural sequences is also demonstrated by the gene mutation case study on which it achieves $$72.69\%$$ 72.69 % macro-f1 score.
url https://doi.org/10.1038/s41598-021-84993-1
work_keys_str_mv AT mingxicheng fromrumortogeneticmutationdetectionwithexplanationsaganapproach
AT yizhili fromrumortogeneticmutationdetectionwithexplanationsaganapproach
AT shahinnazarian fromrumortogeneticmutationdetectionwithexplanationsaganapproach
AT paulbogdan fromrumortogeneticmutationdetectionwithexplanationsaganapproach
_version_ 1724221446721372160