Transfer Correlation Between Textual Content to Images for Sentiment Analysis

In social media, images and texts are used to convey individuals' attitudes and feelings; thus, social media has become an indispensable part of people's lives. To understand social behavior and provide better recommendations, sentiment analysis on social media is helpful. One sentiment an...

Full description

Bibliographic Details
Main Authors: Ke Zhang, Yunwen Zhu, Wenjun Zhang, Weilin Zhang, Yonghua Zhu
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9003301/
id doaj-ccd46afef02f4846bfe350de909523c3
record_format Article
spelling doaj-ccd46afef02f4846bfe350de909523c32021-03-30T02:01:28ZengIEEEIEEE Access2169-35362020-01-018352763528910.1109/ACCESS.2020.29750369003301Transfer Correlation Between Textual Content to Images for Sentiment AnalysisKe Zhang0https://orcid.org/0000-0002-8851-668XYunwen Zhu1https://orcid.org/0000-0001-6745-7468Wenjun Zhang2https://orcid.org/0000-0002-0449-0946Weilin Zhang3https://orcid.org/0000-0002-2774-0944Yonghua Zhu4https://orcid.org/0000-0002-6419-9018Shanghai Film Academy, Shanghai University, Shanghai, ChinaShanghai Film Academy, Shanghai University, Shanghai, ChinaShanghai Film Academy, Shanghai University, Shanghai, ChinaSchool of Computer Engineering and Science, Shanghai University, Shanghai, ChinaShanghai Film Academy, Shanghai University, Shanghai, ChinaIn social media, images and texts are used to convey individuals' attitudes and feelings; thus, social media has become an indispensable part of people's lives. To understand social behavior and provide better recommendations, sentiment analysis on social media is helpful. One sentiment analysis task is polarity prediction. Although current research on visual or textual sentiment analysis has achieved quite good progress, multimodal and cross-modal analysis combining visual and textual correlation is still in the exploration stage. To capture a semantic connection between images and captions, this paper proposes a cross-modal approach that considers both images and captions in classifying image sentiment polarity. This method transfers the correlation between textual content to images. First, the image and its corresponding caption are sent into an inner-class mapping model, where they are transformed into vectors in Hilbert space to obtain their labels by calculating the inner-class maximum mean discrepancy (MMD). Then, a class-aware sentence representation (CASR) model assigns the distributed representation to the labels with a class-aware attention-based gated recurrent unit (GRU). Finally, an inner-class dependency LSTM (IDLSTM) classifies the sentiment polarity. Experiments carried out on the Getty Images dataset and Twitter 1269 dataset demonstrate the effectiveness of our approach. Moreover, extensive experimental results show that our model outperforms baseline solutions.https://ieeexplore.ieee.org/document/9003301/Correlationcross-modaltransfersentiment analysis
collection DOAJ
language English
format Article
sources DOAJ
author Ke Zhang
Yunwen Zhu
Wenjun Zhang
Weilin Zhang
Yonghua Zhu
spellingShingle Ke Zhang
Yunwen Zhu
Wenjun Zhang
Weilin Zhang
Yonghua Zhu
Transfer Correlation Between Textual Content to Images for Sentiment Analysis
IEEE Access
Correlation
cross-modal
transfer
sentiment analysis
author_facet Ke Zhang
Yunwen Zhu
Wenjun Zhang
Weilin Zhang
Yonghua Zhu
author_sort Ke Zhang
title Transfer Correlation Between Textual Content to Images for Sentiment Analysis
title_short Transfer Correlation Between Textual Content to Images for Sentiment Analysis
title_full Transfer Correlation Between Textual Content to Images for Sentiment Analysis
title_fullStr Transfer Correlation Between Textual Content to Images for Sentiment Analysis
title_full_unstemmed Transfer Correlation Between Textual Content to Images for Sentiment Analysis
title_sort transfer correlation between textual content to images for sentiment analysis
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description In social media, images and texts are used to convey individuals' attitudes and feelings; thus, social media has become an indispensable part of people's lives. To understand social behavior and provide better recommendations, sentiment analysis on social media is helpful. One sentiment analysis task is polarity prediction. Although current research on visual or textual sentiment analysis has achieved quite good progress, multimodal and cross-modal analysis combining visual and textual correlation is still in the exploration stage. To capture a semantic connection between images and captions, this paper proposes a cross-modal approach that considers both images and captions in classifying image sentiment polarity. This method transfers the correlation between textual content to images. First, the image and its corresponding caption are sent into an inner-class mapping model, where they are transformed into vectors in Hilbert space to obtain their labels by calculating the inner-class maximum mean discrepancy (MMD). Then, a class-aware sentence representation (CASR) model assigns the distributed representation to the labels with a class-aware attention-based gated recurrent unit (GRU). Finally, an inner-class dependency LSTM (IDLSTM) classifies the sentiment polarity. Experiments carried out on the Getty Images dataset and Twitter 1269 dataset demonstrate the effectiveness of our approach. Moreover, extensive experimental results show that our model outperforms baseline solutions.
topic Correlation
cross-modal
transfer
sentiment analysis
url https://ieeexplore.ieee.org/document/9003301/
work_keys_str_mv AT kezhang transfercorrelationbetweentextualcontenttoimagesforsentimentanalysis
AT yunwenzhu transfercorrelationbetweentextualcontenttoimagesforsentimentanalysis
AT wenjunzhang transfercorrelationbetweentextualcontenttoimagesforsentimentanalysis
AT weilinzhang transfercorrelationbetweentextualcontenttoimagesforsentimentanalysis
AT yonghuazhu transfercorrelationbetweentextualcontenttoimagesforsentimentanalysis
_version_ 1724186039087529984