Understanding Multimodal Popularity Prediction of Social Media Videos With Self-Attention

Predicting popularity of social media videos before they are published is a challenging task, mainly due to the complexity of content distribution network as well as the number of factors that play a part in this process. As solving this task provides tremendous help for media content creators, many...

Full description

Bibliographic Details
Main Authors: Adam Bielski, Tomasz Trzcinski
Format: Article
Language:English
Published: IEEE 2018-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8558491/
id doaj-40e10ece0e18499486a14d2bde6a1aac
record_format Article
spelling doaj-40e10ece0e18499486a14d2bde6a1aac2021-03-29T21:39:49ZengIEEEIEEE Access2169-35362018-01-016742777428710.1109/ACCESS.2018.28848318558491Understanding Multimodal Popularity Prediction of Social Media Videos With Self-AttentionAdam Bielski0https://orcid.org/0000-0001-9626-3566Tomasz Trzcinski1Tooploox, Warsaw, PolandTooploox, Warsaw, PolandPredicting popularity of social media videos before they are published is a challenging task, mainly due to the complexity of content distribution network as well as the number of factors that play a part in this process. As solving this task provides tremendous help for media content creators, many successful methods were proposed to solve this problem with machine learning. In this work, we change the viewpoint and postulate that it is not only the predicted popularity that matters but also, maybe even more importantly, understanding of how individual parts influence the final popularity score. To that end, we propose to combine the Grad-CAM visualization method that allows to visualize spatial relevance to popularity with a soft self-attention mechanism to weight the relative importance of frames in time domain. Our preliminary results show that this approach allows for more intuitive interpretation of the content impact on video popularity while achieving competitive results in terms of prediction accuracy.https://ieeexplore.ieee.org/document/8558491/Computer visionpopularity predictionsocial mediamachine learning
collection DOAJ
language English
format Article
sources DOAJ
author Adam Bielski
Tomasz Trzcinski
spellingShingle Adam Bielski
Tomasz Trzcinski
Understanding Multimodal Popularity Prediction of Social Media Videos With Self-Attention
IEEE Access
Computer vision
popularity prediction
social media
machine learning
author_facet Adam Bielski
Tomasz Trzcinski
author_sort Adam Bielski
title Understanding Multimodal Popularity Prediction of Social Media Videos With Self-Attention
title_short Understanding Multimodal Popularity Prediction of Social Media Videos With Self-Attention
title_full Understanding Multimodal Popularity Prediction of Social Media Videos With Self-Attention
title_fullStr Understanding Multimodal Popularity Prediction of Social Media Videos With Self-Attention
title_full_unstemmed Understanding Multimodal Popularity Prediction of Social Media Videos With Self-Attention
title_sort understanding multimodal popularity prediction of social media videos with self-attention
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2018-01-01
description Predicting popularity of social media videos before they are published is a challenging task, mainly due to the complexity of content distribution network as well as the number of factors that play a part in this process. As solving this task provides tremendous help for media content creators, many successful methods were proposed to solve this problem with machine learning. In this work, we change the viewpoint and postulate that it is not only the predicted popularity that matters but also, maybe even more importantly, understanding of how individual parts influence the final popularity score. To that end, we propose to combine the Grad-CAM visualization method that allows to visualize spatial relevance to popularity with a soft self-attention mechanism to weight the relative importance of frames in time domain. Our preliminary results show that this approach allows for more intuitive interpretation of the content impact on video popularity while achieving competitive results in terms of prediction accuracy.
topic Computer vision
popularity prediction
social media
machine learning
url https://ieeexplore.ieee.org/document/8558491/
work_keys_str_mv AT adambielski understandingmultimodalpopularitypredictionofsocialmediavideoswithselfattention
AT tomasztrzcinski understandingmultimodalpopularitypredictionofsocialmediavideoswithselfattention
_version_ 1724192491129798656