Understanding Multimodal Popularity Prediction of Social Media Videos With Self-Attention
Predicting popularity of social media videos before they are published is a challenging task, mainly due to the complexity of content distribution network as well as the number of factors that play a part in this process. As solving this task provides tremendous help for media content creators, many...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2018-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8558491/ |
id |
doaj-40e10ece0e18499486a14d2bde6a1aac |
---|---|
record_format |
Article |
spelling |
doaj-40e10ece0e18499486a14d2bde6a1aac2021-03-29T21:39:49ZengIEEEIEEE Access2169-35362018-01-016742777428710.1109/ACCESS.2018.28848318558491Understanding Multimodal Popularity Prediction of Social Media Videos With Self-AttentionAdam Bielski0https://orcid.org/0000-0001-9626-3566Tomasz Trzcinski1Tooploox, Warsaw, PolandTooploox, Warsaw, PolandPredicting popularity of social media videos before they are published is a challenging task, mainly due to the complexity of content distribution network as well as the number of factors that play a part in this process. As solving this task provides tremendous help for media content creators, many successful methods were proposed to solve this problem with machine learning. In this work, we change the viewpoint and postulate that it is not only the predicted popularity that matters but also, maybe even more importantly, understanding of how individual parts influence the final popularity score. To that end, we propose to combine the Grad-CAM visualization method that allows to visualize spatial relevance to popularity with a soft self-attention mechanism to weight the relative importance of frames in time domain. Our preliminary results show that this approach allows for more intuitive interpretation of the content impact on video popularity while achieving competitive results in terms of prediction accuracy.https://ieeexplore.ieee.org/document/8558491/Computer visionpopularity predictionsocial mediamachine learning |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Adam Bielski Tomasz Trzcinski |
spellingShingle |
Adam Bielski Tomasz Trzcinski Understanding Multimodal Popularity Prediction of Social Media Videos With Self-Attention IEEE Access Computer vision popularity prediction social media machine learning |
author_facet |
Adam Bielski Tomasz Trzcinski |
author_sort |
Adam Bielski |
title |
Understanding Multimodal Popularity Prediction of Social Media Videos With Self-Attention |
title_short |
Understanding Multimodal Popularity Prediction of Social Media Videos With Self-Attention |
title_full |
Understanding Multimodal Popularity Prediction of Social Media Videos With Self-Attention |
title_fullStr |
Understanding Multimodal Popularity Prediction of Social Media Videos With Self-Attention |
title_full_unstemmed |
Understanding Multimodal Popularity Prediction of Social Media Videos With Self-Attention |
title_sort |
understanding multimodal popularity prediction of social media videos with self-attention |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2018-01-01 |
description |
Predicting popularity of social media videos before they are published is a challenging task, mainly due to the complexity of content distribution network as well as the number of factors that play a part in this process. As solving this task provides tremendous help for media content creators, many successful methods were proposed to solve this problem with machine learning. In this work, we change the viewpoint and postulate that it is not only the predicted popularity that matters but also, maybe even more importantly, understanding of how individual parts influence the final popularity score. To that end, we propose to combine the Grad-CAM visualization method that allows to visualize spatial relevance to popularity with a soft self-attention mechanism to weight the relative importance of frames in time domain. Our preliminary results show that this approach allows for more intuitive interpretation of the content impact on video popularity while achieving competitive results in terms of prediction accuracy. |
topic |
Computer vision popularity prediction social media machine learning |
url |
https://ieeexplore.ieee.org/document/8558491/ |
work_keys_str_mv |
AT adambielski understandingmultimodalpopularitypredictionofsocialmediavideoswithselfattention AT tomasztrzcinski understandingmultimodalpopularitypredictionofsocialmediavideoswithselfattention |
_version_ |
1724192491129798656 |