Characterizing communities of hashtag usage on twitter during the 2020 COVID-19 pandemic by multi-view clustering

Abstract The COVID-19 pandemic has produced a flurry of online activity on social media sites. As such, analysis of social media data during the COVID-19 pandemic can produce unique insights into discussion topics and how those topics evolve over the course of the pandemic. In this study, we propose...

Full description

Bibliographic Details
Main Authors: Iain J. Cruickshank, Kathleen M. Carley
Format: Article
Language:English
Published: SpringerOpen 2020-09-01
Series:Applied Network Science
Subjects:
Online Access:http://link.springer.com/article/10.1007/s41109-020-00317-8
id doaj-c1618346e9db442e9cae9eb85c96746e
record_format Article
spelling doaj-c1618346e9db442e9cae9eb85c96746e2020-11-25T03:55:01ZengSpringerOpenApplied Network Science2364-82282020-09-015114010.1007/s41109-020-00317-8Characterizing communities of hashtag usage on twitter during the 2020 COVID-19 pandemic by multi-view clusteringIain J. Cruickshank0Kathleen M. Carley1CASOS, Carnegie Mellon UniversityCASOS, Carnegie Mellon UniversityAbstract The COVID-19 pandemic has produced a flurry of online activity on social media sites. As such, analysis of social media data during the COVID-19 pandemic can produce unique insights into discussion topics and how those topics evolve over the course of the pandemic. In this study, we propose analyzing discussion topics on Twitter by clustering hashtags. In order to obtain high-quality clusters of the Twitter hashtags, we also propose a novel multi-view clustering technique that incorporates multiple different data types that can be used to describe how users interact with hashtags. The results of our multi-view clustering show that there are distinct temporal and topical trends present within COVID-19 twitter discussion. In particular, we find that some topical clusters of hashtags shift over the course of the pandemic, while others are persistent throughout, and that there are distinct temporal trends in hashtag usage. This study is the first to use multi-view clustering to analyze hashtags and the first analysis of the greater trends of discussion occurring online during the COVID-19 pandemic.http://link.springer.com/article/10.1007/s41109-020-00317-8Social mediaClusteringMulti-view dataCOVID-19
collection DOAJ
language English
format Article
sources DOAJ
author Iain J. Cruickshank
Kathleen M. Carley
spellingShingle Iain J. Cruickshank
Kathleen M. Carley
Characterizing communities of hashtag usage on twitter during the 2020 COVID-19 pandemic by multi-view clustering
Applied Network Science
Social media
Clustering
Multi-view data
COVID-19
author_facet Iain J. Cruickshank
Kathleen M. Carley
author_sort Iain J. Cruickshank
title Characterizing communities of hashtag usage on twitter during the 2020 COVID-19 pandemic by multi-view clustering
title_short Characterizing communities of hashtag usage on twitter during the 2020 COVID-19 pandemic by multi-view clustering
title_full Characterizing communities of hashtag usage on twitter during the 2020 COVID-19 pandemic by multi-view clustering
title_fullStr Characterizing communities of hashtag usage on twitter during the 2020 COVID-19 pandemic by multi-view clustering
title_full_unstemmed Characterizing communities of hashtag usage on twitter during the 2020 COVID-19 pandemic by multi-view clustering
title_sort characterizing communities of hashtag usage on twitter during the 2020 covid-19 pandemic by multi-view clustering
publisher SpringerOpen
series Applied Network Science
issn 2364-8228
publishDate 2020-09-01
description Abstract The COVID-19 pandemic has produced a flurry of online activity on social media sites. As such, analysis of social media data during the COVID-19 pandemic can produce unique insights into discussion topics and how those topics evolve over the course of the pandemic. In this study, we propose analyzing discussion topics on Twitter by clustering hashtags. In order to obtain high-quality clusters of the Twitter hashtags, we also propose a novel multi-view clustering technique that incorporates multiple different data types that can be used to describe how users interact with hashtags. The results of our multi-view clustering show that there are distinct temporal and topical trends present within COVID-19 twitter discussion. In particular, we find that some topical clusters of hashtags shift over the course of the pandemic, while others are persistent throughout, and that there are distinct temporal trends in hashtag usage. This study is the first to use multi-view clustering to analyze hashtags and the first analysis of the greater trends of discussion occurring online during the COVID-19 pandemic.
topic Social media
Clustering
Multi-view data
COVID-19
url http://link.springer.com/article/10.1007/s41109-020-00317-8
work_keys_str_mv AT iainjcruickshank characterizingcommunitiesofhashtagusageontwitterduringthe2020covid19pandemicbymultiviewclustering
AT kathleenmcarley characterizingcommunitiesofhashtagusageontwitterduringthe2020covid19pandemicbymultiviewclustering
_version_ 1724471132756639744