Automated Seeded Latent Dirichlet Allocation for Social Media Based Event Detection and Mapping

In the event of a natural disaster, geo-tagged Tweets are an immediate source of information for locating casualties and damages, and for supporting disaster management. Topic modeling can help in detecting disaster-related Tweets in the noisy Twitter stream in an unsupervised manner. However, the r...

Full description

Bibliographic Details
Main Authors: Cornelia Ferner, Clemens Havas, Elisabeth Birnbacher, Stefan  Wegenkittl, Bernd Resch
Format: Article
Language:English
Published: MDPI AG 2020-07-01
Series:Information
Subjects:
Online Access:https://www.mdpi.com/2078-2489/11/8/376
id doaj-92cd8e11135d419daedd62dccc474a4b
record_format Article
spelling doaj-92cd8e11135d419daedd62dccc474a4b2020-11-25T03:38:37ZengMDPI AGInformation2078-24892020-07-011137637610.3390/info11080376Automated Seeded Latent Dirichlet Allocation for Social Media Based Event Detection and MappingCornelia Ferner0Clemens Havas1Elisabeth Birnbacher2Stefan  Wegenkittl 3Bernd Resch4Salzburg University of Applied Sciences, Urstein Sued 1, 5412 Puch/Hallein, AustriaUniversity of Salzburg, Schillerstrasse 30, 5020 Salzburg, AustriaSalzburg University of Applied Sciences, Urstein Sued 1, 5412 Puch/Hallein, AustriaSalzburg University of Applied Sciences, Urstein Sued 1, 5412 Puch/Hallein, AustriaUniversity of Salzburg, Schillerstrasse 30, 5020 Salzburg, AustriaIn the event of a natural disaster, geo-tagged Tweets are an immediate source of information for locating casualties and damages, and for supporting disaster management. Topic modeling can help in detecting disaster-related Tweets in the noisy Twitter stream in an unsupervised manner. However, the results of topic models are difficult to interpret and require manual identification of one or more “disaster topics”. Immediate disaster response would benefit from a fully automated process for interpreting the modeled topics and extracting disaster relevant information. Initializing the topic model with a set of seed words already allows to directly identify the corresponding disaster topic. In order to enable an automated end-to-end process, we automatically generate seed words using older Tweets from the same geographic area. The results of two past events (Napa Valley earthquake 2014 and hurricane Harvey 2017) show that the geospatial distribution of Tweets identified as disaster related conforms with the officially released disaster footprints. The suggested approach is applicable when there is a single topic of interest and comparative data available.https://www.mdpi.com/2078-2489/11/8/376topic modelingsocial mediageospatial analysisdisaster management
collection DOAJ
language English
format Article
sources DOAJ
author Cornelia Ferner
Clemens Havas
Elisabeth Birnbacher
Stefan  Wegenkittl
Bernd Resch
spellingShingle Cornelia Ferner
Clemens Havas
Elisabeth Birnbacher
Stefan  Wegenkittl
Bernd Resch
Automated Seeded Latent Dirichlet Allocation for Social Media Based Event Detection and Mapping
Information
topic modeling
social media
geospatial analysis
disaster management
author_facet Cornelia Ferner
Clemens Havas
Elisabeth Birnbacher
Stefan  Wegenkittl
Bernd Resch
author_sort Cornelia Ferner
title Automated Seeded Latent Dirichlet Allocation for Social Media Based Event Detection and Mapping
title_short Automated Seeded Latent Dirichlet Allocation for Social Media Based Event Detection and Mapping
title_full Automated Seeded Latent Dirichlet Allocation for Social Media Based Event Detection and Mapping
title_fullStr Automated Seeded Latent Dirichlet Allocation for Social Media Based Event Detection and Mapping
title_full_unstemmed Automated Seeded Latent Dirichlet Allocation for Social Media Based Event Detection and Mapping
title_sort automated seeded latent dirichlet allocation for social media based event detection and mapping
publisher MDPI AG
series Information
issn 2078-2489
publishDate 2020-07-01
description In the event of a natural disaster, geo-tagged Tweets are an immediate source of information for locating casualties and damages, and for supporting disaster management. Topic modeling can help in detecting disaster-related Tweets in the noisy Twitter stream in an unsupervised manner. However, the results of topic models are difficult to interpret and require manual identification of one or more “disaster topics”. Immediate disaster response would benefit from a fully automated process for interpreting the modeled topics and extracting disaster relevant information. Initializing the topic model with a set of seed words already allows to directly identify the corresponding disaster topic. In order to enable an automated end-to-end process, we automatically generate seed words using older Tweets from the same geographic area. The results of two past events (Napa Valley earthquake 2014 and hurricane Harvey 2017) show that the geospatial distribution of Tweets identified as disaster related conforms with the officially released disaster footprints. The suggested approach is applicable when there is a single topic of interest and comparative data available.
topic topic modeling
social media
geospatial analysis
disaster management
url https://www.mdpi.com/2078-2489/11/8/376
work_keys_str_mv AT corneliaferner automatedseededlatentdirichletallocationforsocialmediabasedeventdetectionandmapping
AT clemenshavas automatedseededlatentdirichletallocationforsocialmediabasedeventdetectionandmapping
AT elisabethbirnbacher automatedseededlatentdirichletallocationforsocialmediabasedeventdetectionandmapping
AT stefanwegenkittl automatedseededlatentdirichletallocationforsocialmediabasedeventdetectionandmapping
AT berndresch automatedseededlatentdirichletallocationforsocialmediabasedeventdetectionandmapping
_version_ 1724541500843360256