Automated Seeded Latent Dirichlet Allocation for Social Media Based Event Detection and Mapping
In the event of a natural disaster, geo-tagged Tweets are an immediate source of information for locating casualties and damages, and for supporting disaster management. Topic modeling can help in detecting disaster-related Tweets in the noisy Twitter stream in an unsupervised manner. However, the r...
Main Authors: | , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-07-01
|
Series: | Information |
Subjects: | |
Online Access: | https://www.mdpi.com/2078-2489/11/8/376 |
id |
doaj-92cd8e11135d419daedd62dccc474a4b |
---|---|
record_format |
Article |
spelling |
doaj-92cd8e11135d419daedd62dccc474a4b2020-11-25T03:38:37ZengMDPI AGInformation2078-24892020-07-011137637610.3390/info11080376Automated Seeded Latent Dirichlet Allocation for Social Media Based Event Detection and MappingCornelia Ferner0Clemens Havas1Elisabeth Birnbacher2Stefan Wegenkittl 3Bernd Resch4Salzburg University of Applied Sciences, Urstein Sued 1, 5412 Puch/Hallein, AustriaUniversity of Salzburg, Schillerstrasse 30, 5020 Salzburg, AustriaSalzburg University of Applied Sciences, Urstein Sued 1, 5412 Puch/Hallein, AustriaSalzburg University of Applied Sciences, Urstein Sued 1, 5412 Puch/Hallein, AustriaUniversity of Salzburg, Schillerstrasse 30, 5020 Salzburg, AustriaIn the event of a natural disaster, geo-tagged Tweets are an immediate source of information for locating casualties and damages, and for supporting disaster management. Topic modeling can help in detecting disaster-related Tweets in the noisy Twitter stream in an unsupervised manner. However, the results of topic models are difficult to interpret and require manual identification of one or more “disaster topics”. Immediate disaster response would benefit from a fully automated process for interpreting the modeled topics and extracting disaster relevant information. Initializing the topic model with a set of seed words already allows to directly identify the corresponding disaster topic. In order to enable an automated end-to-end process, we automatically generate seed words using older Tweets from the same geographic area. The results of two past events (Napa Valley earthquake 2014 and hurricane Harvey 2017) show that the geospatial distribution of Tweets identified as disaster related conforms with the officially released disaster footprints. The suggested approach is applicable when there is a single topic of interest and comparative data available.https://www.mdpi.com/2078-2489/11/8/376topic modelingsocial mediageospatial analysisdisaster management |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Cornelia Ferner Clemens Havas Elisabeth Birnbacher Stefan Wegenkittl Bernd Resch |
spellingShingle |
Cornelia Ferner Clemens Havas Elisabeth Birnbacher Stefan Wegenkittl Bernd Resch Automated Seeded Latent Dirichlet Allocation for Social Media Based Event Detection and Mapping Information topic modeling social media geospatial analysis disaster management |
author_facet |
Cornelia Ferner Clemens Havas Elisabeth Birnbacher Stefan Wegenkittl Bernd Resch |
author_sort |
Cornelia Ferner |
title |
Automated Seeded Latent Dirichlet Allocation for Social Media Based Event Detection and Mapping |
title_short |
Automated Seeded Latent Dirichlet Allocation for Social Media Based Event Detection and Mapping |
title_full |
Automated Seeded Latent Dirichlet Allocation for Social Media Based Event Detection and Mapping |
title_fullStr |
Automated Seeded Latent Dirichlet Allocation for Social Media Based Event Detection and Mapping |
title_full_unstemmed |
Automated Seeded Latent Dirichlet Allocation for Social Media Based Event Detection and Mapping |
title_sort |
automated seeded latent dirichlet allocation for social media based event detection and mapping |
publisher |
MDPI AG |
series |
Information |
issn |
2078-2489 |
publishDate |
2020-07-01 |
description |
In the event of a natural disaster, geo-tagged Tweets are an immediate source of information for locating casualties and damages, and for supporting disaster management. Topic modeling can help in detecting disaster-related Tweets in the noisy Twitter stream in an unsupervised manner. However, the results of topic models are difficult to interpret and require manual identification of one or more “disaster topics”. Immediate disaster response would benefit from a fully automated process for interpreting the modeled topics and extracting disaster relevant information. Initializing the topic model with a set of seed words already allows to directly identify the corresponding disaster topic. In order to enable an automated end-to-end process, we automatically generate seed words using older Tweets from the same geographic area. The results of two past events (Napa Valley earthquake 2014 and hurricane Harvey 2017) show that the geospatial distribution of Tweets identified as disaster related conforms with the officially released disaster footprints. The suggested approach is applicable when there is a single topic of interest and comparative data available. |
topic |
topic modeling social media geospatial analysis disaster management |
url |
https://www.mdpi.com/2078-2489/11/8/376 |
work_keys_str_mv |
AT corneliaferner automatedseededlatentdirichletallocationforsocialmediabasedeventdetectionandmapping AT clemenshavas automatedseededlatentdirichletallocationforsocialmediabasedeventdetectionandmapping AT elisabethbirnbacher automatedseededlatentdirichletallocationforsocialmediabasedeventdetectionandmapping AT stefanwegenkittl automatedseededlatentdirichletallocationforsocialmediabasedeventdetectionandmapping AT berndresch automatedseededlatentdirichletallocationforsocialmediabasedeventdetectionandmapping |
_version_ |
1724541500843360256 |