Crisis social media data labeled for storm-related information and toponym usage
Social media provides citizens and officials with important sources of information during times of crisis. This data article makes available labeled, storm-related social media data collected over a six-hour period during a severe storm and F1 tornado that struck Central Pennsylvania on May 1st, 201...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2020-06-01
|
Series: | Data in Brief |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2352340920304893 |
id |
doaj-61058c1061ef45e78db0bb01fad738ce |
---|---|
record_format |
Article |
spelling |
doaj-61058c1061ef45e78db0bb01fad738ce2020-11-25T02:27:26ZengElsevierData in Brief2352-34092020-06-0130105595Crisis social media data labeled for storm-related information and toponym usageRob Grace0Texas Tech UniversitySocial media provides citizens and officials with important sources of information during times of crisis. This data article makes available labeled, storm-related social media data collected over a six-hour period during a severe storm and F1 tornado that struck Central Pennsylvania on May 1st, 2017. Three datasets were collected from Twitter using location, keyword, and network filtering techniques, respectively. Only 2% of the 22,706 total tweets overlap among the datasets, providing researchers with a broader scope of information than normally available when collecting tweets using location (i.e., geotag-based) and keyword filtering alone or in combination during a crisis. Each data collection technique is described in detail, including network filtering which collects data from networks of social media users associated with a geographic area.The datasets are manually labeled for information content and toponym usage. The 22,706 tweet IDs, dehydrated for privacy, are labeled for relevance (storm-related and off-topic) and 19 types of storm-related information organized into six categories: infrastructure damage, service disruption, personal experience, weather updates, weather forecasts, and weather warnings. Data are also labeled for toponym usage (with or without toponyms), location (local, remote, and generic toponyms), and granularity (hyperlocal, municipal, and regional toponyms). The comprehensively labeled datasets provide researchers with opportunities to analyze crisis-related information behaviors and volunteered location information behaviors during a hyperlocal crisis event, as well as develop and evaluate automated filtering, geolocation, and event detection techniques that can aid citizens and crisis responders.http://www.sciencedirect.com/science/article/pii/S2352340920304893Crisis informaticsvolunteered geographic informationTwitterInformation behaviorEmergency managementRisk communication |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Rob Grace |
spellingShingle |
Rob Grace Crisis social media data labeled for storm-related information and toponym usage Data in Brief Crisis informatics volunteered geographic information Information behavior Emergency management Risk communication |
author_facet |
Rob Grace |
author_sort |
Rob Grace |
title |
Crisis social media data labeled for storm-related information and toponym usage |
title_short |
Crisis social media data labeled for storm-related information and toponym usage |
title_full |
Crisis social media data labeled for storm-related information and toponym usage |
title_fullStr |
Crisis social media data labeled for storm-related information and toponym usage |
title_full_unstemmed |
Crisis social media data labeled for storm-related information and toponym usage |
title_sort |
crisis social media data labeled for storm-related information and toponym usage |
publisher |
Elsevier |
series |
Data in Brief |
issn |
2352-3409 |
publishDate |
2020-06-01 |
description |
Social media provides citizens and officials with important sources of information during times of crisis. This data article makes available labeled, storm-related social media data collected over a six-hour period during a severe storm and F1 tornado that struck Central Pennsylvania on May 1st, 2017. Three datasets were collected from Twitter using location, keyword, and network filtering techniques, respectively. Only 2% of the 22,706 total tweets overlap among the datasets, providing researchers with a broader scope of information than normally available when collecting tweets using location (i.e., geotag-based) and keyword filtering alone or in combination during a crisis. Each data collection technique is described in detail, including network filtering which collects data from networks of social media users associated with a geographic area.The datasets are manually labeled for information content and toponym usage. The 22,706 tweet IDs, dehydrated for privacy, are labeled for relevance (storm-related and off-topic) and 19 types of storm-related information organized into six categories: infrastructure damage, service disruption, personal experience, weather updates, weather forecasts, and weather warnings. Data are also labeled for toponym usage (with or without toponyms), location (local, remote, and generic toponyms), and granularity (hyperlocal, municipal, and regional toponyms). The comprehensively labeled datasets provide researchers with opportunities to analyze crisis-related information behaviors and volunteered location information behaviors during a hyperlocal crisis event, as well as develop and evaluate automated filtering, geolocation, and event detection techniques that can aid citizens and crisis responders. |
topic |
Crisis informatics volunteered geographic information Information behavior Emergency management Risk communication |
url |
http://www.sciencedirect.com/science/article/pii/S2352340920304893 |
work_keys_str_mv |
AT robgrace crisissocialmediadatalabeledforstormrelatedinformationandtoponymusage |
_version_ |
1724843178433970176 |