Crisis social media data labeled for storm-related information and toponym usage

Social media provides citizens and officials with important sources of information during times of crisis. This data article makes available labeled, storm-related social media data collected over a six-hour period during a severe storm and F1 tornado that struck Central Pennsylvania on May 1st, 201...

Full description

Bibliographic Details
Main Author: Rob Grace
Format: Article
Language:English
Published: Elsevier 2020-06-01
Series:Data in Brief
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2352340920304893
id doaj-61058c1061ef45e78db0bb01fad738ce
record_format Article
spelling doaj-61058c1061ef45e78db0bb01fad738ce2020-11-25T02:27:26ZengElsevierData in Brief2352-34092020-06-0130105595Crisis social media data labeled for storm-related information and toponym usageRob Grace0Texas Tech UniversitySocial media provides citizens and officials with important sources of information during times of crisis. This data article makes available labeled, storm-related social media data collected over a six-hour period during a severe storm and F1 tornado that struck Central Pennsylvania on May 1st, 2017. Three datasets were collected from Twitter using location, keyword, and network filtering techniques, respectively. Only 2% of the 22,706 total tweets overlap among the datasets, providing researchers with a broader scope of information than normally available when collecting tweets using location (i.e., geotag-based) and keyword filtering alone or in combination during a crisis. Each data collection technique is described in detail, including network filtering which collects data from networks of social media users associated with a geographic area.The datasets are manually labeled for information content and toponym usage. The 22,706 tweet IDs, dehydrated for privacy, are labeled for relevance (storm-related and off-topic) and 19 types of storm-related information organized into six categories: infrastructure damage, service disruption, personal experience, weather updates, weather forecasts, and weather warnings. Data are also labeled for toponym usage (with or without toponyms), location (local, remote, and generic toponyms), and granularity (hyperlocal, municipal, and regional toponyms). The comprehensively labeled datasets provide researchers with opportunities to analyze crisis-related information behaviors and volunteered location information behaviors during a hyperlocal crisis event, as well as develop and evaluate automated filtering, geolocation, and event detection techniques that can aid citizens and crisis responders.http://www.sciencedirect.com/science/article/pii/S2352340920304893Crisis informaticsvolunteered geographic informationTwitterInformation behaviorEmergency managementRisk communication
collection DOAJ
language English
format Article
sources DOAJ
author Rob Grace
spellingShingle Rob Grace
Crisis social media data labeled for storm-related information and toponym usage
Data in Brief
Crisis informatics
volunteered geographic information
Twitter
Information behavior
Emergency management
Risk communication
author_facet Rob Grace
author_sort Rob Grace
title Crisis social media data labeled for storm-related information and toponym usage
title_short Crisis social media data labeled for storm-related information and toponym usage
title_full Crisis social media data labeled for storm-related information and toponym usage
title_fullStr Crisis social media data labeled for storm-related information and toponym usage
title_full_unstemmed Crisis social media data labeled for storm-related information and toponym usage
title_sort crisis social media data labeled for storm-related information and toponym usage
publisher Elsevier
series Data in Brief
issn 2352-3409
publishDate 2020-06-01
description Social media provides citizens and officials with important sources of information during times of crisis. This data article makes available labeled, storm-related social media data collected over a six-hour period during a severe storm and F1 tornado that struck Central Pennsylvania on May 1st, 2017. Three datasets were collected from Twitter using location, keyword, and network filtering techniques, respectively. Only 2% of the 22,706 total tweets overlap among the datasets, providing researchers with a broader scope of information than normally available when collecting tweets using location (i.e., geotag-based) and keyword filtering alone or in combination during a crisis. Each data collection technique is described in detail, including network filtering which collects data from networks of social media users associated with a geographic area.The datasets are manually labeled for information content and toponym usage. The 22,706 tweet IDs, dehydrated for privacy, are labeled for relevance (storm-related and off-topic) and 19 types of storm-related information organized into six categories: infrastructure damage, service disruption, personal experience, weather updates, weather forecasts, and weather warnings. Data are also labeled for toponym usage (with or without toponyms), location (local, remote, and generic toponyms), and granularity (hyperlocal, municipal, and regional toponyms). The comprehensively labeled datasets provide researchers with opportunities to analyze crisis-related information behaviors and volunteered location information behaviors during a hyperlocal crisis event, as well as develop and evaluate automated filtering, geolocation, and event detection techniques that can aid citizens and crisis responders.
topic Crisis informatics
volunteered geographic information
Twitter
Information behavior
Emergency management
Risk communication
url http://www.sciencedirect.com/science/article/pii/S2352340920304893
work_keys_str_mv AT robgrace crisissocialmediadatalabeledforstormrelatedinformationandtoponymusage
_version_ 1724843178433970176