Spatial Reliability Assessment of Social Media Mining Techniques with Regard to Disaster Domain-Based Filtering

The data generated by social media such as Twitter are classified as big data and the usability of those data can provide a wide range of resources to various study areas including disaster management, tourism, political science, and health. However, apart from the acquisition of the data, the relia...

Full description

Bibliographic Details
Main Authors: Ayse Giz Gulnerman, Himmet Karaman
Format: Article
Language:English
Published: MDPI AG 2020-04-01
Series:ISPRS International Journal of Geo-Information
Subjects:
Online Access:https://www.mdpi.com/2220-9964/9/4/245
id doaj-527fbc3ba8074393a9c11b6708636d34
record_format Article
spelling doaj-527fbc3ba8074393a9c11b6708636d342020-11-25T02:21:36ZengMDPI AGISPRS International Journal of Geo-Information2220-99642020-04-01924524510.3390/ijgi9040245Spatial Reliability Assessment of Social Media Mining Techniques with Regard to Disaster Domain-Based FilteringAyse Giz Gulnerman0Himmet Karaman1Geomatics Engineering Department, Faculty of Civil Engineering, Istanbul Technical University, Sariyer, 34469 Istanbul, TurkeyGeomatics Engineering Department, Faculty of Civil Engineering, Istanbul Technical University, Sariyer, 34469 Istanbul, TurkeyThe data generated by social media such as Twitter are classified as big data and the usability of those data can provide a wide range of resources to various study areas including disaster management, tourism, political science, and health. However, apart from the acquisition of the data, the reliability and accuracy when it comes to using it concern scientists in terms of whether or not the use of social media data (SMD) can lead to incorrect and unreliable inferences. There have been many studies on the analyses of SMD in order to investigate their reliability, accuracy, or credibility, but that have not dealt with the filtering techniques applied to with the data before creating the results or after their acquisition. This study provides a methodology for detecting the accuracy and reliability of the filtering techniques for SMD and then a spatial similarity index that analyzes spatial intersections, proximity, and size, and compares them. Finally, we offer a comparison that shows the best combination of filtering techniques and similarity indices to create event maps of SMD by using the Getis-Ord Gi* technique. The steps of this study can be summarized as follows: an investigation of domain-based text filtering techniques for dealing with sentiment lexicons, machine learning-based sentiment analyses on reliability, and developing intermediate codes specific to domain-based studies; then, by using various similarity indices, the determination of the spatial reliability and accuracy of maps of the filtered social media data. The study offers the best combination of filtering, mapping, and spatial accuracy investigation methods for social media data, especially in the case of emergencies, where urgent spatial information is required. As a result, a new similarity index based on the spatial intersection, spatial size, and proximity relationships is introduced to determine the spatial accuracy of the fine-filtered SMD. The motivation for this research is to develop the ability to create an incidence map shortly after a disaster event such as a bombing. However, the proposed methodology can also be used for various domains such as concerts, elections, natural disasters, marketing, etc.https://www.mdpi.com/2220-9964/9/4/245volunteered geographic informationspatial assessmentspatial similarity indexsentiment analysis
collection DOAJ
language English
format Article
sources DOAJ
author Ayse Giz Gulnerman
Himmet Karaman
spellingShingle Ayse Giz Gulnerman
Himmet Karaman
Spatial Reliability Assessment of Social Media Mining Techniques with Regard to Disaster Domain-Based Filtering
ISPRS International Journal of Geo-Information
volunteered geographic information
spatial assessment
spatial similarity index
sentiment analysis
author_facet Ayse Giz Gulnerman
Himmet Karaman
author_sort Ayse Giz Gulnerman
title Spatial Reliability Assessment of Social Media Mining Techniques with Regard to Disaster Domain-Based Filtering
title_short Spatial Reliability Assessment of Social Media Mining Techniques with Regard to Disaster Domain-Based Filtering
title_full Spatial Reliability Assessment of Social Media Mining Techniques with Regard to Disaster Domain-Based Filtering
title_fullStr Spatial Reliability Assessment of Social Media Mining Techniques with Regard to Disaster Domain-Based Filtering
title_full_unstemmed Spatial Reliability Assessment of Social Media Mining Techniques with Regard to Disaster Domain-Based Filtering
title_sort spatial reliability assessment of social media mining techniques with regard to disaster domain-based filtering
publisher MDPI AG
series ISPRS International Journal of Geo-Information
issn 2220-9964
publishDate 2020-04-01
description The data generated by social media such as Twitter are classified as big data and the usability of those data can provide a wide range of resources to various study areas including disaster management, tourism, political science, and health. However, apart from the acquisition of the data, the reliability and accuracy when it comes to using it concern scientists in terms of whether or not the use of social media data (SMD) can lead to incorrect and unreliable inferences. There have been many studies on the analyses of SMD in order to investigate their reliability, accuracy, or credibility, but that have not dealt with the filtering techniques applied to with the data before creating the results or after their acquisition. This study provides a methodology for detecting the accuracy and reliability of the filtering techniques for SMD and then a spatial similarity index that analyzes spatial intersections, proximity, and size, and compares them. Finally, we offer a comparison that shows the best combination of filtering techniques and similarity indices to create event maps of SMD by using the Getis-Ord Gi* technique. The steps of this study can be summarized as follows: an investigation of domain-based text filtering techniques for dealing with sentiment lexicons, machine learning-based sentiment analyses on reliability, and developing intermediate codes specific to domain-based studies; then, by using various similarity indices, the determination of the spatial reliability and accuracy of maps of the filtered social media data. The study offers the best combination of filtering, mapping, and spatial accuracy investigation methods for social media data, especially in the case of emergencies, where urgent spatial information is required. As a result, a new similarity index based on the spatial intersection, spatial size, and proximity relationships is introduced to determine the spatial accuracy of the fine-filtered SMD. The motivation for this research is to develop the ability to create an incidence map shortly after a disaster event such as a bombing. However, the proposed methodology can also be used for various domains such as concerts, elections, natural disasters, marketing, etc.
topic volunteered geographic information
spatial assessment
spatial similarity index
sentiment analysis
url https://www.mdpi.com/2220-9964/9/4/245
work_keys_str_mv AT aysegizgulnerman spatialreliabilityassessmentofsocialmediaminingtechniqueswithregardtodisasterdomainbasedfiltering
AT himmetkaraman spatialreliabilityassessmentofsocialmediaminingtechniqueswithregardtodisasterdomainbasedfiltering
_version_ 1724865291795562496