Spatial Reliability Assessment of Social Media Mining Techniques with Regard to Disaster Domain-Based Filtering
The data generated by social media such as Twitter are classified as big data and the usability of those data can provide a wide range of resources to various study areas including disaster management, tourism, political science, and health. However, apart from the acquisition of the data, the relia...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2020-04-01
|
Series: | ISPRS International Journal of Geo-Information |
Subjects: | |
Online Access: | https://www.mdpi.com/2220-9964/9/4/245 |
id |
doaj-527fbc3ba8074393a9c11b6708636d34 |
---|---|
record_format |
Article |
spelling |
doaj-527fbc3ba8074393a9c11b6708636d342020-11-25T02:21:36ZengMDPI AGISPRS International Journal of Geo-Information2220-99642020-04-01924524510.3390/ijgi9040245Spatial Reliability Assessment of Social Media Mining Techniques with Regard to Disaster Domain-Based FilteringAyse Giz Gulnerman0Himmet Karaman1Geomatics Engineering Department, Faculty of Civil Engineering, Istanbul Technical University, Sariyer, 34469 Istanbul, TurkeyGeomatics Engineering Department, Faculty of Civil Engineering, Istanbul Technical University, Sariyer, 34469 Istanbul, TurkeyThe data generated by social media such as Twitter are classified as big data and the usability of those data can provide a wide range of resources to various study areas including disaster management, tourism, political science, and health. However, apart from the acquisition of the data, the reliability and accuracy when it comes to using it concern scientists in terms of whether or not the use of social media data (SMD) can lead to incorrect and unreliable inferences. There have been many studies on the analyses of SMD in order to investigate their reliability, accuracy, or credibility, but that have not dealt with the filtering techniques applied to with the data before creating the results or after their acquisition. This study provides a methodology for detecting the accuracy and reliability of the filtering techniques for SMD and then a spatial similarity index that analyzes spatial intersections, proximity, and size, and compares them. Finally, we offer a comparison that shows the best combination of filtering techniques and similarity indices to create event maps of SMD by using the Getis-Ord Gi* technique. The steps of this study can be summarized as follows: an investigation of domain-based text filtering techniques for dealing with sentiment lexicons, machine learning-based sentiment analyses on reliability, and developing intermediate codes specific to domain-based studies; then, by using various similarity indices, the determination of the spatial reliability and accuracy of maps of the filtered social media data. The study offers the best combination of filtering, mapping, and spatial accuracy investigation methods for social media data, especially in the case of emergencies, where urgent spatial information is required. As a result, a new similarity index based on the spatial intersection, spatial size, and proximity relationships is introduced to determine the spatial accuracy of the fine-filtered SMD. The motivation for this research is to develop the ability to create an incidence map shortly after a disaster event such as a bombing. However, the proposed methodology can also be used for various domains such as concerts, elections, natural disasters, marketing, etc.https://www.mdpi.com/2220-9964/9/4/245volunteered geographic informationspatial assessmentspatial similarity indexsentiment analysis |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Ayse Giz Gulnerman Himmet Karaman |
spellingShingle |
Ayse Giz Gulnerman Himmet Karaman Spatial Reliability Assessment of Social Media Mining Techniques with Regard to Disaster Domain-Based Filtering ISPRS International Journal of Geo-Information volunteered geographic information spatial assessment spatial similarity index sentiment analysis |
author_facet |
Ayse Giz Gulnerman Himmet Karaman |
author_sort |
Ayse Giz Gulnerman |
title |
Spatial Reliability Assessment of Social Media Mining Techniques with Regard to Disaster Domain-Based Filtering |
title_short |
Spatial Reliability Assessment of Social Media Mining Techniques with Regard to Disaster Domain-Based Filtering |
title_full |
Spatial Reliability Assessment of Social Media Mining Techniques with Regard to Disaster Domain-Based Filtering |
title_fullStr |
Spatial Reliability Assessment of Social Media Mining Techniques with Regard to Disaster Domain-Based Filtering |
title_full_unstemmed |
Spatial Reliability Assessment of Social Media Mining Techniques with Regard to Disaster Domain-Based Filtering |
title_sort |
spatial reliability assessment of social media mining techniques with regard to disaster domain-based filtering |
publisher |
MDPI AG |
series |
ISPRS International Journal of Geo-Information |
issn |
2220-9964 |
publishDate |
2020-04-01 |
description |
The data generated by social media such as Twitter are classified as big data and the usability of those data can provide a wide range of resources to various study areas including disaster management, tourism, political science, and health. However, apart from the acquisition of the data, the reliability and accuracy when it comes to using it concern scientists in terms of whether or not the use of social media data (SMD) can lead to incorrect and unreliable inferences. There have been many studies on the analyses of SMD in order to investigate their reliability, accuracy, or credibility, but that have not dealt with the filtering techniques applied to with the data before creating the results or after their acquisition. This study provides a methodology for detecting the accuracy and reliability of the filtering techniques for SMD and then a spatial similarity index that analyzes spatial intersections, proximity, and size, and compares them. Finally, we offer a comparison that shows the best combination of filtering techniques and similarity indices to create event maps of SMD by using the Getis-Ord Gi* technique. The steps of this study can be summarized as follows: an investigation of domain-based text filtering techniques for dealing with sentiment lexicons, machine learning-based sentiment analyses on reliability, and developing intermediate codes specific to domain-based studies; then, by using various similarity indices, the determination of the spatial reliability and accuracy of maps of the filtered social media data. The study offers the best combination of filtering, mapping, and spatial accuracy investigation methods for social media data, especially in the case of emergencies, where urgent spatial information is required. As a result, a new similarity index based on the spatial intersection, spatial size, and proximity relationships is introduced to determine the spatial accuracy of the fine-filtered SMD. The motivation for this research is to develop the ability to create an incidence map shortly after a disaster event such as a bombing. However, the proposed methodology can also be used for various domains such as concerts, elections, natural disasters, marketing, etc. |
topic |
volunteered geographic information spatial assessment spatial similarity index sentiment analysis |
url |
https://www.mdpi.com/2220-9964/9/4/245 |
work_keys_str_mv |
AT aysegizgulnerman spatialreliabilityassessmentofsocialmediaminingtechniqueswithregardtodisasterdomainbasedfiltering AT himmetkaraman spatialreliabilityassessmentofsocialmediaminingtechniqueswithregardtodisasterdomainbasedfiltering |
_version_ |
1724865291795562496 |