A Framework for Spatial Database Explanations

abstract: In the last few years, there has been a tremendous increase in the use of big data. Most of this data is hard to understand because of its size and dimensions. The importance of this problem can be emphasized by the fact that Big Data Research and Development Initiative was announced by th...

Full description

Bibliographic Details
Other Authors: Tahir, Anique (Author)
Format: Dissertation
Language:English
Published: 2018
Subjects:
Online Access:http://hdl.handle.net/2286/R.I.50465
id ndltd-asu.edu-item-50465
record_format oai_dc
spelling ndltd-asu.edu-item-504652018-10-02T03:01:04Z A Framework for Spatial Database Explanations abstract: In the last few years, there has been a tremendous increase in the use of big data. Most of this data is hard to understand because of its size and dimensions. The importance of this problem can be emphasized by the fact that Big Data Research and Development Initiative was announced by the United States administration in 2012 to address problems faced by the government. Various states and cities in the US gather spatial data about incidents like police calls for service. When we query large amounts of data, it may lead to a lot of questions. For example, when we look at arithmetic relationships between queries in heterogeneous data, there are a lot of differences. How can we explain what factors account for these differences? If we define the observation as an arithmetic relationship between queries, this kind of problem can be solved by aggravation or intervention. Aggravation views the value of our observation for different set of tuples while intervention looks at the value of the observation after removing sets of tuples. We call the predicates which represent these tuples, explanations. Observations by themselves have limited importance. For example, if we observe a large number of taxi trips in a specific area, we might ask the question: Why are there so many trips here? Explanations attempt to answer these kinds of questions. While aggravation and intervention are designed for non spatial data, we propose a new approach for explaining spatially heterogeneous data. Our approach expands on aggravation and intervention while using spatial partitioning/clustering to improve explanations for spatial data. Our proposed approach was evaluated against a real-world taxi dataset as well as a synthetic disease outbreak datasets. The approach was found to outperform aggravation in precision and recall while outperforming intervention in precision. Dissertation/Thesis Tahir, Anique (Author) Elsayed, Mohamed (Advisor) Hsiao, Ihan (Committee member) Maciejewski, Ross (Committee member) Arizona State University (Publisher) Computer science eng 87 pages Masters Thesis Computer Science 2018 Masters Thesis http://hdl.handle.net/2286/R.I.50465 http://rightsstatements.org/vocab/InC/1.0/ 2018
collection NDLTD
language English
format Dissertation
sources NDLTD
topic Computer science
spellingShingle Computer science
A Framework for Spatial Database Explanations
description abstract: In the last few years, there has been a tremendous increase in the use of big data. Most of this data is hard to understand because of its size and dimensions. The importance of this problem can be emphasized by the fact that Big Data Research and Development Initiative was announced by the United States administration in 2012 to address problems faced by the government. Various states and cities in the US gather spatial data about incidents like police calls for service. When we query large amounts of data, it may lead to a lot of questions. For example, when we look at arithmetic relationships between queries in heterogeneous data, there are a lot of differences. How can we explain what factors account for these differences? If we define the observation as an arithmetic relationship between queries, this kind of problem can be solved by aggravation or intervention. Aggravation views the value of our observation for different set of tuples while intervention looks at the value of the observation after removing sets of tuples. We call the predicates which represent these tuples, explanations. Observations by themselves have limited importance. For example, if we observe a large number of taxi trips in a specific area, we might ask the question: Why are there so many trips here? Explanations attempt to answer these kinds of questions. While aggravation and intervention are designed for non spatial data, we propose a new approach for explaining spatially heterogeneous data. Our approach expands on aggravation and intervention while using spatial partitioning/clustering to improve explanations for spatial data. Our proposed approach was evaluated against a real-world taxi dataset as well as a synthetic disease outbreak datasets. The approach was found to outperform aggravation in precision and recall while outperforming intervention in precision. === Dissertation/Thesis === Masters Thesis Computer Science 2018
author2 Tahir, Anique (Author)
author_facet Tahir, Anique (Author)
title A Framework for Spatial Database Explanations
title_short A Framework for Spatial Database Explanations
title_full A Framework for Spatial Database Explanations
title_fullStr A Framework for Spatial Database Explanations
title_full_unstemmed A Framework for Spatial Database Explanations
title_sort framework for spatial database explanations
publishDate 2018
url http://hdl.handle.net/2286/R.I.50465
_version_ 1718756996968087552