A DISTRIBUTED POLYGON RETRIEVAL ALGORITHM USING MAPREDUCE

The burst of large-scale spatial terrain data due to the proliferation of data acquisition devices like 3D laser scanners poses challenges to spatial data analysis and computation. Among many spatial analyses and computations, polygon retrieval is a fundamental operation which is often performed und...

Full description

Bibliographic Details
Main Authors: Q. Guo, B. Palanisamy, H. A. Karimi
Format: Article
Language:English
Published: Copernicus Publications 2015-07-01
Series:ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Online Access:http://www.isprs-ann-photogramm-remote-sens-spatial-inf-sci.net/II-4-W2/51/2015/isprsannals-II-4-W2-51-2015.pdf
id doaj-fd7068a8eb9a41949039808957cbd212
record_format Article
spelling doaj-fd7068a8eb9a41949039808957cbd2122020-11-25T00:42:44ZengCopernicus PublicationsISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences2194-90422194-90502015-07-01II-4/W2515310.5194/isprsannals-II-4-W2-51-2015A DISTRIBUTED POLYGON RETRIEVAL ALGORITHM USING MAPREDUCEQ. Guo0B. Palanisamy1H. A. Karimi2Geoinformatics Laboratory, School of Information Sciences, University of Pittsburgh, USAGeoinformatics Laboratory, School of Information Sciences, University of Pittsburgh, USAGeoinformatics Laboratory, School of Information Sciences, University of Pittsburgh, USAThe burst of large-scale spatial terrain data due to the proliferation of data acquisition devices like 3D laser scanners poses challenges to spatial data analysis and computation. Among many spatial analyses and computations, polygon retrieval is a fundamental operation which is often performed under real-time constraints. However, existing sequential algorithms fail to meet this demand for larger sizes of terrain data. Motivated by the MapReduce programming model, a well-adopted large-scale parallel data processing technique, we present a MapReduce-based polygon retrieval algorithm designed with the objective of reducing the IO and CPU loads of spatial data processing. By indexing the data based on a quad-tree approach, a significant amount of unneeded data is filtered in the filtering stage and it reduces the IO overhead. The indexed data also facilitates querying the relationship between the terrain data and query area in shorter time. The results of the experiments performed in our Hadoop cluster demonstrate that our algorithm performs significantly better than the existing distributed algorithms.http://www.isprs-ann-photogramm-remote-sens-spatial-inf-sci.net/II-4-W2/51/2015/isprsannals-II-4-W2-51-2015.pdf
collection DOAJ
language English
format Article
sources DOAJ
author Q. Guo
B. Palanisamy
H. A. Karimi
spellingShingle Q. Guo
B. Palanisamy
H. A. Karimi
A DISTRIBUTED POLYGON RETRIEVAL ALGORITHM USING MAPREDUCE
ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
author_facet Q. Guo
B. Palanisamy
H. A. Karimi
author_sort Q. Guo
title A DISTRIBUTED POLYGON RETRIEVAL ALGORITHM USING MAPREDUCE
title_short A DISTRIBUTED POLYGON RETRIEVAL ALGORITHM USING MAPREDUCE
title_full A DISTRIBUTED POLYGON RETRIEVAL ALGORITHM USING MAPREDUCE
title_fullStr A DISTRIBUTED POLYGON RETRIEVAL ALGORITHM USING MAPREDUCE
title_full_unstemmed A DISTRIBUTED POLYGON RETRIEVAL ALGORITHM USING MAPREDUCE
title_sort distributed polygon retrieval algorithm using mapreduce
publisher Copernicus Publications
series ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
issn 2194-9042
2194-9050
publishDate 2015-07-01
description The burst of large-scale spatial terrain data due to the proliferation of data acquisition devices like 3D laser scanners poses challenges to spatial data analysis and computation. Among many spatial analyses and computations, polygon retrieval is a fundamental operation which is often performed under real-time constraints. However, existing sequential algorithms fail to meet this demand for larger sizes of terrain data. Motivated by the MapReduce programming model, a well-adopted large-scale parallel data processing technique, we present a MapReduce-based polygon retrieval algorithm designed with the objective of reducing the IO and CPU loads of spatial data processing. By indexing the data based on a quad-tree approach, a significant amount of unneeded data is filtered in the filtering stage and it reduces the IO overhead. The indexed data also facilitates querying the relationship between the terrain data and query area in shorter time. The results of the experiments performed in our Hadoop cluster demonstrate that our algorithm performs significantly better than the existing distributed algorithms.
url http://www.isprs-ann-photogramm-remote-sens-spatial-inf-sci.net/II-4-W2/51/2015/isprsannals-II-4-W2-51-2015.pdf
work_keys_str_mv AT qguo adistributedpolygonretrievalalgorithmusingmapreduce
AT bpalanisamy adistributedpolygonretrievalalgorithmusingmapreduce
AT hakarimi adistributedpolygonretrievalalgorithmusingmapreduce
AT qguo distributedpolygonretrievalalgorithmusingmapreduce
AT bpalanisamy distributedpolygonretrievalalgorithmusingmapreduce
AT hakarimi distributedpolygonretrievalalgorithmusingmapreduce
_version_ 1725280566747594752