A DISTRIBUTED POLYGON RETRIEVAL ALGORITHM USING MAPREDUCE
The burst of large-scale spatial terrain data due to the proliferation of data acquisition devices like 3D laser scanners poses challenges to spatial data analysis and computation. Among many spatial analyses and computations, polygon retrieval is a fundamental operation which is often performed und...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Copernicus Publications
2015-07-01
|
Series: | ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences |
Online Access: | http://www.isprs-ann-photogramm-remote-sens-spatial-inf-sci.net/II-4-W2/51/2015/isprsannals-II-4-W2-51-2015.pdf |
id |
doaj-fd7068a8eb9a41949039808957cbd212 |
---|---|
record_format |
Article |
spelling |
doaj-fd7068a8eb9a41949039808957cbd2122020-11-25T00:42:44ZengCopernicus PublicationsISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences2194-90422194-90502015-07-01II-4/W2515310.5194/isprsannals-II-4-W2-51-2015A DISTRIBUTED POLYGON RETRIEVAL ALGORITHM USING MAPREDUCEQ. Guo0B. Palanisamy1H. A. Karimi2Geoinformatics Laboratory, School of Information Sciences, University of Pittsburgh, USAGeoinformatics Laboratory, School of Information Sciences, University of Pittsburgh, USAGeoinformatics Laboratory, School of Information Sciences, University of Pittsburgh, USAThe burst of large-scale spatial terrain data due to the proliferation of data acquisition devices like 3D laser scanners poses challenges to spatial data analysis and computation. Among many spatial analyses and computations, polygon retrieval is a fundamental operation which is often performed under real-time constraints. However, existing sequential algorithms fail to meet this demand for larger sizes of terrain data. Motivated by the MapReduce programming model, a well-adopted large-scale parallel data processing technique, we present a MapReduce-based polygon retrieval algorithm designed with the objective of reducing the IO and CPU loads of spatial data processing. By indexing the data based on a quad-tree approach, a significant amount of unneeded data is filtered in the filtering stage and it reduces the IO overhead. The indexed data also facilitates querying the relationship between the terrain data and query area in shorter time. The results of the experiments performed in our Hadoop cluster demonstrate that our algorithm performs significantly better than the existing distributed algorithms.http://www.isprs-ann-photogramm-remote-sens-spatial-inf-sci.net/II-4-W2/51/2015/isprsannals-II-4-W2-51-2015.pdf |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Q. Guo B. Palanisamy H. A. Karimi |
spellingShingle |
Q. Guo B. Palanisamy H. A. Karimi A DISTRIBUTED POLYGON RETRIEVAL ALGORITHM USING MAPREDUCE ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences |
author_facet |
Q. Guo B. Palanisamy H. A. Karimi |
author_sort |
Q. Guo |
title |
A DISTRIBUTED POLYGON RETRIEVAL ALGORITHM USING MAPREDUCE |
title_short |
A DISTRIBUTED POLYGON RETRIEVAL ALGORITHM USING MAPREDUCE |
title_full |
A DISTRIBUTED POLYGON RETRIEVAL ALGORITHM USING MAPREDUCE |
title_fullStr |
A DISTRIBUTED POLYGON RETRIEVAL ALGORITHM USING MAPREDUCE |
title_full_unstemmed |
A DISTRIBUTED POLYGON RETRIEVAL ALGORITHM USING MAPREDUCE |
title_sort |
distributed polygon retrieval algorithm using mapreduce |
publisher |
Copernicus Publications |
series |
ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences |
issn |
2194-9042 2194-9050 |
publishDate |
2015-07-01 |
description |
The burst of large-scale spatial terrain data due to the proliferation of data acquisition devices like 3D laser scanners poses challenges to spatial data analysis and computation. Among many spatial analyses and computations, polygon retrieval is a fundamental operation which is often performed under real-time constraints. However, existing sequential algorithms fail to meet this demand for larger sizes of terrain data. Motivated by the MapReduce programming model, a well-adopted large-scale parallel data processing technique, we present a MapReduce-based polygon retrieval algorithm designed with the objective of reducing the IO and CPU loads of spatial data processing. By indexing the data based on a quad-tree approach, a significant amount of unneeded data is filtered in the filtering stage and it reduces the IO overhead. The indexed data also facilitates querying the relationship between the terrain data and query area in shorter time. The results of the experiments performed in our Hadoop cluster demonstrate that our algorithm performs significantly better than the existing distributed algorithms. |
url |
http://www.isprs-ann-photogramm-remote-sens-spatial-inf-sci.net/II-4-W2/51/2015/isprsannals-II-4-W2-51-2015.pdf |
work_keys_str_mv |
AT qguo adistributedpolygonretrievalalgorithmusingmapreduce AT bpalanisamy adistributedpolygonretrievalalgorithmusingmapreduce AT hakarimi adistributedpolygonretrievalalgorithmusingmapreduce AT qguo distributedpolygonretrievalalgorithmusingmapreduce AT bpalanisamy distributedpolygonretrievalalgorithmusingmapreduce AT hakarimi distributedpolygonretrievalalgorithmusingmapreduce |
_version_ |
1725280566747594752 |