Horizontally scalable probabilistic generalized suffix tree (PGST) based route prediction using map data and GPS traces
Abstract Route prediction is an essential requirement for many intelligent transport systems (ITS) services like VANETS, traffic congestion estimation, resource prediction in grid computing etc. This work focuses on building an end-to-end horizontally scalable route prediction application based on s...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
SpringerOpen
2017-07-01
|
Series: | Journal of Big Data |
Subjects: | |
Online Access: | http://link.springer.com/article/10.1186/s40537-017-0085-4 |
id |
doaj-cf346e4c13864a6a956722eaa4ca7843 |
---|---|
record_format |
Article |
spelling |
doaj-cf346e4c13864a6a956722eaa4ca78432020-11-25T00:26:08ZengSpringerOpenJournal of Big Data2196-11152017-07-014112310.1186/s40537-017-0085-4Horizontally scalable probabilistic generalized suffix tree (PGST) based route prediction using map data and GPS tracesVishnu Shankar Tiwari0Arti Arya1Department of MCA, PES Institute of Technology, Bangalore South CampusDepartment of MCA, PES Institute of Technology, Bangalore South CampusAbstract Route prediction is an essential requirement for many intelligent transport systems (ITS) services like VANETS, traffic congestion estimation, resource prediction in grid computing etc. This work focuses on building an end-to-end horizontally scalable route prediction application based on statistical modeling of user travel data. Probabilistic suffix tree (PST) is one of widely used sequence indexing technique which serves a model for prediction. The probabilistic generalized suffix tree (PGST) is a variant of PST and is essentially a suffix tree built from a huge number of smaller sequences. We construct generalized suffix tree model from a large number of trips completed by the users. User trip raw GPS traces is mapped to the digitized road network by parallelizing map matching technique leveraging map reduce framework. PGST construction from the huge volume of data by processing sequentially is a bottleneck in the practical realization. Most of the existing works focused on time-space tradeoffs on a single machine. Proposed technique solves this problem by a two-step process which is intuitive to execute in the map-reduce framework. In the first step, computes all the suffixes along with their frequency of occurrences and in the second step, builds probabilistic generalized suffix tree. The probabilistic aspect of the tree is also taken care so that it can be used as a model for prediction application. Dataset used are road network spatial data and GPS traces of users. Experiments carried out on real datasets available in public domain.http://link.springer.com/article/10.1186/s40537-017-0085-4Route predictionSuffix treeBig DataMap reduceHDFS |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Vishnu Shankar Tiwari Arti Arya |
spellingShingle |
Vishnu Shankar Tiwari Arti Arya Horizontally scalable probabilistic generalized suffix tree (PGST) based route prediction using map data and GPS traces Journal of Big Data Route prediction Suffix tree Big Data Map reduce HDFS |
author_facet |
Vishnu Shankar Tiwari Arti Arya |
author_sort |
Vishnu Shankar Tiwari |
title |
Horizontally scalable probabilistic generalized suffix tree (PGST) based route prediction using map data and GPS traces |
title_short |
Horizontally scalable probabilistic generalized suffix tree (PGST) based route prediction using map data and GPS traces |
title_full |
Horizontally scalable probabilistic generalized suffix tree (PGST) based route prediction using map data and GPS traces |
title_fullStr |
Horizontally scalable probabilistic generalized suffix tree (PGST) based route prediction using map data and GPS traces |
title_full_unstemmed |
Horizontally scalable probabilistic generalized suffix tree (PGST) based route prediction using map data and GPS traces |
title_sort |
horizontally scalable probabilistic generalized suffix tree (pgst) based route prediction using map data and gps traces |
publisher |
SpringerOpen |
series |
Journal of Big Data |
issn |
2196-1115 |
publishDate |
2017-07-01 |
description |
Abstract Route prediction is an essential requirement for many intelligent transport systems (ITS) services like VANETS, traffic congestion estimation, resource prediction in grid computing etc. This work focuses on building an end-to-end horizontally scalable route prediction application based on statistical modeling of user travel data. Probabilistic suffix tree (PST) is one of widely used sequence indexing technique which serves a model for prediction. The probabilistic generalized suffix tree (PGST) is a variant of PST and is essentially a suffix tree built from a huge number of smaller sequences. We construct generalized suffix tree model from a large number of trips completed by the users. User trip raw GPS traces is mapped to the digitized road network by parallelizing map matching technique leveraging map reduce framework. PGST construction from the huge volume of data by processing sequentially is a bottleneck in the practical realization. Most of the existing works focused on time-space tradeoffs on a single machine. Proposed technique solves this problem by a two-step process which is intuitive to execute in the map-reduce framework. In the first step, computes all the suffixes along with their frequency of occurrences and in the second step, builds probabilistic generalized suffix tree. The probabilistic aspect of the tree is also taken care so that it can be used as a model for prediction application. Dataset used are road network spatial data and GPS traces of users. Experiments carried out on real datasets available in public domain. |
topic |
Route prediction Suffix tree Big Data Map reduce HDFS |
url |
http://link.springer.com/article/10.1186/s40537-017-0085-4 |
work_keys_str_mv |
AT vishnushankartiwari horizontallyscalableprobabilisticgeneralizedsuffixtreepgstbasedroutepredictionusingmapdataandgpstraces AT artiarya horizontallyscalableprobabilisticgeneralizedsuffixtreepgstbasedroutepredictionusingmapdataandgpstraces |
_version_ |
1725345735583465472 |