Horizontally scalable probabilistic generalized suffix tree (PGST) based route prediction using map data and GPS traces

Abstract Route prediction is an essential requirement for many intelligent transport systems (ITS) services like VANETS, traffic congestion estimation, resource prediction in grid computing etc. This work focuses on building an end-to-end horizontally scalable route prediction application based on s...

Full description

Bibliographic Details
Main Authors: Vishnu Shankar Tiwari, Arti Arya
Format: Article
Language:English
Published: SpringerOpen 2017-07-01
Series:Journal of Big Data
Subjects:
Online Access:http://link.springer.com/article/10.1186/s40537-017-0085-4
id doaj-cf346e4c13864a6a956722eaa4ca7843
record_format Article
spelling doaj-cf346e4c13864a6a956722eaa4ca78432020-11-25T00:26:08ZengSpringerOpenJournal of Big Data2196-11152017-07-014112310.1186/s40537-017-0085-4Horizontally scalable probabilistic generalized suffix tree (PGST) based route prediction using map data and GPS tracesVishnu Shankar Tiwari0Arti Arya1Department of MCA, PES Institute of Technology, Bangalore South CampusDepartment of MCA, PES Institute of Technology, Bangalore South CampusAbstract Route prediction is an essential requirement for many intelligent transport systems (ITS) services like VANETS, traffic congestion estimation, resource prediction in grid computing etc. This work focuses on building an end-to-end horizontally scalable route prediction application based on statistical modeling of user travel data. Probabilistic suffix tree (PST) is one of widely used sequence indexing technique which serves a model for prediction. The probabilistic generalized suffix tree (PGST) is a variant of PST and is essentially a suffix tree built from a huge number of smaller sequences. We construct generalized suffix tree model from a large number of trips completed by the users. User trip raw GPS traces is mapped to the digitized road network by parallelizing map matching technique leveraging map reduce framework. PGST construction from the huge volume of data by processing sequentially is a bottleneck in the practical realization. Most of the existing works focused on time-space tradeoffs on a single machine. Proposed technique solves this problem by a two-step process which is intuitive to execute in the map-reduce framework. In the first step, computes all the suffixes along with their frequency of occurrences and in the second step, builds probabilistic generalized suffix tree. The probabilistic aspect of the tree is also taken care so that it can be used as a model for prediction application. Dataset used are road network spatial data and GPS traces of users. Experiments carried out on real datasets available in public domain.http://link.springer.com/article/10.1186/s40537-017-0085-4Route predictionSuffix treeBig DataMap reduceHDFS
collection DOAJ
language English
format Article
sources DOAJ
author Vishnu Shankar Tiwari
Arti Arya
spellingShingle Vishnu Shankar Tiwari
Arti Arya
Horizontally scalable probabilistic generalized suffix tree (PGST) based route prediction using map data and GPS traces
Journal of Big Data
Route prediction
Suffix tree
Big Data
Map reduce
HDFS
author_facet Vishnu Shankar Tiwari
Arti Arya
author_sort Vishnu Shankar Tiwari
title Horizontally scalable probabilistic generalized suffix tree (PGST) based route prediction using map data and GPS traces
title_short Horizontally scalable probabilistic generalized suffix tree (PGST) based route prediction using map data and GPS traces
title_full Horizontally scalable probabilistic generalized suffix tree (PGST) based route prediction using map data and GPS traces
title_fullStr Horizontally scalable probabilistic generalized suffix tree (PGST) based route prediction using map data and GPS traces
title_full_unstemmed Horizontally scalable probabilistic generalized suffix tree (PGST) based route prediction using map data and GPS traces
title_sort horizontally scalable probabilistic generalized suffix tree (pgst) based route prediction using map data and gps traces
publisher SpringerOpen
series Journal of Big Data
issn 2196-1115
publishDate 2017-07-01
description Abstract Route prediction is an essential requirement for many intelligent transport systems (ITS) services like VANETS, traffic congestion estimation, resource prediction in grid computing etc. This work focuses on building an end-to-end horizontally scalable route prediction application based on statistical modeling of user travel data. Probabilistic suffix tree (PST) is one of widely used sequence indexing technique which serves a model for prediction. The probabilistic generalized suffix tree (PGST) is a variant of PST and is essentially a suffix tree built from a huge number of smaller sequences. We construct generalized suffix tree model from a large number of trips completed by the users. User trip raw GPS traces is mapped to the digitized road network by parallelizing map matching technique leveraging map reduce framework. PGST construction from the huge volume of data by processing sequentially is a bottleneck in the practical realization. Most of the existing works focused on time-space tradeoffs on a single machine. Proposed technique solves this problem by a two-step process which is intuitive to execute in the map-reduce framework. In the first step, computes all the suffixes along with their frequency of occurrences and in the second step, builds probabilistic generalized suffix tree. The probabilistic aspect of the tree is also taken care so that it can be used as a model for prediction application. Dataset used are road network spatial data and GPS traces of users. Experiments carried out on real datasets available in public domain.
topic Route prediction
Suffix tree
Big Data
Map reduce
HDFS
url http://link.springer.com/article/10.1186/s40537-017-0085-4
work_keys_str_mv AT vishnushankartiwari horizontallyscalableprobabilisticgeneralizedsuffixtreepgstbasedroutepredictionusingmapdataandgpstraces
AT artiarya horizontallyscalableprobabilisticgeneralizedsuffixtreepgstbasedroutepredictionusingmapdataandgpstraces
_version_ 1725345735583465472