The Estimation of Road Passenger Travel Time by Decision Tree

碩士 === 中原大學 === 資訊工程研究所 === 101 === The travel times of public buses are variant due to different traffic situations, as a result, the schedules listed on the bus stop are seldom accurate and can not fulfill the need of the passengers. In this research, we build a travel time estimation model based...

Full description

Bibliographic Details
Main Authors: Chih-Wei Sung, 宋志偉
Other Authors: Jia-Sheng Heh
Format: Others
Language:zh-TW
Published: 2013
Online Access:http://ndltd.ncl.edu.tw/handle/37469107567616003301
id ndltd-TW-101CYCU5392011
record_format oai_dc
spelling ndltd-TW-101CYCU53920112015-10-13T22:40:29Z http://ndltd.ncl.edu.tw/handle/37469107567616003301 The Estimation of Road Passenger Travel Time by Decision Tree 利用資料探勘方法進行公路客運旅行時間之推估 Chih-Wei Sung 宋志偉 碩士 中原大學 資訊工程研究所 101 The travel times of public buses are variant due to different traffic situations, as a result, the schedules listed on the bus stop are seldom accurate and can not fulfill the need of the passengers. In this research, we build a travel time estimation model based on hours, car types, driving behaviors, distances between bus stops, to improve the accuracy of travel time estimation. The data used in this research are collected by GPS probing buses which send their traveling records back to the server. The data are collected from March to April, 2012 in the area of Hsin-Chu County, and the route is from Zhu-Dong to Hsin-Chu. The total number of records is 383,473 with 192,757 records in March and 190,716 records in April. We used the records in March as the training data set and clustered the travel time and waiting time into several clusters. We used the clustering results, together with car types, drivers, weekdays, hours, arrival time, depart time, to build decision tree models, according to four different cluster numbers: 4 clusters (fast, medium fast, medium slow, slow), 3 clusters (fast, medium, slow), 2 clusters (fast, slow), and 1 cluster (medium, i.e. no clustering is performed). We used the records in April as the testing data set to test the models, and applied the standard deviation of percentage error method to evaluate the accuracies of the models. By applying the above mentioned methods, the SDPE of the 4 models of each stops and each hours are calculated and we choose the best models and calculate their percentage. The experiment results indicate that, after comparing the percentage of the estimations which is under 20% SDPE of each models, the percentages of best models of 1-cluster to 4-cluster are 23.97%, 43.83%, 16.99%, 15.21% respectively. It shows that 2-cluster model is the best one, mostly. In conclusion, we choose these best models as the recommended travel time estimation method. Jia-Sheng Heh 賀嘉生 2013 學位論文 ; thesis 68 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 中原大學 === 資訊工程研究所 === 101 === The travel times of public buses are variant due to different traffic situations, as a result, the schedules listed on the bus stop are seldom accurate and can not fulfill the need of the passengers. In this research, we build a travel time estimation model based on hours, car types, driving behaviors, distances between bus stops, to improve the accuracy of travel time estimation. The data used in this research are collected by GPS probing buses which send their traveling records back to the server. The data are collected from March to April, 2012 in the area of Hsin-Chu County, and the route is from Zhu-Dong to Hsin-Chu. The total number of records is 383,473 with 192,757 records in March and 190,716 records in April. We used the records in March as the training data set and clustered the travel time and waiting time into several clusters. We used the clustering results, together with car types, drivers, weekdays, hours, arrival time, depart time, to build decision tree models, according to four different cluster numbers: 4 clusters (fast, medium fast, medium slow, slow), 3 clusters (fast, medium, slow), 2 clusters (fast, slow), and 1 cluster (medium, i.e. no clustering is performed). We used the records in April as the testing data set to test the models, and applied the standard deviation of percentage error method to evaluate the accuracies of the models. By applying the above mentioned methods, the SDPE of the 4 models of each stops and each hours are calculated and we choose the best models and calculate their percentage. The experiment results indicate that, after comparing the percentage of the estimations which is under 20% SDPE of each models, the percentages of best models of 1-cluster to 4-cluster are 23.97%, 43.83%, 16.99%, 15.21% respectively. It shows that 2-cluster model is the best one, mostly. In conclusion, we choose these best models as the recommended travel time estimation method.
author2 Jia-Sheng Heh
author_facet Jia-Sheng Heh
Chih-Wei Sung
宋志偉
author Chih-Wei Sung
宋志偉
spellingShingle Chih-Wei Sung
宋志偉
The Estimation of Road Passenger Travel Time by Decision Tree
author_sort Chih-Wei Sung
title The Estimation of Road Passenger Travel Time by Decision Tree
title_short The Estimation of Road Passenger Travel Time by Decision Tree
title_full The Estimation of Road Passenger Travel Time by Decision Tree
title_fullStr The Estimation of Road Passenger Travel Time by Decision Tree
title_full_unstemmed The Estimation of Road Passenger Travel Time by Decision Tree
title_sort estimation of road passenger travel time by decision tree
publishDate 2013
url http://ndltd.ncl.edu.tw/handle/37469107567616003301
work_keys_str_mv AT chihweisung theestimationofroadpassengertraveltimebydecisiontree
AT sòngzhìwěi theestimationofroadpassengertraveltimebydecisiontree
AT chihweisung lìyòngzīliàotànkānfāngfǎjìnxínggōnglùkèyùnlǚxíngshíjiānzhītuīgū
AT sòngzhìwěi lìyòngzīliàotànkānfāngfǎjìnxínggōnglùkèyùnlǚxíngshíjiānzhītuīgū
AT chihweisung estimationofroadpassengertraveltimebydecisiontree
AT sòngzhìwěi estimationofroadpassengertraveltimebydecisiontree
_version_ 1718078550584590336