The Estimation of Road Passenger Travel Time by Decision Tree
碩士 === 中原大學 === 資訊工程研究所 === 101 === The travel times of public buses are variant due to different traffic situations, as a result, the schedules listed on the bus stop are seldom accurate and can not fulfill the need of the passengers. In this research, we build a travel time estimation model based...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2013
|
Online Access: | http://ndltd.ncl.edu.tw/handle/37469107567616003301 |
id |
ndltd-TW-101CYCU5392011 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-101CYCU53920112015-10-13T22:40:29Z http://ndltd.ncl.edu.tw/handle/37469107567616003301 The Estimation of Road Passenger Travel Time by Decision Tree 利用資料探勘方法進行公路客運旅行時間之推估 Chih-Wei Sung 宋志偉 碩士 中原大學 資訊工程研究所 101 The travel times of public buses are variant due to different traffic situations, as a result, the schedules listed on the bus stop are seldom accurate and can not fulfill the need of the passengers. In this research, we build a travel time estimation model based on hours, car types, driving behaviors, distances between bus stops, to improve the accuracy of travel time estimation. The data used in this research are collected by GPS probing buses which send their traveling records back to the server. The data are collected from March to April, 2012 in the area of Hsin-Chu County, and the route is from Zhu-Dong to Hsin-Chu. The total number of records is 383,473 with 192,757 records in March and 190,716 records in April. We used the records in March as the training data set and clustered the travel time and waiting time into several clusters. We used the clustering results, together with car types, drivers, weekdays, hours, arrival time, depart time, to build decision tree models, according to four different cluster numbers: 4 clusters (fast, medium fast, medium slow, slow), 3 clusters (fast, medium, slow), 2 clusters (fast, slow), and 1 cluster (medium, i.e. no clustering is performed). We used the records in April as the testing data set to test the models, and applied the standard deviation of percentage error method to evaluate the accuracies of the models. By applying the above mentioned methods, the SDPE of the 4 models of each stops and each hours are calculated and we choose the best models and calculate their percentage. The experiment results indicate that, after comparing the percentage of the estimations which is under 20% SDPE of each models, the percentages of best models of 1-cluster to 4-cluster are 23.97%, 43.83%, 16.99%, 15.21% respectively. It shows that 2-cluster model is the best one, mostly. In conclusion, we choose these best models as the recommended travel time estimation method. Jia-Sheng Heh 賀嘉生 2013 學位論文 ; thesis 68 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 中原大學 === 資訊工程研究所 === 101 === The travel times of public buses are variant due to different traffic situations, as a result, the schedules listed on the bus stop are seldom accurate and can not fulfill the need of the passengers. In this research, we build a travel time estimation model based on hours, car types, driving behaviors, distances between bus stops, to improve the accuracy of travel time estimation.
The data used in this research are collected by GPS probing buses which send their traveling records back to the server. The data are collected from March to April, 2012 in the area of Hsin-Chu County, and the route is from Zhu-Dong to Hsin-Chu. The total number of records is 383,473 with 192,757 records in March and 190,716 records in April.
We used the records in March as the training data set and clustered the travel time and waiting time into several clusters. We used the clustering results, together with car types, drivers, weekdays, hours, arrival time, depart time, to build decision tree models, according to four different cluster numbers: 4 clusters (fast, medium fast, medium slow, slow), 3 clusters (fast, medium, slow), 2 clusters (fast, slow), and 1 cluster (medium, i.e. no clustering is performed). We used the records in April as the testing data set to test the models, and applied the standard deviation of percentage error method to evaluate the accuracies of the models.
By applying the above mentioned methods, the SDPE of the 4 models of each stops and each hours are calculated and we choose the best models and calculate their percentage. The experiment results indicate that, after comparing the percentage of the estimations which is under 20% SDPE of each models, the percentages of best models of 1-cluster to 4-cluster are 23.97%, 43.83%, 16.99%, 15.21% respectively. It shows that 2-cluster model is the best one, mostly. In conclusion, we choose these best models as the recommended travel time estimation method.
|
author2 |
Jia-Sheng Heh |
author_facet |
Jia-Sheng Heh Chih-Wei Sung 宋志偉 |
author |
Chih-Wei Sung 宋志偉 |
spellingShingle |
Chih-Wei Sung 宋志偉 The Estimation of Road Passenger Travel Time by Decision Tree |
author_sort |
Chih-Wei Sung |
title |
The Estimation of Road Passenger Travel Time by Decision Tree |
title_short |
The Estimation of Road Passenger Travel Time by Decision Tree |
title_full |
The Estimation of Road Passenger Travel Time by Decision Tree |
title_fullStr |
The Estimation of Road Passenger Travel Time by Decision Tree |
title_full_unstemmed |
The Estimation of Road Passenger Travel Time by Decision Tree |
title_sort |
estimation of road passenger travel time by decision tree |
publishDate |
2013 |
url |
http://ndltd.ncl.edu.tw/handle/37469107567616003301 |
work_keys_str_mv |
AT chihweisung theestimationofroadpassengertraveltimebydecisiontree AT sòngzhìwěi theestimationofroadpassengertraveltimebydecisiontree AT chihweisung lìyòngzīliàotànkānfāngfǎjìnxínggōnglùkèyùnlǚxíngshíjiānzhītuīgū AT sòngzhìwěi lìyòngzīliàotànkānfāngfǎjìnxínggōnglùkèyùnlǚxíngshíjiānzhītuīgū AT chihweisung estimationofroadpassengertraveltimebydecisiontree AT sòngzhìwěi estimationofroadpassengertraveltimebydecisiontree |
_version_ |
1718078550584590336 |