Evaluation of Tree-Based Machine Learning Algorithms for Accident Risk Mapping Caused by Driver Lack of Alertness at a National Scale
Drivers’ lack of alertness is one of the main reasons for fatal road traffic accidents (RTA) in Iran. Accident-risk mapping with machine learning algorithms in the geographic information system (GIS) platform is a suitable approach for investigating the occurrence risk of these accidents by analyzin...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-09-01
|
Series: | Sustainability |
Subjects: | |
Online Access: | https://www.mdpi.com/2071-1050/13/18/10239 |
id |
doaj-5af6e2425d22440ea8902bb040120203 |
---|---|
record_format |
Article |
spelling |
doaj-5af6e2425d22440ea8902bb0401202032021-09-26T01:28:58ZengMDPI AGSustainability2071-10502021-09-0113102391023910.3390/su131810239Evaluation of Tree-Based Machine Learning Algorithms for Accident Risk Mapping Caused by Driver Lack of Alertness at a National ScaleFarbod Farhangi0Abolghasem Sadeghi-Niaraki1Seyed Vahid Razavi-Termeh2Soo-Mi Choi3Geoinformation Tech. Center of Excellence, Faculty of Geodesy and Geomatics Engineering, K. N. Toosi University of Technology, Tehran 19697, IranGeoinformation Tech. Center of Excellence, Faculty of Geodesy and Geomatics Engineering, K. N. Toosi University of Technology, Tehran 19697, IranGeoinformation Tech. Center of Excellence, Faculty of Geodesy and Geomatics Engineering, K. N. Toosi University of Technology, Tehran 19697, IranDepartment of Computer Science and Engineering, and Convergence Engineering for Intelligent Drone, Sejong University, Seoul 143-747, KoreaDrivers’ lack of alertness is one of the main reasons for fatal road traffic accidents (RTA) in Iran. Accident-risk mapping with machine learning algorithms in the geographic information system (GIS) platform is a suitable approach for investigating the occurrence risk of these accidents by analyzing the role of effective factors. This approach helps to identify the high-risk areas even in unnoticed and remote places and prioritizes accident-prone locations. This paper aimed to evaluate tuned machine learning algorithms of bagged decision trees (BDTs), extra trees (ETs), and random forest (RF) in accident-risk mapping caused by drivers’ lack of alertness (due to drowsiness, fatigue, and reduced attention) at a national scale of Iran roads. Accident points and eight effective criteria, namely distance to the city, distance to the gas station, land use/cover, road structure, road type, time of day, traffic direction, and slope, were applied in modeling, using GIS. The time factor was utilized to represent drivers’ varied alertness levels. The accident dataset included 4399 RTA records from March 2017 to March 2019. The performance of all models was cross-validated with five-folds and tree metrics of mean absolute error, mean squared error, and area under the curve of the receiver operating characteristic (ROC-AUC). The results of cross-validation showed that BDT and RF performance with an AUC of 0.846 were slightly more accurate than ET with an AUC of 0.827. The importance of modeling features was assessed by using the Gini index, and the results revealed that the road type, distance to the city, distance to the gas station, slope, and time of day were the most important, while land use/cover, traffic direction, and road structure were the least important. The proposed approach can be improved by applying the traffic volume in modeling and helps decision-makers take necessary actions by identifying important factors on road safety.https://www.mdpi.com/2071-1050/13/18/10239driver alertnessgeographic information system (GIS)machine learning algorithmsspatial modeling |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Farbod Farhangi Abolghasem Sadeghi-Niaraki Seyed Vahid Razavi-Termeh Soo-Mi Choi |
spellingShingle |
Farbod Farhangi Abolghasem Sadeghi-Niaraki Seyed Vahid Razavi-Termeh Soo-Mi Choi Evaluation of Tree-Based Machine Learning Algorithms for Accident Risk Mapping Caused by Driver Lack of Alertness at a National Scale Sustainability driver alertness geographic information system (GIS) machine learning algorithms spatial modeling |
author_facet |
Farbod Farhangi Abolghasem Sadeghi-Niaraki Seyed Vahid Razavi-Termeh Soo-Mi Choi |
author_sort |
Farbod Farhangi |
title |
Evaluation of Tree-Based Machine Learning Algorithms for Accident Risk Mapping Caused by Driver Lack of Alertness at a National Scale |
title_short |
Evaluation of Tree-Based Machine Learning Algorithms for Accident Risk Mapping Caused by Driver Lack of Alertness at a National Scale |
title_full |
Evaluation of Tree-Based Machine Learning Algorithms for Accident Risk Mapping Caused by Driver Lack of Alertness at a National Scale |
title_fullStr |
Evaluation of Tree-Based Machine Learning Algorithms for Accident Risk Mapping Caused by Driver Lack of Alertness at a National Scale |
title_full_unstemmed |
Evaluation of Tree-Based Machine Learning Algorithms for Accident Risk Mapping Caused by Driver Lack of Alertness at a National Scale |
title_sort |
evaluation of tree-based machine learning algorithms for accident risk mapping caused by driver lack of alertness at a national scale |
publisher |
MDPI AG |
series |
Sustainability |
issn |
2071-1050 |
publishDate |
2021-09-01 |
description |
Drivers’ lack of alertness is one of the main reasons for fatal road traffic accidents (RTA) in Iran. Accident-risk mapping with machine learning algorithms in the geographic information system (GIS) platform is a suitable approach for investigating the occurrence risk of these accidents by analyzing the role of effective factors. This approach helps to identify the high-risk areas even in unnoticed and remote places and prioritizes accident-prone locations. This paper aimed to evaluate tuned machine learning algorithms of bagged decision trees (BDTs), extra trees (ETs), and random forest (RF) in accident-risk mapping caused by drivers’ lack of alertness (due to drowsiness, fatigue, and reduced attention) at a national scale of Iran roads. Accident points and eight effective criteria, namely distance to the city, distance to the gas station, land use/cover, road structure, road type, time of day, traffic direction, and slope, were applied in modeling, using GIS. The time factor was utilized to represent drivers’ varied alertness levels. The accident dataset included 4399 RTA records from March 2017 to March 2019. The performance of all models was cross-validated with five-folds and tree metrics of mean absolute error, mean squared error, and area under the curve of the receiver operating characteristic (ROC-AUC). The results of cross-validation showed that BDT and RF performance with an AUC of 0.846 were slightly more accurate than ET with an AUC of 0.827. The importance of modeling features was assessed by using the Gini index, and the results revealed that the road type, distance to the city, distance to the gas station, slope, and time of day were the most important, while land use/cover, traffic direction, and road structure were the least important. The proposed approach can be improved by applying the traffic volume in modeling and helps decision-makers take necessary actions by identifying important factors on road safety. |
topic |
driver alertness geographic information system (GIS) machine learning algorithms spatial modeling |
url |
https://www.mdpi.com/2071-1050/13/18/10239 |
work_keys_str_mv |
AT farbodfarhangi evaluationoftreebasedmachinelearningalgorithmsforaccidentriskmappingcausedbydriverlackofalertnessatanationalscale AT abolghasemsadeghiniaraki evaluationoftreebasedmachinelearningalgorithmsforaccidentriskmappingcausedbydriverlackofalertnessatanationalscale AT seyedvahidrazavitermeh evaluationoftreebasedmachinelearningalgorithmsforaccidentriskmappingcausedbydriverlackofalertnessatanationalscale AT soomichoi evaluationoftreebasedmachinelearningalgorithmsforaccidentriskmappingcausedbydriverlackofalertnessatanationalscale |
_version_ |
1716868910486126592 |