Adopting Machine Learning and Spatial Analysis Techniques for Driver Risk Assessment: Insights from a Case Study

Traffic violations usually caused by aggressive driving behavior are often seen as a primary contributor to traffic crashes. Violations are either caused by an unintentional or deliberate act of drivers that jeopardize the lives of fellow drivers, pedestrians, and property. This study is aimed to in...

Full description

Bibliographic Details
Main Authors: Muhammad Zahid, Yangzhou Chen, Arshad Jamal, Khalaf A. Al-Ofi, Hassan M. Al-Ahmadi
Format: Article
Language:English
Published: MDPI AG 2020-07-01
Series:International Journal of Environmental Research and Public Health
Subjects:
Online Access:https://www.mdpi.com/1660-4601/17/14/5193
id doaj-f1ac4724980d4800a4798ed6db8a720b
record_format Article
spelling doaj-f1ac4724980d4800a4798ed6db8a720b2020-11-25T03:02:15ZengMDPI AGInternational Journal of Environmental Research and Public Health1661-78271660-46012020-07-01175193519310.3390/ijerph17145193Adopting Machine Learning and Spatial Analysis Techniques for Driver Risk Assessment: Insights from a Case StudyMuhammad Zahid0Yangzhou Chen1Arshad Jamal2Khalaf A. Al-Ofi3Hassan M. Al-Ahmadi4College of Metropolitan Transportation, Beijing University of Technology, Beijing 100124, ChinaCollege of Artificial Intelligence and Automation, Beijing University of Technology, Beijing 100124, ChinaDepartment of Civil and Environmental Engineering, King Fahd University of Petroleum & Minerals KFUPM BOX 5055, Dhahran 31261, Saudi ArabiaDepartment of Civil and Environmental Engineering, King Fahd University of Petroleum & Minerals KFUPM BOX 5055, Dhahran 31261, Saudi ArabiaDepartment of Civil and Environmental Engineering, King Fahd University of Petroleum & Minerals KFUPM BOX 5055, Dhahran 31261, Saudi ArabiaTraffic violations usually caused by aggressive driving behavior are often seen as a primary contributor to traffic crashes. Violations are either caused by an unintentional or deliberate act of drivers that jeopardize the lives of fellow drivers, pedestrians, and property. This study is aimed to investigate different traffic violations (overspeeding, wrong-way driving, illegal parking, non-compliance traffic control devices, etc.) using spatial analysis and different machine learning methods. Georeferenced violation data along two expressways (S308<i> </i>and<i> </i>S219) for the year 2016 was obtained from the traffic police department, in the city of Luzhou, China. Detailed descriptive analysis of the data showed that wrong-way driving was the most common violation type observed. Inverse Distance Weighted (IDW) interpolation in the ArcMap Geographic Information System (GIS) was used to develop violation hotspots zones to guide on efficient use of limited resources during the treatment of high-risk sites. Lastly, a systematic Machine Learning (ML) framework, such as K Nearest Neighbors (KNN) models (using k = 3, 5, 7, 10, and 12), support vector machine (SVM), and CN2 Rule Inducer, was utilized for classification and prediction of each violation type as a function of several explanatory variables. The predictive performance of proposed ML models was examined using different evaluation metrics, such as Area Under the Curve (AUC), F-score, precision, recall, specificity, and run time. The results also showed that the KNN model with k = 7 using manhattan evaluation had an accuracy of 99% and outperformed the SVM and CN2 Rule Inducer. The outcome of this study could provide the practitioners and decision-makers with essential insights for appropriate engineering and traffic control measures to improve the safety of road-users.https://www.mdpi.com/1660-4601/17/14/5193aggressive drivingtraffic violationsinverse distance weighted (IDW) interpolationgeographic information system (GIS)machine learning
collection DOAJ
language English
format Article
sources DOAJ
author Muhammad Zahid
Yangzhou Chen
Arshad Jamal
Khalaf A. Al-Ofi
Hassan M. Al-Ahmadi
spellingShingle Muhammad Zahid
Yangzhou Chen
Arshad Jamal
Khalaf A. Al-Ofi
Hassan M. Al-Ahmadi
Adopting Machine Learning and Spatial Analysis Techniques for Driver Risk Assessment: Insights from a Case Study
International Journal of Environmental Research and Public Health
aggressive driving
traffic violations
inverse distance weighted (IDW) interpolation
geographic information system (GIS)
machine learning
author_facet Muhammad Zahid
Yangzhou Chen
Arshad Jamal
Khalaf A. Al-Ofi
Hassan M. Al-Ahmadi
author_sort Muhammad Zahid
title Adopting Machine Learning and Spatial Analysis Techniques for Driver Risk Assessment: Insights from a Case Study
title_short Adopting Machine Learning and Spatial Analysis Techniques for Driver Risk Assessment: Insights from a Case Study
title_full Adopting Machine Learning and Spatial Analysis Techniques for Driver Risk Assessment: Insights from a Case Study
title_fullStr Adopting Machine Learning and Spatial Analysis Techniques for Driver Risk Assessment: Insights from a Case Study
title_full_unstemmed Adopting Machine Learning and Spatial Analysis Techniques for Driver Risk Assessment: Insights from a Case Study
title_sort adopting machine learning and spatial analysis techniques for driver risk assessment: insights from a case study
publisher MDPI AG
series International Journal of Environmental Research and Public Health
issn 1661-7827
1660-4601
publishDate 2020-07-01
description Traffic violations usually caused by aggressive driving behavior are often seen as a primary contributor to traffic crashes. Violations are either caused by an unintentional or deliberate act of drivers that jeopardize the lives of fellow drivers, pedestrians, and property. This study is aimed to investigate different traffic violations (overspeeding, wrong-way driving, illegal parking, non-compliance traffic control devices, etc.) using spatial analysis and different machine learning methods. Georeferenced violation data along two expressways (S308<i> </i>and<i> </i>S219) for the year 2016 was obtained from the traffic police department, in the city of Luzhou, China. Detailed descriptive analysis of the data showed that wrong-way driving was the most common violation type observed. Inverse Distance Weighted (IDW) interpolation in the ArcMap Geographic Information System (GIS) was used to develop violation hotspots zones to guide on efficient use of limited resources during the treatment of high-risk sites. Lastly, a systematic Machine Learning (ML) framework, such as K Nearest Neighbors (KNN) models (using k = 3, 5, 7, 10, and 12), support vector machine (SVM), and CN2 Rule Inducer, was utilized for classification and prediction of each violation type as a function of several explanatory variables. The predictive performance of proposed ML models was examined using different evaluation metrics, such as Area Under the Curve (AUC), F-score, precision, recall, specificity, and run time. The results also showed that the KNN model with k = 7 using manhattan evaluation had an accuracy of 99% and outperformed the SVM and CN2 Rule Inducer. The outcome of this study could provide the practitioners and decision-makers with essential insights for appropriate engineering and traffic control measures to improve the safety of road-users.
topic aggressive driving
traffic violations
inverse distance weighted (IDW) interpolation
geographic information system (GIS)
machine learning
url https://www.mdpi.com/1660-4601/17/14/5193
work_keys_str_mv AT muhammadzahid adoptingmachinelearningandspatialanalysistechniquesfordriverriskassessmentinsightsfromacasestudy
AT yangzhouchen adoptingmachinelearningandspatialanalysistechniquesfordriverriskassessmentinsightsfromacasestudy
AT arshadjamal adoptingmachinelearningandspatialanalysistechniquesfordriverriskassessmentinsightsfromacasestudy
AT khalafaalofi adoptingmachinelearningandspatialanalysistechniquesfordriverriskassessmentinsightsfromacasestudy
AT hassanmalahmadi adoptingmachinelearningandspatialanalysistechniquesfordriverriskassessmentinsightsfromacasestudy
_version_ 1724690670785921024