A Deep Adaptive Traffic Signal Controller With Long-Term Planning Horizon and Spatial-Temporal State Definition Under Dynamic Traffic Fluctuations

This study proposes a new adaptive traffic signal control scheme to effectively manage dynamically fluctuating traffic flows through intersections. A spatial-temporal representation of the traffic state at an intersection has been designed to efficiently identify traffic patterns from complex inters...

Full description

Bibliographic Details
Main Authors: Shurong Li, Chong Wei, Xuedong Yan, Lu Ma, Deqi Chen, Ying Wang
Format: Article
Language:English
Published: IEEE 2020-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9001092/
id doaj-b4e95a6de53e4c758780ed4c14322455
record_format Article
spelling doaj-b4e95a6de53e4c758780ed4c143224552021-03-30T02:29:12ZengIEEEIEEE Access2169-35362020-01-018370873710410.1109/ACCESS.2020.29748859001092A Deep Adaptive Traffic Signal Controller With Long-Term Planning Horizon and Spatial-Temporal State Definition Under Dynamic Traffic FluctuationsShurong Li0https://orcid.org/0000-0002-6719-8969Chong Wei1https://orcid.org/0000-0002-6703-3419Xuedong Yan2https://orcid.org/0000-0003-0120-9183Lu Ma3https://orcid.org/0000-0002-4492-6636Deqi Chen4Ying Wang5https://orcid.org/0000-0002-8596-5308MOT Key Laboratory of Transport Industry of Big Data Application Technologies for Comprehensive Transport, School of Traffic and Transportation, Beijing Jiaotong University, Beijing, ChinaMOT Key Laboratory of Transport Industry of Big Data Application Technologies for Comprehensive Transport, School of Traffic and Transportation, Beijing Jiaotong University, Beijing, ChinaMOT Key Laboratory of Transport Industry of Big Data Application Technologies for Comprehensive Transport, School of Traffic and Transportation, Beijing Jiaotong University, Beijing, ChinaMOT Key Laboratory of Transport Industry of Big Data Application Technologies for Comprehensive Transport, School of Traffic and Transportation, Beijing Jiaotong University, Beijing, ChinaMOT Key Laboratory of Transport Industry of Big Data Application Technologies for Comprehensive Transport, School of Traffic and Transportation, Beijing Jiaotong University, Beijing, ChinaMOT Key Laboratory of Transport Industry of Big Data Application Technologies for Comprehensive Transport, School of Traffic and Transportation, Beijing Jiaotong University, Beijing, ChinaThis study proposes a new adaptive traffic signal control scheme to effectively manage dynamically fluctuating traffic flows through intersections. A spatial-temporal representation of the traffic state at an intersection has been designed to efficiently identify traffic patterns from complex intersection environments, and a deep neural network (long short-term memory network, LSTM) is used to determine look-ahead signal control decisions based on the estimated long-term feedback from a given traffic state. The actor-critic algorithm, one of the reinforcement learning-based algorithms, is adopted to obtain the essential parameters of the LSTM deep neural network through multiple interactions between a simulated environment and the corresponding adaptive traffic signal controller. A realistic model environment comprising a 24-hour time-varying traffic demand including rush hour and non-rush hour situations served as the basis for traffic generation in the numerical experiments to confirm the effectiveness of the proposed scheme. The results of these experiments show that, compared to an optimized fixed time plan (Synchro), the proposed scheme can reduce waiting times at intersections by an astounding 50% with consequential benefits of reducing fuel consumptions, emissions, queue lengths, and vehicle delays whilst increasing mean speeds.https://ieeexplore.ieee.org/document/9001092/Adaptive traffic signal controlreinforcement learningspatial-temporal traffic state representationactor-criticLSTM24-hour dynamic traffic fluctuations
collection DOAJ
language English
format Article
sources DOAJ
author Shurong Li
Chong Wei
Xuedong Yan
Lu Ma
Deqi Chen
Ying Wang
spellingShingle Shurong Li
Chong Wei
Xuedong Yan
Lu Ma
Deqi Chen
Ying Wang
A Deep Adaptive Traffic Signal Controller With Long-Term Planning Horizon and Spatial-Temporal State Definition Under Dynamic Traffic Fluctuations
IEEE Access
Adaptive traffic signal control
reinforcement learning
spatial-temporal traffic state representation
actor-critic
LSTM
24-hour dynamic traffic fluctuations
author_facet Shurong Li
Chong Wei
Xuedong Yan
Lu Ma
Deqi Chen
Ying Wang
author_sort Shurong Li
title A Deep Adaptive Traffic Signal Controller With Long-Term Planning Horizon and Spatial-Temporal State Definition Under Dynamic Traffic Fluctuations
title_short A Deep Adaptive Traffic Signal Controller With Long-Term Planning Horizon and Spatial-Temporal State Definition Under Dynamic Traffic Fluctuations
title_full A Deep Adaptive Traffic Signal Controller With Long-Term Planning Horizon and Spatial-Temporal State Definition Under Dynamic Traffic Fluctuations
title_fullStr A Deep Adaptive Traffic Signal Controller With Long-Term Planning Horizon and Spatial-Temporal State Definition Under Dynamic Traffic Fluctuations
title_full_unstemmed A Deep Adaptive Traffic Signal Controller With Long-Term Planning Horizon and Spatial-Temporal State Definition Under Dynamic Traffic Fluctuations
title_sort deep adaptive traffic signal controller with long-term planning horizon and spatial-temporal state definition under dynamic traffic fluctuations
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2020-01-01
description This study proposes a new adaptive traffic signal control scheme to effectively manage dynamically fluctuating traffic flows through intersections. A spatial-temporal representation of the traffic state at an intersection has been designed to efficiently identify traffic patterns from complex intersection environments, and a deep neural network (long short-term memory network, LSTM) is used to determine look-ahead signal control decisions based on the estimated long-term feedback from a given traffic state. The actor-critic algorithm, one of the reinforcement learning-based algorithms, is adopted to obtain the essential parameters of the LSTM deep neural network through multiple interactions between a simulated environment and the corresponding adaptive traffic signal controller. A realistic model environment comprising a 24-hour time-varying traffic demand including rush hour and non-rush hour situations served as the basis for traffic generation in the numerical experiments to confirm the effectiveness of the proposed scheme. The results of these experiments show that, compared to an optimized fixed time plan (Synchro), the proposed scheme can reduce waiting times at intersections by an astounding 50% with consequential benefits of reducing fuel consumptions, emissions, queue lengths, and vehicle delays whilst increasing mean speeds.
topic Adaptive traffic signal control
reinforcement learning
spatial-temporal traffic state representation
actor-critic
LSTM
24-hour dynamic traffic fluctuations
url https://ieeexplore.ieee.org/document/9001092/
work_keys_str_mv AT shurongli adeepadaptivetrafficsignalcontrollerwithlongtermplanninghorizonandspatialtemporalstatedefinitionunderdynamictrafficfluctuations
AT chongwei adeepadaptivetrafficsignalcontrollerwithlongtermplanninghorizonandspatialtemporalstatedefinitionunderdynamictrafficfluctuations
AT xuedongyan adeepadaptivetrafficsignalcontrollerwithlongtermplanninghorizonandspatialtemporalstatedefinitionunderdynamictrafficfluctuations
AT luma adeepadaptivetrafficsignalcontrollerwithlongtermplanninghorizonandspatialtemporalstatedefinitionunderdynamictrafficfluctuations
AT deqichen adeepadaptivetrafficsignalcontrollerwithlongtermplanninghorizonandspatialtemporalstatedefinitionunderdynamictrafficfluctuations
AT yingwang adeepadaptivetrafficsignalcontrollerwithlongtermplanninghorizonandspatialtemporalstatedefinitionunderdynamictrafficfluctuations
AT shurongli deepadaptivetrafficsignalcontrollerwithlongtermplanninghorizonandspatialtemporalstatedefinitionunderdynamictrafficfluctuations
AT chongwei deepadaptivetrafficsignalcontrollerwithlongtermplanninghorizonandspatialtemporalstatedefinitionunderdynamictrafficfluctuations
AT xuedongyan deepadaptivetrafficsignalcontrollerwithlongtermplanninghorizonandspatialtemporalstatedefinitionunderdynamictrafficfluctuations
AT luma deepadaptivetrafficsignalcontrollerwithlongtermplanninghorizonandspatialtemporalstatedefinitionunderdynamictrafficfluctuations
AT deqichen deepadaptivetrafficsignalcontrollerwithlongtermplanninghorizonandspatialtemporalstatedefinitionunderdynamictrafficfluctuations
AT yingwang deepadaptivetrafficsignalcontrollerwithlongtermplanninghorizonandspatialtemporalstatedefinitionunderdynamictrafficfluctuations
_version_ 1724185110206480384