Cooperative Multiagent Deep Deterministic Policy Gradient (CoMADDPG) for Intelligent Connected Transportation with Unsignalized Intersection

Unsignalized intersection control is one of the most critical issues in intelligent transportation systems, which requires connected and automated vehicles to support more frequent information interaction and on-board computing. It is very promising to introduce reinforcement learning in the unsigna...

Full description

Bibliographic Details
Main Authors:	Tianhao Wu, Mingzhi Jiang, Lin Zhang
Format:	Article
Language:	English
Published:	Hindawi Limited 2020-01-01
Series:	Mathematical Problems in Engineering
Online Access:	http://dx.doi.org/10.1155/2020/1820527

id	doaj-f34617dd1ada4aa29d66e2ac91206f61
record_format	Article
spelling	doaj-f34617dd1ada4aa29d66e2ac91206f612020-11-25T02:53:11ZengHindawi LimitedMathematical Problems in Engineering1024-123X1563-51472020-01-01202010.1155/2020/18205271820527Cooperative Multiagent Deep Deterministic Policy Gradient (CoMADDPG) for Intelligent Connected Transportation with Unsignalized IntersectionTianhao Wu0Mingzhi Jiang1Lin Zhang2School of Information & Communication Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, ChinaSchool of Information & Communication Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, ChinaSchool of Information & Communication Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, ChinaUnsignalized intersection control is one of the most critical issues in intelligent transportation systems, which requires connected and automated vehicles to support more frequent information interaction and on-board computing. It is very promising to introduce reinforcement learning in the unsignalized intersection control. However, the existing multiagent reinforcement learning algorithms, such as multiagent deep deterministic policy gradient (MADDPG), hardly handle a dynamic number of vehicles, which cannot meet the need of the real road condition. Thus, this paper proposes a Cooperative MADDPG (CoMADDPG) for connected vehicles at unsignalized intersection to solve this problem. Firstly, the scenario of multiple vehicles passing through an unsignalized intersection is formulated as a multiagent reinforcement learning (RL) problem. Secondly, MADDPG is redefined to adapt to the dynamic quantity agents, where each vehicle selects reference vehicles to construct a partial stationary environment, which is necessary for RL. Thirdly, this paper incorporates a novel vehicle selection method, which projects the reference vehicles on a virtual lane and selects the largest impact vehicles to construct the environment. At last, an intersection simulation platform is developed to evaluate the proposed method. According to the simulation result, CoMADDPG can reduce average travel time by 39.28% compared with the other optimization-based methods, which indicates that CoMADDPG has an excellent prospect in dealing with the scenario of unsignalized intersection control.http://dx.doi.org/10.1155/2020/1820527
collection	DOAJ
language	English
format	Article
sources	DOAJ
author	Tianhao Wu Mingzhi Jiang Lin Zhang
spellingShingle	Tianhao Wu Mingzhi Jiang Lin Zhang Cooperative Multiagent Deep Deterministic Policy Gradient (CoMADDPG) for Intelligent Connected Transportation with Unsignalized Intersection Mathematical Problems in Engineering
author_facet	Tianhao Wu Mingzhi Jiang Lin Zhang
author_sort	Tianhao Wu
title	Cooperative Multiagent Deep Deterministic Policy Gradient (CoMADDPG) for Intelligent Connected Transportation with Unsignalized Intersection
title_short	Cooperative Multiagent Deep Deterministic Policy Gradient (CoMADDPG) for Intelligent Connected Transportation with Unsignalized Intersection
title_full	Cooperative Multiagent Deep Deterministic Policy Gradient (CoMADDPG) for Intelligent Connected Transportation with Unsignalized Intersection
title_fullStr	Cooperative Multiagent Deep Deterministic Policy Gradient (CoMADDPG) for Intelligent Connected Transportation with Unsignalized Intersection
title_full_unstemmed	Cooperative Multiagent Deep Deterministic Policy Gradient (CoMADDPG) for Intelligent Connected Transportation with Unsignalized Intersection
title_sort	cooperative multiagent deep deterministic policy gradient (comaddpg) for intelligent connected transportation with unsignalized intersection
publisher	Hindawi Limited
series	Mathematical Problems in Engineering
issn	1024-123X 1563-5147
publishDate	2020-01-01
description	Unsignalized intersection control is one of the most critical issues in intelligent transportation systems, which requires connected and automated vehicles to support more frequent information interaction and on-board computing. It is very promising to introduce reinforcement learning in the unsignalized intersection control. However, the existing multiagent reinforcement learning algorithms, such as multiagent deep deterministic policy gradient (MADDPG), hardly handle a dynamic number of vehicles, which cannot meet the need of the real road condition. Thus, this paper proposes a Cooperative MADDPG (CoMADDPG) for connected vehicles at unsignalized intersection to solve this problem. Firstly, the scenario of multiple vehicles passing through an unsignalized intersection is formulated as a multiagent reinforcement learning (RL) problem. Secondly, MADDPG is redefined to adapt to the dynamic quantity agents, where each vehicle selects reference vehicles to construct a partial stationary environment, which is necessary for RL. Thirdly, this paper incorporates a novel vehicle selection method, which projects the reference vehicles on a virtual lane and selects the largest impact vehicles to construct the environment. At last, an intersection simulation platform is developed to evaluate the proposed method. According to the simulation result, CoMADDPG can reduce average travel time by 39.28% compared with the other optimization-based methods, which indicates that CoMADDPG has an excellent prospect in dealing with the scenario of unsignalized intersection control.
url	http://dx.doi.org/10.1155/2020/1820527
work_keys_str_mv	AT tianhaowu cooperativemultiagentdeepdeterministicpolicygradientcomaddpgforintelligentconnectedtransportationwithunsignalizedintersection AT mingzhijiang cooperativemultiagentdeepdeterministicpolicygradientcomaddpgforintelligentconnectedtransportationwithunsignalizedintersection AT linzhang cooperativemultiagentdeepdeterministicpolicygradientcomaddpgforintelligentconnectedtransportationwithunsignalizedintersection
_version_	1715359870167810048

Cooperative Multiagent Deep Deterministic Policy Gradient (CoMADDPG) for Intelligent Connected Transportation with Unsignalized Intersection

Similar Items