Traffic Signal Timing For Regional Boundary Intersections Using Deep Reinforcement Learning

Posted on:2022-12-29

Degree:Master

Type:Thesis

Country:China

Candidate:Y Y Wang

Full Text:PDF

GTID:2492306779498074

Subject:Computer Software and Application of Computer

Abstract/Summary:

PDF Full Text Request

With the rapid development of urbanization,the problem of urban traffic congestion has expanded from nodes and arteries to regions.As a result,regional traffic control is significantly necessary for alleviating urban traffic congestion.The macroscopic fundamental diagram（MFD）is a graph reflecting the relationship among regional vehicle flows,densities and speeds,which can be utilized for evaluating the network performance.Using MFD to manipulate the traffic flow at the perimeter of a congested region（i.e.,perimeter control）,so as to alleviate the congestion in such a protected region,is a research hotspot in the field of macroscopic traffic control.From a microscopic perspective,signal control at intersections is an essential method to implement regional perimeter control.Most existing signal control methods adjust signal timings based on historical data,which has limitations of responsing to dynamic traffic states.However,signal control based on deep reinforcement learning has the characteristics of selflearning and adapting to random traffic environment,which is more advantageous than traditional signal control.As a result,this thesis adopts MFD for traffic flow control among regions,while deep reinforcement learning algorithms are applied to signal timing optimization at boundary intersections.The effectiveness of the proposed macro-micro perimeter control strategy is verified by a traffic simulation.The research contents of this thesis are given as follows:First,the macroscopic perimeter control parameters are solved based on a model predictive control framework.For a two-region traffic system,a regional perimeter control model is established,in which the boundary vehicle release ratio is the control parameter and the maximization of regional completion flow is the objective.The regional perimeter control model is used to predict the states of the traffic system in the future,while the control parameters are optimized by a genetic algorithm.Then,the optimal control parameters are fed into a microscopic traffic simulation model to improve the overall traffic efficiency of the road network through a hierarchical control strategy.Also,the current state of the microscopic traffic simulation model is sent back into the regional boundary control model.Second,the microscopic signal timings at intersections are obtained based on deep reinforcement learning.To describe the vehicle dynamics at boundary intersections,a boundary intersection signal control model is developed to build an interactive environment for deep reinforcement learning agents.The number of waiting vehicles in the entrance lanes at the boundary intersections and the number of expected transfer vehicles（i.e.,the product of macroscopic control parameters and the transfer flow）are constructed as the state space.The given signal control strategy is used as the action space,and the reward function is designed according to the actual number of transfer vehicles and the expected one at the boundary intersections.The deep reinforcement learning algorithm achieves the actual number of vehicles transferred at the boundary intersection as close as possible to the expected value by maximizing the cumulative reward function.Subsequently,the deep Q-network and the proximal policy optimization algorithm are compared in multiple scenarios.The experiment results show that the proximal policy optimization algorithm performs better in terms of convergence effect and convergence speed.Finally,a virtual traffic network is built by the traffic simulation software SUMO.Its Tra CI interface is used to obtain MFD of the simulated road network as an evaluation tool.Subsequently,a signal control simulation model at boundary intersections is constructed as an interactive environment for the deep reinforcement learning agents.The agent switches signal phases dynamically by observing the vehicle states in the entrance lanes at the intersections,so as to control the traffic inflow and outflow at the boundary to ensure that the number of vehicles in the road network is as close to the optimal value as possible.Experiment results show that the regional congestion is alleviated to some extent.

Keywords/Search Tags:

perimeter signal control, macroscopic fundamental diagram, deep reinforcement learning, SUMO simulation

PDF Full Text Request

Related items

1	Regional Perimeter Traffic Signal Control Based On Explicit Model Predictive Control Method
2	Premeter Control And Simulation Optimization Of Critical Area Based On MFD
3	Estimation Of Macroscopic Fundamental Diagram And Perimeter Flow Dynamic Perimeter Flow Control Strategy In The Multi-region Network
4	Research On Regional Road Network Perimeter Control And Collaboration Method Based On Macroscopic Fundamental Diagram
5	Research On Iterative Learning Control Method Of Region Perimeter Control Based On MFD
6	Regional Signal Cooperative Control Algorithem Based On Deep Reinforcement Learning
7	Research On Improved Perimeter Feedback Control Strategy Of Urban Traffic Sub-region
8	Iterative Learning Perimeter Control For Urban Traffic Region
9	Research On Signal Control Of Urban Road Network Based On MFD
10	Study On The Impact Factors Of Macroscopic Fundamental Diagram For Urban Traffic System Based On Cellular Automata Model