Research On Multi-Agent Path Planning Based On Deep Reinforcement Learning

Posted on:2024-03-09

Degree:Master

Type:Thesis

Country:China

Candidate:G H Dan

Full Text:PDF

GTID:2568307079476334

Subject:Electronic information

Abstract/Summary:

PDF Full Text Request

In recent years,intelligent warehouse systems have received increasing attention,and various types of warehouse systems have emerged,such as Amazon’s Kiva system and sorting centers.These systems essentially involve path planning for hundreds or thousands of intelligent agents,ensuring that they do not collide while quickly reaching their destination.With the development of deep reinforcement learning,it is now possible to solve more complex decision-making tasks,and using deep reinforcement learning for multi-agent path planning is a new research field in the field of artificial intelligence.Currently,the most advanced multi-agent path planning algorithms still rely on centralized planning,which is not suitable for real-world deployment.A decentralized framework based on reinforcement learning can learn the optimal planning strategy while mitigating real-time problems.However,it may lead to more vertex conflicts,thus reducing the planning success rate or prolonging the planning time.To address these issues,this thesis studies methods to improve the success rate of multi-agent path planning in uncertain environments using deep reinforcement learning and how to reduce collisions between multiple agents.The main research contents are as follows:(1)A multi-agent path planning method based on an improved A3C(Asynchronous Advantage Actor-Critic)algorithm is proposed to address the problem of online replanning in noisy and uncertain environments.This method combines reinforcement learning and imitation learning to learn a fully decentralized policy,enabling agents to perform real-time reactive path planning in environments with only partial observable information,while demonstrating implicit coordination.(2)A priority-based communication learning method is proposed to effectively avoid collisions in multi-agent path planning.This method combines an implicit priority learning module with traditional coupled planners,allowing multiple agents to dynamically determine communication topology while working in coordination.Information transmission and decision-making are performed based on the determined topology,effectively avoiding collisions.This thesis conducts research and exploration on multi-agent path planning,focusing on the issues of replanning paths in noisy and uncertain environments and effectively avoiding collisions.Two effective solutions are proposed using deep reinforcement learning.Finally,the effectiveness of the methods is validated in a grid environment based on the Asprilo benchmark.

Keywords/Search Tags:

reinforcement learning, imitation learning, multi-agent path planning, collision avoidance

PDF Full Text Request

Related items

1	Research And Application Of Agents Obstacle Avoidance And Path Planning Based On Deep Reinforcement Learning
2	Research On Intelligent Path Planning And Collision Avoidance Algorithm Of Six-Dof Robot Arm
3	Design Of Optimal Algorithm For Collision Avoidance Path Of Multi-mobile Robots
4	Research On AGV Path Planning Based On Cooperative Multi-agent Reinforcement Learnin
5	Research On Collision Avoidance And Navigation Strategy Of Mobile Robot Based On Deep Reinforcement Learning
6	Research On Virtual Crowd Path Planning Based On Deep Reinforcement Learning
7	Research And Application Of Multi-robot Obstacle Avoidance Navigation Based On Deep Reinforcement Learning
8	Research On AGV Storage Path Planning Based On Reinforcement Learning
9	Supervised Reinforcement Learning:methods And Applications
10	Research On Intelligent Path Planning Of Manipulator Based On Reinforcement Learning