Research On Obstacle Avoidance In Multi-agent Fault-tolerant Formation

Posted on:2024-09-03

Degree:Master

Type:Thesis

Country:China

Candidate:X W Chen

Full Text:PDF

GTID:2568307103469854

Subject:Electronic information

Abstract/Summary:

PDF Full Text Request

The traditional agent formation approaches have strong dependence on agent dynamics model.Hence,it is difficult to be applied to complex working scenarios.In this thesis,based on the deep reinforcement learning algorithm,a multi-agent localization method and control algorithm are designed.Multi Agent Deterministic Policy Gradient Algorithm(MADDPG)is used to guide agents to make decision on movement.The localization,formation organization,obstacle avoidance and fault tolerance control of multiple agents are realized.The main research work and innovation of this thesis are as follows:(1)A localization method combining a least square method and a multi-layer perceptron algorithm is proposed.The localization method uses the three-dimensional positioning data from four observation stations to determine the position of agents in three-dimensional space by the iterative least square method,which solves the indoor positioning problem of agents.By considering possible measurement errors(for example,agents block each other or obstacles block the view of the observation station),this thesis uses the multi-layer perceptron to correct the errors.The simulation results show that the positioning accuracy is maintained at millimeter level.(2)A multi-agent formation method based on MADDPG algorithm is designed.This method uses a new formation reward function,which only restricts the topology between agents,but does not specify the specific position of agents in the formation.At the same time,in order to solve the problems such as agent off-line in practical work,the compression method based on knowledge distillation technology is adopted.This method uses the teacherstudent model to integrate various formation strategies into the same model,and realizes the fault-tolerant control.Simulation results show that the multi-agent formation method can realize the tasks of tracking,formation,obstacle avoidance and fault tolerance control.It is superior to the traditional artificial potential field method in terms of convergence speed,path optimization and operation efficiency.The convergence issues of independent reinforcement learning algorithm is solved.The balance problem of obstacle avoidance and formation maintenance is dealt with.(3)An intelligent robot as an agent is designed in this thesis.The control system and control algorithm are used.The data required by the algorithm are generated by positioning equipment.The results show that the proposed algorithm is suitable for the actual system,and can guide the robot to make decision and realize the formation tasks.

Keywords/Search Tags:

Indoor positioning technology, multi-agent formation, reinforcement learning, knowledge distillation, fault-tolerant control

PDF Full Text Request

Related items

1	Research On Key Technologies Of Reinforcement Learning For Cooperative Multi-Agent System
2	Research On Indoor Positioning Method Based On Agent Interaction Reinforcement Learning
3	Distributed Fault Estimation And Formation Control Of Multi-agent Systems And Its Application
4	Research On Multi-agent Collaboration And Formation Control Based On Deep Reinforcement Learning
5	Fault Detection And Estimation And Fault Tolerant Control For Multi-Agent Systems
6	Multi Agent Path Planning And Formation Based On Hierarchical Reinforcement Learning
7	Research On Knowledge Transfer Method For Multi-agent Collaboration
8	Fault Tolerant Consensus Control For Multi-Agent Systems With Actuator Faults
9	Research On Fault-tolerant Control Strategy Of Multi-degree-of-freedom Manipulators In Nuclear Environment
10	Research On Fault-tolerant Control For Formation Tracking Of High-order Multi-agent Systems In Switching Topologies