Research On Multi-agent Communication Based On Reinforcement Learning

Posted on:2024-09-14

Degree:Master

Type:Thesis

Country:China

Candidate:Y X Meng

Full Text:PDF

GTID:2568307115458004

Subject:Communication engineering

Abstract/Summary:

PDF Full Text Request

With the increasing application and popularization of 5G communication technology,people’s demand for communication continue to rise.It is not only necessary to achieve high speed and high reliability,but also more intelligent and humanized interaction methods,so as to better meet the communication requirements in different demand scenarios.In view of the current situation of channel capacity approaching Shannon limit in traditional communication and the huge energy consumption of communication system,this paper investigates the communication between multi-agent based on reinforcement learning,aiming to improve the transmission efficiency of communication and further improve the communication speed between agents,so as to change the behavior of agents to improve the execution efficiency of cooperative tasks.The main research work is as follows:1.A framework of pragmatic communication system is designed,in which the difference between semantics and pragmatics is investigated from the perspective of linguistics.And integrates the communication subject into the communication system,the content of the communication is the language formed by training,with target and environment construction as input,task completion as output,and action performance as an evaluation criterion define the pragmatic communication,which is used to better accomplish specific tasks in specific environments.This paper also summarizes the characteristics of pragmatic communication,especially its two advantages of lightness and confidentiality.2.Based on the framework and process of the designed pragmatic communication system,the glue neural layer is designed.As the intermediary between two agents,Glue Neural Layer connects two agents to form a deeper neural network for more efficient communication training.At the same time,a new activation function is designed,which used Softmax function instead of Logistic function to enlarge the glue neural layer,and used Gumbel to distribute random re-parameters to transfer parameters more accurately at the initial stage of training,thus faster convergence.Finally,temperature parameter is introduced,which is more conducive to lightweight communication.3.Based on the game model of the Prisoner’s Dilemma,a simulation experimental scene model,which was set as four prisoners and an interrogation room.By changing the dimension of glue neural layer and activation function to study the performance of pragmatic communication and the influence of glue neural layer.The experiment proved that Gumbel-Softmax is more effective in spreading and back-spreading messages,superior to the existing Logistic activation function,and can significantly improve the speed of communication between agents,so as to improve the execution efficiency of cooperative tasks.

Keywords/Search Tags:

multi-agent communication, pragmatic communication, glue neural layer, activation function

PDF Full Text Request

Related items

1	Comprehensive-Information-Based Formal Pragmatic Approach For Agent Communication
2	The Research Of Cellular Neural Networks With Trapezoidal And Impulsive Activation Function
3	Design And Implementation Of Multi Agent Communication System Based On Cognitive Consistency
4	Multistability Analysis Of Switched Neural Networks With State-Dependent Switching Rules
5	Research On Consensus And Optimal Control Method Of Multi-Agent System With Limited Energy
6	Activation Function Awareness Of RNN Algorithm Optimization
7	The Study Of Blind Detection Algorithm Of Underwater Discrete Multi-level Hopfield Neural Network
8	Multi-agent Collaboration And Communication Technologies
9	Study On Different Types Of Noise Benefit Of Convolutional Neural Networks Under ReLU Activation Function
10	Research And Application On Convolutional Neural Network Algorithm Based On Improved Activation Function