Multi-Agent Reinforcement Learning Through Weighted Experience Sharing

Posted on:2013-04-08

Degree:Master

Type:Thesis

Country:China

Candidate:B D L A

Full Text:PDF

GTID:2248330374989682

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

Reinforcement learning allows to program agents by reward and punishment without specifying how to achieve the task. Multi-agent reinforcement learning is an extension of reinforcement learning concept to multi-agent environments. From an individual agentâ€™s perspective, multi-agent systems differ from single-agent systems most significantly in that the environmentâ€™s dynamics can be determined by other agents. When multiple agents learn and, hence, adapt their behavior in parallel, each individual agent faces the difficulty of learning in a non-stationary environment. In addition to the uncertainty that may be inherent in the domain, other agents intentionally affect the environment in unpredictable ways. Thus, all multi-agent systems can be viewed as having dynamic environments. Consequently, convergence guarantees, such as the convergence of the Bellman-style single-agent reinforcement techniques, no longer hold.Research in the area of multi-agent systems is concerned with the effective coordination of autonomous agents to perform tasks so as to achieve high quality overall system performance. Multi-agent coordination challenges include the lack of single point of control, local views of each agent that provide only incomplete information, private goals and solution procedures of the agents, communication asynchrony, dynamic environments and uncertainty. Coordination regimes include teamwork, where the agents cooperate and coordinate to achieve a global team goal, or regimes where the agents are self-interested and try to achieve individual goals. An extreme case is where agents are adversaries and try to fulfill their individual goals even while inflicting costs/harm on one another. This thesis proposes a new concept of multi-agent reinforcement learning based on weighted experience sharing. Under the new concept, each agent benefits from the experience of others, and adds his own experience to the knowledge base. By this means our method guarantees the achievement of the point of convergence in so called dynamic multi-agent environments. We also seek to proof that the use of multi-agents allows the rapid convergence of the learning process. Thus, the more number of agents, the quicker the point of convergence arrives. The extension from single-agent system to a multi-agent system is done by means of the Q-learning algorithm as a combination of two methods; Independent Learners and Cooperative Learning.

Keywords/Search Tags:

Reinforcement Learning, Weighted Experience Sharing, Multi-Agent Learning

PDF Full Text Request

Related items

1	Research On Multi-agent Coordination Policy Optimization Based On Deep Reinforcement Learning
2	Research On Deep Reinforcement Learning Technology For Multi-agent Collaboration
3	Research On Multi-agent System Decision Algorithm Based On Deep Reinforcement Learning
4	A Study Of Multi-agent Reinforcement Learning Based On Weighted Q-value Decomposition
5	Research On Multi-agent Attack And Defense Countermeasures Based On Deep Reinforcement Learning
6	Multi-agent Confrontation Algorithm Based On Reinforcement Learning
7	Research On Key Technologies Of Multi-agent Cooperation Problems Based On Reinforcement Learning
8	Research On Knowledge Sharing And Exploration Mechanism In Multi-agent Reinforcement Learning
9	Research On Reinforcement Learning Methods Based On Weighted Double Mechanisms
10	Experience Replay In Multi-Agent Deep Reinforcement Learning