Reinforcement learning in stochastic games against bounded memory opponents

Posted on:2007-05-02

Degree:M.Sc

Type:Thesis

University:McGill University (Canada)

Candidate:Vrljicak, Tomislav

Full Text:PDF

GTID:2458390005990691

Subject:Computer Science

Abstract/Summary:

Learning to play in the presence of independent and self-motivated opponents is a difficult task, because the dynamics of the environment appear to be non-stationary. In recent years there has been considerable amount of research in the field of Multi-Agent Learning, and some of this work has been in the context of Reinforcement Learning. One commonly used approach has been to restrict the opponent to a class of computationally bounded players, creating a parametrized model of it, and then search the model that can best explain the observed opponent behavior. In this thesis we study the problem of Reinforcement Learning in Stochastic Games, and propose two models for the opponent and two search algorithms, one based on Tests of Significance and the other on Maximum a Posteriori probabilities. We integrate the modeled opponent into a Markovian environment, and present an algorithm for solving the resulting MDP. Finally, we perform experiments on the effectiveness of the search algorithms.

Keywords/Search Tags:

Opponent, Reinforcement learning

Related items

1	Research On Mean-Field Multi-Agent Reinforcement Learning In Large Scale Scenarios
2	Research On Multi-Issue Automated Negotiation Based On Agent Reinforcement Learning
3	Supervised Reinforcement Learning:methods And Applications
4	The Research Of RoboCup Simulation Soccer
5	Research On Reinforcement Learning Based Control Method Of Magnetic Navigation AGV
6	Reinforcement Learning Based On Spectral Graph Theory
7	Study Of Multi-agent Learning Problem Based On Reinforcement Learning
8	Research On Sample-efficient Reinforcement Learning Methods
9	Research And Implementation Of Reinforcement Learning Method About Transport Strategy Between Carrier-based Aircraft Station
10	Research On Reinforcement Learning Based On Hidden Space Modeling