Monte Carlo Sampling and Regret Minimization for Equilibrium Computation and Decision-Making in Large Extensive Form Games

Posted on:2014-07-19

Degree:Ph.D

Type:Thesis

University:University of Alberta (Canada)

Candidate:Lanctot, Marc

Full Text:PDF

GTID:2450390005492893

Subject:Computer Science

Abstract/Summary:

In this thesis, we investigate the problem of decision-making in large two-player zero-sum games using Monte Carlo sampling and regret minimization methods. We demonstrate four major contributions. The first is Monte Carlo Counterfactual Regret Minimization (MC-CFR): a generic family of sample-based algorithms that compute near-optimal equilibrium strategies. Secondly, we develop a theory for applying counterfactual regret minimization to a generic subset of imperfect recall games as well as a lossy abstraction mechanism for reducing the size of very large games. Thirdly, we describe Monte Carlo Minimax Search (MCMS): an adversarial search algorithm based on *-Minimax that uses sparse sampling. We then present variance reduction techniques that can be used in these settings, with a focused application to Monte Carlo Tree Search (MCTS). We thoroughly evaluate our algorithms in practice using several different domains and sampling strategies.

Keywords/Search Tags:

Monte carlo, Regret minimization

Related items

1	Quantum Monte Carlo Methods And Their Applications In The Condensed Matter Physics
2	The Improvement Of The Least Squares Monte Carlo Method And The Application Research Of Option Pricing
3	The Acceleration And Electrical Field Representation Of Monte Carlo Simulation And Its Application In Optical Bio-imaging
4	Research On Value At Risk Of Open-End Fund Based On Monte Carlo Simulation
5	Quantum Monte Carlo Simulations Of Fermion-boson Lattice Systems
6	Improvement And Application Of MCMC Method For Paramter Estimation Of SV Models
7	Efficient Monte Carlo Method For Parabolic Stochastic Partial Differential Equations
8	Monte Carlo and quasi-Monte Carlo methods and their applications
9	Advanced Monte Carlo methods for analysis of very high temperature reactors: On-the-fly Doppler broadening and deterministic/Monte Carlo methods
10	Markov chain Monte Carlo estimation of multi-factor affine term-structure models