Monte Carlo Tree Search For "Dou Di Zhu"

Posted on:2021-01-10

Degree:Master

Type:Thesis

Country:China

Candidate:Q W Peng

Full Text:PDF

GTID:2428330623984369

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

The popular three-player game "Dou dizhu" in china is a typical incomplete information game.The game has not only confrontation between two farmers and the landlord,but also cooperation between two farmers.With the important progress of artificial intelligence technology in large-scale chess and card games,for example,Alpha Go and Libratus have achieved the ability to surpass current human professional players in Go and one-to-one non-limit Texas Hold'em games.The game "Dou dizhu" has also attracted widespread interest among artificial intelligence researchers.Monte Carlo Tree Search is an important method to solve the approximate solution of largescale search problems through the sampling method,it is an important part of Alpha Go Zero.In this paper,Monte Carlo Tree Search is combined with the characteristics of "Dou dizhu" and the convolutional neural network technology to explore and research the game strategy of "Dou dizhu".The specific work is as follows:1.Propose the Monte Carlo Tree Search Algorithm(MCTSHS)based on hand splitting.Statistics show that according to the rules of the game,when a hand is split into a card set that does not exceed the minimum number of splits plus 3,the card types chosen by human players in the actual game are included in this deck with a probability of over 99%.Therefore,on this basis,a hand-split algorithm is proposed to prune the game tree to improve its search efficiency.Experimental results show that the MCTSHS algorithm can make better game strategies without relying on human data.2.Propose the Monte Carlo Tree Search Algorithm(MCM)combined with convolutional neural network.Firstly,for MCTSHS,there is a problem that the decision-making time is too long and the learned strategy cannot be fully utilized,it is proposed to use the convolutional neural network to learn the historical decision data of MCTSHS,to realize the mapping of the state and possible decisions to the decision benefits,then choose the most profitable decision as the actual decision,and then greatly reduce the decision time;Secondly,in view of the errors in the learning of convolutional neural networks and the problem of MCTSHS's poor decision results due to experimental preset conditions,it is proposed to use Monte Carlo Tree Search Algorithm to improve the output of convolutional neural networks as the final decision.The experimental results show that the MCM algorithm is significantly better than the current mainstream "Dou dizhu" strategy.

Keywords/Search Tags:

Computer game, incomplete information game, Monte Carlo tree search, convolutional neural network, Dou dizhu

PDF Full Text Request

Related items

1	Incomplete Information Machine Game Based On CNN And MCTS
2	Research On Multi-Player Imperfect Information Computer Game Based On Residual Network And SDMCTS
3	Research And Application Of Bridge Game Algorithm With Incomplete Information
4	Computer Go Game Research Based On Monte Carlo Tree Search
5	Research On Multi-player Imperfect Information Computer Game Based On NFSP And ISMCTS
6	Research And Application Of Computer Game With Imperfect Information
7	Research On The Computer Game Key Technology And System Design Based On Checkers Complete Information
8	Computer Poker Based On Monte-Carlo Tree Search
9	Research On The Game Platform Of Gomoku Man-machine Based On DOBOT Manipulator
10	Research On The Game Of Go Based On Deep Learning And Mente Carlo Tree Search