Research On Game Strategy Of Winning Methods And Waiting Prediction In Mahjong Computer Game

Posted on:2024-01-25

Degree:Master

Type:Thesis

Country:China

Candidate:L Liu

Full Text:PDF

GTID:2530307181450744

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

Computer game is an important branch of artificial intelligence field,which aims to“teach” computers to play chess,cards,mahjong and other games.This thesis takes mahjong as the research carrier,refers to the popular mahjong rules,and carries out corresponding research work for Winning and Waiting,two key sub-processes of mahjong game.As an imperfect information game,mahjong has the following research challenges:(1)Due to the fact that only one’s own hand are visible,the incompleteness of game information and the randomness of discard lead to a sharp increase in decision-making difficulty.(2)The three mahjong operations of Chow,Pung and Kong cause uncertainty in the game order,making it difficult to establish a traditional game tree.(3)The complex and diverse scoring rules for Winning make it difficult to construct a game state evaluation function.The main work of this thesis includes the following three aspects.(1)A segmented game algorithm that integrates knowledge and Monte Carlo tree search is designed.According to the statistical law of mahjong process,artificially divide the whole game process into two periods: the early and later stages.In the early stage of the game,by calculating number of shanten and effective tiles,quantifying the distance from Winning and advancing number of waiting tiles,quickly approaching Winning.In the later stage of the game,by constructing a simulation environment for mahjong games,adaptive improvements are made for each stage of selection,expansion,simulation and backtracking in traditional Monte Carlo tree search respectively.The best path for Winning with maximum expected return is selected based on simulation statistics results.The final experiment shows that compared with single empirical knowledge-based game algorithm,after integrating Monte Carlo tree search,both single-game maximum score,average number of Winning have been significantly improved.(2)A Waiting prediction method based on multi-scale feature extraction and attention mechanism is proposed.In mahjong games,temporal information on field has great influence on predicting Waiting.This thesis proposes solving Waiting prediction problem based on classification idea,first using multi-channel matrix representing “visible”features,“experience” features,“foresight” features in situation.Secondly,convolutional neural network and LSTM encoder are used extract above features,mine temporal information among them.Finally LSTM decoder Attention component calculate influence weight sequence information final prediction,so as pay attention important features when predicting Waiting.Based on the data from the National Computer Game Competition in2020-2021,the Waiting prediction model was trained,and its accuracy reached 87.3% on the validation set.At the same time,the predicted legal rate “Legal Rate” and the predicted fault tolerance rate “Accuracy＿N” both performed well.(3)Developed popular mahjong games intelligent agent.Based JJWorld network technology company’s popular mahjong competition platform,the four-layer architecture of the intelligent agent was built in a top-down manner.Using Python and Pytorch,we implemented a fast near Winning policy algorithm,optimal Winning path selection,Mahjong game environment simulation,and predictive Waiting model construction,combined with Mahjong game rules to form a complete intelligent agent decision-making process,participated in the 2022 China Computer Game Championship and won the runner-up,participated in the Mahjong competition of the 2022 IEEE World Game Conference and won the third place,proving that the research results of this article are feasible and have certain advanced nature.

Keywords/Search Tags:

Mahjong computer games, Prior knowledge, Monte Carlo tree search, Convolutional neural network, LSTM encoder-decoder, Attention Mechanism

PDF Full Text Request

Related items

1	Research On Knowledge Graph Completion Model Combining Temporal Convolutional Network And Monte Carlo Tree Search
2	Research On Medical Image Segmentation Methods Based On Encoder And Decoder
3	Research On Inplementation Of Artifacial Intelligence Algorithm In Texas Holdem Based On Monte Carlo Tree Search
4	Research On Emotion Classification Of Physiological Signals And Prior Knowledge In Online Learning Scenarios
5	Research On Medical Image Semantic Segmentation Method Based On Improved U-Net Model
6	Research On Pivotal Algorithms Of Dou Dizhu Computer Game Bid And Peasant Role Game
7	Research On Medical Image Segmentation Algorithm Based On Encoder-decoder Structure And Attention Mechanism
8	Research On Offline Handwritten Mathematical Expression Recognition Algorithm Based On Encoder-Decoder
9	Modeling CGF Tactical Decision Making Through Monte Carlo Tree Search
10	Study On Improved Convolutional Neural Network Based Classification For P300 Brain-Computer Interface