Font Size: a A A

Applying Temporal Difference Learning In Chinese-chess Computer Game

Posted on:2010-04-10Degree:MasterType:Thesis
Country:ChinaCandidate:Y L HeFull Text:PDF
GTID:2198360302961820Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
Compared with International-chess, Chinese-chess is a more complex board game. Chinese-chess is played on a square grid containing 10×9 intersections with 32 pieces.In Chinese-chess computer game (CCCG), the most time-consuming aspect of building a high performance game playing program is the design, implementation and tuning of the position evaluation function.In this thesis, a three-layer fully-connected feed forward neural network is designed as a position evaluation function. Temporal difference learning (TDL) is a reinforcement learning algorithm, which uses the difference of a pair of successive position-values to incrementally update the weights. Based on the three-layer neural network with single output, we derive a new weight updating rule for applying TD(λ) in CCCG. Starting with random initial weights between-0.5 and 0.5,the neural network is trained through the new rule on the professional game records.In the training process, each professional game record is learned iteratively by the neural network until the evaluation value of the position in professional game record becomes stable ultimately. In the experiments, we validate that our learned evaluation function is feasible and effective.Furthermore, we obtain three main relationships between(1)the number of nodes in hidden layer of neural network, (2) the parameterλ,(3) the learning rate a,and the performance of the evaluation function in the process of training different neural networks through temporal difference learning.
Keywords/Search Tags:Chinese-Chess Computer Game, Temporal difference learning, Neural network, Professional game records, Evaluation function
PDF Full Text Request
Related items