Application of temporal difference learning to the game of Snake

Posted on:2011-12-17

Degree:M.Eng

Type:Thesis

University:University of Louisville

Candidate:Lockhart, Christopher

Full Text:PDF

GTID:2448390002953531

Subject:Engineering

Abstract/Summary:

The game of Snake has been selected to provide a unique application of the TD(lambda) algorithm as proposed by Sutton. A reinforcement learning technique for producing computer controlled players is documented. Using value function approximation with multilayer artificial neural networks and the actor-critic architecture, computer players capable of playing the game of Snake can be created. The adaptation to the standard neural network backpropagation procedure will be documented. Not only does the proposed technique provide reasonable player performance, its application is unique; this approach to Snake has never been documented. By performing sets of trials, the performance of the players are evaluated and compared against an existing machine learning technique. Learning curves provide visualization for the results. Though the snake players are shown to be capable of achieving lower scores than with the existing method, the technique is able to produce agents that accumulate scores, much more efficiently.

Keywords/Search Tags:

Snake, Application, Game, Technique

Related items

1	An Improved Snake Model And Its Application In Multiple Plane Targets Detection
2	Snake Model Based On Optimization Of Parameter And The Opplications In MEI Image Segmetation
3	Research On Movement Mechanism And Structure Design Of A Sea Snake-like Robot
4	Application Of Improved GVF Snake Algorithm In Lung Cancer Detection Tectnology
5	The Research And Implementation Of Key Technique Of Mobile Game Based On J2ME
6	Cell Image Segmentation Based On Snake Model
7	Analysis And Modeling Of Snake-like Robot Locomotion And Its Application In Bridge Cable Inspection
8	Research And Application Of Image Segmentation Algorithm Based On Snake Model
9	The Application Of Snake Model In Contour Extraction
10	Wavelet Interpolation And The Snake Model Application In The Magnified Image Clearer