The Optimization Of Skills And Cooperation With Machine Learning In RoboCup3D

Posted on:2018-04-14

Degree:Master

Type:Thesis

Country:China

Candidate:H H Feng

Full Text:PDF

GTID:2348330536979961

Subject:Control engineering

Abstract/Summary:

PDF Full Text Request

In this paper,we propose two key components,a machine learning based infrastructure for optimizing the behavior of humanoid robots using the high-throughput HTCondor computer cluster system and a cooperation scheme based on formation and role assignment,that are used by our Apollo3 D team at Nanjing University of Posts and Telecommunications in the RoboCup 3D soccer simulation competitions.Both of the two components are designed based on in-depth research on the individual skills and coordination mechanism of the robot team.On the behavior optimization problem,we use the Covariance Matrix Adaptation Evolution Strategy(CMA-ES)to optimize the multidimensional parameters of 5 different types of robots' motions in continuous space.Implementing this algorithm in a reinforcement learning infrastructure,we successfully optimized the kicking of the soccer robots.Facing the difficulties of over fitting in single training task when optimizing the walking skill,we designed a layered learning strategy using multiple subtasks.By using this strategy,enhanced 5 types of robots' walking,turning and dribbling behaviors of the biped robots are finally achieved.On the optimization problem of Multi-agent cooperation,we mainly focous on two points,a strategy positioning machenism based on Simulation Based Strategy Positioning(SBSP)for the optimization of the team formation which is on top of Delanuay triangulation subdicision of the football field and a role assignment machenism based on Markov Decision Process(MDP)and using the Sarsa(?)algorithm based linear function approximation to study the value of the action function in the MDP model,which effectively solves the role assignment problem caused by multiple factors such as different robot type,orientation,speed and distance.Large amounts of experiments have proved that the proposed methods in this paper greatly improve the individual and overall performances of Apollo3 D,especially in the team formation and role assignment.

Keywords/Search Tags:

HTCondor, CMA-ES, Delaunay triangulation, MDP, Role assignment

PDF Full Text Request

Related items

1	Research On Delaunay Triangulation Algorithm
2	Delaunay Triangulation Generation Algorithm And Its Applied Research
3	Delaunay Triangulation And Application Study Of Polygonal Domains
4	Electronic Image Stabilization System Based On Delaunay Triangulation For Implementing Motion Estimation
5	Research On The 3d Real-time Interactive Computer Aided Design System Of Highway And Its Kernel Arithmetic
6	An Improved Delaunay Triangulation Algorithm
7	Surface Approximation Based On Delaunay Triangulation
8	Research On The Isoline Generation Algorithm Based On Delaunay Triangulation
9	Formation Analysis Of RoboCup Simulation 2D Based On Delaunay Triangulation
10	Research On Delaunay Triangulation Algorithm Based On Flip