Research And Application On Ant Colony Clustering Based On Reinforcement Learning

Posted on:2012-05-08

Degree:Master

Type:Thesis

Country:China

Candidate:H Chen

Full Text:PDF

GTID:2218330368992254

Subject:Computer software and theory

Abstract/Summary:

PDF Full Text Request

Ant colony algorithm is a typical example which uses swarm intelligence to solve practical problems, it was first proposed by the Italian scholar Dorigo 90 years in the 20th century. As the global optimization and random searching method, it has many advantages such as simplicity, robustness, distribution and parallel. Since it was proposed, it has been applied to TSP problems, Quadratic assignment problems, machine learning and other aspects. Despite all this, there are many problems should be perfect urgently in theory and application methods, such as easy to fall into stagnation and slow convergence.Some improved methods are proposed to avoid falling into stagnation and low speed of convergence, and the main research results are concluded as follows:i. Proposing a mechanism of dynamic evaporation rate to balance the contradict between solution efficiency and solution quality. Setting evaporation rate for a big value at the beginning, thus the colony will do as much as possible of the search, increasing the possibility of finding the optimum solution; while at the end of the algorithm, setting small value for evaporation rate, for the algorithm should not be in too much to explore, but to speed up the convergence around the possible optimal solutions. The experiments show that the algorithm can quickly converge to the optimal solution; that is it will not only increase the global search capability, but also to a certain extent, accelerate the speed of convergence.ii. Introducing the thought of reinforcement mechanism from reinforcement learning. Reinforcement learning thought is introducd into the updating mechanism of pheromone of ant colony algorithm, when updating pheromone after each iteration, reward the best current path with an additional random amount of reward, and also punish the worst path with giving a random amount of punishment. iii. Redefining the heuristic information to guide the algorithm converge fast. Adjusting the definition of heuristic information to make more efficient use of heuristic information to speed up the convergence in the algorithm later. The heuristic information would not be constant, but continuously been adjusted with the iterative process moving forward, ensuring the algorithm converges to the optimal solution quickly.iv. Introducing the boundary symmetric mutation strategy to get variation of Iteration results symmetrically. Paths are almost complex overlapping in "the center" area, so the difference of the tour path is more determined by "boundary" paths. When applying the variation strategy, we mainly aimed at "boundary"paths, such as mutation can improve the efficiency of mutation, also can obtain better quality solutions.

Keywords/Search Tags:

Ant Colony Optimization, Reinforcement Learning, Clustering Analysis, Dynamic evaporation, Mutation Strategy

PDF Full Text Request

Related items

1	Research On Optimization Technology Of Test Case Generation For Fuzz Testing
2	Research On Improved Methods And Convergent Theory Of Artificial Bee Colony Algorithm
3	Research On Clustering Algorithm And Its Application Based On Reinforcement Learning
4	Research Of Reinforcement Learning Based On Clustering Analysis
5	Research On Mutation Execution Strategy For Mutation Based Fault Localization
6	Reinforcement Learning Of Motion Control Strategy In The Environment Of Multiple Mobile Robots
7	Ant Colony Algorithm For Improvement And Its Application In The Cluster Analysis
8	Study On The Improved Artificial Bee Colony Algorithm Based Fuzzy Clustering
9	Research On Clustering Algorithm Baesd On Group Search Optimization
10	Multi-Agent Dynamic Hierarchical Reinforcement Learning Based On Hybrid Abstraction