A CUOA-Based Parallel Algorithm Design And Implementation Of Symbolic Regression

Posted on:2013-07-02

Degree:Master

Type:Thesis

Country:China

Candidate:S Shao

Full Text:PDF

GTID:2248330395455314

Subject:Computer software and theory

Abstract/Summary:

PDF Full Text Request

With the rapidly growth of information data, people expect so badly to find a method to extractuseful information and knowledge from these magnanimous amount of data. This experience can bewidely used in business management, production control, market analysis, engineering design andscientific exploration. Symbolic Regression is a promising research aspect in the era of scientificcomputing for data mining. In short, the general purpose of Symbolic Regression is to find anoptimal symbol expression that best fits a target sample set, which has much experimental data.Currently, the mainstream methods to solve the problem of Symbolic Regression are almostbased on genetic programming (GP). For the reason that individuals in GP are extremely difficult toreproduce with modification, we adopt a popular evolutionary algorithm, gene expressionprogramming (GEP), in this paper.The GEP individuals are encoded as fixed length linearchromosomes (genotype), which are then translated into different sized and shaped non-linearentities named expression trees (ETs)(phenotype). GEP inherits the manipulative and expressiveadvantages of GAs and GP. In this paper, we will introduce a hybrid variant of GEP namedMIN-GEP, which improves this traditional GEP algorithm in performance and correctness. In theface of large scale datasets, we select the GPU as parallel computing platform and a newgeneral-purpose parallel programming model, namely Compute Unified Device Architecture(CUDA).We have allocated the task of fitness calculation on the GPU, which is one of majorbottlenecks in terms of execution time for GEP. It makes full use of GPU’s computing performanceand obtains a satisfactory speedup. Meanwhile, in order to improve the individual’s fitness value, wehave embedded Method of Least Square (MLS) into GEP, in which MLS focuses on finding theoptimal constant coefficients locally on the fixed function structure.In the part of experiments, we have selected thirty problems as test cases, which appeared in thepublished literature on constant creation issues in GP or GEP. In the performance experiment, theCUDA-based GEP has almost250speeds up against the serial version.Then we compare MIN-GEPwith other existing well-known constant creation methods in terms of performance and accuracy.TheMLS is far less time consuming than other local optimization methods, and provides a significantimprovement in performance.

Keywords/Search Tags:

GEP, Symbolic Regression, GPU, CUDA, MLS

PDF Full Text Request

Related items

1	Implementation And GPU Acceleration Of Symbolic Regress-ion Based On Gene Expression Programming
2	Research And Applications Of GEP In Symbolic Regression
3	Study On Over Fitting In Symbolic Regression
4	Theory Of Genetic Programming And Its Application In Symbolic Regression
5	Research On Symbolic Regression And Evolution Algorithm Based On Machine Learning
6	Research In Genetic Programming And Its Application In Symbolic Regression
7	Prioritized Grammar Enumeration: A novel method for symbolic regression
8	Research And Implementation Of Divergences Analysis Based On Dynamic Symbolic Execution
9	Research And Implementation For Method Of Component Regression Testing
10	The Research And Practice On Symbol Regression Based Space Division