Parallel Computing Acceleration Research And Implementation Of Several Classification And Clustering Algorithms

Posted on:2022-02-23

Degree:Master

Type:Thesis

Country:China

Candidate:Y H Yang

Full Text:PDF

GTID:2518306740996979

Subject:Signal and Information Processing

Abstract/Summary:

PDF Full Text Request

Under the background of big data,mining valuable information quickly poses a great challenge to the speed of computing.With the rapid development of computer hardware and software,high performance computing has been widely used in data mining.CUDA-based GPU heterogeneous computing and Spark distributed computing are two popular directions in the field of high performance computing.Based on CUDA and Spark,this paper does some researches on the parallel computing acceleration of classical clustering and classification algorithms.In this paper,three algorithms,Kmeans,K nearest neighbor and DBSCAN,are selected to study the performance bottleneck of each algorithm in serial computation.According to the hardware and software characteristics of CUDA and Spark,the parallel computing acceleration scheme and optimization strategy are designed and implemented.The main work is as follows:1.The parallelism of Kmeans: The cluster center is saved in the GPU’s constant memory to speed up access.The distance calculation module is expanded in parallel.For centroid update module,atomic add provided by CUDA is implemented in parallel on GPU.The experiment result shows that the parallel algorithm of Kmeans based on CUDA significantly improves the computing efficiency.And the larger the K value is,the better the parallelism effect is.Compared with CPU,the acceleration ratio of serial operation can reach up to 414 times.On the Spark platform,broadcasting centroid variables to worker nodes reduces data requests.Caching data in memory reduces disk IO,speeding up the iteration process.The result shows that the running time of the algorithm decreases and tends to be stable,which is consistent with Amdahl’s law.Compared with the kmeans algorithm in Spark MLlib,the computational efficiency of the parallel design in this paper is improved in the case of a smaller k value.Cublas library is used to accelerate matrix multiplication.2.The parallelism of K nearest neighbor: On the CUDA platform,the distance calculation module is split to reduce the memory consumption and redundant calculation.The insertion sort algorithm is optimized.An ordered array of k size is maintained to reduce unnecessary computational branching.Experiments show that the distance calculation based on Cublas is faster than global memory.The acceleration ratio of serial computation with CPU is up to 237 times.In the case of various parameters,the algorithm still maintains a good acceleration ratio.For the k-nearest neighbor algorithm on Spark,training sets are broadcasted to each node.All kinds of efficient operators are designed to accelerate the prediction.3.The parallelism of DBSCAN: For CUDA platform,adjacency list index is designed in the process of neighborhood construction to reduce the memory resource consumption of adjacency list and increase the memory access efficiency.Width-first search is used in parallel computation of cluster recognition.Although,the data transmission between GPU and CPU is increased,but the poor parallelism problem of DBSCAN is effectively solved.The parallelism of the algorithm is improved and the overall performance is accelerated.The experiment results show that the speed up ratio of DBSCAN on GPU and CPU is up to 222 times,which is significant on large data sets.

Keywords/Search Tags:

Clustering, Classification, High Performance Computing, Spark, GPU

PDF Full Text Request

Related items

1	Research On Classification Using Land-Use Image Based On High Performance Computing
2	A High-Performance Chinese Distributed Computing System (CH-Spark)
3	A System For Distributed MD Data Analysis Based On Spark
4	Research On Large-scale Traffic Classification Technology Based On Spark Performance Optimization
5	High Performance Financial Computing Algorithms And Platform Implementation In Heterogeneous Framework
6	The Research And Implementation Of Parallel Algorithm For Bayesian Text Classification Based Spark Computing Environment
7	The Research And Application Of Large-Scale Image Classification And Robust Subspace Clustering Algorithm For Big Data
8	Research And Implementation Of Performance Modeling And Optimization Technology Of Spark Computing Framework
9	Improvement Of Spark-based Multi-density Clustering Algorithm And Its Application In Text Mining
10	Research And Application Of Clustering Method For Big Visual Data