Research On Parallel Solving Of Large Scale Sparse Linear Systems

Posted on:2020-06-07

Degree:Master

Type:Thesis

Country:China

Candidate:L C Song

Full Text:PDF

GTID:2428330578966717

Subject:Information and Communication Engineering

Abstract/Summary:

PDF Full Text Request

Currently,the solution of large-scale sparse linear systems is an important part of many scientific computing and engineering techniques.In some sparse linear system solving tasks based on direct method,the calculation of linear trigonometric system is the core link for solving large-scale sparse linear systems.Therefore,the rapid solution of the sparse linear triangulation system becomes the key to solving the scientific calculation problem.In recent years,as the scale and complexity of scientific computing problems have increased,the size and complexity of sparse linear triangulation systems are also growing,resulting in a sudden increase in the amount of data that needs to be processed.However,the existing methods are limited by the traditional view of linear trigonometric system solving,that is,the solution of a certain variable must wait until all its precursor variables have been solved.This approach not only limits the degree of parallelism that can be achieved when solving,and does not take advantage of the rich parallel hardware resources of many-core processors.Moreover,frequent data transfer between threads makes synchronization overhead larger,even offsets the advantages brought by parallel computing technology.In order to solve the problems in the existing methods,this thesis proposes a parallel algorithm based on partial value addition.The method first calculates the partial values of the variables in parallel,and then adds all the partial values of the variables to get the final values.Since the variables are calculated without waiting for all the predecessor variables to complete the calculation,the parallelism and calculation speed are greatly improved.In the work,the parallel solving algorithm is implemented based on the CUDA computing platform.The algorithm decomposes the association graph representing the order of the variables into multiple subgraphs.Each thread calculates a layer of subgraphs,making full use of the GPU's rich parallel computing resources.Second,in order to reduce the impact of memory access on the performance of the algorithm,make full use of the large global memory capacity and low shared memory latency,the parallel algorithm proposed in this thesis is optimized.The experimental results show that compared with the calculation time of the cuSPARES library and the non-synchronous parallel algorithm,the calculation speed of the parallel algorithm is increased by 80% on average,the maximum increase is 99%.Under the premise of ensuring the calculation accuracy,the solution speed of the sparse linear triangulation system is greatly improved.

Keywords/Search Tags:

GPU multi-core processor, Parallel computing, Large-scale sparse linear triangular system, Add the partial values of a variable

PDF Full Text Request

Related items

1	Research On Parallel Program Performance Tuning On Multi-Core Computing Platform
2	The Parallel Drawing Of Large-scale Terrain Models Based On Multi-core Platform
3	Research On Multi-Core Parallel Computing In Streaming Media Service System
4	The Study Of Parallel Algorithms For The Solution Of Special Structured Large Linear System Of Equations Under Distributed Memory Environment
5	Research Of Parallel Evolutionary Algorithm Based On Sunway Manycore Architecture
6	Verifying Parallel Low-Level Programs For Multi-core Processor
7	Performance Optimization Of Sparse Lower Triangular Solver On Sunway Architecture
8	Parallel Rendering Of Large-scale DEM Based On Multi-core CPU
9	Research On Key Technologies Of Parallel Optimization For Multi-computing Platforms For Large-scale Applications
10	Parallel Technology Based On K-SVD Image Denoising Algorithm In General Computing Platform