High-performance direct solution of finite element problems on multi-core processors

Posted on:2011-06-17

Degree:Ph.D

Type:Dissertation

University:Georgia Institute of Technology

Candidate:Guney, Murat Efe

Full Text:PDF

GTID:1448390002462814

Subject:Applied mechanics

Abstract/Summary:

PDF Full Text Request

The solution of linear system of equations is at the core of finite element (FE) analysis software. While engineers have been increasing the size and complexity of their models, the growth in the speed of a single computer processor has slowed. Today, computer manufacturers have increased overall processor performance by increasing the number of processing units in a computer using so-called multi-core processors. A FE analysis solver is needed which takes full advantage of these multi-core processors.In this study, a direct solution procedure is proposed and developed which exploits the parallelism that exists in current symmetric multiprocessing (SMP) multi-core processors. Several algorithms are proposed and developed to improve the performance of the direct solution of FE problems. A high-performance sparse direct solver is developed which allows experimentation with the newly developed and existing algorithms. The performance of the algorithms is investigated using a large set of FE problems. Furthermore, operation count estimations are developed to further assess various algorithms.A multifrontal method is adopted for the parallel factorization and triangular solution on SMP multi-core processors. A triangular solution algorithm that is especially efficient for the solution with multiple loading conditions is developed. Furthermore, a new mapping algorithm is designed to find independent factorization tasks that are assigned to the CPU cores in order to minimize the parallel factorization time. As the factorization and triangular solution times are reduced by the use of parallel algorithms, other components of FE analysis such as assembly of the stiffness matrix become a bottleneck for improving the overall performance. An assembled stiffness matrix is not required by the developed solver. Instead, element stiffness matrices and element connectivity information are the inputs. The developed solver never assembles the entire structural stiffness matrix but assembles frontal matrices on each core. This reduces not only the execution time but also the memory requirement for the assembly.Finally, an out-of-core version of the solver is developed to reduce the memory requirements for the solution. I/O is performed asynchronously without blocking the thread that makes the I/O request. Asynchronous I/O allows overlapping factorization and triangular solution computations with I/O. The performance of the developed solver is demonstrated on a large number of test problems. A problem with nearly 10 million degree of freedoms is solved on a low price desktop computer using the out-of-core version of the direct solver. Furthermore, the developed solver usually outperforms a commonly used shared memory solver.

Keywords/Search Tags:

Solution, Direct, Multi-core processors, Element, Developed, Solver, Performance, I/O

PDF Full Text Request

Related items

1	Dynamic thermal management of multi core processors using core hopping
2	A New Algorithm For Analysis Solver Of Stamping And Engineering Applications
3	Research And Implementation Of An All-Solution SAT Solver
4	Research And Design Of Embedded Operating System Based On Multi-Core Processors
5	Cross-layer customization for low power and high performance embedded multi-core processors
6	Research On Inter-communication For Multi-core Processors
7	Scalable Compiler Optimizations for Improving the Memory System Performance in Multi- and Many-core Processors
8	The Task Assignment And Algorithm Research On A Heterogeneous Multi-Core Processor
9	High-performance Video Codecs Based On Multi-core DSP Processors Technical Research
10	Performance Analysis And Enhancement Of Virtualization On Multi-core Processors