Research On Compiler Method For Dynamic Boundary Loop For Coarse Grained Reconfigurable Processor

Posted on:2020-07-24

Degree:Master

Type:Thesis

Country:China

Candidate:S Xie

Full Text:PDF

GTID:2428330620958900

Subject:Integrated circuit engineering

Abstract/Summary:

PDF Full Text Request

Coarse-grained Reconfigurable Architecture(CGRA)have been identified as a desirable platform for computationally intensive applications.However,existing mapping techniques only focus on the single-level loop and the innermost loop body.It limits the application areas for CGRA since the lack of efficient mapping skills for the dynamic boundary loop.This paper address the problems for single-level dynamic boundary loop and mixed boundary loop.CGRA is consist of regular processing element array(PEA).On the one hand,the functionality of PE is simple to acquire high performance.On the other hand,it cannot process complex control flow due to the lack of program counter(PC)and other hardware support.Firstly,we analyze the main structure of CGRA and the characteristics of existing mapping algorithms.Then,a low-cost extended CGRA is provided to support the mapping of the dynamic boundary loop.The Dynamic Boundary Static Schedule(DBSS)is proposed to map the dynamic boundary loop based on the extended CGRA.Compared to traditional mapping algorithms,DBSS maps loop body and loop itself about control-related operators at the same time.DBSS issues loop body at runtime according to loop condition.DBSS can map not only nested branches but also the single-level dynamic boundary loop.To address the mapping of hybrid static boundary loop and dynamic boundary loop,we proposed Mixed Boundary Static Schedule(MBSS).DBSS will involve lots of control-related operators about the static boundary loop and the dynamic boundary loop.MBSS adapts conventional loop unrolling to remove the layer of the static boundary loop firstly.Then,MBSS process the rest of layers of the dynamic boundary loop.MBSS only map control-related operators about the dynamic boundary loop,which improves the performance and simplifies the data flow graph.Finally,the extended CGRA is realized by Verilog and compiler skills is inserted as a pass into LLVM.Compared to the latest mapping algorithms,DBSS and MBSS achieve 2.2 � speedup on average and take performance improvement of 24% and 38% respectively.What's more,DBSS and MBSS save energy and the extra hardware overhead less than 2%.The proposed method owns better scalability and flexibility.

Keywords/Search Tags:

CGRA, PE, dynamic boundary loop

PDF Full Text Request

Related items

1	Research On Loop Optimization In Compiling System Of Coarse-Grained Reconfigurable Architectures
2	Study On The Optimum Design Method For Boundary Cable Nets Of Loop Antennas
3	Research And Implementation Of Reconfigurable Computing For Communication Baseband Signal Processing
4	Improving CGRA Utilization by Enabling Multi-threading for Power-efficient Embedded Systems
5	Research On In-Loop Filter Algorithm For H.264/AVC
6	The Study Of Boundary Detecting Algorithm Based On A2-MST And Ensemble Boundary
7	Dynamic Characterization of the IKK:IkappaBalpha:NF-kappaB Negative Feedback Loop Using Real-Time Bioluminescence Imaging
8	The Second Order Theoretical Modeling And Experimental Research Of Flexible Manipulator Based On The Moving Boundary
9	Tuning Pipeline Granularity Based On Feedback Directed Framework
10	Performance analysis and evaluation of dynamic loop scheduling techniques in a competitive runtime environment for distributed memory architectures