Design And Implementation Of The Software Pipelined Loop Buffer For The Innovative X DSP

Posted on:2014-11-20

Degree:Master

Type:Thesis

Country:China

Candidate:J X Chen

Full Text:PDF

GTID:2298330422474206

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

Plenty of loop opearations exist in the DSP algorithm. It is an important way to improve theperformance of microprocessor by exploring instruction level parallelism between loop bodies.The scheduling technology of loop bodies includes loop unrolling and software pipelining. Thetechnique of software pipelining is studied to improve the executing efficiency of loop programsin the innovative X DSP.The software pipelined loop buffer is designed and implemented.In this dissertation, the techniques of loop unrolling and software pipelining are describedand compared detailedly. Based on the requirements and characteristics of the X DSP, a novelloop buffer is designed based on the modulo scheduling algorithm. This unit locates in thedispatch phase of the instruction pipeline, used to store and dispatch instructions of loop bodies.It reduces memory access latency and frequency to decrease the performance disadvantage ofDSPs. The main work in this dissertation are followed.1) Based on the analysis of executing characters of for loops and while loops, the wholearchitecture of the loop buffer is pictured. The control module and store-dispatch module of theloop buffer are also designed.2) A tracking and comparing mechanism of loop instructions is designed to serve for theloading, draining and reloading of instructions, Realizing the exact storage and distribution ofloop instructions.3) Two mechods of counter comparing and interrupt draining are designed to gain theaccurate interrupt of loop programs.4) Verification methods are studied. The verification platform is builded to serve thesystem-level verification of the loop buffer.5) The performance of the loop buffer is evaluated, using a matrix multiply-accumulateprogram and three typical image algorithm programs. Its utilization ratio for the above fourprograms is95.34%、90.61%、88.85%and89.94%, respectively. It decreases the frequency ofprogram memory access and reduces the dynamic power consumption of the program memory.6) Based on a45nm technology, logic synthetize is conducted for the loop buffer. Its workfrequency is up to1GHz and its area is76778.69um2. Its dynamic power is28.99mW and thecell leakage one is1.83mW.There are up to11232-bit instructions that stored in the loop buffer. On the control ofspecial loop instructions, the loop body instructions are stored and dispatched, greatlyimproving the execution efficiency of loop programs.

Keywords/Search Tags:

DSP, Loop Unrolling, Software Pipelining, Modulo Scheduling, LoopBuffer

PDF Full Text Request

Related items

1	Research On Software Pipelining Techniques For EPIC Architectures
2	Research On Optimization Of Software Pipelining For IA-64 Architecture
3	Research On Software Pipelining Based On GCC
4	Loop Realization And Optimization Based On X Stream Processor
5	Research And Implementation On Software-Pipelined Eliminating Technology In IA-64 Binary Translation
6	Research On Inner Loop Unrolling Method For Vector DSP
7	Exploiting Some Key Techniques On The Loop Unrolling In GCC Compiler
8	Iterative Loop Unrolling Optimizing For Embedded Systems
9	Research Of Automatic Compiler Tuning Base On Machine Learning
10	Loop unrolling along wavefronts and wavefront based techniques for exploiting instruction and thread level parellelism