Floating-point unit design using Taylor-series expansion algorithms

Posted on:2010-12-12

Degree:Ph.D

Type:Dissertation

University:University of Southern California

Candidate:Kwon, Taek-Jun

Full Text:PDF

GTID:1448390002983594

Subject:Engineering

Abstract/Summary:

Due to the constant advances in VLSI technology and the prevalence of many applications that require floating-point operations, hardware support for floating-point arithmetic is an essential feature in high-performance computer systems, embedded systems as well as mobile applications. Over the past years, while addition and multiplication implementations have become increasingly efficient, support for division and other elementary functions such as square root has remained uneven. Although division and square root are relatively infrequent operations in traditional general-purpose applications, they are indispensable and becoming increasingly important, particularly in many modern applications. Furthermore, as the latency gap between addition/multiplication and division/square root grows, the latter operations increasingly become performance bottlenecks. Therefore, poor implementations of floating-point division and square root can result in severe performance degradation.;This dissertation presents various techniques for designing an area-efficient yet high-performance floating-point arithmetic unit using a high-order Taylor-series expansion algorithm with truncated powering units. First, we propose a floating-point divider unit based on a 3rd-order Taylor-series expansion algorithm with truncated powering units. This algorithm achieves fast computation by using truncated powering units, which compute the higher-order terms in the Taylor-series polynomial significantly faster than traditional multipliers with a relatively small hardware overhead. Through careful pipeline design, all multiply operations required by the division algorithm and floating-point multiply operations are executed by one multiplier to maximize area efficiency while achieving high performance. Second, we expand the algorithm and present a generalized floating-point divider design procedure using high-order Taylor-series expansion algorithms by exploring the trade-off space of design constraints for a given precision, which is necessary for an efficient implementation of a divider. Third, we extend the proposed floating-point divider to incorporate square root. Since Taylor's theorem enables us to compute approximations for many well-known functions, we can extend the proposed divider unit to incorporate a square root function. And due to the similarity between the Taylor-series approximations of these functions extending the existing divider to incorporate square root can be achieved with little area and latency overhead. Finally, this dissertation provides insight into trade-offs involving overall FPU organization alternatives using the proposed floating-point divider. Several design considerations and trade-off factors in floating-point unit implementation are evaluated for two types of FPU architectures optimized under different design goals.;The proposed arithmetic unit exhibits area efficiency as well as high performance required by many modern floating-point intensive applications such as scientific computing, CAD tools and 3D graphics rendering, which have a high percentage of division and square root operations.

Keywords/Search Tags:

Floating-point, Square root, Taylor-series expansion, Operations, Unit, Algorithm, Using, Applications

Related items

1	Verification Of Processor Floating-point Division/Square Root Unit Based On UVM
2	The Design And Implementation Of Floating Point Unit Based On ARMv7 Floating Point Instruction Set
3	Research On High Performance Floating Point Unit
4	Realization Of Adaptive Floating-point Multiplication, Division And Square Root Unit For Single, Double And Extended Precision
5	Research And Implement Of Floating-point Division And Square Root Unit Based On Unified Structure
6	Design And Implementation Of High-performance Floating-point Division And Square Root
7	Research And Design Of High Precision And High-performance Floating-point Division And Square Root Unit
8	A novel floating point arithmetic coprocessor
9	Design And Implementation Of FPU In X Microprocessor
10	Design Of A Floating-point Processor For High Precision Transcendental Function Operations