FPGA acceleration of sequence analysis tools in bioinformatics

Posted on:2014-03-29

Degree:Ph.D

Type:Dissertation

University:Boston University

Candidate:Mahram, Atabak

Full Text:PDF

GTID:1458390005996319

Subject:Engineering

Abstract/Summary:

With advances in biotechnology and computing power, biological data are being produced at an exceptional rate. The purpose of this study is to analyze the application of FPGAs to accelerate high impact production biosequence analysis tools. Compared with other alternatives, FPGAs offer huge compute power, lower power consumption, and reasonable flexibility.;BLAST has become the de facto standard in bioinformatic approximate string matching and so its acceleration is of fundamental importance. It is a complex highly-optimized system, consisting of tens of thousands of lines of code and a large number of heuristics. Our idea is to emulate the main phases of its algorithm on FPGA. Utilizing our FPGA engine, we quickly reduce the size of the database to a small fraction, and then use the original code to process the query. Using a standard FPGA-based system, we achieved 12x speedup over a highly optimized multithread reference code.;Multiple Sequence Alignment (MSA)---the extension of pairwise Sequence Alignment to multiple Sequences---is critical to solve many biological problems. Previous attempts to accelerate Clustal-W, the most commonly used MSA code, have directly mapped a portion of the code to the FPGA. We use a new approach: we apply prefiltering of the kind commonly used in BLAST to perform the initial all-pairs alignments. This results in a speedup of from 80x to 190x over the CPU code (8 cores). The quality is comparable to the original according to a commonly used benchmark suite evaluated with respect to multiple distance metrics.;The challenge in FPGA-based acceleration is finding a suitable application mapping. Unfortunately many software heuristics do not fall into this category and so other methods must be applied. One is restructuring: an entirely new algorithm is applied. Another is to analyze application utilization and develop accuracy/performance tradeoffs. Using our prefiltering approach and novel FPGA programming models we have achieved significant speedup over reference programs. We have applied approximation, seeding, and filtering to this end. The bulk of this study is to introduce the pros and cons of these acceleration models for biosequence analysis tools.

Keywords/Search Tags:

Analysis tools, Acceleration, FPGA, Sequence

Related items

1	Research And Design Of Hardware Acceleration For Sequence Alignment Algorithm
2	Hardware Acceleration For Relational Databases On FPGA
3	Research And Implementation Of Convolutional Neural Network Acceleration Method Based On FPGA
4	Research On The Reliability Of FPGA-based Neural Network Acceleration System
5	Design And Implementation Of Intelligent Car Based On FPGA Cloud Acceleration And Its Application In Specific Scenes
6	The Design And Implementation Of A Local Multi-port Computing Acceleration Device Based On FPGA
7	Prototype Design And Implementation Of Parallel Acceleration Experiment Platform Based On FPGA
8	Research On FPGA-based Convolutional Neural Network Accelerated Computing Method
9	An Acceleration Platform For Face Detection And Recognition Based On FPGA
10	Research On CNN Network Acceleration For Image Classification Based On FPGA