Research On Scalable Accelerator Design For Face Detection And Recognition Application

Posted on:2020-02-09

Degree:Master

Type:Thesis

Country:China

Candidate:Q Fu

Full Text:PDF

GTID:2518306548991119

Subject:Master of Engineering

Abstract/Summary:

PDF Full Text Request

Face detection and face recognition technology have been widely used.With the development of deep learning,the face detection and face recognition technology based on deep learning has surpassed the human eye recognition level,but it brings a sharp increase in the amount of calculation.Faced with many face detection and face recognition application scenarios,how to accelerate their inference performance has become an urgent problem to be solved.Based on FPGA platform,this paper studies the deep learning based face detection and recognition forward inference parallelization technologyThis paper first studies the process and characteristics of face key point detection algorithm and face recognition algorithm based on deep learning.A face recognition example is taken for specific research,and a fast algorithm suitable for hardware implementation is designed for face alignment.Then we study the quantization of the face detection algorithm,we use a low-bitwidth global quantization method,which reduces the bandwidth occupation by 50% with little affecting on the accuracy.In this paper,the general matrix multiplier accelerator is selected to accelerate the face-related application.This paper improves accelerator structure and models the performance of this hardware structure.The accelerator parallel search algorithm is designed to adjust the accelerator structure according to hardware resource conditions and different convolutional neural network structures to optimize the theoretical performance of the accelerator.Finally,the accelerator design was implemented on the FPGA of the Zynq7020 chip.The experimental results show that the accelerator can achieve a throughput of 35 GOPS on the platform.Compared with the CPU platform and GPU platform,the performance-topower consumption ratio is 15 and 5 times that of the former.

Keywords/Search Tags:

Deep learning, Face detection, Face recognition, FPGA platform, Performance model, Accelerator

PDF Full Text Request

Related items

1	Research On Face Recognition Based On Embedded Platform
2	Research On Face Detection And Recognition Method Based On Deep Learning
3	Algorithm Of Face Recognition Based On Deep Learning With Its Implementation On Embedded Platform
4	Face Recognition Algorithm Design And FPGA Verification Based On Deep Seperable Convolution
5	Research On Face Recognition Based On Deep Learning
6	Pose Variable Face Recognition Based On Deep Learning
7	Face Recognition Algorithm And Circuit Design Based On Embedding Feature Of Convolutional Network
8	Research And System Design Of Deep Face Recognition
9	Video Face Recognition System Based On Depth Network Design And Implementation
10	Face Detection And Recognition Based On Deep Learning And Its Application In Android Mobile Terminal