Research And Design Of Convolutional Neural Network Accelerator Based On Multi-FPGA Co-acceleration

Posted on:2022-02-26

Degree:Master

Type:Thesis

Country:China

Candidate:Y Wu

Full Text:PDF

GTID:2518306575972289

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

Convolutional Neural Networks(CNN),as one of the most important algorithms of Deep Learning(DL),adopts a weight-sharing network structure to reduce the complexity of the network and is widely used in computer vision and other fields.Traditional convolutional neural networks are implemented based on central processing unit(CPU)or graphics processing unit(GPU).At present,the calculation speed of using the CPU to execute the convolutional neural network is relatively inefficient,and it is difficult to meet the real-time computing requirements;and the high power consumption characteristics of the GPU make it unsuitable for the application on a mobile platform.Field Programmable Gate Array(FPGA)owning rich computing resources can support highly parallel computing,can effectively support the calculation of convolutional neural networks.Aiming at the parallel computing characteristics of convolutional neural networks and starting from the cost of hardware implementation,an efficient FPGA-based convolutional neural network accelerator is designed.The accelerator is realized by a software and hardware cooperative computing platform,in which the hardware part realizes the acceleration of the network by unfolding convolution calculation,and the software part realizes the control and display of image data.On this basis,a system accelerator architecture based on multiple FPGAs is proposed.The calculation tasks are dispersed to multiple FPGAs interconnected through high-speed interfaces,and this architecture further improves the efficiency of the system compared with a single FPGA architecture.The hardware accelerator designed and implemented has achieved the same recognition rate as the existing work on the classification of the CIFAR-10 data set.In terms of computing performance,the single FPGA-based accelerator reached 17.9 GOPS(billion calculations per second),and the multi-FPGA co-accelerator reached 58.3 GOPS.Compared with CPU-based software recognition,it has a great improvement.

Keywords/Search Tags:

Convolutional Neural Networks, Hardware Accelerating, Field Programmable Gate Array, Parallel Computing

PDF Full Text Request

Related items

1	Research And Implementation Of FPGA Accelerating Compressed Convolutional Neural Network
2	The Research And Implementation Of Convolutional Neural Network Based On FPGA
3	ZYNQ-Based Reconfigurable Convolutional Neural Network Accelerator
4	Research Of Convolution Neural Network Acceleration System Based On FPGA
5	Research And Implementation Of FPGA Accelerated Convolutional Neural Network Training
6	Design And Optimization Of Tiny YOLO Convolutional Neural Network Accelerator
7	Research On Convolutional Neural Networks Accelerator Based On FPGA
8	FPGA Based Convolutional Neural Network Application Research
9	SRAM Field Programmable Gate Array Design And Test Analysis
10	Research On Realization And Application Of Neural Networks On FPGA