An Acceleration Structure Of Convolutional Neural Network

Posted on:2021-05-07

Degree:Master

Type:Thesis

Country:China

Candidate:C Jiang

Full Text:PDF

GTID:2428330620464293

Subject:Engineering

Abstract/Summary:

PDF Full Text Request

Artificial intelligence technology has gradually become mature and practical with the help of the Internet,big data,high-performance chips and other technologies.With the continuous improvement of basic technologies such as DNN,artificial intelligence will enter various industries and radically change people's production and lifestyle.CNN is the representative algorithm of DNN,and cause extensive attention of highly efficient classification recognition in recent years.Its special structure with local weights sharing has unique advantages to reduce the complexity of the networks,and its layout is closer to real biological neural network;especially multidimensional networks input vector image can directly input this feature to avoid the extraction and classification.CNN algorithm has been widely used in machine learning,speech recognition,,document analysis,language detection and image recognition of artificial intelligence.The cloud needs to process massive data with high accuracy and needs certain universality.Due to the limitations of security,real-time and network bandwidth resources,CNN sinks to the terminal.It is of great research significance to study the CNN acceleration structure,whether to accelerate the cloud training speed,improve the calculation power and power ratio,or to meet the needs of terminal to improve the calculation power and reduce the cost and power consumption.Studies show that more than 90% of CNN's computation and time consumption are concentrated in the convolution layer,involving a large number of multiplication and addition operations.The systolic array represents a PE network that rhythmically computing and transmitting data through the system,regularly pumping in and out to maintain a regular flow of data.Making the data flow in computing units,it reduces the memory access,and makes the structure more orderly,more uniform wiring,and higher frequency.Systolic array is a special design with simple structure and low implementation cost.The CNN convolution operation is an ideal application to show the characteristics of systolic array.Therefore,this thesis proposes an efficient array architecture for CNN convolutional layers based on systolic arrays,which accelerates the convolution operation by changing the data flow and computing structure.And further improved on it and proposed an acceleration architecture for CNN convolutional layer based on broadcast architecture,which has greatly improved its throughput and power efficiency.

Keywords/Search Tags:

Deep learning, convolutional neural network, covolution accelerator strucuture, systolic array, broadcast architecture

PDF Full Text Request

Related items

1	VLSI Architecture Design For Binary Convolutional Neural Network Accelerator
2	Design Of Neural Network Accelerator Based On RISC-V Processor
3	A Convolutional Neural Network Accelerator Architecture With Fine-Granular Mixed Precision
4	Design And Optimization Of Tiny YOLO Convolutional Neural Network Accelerator
5	Design Of General-purpose Convolutional Neural Network Accelerator Based On FPGA
6	An Efficient General Accelerator For Convolutional Neural Network
7	The Study Of Many-core Deep Learning Accelerator Based On BWDSP
8	A High-reliability Deep Neural Network Accelerator With Hybrid Architecture
9	Design And Implementation Of A High-performance Accelerator Dedicated For Convolutional Neural Networks
10	Design And Implementation Of Lightweight Convolutional Neural Network Accelerator On SoPC