Research On Technology Of Model Compression For Convolutional Neural Networks

Posted on:2020-08-21

Degree:Master

Type:Thesis

Country:China

Candidate:L C Qian

Full Text:PDF

GTID:2428330620958897

Subject:Integrated circuit engineering

Abstract/Summary:

PDF Full Text Request

Convolutional neural networks are everywhere,their applications cover computer vision,speech recognition and natural language processing.Convolutional neural networks require a lot of computation,take up much memory and are difficult to be deployed on embedded systems which greatly limit their applications.This paper takes the object detection algorithm as an example,proposes a model compression method which can help the compressed model detects objects in real time on the embedded side.Since the main compression strategy is for the convolutional layers,this compression method can be extended to other computer vision algorithms which use a large number of convolutional layers.Firstly,this paper replaces convolutional layers which contain a lot of parameters and computations with depthwise separable convolutions.This method can greatly reduce the parameters and computations while ensuring the performance of the model.Secondly,this paper proposes a dynamic pruning method which is adaptive to step size.This method can dynamically determine the pruning step size and order which can greatly improve pruning efficiency according to the sensitivity of each convolutional layer.And if the model is retrained with the pruned structure,the model will perform better than before.Finally,this paper simulates the operation time of each layer in the heterogeneous computation platform ZCU102 and proposes a model compression strategy for different platforms.According to the experimental results,the object detection algorithm using the compression method proposed in this paper performs well on the PASCAL VOC dataset.The model size has decreased from 33.26 M to 9.7M,the final mAP has increased from 54 to 58.In the TX2 embedded system,the FPS increases from 7.23 to 9.56,which basically meets the requirements of real-time detection.Therefore,the model compression method proposed in this paper can effectively solve the problem of deploying real-time object detection algorithms on the embedded side.And the method can be extended to other Computer vision algorithms which contain a large number of convolutional layers.

Keywords/Search Tags:

Convolutional Neural Network, Computer Vision, Model Compression, Object Detection, Pruning

PDF Full Text Request

Related items

1	Research On Convolutional Neural Network Compression Method Based On Dynamic Pruning And Weight Resetting
2	Deep Model Compression In Computer Vision
3	Research On Compression Method For Convolutional Neural Network Based On Pruning
4	Research For Convolutional Neural Network Model Compression Algorithm Based On Object Detection
5	Pruning-Based Compression Method For Convolutional Neural Network
6	Research On Model Pruning Algorithms For Deep Convolutional Neural Networks
7	Research And Application Of Real-time Object Detection Based On Lightweight Convolutional Neural Network
8	Research On Embedded Object Detection Based On Deep Neural Network Compression
9	Research On High-efficiency Compression Technology For Convolutional Neural Networks
10	Research On Convolutional Neural Network Acceleration And Compression Methods