Research On DCNN Model Compression Strategy For Edge Devices

Posted on:2024-09-01

Degree:Master

Type:Thesis

Country:China

Candidate:J R Xu

Full Text:PDF

GTID:2568307061969329

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

With the development of Convolutional neural network technology in deep learning,the model structure of today’s Convolutional neural network is more widely used.In order to improve the performance of the model,the network model usually becomes more complex and increasingly large,and the neural network itself has computational redundancy,making Edge device unable to meet the computational requirements of complex models.Therefore,the model compression of deep Convolutional neural network has been widely studied.Using effective model compression algorithms can reduce redundancy and make complex models become lightweight models to adapt to richer application scenarios.In this paper,lightweight Convolutional neural network and structured model pruning technology are studied.Firstly,manually design lightweight convolutional modules to construct neural network RDPNet.The main method is to construct an RDP module based on deep separable convolution and reparameterization methods,with multiple structural modules for training and reparameterized single structural modules for inference.On this basis,the overall network structure is constructed with appropriate depth and width adjustments.The experimental results show that RDPNet outperforms other lightweight neural networks of the same type,achieving a good balance between model performance and inference speed.Then,we improved and proposed the Global Adaptive Pruning method,which is a structured dynamic pruning method.Using sparse training to obtain a model with sparse solution,the importance of the channel is judged based on the training mask,and then redundant channels are pruned to compress the model.The advantage of using this approach is that the algorithm focuses on exploring the implicit architecture of the model,and dynamically judges the importance of channels in the training state to achieve better compression results.Finally,the proposed two network compression methods are applied to the Edge device environment.By using the INT8 quantization method,we further compress the model size and accelerate the calculation based on the first two methods.By reasoning the compression model on the Edge device,the validity of the compression method proposed in this paper is verified.

Keywords/Search Tags:

convolution neural network, model compression, lightweight CNN structure, model pruning, model quantification

PDF Full Text Request

Related items

1	Research On Deep Neural Network Model Compression Algorithm Based On Convolution Kernel Pruning
2	Research On Lightweight Convolution Neural Network Model For On-Device Learning
3	Research On Compression Algorithm And Application Of Deep Convolution Neural Networks Based On Network Pruning
4	Research On Object Detection Technology Based On Lightweight Model Compression
5	Compression Method Research And Application Of Deep Convolution Neural Network Model
6	Research On Acceleration Method Of Object Detection With Convolution Neural Network In Mobile Sense
7	Research On Structure Optimization Algorithm Based On Convolutional Neural Network Model
8	Research On Computational Optimization Technology Based On Deep Learning
9	Research On Compression Method Of Deep Neural Network Model
10	Research On Acceleration Method Of Convolution Neural Network Model