Research On Compression Method For Convolutional Neural Network Based On Pruning

Posted on:2021-03-18

Degree:Master

Type:Thesis

Country:China

Candidate:Y N Xu

Full Text:PDF

GTID:2428330614459251

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

Nowadays,the Convolutional Neural Networks(CNNs)have achieved impressive performance on many computer vision related tasks such as object detection and image recognition.However,the remarkable performance of CNNs has been accompanied by a significant increase of computation and memory costs,which prohibits their usage on resource-limited environments such as mobile or embedded devices.To this end,the CNN compression and acceleration has recently become an emerging area where researchers are devoted to reducing the consumption of computation and memory.Pruning is one of the predominant approaches for CNN compression,which can effectively find out and remove the redundant part of CNNs with minimal loss.In this thesis,two structured pruning methods and are proposed to compress and accelerate modern CNNs.The contributions are summarized as follows:1.Existing unstructured pruning methods lack the practical implementations to achieve the realistic acceleration.To address the above-mentioned problem,this thesis proposes a structured Saliency-based Global Filter Pruning(SGFP)approach.The SGFP calculates the gradient information of filters generated by multi-batch samples during training process.Specifically,the normalized magnitude of the gradient value is defined as the saliency criterion,which measures the effect of removing filters on network performance.The approach can construct a global ranking by setting a global pruning rate.With global ranking,one-shot filter pruning can be done effectively.In addition,SGFP adopts the group pruning to solve the misalignment problem when pruning the filters of Res Net.Empirical results demonstrate that the proposed approach can better trade-off accuracy for FLOPs.2.To eliminate the need for the expensive prune-retain cycles,this thesis proposes a filter pruning approach by combining the momentum with SGFP.The approach contains four steps.Firstly,it takes the mean of the momentum magnitude that belongs to all nonzero filters for each layer.The resulting proportion is momentum magnitude contribution for each layer.Then,it uses saliency criterion in SGFP to evaluate the importance of each filter and uses a pruning rate to select unimportant filters for each layer.Next,it sets the value of selected filters to zero,which can temporarily eliminate their contribution to the network output.Particularly,the pruning rate is decayed.Finally,it regrows filters proportionately by enabling the zero-valued(missing)filters which have the largest momentum magnitude.The extensive experiments show that the proposed approach can improve the pruning accuracy,especially for large CNNs such as Res Net56.

Keywords/Search Tags:

computer vision, network compression, network acceleration, structured pruning, filter pruning

PDF Full Text Request

Related items

1	Research On Deep Neural Network Model Compression Method Based On Parameter Pruning
2	Pruning-Based Compression Method For Convolutional Neural Network
3	Deep Model Compression In Computer Vision
4	Research On Convolutional Neural Network Compression Method Based On Dynamic Pruning And Weight Resetting
5	Research On Compression And Acceleration Of Deep Neural Network Based On Model Pruning
6	Research On Evolutionary Based Automated Neural Network Compression
7	Research And Application Of Structured Model Compression Algorithm In Deep Neural Network
8	Research On Deep Network Compression Method Based On Model Gradient Information
9	Similarity-Based Approach To Neural Network Pruning
10	Research Of The Model Compression Algorithm For Deep Neural Network