Research On Compression Algorithm And Application Of Deep Convolution Neural Networks Based On Network Pruning

Posted on:2024-04-17

Degree:Master

Type:Thesis

Country:China

Candidate:K Song

Full Text:PDF

GTID:2568306941464124

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

Deep convolutional neural networks have made a big success in recent years and become one of the most popular research directions.Its success has been accompanied by the number of parameters increasing exponentially,resulting in expensive computational and storage costs.To address this problem,model compression has emerged.This thesis mainly studies model compression methods based on network pruning.The main work is divided into three parts as follows:(1)This thesis proposes a filter pruning algorithm based on similarity clustering.Unlike previous filter importance-based pruning methods,this algorithm focuses on the similarity of filters in the same layer and removes the filters with high similarity.In this algorithm,the similarity is measured using Euclidean distance,and the smaller the distance is,the more similar the filters are.The similar filter pair with the smallest distance between filters is selected,and the k-nearest neighbor distance sum is calculated for each filter in the filter pair,then the filter with the smallest distance sum is pruned.The removed filter can be replaced using its nearest neighbor filter.In this thesis,we conduct experiments on several datasets to verify the effectiveness of this algorithm.On the CIFAR10 dataset,our method reduces more than 70%of FL OPs and parameters on GoogLeNet,and the accuracy is even improved by 0.09%over the benchmark model.(2)This thesis proposes a filter pruning algorithm based on layer redundancy.Most of the previous works use global uniform pruning rate.We believe that the global uniform pruning rate cannot obtain the optimal pruning effect,so we propose a filter pruning algorithm based on layer redundancy.The algorithm uses the Euclidean distance to measure the similarity between filters and the Taylor expansion loss function to approximate the pruning sensitivity of filters,and then combines them to measure the redundancy of each layer in the convolutional neural network.Depending on the redundancy,different pruning rates are set.Through experimental validation,this method performs well on most different network models.Among them,after removing 55.1%of FLOPs from ResNet-56 on the CIFAR10 dataset,the accuracy reaches 93.27%,which is 0.13%higher than the accuracy of the filter pruning method using similarity-based clustering only.(3)This thesis designs and implements an application system based on network pruning.The system is developed by using PyQt5 framework with the pruning algorithm proposed in this thesis.The system includes the functional modules of model training,model pruning and image recognition.Users can choose models for training and pruning.By using this system to compare the changes of the model before and after pruning,we can demonstrate the effectiveness and practicality of the network pruning algorithm proposed in this thesis.

Keywords/Search Tags:

Filter Pruning, Model Compression, CNN, Network Pruning

PDF Full Text Request

Related items

1	Research On Compression Algorithm And Application Of Deep Convolution Neural Networks Based On Network Pruning
2	Pruning-Based Compression Method For Convolutional Neural Network
3	Research On Compression Method For Convolutional Neural Network Based On Pruning
4	Research On Adaptive Soft Pruning Algorithm Based On Sensitivity Feedback
5	Research On The Simplification Method Of Neural Network Mode
6	Research On Filter Prunning Method Of Deep Convolution Neural Network
7	Research On Pruning Algorithm Of Classification Model Based On Neural Network
8	Structured Pruning Of Convolutional Neural Networks With Enhanced Linear Representation Redundanc
9	Research On Convolutional Neural Network Model Compression Based On Pruning
10	Convolutional Neural Network Compression By Fusing Weight And Filter Pruning