Research On The Architecture Of Lightweight Convolutional Neural Networks

Posted on:2020-11-30

Degree:Master

Type:Thesis

Country:China

Candidate:C Lv

Full Text:PDF

GTID:2428330575480273

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

With the recovery of artificial intelligence,computer vision has been developing at high speed in recent years.As one of the fundamental models in computer vision,convolutional neural networks are the cornerstone of numerous researches and applications in this field.The performance of convolutional neural networks has a direct impact on the upper bound of many other computer vision tasks such as object detection,semantic segmentation,face recognition,and visual question answer.The improvement of convolutional neural networks could lead to the progress of all related computer vision systems.Convolutional neural networks are one of the research highlights in recent years.After years of development,the research of convolutional neural networks has converted from large models with high accuracy to lightweight models which are more suitable for practical applications.The aim of lightweight convolutional neural networks is to keep comparable accuracy with large models,and at the same time reduce the size of models,decrease training and inference time,and even make the models run with embedded devices.With lightweight convolutional neural networks,more researches in computer vision could apply to industrial products and services.This paper focuses on the construction of lightweight convolutional neural networks from the perspective of decomposing convolutions.We argue that decomposing convolutions is the main idea in the progress of lightweight convolutional neural networks.We make a new interpretation of some important lightweight convolutional neural networks from the perspective of decoupling convolutions.Depthwise convolution is one of the primary modules of lightweight convolutional neural networks.Depth-wise convolution decouples the study of spatial correlations from cross-channel correlations.But depth-wise convolution is now the bottleneck of lightweight convolutional neural networks.Shift module and active shift module are efficient alternatives to depth-wise convolution.To improve the limited expressive abilities of shift module and active shift module,we propose a new component of lightweight convolutional neural networks called multi-active-shift module.The main work of this paper could be summarized in the following three aspects:First,we make a new interpretation of the progress of lightweight convolutional neural networks from the perspective of decoupling spatial correlations and crosschannel correlations.Second,we introduce two alternatives to depth-wise convolution,shift module,and active shift module.We prove that shift module is equivalent with standard convolution with sparse kernels and interpret shift module from the perspective of decomposing and re-integrating convolution.Third,we propose a new component of lightweight convolutional neural networks multi-active-shift module to improve the expressive abilities of shift module and active shift module.We use multi-active-shift module to construct a new light-weight convolutional neural network called MASNet,and valid its superiority in speed and accuracy on CIFAR10/100 and ImageNet 2012 datasets.

Keywords/Search Tags:

Lightweight Convolutional Neural Networks, Network Architecture, Decomposing Convolutions, Shift Operator, Active Shift Layer, Multi-active-shift Layer

PDF Full Text Request

Related items

1	Software And Hardware Acceleration Design Of Shift Convolutional Neural Networks
2	Research Of The Implementation Of Multi-layer Hybrid Architecture And Key Technologies For Air Cargo Terminals Logistics Monitoring System
3	Active Vision System Based On Mean Shift And Fuzzy Control
4	Design Of Sparse Convolutional Neural Network Accelerator Based On Shift Unit
5	Research On Brillouin Frequency Shift Extract Technology Based On Convolutional Neural Network
6	Hybrid Evolutionary Algorithms Based On Mean Shift
7	Application Of The Active Face Tracking Based On The Mean Shift Algorithm In Video Surveillance System
8	Research On Diffusion Layer Based On Cyclic Shift And XOR Operation
9	Active Deceptive Jamming Against SAR Based On Convolutional Modulation
10	The F <sub> 4 </ Sub> On The ��-linear Feedback Shift Register