Research On Vehicle Detection Application Based On Ascend NPU And Lightweight Convolutional Neural Network

Posted on:2022-10-01

Degree:Master

Type:Thesis

Country:China

Candidate:H Guo

Full Text:PDF

GTID:2492306572997749

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

The accuracy of the general target detection convolutional neural network algorithm continues to improve,and with it comes the explosive growth of the amount of calculation and the amount of parameters.In contrast,the emerging fields of intelligent transportation,autonomous driving,and intelligent security all use embedded AI devices,which poses new challenges for convolutional neural network algorithms.In-depth compression of highperformance target detection networks can greatly reduce the amount of network parameters.This is essential to deploy target detection convolutional neural networks on embedded AI devices that lack computing resources.The Ascend NPU uses a customized Da Vinci architecture.How to realize the fast convolution calculation process and give full play to the advantages of the Ascend NPU is the key to vehicle detection applications.On the basis of common target detection,a lightweight feature extraction network is used as the backbone network to design and train a lightweight vehicle detection network with high detection accuracy and small parameters.Aiming at the problem of uneven feature distribution before and after low-precision quantization,Kullback-Leibler divergence is used to measure the difference in information distribution before and after quantization,and the quantization mapping interval is calibrated.Aiming at the problem of wide distribution of scale parameters in the batch normalization layer during the pruning process,the L1 regularization is used to constrain the scale parameters,and the network is sparsely trained to improve the efficiency of pruning.Aiming at the byte alignment problem of data transmission in Ascend NPU,a five-dimensional data transmission format is designed,and the convolution data is tiled to realize fast convolution calculation.For the ordinary convolution calculation process,the use of multi-stage pipeline and multi-core parallel technology to further increase the calculation speed.The experimental results show that the vehicle detection of the lightweight backbone network achieves a compression rate of 93% on Center Net and only a loss of accuracy of4.89%.The 8-bit quantization based on KL divergence calibration reduces the accuracy of the YOLO v4 network model by only 0.87%,and the model size is reduced by four times.The sparse pruning method further compresses the model by 65% under the lightweight Center Net model.The final vehicle detection model is only 2.64 MB,and the detection accuracy after pruning is almost unchanged.After Center Net accelerated by multi-stage pipeline and multi-core in parallel,the inference time on the Ascend NPU reaches 4.547 ms per frame,which far exceeds the speed requirement of real-time detection.

Keywords/Search Tags:

Ascend NPU, lightweight model, low-precision quantification, model pruning, parallel computing

PDF Full Text Request

Related items

1	Front Vehicle And Pedestrian Detection Algorithm Based On Lightweight Neural Network
2	Parallel Computing, Intelligent Optimization And Hybrid Model Research For Traffic Simulation System
3	Coarse-grained Parallel Computing Of The Scheduling Decision In Large Scale Power System
4	Research On Lightweight Technology Of BIM Model On Web For Rail Transit
5	Research On Road Damage Detection Model Based On Lightweight Network
6	Lightweight Design Optimization Of Cylindrical Shells Under External Hydrostatic
7	Parallel Computing Of Flow Field Of A Model Afterburner
8	Research Of The Grid Computing And Parallel Technology Based Dynamic Security Assessment Of Power Systems
9	Optimization Design Of Solid Rocket Motor And Intercept Algorithm Analysis For Direct-Ascend Anti-Satellite Missile
10	Research On New Computing Model Of Electric Power System Based On Network Computing