Deep Pedestrian Detection With Variable Input Size

Posted on:2019-10-14

Degree:Master

Type:Thesis

Country:China

Candidate:S Chen

Full Text:PDF

GTID:2518306473953969

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

In recent years,the object detection based on deep learning shows very good performance.With the input images,the neural networks learn the method to extract the feature maps from the original images end-to-end.However,the detection algorithms based on the deep learning are mainly dependent on the convolution neural network which requires a fixed size input because of the final full connection layer.So the previous deep learning algorithms need to crop/warp the original image to a fixed size,these operation greatly limit the performance on large and small objects.Based on the size problem in the object detection algorithm,this paper proposes the variable input scale object detection algorithm.The details are summarized as follows:(1)Based on the problem of fixed size dependency in convolution neural networks,this paper proposes region proposal framework using multiple scale inputs for training and testing.The framework works well for the large and small objects in the pictures.The proposed algorithm generates the regions based on the size in different levels.And it achieves good performance via the adaptive strategy.With our method,the performance on small and large objects has a significant advance.Based on the further research in the first problem,the problem of hard examples mining is studied.An associated work network,which contains a metric coding net(MC-net)and a weighted association CNN(WA-CNN),is introduced to mine the hard negatives.MC-net is based on the metric learning theory for the comparability determination,to strengthen the difference of intra-class.WA-CNN can be regarded as a network to reinforce the distance of inter-class and associates the MC-net to accomplish the detection task by a weighted strategy.Finally,because the pedestrian detection needs to process bounding boxes to carry out the final target box,Non-maximum Suppression(NMS)is used usually.But in reality,pedestrians have a high overlap in the picture,and the NMS is not work well in these scenes.This paper proposes the segment non-maximum suppression strategy for these scenes.(2)Based on the analysis in the one-stage detection and two-stage detection,this paper synthesizes the advantage of the binary classification and multi-classification in the detection and designs a one-stage detection algorithm with variable input size.Future,an Non-maximum Suppression strategy jointing multi-class score with the binary classification score,is introduced in the paper.

Keywords/Search Tags:

deep learning, pedestrian detection, one-stage dectection framework, two-stage detection framework, variable input size

PDF Full Text Request

Related items

1	Research On Pedestrian Detection Algorithms Based On Deep Learning
2	Research On Small Objects Detection Method Based On Deep Learning
3	Research On Multi-stage Attack Detection Method Based On Deep Learning
4	Two-stage Detector For Automatic Classification And Recognition Of Leukocyte
5	The Applied Research Of One Stage Object Detection Algorithm Based On Deep Learning
6	Research On 3D Ground Object Detection And Recognition Based On Lidar
7	Research On Single-Stage Object Detection Algorithm Based On Deep Convolutional Neural Network
8	Occlusion Pedestrian Detection Based On Deep Learning Algorithm
9	Research On Pedestrian Detection Algorithm Based On Improved End-to-End Deep Learning Framework
10	Research And Implementation Of Improved Method For Far-infrared Pedestrian Detection On Board