Research Of Object Detection Based On Multi-modal Images

Posted on:2020-10-30

Degree:Master

Type:Thesis

Country:China

Candidate:F Yang

Full Text:PDF

GTID:2428330575958419

Subject:Circuits and Systems

Abstract/Summary:

PDF Full Text Request

Object detection is one of the most popular topics in the field of computer vision.Its research results have important application prospects in many fields such as military,agriculture,medicine,security and so on.In recent years,with the in-depth study of deep learning technology in computer vision,the object detection task has made great progress and the detection accuracy has been continuously improved a lot.However,its application in reality still faces great challenges.In areas such as military and security,traditional RGB single-modal images have very large limitations,which seriously restricts the improvement of object detection accuracy in these scenarios.In the past few years,more and more researchers have found that the introduction of multi-modal data is helpful to obtain high-performance detectors,and research of object detection based on multi-modal data is becoming more and more popular.However,current research of multi-modal tasks does not discuss the characteristics of multi-modal data itself.This paper focuses on two problems found in multi-modal data,namely the mismatch problem between image pairs and information missing between modalities.The main work of this paper is divided into the following points:1)The reason for the mismatch problem in multi-modal data is analyzed,which proves that this problem is vulnerable and difficult to avoid in multi-modal data.Experiment results verify that the mismatch problem in multi-modal data is an important factor affecting the fusion phase of multi-modal data.2)The reason for the problem of information missing between modalities in multi-modal data is analyzed.It is proved that this problem is vulnerable and difficult to avoid in multi-modal data,and the influence of this problem on the detection network is discussed.3)Based on the discussion of the above two problems,the structure of the multi-modal object detection network is designed,and a step-wise training method is proposed by this work,which has achieved good detection results.4)In this paper,a RGB and infrared dual-modal dataset is constructed.The image pairs in this dataset have higher resolution and contain more pairs of images taken in different kinds of scenes.

Keywords/Search Tags:

Object Detection, Multi-modality, Deep Learning, Mismatch, Modal information missing

PDF Full Text Request

Related items

1	Research On Few-shot Object Detection Based On Deep Learning
2	Researches On RGB-D Visual Salient Object Detection Algorithms Based On Feature Fusion
3	The Study Of Object Detection Methods Via Deep Learning In Urban Transportation
4	The Object Detection Algorithm Application Based On Deep-Learning
5	Research On Multi-modal Learning For Imbalanced Modal Data
6	Research On 3D Object Detection Algorithm Based On Deep Learning
7	Application For Homologous And Heterogeneous Multimodal Data Based On Multiple Deep Learning Blocks
8	Research On Prediction And Decision-making Methods Based On Multi-source Information Fusion
9	Research Of Object Detection Based On Deep Learning
10	Research On RGB-D Object Recognition Based Deep Learning Algorithms