Research On The Performance Of Target Detection Model In Multimodal Robot Vision

Posted on:2024-02-07

Degree:Master

Type:Thesis

Country:China

Candidate:Y C Chen

Full Text:PDF

GTID:2568306944457604

Subject:Electronic Science and Technology

Abstract/Summary:

PDF Full Text Request

The development of deep learning has brought new possibilities to robot visual inspection systems.Target detection converts sensor data from robotic devices into structured information with representational significance to achieve classification and positioning of targets in images.In the field of robot vision,traditional(Red Green Blue,RGB)image data naturally lacks physical distance information between the mobile terminal and the perceived target,and other data sources are needed to remedy this defect.Depth data naturally contains distance information,but the current research on network detection models based on depth data is relatively lacking.In response to the above issues,this paper proposes a new set of deep pedestrian data sets to compensate for the lack of data sets in related fields.It also conducts performance research on the current classic mainstream visual object detection network under multimodal data,comprehensively selecting the detection network algorithm with the best performance to improve,and increasing the(Average-Precision,AP)value from 0.956 to 0.978 in deep data sets.In addition,considering that the data captured by mobile robots are often sequential data,this paper also proposes a new sequence detection network TDS-DETR,which is innovative in model structure,location coding,and sequence matching stages to achieve serialized data detection.Compared to the twodimensional coded form detection model,our TDS-DETR model improves the detection performance Ap value in deep pedestrian data sets by 11.4%,and reaches 92.9%in short sequence deep data.

Keywords/Search Tags:

Robot vision, Object detection, Depth data, Position embedding

PDF Full Text Request

Related items

1	Research On RGB-D Salient Object Detection Based On Depth Perception And Fusion
2	Research On Object Detection Algorithm Of Learnable Position Encoding And Task Decoupling Acceleration
3	Deep Learning Based 3D Perception And Recognition For Robot Vision
4	Detection And Tracking Of Moving Object Based On The Self-localization Robot
5	Research On Key Technologies Of 3D Vision
6	Research On The Key Technology Of Object Detection Based On Binocular Stereo Vision
7	Research On 3D Object Detection Technology Based On Monocular Vision
8	Study On The Monocular Vision Measurement System Of The Position And Orientation Of Object
9	Research Of The Object Search And Recognition For A Mobile Robot Based On A Hybrid Vision System
10	Research Of The Object Detection And Localization For A Mobile Robot Based On A Hybrid Vision