Research On Application Of Multimodal Information Fusion In Robot Target Positioning And Grasping

Posted on:2019-03-09

Degree:Master

Type:Thesis

Country:China

Candidate:Y F Wei

Full Text:PDF

GTID:2428330590976095

Subject:Mechanical and electrical engineering

Abstract/Summary:

PDF Full Text Request

Computer vision-based object recognition and location technology has been widely used in industrial production.This paper proposes an pose estimation algorithm based on the needs of industrial production,by extracting the multi-modal information of two-dimensional images and three-dimensional images,the identification and location of the objects are achieved,and the reliability and robustness of the algorithm are verified by the grasping experiments.In order to increase the reliability of the recognition,the deep learning was used to classify and train different objects,and a model of classification recognition was obtained.It was verified that the model had a good recognition effect through experiments.The content of this paper is as follows:(1)Perform calibration experiments on Kinect2 depth cameras,USB cameras and robots,convert the coordinate systems of the three to the world coordinate system,and provide a theoretical basis for later gripping.(2)The target 2D image information is obtained by USB camera.The contour is recognized through the contour detection and matching process.Then the image SIFT feature is extracted for location tracking and the position of object is obtained.(3)Obtaining a point cloud image by Kinect2 camera and the best model can be sorted through pre-processing,Euclidean cluster segmentation,computing VFH feature and KD-tree searching,identifying the point cloud image.Then the orientation is obtained by registering the point clouds.(4)An pose estimation algorithm is proposed,which combines the above two methods to complete the identification and positioning of objects.The effect of the method is verified by the robotic gripping experiment.The result show that the multi-modal information of two-dimensional image and three-dimensional point cloud image can be used to identify and locate different target objects.Compared with the processing method using only two-dimensional or point cloud single-mode image information,the positioning error can be reduced by 50%,the robustness and accuracy are better.(5)CNN convolutional neural network is used to classify objects.Firstly,collect images of only objects,and then use the “DCGAN” neural network to amplify the number of objects.After that,CNN convolutional neural network was established to complete the classification training of different objects and obtain a classification recognition model.Through experiments,the model can accurately identify different objects.

Keywords/Search Tags:

2D image, Point cloud, Recognition and positioning, Robot, Neural network

PDF Full Text Request

Related items

1	3D Object Recognition Of Scattered Parts Based On RealSense
2	Research On 3D Point Cloud Recognition Method Based On Hierarchical Graph Convolutional Neural Network
3	Research On Point Cloud Classification And Semantic Segmentation Technology Based On Deep Neural Network
4	Research On Grasp Positioning Technology Of Industrial Robot Based On 3D Point Cloud
5	Research On Laser Point Cloud Semantic Segmentation Algorithm Based On Deep Neural Network
6	Design And Implementation Of Finger Vein Recognition System Based On Binocular Vision
7	Design Of Industrial Robot Positioning System Based On Improved Algorithm Of Point Cloud Registration
8	Research On Online Measurement Technology Of Robot Assembly And Welding Process Based On Point Cloud Data
9	Research On Recognition And Segmentation Method For 3D Point Clouds Based On Graph Neural Network
10	Research On Visual Inspection Method Of Safe Operation System Of Industrial Robot