Application Research Of Start To Target Autonomous Control Of Virtual Camera Based On Reinforcement Learning

Posted on:2024-01-05

Degree:Master

Type:Thesis

Country:China

Candidate:Q Shao

Full Text:PDF

GTID:2568307127968469

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

In the virtual museum scene created using the Unity3 D development engine,to realize the virtual camera’s self-display of the exhibits in the virtual scene,the existing method is to manually set the track,make the virtual camera move along the track and display the exhibits along the way,which requires a lot of manpower to adjust the key points of the track in detail.Acommon path planning method is to calculate the optimal path through algorithms given a well-defined map and starting andending points.In the virtual scene,the display of exhibits needs to consider the display angle of the virtual camera,which cannot be solved by the path planning method.In terms of the above problems,this thesis proposes a new autonomous control method to reduce human work and realize the display of exhibits.On the basis of not obtaining the position information of the exhibits in the scene,the target detection module provides the status information of the detected objects,passes it to the reinforcement learning module for training,and finally the reinforcement learning module realizes the autonomous control of the virtual camera from the start to the target to complete the display of the exhibits in the scene.And research from the following points.(1)Astate space suitable for this research topic is designed,which provides state information support forthedesignofrewardfunctions.(2)An action space suitable for this research topic is designed,and the degree of freedom of the researchobjectisdetermined,sothattheagentcanmoveinthesceneanddisplaytheperspectivescreen.(3)A reward function that can effectively motivate the virtual camera to quickly realize autonomous control is designed,which improves the decision-making efficiency of the virtual camera,and is used to connecttheobjectdetectionmoduleandthereinforcementlearningmodule.(4)Three experimentalscenes were builtthrough the Unity3 Ddevelopment engine,and in thesethree virtual scenes,the Vanilla Policy Gradient algorithm(VPG)and the Trust Region Policy Optimization algorithm(TRPO)were used to realize the autonomous control of the virtual camera from the start to the target.Finally,the experimental results in each scene are compared and analyzed to verify the feasibility andperformanceofthetwoalgorithmsintheapplicationofthisthesis.This research project uses the object detection algorithm to identify the virtual exhibits in the virtual scene without obtaining the spatial position of the target and the virtual camera,provides the state support required for autonomous control of reinforcement learning,and realizes the autonomous control of the virtual camera through VPG and TRPO algorithms respectively,and displays the exhibits in the virtual scene.The experimental results of the two algorithms in three virtual scenarios show that the TRPO algorithm is more suitable for complex virtual scenarios,and it is more adaptable,higher in cumulative rewardvalueandbetterinperformancethanthe VPGalgorithminthisapplication.

Keywords/Search Tags:

Reinforcementlearning, Autonomouscontrol, Virtualmuseum, Targetdetection

PDF Full Text Request

Related items

1	Research And Implementation Of A Plum Pavilion Based On Mobile Platform
2	Research Of The Interaction With3D Virtual World Using Smartphone
3	Research On Heterogeneous Virtual Machine Platform Of Supporting Hadoop Configuration
4	The Avoidance Obstacle Path Planning Of Mobile Robot In Unknown Environment
5	Agent Sequential Decision-making Approach And Its Application Under Uncertain Enviroment
6	CFAR Detection Of Targets In SAR Images Based On Statistical Model
7	Interference Coordination Based On Reinforcement Learning In Femtocell Networks
8	The Digital Virtualmuseum Of Hong Shan Culture Research And Application Performance
9	Research On Target Detection Theory For Multiple Antenna-based Radar Systems
10	Based On The Three-dimensional Ranging Multi-objective Warning Technology Research