The Research And Design Of Point-based POMDP Value Iteration Algorithm

Posted on:2014-05-20

Degree:Master

Type:Thesis

Country:China

Candidate:T Bai

Full Text:PDF

GTID:2298330467464069

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

Making sequential decision under uncertainty is one of the most important tasks in artificial intelligence. As the natural extension of Markov Decision Process(MDP), Partially Observable Markov Decision Processes (POMDP) serves as a very competitive model to capture the uncertainty of the world. However, due to the problem of dimension and history, solving exactly POMDP is intractable, people in turn try to look for approximation method. Point-based value iteration algorithms are the most MLOPVIular algorithms among approximation method, whose elegant principle and excellent performance has made POMDP could be applied to much larger practical problems.In this paper, firstly we introduce the mathematical model of POMDP in detail, then we put emphasis on discussion of many point-based value iteration algorithms and further research on these algorithms, the main content of this paper is listed as follows:1. Introduce the math model of Markov Decision Process (MDP) and Partially Observable Markov Decision Processes (POMDP) in detail.2. Introduce the concept and meaning of value function, and the method of value iteration, and simply introduce exact solving method of MDP and POMDP.3. Explain the thought and meaning of several classic point-based value iteration methods, and introduce these methods’ content, advantages and disadvantages.4. Put forward the Most Likely Optimal Policy Value Iteration algorithm, this algorithm uses Monte Carlo Method to calculate the probability of being a part of optimal policy for every action, and gives the priority to those points being visited under the action with the highest probabilities to belief points set. We conduct experiments on several data sets. The result shows our algorithm is superior on these sets comparing with predecessor.5. Introduce the organization and application of spoken dialog system in detail, and put forward a spoken dialog system modeled by POMDP and solved by point-based method.

Keywords/Search Tags:

Sequential Decision Problem, Partially Observable Markov Decision Process, Point-based Value Iteration, Monte Carlo Method

PDF Full Text Request

Related items

1	The Design And Implementation Of Point-based POMDP Policy Iteration Algorithm
2	Markov Theory Based Planning And Sensing Under Uncertainty
3	Agent Sequential Decision-making Approach And Its Application Under Uncertain Enviroment
4	Deep Value Iteration Network For Partially Observable Markov Decision Process
5	Heuristic Learning Model Based On Partially Observable Markov Decision Process
6	Learning partially observable Markov decision processes using abstract actions
7	Increasing scalability in algorithms for centralized and decentralized partially observable Markov decision processes: Efficient decision-making and coordination in uncertain environments
8	Research On Path Planning Based On Markov Decision Process For AUV
9	Research On Path Planning Based On Markov Decision Processes For Palletizing Robot
10	Research On Optimization Of Service Composition Based On Partially Observable Environment