Increasing scalability in algorithms for centralized and decentralized partially observable Markov decision processes: Efficient decision-making and coordination in uncertain environments

Posted on:2011-12-06

Degree:Ph.D

Type:Thesis

University:University of Massachusetts Amherst

Candidate:Amato, Christopher

Full Text:PDF

GTID:2468390011471461

Subject:Operations Research

Abstract/Summary:

PDF Full Text Request

As agents are built for ever more complex environments, methods that consider the uncertainty in the system have strong advantages. This uncertainty is common in domains such as robot navigation, medical diagnosis and treatment, inventory management, sensor networks and e-commerce. When a single decision maker is present, the partially observable Markov decision process (POMDP) model is a popular and powerful choice. When choices are made in a decentralized manner by a set of decision makers, the problem can be modeled as a decentralized partially observable Markov decision process (DEC-POMDP). While POMDPs and DEC-POMDPs offer rich frameworks for sequential decision making under uncertainty, the computational complexity of each model presents an important research challenge.;As a way to address this high complexity, this thesis develops several solution methods based on utilizing domain structure, memory-bounded representations and sampling. These approaches address some of the major bottlenecks for decision-making in real-world uncertain systems. The methods include a more efficient optimal algorithm for DEC-POMDPs as well as scalable approximate algorithms for POMDPs and DEC-POMDPs. Key contributions include optimizing compact representations as well as automatic structure extraction and exploitation. These approaches increase the scalability of algorithms, while also increasing their solution quality.

Keywords/Search Tags:

Partially observable markov decision, Algorithms, Decentralized

PDF Full Text Request

Related items

1	Research On Host Penetration Test Of LAN Based On Partially Observable Markov Decision Process
2	Heuristic Learning Model Based On Partially Observable Markov Decision Process
3	Hierarchical learning and planning in partially observable Markov decision processes
4	Deep Value Iteration Network For Partially Observable Markov Decision Process
5	Learning partially observable Markov decision processes using abstract actions
6	Research On Optimization Of Service Composition Based On Partially Observable Environment
7	Decision-Theoretic Planning For Multi-Agent Systems
8	Algorithms for partially observable Markov decision processes
9	Markov Theory Based Planning And Sensing Under Uncertainty
10	Decision-theoretic meta-reasoning in partially observable and decentralized settings