Automated Hierarchy Discovery for Planning in Partially Observable Domains

Planning in partially observable domains is a notoriously difficult problem. However, in many real-world scenarios, planning can be simplified by decomposing the task into a hierarchy of smaller planning problems which, can then be solved independently of one another. Several approaches, mainly deal...

Full description

Bibliographic Details
Main Author: Charlin, Laurent
Format: Others
Language:en
Published: 2007
Subjects:
Online Access:http://hdl.handle.net/10012/2665
id ndltd-WATERLOO-oai-uwspace.uwaterloo.ca-10012-2665
record_format oai_dc
spelling ndltd-WATERLOO-oai-uwspace.uwaterloo.ca-10012-26652013-01-08T18:49:44ZCharlin, Laurent2007-01-19T20:57:29Z2007-01-19T20:57:29Z2007-01-19T20:57:29Z2006http://hdl.handle.net/10012/2665Planning in partially observable domains is a notoriously difficult problem. However, in many real-world scenarios, planning can be simplified by decomposing the task into a hierarchy of smaller planning problems which, can then be solved independently of one another. Several approaches, mainly dealing with fully observable domains, have been proposed to optimize a plan that decomposes according to a hierarchy specified a priori. Some researchers have also proposed to discover hierarchies in fully observable domains. In this thesis, we investigate the problem of automatically discovering planning hierarchies in partially observable domains. The main advantage of discovering hierarchies is that, for a plan of a fixed size, hierarchical plans can be more expressive than non-hierarchical ones. Our solution frames the discovery and optimization of a hierarchical policy as a non-convex optimization problem. By encoding the hierarchical structure as variables of the optimization problem, we can automatically discover a hierarchy. Successfully solving the optimization problem therefore yields an optimal hierarchy and an optimal policy. We describe several techniques to solve the optimization problem. Namely, we provide results using general non-linear solvers, mixed-integer linear and non-linear solvers or a form of bounded hierarchical policy iteration. Our method is flexible enough to allow any parts of the hierarchy to be specified based on prior knowledge while letting the optimization discover the unknown parts. It can also discover hierarchical policies, including recursive policies, that are more compact (potentially infinitely fewer parameters).578014 bytesapplication/pdfenComputer ScienceArtificial IntelligenceReasoning under uncertaintyDecision MakingPlanningMarkov Decision ProcessPartially ObservableHierarchy DiscoveryHierarchical PolicyOptimizationAutomated Hierarchy Discovery for Planning in Partially Observable DomainsThesis or DissertationSchool of Computer ScienceMaster of MathematicsComputer Science
collection NDLTD
language en
format Others
sources NDLTD
topic Computer Science
Artificial Intelligence
Reasoning under uncertainty
Decision Making
Planning
Markov Decision Process
Partially Observable
Hierarchy Discovery
Hierarchical Policy
Optimization
Computer Science
spellingShingle Computer Science
Artificial Intelligence
Reasoning under uncertainty
Decision Making
Planning
Markov Decision Process
Partially Observable
Hierarchy Discovery
Hierarchical Policy
Optimization
Computer Science
Charlin, Laurent
Automated Hierarchy Discovery for Planning in Partially Observable Domains
description Planning in partially observable domains is a notoriously difficult problem. However, in many real-world scenarios, planning can be simplified by decomposing the task into a hierarchy of smaller planning problems which, can then be solved independently of one another. Several approaches, mainly dealing with fully observable domains, have been proposed to optimize a plan that decomposes according to a hierarchy specified a priori. Some researchers have also proposed to discover hierarchies in fully observable domains. In this thesis, we investigate the problem of automatically discovering planning hierarchies in partially observable domains. The main advantage of discovering hierarchies is that, for a plan of a fixed size, hierarchical plans can be more expressive than non-hierarchical ones. Our solution frames the discovery and optimization of a hierarchical policy as a non-convex optimization problem. By encoding the hierarchical structure as variables of the optimization problem, we can automatically discover a hierarchy. Successfully solving the optimization problem therefore yields an optimal hierarchy and an optimal policy. We describe several techniques to solve the optimization problem. Namely, we provide results using general non-linear solvers, mixed-integer linear and non-linear solvers or a form of bounded hierarchical policy iteration. Our method is flexible enough to allow any parts of the hierarchy to be specified based on prior knowledge while letting the optimization discover the unknown parts. It can also discover hierarchical policies, including recursive policies, that are more compact (potentially infinitely fewer parameters).
author Charlin, Laurent
author_facet Charlin, Laurent
author_sort Charlin, Laurent
title Automated Hierarchy Discovery for Planning in Partially Observable Domains
title_short Automated Hierarchy Discovery for Planning in Partially Observable Domains
title_full Automated Hierarchy Discovery for Planning in Partially Observable Domains
title_fullStr Automated Hierarchy Discovery for Planning in Partially Observable Domains
title_full_unstemmed Automated Hierarchy Discovery for Planning in Partially Observable Domains
title_sort automated hierarchy discovery for planning in partially observable domains
publishDate 2007
url http://hdl.handle.net/10012/2665
work_keys_str_mv AT charlinlaurent automatedhierarchydiscoveryforplanninginpartiallyobservabledomains
_version_ 1716572569263407104