Computing Agent Competency in First Order Markov Processes

Artificial agents are usually designed to achieve specific goals. An agent's competency can be defined as its ability to accomplish its goals under different conditions. This thesis restricts attention to a specific type of goal, namely reaching a desired state without exceeding a tolerance thr...

Full description

Bibliographic Details
Main Author:	Cao, Xuan
Format:	Others
Published:	BYU ScholarsArchive 2021
Subjects:	artificial agents first-order Markov process state-dependent competency total competency Monte Carlo approach Bayesian network Physical Sciences and Mathematics
Online Access:	https://scholarsarchive.byu.edu/etd/9336 https://scholarsarchive.byu.edu/cgi/viewcontent.cgi?article=10345&context=etd

id	ndltd-BGMYU2-oai-scholarsarchive.byu.edu-etd-10345
record_format	oai_dc
spelling	ndltd-BGMYU2-oai-scholarsarchive.byu.edu-etd-103452021-12-23T05:00:54Z Computing Agent Competency in First Order Markov Processes Cao, Xuan Artificial agents are usually designed to achieve specific goals. An agent's competency can be defined as its ability to accomplish its goals under different conditions. This thesis restricts attention to a specific type of goal, namely reaching a desired state without exceeding a tolerance threshold of undesirable events in a first-order Markov process. For such goals, the state-dependent competency for an agent can be defined as the probability of reaching the desired state without exceeding the threshold and within a time limit given an initial state. The thesis further defines total competency as the set of state-dependent competency relationships over all possible initial states. The thesis uses a Monte Carlo approach to establish a baseline for estimating state-dependent competency. The Monte Carlo approach (a) uses trajectories sampled from an agent behaving in the environment, and then (b) uses nonlinear regression over the trajectory samples to estimate the competency curve. The thesis further presents an equation demonstrating recurrent relations for total competency and an algorithm based on that equation for computing total competency whose worst case computation time grows quadratically with the size of the state space. Simple maze-based Markov chains show that the Monte Carlo approach to estimating the competency agrees with the results computed by the proposed algorithm. Lastly, the thesis explores a special case where there are multiple sequential atomic goals that make up a complex goal. The thesis models a set of sequential goals as a Bayesian network and presents an equation based on the chain rule for deriving the competency for the complex goal from the competency for atomic goals. Experiments for the canonical taxi problem with sequential goals show the correctness of the Bayesian network-based decomposition approach. 2021-12-06T08:00:00Z text application/pdf https://scholarsarchive.byu.edu/etd/9336 https://scholarsarchive.byu.edu/cgi/viewcontent.cgi?article=10345&context=etd https://lib.byu.edu/about/copyright/ Theses and Dissertations BYU ScholarsArchive artificial agents first-order Markov process state-dependent competency total competency Monte Carlo approach Bayesian network Physical Sciences and Mathematics
collection	NDLTD
format	Others
sources	NDLTD
topic	artificial agents first-order Markov process state-dependent competency total competency Monte Carlo approach Bayesian network Physical Sciences and Mathematics
spellingShingle	artificial agents first-order Markov process state-dependent competency total competency Monte Carlo approach Bayesian network Physical Sciences and Mathematics Cao, Xuan Computing Agent Competency in First Order Markov Processes
description	Artificial agents are usually designed to achieve specific goals. An agent's competency can be defined as its ability to accomplish its goals under different conditions. This thesis restricts attention to a specific type of goal, namely reaching a desired state without exceeding a tolerance threshold of undesirable events in a first-order Markov process. For such goals, the state-dependent competency for an agent can be defined as the probability of reaching the desired state without exceeding the threshold and within a time limit given an initial state. The thesis further defines total competency as the set of state-dependent competency relationships over all possible initial states. The thesis uses a Monte Carlo approach to establish a baseline for estimating state-dependent competency. The Monte Carlo approach (a) uses trajectories sampled from an agent behaving in the environment, and then (b) uses nonlinear regression over the trajectory samples to estimate the competency curve. The thesis further presents an equation demonstrating recurrent relations for total competency and an algorithm based on that equation for computing total competency whose worst case computation time grows quadratically with the size of the state space. Simple maze-based Markov chains show that the Monte Carlo approach to estimating the competency agrees with the results computed by the proposed algorithm. Lastly, the thesis explores a special case where there are multiple sequential atomic goals that make up a complex goal. The thesis models a set of sequential goals as a Bayesian network and presents an equation based on the chain rule for deriving the competency for the complex goal from the competency for atomic goals. Experiments for the canonical taxi problem with sequential goals show the correctness of the Bayesian network-based decomposition approach.
author	Cao, Xuan
author_facet	Cao, Xuan
author_sort	Cao, Xuan
title	Computing Agent Competency in First Order Markov Processes
title_short	Computing Agent Competency in First Order Markov Processes
title_full	Computing Agent Competency in First Order Markov Processes
title_fullStr	Computing Agent Competency in First Order Markov Processes
title_full_unstemmed	Computing Agent Competency in First Order Markov Processes
title_sort	computing agent competency in first order markov processes
publisher	BYU ScholarsArchive
publishDate	2021
url	https://scholarsarchive.byu.edu/etd/9336 https://scholarsarchive.byu.edu/cgi/viewcontent.cgi?article=10345&context=etd
work_keys_str_mv	AT caoxuan computingagentcompetencyinfirstordermarkovprocesses
_version_	1723965555866599424

Computing Agent Competency in First Order Markov Processes

Similar Items