Markov decision processes in minimization of expected costs

Basics of Markov decision processes will be introduced in order to obtain the optimization goal function for minimizing the long-run expected cost. We focus on mini-mization of such cost of the farmer's policy consisting of different decisions in specic states regarding both milk quality and qu...

Full description

Bibliographic Details
Main Authors: Marija Rukav, Kruno Stražanac, Nenad Šuvak, Zoran Tomljanović
Format: Article
Language:English
Published: Croatian Operational Research Society 2014-12-01
Series:Croatian Operational Research Review
Subjects:
Online Access:http://hrcak.srce.hr/index.php?show=clanak&id_clanak_jezik=197361
id doaj-ccf3035d531a456a8b94031b51222f00
record_format Article
spelling doaj-ccf3035d531a456a8b94031b51222f002020-11-24T23:30:52ZengCroatian Operational Research SocietyCroatian Operational Research Review1848-02251848-99312014-12-015224725710.17535/crorr.2014.0011Markov decision processes in minimization of expected costsMarija Rukav0Kruno Stražanac1Nenad Šuvak2Zoran Tomljanović3Department of Mathematics, Josip Juraj Strossmayer University of OsijekFarmeron d.o.o.Department of Mathematics, Josip Juraj Strossmayer University of OsijekDepartment of Mathematics, Josip Juraj Strossmayer University of OsijekBasics of Markov decision processes will be introduced in order to obtain the optimization goal function for minimizing the long-run expected cost. We focus on mini-mization of such cost of the farmer's policy consisting of different decisions in specic states regarding both milk quality and quantity (lactation states) produced by a dairy cow. The transition probability matrix of the Markov process, used here for modeling of transitions of a dairy cow from one state to another, will be estimated from the data simulated from the lactation model that is often used in practice. We want to choose optimal actions in the states of this Markov process regarding the farmer's costs. This problem can be solved by exhaustive enumeration of all possible cases in order to obtain the optimal policy. How-ever, this is feasible only for a small number of states. Generally, this problem can be approached in the linear programming setting which yields an efficient solution. In order to demonstrate and compare these two approaches, we present an example based on the simulated data regarding milk quality and quantity.http://hrcak.srce.hr/index.php?show=clanak&id_clanak_jezik=197361Linear programming methodslong-run expected costMarkov decision pro-cessstationary distributionoptimal policy
collection DOAJ
language English
format Article
sources DOAJ
author Marija Rukav
Kruno Stražanac
Nenad Šuvak
Zoran Tomljanović
spellingShingle Marija Rukav
Kruno Stražanac
Nenad Šuvak
Zoran Tomljanović
Markov decision processes in minimization of expected costs
Croatian Operational Research Review
Linear programming methods
long-run expected cost
Markov decision pro-cess
stationary distribution
optimal policy
author_facet Marija Rukav
Kruno Stražanac
Nenad Šuvak
Zoran Tomljanović
author_sort Marija Rukav
title Markov decision processes in minimization of expected costs
title_short Markov decision processes in minimization of expected costs
title_full Markov decision processes in minimization of expected costs
title_fullStr Markov decision processes in minimization of expected costs
title_full_unstemmed Markov decision processes in minimization of expected costs
title_sort markov decision processes in minimization of expected costs
publisher Croatian Operational Research Society
series Croatian Operational Research Review
issn 1848-0225
1848-9931
publishDate 2014-12-01
description Basics of Markov decision processes will be introduced in order to obtain the optimization goal function for minimizing the long-run expected cost. We focus on mini-mization of such cost of the farmer's policy consisting of different decisions in specic states regarding both milk quality and quantity (lactation states) produced by a dairy cow. The transition probability matrix of the Markov process, used here for modeling of transitions of a dairy cow from one state to another, will be estimated from the data simulated from the lactation model that is often used in practice. We want to choose optimal actions in the states of this Markov process regarding the farmer's costs. This problem can be solved by exhaustive enumeration of all possible cases in order to obtain the optimal policy. How-ever, this is feasible only for a small number of states. Generally, this problem can be approached in the linear programming setting which yields an efficient solution. In order to demonstrate and compare these two approaches, we present an example based on the simulated data regarding milk quality and quantity.
topic Linear programming methods
long-run expected cost
Markov decision pro-cess
stationary distribution
optimal policy
url http://hrcak.srce.hr/index.php?show=clanak&id_clanak_jezik=197361
work_keys_str_mv AT marijarukav markovdecisionprocessesinminimizationofexpectedcosts
AT krunostrazanac markovdecisionprocessesinminimizationofexpectedcosts
AT nenadsuvak markovdecisionprocessesinminimizationofexpectedcosts
AT zorantomljanovic markovdecisionprocessesinminimizationofexpectedcosts
_version_ 1725539887302574080