Markov decision processes in minimization of expected costs
Basics of Markov decision processes will be introduced in order to obtain the optimization goal function for minimizing the long-run expected cost. We focus on mini-mization of such cost of the farmer's policy consisting of different decisions in specic states regarding both milk quality and qu...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Croatian Operational Research Society
2014-12-01
|
Series: | Croatian Operational Research Review |
Subjects: | |
Online Access: | http://hrcak.srce.hr/index.php?show=clanak&id_clanak_jezik=197361 |
id |
doaj-ccf3035d531a456a8b94031b51222f00 |
---|---|
record_format |
Article |
spelling |
doaj-ccf3035d531a456a8b94031b51222f002020-11-24T23:30:52ZengCroatian Operational Research SocietyCroatian Operational Research Review1848-02251848-99312014-12-015224725710.17535/crorr.2014.0011Markov decision processes in minimization of expected costsMarija Rukav0Kruno Stražanac1Nenad Šuvak2Zoran Tomljanović3Department of Mathematics, Josip Juraj Strossmayer University of OsijekFarmeron d.o.o.Department of Mathematics, Josip Juraj Strossmayer University of OsijekDepartment of Mathematics, Josip Juraj Strossmayer University of OsijekBasics of Markov decision processes will be introduced in order to obtain the optimization goal function for minimizing the long-run expected cost. We focus on mini-mization of such cost of the farmer's policy consisting of different decisions in specic states regarding both milk quality and quantity (lactation states) produced by a dairy cow. The transition probability matrix of the Markov process, used here for modeling of transitions of a dairy cow from one state to another, will be estimated from the data simulated from the lactation model that is often used in practice. We want to choose optimal actions in the states of this Markov process regarding the farmer's costs. This problem can be solved by exhaustive enumeration of all possible cases in order to obtain the optimal policy. How-ever, this is feasible only for a small number of states. Generally, this problem can be approached in the linear programming setting which yields an efficient solution. In order to demonstrate and compare these two approaches, we present an example based on the simulated data regarding milk quality and quantity.http://hrcak.srce.hr/index.php?show=clanak&id_clanak_jezik=197361Linear programming methodslong-run expected costMarkov decision pro-cessstationary distributionoptimal policy |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Marija Rukav Kruno Stražanac Nenad Šuvak Zoran Tomljanović |
spellingShingle |
Marija Rukav Kruno Stražanac Nenad Šuvak Zoran Tomljanović Markov decision processes in minimization of expected costs Croatian Operational Research Review Linear programming methods long-run expected cost Markov decision pro-cess stationary distribution optimal policy |
author_facet |
Marija Rukav Kruno Stražanac Nenad Šuvak Zoran Tomljanović |
author_sort |
Marija Rukav |
title |
Markov decision processes in minimization of expected costs |
title_short |
Markov decision processes in minimization of expected costs |
title_full |
Markov decision processes in minimization of expected costs |
title_fullStr |
Markov decision processes in minimization of expected costs |
title_full_unstemmed |
Markov decision processes in minimization of expected costs |
title_sort |
markov decision processes in minimization of expected costs |
publisher |
Croatian Operational Research Society |
series |
Croatian Operational Research Review |
issn |
1848-0225 1848-9931 |
publishDate |
2014-12-01 |
description |
Basics of Markov decision processes will be introduced in order to obtain the optimization goal function for minimizing the long-run expected cost. We focus on mini-mization of such cost of the farmer's policy consisting of different decisions in specic states regarding both milk quality and quantity (lactation states) produced by a dairy cow. The transition probability matrix of the Markov process, used here for modeling of transitions of a dairy cow from one state to another, will be estimated from the data simulated from the lactation model that is often used in practice. We want to choose optimal actions in the states of this Markov process regarding the farmer's costs. This problem can be solved by exhaustive enumeration of all possible cases in order to obtain the optimal policy. How-ever, this is feasible only for a small number of states. Generally, this problem can be approached in the linear programming setting which yields an efficient solution. In order to demonstrate and compare these two approaches, we present an example based on the simulated data regarding milk quality and quantity. |
topic |
Linear programming methods long-run expected cost Markov decision pro-cess stationary distribution optimal policy |
url |
http://hrcak.srce.hr/index.php?show=clanak&id_clanak_jezik=197361 |
work_keys_str_mv |
AT marijarukav markovdecisionprocessesinminimizationofexpectedcosts AT krunostrazanac markovdecisionprocessesinminimizationofexpectedcosts AT nenadsuvak markovdecisionprocessesinminimizationofexpectedcosts AT zorantomljanovic markovdecisionprocessesinminimizationofexpectedcosts |
_version_ |
1725539887302574080 |