Study on Contextual Bandit Problem with Multiple Actions

碩士 === 國立臺灣大學 === 資訊工程學研究所 === 101 === The contextual bandit problem is usually used to model online applications like article recommendation. Somehow the problem cannot fully meet some needs of these applica- tions, such as making multiple actions at the same time. We propose a new Contextual Bandi...

Full description

Bibliographic Details
Main Authors:	Ya-Hsuan Chang, 張雅軒
Other Authors:	Hsuan-Tien Lin
Format:	Others
Language:	en_US
Published:	2013
Online Access:	http://ndltd.ncl.edu.tw/handle/94665894891939536263

id	ndltd-TW-101NTU05392082
record_format	oai_dc
spelling	ndltd-TW-101NTU053920822015-10-13T23:05:30Z http://ndltd.ncl.edu.tw/handle/94665894891939536263 Study on Contextual Bandit Problem with Multiple Actions 多動作情境式拉霸問題之研究 Ya-Hsuan Chang 張雅軒碩士國立臺灣大學資訊工程學研究所 101 The contextual bandit problem is usually used to model online applications like article recommendation. Somehow the problem cannot fully meet some needs of these applica- tions, such as making multiple actions at the same time. We propose a new Contextual Bandit Problem with Multiple Ac- tions (CBMA), which is an extension of the traditional con- textual bandit problem and fits the online applications better. We adapt some existing contextual bandit algorithms for our CBMA problem, and propose a new Pairwise Regression with Upper Confidence Bound (PairUCB) algorithm which utilizes the new properties of the CBMA problem, The experiment re- sults demostrate that PairUCB outperforms other algorithms. Hsuan-Tien Lin 林軒田 2013 學位論文 ; thesis 37 en_US
collection	NDLTD
language	en_US
format	Others
sources	NDLTD
description	碩士 === 國立臺灣大學 === 資訊工程學研究所 === 101 === The contextual bandit problem is usually used to model online applications like article recommendation. Somehow the problem cannot fully meet some needs of these applica- tions, such as making multiple actions at the same time. We propose a new Contextual Bandit Problem with Multiple Ac- tions (CBMA), which is an extension of the traditional con- textual bandit problem and fits the online applications better. We adapt some existing contextual bandit algorithms for our CBMA problem, and propose a new Pairwise Regression with Upper Confidence Bound (PairUCB) algorithm which utilizes the new properties of the CBMA problem, The experiment re- sults demostrate that PairUCB outperforms other algorithms.
author2	Hsuan-Tien Lin
author_facet	Hsuan-Tien Lin Ya-Hsuan Chang 張雅軒
author	Ya-Hsuan Chang 張雅軒
spellingShingle	Ya-Hsuan Chang 張雅軒 Study on Contextual Bandit Problem with Multiple Actions
author_sort	Ya-Hsuan Chang
title	Study on Contextual Bandit Problem with Multiple Actions
title_short	Study on Contextual Bandit Problem with Multiple Actions
title_full	Study on Contextual Bandit Problem with Multiple Actions
title_fullStr	Study on Contextual Bandit Problem with Multiple Actions
title_full_unstemmed	Study on Contextual Bandit Problem with Multiple Actions
title_sort	study on contextual bandit problem with multiple actions
publishDate	2013
url	http://ndltd.ncl.edu.tw/handle/94665894891939536263
work_keys_str_mv	AT yahsuanchang studyoncontextualbanditproblemwithmultipleactions AT zhāngyǎxuān studyoncontextualbanditproblemwithmultipleactions AT yahsuanchang duōdòngzuòqíngjìngshìlābàwèntízhīyánjiū AT zhāngyǎxuān duōdòngzuòqíngjìngshìlābàwèntízhīyánjiū
_version_	1718083772090417152

Study on Contextual Bandit Problem with Multiple Actions

Similar Items