Study on Contextual Bandit Problem with Multiple Actions

碩士 === 國立臺灣大學 === 資訊工程學研究所 === 101 === The contextual bandit problem is usually used to model online applications like article recommendation. Somehow the problem cannot fully meet some needs of these applica- tions, such as making multiple actions at the same time. We propose a new Contextual Bandi...

Full description

Bibliographic Details
Main Authors: Ya-Hsuan Chang, 張雅軒
Other Authors: Hsuan-Tien Lin
Format: Others
Language:en_US
Published: 2013
Online Access:http://ndltd.ncl.edu.tw/handle/94665894891939536263
id ndltd-TW-101NTU05392082
record_format oai_dc
spelling ndltd-TW-101NTU053920822015-10-13T23:05:30Z http://ndltd.ncl.edu.tw/handle/94665894891939536263 Study on Contextual Bandit Problem with Multiple Actions 多動作情境式拉霸問題之研究 Ya-Hsuan Chang 張雅軒 碩士 國立臺灣大學 資訊工程學研究所 101 The contextual bandit problem is usually used to model online applications like article recommendation. Somehow the problem cannot fully meet some needs of these applica- tions, such as making multiple actions at the same time. We propose a new Contextual Bandit Problem with Multiple Ac- tions (CBMA), which is an extension of the traditional con- textual bandit problem and fits the online applications better. We adapt some existing contextual bandit algorithms for our CBMA problem, and propose a new Pairwise Regression with Upper Confidence Bound (PairUCB) algorithm which utilizes the new properties of the CBMA problem, The experiment re- sults demostrate that PairUCB outperforms other algorithms. Hsuan-Tien Lin 林軒田 2013 學位論文 ; thesis 37 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立臺灣大學 === 資訊工程學研究所 === 101 === The contextual bandit problem is usually used to model online applications like article recommendation. Somehow the problem cannot fully meet some needs of these applica- tions, such as making multiple actions at the same time. We propose a new Contextual Bandit Problem with Multiple Ac- tions (CBMA), which is an extension of the traditional con- textual bandit problem and fits the online applications better. We adapt some existing contextual bandit algorithms for our CBMA problem, and propose a new Pairwise Regression with Upper Confidence Bound (PairUCB) algorithm which utilizes the new properties of the CBMA problem, The experiment re- sults demostrate that PairUCB outperforms other algorithms.
author2 Hsuan-Tien Lin
author_facet Hsuan-Tien Lin
Ya-Hsuan Chang
張雅軒
author Ya-Hsuan Chang
張雅軒
spellingShingle Ya-Hsuan Chang
張雅軒
Study on Contextual Bandit Problem with Multiple Actions
author_sort Ya-Hsuan Chang
title Study on Contextual Bandit Problem with Multiple Actions
title_short Study on Contextual Bandit Problem with Multiple Actions
title_full Study on Contextual Bandit Problem with Multiple Actions
title_fullStr Study on Contextual Bandit Problem with Multiple Actions
title_full_unstemmed Study on Contextual Bandit Problem with Multiple Actions
title_sort study on contextual bandit problem with multiple actions
publishDate 2013
url http://ndltd.ncl.edu.tw/handle/94665894891939536263
work_keys_str_mv AT yahsuanchang studyoncontextualbanditproblemwithmultipleactions
AT zhāngyǎxuān studyoncontextualbanditproblemwithmultipleactions
AT yahsuanchang duōdòngzuòqíngjìngshìlābàwèntízhīyánjiū
AT zhāngyǎxuān duōdòngzuòqíngjìngshìlābàwèntízhīyánjiū
_version_ 1718083772090417152