Study on Contextual Bandit Problem with Multiple Actions
碩士 === 國立臺灣大學 === 資訊工程學研究所 === 101 === The contextual bandit problem is usually used to model online applications like article recommendation. Somehow the problem cannot fully meet some needs of these applica- tions, such as making multiple actions at the same time. We propose a new Contextual Bandi...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2013
|
Online Access: | http://ndltd.ncl.edu.tw/handle/94665894891939536263 |
id |
ndltd-TW-101NTU05392082 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-101NTU053920822015-10-13T23:05:30Z http://ndltd.ncl.edu.tw/handle/94665894891939536263 Study on Contextual Bandit Problem with Multiple Actions 多動作情境式拉霸問題之研究 Ya-Hsuan Chang 張雅軒 碩士 國立臺灣大學 資訊工程學研究所 101 The contextual bandit problem is usually used to model online applications like article recommendation. Somehow the problem cannot fully meet some needs of these applica- tions, such as making multiple actions at the same time. We propose a new Contextual Bandit Problem with Multiple Ac- tions (CBMA), which is an extension of the traditional con- textual bandit problem and fits the online applications better. We adapt some existing contextual bandit algorithms for our CBMA problem, and propose a new Pairwise Regression with Upper Confidence Bound (PairUCB) algorithm which utilizes the new properties of the CBMA problem, The experiment re- sults demostrate that PairUCB outperforms other algorithms. Hsuan-Tien Lin 林軒田 2013 學位論文 ; thesis 37 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立臺灣大學 === 資訊工程學研究所 === 101 === The contextual bandit problem is usually used to model online applications like article recommendation. Somehow the problem cannot fully meet some needs of these applica- tions, such as making multiple actions at the same time. We propose a new Contextual Bandit Problem with Multiple Ac- tions (CBMA), which is an extension of the traditional con- textual bandit problem and fits the online applications better. We adapt some existing contextual bandit algorithms for our CBMA problem, and propose a new Pairwise Regression with Upper Confidence Bound (PairUCB) algorithm which utilizes the new properties of the CBMA problem, The experiment re- sults demostrate that PairUCB outperforms other algorithms.
|
author2 |
Hsuan-Tien Lin |
author_facet |
Hsuan-Tien Lin Ya-Hsuan Chang 張雅軒 |
author |
Ya-Hsuan Chang 張雅軒 |
spellingShingle |
Ya-Hsuan Chang 張雅軒 Study on Contextual Bandit Problem with Multiple Actions |
author_sort |
Ya-Hsuan Chang |
title |
Study on Contextual Bandit Problem with Multiple Actions |
title_short |
Study on Contextual Bandit Problem with Multiple Actions |
title_full |
Study on Contextual Bandit Problem with Multiple Actions |
title_fullStr |
Study on Contextual Bandit Problem with Multiple Actions |
title_full_unstemmed |
Study on Contextual Bandit Problem with Multiple Actions |
title_sort |
study on contextual bandit problem with multiple actions |
publishDate |
2013 |
url |
http://ndltd.ncl.edu.tw/handle/94665894891939536263 |
work_keys_str_mv |
AT yahsuanchang studyoncontextualbanditproblemwithmultipleactions AT zhāngyǎxuān studyoncontextualbanditproblemwithmultipleactions AT yahsuanchang duōdòngzuòqíngjìngshìlābàwèntízhīyánjiū AT zhāngyǎxuān duōdòngzuòqíngjìngshìlābàwèntízhīyánjiū |
_version_ |
1718083772090417152 |