HUSCAP logo Hokkaido Univ. logo

Hokkaido University Collection of Scholarly and Academic Papers >
Graduate School of Information Science and Technology / Faculty of Information Science and Technology >
Peer-reviewed Journal Articles, etc >

KL-UCB-Based Policy for Budgeted Multi-Armed Bandits with Stochastic Action Costs

Files in This Item:
IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 100-A_11_2470-2486.pdf1.5 MBPDFView/Open
Please use this identifier to cite or link to this item:http://hdl.handle.net/2115/89513

Title: KL-UCB-Based Policy for Budgeted Multi-Armed Bandits with Stochastic Action Costs
Authors: WATANABE, Ryo Browse this author
KOMIYAMA, Junpei Browse this author
NAKAMURA, Atsuyoshi Browse this author →KAKEN DB
KUDO, Mineichi Browse this author
Keywords: budgeted multi-armed bandits
asymptotically optimal policy
regret analysis
Issue Date: 1-Nov-2017
Publisher: IEICE
Journal Title: IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences
Volume: E100.A
Issue: 11
Start Page: 2470
End Page: 2486
Publisher DOI: 10.1587/transfun.E100.A.2470
Rights: copyright©2017 IEICE
Type: article
URI: http://hdl.handle.net/2115/89513
Appears in Collections:情報科学院・情報科学研究院 (Graduate School of Information Science and Technology / Faculty of Information Science and Technology) > 雑誌発表論文等 (Peer-reviewed Journal Articles, etc)

Submitter: 中村 篤祥

Export metadata:

OAI-PMH ( junii2 , jpcoar_1.0 )

MathJax is now OFF:


 

 - Hokkaido University