A bad arm existence checking problem: How to utilize asymmetric problem structure?

Tabata, Koji; Nakamura, Atsuyoshi; Honda, Junya; Komatsuzaki, Tamiki

doi:10.1007/s10994-019-05854-7


Hokkaido University \| Library \| HUSCAP	Advanced Search		言語

	Home
	About HUSCAP
	Open Access Policy

	Browse by Author

Browse
	Communities & Collections

	Scholarly Journals
	Theses
	Doctoral Dissertations Listed by Graduate Schools
	Conference Procs.
	Events

	HUSCAP Senior (in Japanese)

	Societies

	Downloads (country)

For university staff
	How to post your papers to HUSCAP
	Publication of theses
	Helpline about theses publication

Open Archives Compliant

You can search our collection also at:
	Google
	Google Scholar
	CiNii
	IRDB
	OAIster
	NDLTD

Hokkaido University Collection of Scholarly and Academic Papers >
Graduate School of Information Science and Technology / Faculty of Information Science and Technology >
Peer-reviewed Journal Articles, etc >

A bad arm existence checking problem: How to utilize asymmetric problem structure?

Files in This Item:

ecmlmlj2019r2c.pdf

577.04 kB

PDF

View/Open

Please use this identifier to cite or link to this item:http://hdl.handle.net/2115/80332

Title:	A bad arm existence checking problem: How to utilize asymmetric problem structure?
Authors:	Tabata, Koji Browse this author
	Nakamura, Atsuyoshi Browse this author →KAKEN DB
	Honda, Junya Browse this author
	Komatsuzaki, Tamiki Browse this author →KAKEN DB
Keywords:	Online learning
	Bandit problem
	Best arm identification
Issue Date:	30-Oct-2019
Publisher:	Springer
Journal Title:	Machine learning
Volume:	109
Start Page:	327
End Page:	372
Publisher DOI:	10.1007/s10994-019-05854-7
Abstract:	We study a bad arm existence checking problem in a stochastic K-armed bandit setting, in which a player's task is to judge whether a positive arm exists or all the arms are negative among given K arms by drawing as small number of arms as possible. Here, an arm is positive if its expected loss suffered by drawing the arm is at least a given threshold theta(U), and it is negative if that is less than another given threshold theta(L) (<= theta(U)). This problem is a formalization of diagnosis of disease or machine failure. An interesting structure of this problem is the asymmetry of positive and negative arms' roles; finding one positive arm is enough to judge positive existence while all the arms must be discriminated as negative to judge whole negativity. In the case with Delta = theta(U) - theta(L) > 0, we propose elimination algorithms with arm selection policy (policy to determine the next arm to draw) and decision condition (condition to conclude positive arm's existence or the drawn arm's negativity) utilizing this asymmetric problem structure and prove its effectiveness theoretically and empirically.
Rights:	This is a post-peer-review, pre-copyedit version of an article published in Machine learning. The final authenticated version is available online at: http://dx.doi.org/10.1007/s10994-019-05854-7
Type:	article (author version)
URI:	http://hdl.handle.net/2115/80332
Appears in Collections:	情報科学院・情報科学研究院 (Graduate School of Information Science and Technology / Faculty of Information Science and Technology) > 雑誌発表論文等 (Peer-reviewed Journal Articles, etc)

Submitter: 中村篤祥

OAI-PMH ( junii2 , jpcoar_1.0 )

- Hokkaido University