Historique: Seminar28042015

Afficher la page Collapse Into Edit Sessions

Aperçu de cette version: 1 (courant)

April 28th

14:30 , R2014 Digiteo Shannon (660) (see location):

Vianney Perchet

Title: Optimal Sample Size in Multi-Phase Learning

Abstract :

Motivated by practical applications, chiefly clinical trials, we study the regret achievable for stochastic multi-armed bandits under the constraint that the employed policy must function in a small number of phases. Our results show that a very small number of phases gives already close to minimax optimal regret bounds and we also evaluate the number of trials in each phase.

Contact: cyril.furtlehner à inria.fr

Historique

Activer la pagination rows per page

Avancé

Information	Version
mar. 21 de Apr, 2015 11h16 furtlehn from 129.175.15.11	1		Afficher