The nonstochastic multiarmed bandit problem
[9] P. Auer, N. Cesa-Bianchi, Y. Freund, and R. E. Schapire. The nonstochastic multiarmed bandit problem. SIAM journal on computing, 32:48–77, 2002.
PreviousRegret analysis of stochastic and nonstochastic multi-armed bandit problemsNextInformation theory of decisions and actions
Last updated