The nonstochastic multiarmed bandit problem
[9] P. Auer, N. Cesa-Bianchi, Y. Freund, and R. E. Schapire. The nonstochastic multiarmed bandit problem. SIAM journal on computing, 32:48–77, 2002.
Copy link