"Towards Minimax Policies for Online Linear Optimization with Bandit Feedback."

Sébastien Bubeck, Nicolò Cesa-Bianchi, Sham M. Kakade (2012)

Details and statistics

DOI:

access: open

type: Conference or Workshop Paper

metadata version: 2019-05-29

a service of  Schloss Dagstuhl - Leibniz Center for Informatics