no code implementations • 1 Feb 2021 • Anne Gael Manegueu, Alexandra Carpentier, Yi Yu
On top of the switching bandit problem (\textbf{Case a}), we are interested in three concrete examples: (\textbf{b}) the means of the arms are local polynomials, (\textbf{c}) the means of the arms are locally smooth, and (\textbf{d}) the gaps of the arms have a bounded number of inflexion points and where the highest arm mean cannot vary too much in a short range.
no code implementations • ICML 2020 • Anne Gael Manegueu, Claire Vernade, Alexandra Carpentier, Michal Valko
Significant work has been recently dedicated to the stochastic delayed bandit setting because of its relevance in applications.