1 code implementation • 7 Dec 2022 • Alberto Maria Metelli, Francesco Trovò, Matteo Pirola, Marcello Restelli
This paper is in the field of stochastic Multi-Armed Bandits (MABs), i. e., those sequential selection techniques able to learn online using only the feedback given by the chosen option (a. k. a.