Search Results for author: Datong P. Zhou

Found 1 papers, 0 papers with code

Budget-Constrained Multi-Armed Bandits with Multiple Plays

no code implementations16 Nov 2017 Datong P. Zhou, Claire J. Tomlin

Secondly, for the adversarial case in which the entire sequence of rewards and costs is fixed in advance, we derive an upper bound on the regret of order $O(\sqrt{NB\log(N/K)})$ utilizing an extension of the well-known $\texttt{Exp3}$ algorithm.

Multi-Armed Bandits

Cannot find the paper you are looking for? You can Submit a new open access paper.