no code implementations • 29 Sep 2021 • James A. Grant, David S. Leslie
We consider a variant of online binary classification where a learner sequentially assigns labels ($0$ or $1$) to items with unknown true class.
no code implementations • 7 Sep 2020 • James A. Grant, David S. Leslie
The learning to rank (LTR) framework models this problem as a sequential problem of selecting lists of content and observing where users decide to click.
no code implementations • 20 Jul 2020 • James A. Grant, Roberto Szechtman
We consider a version of the continuum armed bandit where an action induces a filtered realisation of a non-homogeneous Poisson process.
no code implementations • 8 Jan 2020 • James A. Grant, David S. Leslie
Thompson Sampling is a well established approach to bandit and reinforcement learning problems.
no code implementations • 16 May 2019 • James A. Grant, Alexis Boukouvalas, Ryan-Rhys Griffiths, David S. Leslie, Sattar Vakili, Enrique Munoz de Cote
We consider the problem of adaptively placing sensors along an interval to detect stochastically-generated events.
no code implementations • 4 Oct 2018 • James A. Grant, David S. Leslie, Kevin Glazebrook, Roberto Szechtman, Adam N. Letchford
Maximising the detection of intrusions is a fundamental and often critical aim of perimeter surveillance.
no code implementations • 26 May 2017 • James A. Grant, David S. Leslie, Kevin Glazebrook, Roberto Szechtman
Motivated by problems in search and detection we present a solution to a Combinatorial Multi-Armed Bandit (CMAB) problem with both heavy-tailed reward distributions and a new class of feedback, filtered semibandit feedback.