Search Results for author: James A. Grant

Found 7 papers, 0 papers with code

Apple Tasting Revisited: Bayesian Approaches to Partially Monitored Online Binary Classification

no code implementations • 29 Sep 2021 • James A. Grant, David S. Leslie

We consider a variant of online binary classification where a learner sequentially assigns labels ($0$ or $1$) to items with unknown true class.

Binary Classification Thompson Sampling

Paper
Add Code

Learning to Rank under Multinomial Logit Choice

no code implementations • 7 Sep 2020 • James A. Grant, David S. Leslie

The learning to rank (LTR) framework models this problem as a sequential problem of selecting lists of content and observing where users decide to click.

Learning-To-Rank Position

Paper
Add Code

Filtered Poisson Process Bandit on a Continuum

no code implementations • 20 Jul 2020 • James A. Grant, Roberto Szechtman

We consider a version of the continuum armed bandit where an action induces a filtered realisation of a non-homogeneous Poisson process.

Paper
Add Code

On Thompson Sampling for Smoother-than-Lipschitz Bandits

no code implementations • 8 Jan 2020 • James A. Grant, David S. Leslie

Thompson Sampling is a well established approach to bandit and reinforcement learning problems.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Adaptive Sensor Placement for Continuous Spaces

no code implementations • 16 May 2019 • James A. Grant, Alexis Boukouvalas, Ryan-Rhys Griffiths, David S. Leslie, Sattar Vakili, Enrique Munoz de Cote

We consider the problem of adaptively placing sensors along an interval to detect stochastically-generated events.

Thompson Sampling

Paper
Add Code

Adaptive Policies for Perimeter Surveillance Problems

no code implementations • 4 Oct 2018 • James A. Grant, David S. Leslie, Kevin Glazebrook, Roberto Szechtman, Adam N. Letchford

Maximising the detection of intrusions is a fundamental and often critical aim of perimeter surveillance.

Paper
Add Code

Combinatorial Multi-Armed Bandits with Filtered Feedback

no code implementations • 26 May 2017 • James A. Grant, David S. Leslie, Kevin Glazebrook, Roberto Szechtman

Motivated by problems in search and detection we present a solution to a Combinatorial Multi-Armed Bandit (CMAB) problem with both heavy-tailed reward distributions and a new class of feedback, filtered semibandit feedback.

Multi-Armed Bandits

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.