no code implementations • 23 Feb 2022 • Suprovat Ghoshal, Aadirupa Saha
We introduce the \emph{Correlated Preference Bandits} problem with random utility-based choice models (RUMs), where the goal is to identify the best item from a given pool of $n$ items through online subsetwise preference feedback.