Search Results for author: Volodymyr Tkachuk

Found 3 papers, 0 papers with code

Regret Minimization via Saddle Point Optimization

no code implementations NeurIPS 2023 Johannes Kirschner, Seyed Alireza Bakhtiari, Kushagra Chandak, Volodymyr Tkachuk, Csaba Szepesvári

A long line of works characterizes the sample complexity of regret minimization in sequential decision-making by min-max programs.

Decision Making

The Effect of Q-function Reuse on the Total Regret of Tabular, Model-Free, Reinforcement Learning

no code implementations7 Mar 2021 Volodymyr Tkachuk, Sriram Ganapathi Subramanian, Matthew E. Taylor

We aim to bridge the gap between theoretical and empirical work in $Q$-function reuse by providing some theoretical insights on the effectiveness of $Q$-function reuse when applied to the $Q$-learning with UCB-Hoeffding algorithm.

Q-Learning Transfer Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.