Search Results for author: Volodymyr Tkachuk

Found 4 papers, 0 papers with code

Trajectory Data Suffices for Statistically Efficient Learning in Offline RL with Linear $q^π$-Realizability and Concentrability

no code implementations27 May 2024 Volodymyr Tkachuk, Gellért Weisz, Csaba Szepesvári

The hope in this setting is that learning a good policy will be possible without requiring a sample size that scales with the number of states in the MDP.

Regret Minimization via Saddle Point Optimization

no code implementations NeurIPS 2023 Johannes Kirschner, Seyed Alireza Bakhtiari, Kushagra Chandak, Volodymyr Tkachuk, Csaba Szepesvári

A long line of works characterizes the sample complexity of regret minimization in sequential decision-making by min-max programs.

Decision Making

The Effect of Q-function Reuse on the Total Regret of Tabular, Model-Free, Reinforcement Learning

no code implementations7 Mar 2021 Volodymyr Tkachuk, Sriram Ganapathi Subramanian, Matthew E. Taylor

We aim to bridge the gap between theoretical and empirical work in $Q$-function reuse by providing some theoretical insights on the effectiveness of $Q$-function reuse when applied to the $Q$-learning with UCB-Hoeffding algorithm.

Q-Learning Transfer Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.