Search Results for author: Orin Levy

Found 4 papers, 0 papers with code

Efficient Rate Optimal Regret for Adversarial Contextual MDPs Using Online Function Approximation

no code implementations2 Mar 2023 Orin Levy, Alon Cohen, Asaf Cassel, Yishay Mansour

To the best of our knowledge, our algorithm is the first efficient rate optimal regret minimization algorithm for adversarial CMDPs that operates under the minimal standard assumption of online function approximation.

regression

Eluder-based Regret for Stochastic Contextual MDPs

no code implementations27 Nov 2022 Orin Levy, Asaf Cassel, Alon Cohen, Yishay Mansour

To the best of our knowledge, our algorithm is the first efficient and rate-optimal regret minimization algorithm for CMDPs that operates under the general offline function approximation setting.

regression

Optimism in Face of a Context: Regret Guarantees for Stochastic Contextual MDP

no code implementations22 Jul 2022 Orin Levy, Yishay Mansour

For the latter, our algorithm obtains regret bound of $\widetilde{O}( (H+{1}/{p_{min}})H|S|^{3/2}\sqrt{|A|T\log(\max\{|\mathcal{G}|,|\mathcal{P}|\}/\delta)})$ with probability $1-\delta$, where $\mathcal{P}$ and $\mathcal{G}$ are finite and realizable function classes used to approximate the dynamics and rewards respectively, $p_{min}$ is the minimum reachability parameter, $S$ is the set of states, $A$ the set of actions, $H$ the horizon, and $T$ the number of episodes.

Learning Efficiently Function Approximation for Contextual MDP

no code implementations2 Mar 2022 Orin Levy, Yishay Mansour

We study learning contextual MDPs using a function approximation for both the rewards and the dynamics.

Cannot find the paper you are looking for? You can Submit a new open access paper.