Search Results for author: Itai Shufaro

Found 2 papers, 0 papers with code

On Bits and Bandits: Quantifying the Regret-Information Trade-off

no code implementations26 May 2024 Itai Shufaro, Nadav Merlis, Nir Weinberger, Shie Mannor

Using this setting, we introduce the first Bayesian regret lower bounds that depend on the information an agent accumulates.

On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes

no code implementations11 Mar 2024 Navdeep Kumar, Yashaswini Murthy, Itai Shufaro, Kfir Y. Levy, R. Srikant, Shie Mannor

We present the first finite time global convergence analysis of policy gradient in the context of infinite horizon average reward Markov decision processes (MDPs).

Cannot find the paper you are looking for? You can Submit a new open access paper.