Search Results for author: Itai Shufaro

Found 1 papers, 0 papers with code

On the Global Convergence of Policy Gradient in Average Reward Markov Decision Processes

no code implementations • 11 Mar 2024 • Navdeep Kumar, Yashaswini Murthy, Itai Shufaro, Kfir Y. Levy, R. Srikant, Shie Mannor

We present the first finite time global convergence analysis of policy gradient in the context of infinite horizon average reward Markov decision processes (MDPs).

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.