no code implementations • 3 Aug 2023 • Asen Nachkov, Luchen Li, Giulia Luise, Filippo Valdettaro, Aldo Faisal
To test whether optimistic ensemble method can improve on distributional RL as did on scalar RL, by e. g. Bootstrapped DQN, we implement the BoP approach with a population of distributional actor-critics using Bayesian Distributional Policy Gradients (BDPG).