Search Results for author: Daniela Sánchez Lopera

Found 1 papers, 1 papers with code

MEET: A Monte Carlo Exploration-Exploitation Trade-off for Buffer Sampling

1 code implementation24 Oct 2022 Julius Ott, Lorenzo Servadei, Jose Arjona-Medina, Enrico Rinaldi, Gianfranco Mauro, Daniela Sánchez Lopera, Michael Stephan, Thomas Stadelmayer, Avik Santra, Robert Wille

This is enabled by the uncertainty estimation of the Q-Value function, which guides the sampling to explore more significant transitions and, thus, learn a more efficient policy.

reinforcement-learning Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.