no code implementations • 25 Nov 2023 • Pouya Parsa, Raoof Zare Moayedi, Mohammad Bornosi, Mohammad Mahdi Bejani
The strategy retrieves more informative trajectories, so the agent can learn with fewer trajectory sample.
reinforcement-learning