no code implementations • 1 Jun 2023 • Simon Keizer, Caroline Dockes, Norbert Braunschweiler, Svetlana Stoyanchev, Rama Doddipatla
Reinforcement learning based dialogue policies are typically trained in interaction with a user simulator.