no code implementations • 22 Feb 2023 • Thibault Cordier, Tanguy Urvoy, Fabrice Lefevre, Lina M. Rojas-Barahona
Reinforcement learning has been widely adopted to model dialogue managers in task-oriented dialogues.
no code implementations • SIGDIAL (ACL) 2022 • Thibault Cordier, Tanguy Urvoy, Fabrice Lefèvre, Lina M. Rojas-Barahona
Task-oriented dialogue systems are designed to achieve specific goals while conversing with humans.
no code implementations • 25 Nov 2020 • Thibault Cordier, Tanguy Urvoy, Lina M. Rojas-Barahona, Fabrice Lefèvre
We notably propose a randomised exploration policy which allows for a seamless hybridisation of the learned policy and the expert.