Search Results for author: Thibault Cordier

Few-Shot Structured Policy Learning for Multi-Domain and Multi-Task Dialogues

Reinforcement learning has been widely adopted to model dialogue managers in task-oriented dialogues.

Paper
Add Code

Task-oriented dialogue systems are designed to achieve specific goals while conversing with humans.

Paper
Add Code

We notably propose a randomised exploration policy which allows for a seamless hybridisation of the learned policy and the expert.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.