no code implementations • 3 Apr 2023 • Tian Huey Teh, Vivian Hu, Devang S Ram Mohan, Zack Hodari, Christopher G. R. Wallis, Tomás Gomez Ibarrondo, Alexandra Torresquintero, James Leoni, Mark Gales, Simon King
Generating expressive speech with rich and varied prosody continues to be a challenge for Text-to-Speech.
no code implementations • 14 Mar 2023 • Dan Andrei Iliescu, Devang Savita Ram Mohan, Tian Huey Teh, Zack Hodari
We address the problem of human-in-the-loop control for generating prosody in the context of text-to-speech synthesis.
no code implementations • 4 Nov 2020 • Sri Karlapati, Ammar Abbas, Zack Hodari, Alexis Moinet, Arnaud Joly, Penny Karanasou, Thomas Drugman
In Stage II, we propose a novel method to sample from this learnt prosodic distribution using the contextual information available in text.
1 code implementation • 14 Mar 2020 • Zack Hodari, Catherine Lai, Simon King
In English, prosody adds a broad range of information to segment sequences, from information structure (e. g. contrast) to stylistic variation (e. g. expression of emotion).
1 code implementation • 10 Jun 2019 • Zack Hodari, Oliver Watts, Simon King
A generative model that can synthesise multiple prosodies will, by design, not model average prosody.