no code implementations • 20 Jun 2023 • Jakub Swiatkowski, Duo Wang, Mikolaj Babianski, Giuseppe Coccia, Patrick Lumban Tobing, Ravichander Vipperla, Viacheslav Klimkov, Vincent Pollet
Speech generation for machine dubbing adds complexity to conventional Text-To-Speech solutions as the generated output is required to match the expressiveness, emotion and speaking rate of the source content.
no code implementations • 20 Jun 2023 • Jakub Swiatkowski, Duo Wang, Mikolaj Babianski, Patrick Lumban Tobing, Ravichander Vipperla, Vincent Pollet
Prosody transfer is well-studied in the context of expressive speech synthesis.
no code implementations • 10 Feb 2021 • Giuseppe Ruggiero, Enrico Zovato, Luigi di Caro, Vincent Pollet
This is the main reason why the TTS models are usually single speaker.