no code implementations • 17 May 2019 • Vincent Wan, Chun-an Chan, Tom Kenter, Jakub Vit, Rob Clark
The prosodic aspects of speech signals produced by current text-to-speech systems are typically averaged over training material, and as such lack the variety and liveliness found in natural speech.