no code implementations • 8 Apr 2021 • Eric Engelhart, Mahsa Elyasi, Gaurav Bharaj
And transformer-based models require significant training data, and do not generalize well, especially for dialects with limited data.
no code implementations • 8 Apr 2021 • Mahsa Elyasi, Gaurav Bharaj
In this work, we propose a novel carefully designed strategy for conditioning Tacotron-2 on two fundamental prosodic features in English -- stress syllable and pitch accent, that help achieve more natural prosody.