no code implementations • 28 Sep 2021 • Shilun Lin, Wenchao Su, Li Meng, Fenglong Xie, Xinhui Li, Li Lu
Thirdly, a duration predictor instead of an attention model that connects the above hybrid encoder and decoder.
no code implementations • 30 Jan 2021 • Shilun Lin, Fenglong Xie, Li Meng, Xinhui Li, Li Lu
In this work, a robust and efficient text-to-speech (TTS) synthesis system named Triple M is proposed for large-scale online application.