no code implementations • 25 Jan 2024 • Sunghee Jung, Won Jang, Jaesam Yoon, BongWan Kim
Zero-shot TTS demands additional efforts to ensure clear pronunciation and speech quality due to its inherent requirement of replacing a core parameter (speaker embedding or acoustic prompt) with a new one at the inference stage.
2 code implementations • 31 Mar 2022 • Dan Lim, Sunghee Jung, Eesung Kim
In neural text-to-speech (TTS), two-stage system or a cascade of separately learned models have shown synthesis quality close to human speech.
1 code implementation • Interspeech 2020 • Sunghee Jung, Hoirin Kim
To deal with this issue, we propose two models, hard and soft pitchtron and release the toolkit and corpus that we have developed.