no code implementations • 8 Apr 2022 • Eesung Kim, Jae-Jin Jeon, Hyeji Seo, Hoon Kim
Self-supervised learning (SSL) approaches such as wav2vec 2. 0 and HuBERT models have shown promising results in various downstream tasks in the speech community.
2 code implementations • 31 Mar 2022 • Dan Lim, Sunghee Jung, Eesung Kim
In neural text-to-speech (TTS), two-stage system or a cascade of separately learned models have shown synthesis quality close to human speech.
no code implementations • 2 Nov 2020 • Jae-Jin Jeon, Eesung Kim
Recently, several types of end-to-end speech recognition methods named transformer-transducer were introduced.