1 code implementation • 25 Jan 2024 • Samuel Pegg, Kai Li, Xiaolin Hu
TDANet serves as the architectural foundation for the auditory and visual networks within TDFNet, offering an efficient model with fewer parameters.
Ranked #1 on Speech Separation on LRS2
1 code implementation • 29 Sep 2023 • Samuel Pegg, Kai Li, Xiaolin Hu
This is the first time-frequency domain audio-visual speech separation method to outperform all contemporary time-domain counterparts.
Ranked #1 on Speech Separation on VoxCeleb2