1 code implementation • 27 Mar 2022 • Juan Leon Alcazar, Moritz Cordes, Chen Zhao, Bernard Ghanem
Recent advances in the Active Speaker Detection (ASD) problem build upon a two-stage process: feature extraction and spatio-temporal context aggregation.
Ranked #5 on Audio-Visual Active Speaker Detection on AVA-ActiveSpeaker (using extra training data)