no code implementations • 25 Nov 2020 • Jorgen Valk, Tanel Alumae
Speech activity detection and speaker diarization are used to extract segments from the videos that contain speech.
Ranked #1 on Spoken language identification on KALAKA-3
Action Detection Activity Detection +4