Search Results for author: Jean-Hugh Thomas

Channel-Combination Algorithms for Robust Distant Voice Activity and Overlapped Speech Detection

A channel-number invariant loss is proposed to learn a unique feature representation regardless of the number of available microphones.

Paper
Add Code

Voice activity and overlapped speech detection (respectively VAD and OSD) are key pre-processing tasks for speaker diarization.

Paper
Add Code

Pipeline systems rely on speech segmentation to extract speakers' segments and achieve robust speaker diarization.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.