no code implementations • 8 Jan 2024 • Tobias Cord-Landwehr, Christoph Boeddeker, Cătălin Zorilă, Rama Doddipatla, Reinhold Haeb-Umbach
We propose a modified teacher-student training for the extraction of frame-wise speaker embeddings that allows for an effective diarization of meeting scenarios containing partially overlapping speech.
no code implementations • 1 Jun 2023 • Tobias Cord-Landwehr, Christoph Boeddeker, Cătălin Zorilă, Rama Doddipatla, Reinhold Haeb-Umbach
We introduce a monaural neural speaker embeddings extractor that computes an embedding for each speaker present in a speech mixture.
no code implementations • 1 Jun 2023 • Tobias Cord-Landwehr, Christoph Boeddeker, Cătălin Zorilă, Rama Doddipatla, Reinhold Haeb-Umbach
Using a Teacher-Student training approach we developed a speaker embedding extraction system that outputs embeddings at frame rate.