Search Results for author: Imran Ahamad Sheikh

Found 3 papers, 0 papers with code

Improving Speaker Assignment in Speaker-Attributed ASR for Real Meeting Applications

no code implementations • 11 Mar 2024 • Can Cui, Imran Ahamad Sheikh, Mostafa Sadeghi, Emmanuel Vincent

Past studies on end-to-end meeting transcription have focused on model architecture and have mostly been evaluated on simulated meeting data.

Action Detection Activity Detection +2

Paper
Add Code

End-to-end Joint Rich and Normalized ASR with a limited amount of rich training data

no code implementations • 29 Nov 2023 • Can Cui, Imran Ahamad Sheikh, Mostafa Sadeghi, Emmanuel Vincent

End-to-end (E2E) ASR models offer both convenience and the ability to perform such joint transcription of speech.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

End-to-end Multichannel Speaker-Attributed ASR: Speaker Guided Decoder and Input Feature Analysis

no code implementations • 16 Oct 2023 • Can Cui, Imran Ahamad Sheikh, Mostafa Sadeghi, Emmanuel Vincent

We present an end-to-end multichannel speaker-attributed automatic speech recognition (MC-SA-ASR) system that combines a Conformer-based encoder with multi-frame crosschannel attention and a speaker-attributed Transformer-based decoder.

Automatic Speech Recognition Decoder +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.