Search Results for author: Roman Bedyakin

Found 4 papers, 1 papers with code

Language ID Prediction from Speech Using Self-Attentive Pooling

no code implementations • NAACL (SIGTYP) 2021 • Roman Bedyakin, Nikolay Mikhaylovskiy

This memo describes NTR-TSU submission for SIGTYP 2021 Shared Task on predicting language IDs from speech.

Language Identification speech-recognition +2

Paper
Add Code

Low-Resource Spoken Language Identification Using Self-Attentive Pooling and Deep 1D Time-Channel Separable Convolutions

no code implementations • 31 May 2021 • Roman Bedyakin, Nikolay Mikhaylovskiy

In this memo, we show that a convolutional neural network with a Self-Attentive Pooling layer shows promising results in low-resource setting for the language identification task and set up a SOTA for the Low Resource ASR challenge dataset.

Language Identification speech-recognition +2

Paper
Add Code

Language ID Prediction from Speech Using Self-Attentive Pooling and 1D-Convolutions

no code implementations • 24 Apr 2021 • Roman Bedyakin, Nikolay Mikhaylovskiy

This memo describes NTR-TSU submission for SIGTYP 2021 Shared Task on predicting language IDs from speech.

Language Identification speech-recognition +2

Paper
Add Code

MediaSpeech: Multilanguage ASR Benchmark and Dataset

1 code implementation • 30 Mar 2021 • Rostislav Kolobov, Olga Okhapkina, Olga Omelchishina, Andrey Platunov, Roman Bedyakin, Vyacheslav Moshkin, Dmitry Menshikov, Nikolay Mikhaylovskiy

The performance of automated speech recognition (ASR) systems is well known to differ for varied application domains.

Ranked #1 on Speech Recognition on MediaSpeech

speech-recognition Speech Recognition

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.