Search Results for author: Andrei Andrusenko

Found 8 papers, 3 papers with code

SALM: Speech-augmented Language Model with In-context Learning for Speech Recognition and Translation

1 code implementation • 13 Oct 2023 • Zhehuai Chen, He Huang, Andrei Andrusenko, Oleksii Hrinchuk, Krishna C. Puvvada, Jason Li, Subhankar Ghosh, Jagadeesh Balam, Boris Ginsburg

We present a novel Speech Augmented Language Model (SALM) with {\em multitask} and {\em in-context} learning capabilities.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

10,223

Paper
Code

Uconv-Conformer: High Reduction of Input Sequence Length for End-to-End Speech Recognition

no code implementations • 16 Aug 2022 • Andrei Andrusenko, Rauf Nasretdinov, Aleksei Romanenko

Optimization of modern ASR architectures is among the highest priority tasks since it saves many computational resources for model training and inference.

speech-recognition Speech Recognition

Paper
Add Code

LT-LM: a novel non-autoregressive language model for single-shot lattice rescoring

1 code implementation • 6 Apr 2021 • Anton Mitrofanov, Mariya Korenevskaya, Ivan Podluzhny, Yuri Khokhlov, Aleksandr Laptev, Andrei Andrusenko, Aleksei Ilin, Maxim Korenevsky, Ivan Medennikov, Aleksei Romanenko

We propose a novel rescoring approach, which processes the entire lattice in a single call to the model.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Code

Dynamic Acoustic Unit Augmentation With BPE-Dropout for Low-Resource End-to-End Speech Recognition

no code implementations • 12 Mar 2021 • Aleksandr Laptev, Andrei Andrusenko, Ivan Podluzhny, Anton Mitrofanov, Ivan Medennikov, Yuri Matveev

Researchers and industry prefer to use end-to-end ASR systems for on-device speech recognition tasks.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Exploration of End-to-End ASR for OpenSTT -- Russian Open Speech-to-Text Dataset

no code implementations • 15 Jun 2020 • Andrei Andrusenko, Aleksandr Laptev, Ivan Medennikov

This paper presents an exploration of end-to-end automatic speech recognition systems (ASR) for the largest open-source Russian language data set -- OpenSTT.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

You Do Not Need More Data: Improving End-To-End Speech Recognition by Text-To-Speech Data Augmentation

no code implementations • 14 May 2020 • Aleksandr Laptev, Roman Korostik, Aleksey Svischev, Andrei Andrusenko, Ivan Medennikov, Sergey Rybin

Data augmentation is one of the most effective ways to make end-to-end automatic speech recognition (ASR) perform close to the conventional hybrid approach, especially when dealing with low-resource tasks.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4