Search Results for author: Vladimir Bataev

Found 5 papers, 1 papers with code

Powerful and Extensible WFST Framework for RNN-Transducer Losses

no code implementations • 18 Mar 2023 • Aleksandr Laptev, Vladimir Bataev, Igor Gitman, Boris Ginsburg

This paper presents a framework based on Weighted Finite-State Transducers (WFST) to simplify the development of modifications for RNN-Transducer (RNN-T) loss.

Paper
Add Code

Text-only domain adaptation for end-to-end ASR using integrated text-to-mel-spectrogram generator

no code implementations • 27 Feb 2023 • Vladimir Bataev, Roman Korostik, Evgeny Shabalin, Vitaly Lavrukhin, Boris Ginsburg

We propose an end-to-end Automatic Speech Recognition (ASR) system that can be trained on transcribed speech data, text-only data, or a mixture of both.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Digital Peter: Dataset, Competition and Handwriting Recognition Methods

2 code implementations • 16 Mar 2021 • Mark Potanin, Denis Dimitrov, Alex Shonenkov, Vladimir Bataev, Denis Karachev, Maxim Novopoltsev

This paper presents a new dataset of Peter the Great's manuscripts and describes a segmentation procedure that converts initial images of documents into the lines.

BIG-bench Machine Learning Handwriting Recognition +1

Paper
Code

Techniques for Vocabulary Expansion in Hybrid Speech Recognition Systems

no code implementations • 19 Mar 2020 • Nikolay Malkovsky, Vladimir Bataev, Dmitrii Sviridkin, Natalia Kizhaeva, Aleksandr Laptev, Ildar Valiev, Oleg Petrov

The problem of out of vocabulary words (OOV) is typical for any speech recognition system, hybrid systems are usually constructed to recognize a fixed set of words and rarely can include all the words that will be encountered during exploitation of the system.

graph construction speech-recognition +1

Paper
Add Code

Exploring End-to-End Techniques for Low-Resource Speech Recognition

no code implementations • 2 Jul 2018 • Vladimir Bataev, Maxim Korenevsky, Ivan Medennikov, Alexander Zatvornitskiy

In this work we present simple grapheme-based system for low-resource speech recognition using Babel data for Turkish spontaneous speech (80 hours).

speech-recognition Speech Recognition

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.