no code implementations • LREC 2022 • Tomoki Kitagawa, Chee Siang Leow, Hiromitsu Nishizaki
This paper introduces a Y-Autoencoder (Y-AE)-based handwritten character generator to generate multiple Japanese Hiragana characters with a single image to increase the amount of data for training a handwritten character recognizer.
Optical Character Recognition Optical Character Recognition (OCR)
no code implementations • 29 Mar 2022 • Akihiro Dobashi, Chee Siang Leow, Hiromitsu Nishizaki
Furthermore, visualization of the attention weights based on the proposed method suggested that it is possible to transform acoustic features considering the frequency characteristics of each language.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
1 code implementation • 3 Apr 2021 • Yu Wang, Chee Siang Leow, Akio Kobayashi, Takehito Utsuro, Hiromitsu Nishizaki
This paper describes the ExKaldi-RT online automatic speech recognition (ASR) toolkit that is implemented based on the Kaldi ASR toolkit and Python language.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1