Search Results for author: Bingshen Mu

Found 4 papers, 0 papers with code

MMGER: Multi-modal and Multi-granularity Generative Error Correction with LLM for Joint Accent and Speech Recognition

no code implementations6 May 2024 Bingshen Mu, Yangze Li, Qijie Shao, Kun Wei, Xucheng Wan, Naijun Zheng, Huan Zhou, Lei Xie

Accents represent deviations from standard pronunciation norms, and the multi-task learning framework for simultaneous ASR and accent recognition (AR) has effectively addressed the multi-accent scenarios, making it a prominent solution.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

The NPU-ASLP System for Audio-Visual Speech Recognition in MISP 2022 Challenge

no code implementations11 Mar 2023 Pengcheng Guo, He Wang, Bingshen Mu, Ao Zhang, Peikun Chen

This paper describes our NPU-ASLP system for the Audio-Visual Diarization and Recognition (AVDR) task in the Multi-modal Information based Speech Processing (MISP) 2022 Challenge.

Audio-Visual Speech Recognition speech-recognition +1

Cannot find the paper you are looking for? You can Submit a new open access paper.