no code implementations • 29 May 2023 • Yui Sudo, Kazuya Hata, Kazuhiro Nakadai
End-to-end automatic speech recognition (E2E-ASR) has the potential to improve performance, but a specific issue that needs to be addressed is the difficulty it has in handling enharmonic words: named entities (NEs) with the same pronunciation and part of speech that are spelled differently.