1 code implementation • ECCV 2020 • Zhe Niu, Brian Mak
In this paper, we propose novel stochastic modeling of various components of a continuous sign language recognition (CSLR) system that is based on the transformer encoder and connectionist temporal classification (CTC).
Ranked #11 on Sign Language Recognition on RWTH-PHOENIX-Weather 2014 T
no code implementations • 2 May 2024 • Zhe Niu, Ronglai Zuo, Brian Mak, Fangyun Wei
The dataset is collected to enrich resources for HKSL and support research in large-vocabulary continuous sign language recognition (SLR) and translation (SLT).
no code implementations • ICCV 2023 • Zhe Niu, Brian Mak
Most lip-to-speech (LTS) synthesis models are trained and evaluated under the assumption that the audio-video pairs in the dataset are perfectly synchronized.