no code implementations • 29 Nov 2023 • Jinhui Ye, Jiaming Zhou, Hui Xiong, Junwei Liang
Specifically, at the core of GeoDeformer is the Geometric Deformation Predictor, a module designed to identify and quantify potential spatial and temporal geometric deformations within the given video.
no code implementations • 19 Aug 2023 • Jinhui Ye, Junwei Liang
This paper studies introducing viewpoint invariant feature representations in existing action recognition architecture.
1 code implementation • 18 May 2023 • Jinhui Ye, Wenxiang Jiao, Xing Wang, Zhaopeng Tu, Hui Xiong
It has been a challenging task due to the modality gap between sign videos and texts and the data scarcity of labeled data.
Ranked #3 on Sign Language Translation on CSL-Daily
1 code implementation • 13 Oct 2022 • Jinhui Ye, Wenxiang Jiao, Xing Wang, Zhaopeng Tu
In this paper, to overcome the limitation, we propose a Prompt based domain text Generation (PGEN) approach to produce the large-scale in-domain spoken language text data.