no code implementations • 1 Sep 2023 • Jincheng Li, Chunyu Xie, Xiaoyu Wu, Bin Wang, Dawei Leng
A two-stage object detector includes a visual backbone, a region proposal network (RPN), and a region of interest (RoI) head.
1 code implementation • 31 Dec 2022 • Xin Ma, Chang Liu, Chunyu Xie, Long Ye, Yafeng Deng, Xiangyang Ji
Masked image modeling (MIM) has shown great promise for self-supervised learning (SSL) yet been criticized for learning inefficiency.
1 code implementation • 8 May 2022 • Chunyu Xie, Heng Cai, Jincheng Li, Fanjing Kong, Xiaoyu Wu, Jianfei Song, Henrique Morimitsu, Lin Yao, Dexin Wang, Xiangzheng Zhang, Dawei Leng, Baochang Zhang, Xiangyang Ji, Yafeng Deng
In this work, we build a large-scale high-quality Chinese Cross-Modal Benchmark named CCMB for the research community, which contains the currently largest public pre-training dataset Zero and five human-annotated fine-tuning datasets for downstream tasks.
Ranked #3 on Image Retrieval on Flickr30k-CN
1 code implementation • 23 Apr 2018 • Chunyu Xie, Ce Li, Baochang Zhang, Chen Chen, Jungong Han, Changqing Zou, Jianzhuang Liu
Specifically, the TARM is deployed in a residual learning module that employs a novel attention learning network to recalibrate the temporal attention of frames in a skeleton sequence.
Ranked #89 on Skeleton Based Action Recognition on NTU RGB+D
1 code implementation • 12 Jul 2017 • Chunyu Xie, Ce Li, Baochang Zhang, Chen Chen, Jungong Han
Gesture recognition is a challenging problem in the field of biometrics.
Ranked #1 on Hand Gesture Recognition on MGB