1 code implementation • 23 Apr 2024 • Junjie Zhang, Tianci Hu, Xiaoshui Huang, Yongshun Gong, Dan Zeng
Evaluating the performance of Multi-modal Large Language Models (MLLMs), integrating both point cloud and language, presents significant challenges.
no code implementations • 4 Mar 2024 • Weiyi Lv, Yuhang Huang, Ning Zhang, Ruei-Sung Lin, Mei Han, Dan Zeng
In Multiple Object Tracking, objects often exhibit non-linear motion of acceleration and deceleration, with irregular direction changes.
no code implementations • 4 Mar 2024 • P. Bilha Githinji, Xi Yuan, Zhenglin Chen, Ijaz Gul, Dingqi Shang, Wen Liang, Jianming Deng, Dan Zeng, Dongmei Yu, Chenggang Yan, Peiwu Qin
Realizing sufficient separability between the distributions of healthy and pathological samples is a critical obstacle for pathology detection convolutional models.
no code implementations • 3 Jan 2024 • Yuzhou Yang, Yangming Zhou, Qichao Ying, Zhenxing Qian, Dan Zeng, Liang Liu
This paper reviews and summarizes the research results on fact-based fake news from the perspectives of tasks and problems, algorithm strategies, and datasets.
3 code implementations • 26 Dec 2023 • Hansong Zhang, Shikun Li, Pengju Wang, Dan Zeng, Shiming Ge
Nowadays, optimization-oriented methods have been the primary method in the field of dataset condensation for achieving SOTA results.
2 code implementations • 12 Dec 2023 • Hansong Zhang, Shikun Li, Dan Zeng, Chenggang Yan, Shiming Ge
Moreover, we cluster the ``annotator groups'' who share similar expertise so that their confusion matrices could be corrected together.
no code implementations • 22 Aug 2023 • Dan Zeng, Mingliang Zou, Xucheng Wang, Shuiwang Li
Lightweight Deep learning (DL)-based trackers can achieve a good balance between efficiency and precision but performance gains are limited by the compression rate.
1 code implementation • ICCV 2023 • Shuiwang Li, Yangxiang Yang, Dan Zeng, Xucheng Wang
In this paper, we propose an efficient ViT-based tracking framework, Aba-ViTrack, for UAV tracking.
1 code implementation • 26 Oct 2022 • Gongyang Li, Yike Wang, Zhi Liu, Xinpeng Zhang, Dan Zeng
The highlight of LASNet is that we fully consider the characteristics of cross-modal features at different levels, and accordingly propose three specific modules for better segmentation.
Ranked #26 on Thermal Image Segmentation on MFN Dataset
no code implementations • 19 Sep 2022 • Hailin Shi, Hang Du, Yibo Hu, Jun Wang, Dan Zeng, Ting Yao
Such multi-shot scheme brings inference burden, and the predefined scales inevitably have gap from real data.
no code implementations • 2 Sep 2022 • Shuaitao Zhao, Kun Liu, Yuhang Huang, Qian Bao, Dan Zeng, Wu Liu
Human pose estimation aims to figure out the keypoints of all people in different scenes.
Ranked #22 on Pose Estimation on COCO test-dev
1 code implementation • 28 Jul 2022 • Yan Hu, Zhongxi Qiu, Dan Zeng, Li Jiang, Chen Lin, Jiang Liu
Vascular segmentation extracts blood vessels from images and serves as the basis for diagnosing various diseases, like ophthalmic diseases.
no code implementations • 14 Jul 2022 • Daichi Zhang, Fanzhao Lin, Yingying Hua, Pengju Wang, Dan Zeng, Shiming Ge
Existing image-level approaches often focus on single frame and ignore the spatiotemporal cues hidden in deepfake videos, resulting in poor generalization and robustness.
no code implementations • 5 Jul 2022 • Xucheng Wang, Dan Zeng, Qijun Zhao, Shuiwang Li
Model compression is a promising way to narrow the gap (i. e., effciency, precision) between DCF- and deep learning- based trackers, which has not caught much attention in UAV tracking.
1 code implementation • 12 Jun 2022 • Qichao Ying, Xiaoxiao Hu, Yangming Zhou, Zhenxing Qian, Dan Zeng, Shiming Ge
Representations from each view are separately used to coarsely predict the fidelity of the whole news, and the multimodal representations are able to predict the cross-modal consistency.
1 code implementation • 27 Mar 2022 • Xiao Wang, Yuhang Huang, Dan Zeng, Guo-Jun Qi
It trains an encoder by distinguishing positive samples from negative ones given query anchors.
Ranked #65 on Self-Supervised Image Classification on ImageNet
1 code implementation • 25 Mar 2022 • Gongyang Li, Zhi Liu, Dan Zeng, Weisi Lin, Haibin Ling
As the key component of ACCoNet, ACCoM activates the salient regions of output features of the encoder and transmits them to the decoder.
1 code implementation • 8 Mar 2022 • Shikun Li, Tongliang Liu, Jiyong Tan, Dan Zeng, Shiming Ge
This raises the following important question: how can we effectively use a small amount of trusted data to facilitate robust classifier learning from multiple annotators?
1 code implementation • CVPR 2022 • Dan Zeng, Zhiyuan Lin, Xiao Yan, YuTing Liu, Fei Wang, Bo Tang
To combat the mismatch between FR and FER data, Meta-Face2Exp uses a circuit feedback mechanism, which improves the base network with the feedback from the adaptation network.
1 code implementation • 19 May 2021 • Jiansheng Fang, Huazhu Fu, Dan Zeng, Xiao Yan, Yuguang Yan, Jiang Liu
When encountering a dubious diagnostic case, medical instance retrieval can help radiologists make evidence-based diagnoses by finding images containing instances similar to a query case from a large image database.
no code implementations • 10 May 2021 • Hailin Shi, Dan Zeng, Yichun Tai, Hang Du, Yibo Hu, ZiCheng Zhang, Tao Mei
However, unlike the existing public face datasets, in many real-world scenarios of face recognition, the depth of training dataset is shallow, which means only two face images are available for each ID.
no code implementations • 14 Apr 2021 • Hang Du, Hailin Shi, Yinglu Liu, Dan Zeng, Tao Mei
In this paper, we aim to address the challenge of NIR-VIS masked face recognition from the perspectives of training data and training method.
no code implementations • 23 Mar 2021 • Kangkai Zhang, Chunhui Zhang, Shikun Li, Dan Zeng, Shiming Ge
Inspired by that, we propose an evolutionary knowledge distillation approach to improve the transfer effectiveness of teacher knowledge.
1 code implementation • ICCV 2021 • Dan Zeng, Yuhang Huang, Qian Bao, Junjie Zhang, Chi Su, Wu Liu
With the spirit of NAS, we propose to search for an efficient network architecture (NPPNet) to tackle two tasks at the same time.
no code implementations • 28 Sep 2020 • Hang Du, Hailin Shi, Dan Zeng, Xiao-Ping Zhang, Tao Mei
To start with, we present an overview of the end-to-end deep face recognition.
no code implementations • 20 Jul 2020 • Dan Zeng, Hailin Shi, Hang Du, Jun Wang, Zhen Lei, Tao Mei
However, the correlation between hard positive and hard negative is overlooked, and so is the relation between the margins in positive and negative logits.
3 code implementations • ECCV 2020 • Hang Du, Hailin Shi, Yuchi Liu, Jun Wang, Zhen Lei, Dan Zeng, Tao Mei
Extensive experiments on various benchmarks of face recognition show the proposed method significantly improves the training, not only in shallow face learning, but also for conventional deep face data.
no code implementations • 19 Jun 2020 • Dan Zeng, Raymond Veldhuis, Luuk Spreeuwers
As a part of this review, we introduce face detection under occlusion, a preliminary step in face recognition.
no code implementations • 13 May 2020 • Ning Zhang, Jingen Liu, Ke Wang, Dan Zeng, Tao Mei
Inspired by the human "visual tracking" capability which leverages motion cues to distinguish the target from the background, we propose a Two-Stream Residual Convolutional Network (TS-RCN) for visual tracking, which successfully exploits both appearance and motion features for model update.
no code implementations • 12 Dec 2019 • Jui-Hsin Lai, Bo Wu, Xin Wang, Dan Zeng, Tao Mei, Jingen Liu
This model associates themes with the pairwise compatibility with attention, and thus compute the outfit-wise compatibility.
no code implementations • 12 Dec 2019 • Jia Li, Tong Shen, Wei zhang, Hui Ren, Dan Zeng, Tao Mei
The stunning progress in face manipulation methods has made it possible to synthesize realistic fake face images, which poses potential threats to our society.
no code implementations • 2 Sep 2019 • Zhao Zhang, Yan Zhang, Sheng Li, Guangcan Liu, Dan Zeng, Shuicheng Yan, Meng Wang
For auto-weighting, RFA-LCF jointly preserves the manifold structures in the basis concept space and new coordinate space in an adaptive manner by minimizing the reconstruction errors on clean data, anchor points and coordinates.
1 code implementation • 6 Aug 2019 • Chen Ma, Chenxu Zhao, Hailin Shi, Li Chen, Junhai Yong, Dan Zeng
To solve such few-shot problem with the evolving attack, we propose a meta-learning based robust detection method to detect new adversarial attacks with limited examples.
no code implementations • 19 Oct 2018 • Han Liu, Dan Zeng, Qi Tian
Secondly, super-pixel level database is used to train our cloud detection models based on CNN and deep forest.
no code implementations • CVPR 2018 • Feng Liu, Ronghang Zhu, Dan Zeng, Qijun Zhao, Xiaoming Liu
This paper proposes an encoder-decoder network to disentangle shape features during 3D face reconstruction from single 2D images, such that the tasks of reconstructing accurate 3D face shapes and learning discriminative shape features for face recognition can be accomplished simultaneously.
no code implementations • 9 Aug 2017 • Feng Liu, Qijun Zhao, Xiaoming Liu, Dan Zeng
Extensive experiments show that the proposed method can achieve the state-of-the-art accuracy in both face alignment and 3D face reconstruction, and benefit face recognition owing to its reconstructed PEN 3D face.
no code implementations • 20 May 2016 • Wei Shen, Yuan Jiang, Wenjing Gao, Dan Zeng, Xinggang Wang
Contour and skeleton are two complementary representations for shape recognition.
no code implementations • 21 Sep 2015 • Feng Liu, Dan Zeng, Jing Li, Qijun Zhao
Cascaded regression has been recently applied to reconstructing 3D faces from single 2D images directly in shape space, and achieved state-of-the-art performance.