1 code implementation • 20 Feb 2024 • Wenxiao Cai, Wankou Yang
Current methodologies exhibit the ability to preserve local geometric structures, yet fall short in maintaining relationships between these geometric structures.
1 code implementation • 23 Jan 2024 • Fei Xie, Wankou Yang, Chunyu Wang, Lei Chu, Yue Cao, Chao Ma, Wenjun Zeng
Thus, we reformulate the two-branch Siamese tracking as a conceptually simple, fully transformer-based Single-Branch Tracking pipeline, dubbed SBT.
1 code implementation • 11 Dec 2023 • Jifeng Shen, Teng Guo, Xin Zuo, Heng Fan, Wankou Yang
The AFSS module learns to provide reasonable scale prior information for different attribute groups, allowing the model to focus on different levels of feature maps with varying semantic granularity.
no code implementations • 11 Oct 2023 • Ke Jin, Wankou Yang
Later works, such as DenseCLIP and LSeg, extend this paradigm to dense prediction, including semantic segmentation, and have achieved excellent results.
2 code implementations • ICCV 2023 • Lingyu Xiao, Xiang Li, Sen yang, Wankou Yang
In this paper, we revisit the limitations of anchor-based lane detection methods, which have predominantly focused on fixed anchors that stem from the edges of the image, disregarding their versatility and quality.
1 code implementation • ICCV 2023 • Ziyu Li, Jingming Guo, Tongtong Cao, Liu Bingbing, Wankou Yang
LiDAR-based 3D detection has made great progress in recent years.
1 code implementation • 15 Aug 2023 • Jifeng Shen, Yifei Chen, Yue Liu, Xin Zuo, Heng Fan, Wankou Yang
Effective feature fusion of multispectral images plays a crucial role in multi-spectral object detection.
Ranked #2 on Object Detection on VEDAI
1 code implementation • 23 May 2023 • Wenxiao Cai, Ke Jin, Jinyan Hou, Cong Guo, Letian Wu, Wankou Yang
We expect that our dataset will generate considerable interest in drone image segmentation and serve as a foundation for other drone vision tasks.
Ranked #1 on Semantic Segmentation on VDD
1 code implementation • IEEE ROBOTICS AND AUTOMATION LETTERS 2023 • Lineng Chen, Huan Wang, Hui Kong, Wankou Yang, Mingwu Ren
To address this issue, we propose a novel Point-wise Transformer with sparse Convolution (PTC).
no code implementations • 13 Apr 2023 • Letian Wu, Wenyao Zhang, Tengping Jiang, Wankou Yang, Xin Jin, Wenjun Zeng
Based on that, we build upon the CLIP model as a backbone which we extend with a One-Way [CLS] token navigation from text to the visual branch that enables zero-shot dense prediction, dubbed \textbf{ClsCLIP}.
no code implementations • 4 Mar 2023 • Jiren Mai, Fei Zhang, Junjie Ye, Marcus Kalander, Xian Zhang, Wankou Yang, Tongliang Liu, Bo Han
Motivated by this simple but effective learning pattern, we propose a General-Specific Learning Mechanism (GSLM) to explicitly drive a coarse-grained CAM to a fine-grained pseudo mask.
1 code implementation • 1 Mar 2023 • Sen yang, Wen Heng, Gang Liu, Guozhong Luo, Wankou Yang, Gang Yu
In this paper we present a novel method to estimate 3D human pose and shape from monocular videos.
Ranked #35 on 3D Human Pose Estimation on 3DPW
no code implementations • 23 Feb 2023 • Guangtao Wang, Jun Li, Zhijian Wu, Jianhua Xu, Jifeng Shen, Wankou Yang
Besides, this is conducive to estimating the locations of faces and enhancing the descriptive power of face features.
1 code implementation • 31 Oct 2022 • Junlong Tong, Liping Xie, Wankou Yang, Kanjian Zhang
The Transformer is employed to learn temporal patterns and implement primary probabilistic forecasts, while the conditional generative model is used to achieve non-autoregressive hierarchical probabilistic forecasts by introducing latent space feature representations.
no code implementations • 13 Aug 2022 • Ming Dai, Enhui Zheng, ZhenHua Feng, Jiahao Chen, Wankou Yang
To validate the practicality of our framework, we construct a paired dataset, namely UL14, that consists of UAV and satellite views.
1 code implementation • CVPR 2022 • Fei Xie, Chunyu Wang, Guangting Wang, Yue Cao, Wankou Yang, Wenjun Zeng
In contrast to the Siamese-like feature extraction, our network deeply embeds cross-image feature correlation in multiple layers of the feature network.
1 code implementation • 23 Jan 2022 • Ming Dai, Enhui Zheng, ZhenHua Feng, Jiedong Zhuang, Wankou Yang
Last, we enhance the Recall@K metric and introduce a new measurement, SDM@K, to evaluate the performance of a trained model from both the retrieval and localization perspectives simultaneously.
1 code implementation • 5 Dec 2021 • Fei Xie, Chunyu Wang, Guangting Wang, Wankou Yang, Wenjun Zeng
We present a Siamese-like Dual-branch network based on solely Transformers for tracking.
1 code implementation • 25 Nov 2021 • Sen yang, Zhicheng Wang, Ze Chen, YanJie Li, Shoukui Zhang, Zhibin Quan, Shu-Tao Xia, Yiping Bao, Erjin Zhou, Wankou Yang
This paper presents a new method to solve keypoint detection and instance association by using Transformer.
Ranked #10 on Multi-Person Pose Estimation on MS COCO
1 code implementation • 29 Jul 2021 • Ziwei Chen, Yiye Wang, Wankou Yang
Video based fall detection accuracy has been largely improved due to the recent progress on deep convolutional neural networks.
3 code implementations • 7 Jul 2021 • YanJie Li, Sen yang, Peidong Liu, Shoukui Zhang, Yunxiao Wang, Zhicheng Wang, Wankou Yang, Shu-Tao Xia
The 2D heatmap-based approaches have dominated Human Pose Estimation (HPE) for years due to high performance.
1 code implementation • ICCV 2021 • YanJie Li, Shoukui Zhang, Zhicheng Wang, Sen yang, Wankou Yang, Shu-Tao Xia, Erjin Zhou
Most existing CNN-based methods do well in visual representation, however, lacking in the ability to explicitly learn the constraint relationships between keypoints.
1 code implementation • 29 Mar 2021 • Ziyu Li, Yuncong Yao, Zhibin Quan, Wankou Yang, Jin Xie
Specifically, we design the Spatial Information Enhancement (SIE) module to predict the spatial shapes of the foreground points within proposals, and extract the structure information to learn the representative features for further box refinement.
no code implementations • 17 Jan 2021 • Shuangping Jin, ZhenHua Feng, Wankou Yang, Josef Kittler
Different from the standard BN layer that uses all the training data to calculate a single set of parameters, SepBN considers that the samples of a training dataset may belong to different sub-domains.
1 code implementation • ICCV 2021 • Sen yang, Zhibin Quan, Mu Nie, Wankou Yang
While CNN-based models have made remarkable progress on human pose estimation, what spatial dependencies they capture to localize keypoints remains unclear.
Ranked #3 on Pose Estimation on OCHuman (Validation AP metric)
1 code implementation • 21 Sep 2020 • Fei Xie, Wankou Yang, Bo Liu, Kaihua Zhang, Wanli Xue, WangMeng Zuo
Existing visual object tracking usually learns a bounding-box based template to match the targets across frames, which cannot accurately learn a pixel-wise representation, thereby being limited in handling severe appearance variations.
1 code implementation • 16 Sep 2019 • Bingwen Hu, Zhedong Zheng, Ping Liu, Wankou Yang, Mingwu Ren
Given two facial images with and without eyeglasses, the proposed model learns to swap the eye area in two faces.
2 code implementations • 16 Sep 2019 • Sen Yang, Wankou Yang, Zhen Cui
Neural Architecture Search (NAS) technologies have emerged in many domains to jointly learn the architectures and weights of the neural network.
Ranked #13 on Keypoint Detection on MS COCO
no code implementations • 16 Mar 2018 • Feng Liu, Tao Xiang, Timothy M. Hospedales, Wankou Yang, Changyin Sun
The iVQA task is to generate a question that corresponds to a given image and answer pair.
no code implementations • CVPR 2018 • Feng Liu, Tao Xiang, Timothy M. Hospedales, Wankou Yang, Changyin Sun
The iVQA task is to generate a question that corresponds to a given image and answer pair.
no code implementations • 16 Mar 2017 • Yazhou Yao, Jian Zhang, Fumin Shen, Xian-Sheng Hua, Wankou Yang, Zhenmin Tang
To tackle these problems, in this work, we exploit general corpus information to automatically select and subsequently classify web images into semantic rich (sub-)categories.
no code implementations • CVPR 2017 • Feng Liu, Tao Xiang, Timothy M. Hospedales, Wankou Yang, Changyin Sun
We propose a simple modification to the design pattern that makes learning more effective and efficient.
no code implementations • 29 Apr 2016 • Biyun Sheng, Chunhua Shen, Guosheng Lin, Jun Li, Wankou Yang, Changyin Sun
Crowd counting is an important task in computer vision, which has many applications in video surveillance.