Search Results for author: Wankou Yang

Found 33 papers, 22 papers with code

Object-level Geometric Structure Preserving for Natural Image Stitching

1 code implementation • 20 Feb 2024 • Wenxiao Cai, Wankou Yang

Current methodologies exhibit the ability to preserve local geometric structures, yet fall short in maintaining relationships between these geometric structures.

Image Stitching

Paper
Code

Correlation-Embedded Transformer Tracking: A Single-Branch Framework

1 code implementation • 23 Jan 2024 • Fei Xie, Wankou Yang, Chunyu Wang, Lei Chu, Yue Cao, Chao Ma, Wenjun Zeng

Thus, we reformulate the two-branch Siamese tracking as a conceptually simple, fully transformer-based Single-Branch Tracking pipeline, dubbed SBT.

Feature Correlation Visual Object Tracking

Paper
Code

SSPNet: Scale and Spatial Priors Guided Generalizable and Interpretable Pedestrian Attribute Recognition

1 code implementation • 11 Dec 2023 • Jifeng Shen, Teng Guo, Xin Zuo, Heng Fan, Wankou Yang

The AFSS module learns to provide reasonable scale prior information for different attribute groups, allowing the model to focus on different levels of feature maps with varying semantic granularity.

Attribute Pedestrian Attribute Recognition

Paper
Code

CLIP for Lightweight Semantic Segmentation

no code implementations • 11 Oct 2023 • Ke Jin, Wankou Yang

Later works, such as DenseCLIP and LSeg, extend this paradigm to dense prediction, including semantic segmentation, and have achieved excellent results.

Segmentation Semantic Segmentation

Paper
Add Code

ADNet: Lane Shape Prediction via Anchor Decomposition

2 code implementations • ICCV 2023 • Lingyu Xiao, Xiang Li, Sen yang, Wankou Yang

In this paper, we revisit the limitations of anchor-based lane detection methods, which have predominantly focused on fixed anchors that stem from the edges of the image, disregarding their versatility and quality.

Lane Detection

Paper
Code

GPA-3D: Geometry-aware Prototype Alignment for Unsupervised Domain Adaptive 3D Object Detection from Point Clouds

1 code implementation • ICCV 2023 • Ziyu Li, Jingming Guo, Tongtong Cao, Liu Bingbing, Wankou Yang

LiDAR-based 3D detection has made great progress in recent years.

3D Object Detection object-detection

Paper
Code

ICAFusion: Iterative Cross-Attention Guided Feature Fusion for Multispectral Object Detection

1 code implementation • 15 Aug 2023 • Jifeng Shen, Yifei Chen, Yue Liu, Xin Zuo, Heng Fan, Wankou Yang

Effective feature fusion of multispectral images plays a crucial role in multi-spectral object detection.

Ranked #2 on Object Detection on VEDAI

Multispectral Object Detection object-detection +1

Paper
Code

VDD: Varied Drone Dataset for Semantic Segmentation

1 code implementation • 23 May 2023 • Wenxiao Cai, Ke Jin, Jinyan Hou, Cong Guo, Letian Wu, Wankou Yang

We expect that our dataset will generate considerable interest in drone image segmentation and serve as a foundation for other drone vision tasks.

Ranked #1 on Semantic Segmentation on VDD

Image Segmentation Segmentation +1

Paper
Code

PTC-Net: Point-Wise Transformer with Sparse Convolution Network for Place Recognition

1 code implementation • IEEE ROBOTICS AND AUTOMATION LETTERS 2023 • Lineng Chen, Huan Wang, Hui Kong, Wankou Yang, Mingwu Ren

To address this issue, we propose a novel Point-wise Transformer with sparse Convolution (PTC).

Ranked #3 on Point Cloud Retrieval on Oxford RobotCar (LiDAR 4096 points)

Point Cloud Retrieval Retrieval

Paper
Code

[CLS] Token is All You Need for Zero-Shot Semantic Segmentation

no code implementations • 13 Apr 2023 • Letian Wu, Wenyao Zhang, Tengping Jiang, Wankou Yang, Xin Jin, Wenjun Zeng

Based on that, we build upon the CLIP model as a backbone which we extend with a One-Way [CLS] token navigation from text to the visual branch that enables zero-shot dense prediction, dubbed \textbf{ClsCLIP}.

Few-Shot Semantic Segmentation Language Modelling +4

Paper
Add Code

Exploit CAM by itself: Complementary Learning System for Weakly Supervised Semantic Segmentation

no code implementations • 4 Mar 2023 • Jiren Mai, Fei Zhang, Junjie Ye, Marcus Kalander, Xian Zhang, Wankou Yang, Tongliang Liu, Bo Han

Motivated by this simple but effective learning pattern, we propose a General-Specific Learning Mechanism (GSLM) to explicitly drive a coarse-grained CAM to a fine-grained pseudo mask.

General Knowledge Hippocampus +2

Paper
Add Code

Capturing the motion of every joint: 3D human pose and shape estimation with independent tokens

1 code implementation • 1 Mar 2023 • Sen yang, Wen Heng, Gang Liu, Guozhong Luo, Wankou Yang, Gang Yu

In this paper we present a novel method to estimate 3D human pose and shape from monocular videos.

Ranked #35 on 3D Human Pose Estimation on 3DPW

3D human pose and shape estimation

Paper
Code

EfficientFace: An Efficient Deep Network with Feature Enhancement for Accurate Face Detection

no code implementations • 23 Feb 2023 • Guangtao Wang, Jun Li, Zhijian Wu, Jianhua Xu, Jifeng Shen, Wankou Yang

Besides, this is conducive to estimating the locations of faces and enhancing the descriptive power of face features.

Descriptive Face Detection

Paper
Add Code

Probabilistic Decomposition Transformer for Time Series Forecasting

1 code implementation • 31 Oct 2022 • Junlong Tong, Liping Xie, Wankou Yang, Kanjian Zhang

The Transformer is employed to learn temporal patterns and implement primary probabilistic forecasts, while the conditional generative model is used to achieve non-autoregressive hierarchical probabilistic forecasts by introducing latent space feature representations.

Time Series Time Series Forecasting

Paper
Code

Finding Point with Image: A Simple and Efficient Method for UAV Self-Localization

no code implementations • 13 Aug 2022 • Ming Dai, Enhui Zheng, ZhenHua Feng, Jiahao Chen, Wankou Yang

To validate the practicality of our framework, we construct a paired dataset, namely UL14, that consists of UAV and satellite views.

Image Retrieval Retrieval +1

Paper
Add Code

Correlation-Aware Deep Tracking

1 code implementation • CVPR 2022 • Fei Xie, Chunyu Wang, Guangting Wang, Yue Cao, Wankou Yang, Wenjun Zeng

In contrast to the Siamese-like feature extraction, our network deeply embeds cross-image feature correlation in multiple layers of the feature network.

Feature Correlation Visual Object Tracking

Paper
Code

Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments

1 code implementation • 23 Jan 2022 • Ming Dai, Enhui Zheng, ZhenHua Feng, Jiedong Zhuang, Wankou Yang

Last, we enhance the Recall@K metric and introduce a new measurement, SDM@K, to evaluate the performance of a trained model from both the retrieval and localization perspectives simultaneously.

Metric Learning Representation Learning

Paper
Code

Learning Tracking Representations via Dual-Branch Fully Transformer Networks

1 code implementation • 5 Dec 2021 • Fei Xie, Chunyu Wang, Guangting Wang, Wankou Yang, Wenjun Zeng

We present a Siamese-like Dual-branch network based on solely Transformers for tracking.

Object Tracking

Paper
Code

Attend to Who You Are: Supervising Self-Attention for Keypoint Detection and Instance-Aware Association

1 code implementation • 25 Nov 2021 • Sen yang, Zhicheng Wang, Ze Chen, YanJie Li, Shoukui Zhang, Zhibin Quan, Shu-Tao Xia, Yiping Bao, Erjin Zhou, Wankou Yang

This paper presents a new method to solve keypoint detection and instance association by using Transformer.

Ranked #10 on Multi-Person Pose Estimation on MS COCO

Instance Segmentation Keypoint Detection +2

Paper
Code

Video Based Fall Detection Using Human Poses

1 code implementation • 29 Jul 2021 • Ziwei Chen, Yiye Wang, Wankou Yang

Video based fall detection accuracy has been largely improved due to the recent progress on deep convolutional neural networks.

Action Recognition

Paper
Code

SimCC: a Simple Coordinate Classification Perspective for Human Pose Estimation

3 code implementations • 7 Jul 2021 • YanJie Li, Sen yang, Peidong Liu, Shoukui Zhang, Yunxiao Wang, Zhicheng Wang, Wankou Yang, Shu-Tao Xia

The 2D heatmap-based approaches have dominated Human Pose Estimation (HPE) for years due to high performance.

Classification Pose Estimation +1

312

Paper
Code

TokenPose: Learning Keypoint Tokens for Human Pose Estimation

1 code implementation • ICCV 2021 • YanJie Li, Shoukui Zhang, Zhicheng Wang, Sen yang, Wankou Yang, Shu-Tao Xia, Erjin Zhou

Most existing CNN-based methods do well in visual representation, however, lacking in the ability to explicitly learn the constraint relationships between keypoints.

Pose Estimation

112

Paper
Code

SIENet: Spatial Information Enhancement Network for 3D Object Detection from Point Cloud

1 code implementation • 29 Mar 2021 • Ziyu Li, Yuncong Yao, Zhibin Quan, Wankou Yang, Jin Xie

Specifically, we design the Spatial Information Enhancement (SIE) module to predict the spatial shapes of the foreground points within proposals, and extract the structure information to learn the representative features for further box refinement.

3D Object Detection Autonomous Vehicles +3

Paper
Code

Separable Batch Normalization for Robust Facial Landmark Localization with Cross-protocol Network Training

no code implementations • 17 Jan 2021 • Shuangping Jin, ZhenHua Feng, Wankou Yang, Josef Kittler

Different from the standard BN layer that uses all the training data to calculate a single set of parameters, SepBN considers that the samples of a training dataset may belong to different sub-domains.

Face Alignment

Paper
Add Code

TransPose: Keypoint Localization via Transformer

1 code implementation • ICCV 2021 • Sen yang, Zhibin Quan, Mu Nie, Wankou Yang

While CNN-based models have made remarkable progress on human pose estimation, what spatial dependencies they capture to localize keypoints remains unclear.

Ranked #3 on Pose Estimation on OCHuman (Validation AP metric)

Keypoint Detection Multi-Person Pose Estimation

347

Paper
Code

Learning Spatio-Appearance Memory Network for High-Performance Visual Tracking

1 code implementation • 21 Sep 2020 • Fei Xie, Wankou Yang, Bo Liu, Kaihua Zhang, Wanli Xue, WangMeng Zuo

Existing visual object tracking usually learns a bounding-box based template to match the targets across frames, which cannot accurately learn a pixel-wise representation, thereby being limited in handling severe appearance variations.

Segmentation Semantic Segmentation +5

Paper
Code

Unsupervised Eyeglasses Removal in the Wild

1 code implementation • 16 Sep 2019 • Bingwen Hu, Zhedong Zheng, Ping Liu, Wankou Yang, Mingwu Ren

Given two facial images with and without eyeglasses, the proposed model learns to swap the eye area in two faces.

Face Reconstruction Face Verification +3

Paper
Code

Pose Neural Fabrics Search

2 code implementations • 16 Sep 2019 • Sen Yang, Wankou Yang, Zhen Cui

Neural Architecture Search (NAS) technologies have emerged in many domains to jointly learn the architectures and weights of the neural network.

Ranked #13 on Keypoint Detection on MS COCO

Image Classification Keypoint Detection +3

Paper
Code

Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis Tool

no code implementations • 16 Mar 2018 • Feng Liu, Tao Xiang, Timothy M. Hospedales, Wankou Yang, Changyin Sun

The iVQA task is to generate a question that corresponds to a given image and answer pair.

Question Answering Visual Question Answering

Paper
Add Code

iVQA: Inverse Visual Question Answering

no code implementations • CVPR 2018 • Feng Liu, Tao Xiang, Timothy M. Hospedales, Wankou Yang, Changyin Sun

The iVQA task is to generate a question that corresponds to a given image and answer pair.

Question Answering Question Generation +2

Paper
Add Code

Refining Image Categorization by Exploiting Web Images and General Corpus

no code implementations • 16 Mar 2017 • Yazhou Yao, Jian Zhang, Fumin Shen, Xian-Sheng Hua, Wankou Yang, Zhenmin Tang

To tackle these problems, in this work, we exploit general corpus information to automatically select and subsequently classify web images into semantic rich (sub-)categories.

Image Categorization

Paper
Add Code

Semantic Regularisation for Recurrent Image Annotation

no code implementations • CVPR 2017 • Feng Liu, Tao Xiang, Timothy M. Hospedales, Wankou Yang, Changyin Sun

We propose a simple modification to the design pattern that makes learning more effective and efficient.

General Classification Image Captioning +1

Paper
Add Code

Crowd Counting via Weighted VLAD on Dense Attribute Feature Maps

no code implementations • 29 Apr 2016 • Biyun Sheng, Chunhua Shen, Guosheng Lin, Jun Li, Wankou Yang, Changyin Sun

Crowd counting is an important task in computer vision, which has many applications in video surveillance.

Attribute Crowd Counting

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.