Search Results for author: Guangliang Cheng

Found 36 papers, 25 papers with code

OV-VG: A Benchmark for Open-Vocabulary Visual Grounding

1 code implementation • 22 Oct 2023 • Chunlei Wang, Wenquan Feng, Xiangtai Li, Guangliang Cheng, Shuchang Lyu, Binghao Liu, Lijiang Chen, Qi Zhao

While current foundational models excel at various visual language tasks, there's a noticeable absence of models specifically tailored for open-vocabulary visual grounding.

Novel Concepts object-detection +2

Paper
Code

Iterative Robust Visual Grounding with Masked Reference based Centerpoint Supervision

1 code implementation • 23 Jul 2023 • Menghao Li, Chunlei Wang, Wenquan Feng, Shuchang Lyu, Guangliang Cheng, Xiangtai Li, Binghao Liu, Qi Zhao

The proposed framework is evaluated on five regular VG datasets and two newly constructed robust VG datasets.

Decoder Visual Grounding

Paper
Code

Change Detection Methods for Remote Sensing in the Last Decade: A Comprehensive Review

no code implementations • 9 May 2023 • Guangliang Cheng, Yunmeng Huang, Xiangtai Li, Shuchang Lyu, Zhaoyang Xu, Qi Zhao, Shiming Xiang

We first introduce some preliminary knowledge for the change detection task, such as problem definition, datasets, evaluation metrics, and transformer basics, as well as provide a detailed taxonomy of existing algorithms from three different perspectives: algorithm granularity, supervision modes, and learning frameworks in the methodology section.

Change Detection Change detection for remote sensing images

Paper
Add Code

Transformer-Based Visual Segmentation: A Survey

2 code implementations • 19 Apr 2023 • Xiangtai Li, Henghui Ding, Haobo Yuan, Wenwei Zhang, Jiangmiao Pang, Guangliang Cheng, Kai Chen, Ziwei Liu, Chen Change Loy

Recently, transformers, a type of neural network based on self-attention originally designed for natural language processing, have considerably surpassed previous convolutional or recurrent approaches in various vision processing tasks.

Autonomous Driving Point Cloud Segmentation +1

587

Paper
Code

Tube-Link: A Flexible Cross Tube Framework for Universal Video Segmentation

2 code implementations • ICCV 2023 • Xiangtai Li, Haobo Yuan, Wenwei Zhang, Guangliang Cheng, Jiangmiao Pang, Chen Change Loy

Our framework is a near-online approach that takes a short subclip as input and outputs the corresponding spatial-temporal tube masks.

Ranked #3 on Video Semantic Segmentation on VSPW

Contrastive Learning Segmentation +4

106

Paper
Code

Local-to-Global Information Communication for Real-Time Semantic Segmentation Network Search

no code implementations • 16 Feb 2023 • Guangliang Cheng, Peng Sun, Ting-Bing Xu, Shuchang Lyu, Peiwen Lin

For local information exchange, a graph convolutional network (GCN) guided module is seamlessly integrated as a communication deliver between cells.

Neural Architecture Search Real-Time Semantic Segmentation

Paper
Add Code

PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation

1 code implementation • 3 Jan 2023 • Xiangtai Li, Shilin Xu, Yibo Yang, Haobo Yuan, Guangliang Cheng, Yunhai Tong, Zhouchen Lin, Ming-Hsuan Yang, DaCheng Tao

Third, inspired by Mask2Former, based on our meta-architecture, we propose Panoptic-PartFormer++ and design a new part-whole cross-attention scheme to boost part segmentation qualities further.

Panoptic Segmentation Segmentation

Paper
Code

Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation

2 code implementations • ICCV 2023 • Jianzong Wu, Xiangtai Li, Henghui Ding, Xia Li, Guangliang Cheng, Yunhai Tong, Chen Change Loy

Experiments on the COCO dataset with two settings: Open Vocabulary Instance Segmentation (OVIS) and Open Set Panoptic Segmentation (OSPS) demonstrate the superiority of the CGG.

Caption Generation Instance Segmentation +2

Paper
Code

MMOTU: A Multi-Modality Ovarian Tumor Ultrasound Image Dataset for Unsupervised Cross-Domain Semantic Segmentation

1 code implementation • 14 Jul 2022 • Qi Zhao, Shuchang Lyu, Wenpei Bai, Linghan Cai, Binghao Liu, Guangliang Cheng, Meijing Wu, Xiubo Sang, Min Yang, Lijiang Chen

To solve this problem, we propose a Multi-Modality Ovarian Tumor Ultrasound (MMOTU) image dataset containing 1469 2d ultrasound images and 170 contrast enhanced ultrasonography (CEUS) images with pixel-wise and global-wise annotations.

Decoder Domain Adaptation +2

Paper
Code

SFNet: Faster, Accurate, and Domain Agnostic Semantic Segmentation via Semantic Flow

1 code implementation • 10 Jul 2022 • Xiangtai Li, Jiangning Zhang, Yibo Yang, Guangliang Cheng, Kuiyuan Yang, Yunhai Tong, DaCheng Tao

In this paper, we focus on exploring effective methods for faster, accurate, and domain agnostic semantic segmentation.

Real-Time Semantic Segmentation

354

Paper
Code

Multi-level Domain Adaptation for Lane Detection

no code implementations • 21 Jun 2022 • Chenguang Li, Boheng Zhang, Jia Shi, Guangliang Cheng

We focus on bridging domain discrepancy in lane detection among different scenarios to greatly reduce extra annotation and re-training costs for autonomous driving.

Autonomous Driving Domain Adaptation +2

Paper
Add Code

Reconstruct from BEV: A 3D Lane Detection Approach based on Geometry Structure Prior

no code implementations • 21 Jun 2022 • Chenguang Li, Jia Shi, Ya Wang, Guangliang Cheng

Inspired by previous methods, we first analyze the geometry heuristic between the 3D lane and its 2D representation on the ground and propose to impose explicit supervision based on the structure prior, which makes it achievable to build inter-lane and intra-lane relationships to facilitate the reconstruction of 3D lanes from local to global.

3D Lane Detection Data Augmentation

Paper
Add Code

Reconstruct from Top View: A 3D Lane Detection Approach based on Geometry Structure Prior

no code implementations • CVPR 2022 • Chenguang Li, Jia Shi, Ya Wang, Guangliang Cheng

Ranked #6 on 3D Lane Detection on Apollo Synthetic 3D Lane

3D Lane Detection Data Augmentation

Paper
Add Code

Fashionformer: A simple, Effective and Unified Baseline for Human Fashion Segmentation and Recognition

1 code implementation • 10 Apr 2022 • Shilin Xu, Xiangtai Li, Jingbo Wang, Guangliang Cheng, Yunhai Tong, DaCheng Tao

This focus on joint human fashion segmentation and attribute recognition.

Attribute Fashion Understanding +1

Paper
Code

Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation

1 code implementation • CVPR 2022 • Xiangtai Li, Wenwei Zhang, Jiangmiao Pang, Kai Chen, Guangliang Cheng, Yunhai Tong, Chen Change Loy

We hope this simple, yet effective method can serve as a new, flexible baseline in unified video segmentation design.

Ranked #1 on Video Panoptic Segmentation on KITTI-STEP (using extra training data)

Image Segmentation Instance Segmentation +5

150

Paper
Code

Panoptic-PartFormer: Learning a Unified Model for Panoptic Part Segmentation

1 code implementation • 10 Apr 2022 • Xiangtai Li, Shilin Xu, Yibo Yang, Guangliang Cheng, Yunhai Tong, DaCheng Tao

To the best of our knowledge, we are the first to solve the PPS problem via \textit{a unified and end-to-end transformer model.

Ranked #2 on Part-aware Panoptic Segmentation on Pascal Panoptic Parts

Panoptic Segmentation Part-aware Panoptic Segmentation +1

Paper
Code

TransVOD: End-to-End Video Object Detection with Spatial-Temporal Transformers

3 code implementations • 13 Jan 2022 • Qianyu Zhou, Xiangtai Li, Lu He, Yibo Yang, Guangliang Cheng, Yunhai Tong, Lizhuang Ma, DaCheng Tao

Detection Transformer (DETR) and Deformable DETR have been proposed to eliminate the need for many hand-designed components in object detection while demonstrating good performance as previous complex hand-crafted detectors.

Ranked #4 on Video Object Detection on ImageNet VID (using extra training data)

Object object-detection +2

198

Paper
Code

PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic Segmentation

1 code implementation • 5 Dec 2021 • Haobo Yuan, Xiangtai Li, Yibo Yang, Guangliang Cheng, Jing Zhang, Yunhai Tong, Lefei Zhang, DaCheng Tao

The Depth-aware Video Panoptic Segmentation (DVPS) is a new challenging vision problem that aims to predict panoptic segmentation and depth in a video simultaneously.

Ranked #1 on Depth-aware Video Panoptic Segmentation on SemKITTI-DVPS

Depth-aware Video Panoptic Segmentation Depth Estimation +4

Paper
Code

PIT: Position-Invariant Transform for Cross-FoV Domain Adaptation

1 code implementation • ICCV 2021 • Qiqi Gu, Qianyu Zhou, Minghao Xu, Zhengyang Feng, Guangliang Cheng, Xuequan Lu, Jianping Shi, Lizhuang Ma

Extensive experiments demonstrate that our method can soundly boost the performance on both cross-domain object detection and segmentation for state-of-the-art techniques.

Domain Adaptation object-detection +4

Paper
Code

Context-Aware Mixup for Domain Adaptive Semantic Segmentation

1 code implementation • 8 Aug 2021 • Qianyu Zhou, Zhengyang Feng, Qiqi Gu, Jiangmiao Pang, Guangliang Cheng, Xuequan Lu, Jianping Shi, Lizhuang Ma

The generated contextual mask is critical in this work and will guide the context-aware domain mixup on three different levels.

Ranked #5 on Image-to-Image Translation on SYNTHIA-to-Cityscapes

Semantic Segmentation Synthetic-to-Real Translation +1

Paper
Code

Improving Video Instance Segmentation via Temporal Pyramid Routing

1 code implementation • 28 Jul 2021 • Xiangtai Li, Hao He, Yibo Yang, Henghui Ding, Kuiyuan Yang, Guangliang Cheng, Yunhai Tong, DaCheng Tao

To incorporate both temporal and scale information, we propose a Temporal Pyramid Routing (TPR) strategy to conditionally align and conduct pixel-level aggregation from a feature pyramid pair of two adjacent frames.

Instance Segmentation Panoptic Segmentation +2

Paper
Code

Global Aggregation then Local Distribution for Scene Parsing

1 code implementation • 28 Jul 2021 • Xiangtai Li, Li Zhang, Guangliang Cheng, Kuiyuan Yang, Yunhai Tong, Xiatian Zhu, Tao Xiang

Modelling long-range contextual relationships is critical for pixel-wise prediction tasks such as semantic segmentation.

Scene Parsing Segmentation +1

344

Paper
Code

BoundarySqueeze: Image Segmentation as Boundary Squeezing

1 code implementation • 25 May 2021 • Hao He, Xiangtai Li, Yibo Yang, Guangliang Cheng, Yunhai Tong, Lubin Weng, Zhouchen Lin, Shiming Xiang

This module is used to squeeze the object boundary from both inner and outer directions, which contributes to precise mask representation.

Image Segmentation Instance Segmentation +2

Paper
Code

End-to-End Video Object Detection with Spatial-Temporal Transformers

1 code implementation • 23 May 2021 • Lu He, Qianyu Zhou, Xiangtai Li, Li Niu, Guangliang Cheng, Xiao Li, Wenxuan Liu, Yunhai Tong, Lizhuang Ma, Liqing Zhang

Recently, DETR and Deformable DETR have been proposed to eliminate the need for many hand-designed components in object detection while demonstrating good performance as previous complex hand-crafted detectors.

Object object-detection +2

198

Paper
Code

Enhanced Boundary Learning for Glass-like Object Segmentation

1 code implementation • ICCV 2021 • Hao He, Xiangtai Li, Guangliang Cheng, Jianping Shi, Yunhai Tong, Gaofeng Meng, Véronique Prinet, Lubin Weng

We use these two modules to design a decoder that generates accurate and clean segmentation results, especially on the object contours.

Ranked #20 on Thermal Image Segmentation on RGB-T-Glass-Segmentation

Decoder Object +4

Paper
Code

PointFlow: Flowing Semantics Through Points for Aerial Image Segmentation

1 code implementation • CVPR 2021 • Xiangtai Li, Hao He, Xia Li, Duo Li, Guangliang Cheng, Jianping Shi, Lubin Weng, Yunhai Tong, Zhouchen Lin

Experimental results on three different aerial segmentation datasets suggest that the proposed method is more effective and efficient than state-of-the-art general semantic segmentation methods.

Image Segmentation Segmentation +1

121

Paper
Code

Embedded Knowledge Distillation in Depth-Level Dynamic Neural Network

no code implementations • 1 Mar 2021 • Qi Zhao, Shuchang Lyu, Zhiwei Zhang, Ting-Bing Xu, Guangliang Cheng

In real applications, different computation-resource devices need different-depth networks (e. g., ResNet-18/34/50) with high-accuracy.

Knowledge Distillation Transfer Learning

Paper
Add Code

Towards Efficient Scene Understanding via Squeeze Reasoning

1 code implementation • 6 Nov 2020 • Xiangtai Li, Xia Li, Ansheng You, Li Zhang, Guangliang Cheng, Kuiyuan Yang, Yunhai Tong, Zhouchen Lin

Instead of propagating information on the spatial map, we first learn to squeeze the input feature into a channel-wise global vector and perform reasoning within the single vector where the computation cost can be significantly reduced.

Instance Segmentation object-detection +4

354

Paper
Code

Search What You Want: Barrier Panelty NAS for Mixed Precision Quantization

no code implementations • ECCV 2020 • Haibao Yu, Qi Han, Jianbo Li, Jianping Shi, Guangliang Cheng, Bin Fan

Learning to find an optimal mixed precision model that can preserve accuracy and satisfy the specific constraints on model size and computation is extremely challenge due to the difficult in training a mixed precision model and the huge space of all possible bit quantizations.

Quantization valid

Paper
Add Code

Improving Semantic Segmentation via Decoupled Body and Edge Supervision

2 code implementations • ECCV 2020 • Xiangtai Li, Xia Li, Li Zhang, Guangliang Cheng, Jianping Shi, Zhouchen Lin, Shaohua Tan, Yunhai Tong

Our insight is that appealing performance of semantic segmentation requires \textit{explicitly} modeling the object \textit{body} and \textit{edge}, which correspond to the high and low frequency of the image.

Object Segmentation +1

8,299

Paper
Code

Uncertainty-Aware Consistency Regularization for Cross-Domain Semantic Segmentation

no code implementations • 19 Apr 2020 • Qianyu Zhou, Zhengyang Feng, Qiqi Gu, Guangliang Cheng, Xuequan Lu, Jianping Shi, Lizhuang Ma

Guided by this mask, we propose a ClassOut strategy to realize effective regional consistency in a fine-grained manner.

Semantic Segmentation Unsupervised Domain Adaptation

Paper
Add Code

DMT: Dynamic Mutual Training for Semi-Supervised Learning

1 code implementation • 18 Apr 2020 • Zhengyang Feng, Qianyu Zhou, Qiqi Gu, Xin Tan, Guangliang Cheng, Xuequan Lu, Jianping Shi, Lizhuang Ma

Instead, leveraging inter-model disagreement between different models is a key to locate pseudo label errors.

Ranked #3 on Semi-Supervised Semantic Segmentation on Pascal VOC 2012 1% labeled

Pseudo Label Semi-Supervised Image Classification +1

134

Paper
Code

Graph-guided Architecture Search for Real-time Semantic Segmentation

1 code implementation • CVPR 2020 • Peiwen Lin, Peng Sun, Guangliang Cheng, Sirui Xie, Xi Li, Jianping Shi

Unlike previous works that use a simplified search space and stack a repeatable cell to form a network, we introduce a novel search mechanism with new search space where a lightweight model can be effectively explored through the cell-level diversity and latencyoriented constraint.

Real-Time Semantic Segmentation

Paper
Code

GDRQ: Group-based Distribution Reshaping for Quantization

no code implementations • 5 Aug 2019 • Haibao Yu, Tuopu Wen, Guangliang Cheng, Jiankai Sun, Qi Han, Jianping Shi

Low-bit quantization is challenging to maintain high performance with limited model capacity (e. g., 4-bit for both weights and activations).

Quantization

Paper
Add Code

OVSNet : Towards One-Pass Real-Time Video Object Segmentation

no code implementations • 24 May 2019 • Peng Sun, Peiwen Lin, Guangliang Cheng, Jianping Shi, Jiawan Zhang, Xi Li

Video object segmentation aims at accurately segmenting the target object regions across consecutive frames.

Object object-detection +6

Paper
Add Code

Accurate Urban Road Centerline Extraction from VHR Imagery via Multiscale Segmentation and Tensor Voting

no code implementations • 25 Aug 2015 • Guangliang Cheng, Feiyun Zhu, Shiming Xiang, Chunhong Pan

Finally, to overcome the ineffectiveness of current methods in the road intersection, a fitting based road centerline connection algorithm is proposed.

Road Segmentation

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.