Search Results for author: Jiarui Xu

Found 26 papers, 11 papers with code

Learning at the Speed of Wireless: Online Real-Time Learning for AI-Enabled MIMO in NextG

no code implementations • 5 Mar 2024 • Jiarui Xu, Shashank Jere, Yifei Song, Yi-Hung Kao, Lizhong Zheng, Lingjia Liu

At the air interface, multiple-input multiple-output (MIMO) and its variants such as multi-user MIMO (MU-MIMO) and massive/full-dimension MIMO have been key enablers across successive generations of cellular networks with evolving complexity and design challenges.

Scheduling

Paper
Add Code

Pixel Aligned Language Models

no code implementations • 14 Dec 2023 • Jiarui Xu, Xingyi Zhou, Shen Yan, Xiuye Gu, Anurag Arnab, Chen Sun, Xiaolong Wang, Cordelia Schmid

When taking locations as inputs, the model performs location-conditioned captioning, which generates captions for the indicated object or region.

Language Modelling

Paper
Add Code

IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks

no code implementations • 4 Dec 2023 • Jiarui Xu, Yossi Gandelsman, Amir Bar, Jianwei Yang, Jianfeng Gao, Trevor Darrell, Xiaolong Wang

Given a textual description of a visual task (e. g. "Left: input image, Right: foreground segmentation"), a few input-output visual examples, or both, the model in-context learns to solve it for a new test input.

Colorization Foreground Segmentation +3

Paper
Add Code

2D-RC: Two-Dimensional Neural Network Approach for OTFS Symbol Detection

no code implementations • 14 Nov 2023 • Jiarui Xu, Karim Said, Lizhong Zheng, Lingjia Liu

Orthogonal time frequency space (OTFS) is a promising modulation scheme for wireless communication in high-mobility scenarios.

Paper
Add Code

Learning to Estimate: A Real-Time Online Learning Framework for MIMO-OFDM Channel Estimation

no code implementations • 22 May 2023 • Lianjun Li, Sai Sree Rayala, Jiarui Xu, Lizhong Zheng, Lingjia Liu

In this paper we introduce StructNet-CE, a novel real-time online learning framework for MIMO-OFDM channel estimation, which only utilizes over-the-air (OTA) pilot symbols for online training and converges within one OFDM subframe.

Binary Classification

Paper
Add Code

Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models

1 code implementation • CVPR 2023 • Jiarui Xu, Sifei Liu, Arash Vahdat, Wonmin Byeon, Xiaolong Wang, Shalini De Mello

Our approach outperforms the previous state of the art by significant margins on both open-vocabulary panoptic and semantic segmentation tasks.

Ranked #2 on Open-World Instance Segmentation on UVO (using extra training data)

Open Vocabulary Panoptic Segmentation Open Vocabulary Semantic Segmentation +4

804

Paper
Code

GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation

2 code implementations • 13 Dec 2022 • Chenhongyi Yang, Jiarui Xu, Shalini De Mello, Elliot J. Crowley, Xiaolong Wang

In each GP Block, features are first grouped together by a fixed number of learnable group tokens; we then perform Group Propagation where global information is exchanged between the grouped features; finally, global information in the updated grouped features is returned back to the image features through a transformer decoder.

Decoder Image Classification +6

561

Paper
Code

Detect to Learn: Structure Learning with Attention and Decision Feedback for MIMO-OFDM Receive Processing

no code implementations • 17 Aug 2022 • Jiarui Xu, Lianjun Li, Lizhong Zheng, Lingjia Liu

The DF mechanism further enhances detection performance by dynamically tracking the channel changes through detected data symbols.

Paper
Add Code

Learning Implicit Feature Alignment Function for Semantic Segmentation

1 code implementation • 17 Jun 2022 • Hanzhe Hu, Yinbo Chen, Jiarui Xu, Shubhankar Borse, Hong Cai, Fatih Porikli, Xiaolong Wang

As such, IFA implicitly aligns the feature maps at different levels and is capable of producing segmentation maps in arbitrary resolutions.

Segmentation Semantic Segmentation

Paper
Code

GroupViT: Semantic Segmentation Emerges from Text Supervision

2 code implementations • CVPR 2022 • Jiarui Xu, Shalini De Mello, Sifei Liu, Wonmin Byeon, Thomas Breuel, Jan Kautz, Xiaolong Wang

With only text supervision and without any pixel-level annotations, GroupViT learns to group together semantic regions and successfully transfers to the task of semantic segmentation in a zero-shot manner, i. e., without any further fine-tuning.

Ranked #3 on Unsupervised Semantic Segmentation with Language-image Pre-training on PascalVOC-20

Object Detection Scene Understanding +3

125,940

Paper
Code

RC-Struct: A Structure-based Neural Network Approach for MIMO-OFDM Detection

no code implementations • 3 Oct 2021 • Jiarui Xu, Zhou Zhou, Lianjun Li, Lizhong Zheng, Lingjia Liu

The binary classifier enables the efficient utilization of the precious online training symbols and allows an easy extension to high-order modulations without a substantial increase in complexity.

Paper
Add Code

Learning to Equalize OTFS

no code implementations • 17 Jul 2021 • Zhou Zhou, Lingjia Liu, Jiarui Xu, Robert Calderbank

Orthogonal Time Frequency Space (OTFS) is a novel framework that processes modulation symbols via a time-independent channel characterized by the delay-Doppler domain.

Scheduling

Paper
Add Code

Semi-Supervised 3D Hand-Object Poses Estimation with Interactions in Time

no code implementations • CVPR 2021 • Shaowei Liu, Hanwen Jiang, Jiarui Xu, Sifei Liu, Xiaolong Wang

Estimating 3D hand and object pose from a single image is an extremely challenging problem: hands and objects are often self-occluded during interactions, and the 3D annotations are scarce as even humans cannot directly label the ground-truths from a single image perfectly.

Ranked #7 on hand-object pose on HO-3D

hand-object pose Object

Paper
Add Code

Rethinking Self-supervised Correspondence Learning: A Video Frame-level Similarity Perspective

5 code implementations • ICCV 2021 • Jiarui Xu, Xiaolong Wang

To learn generalizable representation for correspondence in large-scale, a variety of self-supervised pretext tasks are proposed to explicitly perform object-level or patch-level similarity learning.

Contrastive Learning Object +5

3,926

Paper
Code

Harnessing Tensor Structures -- Multi-Mode Reservoir Computing and Its Application in Massive MIMO

no code implementations • 25 Jan 2021 • Zhou Zhou, Lingjia Liu, Jiarui Xu

In this paper, we introduce a new neural network (NN) structure, multi-mode reservoir computing (Multi-Mode RC).

Paper
Add Code

P4Neighbor: Efficient Link Failure Recovery With Programmable Switches

no code implementations • IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT 2021 • Jiarui Xu, Sihao Xie, and Jin Zhao

In this article, we analyze why implementing traditional proactive failure recovery mechanism introduces huge switch storage overhead, and discuss the flexibility and limitations of the programmable data plane.

Paper
Add Code

Estimation of Number of Communities in Assortative Sparse Networks

no code implementations • 1 Jan 2021 • Neil Hwang, Jiarui Xu, Shirshendu Chatterjee, Sharmodeep Bhattacharyya

Most community detection algorithms assume the number of communities, K, to be known a priori.

Community Detection Computational Efficiency

Paper
Add Code

Global Context Networks

3 code implementations • 24 Dec 2020 • Yue Cao, Jiarui Xu, Stephen Lin, Fangyun Wei, Han Hu

The Non-Local Network (NLNet) presents a pioneering approach for capturing long-range dependencies within an image, via aggregating query-specific global context to each query position.

Ranked #40 on Instance Segmentation on COCO test-dev

Instance Segmentation Object Detection

29,949

Paper
Code

DRG: Dual Relation Graph for Human-Object Interaction Detection

1 code implementation • ECCV 2020 • Chen Gao, Jiarui Xu, Yuliang Zou, Jia-Bin Huang

We tackle the challenging problem of human-object interaction (HOI) detection.

Ranked #26 on Human-Object Interaction Detection on V-COCO

Human-Object Interaction Detection Object +1

Paper
Code

Fast Video Object Segmentation With Temporal Aggregation Network and Dynamic Template Matching

no code implementations • CVPR 2020 • Xuhua Huang, Jiarui Xu, Yu-Wing Tai, Chi-Keung Tang

Significant progress has been made in Video Object Segmentation (VOS), the video object tracking task in its finest level.

Ranked #71 on Semi-Supervised Video Object Segmentation on DAVIS 2016

Object One-Shot Learning +6

Paper
Add Code

Learning to Group: A Bottom-Up Framework for 3D Part Discovery in Unseen Categories

1 code implementation • ICLR 2020 • Tiange Luo, Kaichun Mo, Zhiao Huang, Jiarui Xu, Siyu Hu, Li-Wei Wang, Hao Su

We address the problem of discovering 3D parts for objects in unseen categories.

Clustering Segmentation

Paper
Code

MMDetection: Open MMLab Detection Toolbox and Benchmark

144 code implementations • 17 Jun 2019 • Kai Chen, Jiaqi Wang, Jiangmiao Pang, Yuhang Cao, Yu Xiong, Xiaoxiao Li, Shuyang Sun, Wansen Feng, Ziwei Liu, Jiarui Xu, Zheng Zhang, Dazhi Cheng, Chenchen Zhu, Tianheng Cheng, Qijie Zhao, Buyu Li, Xin Lu, Rui Zhu, Yue Wu, Jifeng Dai, Jingdong Wang, Jianping Shi, Wanli Ouyang, Chen Change Loy, Dahua Lin

In this paper, we introduce the various features of this toolbox.

Benchmarking Instance Segmentation +2

27,966

Paper
Code

GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond

9 code implementations • 25 Apr 2019 • Yue Cao, Jiarui Xu, Stephen Lin, Fangyun Wei, Han Hu

In this paper, we take advantage of this finding to create a simplified network based on a query-independent formulation, which maintains the accuracy of NLNet but with significantly less computation.

Ranked #25 on Object Detection on COCO-O

Instance Segmentation Object Detection +1

27,966

Paper
Code

Spatial-Temporal Relation Networks for Multi-Object Tracking

no code implementations • ICCV 2019 • Jiarui Xu, Yue Cao, Zheng Zhang, Han Hu

Recent progress in multiple object tracking (MOT) has shown that a robust similarity score is key to the success of trackers.

Multi-Object Tracking Multiple Object Tracking +2

Paper
Add Code

Deep High Dynamic Range Imaging with Large Foreground Motions

1 code implementation • ECCV 2018 • Shangzhe Wu, Jiarui Xu, Yu-Wing Tai, Chi-Keung Tang

In state-of-the-art deep HDR imaging, input images are first aligned using optical flows before merging, which are still error-prone due to occlusion and large motions.

Translation Vocal Bursts Intensity Prediction

180

Paper
Code

STCP: Simplified-Traditional Chinese Conversion and Proofreading

no code implementations • IJCNLP 2017 • Jiarui Xu, Xuezhe Ma, Chen-Tse Tsai, Eduard Hovy

This paper aims to provide an effective tool for conversion between Simplified Chinese and Traditional Chinese.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.