no code implementations • 5 Mar 2024 • Jiarui Xu, Shashank Jere, Yifei Song, Yi-Hung Kao, Lizhong Zheng, Lingjia Liu
At the air interface, multiple-input multiple-output (MIMO) and its variants such as multi-user MIMO (MU-MIMO) and massive/full-dimension MIMO have been key enablers across successive generations of cellular networks with evolving complexity and design challenges.
no code implementations • 14 Dec 2023 • Jiarui Xu, Xingyi Zhou, Shen Yan, Xiuye Gu, Anurag Arnab, Chen Sun, Xiaolong Wang, Cordelia Schmid
When taking locations as inputs, the model performs location-conditioned captioning, which generates captions for the indicated object or region.
no code implementations • 4 Dec 2023 • Jiarui Xu, Yossi Gandelsman, Amir Bar, Jianwei Yang, Jianfeng Gao, Trevor Darrell, Xiaolong Wang
Given a textual description of a visual task (e. g. "Left: input image, Right: foreground segmentation"), a few input-output visual examples, or both, the model in-context learns to solve it for a new test input.
no code implementations • 14 Nov 2023 • Jiarui Xu, Karim Said, Lizhong Zheng, Lingjia Liu
Orthogonal time frequency space (OTFS) is a promising modulation scheme for wireless communication in high-mobility scenarios.
no code implementations • 22 May 2023 • Lianjun Li, Sai Sree Rayala, Jiarui Xu, Lizhong Zheng, Lingjia Liu
In this paper we introduce StructNet-CE, a novel real-time online learning framework for MIMO-OFDM channel estimation, which only utilizes over-the-air (OTA) pilot symbols for online training and converges within one OFDM subframe.
1 code implementation • CVPR 2023 • Jiarui Xu, Sifei Liu, Arash Vahdat, Wonmin Byeon, Xiaolong Wang, Shalini De Mello
Our approach outperforms the previous state of the art by significant margins on both open-vocabulary panoptic and semantic segmentation tasks.
Ranked #2 on Open-World Instance Segmentation on UVO (using extra training data)
Open Vocabulary Panoptic Segmentation Open Vocabulary Semantic Segmentation +4
2 code implementations • 13 Dec 2022 • Chenhongyi Yang, Jiarui Xu, Shalini De Mello, Elliot J. Crowley, Xiaolong Wang
In each GP Block, features are first grouped together by a fixed number of learnable group tokens; we then perform Group Propagation where global information is exchanged between the grouped features; finally, global information in the updated grouped features is returned back to the image features through a transformer decoder.
no code implementations • 17 Aug 2022 • Jiarui Xu, Lianjun Li, Lizhong Zheng, Lingjia Liu
The DF mechanism further enhances detection performance by dynamically tracking the channel changes through detected data symbols.
1 code implementation • 17 Jun 2022 • Hanzhe Hu, Yinbo Chen, Jiarui Xu, Shubhankar Borse, Hong Cai, Fatih Porikli, Xiaolong Wang
As such, IFA implicitly aligns the feature maps at different levels and is capable of producing segmentation maps in arbitrary resolutions.
2 code implementations • CVPR 2022 • Jiarui Xu, Shalini De Mello, Sifei Liu, Wonmin Byeon, Thomas Breuel, Jan Kautz, Xiaolong Wang
With only text supervision and without any pixel-level annotations, GroupViT learns to group together semantic regions and successfully transfers to the task of semantic segmentation in a zero-shot manner, i. e., without any further fine-tuning.
no code implementations • 3 Oct 2021 • Jiarui Xu, Zhou Zhou, Lianjun Li, Lizhong Zheng, Lingjia Liu
The binary classifier enables the efficient utilization of the precious online training symbols and allows an easy extension to high-order modulations without a substantial increase in complexity.
no code implementations • 17 Jul 2021 • Zhou Zhou, Lingjia Liu, Jiarui Xu, Robert Calderbank
Orthogonal Time Frequency Space (OTFS) is a novel framework that processes modulation symbols via a time-independent channel characterized by the delay-Doppler domain.
no code implementations • CVPR 2021 • Shaowei Liu, Hanwen Jiang, Jiarui Xu, Sifei Liu, Xiaolong Wang
Estimating 3D hand and object pose from a single image is an extremely challenging problem: hands and objects are often self-occluded during interactions, and the 3D annotations are scarce as even humans cannot directly label the ground-truths from a single image perfectly.
Ranked #7 on hand-object pose on HO-3D
5 code implementations • ICCV 2021 • Jiarui Xu, Xiaolong Wang
To learn generalizable representation for correspondence in large-scale, a variety of self-supervised pretext tasks are proposed to explicitly perform object-level or patch-level similarity learning.
no code implementations • 25 Jan 2021 • Zhou Zhou, Lingjia Liu, Jiarui Xu
In this paper, we introduce a new neural network (NN) structure, multi-mode reservoir computing (Multi-Mode RC).
no code implementations • IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT 2021 • Jiarui Xu, Sihao Xie, and Jin Zhao
In this article, we analyze why implementing traditional proactive failure recovery mechanism introduces huge switch storage overhead, and discuss the flexibility and limitations of the programmable data plane.
no code implementations • 1 Jan 2021 • Neil Hwang, Jiarui Xu, Shirshendu Chatterjee, Sharmodeep Bhattacharyya
Most community detection algorithms assume the number of communities, K, to be known a priori.
3 code implementations • 24 Dec 2020 • Yue Cao, Jiarui Xu, Stephen Lin, Fangyun Wei, Han Hu
The Non-Local Network (NLNet) presents a pioneering approach for capturing long-range dependencies within an image, via aggregating query-specific global context to each query position.
Ranked #40 on Instance Segmentation on COCO test-dev
1 code implementation • ECCV 2020 • Chen Gao, Jiarui Xu, Yuliang Zou, Jia-Bin Huang
We tackle the challenging problem of human-object interaction (HOI) detection.
Ranked #26 on Human-Object Interaction Detection on V-COCO
no code implementations • CVPR 2020 • Xuhua Huang, Jiarui Xu, Yu-Wing Tai, Chi-Keung Tang
Significant progress has been made in Video Object Segmentation (VOS), the video object tracking task in its finest level.
Ranked #71 on Semi-Supervised Video Object Segmentation on DAVIS 2016
1 code implementation • ICLR 2020 • Tiange Luo, Kaichun Mo, Zhiao Huang, Jiarui Xu, Siyu Hu, Li-Wei Wang, Hao Su
We address the problem of discovering 3D parts for objects in unseen categories.
144 code implementations • 17 Jun 2019 • Kai Chen, Jiaqi Wang, Jiangmiao Pang, Yuhang Cao, Yu Xiong, Xiaoxiao Li, Shuyang Sun, Wansen Feng, Ziwei Liu, Jiarui Xu, Zheng Zhang, Dazhi Cheng, Chenchen Zhu, Tianheng Cheng, Qijie Zhao, Buyu Li, Xin Lu, Rui Zhu, Yue Wu, Jifeng Dai, Jingdong Wang, Jianping Shi, Wanli Ouyang, Chen Change Loy, Dahua Lin
In this paper, we introduce the various features of this toolbox.
9 code implementations • 25 Apr 2019 • Yue Cao, Jiarui Xu, Stephen Lin, Fangyun Wei, Han Hu
In this paper, we take advantage of this finding to create a simplified network based on a query-independent formulation, which maintains the accuracy of NLNet but with significantly less computation.
Ranked #25 on Object Detection on COCO-O
no code implementations • ICCV 2019 • Jiarui Xu, Yue Cao, Zheng Zhang, Han Hu
Recent progress in multiple object tracking (MOT) has shown that a robust similarity score is key to the success of trackers.
1 code implementation • ECCV 2018 • Shangzhe Wu, Jiarui Xu, Yu-Wing Tai, Chi-Keung Tang
In state-of-the-art deep HDR imaging, input images are first aligned using optical flows before merging, which are still error-prone due to occlusion and large motions.
no code implementations • IJCNLP 2017 • Jiarui Xu, Xuezhe Ma, Chen-Tse Tsai, Eduard Hovy
This paper aims to provide an effective tool for conversion between Simplified Chinese and Traditional Chinese.