1 code implementation • 29 Mar 2024 • Yunhao Li, Xiaodong Wang, Ping Wang, Xin Yuan, Peidong Liu
SCI is a cost-effective method that enables the recording of high-dimensional data, such as hyperspectral or temporal information, into a single image using low-cost 2D imaging sensors.
1 code implementation • 29 Mar 2024 • Yunhao Li, Jing Wu, Lingzhe Zhao, Peidong Liu
When capturing images through the glass during rainy or snowy weather conditions, the resulting images often contain waterdrops adhered on the glass surface, and these waterdrops significantly degrade the image quality and performance of many computer vision algorithms.
no code implementations • 8 Mar 2024 • Yunhao Li, Hao Wang, Xue Ma, Jiali Yao, Shaohua Dong, Heng Fan, Libo Zhang
Current multi-object tracking (MOT) aims to predict trajectories of targets (i. e.,"where") in videos.
no code implementations • 7 Dec 2023 • Sijing Wu, Yunhao Li, Weitian Zhang, Jun Jia, Yucheng Zhu, Yichao Yan, Guangtao Zhai
Extensive comparative experiments with both SOTA 3D facial animation and 2D portrait animation methods demonstrate the necessity of singing-specific datasets in singing head animation tasks and the promising performance of our unified facial animation framework.
no code implementations • 15 Aug 2023 • Yunhao Li, Zhen Xiao, Lin Yang, Dan Meng, Xin Zhou, Heng Fan, Libo Zhang
To the best of our knowledge, AttMOT is the first MOT dataset with semantic attributes.
2 code implementations • 1 Jun 2023 • Wenjin Wang, Yunhao Li, Yixin Ou, Yin Zhang
Instead, in this paper, we find that instruction-tuning language models like Claude and ChatGPT can understand layout by spaces and line breaks.
Ranked #8 on Visual Question Answering (VQA) on DocVQA test
no code implementations • CVPR 2023 • Sijing Wu, Yichao Yan, Yunhao Li, Yuhao Cheng, Wenhan Zhu, Ke Gao, Xiaobo Li, Guangtao Zhai
To bring digital avatars into people's lives, it is highly demanded to efficiently generate complete, realistic, and animatable head avatars.
no code implementations • 10 Oct 2022 • Xiaoyu Huang, Zhongyu Li, Yanzhen Xiang, Yiming Ni, Yufeng Chi, Yunhao Li, Lizhi Yang, Xue Bin Peng, Koushil Sreenath
We present a reinforcement learning (RL) framework that enables quadrupedal robots to perform soccer goalkeeping tasks in the real world.
1 code implementation • 9 Oct 2022 • Yunhao Li, Zhenbo Yu, Yucheng Zhu, Bingbing Ni, Guangtao Zhai, Wei Shen
Stage I introduces a test time adaptation strategy, which improves the physical plausibility of synthesized human skeleton motions by optimizing skeleton joint locations.
1 code implementation • Findings (ACL) 2021 • Yunhao Li, Yunyi Yang, Xiaojun Quan, Jianxing Yu
In this paper, we propose a retrieve-and-memorize framework to enhance the learning of system actions.
no code implementations • 31 Jan 2021 • Merlin A. Nau, Florian Schiffers, Yunhao Li, Bingjie Xu, Andreas Maier, Jack Tumblin, Marc Walton, Aggelos K. Katsaggelos, Florian Willomitzer, Oliver Cossairt
The utilization of computational photography becomes increasingly essential in the medical field.
1 code implementation • ICCV 2021 • Yunhao Li, Wei Shen, Zhongpai Gao, Yucheng Zhu, Guangtao Zhai, Guodong Guo
Specifically, the local region is obtained as a 2D cone-shaped field along the 2D projection of the sight line starting at the human subject's head position, and the distant region is obtained by searching along the sight line in 3D sphere space.
1 code implementation • 7 Dec 2020 • Yunyi Yang, Yunhao Li, Xiaojun Quan
This paper presents our task-oriented dialog system UBAR which models task-oriented dialogs on a dialog session level.