Search Results for author: Yunhao Li

Found 13 papers, 7 papers with code

SCINeRF: Neural Radiance Fields from a Snapshot Compressive Image

1 code implementation • 29 Mar 2024 • Yunhao Li, Xiaodong Wang, Ping Wang, Xin Yuan, Peidong Liu

SCI is a cost-effective method that enables the recording of high-dimensional data, such as hyperspectral or temporal information, into a single image using low-cost 2D imaging sensors.

Image Generation Image Reconstruction

Paper
Code

DerainNeRF: 3D Scene Estimation with Adhesive Waterdrop Removal

1 code implementation • 29 Mar 2024 • Yunhao Li, Jing Wu, Lingzhe Zhao, Peidong Liu

When capturing images through the glass during rainy or snowy weather conditions, the resulting images often contain waterdrops adhered on the glass surface, and these waterdrops significantly degrade the image quality and performance of many computer vision algorithms.

Paper
Code

Beyond MOT: Semantic Multi-Object Tracking

no code implementations • 8 Mar 2024 • Yunhao Li, Hao Wang, Xue Ma, Jiali Yao, Shaohua Dong, Heng Fan, Libo Zhang

Current multi-object tracking (MOT) aims to predict trajectories of targets (i. e.,"where") in videos.

Multi-Object Tracking Object +1

Paper
Add Code

SingingHead: A Large-scale 4D Dataset for Singing Head Animation

no code implementations • 7 Dec 2023 • Sijing Wu, Yunhao Li, Weitian Zhang, Jun Jia, Yucheng Zhu, Yichao Yan, Guangtao Zhai

Extensive comparative experiments with both SOTA 3D facial animation and 2D portrait animation methods demonstrate the necessity of singing-specific datasets in singing head animation tasks and the promising performance of our unified facial animation framework.

Paper
Add Code

AttMOT: Improving Multiple-Object Tracking by Introducing Auxiliary Pedestrian Attributes

no code implementations • 15 Aug 2023 • Yunhao Li, Zhen Xiao, Lin Yang, Dan Meng, Xin Zhou, Heng Fan, Libo Zhang

To the best of our knowledge, AttMOT is the first MOT dataset with semantic attributes.

Attribute Multi-Object Tracking +1

Paper
Add Code

Layout and Task Aware Instruction Prompt for Zero-shot Document Image Question Answering

2 code implementations • 1 Jun 2023 • Wenjin Wang, Yunhao Li, Yixin Ou, Yin Zhang

Instead, in this paper, we find that instruction-tuning language models like Claude and ChatGPT can understand layout by spaces and line breaks.

Ranked #8 on Visual Question Answering (VQA) on DocVQA test

Optical Character Recognition (OCR) Question Answering +2

Paper
Code

GANHead: Towards Generative Animatable Neural Head Avatars

no code implementations • CVPR 2023 • Sijing Wu, Yichao Yan, Yunhao Li, Yuhao Cheng, Wenhan Zhu, Ke Gao, Xiaobo Li, Guangtao Zhai

To bring digital avatars into people's lives, it is highly demanded to efficiently generate complete, realistic, and animatable head avatars.

Paper
Add Code

Creating a Dynamic Quadrupedal Robotic Goalkeeper with Reinforcement Learning

no code implementations • 10 Oct 2022 • Xiaoyu Huang, Zhongyu Li, Yanzhen Xiang, Yiming Ni, Yufeng Chi, Yunhao Li, Lizhi Yang, Xue Bin Peng, Koushil Sreenath

We present a reinforcement learning (RL) framework that enables quadrupedal robots to perform soccer goalkeeping tasks in the real world.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Skeleton2Humanoid: Animating Simulated Characters for Physically-plausible Motion In-betweening

1 code implementation • 9 Oct 2022 • Yunhao Li, Zhenbo Yu, Yucheng Zhu, Bingbing Ni, Guangtao Zhai, Wei Shen

Stage I introduces a test time adaptation strategy, which improves the physical plausibility of synthesized human skeleton motions by optimizing skeleton joint locations.

Motion Synthesis Reinforcement Learning (RL) +1

Paper
Code

Retrieve & Memorize: Dialog Policy Learning with Multi-Action Memory

1 code implementation • Findings (ACL) 2021 • Yunhao Li, Yunyi Yang, Xiaojun Quan, Jianxing Yu

In this paper, we propose a retrieve-and-memorize framework to enhance the learning of system actions.

Decoder Response Generation +2

Paper
Code

SkinScan: Low-Cost 3D-Scanning for Dermatologic Diagnosis and Documentation

no code implementations • 31 Jan 2021 • Merlin A. Nau, Florian Schiffers, Yunhao Li, Bingjie Xu, Andreas Maier, Jack Tumblin, Marc Walton, Aggelos K. Katsaggelos, Florian Willomitzer, Oliver Cossairt

The utilization of computational photography becomes increasingly essential in the medical field.

Paper
Add Code

Looking Here or There? Gaze Following in 360-Degree Images

1 code implementation • ICCV 2021 • Yunhao Li, Wei Shen, Zhongpai Gao, Yucheng Zhu, Guangtao Zhai, Guodong Guo

Specifically, the local region is obtained as a 2D cone-shaped field along the 2D projection of the sight line starting at the human subject's head position, and the distant region is obtained by searching along the sight line in 3D sphere space.

Paper
Code

UBAR: Towards Fully End-to-End Task-Oriented Dialog Systems with GPT-2

1 code implementation • 7 Dec 2020 • Yunyi Yang, Yunhao Li, Xiaojun Quan

This paper presents our task-oriented dialog system UBAR which models task-oriented dialogs on a dialog session level.

Language Modelling Response Generation

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.