Search Results for author: Yuhao Cheng

Found 9 papers, 5 papers with code

Rethink Predicting the Optical Flow with the Kinetics Perspective

1 code implementation • 21 May 2024 • Yuhao Cheng, Siru Zhang, Yiqiang Yan

Furthermore, comprehensive experiments and ablation studies prove that the proposed novel insight into how to predict the optical flow can achieve the better performance of the state-of-the-art methods, and in some metrics, the proposed method outperforms the correlation-based method, especially in situations containing occlusion and fast moving.

Optical Flow Estimation

Paper
Code

TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation

1 code implementation • 29 Apr 2024 • Junhao Cheng, Baiqiao Yin, Kaixin Cai, Minbin Huang, Hanhui Li, Yuxin He, Xi Lu, Yue Li, Yifei Li, Yuhao Cheng, Yiqiang Yan, Xiaodan Liang

To address this issue, we introduce TheaterGen, a training-free framework that integrates large language models (LLMs) and text-to-image (T2I) models to provide the capability of multi-turn image generation.

Denoising Image Generation +2

Paper
Code

ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving

1 code implementation • 25 Apr 2024 • Jiehui Huang, Xiao Dong, Wenhui Song, Hanhui Li, Jun Zhou, Yuhao Cheng, Shutao Liao, Long Chen, Yiqiang Yan, Shengcai Liao, Xiaodan Liang

ConsistentID comprises two key components: a multimodal facial prompt generator that combines facial features, corresponding facial descriptions and the overall facial context to enhance precision in facial details, and an ID-preservation network optimized through the facial attention localization strategy, aimed at preserving ID consistency in facial regions.

591

Paper
Code

Monocular Identity-Conditioned Facial Reflectance Reconstruction

no code implementations • 30 Mar 2024 • Xingyu Ren, Jiankang Deng, Yuhao Cheng, Jia Guo, Chao Ma, Yichao Yan, Wenhan Zhu, Xiaokang Yang

We first learn a high-quality prior for facial reflectance.

3D Face Reconstruction

Paper
Add Code

LSCD: A Large-Scale Screen Content Dataset for Video Compression

no code implementations • 18 Aug 2023 • Yuhao Cheng, Siru Zhang, Yiqiang Yan, Rong Chen, Yun Zhang

Multimedia compression allows us to watch videos, see pictures and hear sounds within a limited bandwidth, which helps the flourish of the internet.

Video Compression

Paper
Add Code

GANHead: Towards Generative Animatable Neural Head Avatars

no code implementations • CVPR 2023 • Sijing Wu, Yichao Yan, Yunhao Li, Yuhao Cheng, Wenhan Zhu, Ke Gao, Xiaobo Li, Guangtao Zhai

To bring digital avatars into people's lives, it is highly demanded to efficiently generate complete, realistic, and animatable head avatars.

Paper
Add Code

Head3D: Complete 3D Head Generation via Tri-plane Feature Distillation

no code implementations • 28 Mar 2023 • Yuhao Cheng, Yichao Yan, Wenhan Zhu, Ye Pan, Bowen Pan, Xiaokang Yang

Head generation with diverse identities is an important task in computer vision and computer graphics, widely used in multimedia applications.

Paper
Add Code

Simple and Robust Loss Design for Multi-Label Learning with Missing Labels

2 code implementations • 13 Dec 2021 • Youcai Zhang, Yuhao Cheng, Xinyu Huang, Fei Wen, Rui Feng, Yaqian Li, Yandong Guo

Multi-label learning in the presence of missing labels (MLML) is a challenging problem.

Missing Labels Multi-Label Image Classification

Paper
Code

Semantic Role Labeling with Associated Memory Network

1 code implementation • NAACL 2019 • Chaoyu Guan, Yuhao Cheng, Hai Zhao

Semantic role labeling (SRL) is a task to recognize all the predicate-argument pairs of a sentence, which has been in a performance improvement bottleneck after a series of latest works were presented.

Semantic Role Labeling Sentence

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.