Search Results for author: Yaole Wang

Found 2 papers, 1 papers with code

Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models

no code implementations • 7 May 2024 • Fan Bao, Chendong Xiang, Gang Yue, Guande He, Hongzhou Zhu, Kaiwen Zheng, Min Zhao, Shilong Liu, Yaole Wang, Jun Zhu

We introduce Vidu, a high-performance text-to-video generator that is capable of producing 1080p videos up to 16 seconds in a single generation.

Video Generation Video Prediction

Paper
Add Code

One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale

3 code implementations • 12 Mar 2023 • Fan Bao, Shen Nie, Kaiwen Xue, Chongxuan Li, Shi Pu, Yaole Wang, Gang Yue, Yue Cao, Hang Su, Jun Zhu

Inspired by the unified view, UniDiffuser learns all distributions simultaneously with a minimal modification to the original diffusion model -- perturbs data in all modalities instead of a single modality, inputs individual timesteps in different modalities, and predicts the noise of all modalities instead of a single modality.

Text-to-Image Generation

6,114

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.