Search Results for author: Shen Nie

Found 4 papers, 3 papers with code

Unifying Bayesian Flow Networks and Diffusion Models through Stochastic Differential Equations

1 code implementation • 24 Apr 2024 • Kaiwen Xue, Yuhao Zhou, Shen Nie, Xu Min, Xiaolu Zhang, Jun Zhou, Chongxuan Li

Bayesian flow networks (BFNs) iteratively refine the parameters, instead of the samples in diffusion models (DMs), of distributions at various noise levels through Bayesian inference.

Bayesian Inference

Paper
Code

The Blessing of Randomness: SDE Beats ODE in General Diffusion-based Image Editing

no code implementations • 2 Nov 2023 • Shen Nie, Hanzhong Allan Guo, Cheng Lu, Yuhao Zhou, Chenyu Zheng, Chongxuan Li

We present a unified probabilistic formulation for diffusion-based image editing, where a latent variable is edited in a task-specific manner and generally deviates from the corresponding marginal distribution induced by the original stochastic or ordinary differential equation (SDE or ODE).

Image-to-Image Translation

Paper
Add Code

One Transformer Fits All Distributions in Multi-Modal Diffusion at Scale

3 code implementations • 12 Mar 2023 • Fan Bao, Shen Nie, Kaiwen Xue, Chongxuan Li, Shi Pu, Yaole Wang, Gang Yue, Yue Cao, Hang Su, Jun Zhu

Inspired by the unified view, UniDiffuser learns all distributions simultaneously with a minimal modification to the original diffusion model -- perturbs data in all modalities instead of a single modality, inputs individual timesteps in different modalities, and predicts the noise of all modalities instead of a single modality.

Text-to-Image Generation

6,114

Paper
Code

All are Worth Words: A ViT Backbone for Diffusion Models

3 code implementations • CVPR 2023 • Fan Bao, Shen Nie, Kaiwen Xue, Yue Cao, Chongxuan Li, Hang Su, Jun Zhu

We evaluate U-ViT in unconditional and class-conditional image generation, as well as text-to-image generation tasks, where U-ViT is comparable if not superior to a CNN-based U-Net of a similar size.

Ranked #4 on Text-to-Image Generation on MS COCO

Conditional Image Generation Text-to-Image Generation

756

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.