1 code implementation • 3 Apr 2024 • Keyu Tian, Yi Jiang, Zehuan Yuan, Bingyue Peng, LiWei Wang
We present Visual AutoRegressive modeling (VAR), a new generation paradigm that redefines the autoregressive learning on images as coarse-to-fine "next-scale prediction" or "next-resolution prediction", diverging from the standard raster-scan "next-token prediction".
Ranked #7 on Image Generation on ImageNet 256x256