no code implementations • 16 Aug 2023 • Reza Pourreza, Apratim Bhattacharyya, Sunny Panchal, Mingu Lee, Pulkit Madan, Roland Memisevic
In this work, we apply LLMs to image generation tasks by directly generating the virtual brush strokes to paint an image.
no code implementations • 30 Jun 2023 • Apratim Bhattacharyya, Sunny Panchal, Mingu Lee, Reza Pourreza, Pulkit Madan, Roland Memisevic
Multi-modal language models (LM) have recently shown promising performance in high-level reasoning tasks on videos.