1 code implementation • 17 Oct 2021 • Zai Shi, Zhao Meng, Yiran Xing, Yunpu Ma, Roger Wattenhofer
3D-RETR is capable of 3D reconstruction from a single view or multiple views.
1 code implementation • ACL 2021 • Yiran Xing, Zai Shi, Zhao Meng, Gerhard Lakemeyer, Yunpu Ma, Roger Wattenhofer
We present Knowledge Enhanced Multimodal BART (KM-BART), which is a Transformer-based sequence-to-sequence model capable of reasoning about commonsense knowledge from multimodal inputs of images and texts.