no code implementations • 17 Sep 2023 • Xiangrui Su, Qi Zhang, Chongyang Shi, Jiachang Liu, Liang Hu
Existing VQA methods integrate vision modeling and language understanding to explore the deep semantics of the question.
Question Answering Visual Question Answering