no code implementations • 28 Apr 2024 • Zirui Song, Yaohang Li, Meng Fang, Zhenhao Chen, Zecheng Shi, Yuan Huang, Ling Chen
Autonomous virtual agents are often limited by their singular mode of interaction with real-world environments, restricting their versatility.
1 code implementation • 26 Dec 2023 • Jingpu Yang, Helin Wang, Qirui Zhao, Zhecheng Shi, Zirui Song, Miao Fang
To address this, we have introduced an additional optimistic Actor to enhance the model's exploration ability, while employing a more constrained pessimistic Actor for performance evaluation.
2 code implementations • 5 Dec 2023 • Rizhao Cai, Zirui Song, Dayan Guan, Zhenhao Chen, Xing Luo, Chenyu Yi, Alex Kot
Large Multimodal Models (LMMs) such as GPT-4V and LLaVA have shown remarkable capabilities in visual reasoning with common image styles.
Ranked #1000000000 on Visual Question Answering on MS COCO
no code implementations • 14 Jan 2022 • Paul Goldsmith-Pinkham, Karen Jiang, Zirui Song, Jacob Wallace
We propose a method for reporting how program evaluations reduce gaps between groups, such as the gender or Black-white gap.