1 code implementation • 3 Dec 2023 • Andrés Villa, Juan Carlos León Alcázar, Alvaro Soto, Bernard Ghanem
This paper introduces a Multi-modal Evaluation Benchmark named MERLIM, a scalable test-bed to assess the performance of IT-LVLMs on fundamental computer vision tasks.