no code implementations • 9 Feb 2024 • João Daniel Silva, João Magalhães, Devis Tuia, Bruno Martins
In this work, we propose RS-CapRet, a Vision and Language method for remote sensing tasks, in particular image captioning and text-image retrieval.
Cross-Modal Retrieval Decoder +6