Search Results for author: Rita Ramos

Found 3 papers, 3 papers with code

LMCap: Few-shot Multilingual Image Captioning by Retrieval Augmented Language Model Prompting

1 code implementation • 31 May 2023 • Rita Ramos, Bruno Martins, Desmond Elliott

Multilingual image captioning has recently been tackled by training with large-scale machine translated data, which is an expensive, noisy, and time-consuming process.

Decoder Image Captioning +2

Paper
Code

Retrieval-augmented Image Captioning

1 code implementation • 16 Feb 2023 • Rita Ramos, Desmond Elliott, Bruno Martins

The encoder in our model jointly processes the image and retrieved captions using a pretrained V&L BERT, while the decoder attends to the multimodal encoder representations, benefiting from the extra textual evidence from the retrieved captions.

Decoder Image Captioning +2

Paper
Code

SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation

1 code implementation • CVPR 2023 • Rita Ramos, Bruno Martins, Desmond Elliott, Yova Kementchedjhieva

Recent advances in image captioning have focused on scaling the data and model size, substantially increasing the cost of pre-training and finetuning.

Decoder Image Captioning +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.