Search Results for author: Rita Ramos

Found 3 papers, 3 papers with code

LMCap: Few-shot Multilingual Image Captioning by Retrieval Augmented Language Model Prompting

1 code implementation31 May 2023 Rita Ramos, Bruno Martins, Desmond Elliott

Multilingual image captioning has recently been tackled by training with large-scale machine translated data, which is an expensive, noisy, and time-consuming process.

Decoder Image Captioning +2

Retrieval-augmented Image Captioning

1 code implementation16 Feb 2023 Rita Ramos, Desmond Elliott, Bruno Martins

The encoder in our model jointly processes the image and retrieved captions using a pretrained V&L BERT, while the decoder attends to the multimodal encoder representations, benefiting from the extra textual evidence from the retrieved captions.

Decoder Image Captioning +2

SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation

1 code implementation CVPR 2023 Rita Ramos, Bruno Martins, Desmond Elliott, Yova Kementchedjhieva

Recent advances in image captioning have focused on scaling the data and model size, substantially increasing the cost of pre-training and finetuning.

Decoder Image Captioning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.