Search Results for author: Malvina Nikandrou

Found 5 papers, 2 papers with code

Learning To See But Forgetting To Follow: Visual Instruction Tuning Makes LLMs More Prone To Jailbreak Attacks

1 code implementation7 May 2024 Georgios Pantazopoulos, Amit Parekh, Malvina Nikandrou, Alessandro Suglia

Augmenting Large Language Models (LLMs) with image-understanding capabilities has resulted in a boom of high-performing Vision-Language models (VLMs).

Multitask Multimodal Prompted Training for Interactive Embodied Task Completion

no code implementations7 Nov 2023 Georgios Pantazopoulos, Malvina Nikandrou, Amit Parekh, Bhathiya Hemanthage, Arash Eshghi, Ioannis Konstas, Verena Rieser, Oliver Lemon, Alessandro Suglia

Interactive and embodied tasks pose at least two fundamental challenges to existing Vision & Language (VL) models, including 1) grounding language in trajectories of actions and observations, and 2) referential disambiguation.

Decoder Text Generation

Quality-agnostic Image Captioning to Safely Assist People with Vision Impairment

no code implementations28 Apr 2023 Lu Yu, Malvina Nikandrou, Jiali Jin, Verena Rieser

In this paper, we propose a quality-agnostic framework to improve the performance and robustness of image captioning models for visually impaired people.

Data Augmentation Image Captioning

Going for GOAL: A Resource for Grounded Football Commentaries

1 code implementation8 Nov 2022 Alessandro Suglia, José Lopes, Emanuele Bastianelli, Andrea Vanzo, Shubham Agarwal, Malvina Nikandrou, Lu Yu, Ioannis Konstas, Verena Rieser

As the course of a game is unpredictable, so are commentaries, which makes them a unique resource to investigate dynamic language grounding.

Moment Retrieval Retrieval

Cannot find the paper you are looking for? You can Submit a new open access paper.