1 code implementation • DCASE workshop 2021 • F ́elix Gontier, Romain Serizel, Christophe Cerisara
utomated audio captioning is the multimodal task of describing environmental audio recordings with fluent natural language.
Ranked #5 on Retrieval-augmented Few-shot In-context Audio Captioning on AudioCaps
AudioCaps Caption Generation +2