no code implementations • 29 Aug 2023 • Etienne Labbé, Thomas Pellegrini, Julien Pinquier
For ATR, we propose using the standard Cross-Entropy loss values obtained for any audio/caption pair.
1 code implementation • 14 Nov 2022 • Etienne Labbé, Thomas Pellegrini, Julien Pinquier
For this reason, several complementary metrics, such as BLEU, CIDEr, SPICE and SPIDEr, are used to compare a single automatic caption to one or several captions of reference, produced by a human annotator.
1 code implementation • 16 Feb 2021 • Léo Cances, Etienne Labbé, Thomas Pellegrini
In all but one cases, MM, RMM, and FM outperformed MT and DCT significantly, MM and RMM being the best methods in most experiments.