1 code implementation • 5 Jun 2023 • Hervé Déjean, Stéphane Clinchant, Carlos Lassance, Simon Lupart, Thibault Formal
We compare both dense and sparse approaches under various finetuning protocols and middle training on different collections (MS MARCO, Wikipedia or Tripclick).
no code implementations • 25 Apr 2023 • Carlos Lassance, Simon Lupart, Hervé Dejean, Stéphane Clinchant, Nicola Tonellotto
Sparse neural retrievers, such as DeepImpact, uniCOIL and SPLADE, have been introduced recently as an efficient and effective way to perform retrieval with inverted indexes.
no code implementations • 25 Jan 2023 • Simon Lupart, Stéphane Clinchant
Neural retrieval models have acquired significant effectiveness gains over the last few years compared to term-based methods.
1 code implementation • 5 May 2022 • Simon Lupart, Thibault Formal, Stéphane Clinchant
To this end, we build three query-based distribution shifts within MS MARCO (query-semantic, query-intent, query-length), which are used to evaluate the three main families of neural retrievers based on BERT: sparse, dense, and late-interaction -- as well as a monoBERT re-ranker.
no code implementations • 9 Jan 2022 • Simon Lupart, Benoit Favre, Vassilina Nikoulina, Salah Ait-Mokhtar
MeSH (Medical Subject Headings) is a large thesaurus created by the National Library of Medicine and used for fine-grained indexing of publications in the biomedical domain.