no code implementations • JEP/TALN/RECITAL 2021 • Christel Gérardin, Pascal Vaillant, Perceval Wajsbürt, Clément Gilavert, Ali Bellamine, Emmanuelle Kempf, Xavier Tannier
Nous avons également proposé un modèle « bout-enbout », avec une première phase d’extraction d’entités nommées également basée sur un transformer de type camembert-large et un classifieur de genre sur un modèle Adaboost.
no code implementations • 2 May 2024 • Jamil Zaghir, Marco Naguib, Mina Bjelogrlic, Aurélie Névéol, Xavier Tannier, Christian Lovis
PD is the most prevalent (78 articles).
no code implementations • 28 Mar 2024 • Nesrine Bannour, Christophe Servan, Aurélie Névéol, Xavier Tannier
Objective: This paper presentsan evaluation of masked language models for biomedical French on the task of clinical named entity recognition. Material and methods: We evaluate biomedical models CamemBERT-bio and DrBERT and compare them tostandard French models CamemBERT, FlauBERT and FrALBERT as well as multilingual mBERT using three publicallyavailable corpora for clinical named entity recognition in French.
no code implementations • 20 Feb 2024 • Marco Naguib, Xavier Tannier, Aurélie Névéol
Results are consistent over the three languages and suggest that few-shot learning using Large language models is not production ready for named entity recognition in the clinical domain.
no code implementations • 3 Jun 2023 • Christel Gérardin, Yuhan Xiong, Perceval Wajsbürt, Fabrice Carrat, Xavier Tannier
The objective of our study is to determine whether using English tools to extract and normalize French medical concepts on translations provides comparable performance to French models trained on a set of annotated French clinical notes.
no code implementations • 23 May 2023 • Christel Gérardin, Perceval Wajsbürt, Basile Dura, Alice Calliger, Alexandre Moucher, Xavier Tannier, Romain Bey
The precision, recall, and F1 score per document for the acute infection detection algorithm were 82. 54 (95CI 72. 86-91. 60), 85. 24 (95CI 76. 61-93. 70), 83. 87 (95CI 76, 92-90. 08) with exploitation of the results of the advanced body extraction algorithm, respectively.
no code implementations • 23 Mar 2023 • Xavier Tannier, Perceval Wajsbürt, Alice Calliger, Basile Dura, Alexandre Mouchet, Martin Hilka, Romain Bey
The objective of this study is to address the critical issue of de-identification of clinical reports in order to allow access to data for research purposes, while ensuring patient privacy.
no code implementations • 26 Jul 2022 • Basile Dura, Charline Jean, Xavier Tannier, Alice Calliger, Romain Bey, Antoine Neuraz, Rémi Flicoteaux
We used two French annotated medical datasets to compare our language models to the original CamemBERT network, evaluating the statistical significance of improvement with the Wilcoxon test.
1 code implementation • 1 Nov 2021 • Adrian Ahne, Vivek Khetan, Xavier Tannier, Md Imbessat Hassan Rizvi, Thomas Czernichow, Francisco Orchard, Charline Bour, Andrew Fano, Guy Fagherazzi
A cause-effect-tweet dataset was manually labeled and used to train 1) a fine-tuned Bertweet model to detect causal sentences containing a causal association 2) a CRF model with BERT based features to extract possible cause-effect associations.
no code implementations • 2 Apr 2021 • Perceval Wajsburt, Yoann Taillé, Xavier Tannier
We provide a set of experiments to study the model's capabilities and the effects of the order on performance.
no code implementations • JEPTALNRECITAL 2020 • Perceval Wajsb{\"u}rt, Yoann Taill{\'e}, Guillaume Lain{\'e}, Xavier Tannier
Nous pr{\'e}sentons dans cet article les m{\'e}thodes con{\c{c}}ues et les r{\'e}sultats obtenus lors de notre participation {\`a} la t{\^a}che 3 de la campagne d{'}{\'e}valuation DEFT 2020, consistant en la reconnaissance d{'}entit{\'e}s nomm{\'e}es du domaine m{\'e}dical.
no code implementations • JEPTALNRECITAL 2020 • Julien Tourille, Olivier Ferret, Aur{\'e}lie N{\'e}v{\'e}ol, Xavier Tannier
La r{\'e}solution de la cor{\'e}f{\'e}rence est un {\'e}l{\'e}ment essentiel pour la constitution automatique de chronologies m{\'e}dicales {\`a} partir des dossiers m{\'e}dicaux {\'e}lectroniques.
no code implementations • JEPTALNRECITAL 2019 • Jacques Hilbey, Louise Del{\'e}ger, Xavier Tannier
Nous pr{\'e}sentons dans cet article les m{\'e}thodes con{\c{c}}ues et les r{\'e}sultats obtenus lors de notre participation {\`a} la t{\^a}che 3 de la campagne d{'}{\'e}valuation DEFT 2019.
no code implementations • 25 Apr 2019 • Ivan Lerner, Nicolas Paris, Xavier Tannier
On APcNER corpus, the micro-average F-measure of the hybrid system on the 5 entities was 69. 5% in exact match, and 84. 1% in non-exact match.
no code implementations • 11 Apr 2019 • Charlotte Rudnik, Thibault Ehrhart, Olivier Ferret, Denis Teyssou, Raphaël Troncy, Xavier Tannier
News agencies produce thousands of multimedia stories describing events happening in the world that are either scheduled such as sports competitions, political summits and elections, or breaking events such as military conflicts, terrorist attacks, natural disasters, etc.
no code implementations • 19 Mar 2019 • Xavier Tannier, Nicolas Paris, Hugo Cisneros, Christel Daniel, Matthieu Doutreligne, Catherine Duclos, Nicolas Griffon, Claire Hassen-Khodja, Ivan Lerner, Adrien Parrot, Éric Sadou, Cyrina Saussol, Pascal Vaillant
Materials and Methods: The first method is a weakly supervised method using an unlabeled corpus (MIMIC) to build a silver standard, by producing semi-automatically a small and very precise set of rules to detect some samples of positive and negative patients.
1 code implementation • WS 2018 • Julien Tourille, Matthieu Doutreligne, Olivier Ferret, Aur{\'e}lie N{\'e}v{\'e}ol, Nicolas Paris, Xavier Tannier
Many applications in biomedical natural language processing rely on sequence tagging as an initial step to perform more complex analysis.
no code implementations • WS 2017 • Swen Ribeiro, Olivier Ferret, Xavier Tannier
In this paper, we present an unsupervised pipeline approach for clustering news articles based on identified event instances in their content.
no code implementations • SEMEVAL 2017 • Julien Tourille, Olivier Ferret, Xavier Tannier, Aur{\'e}lie N{\'e}v{\'e}ol
In this paper we present our participation to SemEval 2017 Task 12.
no code implementations • ACL 2017 • Julien Tourille, Olivier Ferret, Aur{\'e}lie N{\'e}v{\'e}ol, Xavier Tannier
We present a neural architecture for containment relation identification between medical events and/or temporal expressions.
no code implementations • JEPTALNRECITAL 2017 • Jos{\'e} Moreno, Romaric Besan{\c{c}}on, Romain Beaumont, Eva D{'}hondt, Anne-Laure Ligozat, Sophie Rosset, Xavier Tannier, Brigitte Grau
La d{\'e}sambigu{\"\i}sation d{'}entit{\'e}s (ou liaison d{'}entit{\'e}s), qui consiste {\`a} relier des mentions d{'}entit{\'e}s d{'}un texte {\`a} des entit{\'e}s d{'}une base de connaissance, est un probl{\`e}me qui se pose, entre autre, pour le peuplement automatique de bases de connaissances {\`a} partir de textes.
no code implementations • EACL 2017 • Julien Tourille, Olivier Ferret, Xavier Tannier, Aur{\'e}lie N{\'e}v{\'e}ol
In this paper, we present a method for temporal relation extraction from clinical narratives in French and in English.
no code implementations • JEPTALNRECITAL 2016 • Julien Tourille, Olivier Ferret, Aur{\'e}lie N{\'e}v{\'e}ol, Xavier Tannier
Cette analyse repose sur l{'}extraction d{'}{\'e}v{\'e}nements, d{'}expressions temporelles et des relations entre eux.
no code implementations • SEMEVAL 2016 • Maria Pontiki, Dimitris Galanis, Haris Papageorgiou, Ion Androutsopoulos, Man, Suresh har, Mohammad AL-Smadi, Mahmoud Al-Ayyoub, Yanyan Zhao, Bing Qin, Orph{\'e}e De Clercq, V{\'e}ronique Hoste, Marianna Apidianaki, Xavier Tannier, Natalia Loukachevitch, Evgeniy Kotelnikov, Nuria Bel, Salud Mar{\'\i}a Jim{\'e}nez-Zafra, G{\"u}l{\c{s}}en Eryi{\u{g}}it
Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +2
no code implementations • LREC 2016 • Kiem-Hieu Nguyen, Xavier Tannier, Olivier Ferret, Romaric Besan{\c{c}}on
We detail the methodology used for building the corpus and evaluate some existing systems on this new data.
no code implementations • LREC 2016 • Marianna Apidianaki, Xavier Tannier, C{\'e}cile Richart
Aspect Based Sentiment Analysis (ABSA) is the task of mining and summarizing opinions from text about specific entities and their aspects.
Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA)
no code implementations • JEPTALNRECITAL 2015 • Kiem-Hieu Nguyen, Xavier Tannier, Olivier Ferret, Romaric Besan{\c{c}}on
Les pr{\'e}c{\'e}dentes m{\'e}thodes de la litt{\'e}rature utilisent uniquement les t{\^e}tes des syntagmes pour repr{\'e}senter les entit{\'e}s. Pourtant, le groupe complet (par exemple, {''}un homme arm{\'e}{''}) apporte une information plus discriminante (que {''}homme{''}).
no code implementations • JEPTALNRECITAL 2015 • Mike Donald Tapi Nzali, Aur{\'e}lie N{\'e}v{\'e}ol, Xavier Tannier
En s{'}appuyant sur un corpus de documents issus de plusieurs dossiers {\'e}lectroniques patient d{\'e}sidentifi{\'e}s, nous d{\'e}crivons la construction d{'}une ressource annot{\'e}e en expressions temporelles selon la norme TimeML.
no code implementations • LREC 2014 • Cl{\'e}ment de Groc, Xavier Tannier, Claude de Loupy
This graph can be interpreted as a recommendation graph, where two terms occurring in a same document means that they recommend each other.
no code implementations • LREC 2014 • V{\'e}ronique Moriceau, Xavier Tannier
French resources have been evaluated in two different ways: on the French TimeBank corpus, a corpus of newspaper articles in French annotated according to the ISO-TimeML standard, and on a user application for automatic building of event timelines.
no code implementations • LREC 2014 • Cl{\'e}ment de Groc, Xavier Tannier
This article introduces a novel protocol and resource to evaluate Web-as-corpus topical document retrieval.
no code implementations • LREC 2014 • Xavier Tannier
Web pages do not offer reliable metadata concerning their creation date and time.
no code implementations • 16 Jan 2014 • Xavier Tannier, Philippe Muller
Temporal information has been the focus of recent attention in information extraction, leading to some standardization effort, in particular for the task of relating events in a text.
no code implementations • LREC 2014 • Cyril Grouin, Jeremy Leixa, Aurélie Névéol, Sophie Rosset, Xavier Tannier, Pierre Zweigenbaum
Overall, a total of 26, 409 entity annotations were mapped to 5, 797 unique UMLS concepts.
no code implementations • LREC 2012 • B{\'e}atrice Arnulphy, Xavier Tannier, Anne Vilnat
As our application domain is information extraction, we follow a named entity approach to describe and annotate events.
no code implementations • LREC 2012 • Andr{\'e} Bittar, Caroline Hag{\`e}ge, V{\'e}ronique Moriceau, Xavier Tannier, Charles Teiss{\`e}dre
We provide results of an initial application of these guidelines to real news-wire texts in French over several iterations of the annotation process.
no code implementations • LREC 2012 • Xavier Tannier, V{\'e}ronique Moriceau, B{\'e}atrice Arnulphy, Ruixin He
In this article, we present our methodology concerning the study of the evolution of event designations in French documents from the news agency AFP.
no code implementations • LREC 2012 • Xavier Tannier
The HTML rendering fully preserved and all annotations consist in new HTML spans with specific styles.
no code implementations • LREC 2012 • Patrick Paroubek, Xavier Tannier
In this paper, we present the founding elements of a formal model of the evaluation paradigm in natural language processing.