1 code implementation • 20 Dec 2023 • Tannon Kew, Florian Schottmann, Rico Sennrich
The vast majority of today's large language models are English-centric, having been pretrained predominantly on English text.
1 code implementation • 28 Nov 2023 • Noëmi Aepli, Chantal Amrhein, Florian Schottmann, Rico Sennrich
For sensible progress in natural language processing, it is important that we are aware of the limitations of the evaluation metrics we use.
1 code implementation • 18 May 2023 • Chantal Amrhein, Florian Schottmann, Rico Sennrich, Samuel Läubli
We hypothesise that creating training data in the reverse direction, i. e. starting from gender-fair text, is easier for morphologically complex languages and show that it matches the performance of state-of-the-art rewriting models for English.
no code implementations • 6 Oct 2022 • Dieuwke Hupkes, Mario Giulianelli, Verna Dankers, Mikel Artetxe, Yanai Elazar, Tiago Pimentel, Christos Christodoulopoulos, Karim Lasri, Naomi Saphra, Arabella Sinclair, Dennis Ulmer, Florian Schottmann, Khuyagbaatar Batsuren, Kaiser Sun, Koustuv Sinha, Leila Khalatbari, Maria Ryskina, Rita Frieske, Ryan Cotterell, Zhijing Jin
We present a taxonomy for characterising and understanding generalisation research in NLP.