TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Question Answering	FQuAD	LePetit	EM	57.2	# 7
Question Answering	FQuAD	LePetit	F1	70.71	# 7

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/on-the-importance-of-pre-training-data-volume/question-answering-on-fquad-1)](https://paperswithcode.com/sota/question-answering-on-fquad-1?p=on-the-importance-of-pre-training-data-volume)`

On the importance of pre-training data volume for compact language models

EMNLP 2020 · Vincent Micheli, Martin d'Hoffschmidt, François Fleuret ·

Recent advances in language modeling have led to computationally intensive and resource-demanding state-of-the-art models. In an effort towards sustainable practices, we study the impact of pre-training data volume on compact language models. Multiple BERT-based models are trained on gradually increasing amounts of French text. Through fine-tuning on the French Question Answering Dataset (FQuAD), we observe that well-performing models are obtained with as little as 100 MB of text. In addition, we show that past critically low amounts of pre-training data, an intermediate pre-training step on the task-specific corpus does not yield substantial improvements.