no code implementations • 17 Oct 2020 • Çağla Aksoy, Alper Ahmetoğlu, Tunga Güngör
In this work, we adopt hierarchical multitask learning approaches for BERT pre-training.
Language Modelling Natural Language Inference +2