1 code implementation • 10 Jun 2020 • Luigi Carratino, Moustapha Cissé, Rodolphe Jenatton, Jean-Philippe Vert
We show that Mixup can be interpreted as standard empirical risk minimization estimator subject to a combination of data transformation and random perturbation of the transformed data.
Ranked #75 on Image Classification on ObjectNet (using extra training data)
12 code implementations • ICML 2017 • Edouard Grave, Armand Joulin, Moustapha Cissé, David Grangier, Hervé Jégou
We propose an approximate strategy to efficiently train neural network based language models over very large vocabularies.