Search Results for author: Aritra Das

Found 5 papers, 3 papers with code

Grokking Modular Polynomials

no code implementations5 Jun 2024 Darshil Doshi, Tianyu He, Aritra Das, Andrey Gromov

Neural networks readily learn a subset of the modular arithmetic tasks, while failing to generalize on the rest.

To grok or not to grok: Disentangling generalization and memorization on corrupted algorithmic datasets

1 code implementation19 Oct 2023 Darshil Doshi, Aritra Das, Tianyu He, Andrey Gromov

Robust generalization is a major challenge in deep learning, particularly when the number of trainable parameters is very large.

Memorization

Cannot find the paper you are looking for? You can Submit a new open access paper.