Search Results for author: Alexandru Crăciun

Found 1 papers, 0 papers with code

On the Stability of Gradient Descent for Large Learning Rate

no code implementations20 Feb 2024 Alexandru Crăciun, Debarghya Ghoshdastidar

There currently is a significant interest in understanding the Edge of Stability (EoS) phenomenon, which has been observed in neural networks training, characterized by a non-monotonic decrease of the loss function over epochs, while the sharpness of the loss (spectral norm of the Hessian) progressively approaches and stabilizes around 2/(learning rate).

Cannot find the paper you are looking for? You can Submit a new open access paper.