Search Results for author: Dayal Singh Kalra

Found 1 papers, 0 papers with code

Universal Sharpness Dynamics in Neural Network Training: Fixed Point Analysis, Edge of Stability, and Route to Chaos

no code implementations • 3 Nov 2023 • Dayal Singh Kalra, Tianyu He, Maissam Barkeshli

In gradient descent dynamics of neural networks, the top eigenvalue of the Hessian of the loss (sharpness) displays a variety of robust phenomena throughout training.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.