no code implementations • 28 May 2024 • Gongyue Zhang, Honghai Liu
We have identified a potential method for unifying first-order optimizers through the use of variable Second-Moment Exponential Scaling(SMES).
1 code implementation • 5 Sep 2023 • Gongyue Zhang, Dinghuang Zhang, Shuwen Zhao, Donghan Liu, Carrie M. Toptan, Honghai Liu
It not only can accelerates slow-changing parameters for sparse gradients, similar to adaptive optimizers, but also can choose to accelerates frequently-changing parameters for non-sparse gradients, thus being adaptable to all types of datasets.