no code implementations • 22 Mar 2024 • Shokichi Takakura, Taiji Suzuki
In this paper, we study the feature learning ability of two-layer neural networks in the mean-field regime through the lens of kernel methods.
no code implementations • 30 May 2023 • Shokichi Takakura, Taiji Suzuki
Despite the great success of Transformer networks in various applications such as natural language processing and computer vision, their theoretical aspects are not well understood.