Search Results for author: Matvey Mikhalchuk

Found 3 papers, 2 papers with code

Your Transformer is Secretly Linear

1 code implementation19 May 2024 Anton Razzhigaev, Matvey Mikhalchuk, Elizaveta Goncharova, Nikolai Gerasimenko, Ivan Oseledets, Denis Dimitrov, Andrey Kuznetsov

This regularization improves performance metrics on benchmarks like Tiny Stories and SuperGLUE and as well successfully decreases the linearity of the models.

The Shape of Learning: Anisotropy and Intrinsic Dimensions in Transformer-Based Models

no code implementations10 Nov 2023 Anton Razzhigaev, Matvey Mikhalchuk, Elizaveta Goncharova, Ivan Oseledets, Denis Dimitrov, Andrey Kuznetsov

In this study, we present an investigation into the anisotropy dynamics and intrinsic dimension of embeddings in transformer architectures, focusing on the dichotomy between encoders and decoders.

Cannot find the paper you are looking for? You can Submit a new open access paper.