Search Results for author: Alexandre Variengien

Found 5 papers, 5 papers with code

Look Before You Leap: A Universal Emergent Decomposition of Retrieval Tasks in Language Models

1 code implementation13 Dec 2023 Alexandre Variengien, Eric Winsor

We find that LMs internally decompose retrieval tasks in a modular way: middle layers at the last token position process the request, while late layers retrieve the correct entity from the context.

Attribute Question Answering +1

Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 small

3 code implementations1 Nov 2022 Kevin Wang, Alexandre Variengien, Arthur Conmy, Buck Shlegeris, Jacob Steinhardt

Research in mechanistic interpretability seeks to explain behaviors of machine learning models in terms of their internal components.

Language Modelling

Towards self-organized control: Using neural cellular automata to robustly control a cart-pole agent

1 code implementation29 Jun 2021 Alexandre Variengien, Stefano Nichele, Tom Glover, Sidney Pontes-Filho

The observations of the environment are transmitted in input cells, while the values of output cells are used as a readout of the system.

Q-Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.