Search Results for author: Alexandre Variengien

Found 5 papers, 5 papers with code

Look Before You Leap: A Universal Emergent Decomposition of Retrieval Tasks in Language Models

1 code implementation • 13 Dec 2023 • Alexandre Variengien, Eric Winsor

We find that LMs internally decompose retrieval tasks in a modular way: middle layers at the last token position process the request, while late layers retrieve the correct entity from the context.

Attribute Question Answering +1

Paper
Code

How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language model

1 code implementation • NeurIPS 2023 • Michael Hanna, Ollie Liu, Alexandre Variengien

Concretely, we use mechanistic interpretability techniques to explain the (limited) mathematical abilities of GPT-2 small.

Language Modelling valid

Paper
Code

Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 small

3 code implementations • 1 Nov 2022 • Kevin Wang, Alexandre Variengien, Arthur Conmy, Buck Shlegeris, Jacob Steinhardt

Research in mechanistic interpretability seeks to explain behaviors of machine learning models in terms of their internal components.

Language Modelling

3,717

Paper
Code

Towards self-organized control: Using neural cellular automata to robustly control a cart-pole agent

1 code implementation • 29 Jun 2021 • Alexandre Variengien, Stefano Nichele, Tom Glover, Sidney Pontes-Filho

The observations of the environment are transmitted in input cells, while the values of output cells are used as a readout of the system.

Q-Learning

Paper
Code

A journey in ESN and LSTM visualisations on a language task

1 code implementation • 3 Dec 2020 • Alexandre Variengien, Xavier Hinaut

In this work, we trained ESNs and LSTMs on a Cross-Situationnal Learning (CSL) task.

Dimensionality Reduction Sentence

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.