Search Results for author: Michael Lan

Found 1 papers, 0 papers with code

Interpreting Shared Circuits for Ordered Sequence Prediction in a Large Language Model

no code implementations • 7 Nov 2023 • Michael Lan, Fazl Barez

While transformer models exhibit strong capabilities on linguistic tasks, their complex architectures make them difficult to interpret.

Language Modelling Large Language Model

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.