Search Results for author: Jan H. Kirchner

Found 2 papers, 1 papers with code

Understanding polysemanticity in neural networks through coding theory

no code implementations • 31 Jan 2024 • Simon C. Marshall, Jan H. Kirchner

Despite substantial efforts, neural network interpretability remains an elusive goal, with previous research failing to provide succinct explanations of most single neurons' impact on the network output.

Paper
Add Code

Researching Alignment Research: Unsupervised Analysis

1 code implementation • 6 Jun 2022 • Jan H. Kirchner, Logan Smith, Jacques Thibodeau, Kyle McDonell, Laria Reynolds

We looked at the subfields and identified the prominent researchers, recurring topics, and different modes of communication in each.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.