Search Results for author: Jan H. Kirchner

Found 2 papers, 1 papers with code

Understanding polysemanticity in neural networks through coding theory

no code implementations31 Jan 2024 Simon C. Marshall, Jan H. Kirchner

Despite substantial efforts, neural network interpretability remains an elusive goal, with previous research failing to provide succinct explanations of most single neurons' impact on the network output.

Researching Alignment Research: Unsupervised Analysis

1 code implementation6 Jun 2022 Jan H. Kirchner, Logan Smith, Jacques Thibodeau, Kyle McDonell, Laria Reynolds

We looked at the subfields and identified the prominent researchers, recurring topics, and different modes of communication in each.

Cannot find the paper you are looking for? You can Submit a new open access paper.