Search Results for author: Sander Land

Found 1 papers, 1 papers with code

Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models

2 code implementations8 May 2024 Sander Land, Max Bartolo

The disconnect between tokenizer creation and model training in language models has been known to allow for certain inputs, such as the infamous SolidGoldMagikarp token, to induce unwanted behaviour.

Language Modelling Large Language Model

Cannot find the paper you are looking for? You can Submit a new open access paper.