2 code implementations • 8 May 2024 • Sander Land, Max Bartolo
The disconnect between tokenizer creation and model training in language models has been known to allow for certain inputs, such as the infamous SolidGoldMagikarp token, to induce unwanted behaviour.