no code implementations • 1 May 2024 • Leonardo Ranaldi, Andrè Freitas
The alignments of reasoning abilities between smaller and larger Language Models are largely conducted via Supervised Fine-Tuning (SFT) using demonstrations generated from robust Large Language Models (LLMs).