Search Results for author: Andrè Freitas

Found 1 papers, 0 papers with code

Self-Refine Instruction-Tuning for Aligning Reasoning in Language Models

no code implementations1 May 2024 Leonardo Ranaldi, Andrè Freitas

The alignments of reasoning abilities between smaller and larger Language Models are largely conducted via Supervised Fine-Tuning (SFT) using demonstrations generated from robust Large Language Models (LLMs).

Math

Cannot find the paper you are looking for? You can Submit a new open access paper.