4 code implementations • 16 Oct 2023 • Zhangir Azerbayev, Hailey Schoelkopf, Keiran Paster, Marco Dos Santos, Stephen Mcaleer, Albert Q. Jiang, Jia Deng, Stella Biderman, Sean Welleck
We present Llemma, a large language model for mathematics.
Ranked #6 on Automated Theorem Proving on miniF2F-test
2 code implementations • 10 Oct 2023 • Keiran Paster, Marco Dos Santos, Zhangir Azerbayev, Jimmy Ba
We hope that our dataset, openly released on the Hugging Face Hub, will help spur advances in the reasoning abilities of large language models.
no code implementations • NeurIPS 2023 • Shalev Lifshitz, Keiran Paster, Harris Chan, Jimmy Ba, Sheila Mcilraith
Constructing AI models that respond to text instructions is challenging, especially for sequential decision-making tasks.
2 code implementations • 3 Nov 2022 • Yongchao Zhou, Andrei Ioan Muresanu, Ziwen Han, Keiran Paster, Silviu Pitis, Harris Chan, Jimmy Ba
By conditioning on natural language instructions, large language models (LLMs) have displayed impressive capabilities as general-purpose computers.
no code implementations • 31 May 2022 • Keiran Paster, Sheila Mcilraith, Jimmy Ba
In all tested domains, ESPER achieves significantly better alignment between the target return and achieved return than simply conditioning on returns.
1 code implementation • NeurIPS 2021 • Beining Han, Chongyi Zheng, Harris Chan, Keiran Paster, Michael R. Zhang, Jimmy Ba
These changes are often spurious and unrelated to the underlying problem, such as background shifts for visual input agents.
no code implementations • ICLR 2021 • Keiran Paster, Sheila A. McIlraith, Jimmy Ba
Learning task-agnostic dynamics models in high-dimensional observation spaces can be challenging for model-based RL agents.