no code implementations • 19 Apr 2024 • Stephen Choi, William Gazeley
This paper presents the LLM-ADE framework, a novel methodology for continued pre-training of large language models (LLMs) that addresses the challenges of catastrophic forgetting and double descent.
1 code implementation • 6 Oct 2023 • Stephen Choi, William Gazeley, Siu Ho Wong, TingTing Li
With the exponential growth in large language models (LLMs), leveraging their emergent properties for specialized domains like finance merits exploration.