1 code implementation • 19 Feb 2024 • Zouying Cao, Yifei Yang, Hai Zhao
In this paper, we present a perspective on $\textit{$\textbf{head-wise shareable attention for large language models}$}$.
1 code implementation • 17 Feb 2024 • Yifei Yang, Zouying Cao, Hai Zhao
Large language models (LLMs) based on transformer are witnessing a notable trend of size expansion, which brings considerable costs to both model training and inference.
no code implementations • 30 Sep 2023 • Zouying Cao, Yifei Yang, Hai Zhao
While Large language models (LLMs) have garnered widespread applications across various domains due to their powerful language understanding and generation capabilities, the detection of non-factual or hallucinatory content generated by LLMs remains scarce.