Search Results for author: Zhenyang Xiao

Found 3 papers, 2 papers with code

Learning to Check: Unleashing Potentials for Self-Correction in Large Language Models

1 code implementation20 Feb 2024 Che Zhang, Zhenyang Xiao, Chengcheng Han, Yixin Lian, Yuejian Fang

After integrating the original CoT data and checking-correction data for training, we observe that models could improve their self-checking capabilities, thereby enhancing their self-correction capacity and eliminating the need for external feedback or ground truth labels to ascertain the endpoint of correction.

Mathematical Reasoning

LoMA: Lossless Compressed Memory Attention

no code implementations16 Jan 2024 Yumeng Wang, Zhenyang Xiao

We introduce Lossless Compressed Memory Attention (LoMA), a novel approach that enables lossless compression of the KV cache, thereby reducing the memory and computational demands during autoregressive generation.

Cannot find the paper you are looking for? You can Submit a new open access paper.