Search Results for author: Jiangcun Du

Found 1 papers, 0 papers with code

A Comprehensive Evaluation of Quantization Strategies for Large Language Models

no code implementations • 26 Feb 2024 • Renren Jin, Jiangcun Du, Wuwei Huang, Wei Liu, Jian Luan, Bin Wang, Deyi Xiong

Our experimental results indicate that LLMs with 4-bit quantization can retain performance comparable to their non-quantized counterparts, and perplexity can serve as a proxy metric for quantized LLMs on most benchmarks.

Language Modelling Quantization

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.