Search Results for author: Guangyan Li

Found 1 papers, 0 papers with code

LoRAP: Transformer Sub-Layers Deserve Differentiated Structured Compression for Large Language Models

no code implementations15 Apr 2024 Guangyan Li, Yongqiang Tang, Wensheng Zhang

With this regard, we design a mixed compression model, which organically combines Low-Rank matrix approximation And structured Pruning (LoRAP).

Cannot find the paper you are looking for? You can Submit a new open access paper.