no code implementations • 15 Apr 2024 • Guangyan Li, Yongqiang Tang, Wensheng Zhang
With this regard, we design a mixed compression model, which organically combines Low-Rank matrix approximation And structured Pruning (LoRAP).