Search Results for author: Ke Zeng

Found 4 papers, 2 papers with code

CLAQ: Pushing the Limits of Low-Bit Post-Training Quantization for LLMs

no code implementations • 27 May 2024 • Haoyu Wang, Bei Liu, Hang Shao, Bo Xiao, Ke Zeng, Guanglu Wan, Yanmin Qian

In this paper, we present a novel and effective Column-Level Adaptive weight Quantization (CLAQ) framework by introducing three different types of adaptive strategies for LLM quantization.

Paper
Add Code

Learning or Self-aligning? Rethinking Instruction Fine-tuning

no code implementations • 28 Feb 2024 • Mengjie Ren, Boxi Cao, Hongyu Lin, Cao Liu, Xianpei Han, Ke Zeng, Guanglu Wan, Xunliang Cai, Le Sun

Instruction Fine-tuning~(IFT) is a critical phase in building large language models~(LLMs).

World Knowledge

Paper
Add Code

One-Shot Sensitivity-Aware Mixed Sparsity Pruning for Large Language Models

1 code implementation • 14 Oct 2023 • Hang Shao, Bei Liu, Bo Xiao, Ke Zeng, Guanglu Wan, Yanmin Qian

Various Large Language Models~(LLMs) from the Generative Pretrained Transformer(GPT) family have achieved outstanding performances in a wide range of text generation tasks.

Quantization Text Generation

Paper
Code

A Task-oriented Dialog Model with Task-progressive and Policy-aware Pre-training

1 code implementation • 1 Oct 2023 • Lucen Zhong, Hengtong Lu, Caixia Yuan, Xiaojie Wang, Jiashen Sun, Ke Zeng, Guanglu Wan

A global policy consistency task is designed to capture the multi-turn dialog policy sequential relation, and an act-based contrastive learning task is designed to capture similarities among samples with the same dialog policy.

Contrastive Learning

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.