Search Results for author: Shilin Zhang

Found 5 papers, 3 papers with code

PyramidInfer: Pyramid KV Cache Compression for High-throughput LLM Inference

no code implementations • 21 May 2024 • Dongjie Yang, Xiaodong Han, Yan Gao, Yao Hu, Shilin Zhang, Hai Zhao

To accelerate inference, we store computed keys and values (KV cache) in the GPU memory.

Paper
Add Code

Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay

1 code implementation • 16 Apr 2024 • Jinmei Liu, Wenbin Li, Xiangyu Yue, Shilin Zhang, Chunlin Chen, Zhi Wang

Finally, by interleaving pseudo samples with real ones of the new task, we continually update the state and behavior generators to model progressively diverse behaviors, and regularize the multi-head critic via behavior cloning to mitigate forgetting.

Continual Learning reinforcement-learning

Paper
Code

MLLM-as-a-Judge: Assessing Multimodal LLM-as-a-Judge with Vision-Language Benchmark

1 code implementation • 7 Feb 2024 • Dongping Chen, Ruoxi Chen, Shilin Zhang, Yinuo Liu, Yaochen Wang, Huichi Zhou, Qihui Zhang, Pan Zhou, Yao Wan, Lichao Sun

Drawing inspiration from the concept of LLM-as-a-Judge within LLMs, this paper introduces a novel benchmark, termed MLLM-as-a-Judge, to assess the ability of MLLMs in assisting judges across diverse modalities, encompassing three distinct tasks: Scoring Evaluation, Pair Comparison, and Batch Ranking.

Paper
Code

LLM-as-a-Coauthor: Can Mixed Human-Written and Machine-Generated Text Be Detected?

2 code implementations • 11 Jan 2024 • Qihui Zhang, Chujie Gao, Dongping Chen, Yue Huang, Yixin Huang, Zhenyang Sun, Shilin Zhang, Weiye Li, Zhengyan Fu, Yao Wan, Lichao Sun

With the rapid development and widespread application of Large Language Models (LLMs), the use of Machine-Generated Text (MGT) has become increasingly common, bringing with it potential risks, especially in terms of quality and integrity in fields like news, education, and science.

Paper
Code

Density Distribution-based Learning Framework for Addressing Online Continual Learning Challenges

no code implementations • 22 Nov 2023 • Shilin Zhang, Jiahui Wang

CL, especially the Class Incremental Learning, enables adaptation to new test distributions while continuously learning from a single-pass training data stream, which is more in line with the practical application requirements of real-world scenarios.

Class Incremental Learning Density Estimation +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.