1 code implementation • 13 Jan 2024 • Zhengxin Zhang, Dan Zhao, Xupeng Miao, Gabriele Oliaro, Qing Li, Yong Jiang, Zhihao Jia
Experiments show that QST can reduce the total memory footprint by up to 2. 3 $\times$ and speed up the finetuning process by up to 3 $\times$ while achieving competent performance compared with the state-of-the-art.
3 code implementations • 16 May 2023 • Xupeng Miao, Gabriele Oliaro, Zhihao Zhang, Xinhao Cheng, Zeyu Wang, Zhengxin Zhang, Rae Ying Yee Wong, Alan Zhu, Lijie Yang, Xiaoxiang Shi, Chunan Shi, Zhuoming Chen, Daiyaan Arfeen, Reyna Abhyankar, Zhihao Jia
Our evaluation shows that SpecInfer outperforms existing LLM serving systems by 1. 5-2. 8x for distributed LLM inference and by 2. 6-3. 5x for offloading-based LLM inference, while preserving the same generative performance.
no code implementations • 22 Nov 2021 • Zhengxin Zhang, Youssef Mroueh, Ziv Goldfeld, Bharath K. Sriperumbudur
Discrepancy measures between probability distributions are at the core of statistical inference and machine learning.
no code implementations • 11 Mar 2021 • Sreejith Sreekumar, Zhengxin Zhang, Ziv Goldfeld
Statistical distances (SDs), which quantify the dissimilarity between probability distributions, are central to machine learning and statistics.
no code implementations • SEMEVAL 2019 • Qimin Zhou, Zhengxin Zhang, Hao Wu, Linmao Wang
In our system, the input of convolutional neural network is the embedding vectors which are drawn from the pre-trained BERT model.
no code implementations • SEMEVAL 2018 • Zhengxin Zhang, Qimin Zhou, Hao Wu
We participate in two subtasks for English tweets: EI-reg and V-reg.
13 code implementations • 29 Nov 2017 • Zhengxin Zhang, Qingjie Liu, Yunhong Wang
Road extraction from aerial images has been a hot research topic in the field of remote sensing image analysis.