Search Results for author: Liwei Guo

Found 4 papers, 0 papers with code

The CAP Principle for LLM Serving: A Survey of Long-Context Large Language Model Serving

no code implementations • 18 May 2024 • Pai Zeng, Zhenyu Ning, Jieru Zhao, Weihao Cui, Mengwei Xu, Liwei Guo, Xusheng Chen, Yizhou Shan

We survey the large language model (LLM) serving area to understand the intricate dynamics between cost-efficiency and accuracy, which is magnified by the growing need for longer contextual understanding when deploying models at a massive scale.

Language Modelling Large Language Model

Paper
Add Code

EdgeMoE: Fast On-Device Inference of MoE-based Large Language Models

no code implementations • 28 Aug 2023 • Rongjie Yi, Liwei Guo, Shiyun Wei, Ao Zhou, Shangguang Wang, Mengwei Xu

Large Language Models (LLMs) such as GPTs and LLaMa have ushered in a revolution in machine intelligence, owing to their exceptional capabilities in a wide range of machine learning tasks.

Computational Efficiency

Paper
Add Code

STI: Turbocharge NLP Inference at the Edge via Elastic Pipelining

no code implementations • 11 Jul 2022 • Liwei Guo, Wonkyo Choe, Felix Xiaozhu Lin

Yet, the unprecedented size of an NLP model stresses both latency and memory, creating a tension between the two key resources of a mobile device.

Management

Paper
Add Code

Let the Cloud Watch Over Your IoT File Systems

no code implementations • 17 Feb 2019 • Liwei Guo, Yiying Zhang, Felix Xiaozhu Lin

To safeguard such data on smart devices, we present a novel storage stack architecture that i) protects file data in a trusted execution environment (TEE); ii) outsources file system logic and metadata out of TEE; iii) running a metadata-only file system replica in the cloud for continuously verifying the on-device file system behaviors.

Cryptography and Security

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.