Search Results for author: Yang Yong

Found 5 papers, 4 papers with code

LLM-QBench: A Benchmark Towards the Best Practice for Post-training Quantization of Large Language Models

1 code implementation9 May 2024 Ruihao Gong, Yang Yong, Shiqiao Gu, Yushi Huang, Yunchen Zhang, Xianglong Liu, DaCheng Tao

Recent advancements in large language models (LLMs) are propelling us toward artificial general intelligence, thanks to their remarkable emergent abilities and reasoning capabilities.

Benchmarking Computational Efficiency +1

Fast and Controllable Post-training Sparsity: Learning Optimal Sparsity Allocation with Global Constraint in Minutes

1 code implementation9 May 2024 Ruihao Gong, Yang Yong, Zining Wang, Jinyang Guo, Xiuying Wei, Yuqing Ma, Xianglong Liu

Previous methods for finding sparsity rates mainly focus on the training-aware scenario, which usually fails to converge stably under the PTS setting with limited data and much less training cost.

Compressing Models with Few Samples: Mimicking then Replacing

1 code implementation CVPR 2022 Huanyu Wang, Junjie Liu, Xin Ma, Yang Yong, Zhenhua Chai, Jianxin Wu

Hence, previous methods optimize the compressed model layer-by-layer and try to make every layer have the same outputs as the corresponding layer in the teacher model, which is cumbersome.

Design and implementation of smart cooking based on amazon echo

no code implementations4 Dec 2018 Lin Xiaoguang, Yang Yong, Zhang Ju

Smart cooking based on Amazon Echo uses the internet of things and cloud computing to assist in cooking food.

Cloud Computing

Cannot find the paper you are looking for? You can Submit a new open access paper.