no code implementations • 31 Jan 2024 • Dingyi Dai, Yichi Zhang, Jiahao Zhang, Zhanqiu Hu, Yaohui Cai, Qi Sun, Zhiru Zhang
Quantization is a crucial technique for deploying deep learning models on resource-constrained devices, such as embedded FPGAs.
no code implementations • 13 Oct 2019 • Ritchie Zhao, Jordan Dotzel, Zhanqiu Hu, Preslav Ivanov, Christopher De Sa, Zhiru Zhang
Specialized hardware for handling activation outliers can enable low-precision neural networks, but at the cost of nontrivial area overhead.