1 code implementation • 26 Jan 2024 • Saleh Ashkboos, Maximilian L. Croci, Marcelo Gennari do Nascimento, Torsten Hoefler, James Hensman
Large language models have become the cornerstone of natural language processing, but their use comes with substantial costs in terms of compute and memory resources.
1 code implementation • ECCV 2020 • Marcelo Gennari do Nascimento, Theo W. Costain, Victor Adrian Prisacariu
We propose a novel method for neural network quantization that casts the neural architecture search problem as one of hyperparameter search to find non-uniform bit distributions throughout the layers of a CNN.
no code implementations • ICCV 2019 • Marcelo Gennari do Nascimento, Roger Fawcett, Victor Adrian Prisacariu
Quantization is a popular way of increasing the speed and lowering the memory usage of Convolution Neural Networks (CNNs).