Search Results for author: Tommaso Pegolotti

Found 2 papers, 2 papers with code

QIGen: Generating Efficient Kernels for Quantized Inference on Large Language Models

1 code implementation7 Jul 2023 Tommaso Pegolotti, Elias Frantar, Dan Alistarh, Markus Püschel

We present ongoing work on a new automatic code generation approach for supporting quantized generative inference on LLMs such as LLaMA or OPT on off-the-shelf CPUs.

Code Generation

SparseProp: Efficient Sparse Backpropagation for Faster Training of Neural Networks

1 code implementation9 Feb 2023 Mahdi Nikdan, Tommaso Pegolotti, Eugenia Iofinova, Eldar Kurtic, Dan Alistarh

We provide a new efficient version of the backpropagation algorithm, specialized to the case where the weights of the neural network being trained are sparse.

Transfer Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.