Search Results for author: Jorg K. H. Franke

Found 1 papers, 1 papers with code

HW-GPT-Bench: Hardware-Aware Architecture Benchmark for Language Models

1 code implementation16 May 2024 Rhea Sanjay Sukthanker, Arber Zela, Benedikt Staffler, Jorg K. H. Franke, Frank Hutter

To this end, we propose HW-GPT-Bench, a hardware-aware language model surrogate benchmark, where we leverage weight-sharing techniques from Neural Architecture Search (NAS) to efficiently train a supernet proxy, encompassing language models of varying scales in a single model.

Language Modelling Neural Architecture Search

Cannot find the paper you are looking for? You can Submit a new open access paper.