13 code implementations • 18 Nov 2018 • Jonathan Lew, Deval Shah, Suchita Pati, Shaylin Cattell, Mengchi Zhang, Amruth Sandhupatla, Christopher Ng, Negar Goli, Matthew D. Sinclair, Timothy G. Rogers, Tor Aamodt
Most deep neural networks deployed today are trained using GPUs via high-level frameworks such as TensorFlow and PyTorch.
Distributed, Parallel, and Cluster Computing
9 code implementations • 16 Oct 2018 • Mahmoud Khairy, Jain Akshay, Tor Aamodt, Timothy G. Rogers
Our enhanced GPU model is able to describe the NVIDIA Volta architecture in sufficient detail to reduce error in memory system even counters by as much as 66X.
Hardware Architecture