no code implementations • 8 Nov 2019 • Zihan Jiang, Jiansong Li, Jiangfeng Zhan
To reveal this pitfall, we evaluates several frequently-used optimizations on a typical AI accelerator and quantifies their impact on accuracy and throughout under representative DL inference workloads.