no code implementations • 8 Apr 2023 • Shinkook Choi, Junkyeong Choi
As deep learning advances, edge devices and lightweight neural networks are becoming more important.
no code implementations • 11 Feb 2022 • Junkyeong Choi, Hyucksung Kwon, Woongkyu Lee, Jungwook Choi, Jieun Lim
In this method, we devise a search space that explores the thread tile and warp sizes to increase the data reuse despite a large matrix operand of reduced-precision MMA.