no code implementations • 29 Nov 2018 • Samuel Matzek, Max Grossman, Minsik Cho, Anar Yusifov, Bryant Nelson, Amit Juneja
GPUs have limited memory and it is difficult to train wide and/or deep models that cause the training process to go out of memory.