no code implementations • 1 Jun 2023 • Christian H. X. Ali Mehmeti-Göpel, Michael Wand
Recent studies have shown that high disparities in effective learning rates (ELRs) across layers in deep neural networks can negatively affect trainability.
no code implementations • 30 Nov 2022 • Christian H. X. Ali Mehmeti-Göpel, Jan Disselhoff
We perform an empirical study of the behaviour of deep networks when fully linearizing some of its feature channels through a sparsity prior on the overall number of nonlinear units in the network.