no code implementations • ICLR 2020 • George Stamatescu, Federica Gerace, Carlo Lucibello, Ian Fuss, Langford B. White
Moreover, we predict theoretically and confirm numerically, that common weight initialisation schemes used in standard continuous networks, when applied to the mean values of the stochastic binary weights, yield poor training performance.