no code implementations • 18 May 2022 • Anton Trusov, Elena Limonova, Dmitry Nikolaev, Vladimir V. Arlazarov
In this paper, we propose novel fast algorithms of ternary, ternary-binary, and binary matrix multiplication for mobile devices with ARM architecture.
1 code implementation • 15 Sep 2020 • Elena Limonova, Daniil Alfonso, Dmitry Nikolaev, Vladimir V. Arlazarov
In the paper, we introduce a bipolar morphological ResNet (BM-ResNet) model obtained from a much more complex ResNet architecture by converting its layers to bipolar morphological ones.
no code implementations • 14 Sep 2020 • Anton Trusov, Elena Limonova, Dmitry Slugin, Dmitry Nikolaev, Vladimir V. Arlazarov
We introduce an efficient implementation of 4-bit matrix multiplication for quantized neural networks and perform time measurements on a mobile ARM processor.
no code implementations • 19 Feb 2020 • Elena Limonova, Arseny Terekhin, Dmitry Nikolaev, Vladimir Arlazarov
Experiments showed 3 times efficiency increase for final implementation of erosion and dilation compared to van Herk/Gil-Werman algorithm without SIMD, 5. 7 times speedup for 8x8 matrix transpose and 12 times speedup for 16x16 matrix transpose compared to transpose without SIMD.
no code implementations • 18 Feb 2020 • Elena Limonova, Alexander Sheshkus, Dmitry Nikolaev
This paper considers a convolutional neural network transformation that reduces computation complexity and thus speedups neural network processing.
no code implementations • 3 Dec 2019 • Anton Trusov, Elena Limonova
In this work we apply commonly known methods of non-adaptive interpolation (nearest pixel, bilinear, B-spline, bicubic, Hermite spline) and sampling (point sampling, supersampling, mip-map pre-filtering, rip-map pre-filtering and FAST) to the problem of projective image transformation.
no code implementations • 5 Nov 2019 • Elena Limonova, Daniil Matveev, Dmitry Nikolaev, Vladimir V. Arlazarov
To demonstrate efficiency of the proposed model we consider classical convolutional neural networks and convert the pre-trained convolutional layers to the bipolar morphological layers.