1 code implementation • 12 Jul 2020 • Marko Smilevski
The neural networks we propose use two different fusion modules, the Recurrent Neural Network + Convolutional Neural Network fusion module and the Stacked Attention Network fusion module, that jointly combine the visual and the textual data of the records.
1 code implementation • 15 May 2018 • Marko Smilevski, Ilija Lalkovski, Gjorgji Madjarov
As a novelty, our encoder model is composed of two separate encoders, one that models the behaviour of the image sequence and other that models the sentence-story generated for the previous image in the sequence of images.