no code implementations • 21 Oct 2019 • Jianri Li, Jae-whan Lee, Woo-sang Song, Ki-young Shin, Byung-Hyun Go
Then we compose candidate image vector and text representation into a single vector which is exptected to be biased toward target image vector.