no code implementations • CVPR 2022 • Salvador Medina, Denis Tome, Carsten Stoll, Mark Tiede, Kevin Munhall, Alexander G. Hauptmann, Iain Matthews
In this work, we introduce a large-scale speech and mocap dataset that focuses on capturing tongue, jaw, and lip motion.
no code implementations • 22 Oct 2018 • Salvador Medina, Zhuyun Dai, Yingkai Gao
In this paper, we propose a family of voting-based methods to aggregate frame-wise geolocation results which boost the video geolocation result.