1 code implementation • 2 Dec 2022 • Fangxun Shu, Biaolong Chen, Yue Liao, Shuwen Xiao, Wenyu Sun, Xiaobo Li, Yousong Zhu, Jinqiao Wang, Si Liu
Our MAC aims to reduce video representation's spatial and temporal redundancy in the VidLP model by a mask sampling mechanism to improve pre-training efficiency.
Ranked #37 on Video Retrieval on MSR-VTT-1kA (using extra training data)
2 code implementations • 31 Jan 2020 • Shuwen Xiao, Zhou Zhao, Zijian Zhang, Xiaohui Yan, Min Yang
This paper addresses the task of query-focused video summarization, which takes user's query and a long video as inputs and aims to generate a query-focused video summary.