no code implementations • 31 Mar 2022 • Liyu Wu, Can Zhang, Yuexian Zou
Inspired by the recent attention mechanism, we propose a multi-grain contextual focus module, termed MCF, to capture the action associated relation information from the body joints and parts.
no code implementations • 16 Sep 2021 • Zhenzhi Wang, Liyu Wu, Zhimin Li, Jiangfeng Xiong, Qinglin Lu
Our challenge includes two tasks: video structuring in the temporal dimension and multi-modal video classification.
no code implementations • 30 Jun 2021 • Liyu Wu, Yuexian Zou, Can Zhang
Efficient long-short temporal modeling is key for enhancing the performance of action recognition task.