1 code implementation • 6 Sep 2022 • Han Wang, Jun Tang, Xiaodong Liu, Shanyan Guan, Rong Xie, Li Song
The temporal information is introduced by the temporal feature aggregation model (TFAM), by conducting an attention mechanism between the context frames and the target frame (i. e., the frame to be detected).
Ranked #5 on Video Object Detection on ImageNet VID
no code implementations • 3 Mar 2022 • Shanyan Guan, Huayu Deng, Yunbo Wang, Xiaokang Yang
Deep learning has shown great potential for modeling the physical dynamics of complex particle systems such as fluids.
1 code implementation • 7 Nov 2021 • Shanyan Guan, Jingwei Xu, Michelle Z. He, Yunbo Wang, Bingbing Ni, Xiaokang Yang
We consider a new problem of adapting a human mesh reconstruction model to out-of-domain streaming videos, where performance of existing SMPL-based models are significantly affected by the distribution shift represented by different camera parameters, bone lengths, backgrounds, and occlusions.
Ranked #1 on 3D Absolute Human Pose Estimation on Surreal
1 code implementation • CVPR 2021 • Shanyan Guan, Jingwei Xu, Yunbo Wang, Bingbing Ni, Xiaokang Yang
This paper considers a new problem of adapting a pre-trained model of human mesh reconstruction to out-of-domain streaming videos.
Ranked #39 on 3D Human Pose Estimation on 3DPW
no code implementations • 3 Jul 2020 • Shanyan Guan, Ying Tai, Bingbing Ni, Feida Zhu, Feiyue Huang, Xiaokang Yang
The latent code of the recent popular model StyleGAN has learned disentangled representations thanks to the multi-layer style-based generator.