no code implementations • 14 Nov 2023 • Yating Xu, Conghui Hu, Gim Hee Lee
Existing works on weakly-supervised audio-visual video parsing adopt hybrid attention network (HAN) as the multi-modal embedding to capture the cross-modal context.
no code implementations • 13 Nov 2023 • Ruolin Yang, Da Li, Conghui Hu, Timothy Hospedales, Honggang Zhang, Yi-Zhe Song
Reference-based video object segmentation is an emerging topic which aims to segment the corresponding target object in each video frame referred by a given reference, such as a language expression or a photo mask.
1 code implementation • ICCV 2023 • Yating Xu, Conghui Hu, Na Zhao, Gim Hee Lee
Existing fully-supervised point cloud segmentation methods suffer in the dynamic testing environment with emerging new classes.
1 code implementation • ICCV 2023 • Conghui Hu, Can Zhang, Gim Hee Lee
This limitation motivates us to present the first attempt at domain-generalized unsupervised cross-domain image retrieval (DG-UCDIR) aiming at facilitating image retrieval between any two unseen domains in an unsupervised way.
no code implementations • 9 Dec 2022 • Yating Xu, Conghui Hu, Gim Hee Lee
The existing state-of-the-art method for audio-visual conditioned video prediction uses the latent codes of the audio-visual frames from a multimodal stochastic network and a frame encoder to predict the next visual frame.
1 code implementation • 20 Jul 2022 • Conghui Hu, Gim Hee Lee
Current supervised cross-domain image retrieval methods can achieve excellent performance.
no code implementations • 18 May 2021 • Conghui Hu, Yongxin Yang, Yunpeng Li, Timothy M. Hospedales, Yi-Zhe Song
The practical value of existing supervised sketch-based image retrieval (SBIR) algorithms is largely limited by the requirement for intensive data collection and labeling.
1 code implementation • 11 Nov 2020 • Hao Wen, Xiongjie Chen, Georgios Papagiannis, Conghui Hu, Yunpeng Li
Recent advances in incorporating neural networks into particle filters provide the desired flexibility to apply particle filters in large-scale real-world applications.
no code implementations • CVPR 2018 • Conghui Hu, Da Li, Yi-Zhe Song, Tao Xiang, Timothy M. Hospedales
Contemporary deep learning techniques have made image recognition a reasonably reliable technology.