no code implementations • 8 Mar 2024 • Yiding Liu, Jingjing Wang, Jiamin Luo, Tao Zeng, Guodong Zhou
Specifically, this TSA treats the ACR task as an auxiliary task to boost the performance of the primary ASU task, and further integrates trusted learning into reflexion mechanisms to alleviate the LLMs-intrinsic factual hallucination problem in TSA.
no code implementations • 4 Mar 2024 • Jiamin Luo, Jingjing Wang, Guodong Zhou
Multimodal Conversational Emotion (MCE) detection, generally spanning across the acoustic, vision and language modalities, has attracted increasing interest in the multimedia community.
no code implementations • 29 Feb 2024 • Jiamin Luo, Jianing Zhao, Jingjing Wang, Guodong Zhou
Weakly-supervised Phrase Grounding (WPG) is an emerging task of inferring the fine-grained phrase-region matching, while merely leveraging the coarse-grained sentence-image pairs for training.