no code implementations • 12 Dec 2023 • Peiwen Sun, Yifan Zhang, Zishan Liu, Donghao Chen, Honggang Zhang
The vanilla fusion methods still dominate a large percentage of mainstream audio-visual tasks.