no code implementations • 26 Nov 2023 • Jixuan Leng, Yijiang Li, Haohan Wang
SCMD leverages the capabilities of large vision-language models, specifically CLIP, to train a more efficient model, ensuring it acquires robust generalization capabilities across unseen domains.