[CVPR 2026] - IsoCLIP: Decomposing CLIP Projectors for Efficient Intra-modal Alignment
image-classification clip image-retrieval text-retrieval vision-language contrastive-learning intra-modal-misalignment perception-encoder cvpr2026
-
Updated
Mar 23, 2026 - Python