arxivApril 8, 2026 at 4:00 AM1 min read
Is CLIP Cross-Eyed? Revealing and Mitigating Center Bias in the CLIP Family
arXiv:2604.05971v1 Announce Type: cross Abstract: Recent research has shown that contrastive vision-language models such as CLIP often lack fine-grained understanding of visual content. While a growing body of work has sought to address this limitation, we identify a distinct failure mode in the CLI
No replies yet. Be first.