arxivApril 8, 2026 at 4:00 AM1 min read

Is CLIP Cross-Eyed? Revealing and Mitigating Center Bias in the CLIP Family

arXiv:2604.05971v1 Announce Type: cross Abstract: Recent research has shown that contrastive vision-language models such as CLIP often lack fine-grained understanding of visual content. While a growing body of work has sought to address this limitation, we identify a distinct failure mode in the CLI

Read original article ↗

No replies yet. Be first.

arxiv6h ago

Advantage-Guided Diffusion for Model-Based Reinforcement Learning

arxiv6h ago

FluidFlow: a flow-matching generative model for fluid dynamics surrogates on unstructured meshes

arxiv6h ago

Is CLIP Cross-Eyed? Revealing and Mitigating Center Bias in the CLIP Family

Related Articles

Advantage-Guided Diffusion for Model-Based Reinforcement Learning

FluidFlow: a flow-matching generative model for fluid dynamics surrogates on unstructured meshes

HiL-Bench (Human-in-Loop Benchmark): Do Agents Know When to Ask for Help?