arxivMar 31
Person-Centric Annotations of LAION-400M: Auditing Bias and Its Transfer to Models
arXiv:2510.03721v2 Announce Type: replace-cross Abstract: Vision-language models trained on large-scale multimodal datasets show strong demographic biases, but the role of training data in producing these biases remains unclear. A major barrier has been the lack of demographic annotations in web-sca