arxiv
PublishedJune 15, 2026 at 4:00 AM
When Sample Selection Bias Precipitates Model Collapse
Publisher summary· verbatim
arXiv:2606.13732v1 Announce Type: new Abstract: The proliferation of recursive training on synthetic data can alleviate data scarcity but risks model collapse, where repeated training erodes distributional tails and homogenizes outputs. Data selection is widely viewed as a remedy, yet its reliabilit
Stay posted· Newsletter
A 5-min weekly brief — top movers, price watch, story of the week.
Discussion
No replies yet. Be first.
Related coverage
More from ARXIV
arxivGAGPO: Generalized Advantage Grouped Policy Optimization1harxivUP-NRPA: User Portrait based Nested Rollout Policy Adaptation for Planning with Large Language Models in Goal-oriented Dialogue Systems1harxivOrchestra-o1: Omnimodal Agent Orchestration1harxivHybrid Open-Ended Tri-Evolution Makes Better Deep Researcher1hThe Bubble Brief
WEEKLYRead AI insights every Tuesday — top movers, new releases, story of the week.
Originally published on arxiv ↗