arxiv
PublishedMay 26, 2026 at 4:00 AM
—neutral
Spiking the training data to correct for test set contamination
Publisher summary· verbatim
arXiv:2605.24818v1 Announce Type: cross Abstract: The literature on test set contamination largely focuses on detection, but the correction of contaminated test scores is underexplored. Our core proposal is to spike the training data by intentionally contaminating some test examples at known rates.
Stay posted· Newsletter
A 5-min weekly brief — top movers, price watch, story of the week.
Discussion
No replies yet. Be first.
The Bubble Brief
WEEKLYRead AI insights every Tuesday — top movers, new releases, story of the week.
Originally published on arxiv ↗