arxiv
PublishedJune 12, 2026 at 4:00 AM
Detecting Functional Memorization in Code Language Models
Publisher summary· verbatim
arXiv:2606.12764v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly used to generate code at scale. Meanwhile, prior work has investigated whether training data may be recoverable from model outputs, by auditing the textual overlap between training examples and model gene
Stay posted· Newsletter
A 5-min weekly brief — top movers, price watch, story of the week.
Discussion
No replies yet. Be first.
Related coverage
More from ARXIV
arxivLoHoSearch: Benchmarking Long-Horizon Search Agents Beyond the Human Difficulty Ceiling1harxivMulti-Turn Reasoning When Context Arrives in Pieces: Scalable Sharding and Memory-Augmented RL1harxivSkillChain: Closing the Loop on Skill Evolution for Image-Based E-Commerce AI Assistants1harxivNo Hidden Prompts Needed! You Can Game AI Peer Review with Presentation-Only Revisions1hThe Bubble Brief
WEEKLYRead AI insights every Tuesday — top movers, new releases, story of the week.
Originally published on arxiv ↗