arxiv
PublishedJune 26, 2026 at 4:00 AM
Erase-then-Delta Attention: Decoupling Erase and Write Addresses in Delta-Rule Linear Attention
Publisher summary· verbatim
arXiv:2606.26560v1 Announce Type: new Abstract: Delta-rule linear attention improves recurrent memory updates by correcting what is already stored at the current write address before writing new content. However, the active correction is still anchored to that same write address. As a result, stale
Stay posted· Newsletter
A 5-min weekly brief — top movers, price watch, story of the week.
Discussion
No replies yet. Be first.
Related coverage
More from ARXIV
arxivMMGist: A Comprehensive Multimodal Benchmark for 20272harxivVisualizing "We the People": Bridging the Perception Gap through Pluralistic Data Storytelling2harxivSmall edits, large models: How Wikipedia advocacy shapes LLM values2harxivNoise-Aware Boundary-Enhanced Generative Learning for Ultrasound Speckle Reduction2hThe Bubble Brief
WEEKLYRead AI insights every Tuesday — top movers, new releases, story of the week.
Originally published on arxiv ↗