arxiv
PublishedApril 22, 2026 at 4:00 AM
FG$^2$-GDN: Enhancing Long-Context Gated Delta Networks with Doubly Fine-Grained Control
Publisher summary· verbatim
arXiv:2604.19021v1 Announce Type: new Abstract: Linear attention mechanisms have emerged as promising alternatives to softmax attention, offering linear-time complexity during inference. Recent advances such as Gated DeltaNet (GDN) and Kimi Delta Attention (KDA) have demonstrated that the delta rule
Discussion
No replies yet. Be first.
Related coverage
More from ARXIV
arxivConsequentialist Objectives and Catastrophe8harxivEgoMAGIC- An Egocentric Video Field Medicine Dataset for Training Perception Algorithms8harxivReCast: Recasting Learning Signals for Reinforcement Learning in Generative Recommendation8harxivA Probabilistic Framework for Hierarchical Goal Recognition8hOriginally published on arxiv ↗