·
DataBubble
  • Home
  • Models
  • News
  • Compare
  • Boards
  • Pricing
  • About
  • Newsletter
  • Methodology
  • Contact
Latest
From Idea to Prototype in an Afternoon: Scaffolded, AI-Assisted Rapid VA Prototyping7h◆3D HAMSTER: Bridging Planning and Control in Hierarchical Vision Language Action Models through 3D Trajectory Guidance7h◆Improving LLM Reasoning with Homophily-aware Structural and Semantic Text-Attributed Graph Compression7h◆Paper2Rebuttal: A Multi-Agent Framework for Transparent Author Response Assistance7h◆When to Re-Plan: Subgoal Persistence in Hierarchical Latent Reasoning7h◆DSH-Bench: A Difficulty- and Scenario-Aware Benchmark with Hierarchical Subject Taxonomy for Subject-Driven Text-to-Image Generation7h◆PSCT-Net: Geometry-Aware Pediatric Skull CT Reconstruction via Differentiable Back-Projection and Attention-Guided Refinement7h◆Topological Neural Dynamics: A Neuron-wise Framework for Sequence Modeling7h◆Representing Research Attention as Contextually Structured Flows7h◆BLUEX v2: Benchmarking LLMs on Open-Ended Questions from Brazilian University Entrance Exams7h◆Fund2Persona: A Framework for Building and Refining Financial Advisor Personas from Fund Disclosure Data7h◆Multistage Defer Trees for Hybrid Interpretability: If at First You Can't Succeed, Tree Again7h◆Constrained Online Convex Optimization without Slater's Condition7h◆Beyond the Expressivity-Trainability Paradox: A Dynamical Lie Algebra Perspective on Navigating Barren Plateaus in Quantum Machine Learning7h◆Introduction to Stochastic Differential Equations for Generative Machine Learning: A Variational Perspective7h◆Robustness of neural networks to random noise perturbations of their inputs7h◆Diffusion-warm sampling of the XY model enables fast thermalization at scale7h◆Evaluation of Population Initialization Methods for Genetic Programming-based Symbolic Regression7h◆Tailored minimal reservoir computing: on the bidirectional connection between nonlinearities in the reservoir and in data7h◆Private Rate-Constrained Optimization with Applications to Fair Learning7h◆From Idea to Prototype in an Afternoon: Scaffolded, AI-Assisted Rapid VA Prototyping7h◆3D HAMSTER: Bridging Planning and Control in Hierarchical Vision Language Action Models through 3D Trajectory Guidance7h◆Improving LLM Reasoning with Homophily-aware Structural and Semantic Text-Attributed Graph Compression7h◆Paper2Rebuttal: A Multi-Agent Framework for Transparent Author Response Assistance7h◆When to Re-Plan: Subgoal Persistence in Hierarchical Latent Reasoning7h◆DSH-Bench: A Difficulty- and Scenario-Aware Benchmark with Hierarchical Subject Taxonomy for Subject-Driven Text-to-Image Generation7h◆PSCT-Net: Geometry-Aware Pediatric Skull CT Reconstruction via Differentiable Back-Projection and Attention-Guided Refinement7h◆Topological Neural Dynamics: A Neuron-wise Framework for Sequence Modeling7h◆Representing Research Attention as Contextually Structured Flows7h◆BLUEX v2: Benchmarking LLMs on Open-Ended Questions from Brazilian University Entrance Exams7h◆Fund2Persona: A Framework for Building and Refining Financial Advisor Personas from Fund Disclosure Data7h◆Multistage Defer Trees for Hybrid Interpretability: If at First You Can't Succeed, Tree Again7h◆Constrained Online Convex Optimization without Slater's Condition7h◆Beyond the Expressivity-Trainability Paradox: A Dynamical Lie Algebra Perspective on Navigating Barren Plateaus in Quantum Machine Learning7h◆Introduction to Stochastic Differential Equations for Generative Machine Learning: A Variational Perspective7h◆Robustness of neural networks to random noise perturbations of their inputs7h◆Diffusion-warm sampling of the XY model enables fast thermalization at scale7h◆Evaluation of Population Initialization Methods for Genetic Programming-based Symbolic Regression7h◆Tailored minimal reservoir computing: on the bidirectional connection between nonlinearities in the reservoir and in data7h◆Private Rate-Constrained Optimization with Applications to Fair Learning7h◆
News/Multipole Semantic Attention: A Fast Approximation of Softmax Attention for Pretraining
arxiv
PublishedJuly 1, 2026 at 4:00 AM

Multipole Semantic Attention: A Fast Approximation of Softmax Attention for Pretraining

Source
arxiv.orgfull article ↗
Read on arxiv→
Publisher summary· verbatim

arXiv:2509.10406v4 Announce Type: replace Abstract: Pretraining transformers on long sequences (entire code repositories, collections of related documents) is bottlenecked by quadratic attention costs. We present Multipole Semantic Attention (MuSe), which accelerates 64k-context pretraining by 36% w

Stay posted· Newsletter

A 5-min weekly brief — top movers, price watch, story of the week.

// no spam · unsubscribe one-click · free forever

Discussion
Source
↗
arxiv
Read original ↗All from arxiv →

No replies yet. Be first.

Source
↗
arxiv
Read original ↗All from arxiv →

Related coverage

More from ARXIV
arxivFrom Idea to Prototype in an Afternoon: Scaffolded, AI-Assisted Rapid VA Prototyping7harxiv3D HAMSTER: Bridging Planning and Control in Hierarchical Vision Language Action Models through 3D Trajectory Guidance7harxivImproving LLM Reasoning with Homophily-aware Structural and Semantic Text-Attributed Graph Compression7harxivPaper2Rebuttal: A Multi-Agent Framework for Transparent Author Response Assistance7h
The Bubble Brief
WEEKLY

Read AI insights every Tuesday — top movers, new releases, story of the week.

// no spam · unsubscribe one-click · free forever

Originally published on arxiv ↗
HomeModelsNews