arxiv6d agobullish

Incomplete Observations Boost Evolutionary Performance in Ocean Modeling

arXiv:2607.19147v1 Announce Type: cross Abstract: Data-driven methods have revolutionized ocean modeling, yet current approaches rely heavily on complete reanalysis datasets, imposing computational constraints and limiting model performance to that of the training data. Here, we present a generative

HI1 model #machine-learning #ocean-modeling #earth-system Read on arxiv →

arxiv6d agobullish

AI Tool Discovery at Scale: All You Need is DNS

arXiv:2607.18242v1 Announce Type: new Abstract: The coming era of autonomous AI agents demands a discovery mechanism capable of navigating millions of tools, yet existing solutions buckle under O(N) complexity and centralized governance. Instead of building another fragile overlay, we propose ToolDN

#autonomous-agents #ai-interoperability #dns Read on arxiv →

mit-tech-reviewJul 7bullish

The foundational elements of AI architecture that IT leaders need to scale

With the rapid progress of AI capabilities and the move to agentic systems, organizations are expanding their use cases as the technology continues to grow. That constant evolution also introduces risk, leaving IT leaders to wonder which investments will prove valuable even six months into the futur

#data-quality #ai-architecture #governance Read on mit-tech-review →

arxivJul 3

Scaling Laws for Grid-Based Approximate Nearest Neighbor Search in High Dimensions

arXiv:2607.01283v1 Announce Type: cross Abstract: Grid-based approaches to approximate nearest neighbor (ANN) search have been absent from modern scaling analyses. We present a systematic characterization of a multiprobe grid algorithm with respect to dataset size $N$ and dimensionality $d$. Our exp

#ann #scalability #machine-learning Read on arxiv →

arxivJun 12bullish

MiniMax Sparse Attention

arXiv:2606.13392v1 Announce Type: new Abstract: Ultra-long-context capability is becoming indispensable for frontier LLMs: agentic workflows, repository-scale code reasoning, and persistent memory all require the model to jointly attend over hundreds of thousands to millions of tokens, yet the quadr

#attention-mechanisms #llms #optimization Read on arxiv →

arxivMay 22bullish

Billion-Scale Graph Foundation Models

arXiv:2602.04768v2 Announce Type: replace Abstract: Graph-structured data underpins many critical applications. While foundation models have transformed language and vision via large-scale pretraining and lightweight adaptation, extending this paradigm to general, real-world graphs is challenging. I

GR1 model #graph-learning #foundation-models #pretraining Read on arxiv →

arxivMay 21bullish

Fast and Featureless Node Representation Learning with Partial Pairwise Supervision

arXiv:2605.19916v1 Announce Type: cross Abstract: We introduce Contrastive FUSE, a fast and unified framework for scalable node representation learning in graphs with partially available pairwise node labels and no available node features. Unlike existing methods, we directly optimize a spectral con

CO1 model #machine-learning #graph-learning #optimization Read on arxiv →

arxivMay 16bullish

Krause Synchronization Transformers

arXiv:2602.11534v3 Announce Type: replace-cross Abstract: Self-attention in Transformers relies on globally normalized softmax weights, causing all tokens to compete for influence at every layer. When composed across depth, this interaction pattern induces strong synchronization dynamics that favor

MEQWVI3 models #transformers #attention #efficiency Read on arxiv →

arxivMay 11

When Stored Evidence Stops Being Usable: Scale-Conditioned Evaluation of Agent Memory

arXiv:2605.07313v1 Announce Type: new Abstract: Memory-agent evaluations report fixed-snapshot accuracy or retrieval quality, but these scores do not show whether evidence remains usable as irrelevant sessions (sessions not annotated as task-relevant evidence for the query) accumulate. We present a

HILIQW5 models · +2 #evaluation #memory #agents Read on arxiv →

arxivMay 1bullish

ScaleBox: Enabling High-Fidelity and Scalable Code Verification for Large Language Models

arXiv:2604.27467v1 Announce Type: cross Abstract: Code sandboxes have emerged as a critical infrastructure for advancing the coding capabilities of large language models, providing verifiable feedback for both RL training and evaluation. However, existing systems fail to provide accurate verificatio

#research #large-language-models #code-training Read on arxiv →

arxivApr 30bullish

Efficient Traffic Forecasting on Large-Scale Road Network by Regularized Adaptive Graph Convolution

arXiv:2506.07179v2 Announce Type: replace-cross Abstract: Traffic prediction is a critical task in spatial-temporal forecasting with broad applications in travel planning and urban management. To model the complex spatial-temporal dependencies in traffic data, Spatial-Temporal Graph Convolutional Ne

SPRE2 models #machine-learning #traffic-prediction #graph-convolution Read on arxiv →

arxivApr 27

A Nationwide Japanese Medical Claims Foundation Model: Balancing Model Scaling and Task-Specific Computational Efficiency

arXiv:2604.22348v1 Announce Type: new Abstract: Clinical risk prediction using longitudinal medical data supports individualized care. Self-supervised foundation models have emerged as a promising approach for leveraging large-scale unlabeled healthcare records. In natural language processing, scali

TR1 model #healthcare #foundation-models #medical-research Read on arxiv →

arxivApr 16bullish

METRO: Towards Strategy Induction from Expert Dialogue Transcripts for Non-collaborative Dialogues

arXiv:2604.11427v2 Announce Type: replace-cross Abstract: Developing non-collaborative dialogue agents traditionally requires the manual, unscalable codification of expert strategies. We propose \ours, a method that leverages large language models to autonomously induce both strategy actions and pla

ME1 model #dialogue-agents #language-models #scalability Read on arxiv →

arxivApr 16bullish

Neural Two-Stage Stochastic Optimization for Solving Unit Commitment Problem

arXiv:2507.09503v4 Announce Type: replace-cross Abstract: This paper proposes a neural stochastic optimization method for efficiently solving the two-stage stochastic unit commitment (2S-SUC) problem under high-dimensional uncertainty scenarios. The proposed method approximates the second-stage reco

NE1 model #optimization #machine-learning #scalability Read on arxiv →

arxivApr 7bullish

Neuromorphic Computing for Low-Power Artificial Intelligence

arXiv:2604.04727v1 Announce Type: cross Abstract: Classical computing is beginning to encounter fundamental limits of energy efficiency. This presents a challenge that can no longer be solved by strategies such as increasing circuit density or refining standard semiconductor processes. The growing c

#neuromorphic #hardware #energy-efficiency Read on arxiv →

arxivApr 3bullish

annbatch unlocks terabyte-scale training of biological data in anndata

arXiv:2604.01949v1 Announce Type: new Abstract: The scale of biological datasets now routinely exceeds system memory, making data access rather than model computation the primary bottleneck in training machine-learning models. This bottleneck is particularly acute in biology, where widely used commu

#machine-learning #genomics #scalability Read on arxiv →