·
DataBubble
  • Home
  • Models
  • News
  • Compare
  • Boards
  • Pricing
  • About
  • Newsletter
  • Methodology
  • Contact
Latest
SpaceX officially prices shares at $135 in the largest IPO ever5h◆Our new community investments in Virginia support local jobs and expand energy affordability.5h◆SpaceX SPV investors won’t know their true holdings until post-IPO lock-ups lift5h◆Amazon’s data centers used 2.5 billion gallons of water last year8h◆Deezer’s new tool can identify AI music from Spotify, Apple Music, and others9h◆Pool’s new app turns your screenshots into something useful10h◆DoorDash’s new AI chatbot lets you order with prompts and photos11h◆Anthropic apologizes for invisible Claude Fable guardrails14h◆Google DeepMind is worried about what happens when millions of agents start to interact14h◆Deezer launches an AI music detector for other streaming services17h◆Opendoor’s India exit is fueling a bigger conversation about AI and outsourcing21h◆MODF-SIR: A Multi-agent Omni-modal Distilled Framework for Social Intelligence Reasoning21h◆Position: Stop Anthropomorphizing Intermediate Tokens as Reasoning/Thinking Traces!21h◆ARGUS: Stacked Multi-View Identity Mosaic Injection for Subject-Preserving Video Generation21h◆Generalizing Beyond Suboptimality: Offline Reinforcement Learning Learns Effective Scheduling through Random Solutions21h◆The Impossibility of Eliciting Latent Knowledge21h◆Mapping Scientific Literature with Large Language Models and Topic Modeling21h◆Grounding Computer Use Agents on Human Demonstrations21h◆Embodied-R1.5: Evolving Physical Intelligence via Embodied Foundation Models21h◆LSTM based IoT Device Identification21h◆SpaceX officially prices shares at $135 in the largest IPO ever5h◆Our new community investments in Virginia support local jobs and expand energy affordability.5h◆SpaceX SPV investors won’t know their true holdings until post-IPO lock-ups lift5h◆Amazon’s data centers used 2.5 billion gallons of water last year8h◆Deezer’s new tool can identify AI music from Spotify, Apple Music, and others9h◆Pool’s new app turns your screenshots into something useful10h◆DoorDash’s new AI chatbot lets you order with prompts and photos11h◆Anthropic apologizes for invisible Claude Fable guardrails14h◆Google DeepMind is worried about what happens when millions of agents start to interact14h◆Deezer launches an AI music detector for other streaming services17h◆Opendoor’s India exit is fueling a bigger conversation about AI and outsourcing21h◆MODF-SIR: A Multi-agent Omni-modal Distilled Framework for Social Intelligence Reasoning21h◆Position: Stop Anthropomorphizing Intermediate Tokens as Reasoning/Thinking Traces!21h◆ARGUS: Stacked Multi-View Identity Mosaic Injection for Subject-Preserving Video Generation21h◆Generalizing Beyond Suboptimality: Offline Reinforcement Learning Learns Effective Scheduling through Random Solutions21h◆The Impossibility of Eliciting Latent Knowledge21h◆Mapping Scientific Literature with Large Language Models and Topic Modeling21h◆Grounding Computer Use Agents on Human Demonstrations21h◆Embodied-R1.5: Evolving Physical Intelligence via Embodied Foundation Models21h◆LSTM based IoT Device Identification21h◆
Tag

#machine-learning

100 articles tagged #machine-learning

arxiv21h ago

Minimal surfaces, Knots, and Neural Networks

arXiv:2605.26234v2 Announce Type: replace-cross Abstract: A recent conjecture by Joel Fine posits a relationship between the coefficients of the HOMFLY polynomial of a knot $K$ in the 3-sphere $S^3$, and the signed count of minimal surfaces in hyperbolic 4-space $\mathrm{H}^4$ meeting the sphere at

PH1 model#mathematics#machine-learning#geometryRead on arxiv →
arxiv1d ago

Bidirectional Random Projections

arXiv:2606.10377v1 Announce Type: cross Abstract: This paper analyzes bidirectional random projections for ordinary least squares (OLS) regression under the fixed design setting. Let $(X,Y) \in \mathbb{R}^{n \times p} \times \mathbb{R}^n$ be a sample and $R \in \mathbb{R}^{n_1 \times n}, W \in \math

#statistics#machine-learning#researchRead on arxiv →
arxiv5d agobullish

Reformulating Neural Operators in $d+1$ Dimensions for Embedding Evolution

arXiv:2505.11766v4 Announce Type: replace-cross Abstract: Neural Operators (NOs) are powerful architectures for learning mappings between function spaces. While most advances focus on refining kernel parameterizations over the $d$-dimensional physical domain, the evolution of lifted embeddings remai

#machine-learning#artificial-intelligence#quantum-physicsRead on arxiv →
arxiv6d agobullish

ADAPTOOD: Uncertainty-Aware Fine-Tuning for Out-of-Distribution ECG Time Series Models

arXiv:2606.04164v1 Announce Type: cross Abstract: Data samples used for training often differ from those encountered during fine-tuning and deployment, and while ML models show promise, their performance remains limited when only small annotated datasets are available. Performance often degrades und

#machine-learning#adaptation#robustnessRead on arxiv →
arxiv6d ago

Dead Directions: Geometric Singular Learning

arXiv:2606.05957v1 Announce Type: new Abstract: Singular learning theory and information geometry have studied the same parameter spaces in mostly separate vocabularies: the former computes Bayesian invariants in resolved coordinates, the latter works in original coordinates under a non-degeneracy a

#machine-learning#information-geometry#singular-learningRead on arxiv →
arxiv6d agobullish

Toward Scalable and Valid Conditional Independence Testing with Spectral Representations

arXiv:2512.19510v2 Announce Type: replace Abstract: Conditional independence (CI) is central to causal inference, feature selection, and graphical modeling, yet it is untestable in many settings without additional assumptions. Existing CI tests often rely on restrictive structural conditions, limiti

#machine-learning#representation-learning#causal-inferenceRead on arxiv →
arxivJun 4

Reconciling Causality and Non-Equilibrium Thermodynamics with Hamiltonian Causal Models

arXiv:2606.04822v1 Announce Type: new Abstract: Causal modeling of physical temporal phenomena must handle interventions that act along trajectories, nonstationary induced laws, path-dependent effects, and feedback mediated by dynamics, all challenging in standard causal models. We introduce Hamilto

HA1 model#causal-modeling#non-equilibrium-thermodynamics#machine-learningRead on arxiv →
arxivJun 3

Localized, High-resolution Geographic Representations with Slepian Functions

arXiv:2602.00392v2 Announce Type: replace Abstract: Geographic data is fundamentally local. Disease outbreaks cluster in population centers, ecological patterns emerge along coastlines, and economic activity concentrates within country borders. Machine learning models that encode geographic location

#machine-learning#geographic-data#localizationRead on arxiv →
arxivJun 3bullish

Your Autoregressive Model Already Reveals the Causal Graph

arXiv:2602.01135v3 Announce Type: replace Abstract: Autoregressive models trained via next-token prediction implicitly learn the conditional independence structure of their data-generating process. We exploit this observation to perform scalable causal discovery from a single observed sequence of di

TR1 model#causal-discovery#autoregressive-models#sequence-predictionRead on arxiv →
arxivJun 2

How Much Orthogonalization Does Muon Need?

arXiv:2606.00371v1 Announce Type: new Abstract: Muon optimizers improve neural-network training by replacing ill-conditioned momentum updates with approximately semi-orthogonal updates. This motivates a practical question: how much orthogonalization does Muon actually require? We study this question

NAGPMA4 models · +1#machine-learning#optimization#neural-networksRead on arxiv →
arxivJun 2

Adaptive Exploration for Latent-State Bandits

arXiv:2602.05139v3 Announce Type: replace Abstract: We study bandits whose rewards depend on an unobserved Markov state that evolves independently of the learner's actions. The optimal arm can change even though the learner observes only past actions and rewards. We propose algorithms that feed LinU

LI1 model#bandits#machine-learning#markov-stateRead on arxiv →
arxivJun 2

Edge-aware Decoding for Neural Asymmetric Routing

arXiv:2606.02136v1 Announce Type: new Abstract: Neural asymmetric routing models increasingly encode directionality through matrix representations and asymmetry-aware attention. The final routing action, however, is not a node in isolation but a directed transition chosen under the current partial r

RA1 model#machine-learning#routing#asymmetric-routingRead on arxiv →
arxivJun 2

OmniEEG-Bench: A Standardized Evaluation Benchmark for EEG Foundation Models

arXiv:2606.00815v1 Announce Type: new Abstract: Electroencephalography (EEG) supports a variety of brain-computer interface (BCI) tasks ranging from brain-state monitoring to human-LLM interactions. EEG foundation models are emerging, but evaluation remains fragmented due to heterogeneous datasets a

#benchmark#machine-learning#neuroscienceRead on arxiv →
arxivJun 2bullish

Margin Adaptive DPO: Leveraging Reward Model for Granular Control in Preference Optimization

arXiv:2510.05342v2 Announce Type: replace-cross Abstract: Direct Preference Optimization (DPO) has emerged as a simple and effective method for aligning large language models. However, its reliance on a fixed temperature parameter leads to suboptimal training on diverse preference data, causing over

DIIP$\4 models · +1#machine-learning#optimization#language-modelsRead on arxiv →
arxivJun 1bullish

Deep-learning-based low-energy trigger algorithms for the Hyper-Kamiokande experiment

arXiv:2605.31391v1 Announce Type: cross Abstract: Modern machine learning techniques have become increasingly important in particle physics because of their powerful pattern-recognition capabilities, including in real-time data acquisition where stringent runtime constraints apply. This paper detail

AUMACU3 models#particle-physics#machine-learning#real-time-processingRead on arxiv →
arxivJun 1bearish

Multi-Agent Teams Hold Experts Back

arXiv:2602.01011v4 Announce Type: replace-cross Abstract: Multi-agent LLM systems are increasingly deployed as autonomous collaborators, where agents interact freely rather than execute fixed, pre-specified workflows. In such settings, effective coordination cannot be fully designed in advance and m

#multiagent-systems#artificial-intelligence#machine-learningRead on arxiv →
arxivJun 1

The Fundamental Limits of Fraud Detection in Card Payment Networks

arXiv:2605.27557v2 Announce Type: replace Abstract: Card payment fraud detection is usually framed as a supervised classification problem. Although this approach has generated practical progress, improvement has remained incremental despite major advances in model architecture. We argue that this is

#machine-learning#fraud-detection#payment-ecosystemRead on arxiv →
arxivJun 1

Representation Collapse in Sequential Post-Training of Large Language Models

arXiv:2605.30524v1 Announce Type: new Abstract: Large language models are now adapted through chains of post-training stages rather than through a single instruction-tuning pass. This paper studies whether such sequential post-training gradually compresses internal representations into low-rank, ani

#machine-learning#post-training#representationRead on arxiv →
arxivJun 1bullish

Fixed Universal Transformers

arXiv:2605.31423v1 Announce Type: new Abstract: We introduce \emph{universal transformers}: fixed transformers that can simulate any transformer in a given class via a suitable input embedding. Analogous to a universal Turing machine, the input embedding encodes a description of the target model whi

TR1 model#machine-learning#transformers#universalityRead on arxiv →
arxivMay 29

OVA-IB: One vs All Information Bottleneck for Multi-Modal Alignment

arXiv:2605.29900v1 Announce Type: new Abstract: Contrastive learning is effective for aligning paired views or modalities, but alignment beyond two modalities remains non-trivial and comparatively underexplored. Pairwise CLIP-style losses decompose multi-modal alignment into independent two-way comp

#multi-modal#contrastive-learning#information-theoryRead on arxiv →
arxivMay 29

Calibrating Generative Models to Distributional Constraints

arXiv:2510.10020v4 Announce Type: replace-cross Abstract: Generative models frequently suffer miscalibration, wherein statistics of the sampling distribution, such as the fraction of generations in a given class, deviate from desired values. We frame calibration as a constrained optimization problem

#machine-learning#calibration#optimizationRead on arxiv →
arxivMay 29

Approximate Proportionality in Online Fair Division

arXiv:2508.03253v2 Announce Type: replace-cross Abstract: We study the online fair division problem, where indivisible goods arrive sequentially and must be allocated immediately and irrevocably. Prior work establishes strong impossibility results for approximating classic notions such as envy-freen

#fair-division#online-algorithms#machine-learningRead on arxiv →
arxivMay 29

Collaborative Threshold Watermarking

arXiv:2602.10765v2 Announce Type: replace Abstract: In federated learning (FL), $K$ clients jointly train a model without sharing raw data. Because each participant invests data and compute, clients need mechanisms to later prove the provenance of a jointly trained model. Model watermarking embeds a

#federated-learning#model-watermarking#machine-learningRead on arxiv →
arxivMay 29

Evolving Features vs Evolving Entire Trees with GP for Interpretable Survival Analysis

arXiv:2605.30119v1 Announce Type: cross Abstract: Survival analysis concerns the task of predicting the time until an event occurs. Often used in the medical field, survival analysis deals with incomplete (i.e., censored) data, for instance, from patients who did not experience the event during the

#machine-learning#survival-analysis#evolutionary-computingRead on arxiv →
arxivMay 28bullish

Learning Compositional Latent Structure with Vector Networks

arXiv:2605.28007v1 Announce Type: cross Abstract: Deep networks are powerful function approximators, but they typically store many different computations in shared weight matrices, making it difficult to selectively reuse or adapt parts of them when a familiar structure appears in novel combinations

VE1 model#machine-learning#artificial-intelligence#neural-networksRead on arxiv →
arxivMay 28

Compositional Generalization in Autoregressive Models via Logit Composition

arXiv:2605.28304v1 Announce Type: new Abstract: Composing autoregressive models remains a core challenge in understanding how large language models can combine behaviors or skills learned across tasks. We introduce a new and principled composition strategy for autoregressive systems, inspired by com

#machine-learning#autoregressive-models#diffusion-modelsRead on arxiv →
arxivMay 28

Worker Disagreement Reveals Sharp Directions in Local SGD

arXiv:2605.27739v1 Announce Type: cross Abstract: Deep neural network training often exhibits highly anisotropic loss geometry, where a few sharp dominant Hessian directions coexist with a large flatter bulk. Gradients tend to align disproportionately with these dominant directions, although stable

MLCNTR3 models#machine-learning#deep-learning#optimizationRead on arxiv →
arxivMay 28

DSSE: a drone swarm search environment

arXiv:2307.06240v2 Announce Type: replace-cross Abstract: The Drone Swarm Search project is an environment, based on \textsc{PettingZoo}, that is to be used in conjunction with multi-agent (or single-agent) reinforcement learning algorithms. It is an environment in which the agents (drones), have to

#reinforcement-learning#multi-agent#machine-learningRead on arxiv →
arxivMay 28

LiDDA: Data Driven Attribution at LinkedIn

arXiv:2505.09861v3 Announce Type: replace-cross Abstract: Data Driven Attribution, which assigns conversion credits to marketing interactions based on causal patterns learned from data, is the foundation of modern marketing intelligence and vital to any marketing business and advertising platform. I

TR1 model#machine-learning#marketing#ad-techRead on arxiv →
arxivMay 26bullish

'Si'multaneous 'S'patial-'T'emporal Message Passing for Dynamic Graph Representation Learning

arXiv:2605.25548v1 Announce Type: cross Abstract: Dynamic graph neural networks (DGNNs) that operate on snapshot sequences typically fall into one of two categories. \emph{Temporal-first} approaches build per-node temporal embeddings and only afterwards perform spatial aggregation, whereas \emph{Spa

SI1 model#machine-learning#graph-neural-networks#link-predictionRead on arxiv →
arxivMay 26bullish

Kolmogorov-Arnold Fourier Networks

arXiv:2502.06018v3 Announce Type: replace-cross Abstract: Although Kolmogorov-Arnold-based interpretable networks (KANs) possess strong theoretical expressiveness, they suffer from severe parameter explosion and limited ability to capture high-frequency features in high-dimensional tasks. To address

KO1 model#machine-learning#neural-networks#spectral-reparameterizationRead on arxiv →
arxivMay 26

Reinforcement Learning for Reachability: Guaranteeing Asymptotic Optimality

arXiv:2605.24740v1 Announce Type: new Abstract: Reinforcement learning (RL) for reachability specifications is fundamental in sequential decision-making, yet theoretical guarantees remain less explored. A recent work achieves asymptotic convergence to optimal policies. However, this approach provide

#reinforcement-learning#machine-learning#convergenceRead on arxiv →
arxivMay 26

Active Query Synthesis for Preference Learning

arXiv:2605.26072v1 Announce Type: new Abstract: Efficient learning of user preferences is crucial for many modern decision making systems but typically requires costly labeled data. Active learning reduces this cost, yet standard methods are computationally expensive due to pool-based evaluation. Fu

#active-learning#machine-learning#optimizationRead on arxiv →
arxivMay 25

MELT: A Behavioral Trace Dataset for High-Risk Memecoin Launch Detection

arXiv:2602.13480v2 Announce Type: cross Abstract: Launchpads have become the dominant mechanism for issuing memecoins, exposing investors to a new class of high-risk launches that existing rug-pull detection methods cannot capture. We argue that detecting these threats requires structured behavioral

#blockchain#cryptography#machine-learningRead on arxiv →
arxivMay 25bullish

PaP-NF: Probabilistic Long-Term Time Series Forecasting via Prefix-as-Prompt Reprogramming and Normalizing Flows

arXiv:2605.23219v1 Announce Type: cross Abstract: Time series forecasting plays a central role in many real-world applications and has been extensively studied. Most existing approaches rely on deterministic models. However, real-world environments exhibit inherently uncertain and complex future beh

PA1 model#probabilistic-forecasting#time-series#machine-learningRead on arxiv →
arxivMay 25

Does Your Wildfire Prediction Model Actually Work, or Just Score Well?

arXiv:2605.18911v2 Announce Type: replace-cross Abstract: Wildfire prediction is important for early warning and resource allocation, yet existing Earth foundation models (Earth FMs) are pretrained for general atmospheric and geophysical objectives rather than wildfire forecasting. To address this g

WI1 model#wildfire-prediction#earth-foundation-models#machine-learningRead on arxiv →
arxivMay 25bullish

Task-Awareness Improves LLM Generations and Uncertainty

arXiv:2601.21500v2 Announce Type: replace Abstract: In many applications of LLMs, natural language responses often have an underlying structure such as representing discrete labels, numerical values, or graphs. Yet, existing decoding and uncertainty estimation methods operate only in language space

#llms#machine-learning#uncertainty-estimationRead on arxiv →
arxivMay 25bullish

Transform-Invariant Generative Ray Path Sampling for Efficient Radio Propagation Modeling

arXiv:2603.01655v2 Announce Type: replace Abstract: Ray tracing has become a standard for accurate radio propagation modeling, but suffers from exponential computational complexity, as the number of candidate paths scales with the number of objects raised to the interaction order. This bottleneck li

GE1 model#machine-learning#signal-processing#optimizationRead on arxiv →
arxivMay 22bullish

Towards Autonomous Mechanistic Reasoning in Virtual Cells

arXiv:2604.11661v3 Announce Type: replace-cross Abstract: Large language models (LLMs) have recently gained significant attention as a promising approach to accelerate scientific discovery. However, their application in open-ended scientific domains such as biology remains limited, primarily due to

#machine-learning#artificial-intelligence#biologyRead on arxiv →
arxivMay 22

Discovering Entity-Conditioned Lag Heterogeneity: A Lag-Gated Neural Audit Framework for Panel Time Series

arXiv:2605.21542v1 Announce Type: new Abstract: Country-level temporal panels are widely used in empirical analysis. Researchers often need to audit how different entities respond to historical signals over different time horizons. Current approaches typically do not provide directly auditable entit

AC1 model#machine-learning#time-series#auditRead on arxiv →
arxivMay 22bullish

Distribution-Aware Reward: Reinforcement Learning over Predictive Distributions for LLM Regression

arXiv:2605.20740v1 Announce Type: cross Abstract: Large language models can predict real-valued quantities from heterogeneous inputs such as text, code, and molecular strings, but most training objectives score each decoded floating-point number independently, improving point estimates without ensur

#machine-learning#regression#reinforcement-learningRead on arxiv →
arxivMay 22

How Many Different Outputs Can a Transformer Generate?

arXiv:2605.22223v1 Announce Type: new Abstract: We study how we can leverage only a handful of characteristics of a transformer's architecture to closely predict the number of different sequences it can output, both qualitatively and quantitatively. We provide an upper bound depending on the length

TR1 model#machine-learning#research#sequence-modelingRead on arxiv →
arxivMay 22

Expectation Consistency Loss: Rethink Confidence Calibration under Covariate Shift

arXiv:2605.21552v1 Announce Type: new Abstract: Confidence calibration for classification models is vital in safety-critical decision-making scenarios and has received extensive attention. General confidence calibration methods assume training and test data are independent and identically distribute

#calibration#covariate-shift#domain-adaptationRead on arxiv →
arxivMay 22

The Secretary Problem with a Stochastic Precursor

arXiv:2605.22653v1 Announce Type: cross Abstract: In learning-augmented online algorithms, predictions are usually valued for what they say: a value estimate, a solution, or an algorithmic recommendation. This paper shows that predictions can also be valuable solely due to their arrival time. We stu

#online-algorithms#stochastic-processes#decision-makingRead on arxiv →
arxivMay 22bullish

Spectra as Language: Large Language Models for Scalable Stellar Parameter and Abundance Inference

arXiv:2605.22162v1 Announce Type: cross Abstract: Stellar spectra encode key information on the physical properties and chemical compositions of stars. Accurate stellar parameter determination is essential for addressing major questions such as galaxy and stellar evolution. Large-scale spectroscopic

#astronomy#machine-learning#spectroscopyRead on arxiv →
arxivMay 22

Internal narratives parameterise affective states

arXiv:2502.09487v3 Announce Type: replace Abstract: Characterising how we verbalise our feelings is central to psychological assessment and intervention, yet the mapping between narrative and affective state remains poorly understood. Across two large studies (n=1257), we parameterised the structure

LA1 model#psychology#narrative#affective-stateRead on arxiv →
arxivMay 22bullish

Choose Wisely and Privately: Proactive Client Selection for Fair and Efficient Federated Learning

arXiv:2605.20975v2 Announce Type: replace Abstract: Federated Learning enables collaborative model training across decentralized data sources without data transfer. Averaging-based FL is limited by the presence of non-IID data, which negatively impacts convergence speed and final model accuracy. Con

#federated-learning#machine-learning#cryptographyRead on arxiv →
arxivMay 21

Batched Single-Index Global Multi-Armed Bandits with Covariates

arXiv:2503.00565v3 Announce Type: replace-cross Abstract: The multi-armed bandits (MAB) framework is a widely used approach for sequential decision-making, where a decision-maker selects an arm in each round with the goal of maximizing long-term rewards. In many practical applications, such as perso

#machine-learning#bandits#recommendation-systemsRead on arxiv →
arxivMay 21bullish

CASCADE Conformal Prediction: Uncertainty-Adaptive Prediction Intervals for Two-Stage Clinical Decision Support

arXiv:2605.20468v1 Announce Type: new Abstract: Effective medication management in Parkinson's Disease (PD) is challenging due to heterogeneous disease progression, variable patient response, and medication side effects. While AI models can forecast levodopa equivalent daily dose (LEDD) as a measure

CA1 model#healthcare#machine-learning#uncertainty-quantificationRead on arxiv →
arxivMay 21bullish

INSHAPE: Instance-Level Shapelets for Interpretable Time-Series Classification

arXiv:2605.20088v1 Announce Type: cross Abstract: Discovering shapelets -- i.e., discriminative temporal patterns within time series -- has been widely studied to address the inherent complexity of time-series classification (TSC) and to make model decision-making processes more transparent. However

IN1 model#time-series#classification#interpretabilityRead on arxiv →
arxivMay 21

Adversarial Robustness in One-Stage Learning-to-Defer

arXiv:2510.10988v2 Announce Type: replace-cross Abstract: Learning-to-Defer (L2D) enables hybrid decision-making by routing inputs either to a predictor or to external experts. While promising, L2D is highly vulnerable to adversarial perturbations, which can not only flip predictions but also manipu

#adversarial-robustness#machine-learning#hybrid-decision-makingRead on arxiv →
arxivMay 21bullish

Fast and Featureless Node Representation Learning with Partial Pairwise Supervision

arXiv:2605.19916v1 Announce Type: cross Abstract: We introduce Contrastive FUSE, a fast and unified framework for scalable node representation learning in graphs with partially available pairwise node labels and no available node features. Unlike existing methods, we directly optimize a spectral con

CO1 model#machine-learning#graph-learning#optimizationRead on arxiv →
arxivMay 21bullish

Cross-Subject Intracranial EEG Reconstruction from Scalp Recordings Using Multi-Scale Cross-Attention Transformers

arXiv:2605.18897v1 Announce Type: cross Abstract: Intracranial EEG (iEEG) provides high-fidelity neural recordings essential for clinical and brain-computer interface applications, but acquiring these signals requires invasive surgery. While recent studies have attempted to estimate iEEG from non-in

CA1 model#neural-recordings#brain-computer#machine-learningRead on arxiv →
arxivMay 21bullish

Dynamic Shapley Computation

arXiv:2605.20620v1 Announce Type: new Abstract: Shapley-based data valuation provides a principled way to quantify the contribution of training data, but its high computational cost makes it impractical in dynamic settings where tasks and training players evolve. Existing methods treat Shapley compu

#machine-learning#valuation#efficiencyRead on arxiv →
arxivMay 20bullish

Cubit: Token Mixer with Kernel Ridge Regression

arXiv:2605.06501v2 Announce Type: replace-cross Abstract: Since its introduction in 2017, the Transformer has become one of the most widely adopted architectures in modern deep learning. Despite extensive efforts to improve positional encoding, attention mechanisms, and feed-forward networks, the co

TRCU2 models#machine-learning#architecture#regressionRead on arxiv →
arxivMay 19

Ready from Day 1: Population-Aware Coordination for Large-Scale Constrained Multi-Agent Systems

arXiv:2605.13900v2 Announce Type: replace-cross Abstract: In large-scale multi-agent systems with shared resource constraints, an upstream planner must iteratively evaluate candidate resource plans -- assessing feasibility, aggregate response, and marginal cost -- before committing to one. Lagrangia

#multi-agent#machine-learning#supply-chainRead on arxiv →
arxivMay 19

Enabling Off-Policy Imitation Learning with Deep Actor Critic Stabilization

arXiv:2511.07288v2 Announce Type: replace-cross Abstract: Learning complex policies with Reinforcement Learning (RL) is often hindered by instability and slow convergence, a problem exacerbated by the difficulty of reward engineering. Imitation Learning (IL) from expert demonstrations bypasses this

GATR2 models#reinforcement-learning#imitation-learning#machine-learningRead on arxiv →
arxivMay 18

Preconditioned Regularized Wasserstein Proximal Sampling

arXiv:2509.01685v2 Announce Type: replace-cross Abstract: We consider sampling from a Gibbs distribution by evolving finitely many particles. We propose a preconditioned version of a recently proposed noise-free sampling method, governed by approximating the score function with the numerically tract

TR1 model#machine-learning#optimization#sampling-methodsRead on arxiv →
arxivMay 18

Rethinking Neural Network Learning Rates: A Stackelberg Perspective

arXiv:2605.15530v1 Announce Type: new Abstract: Neural networks are typically trained with a single learning rate across all layers. While recent empirical evidence suggests that assigning layer-specific learning rates can accelerate training, a principled understanding of the conditions and mechani

#machine-learning#optimization#neural-networksRead on arxiv →
arxivMay 16bullish

Beyond What to Select: A Plug-and-play Oscillatory Data-Volume Scheduling for Efficient Model Training

arXiv:2605.14773v1 Announce Type: cross Abstract: Data selection accelerates training by identifying representative training data while preserving model performance. However, existing methods mainly focus on designing sample-importance criteria, i.e., deciding what to select, while typically fixing

#optimization#machine-learning#efficiencyRead on arxiv →
arxivMay 16

Trapping Attacker in Dilemma: Examining Internal Correlations and External Influences of Trigger for Defending GNN Backdoors

arXiv:2605.08278v2 Announce Type: replace-cross Abstract: GNNs have become a standard tool for learning on relational data, yet they remain highly vulnerable to backdoor attacks. Prior defenses often depend on inspecting specific subgraph patterns or node features, and thus can be circumvented by ad

#graph-neural-networks#backdoor-attacks#securityRead on arxiv →
arxivMay 16

Interestingness as an Inductive Heuristic for Future Compression Progress

arXiv:2605.14831v1 Announce Type: new Abstract: One of the bottlenecks on the way towards recursively self-improving systems is the challenge of interestingness: the ability to prospectively identify which tasks or data hold the potential for future progress. We formalize interestingness as an induc

#artificial-intelligence#machine-learning#complexity-theoryRead on arxiv →
arxivMay 16

FutureSim: Replaying World Events to Evaluate Adaptive Agents

arXiv:2605.15188v1 Announce Type: cross Abstract: AI agents are being increasingly deployed in dynamic, open-ended environments that require adapting to new information as it arrives. To efficiently measure this capability for realistic use-cases, we propose building grounded simulations that replay

#benchmark#adaptation#machine-learningRead on arxiv →
arxivMay 15bullish

Focused PU learning from imbalanced data

arXiv:2605.14467v1 Announce Type: new Abstract: We propose a new method of learning from positive and unlabeled (PU) examples in highly imbalanced datasets. Many real-world problems, such as disease gene identification, targeted marketing, fraud detection, and recommender systems, are hard to addres

#machine-learning#imbalanced-datasets#binary-classificationRead on arxiv →
arxivMay 15

What if Tomorrow is the World Cup Final? Counterfactual Time Series Forecasting with Textual Conditions

arXiv:2605.14422v1 Announce Type: new Abstract: Time series forecasting has become increasingly critical in real-world scenarios, where future sequences are influenced not only by historical patterns but also by forthcoming events. In this context, forecasting must dynamically adapt to complex and s

#time-series#forecasting#machine-learningRead on arxiv →
arxivMay 15

To discretize continually: Mean shift interacting particle systems for Bayesian inference

arXiv:2605.14142v1 Announce Type: cross Abstract: Integration against a probability distribution given its unnormalized density is a central task in Bayesian inference and other fields. We introduce new methods for approximating such expectations with a small set of weighted samples -- i.e., a quadr

#machine-learning#bayesian-inference#samplingRead on arxiv →
arxivMay 15bullish

Lang2MLIP: End-to-End Language-to-Machine Learning Interatomic Potential Development with Autonomous Agentic Workflows

arXiv:2605.14527v1 Announce Type: new Abstract: Developing machine learning interatomic potentials (MLIPs) for complex materials systems remains challenging because it requires expertise in atomistic simulations, machine learning, and workflow design, as well as iterative active learning procedures.

LALA2 models#machine-learning#materials-science#automated-pipelinesRead on arxiv →
arxivMay 15bullish

K-Models: a Flexible and Interpretable Method for Ordinal Clustering with Application to Antigen-Antibody Interaction Profiles

arXiv:2605.14828v1 Announce Type: cross Abstract: Existing clustering methods for functional data often prioritize partitioning accuracy over interpretability, making it challenging to extract meaningful insights when the data-generating process follows a specific underlying structure and an ordinal

K-1 model#clustering#interpretability#machine-learningRead on arxiv →
arxivMay 15bullish

R-DMesh: Video-Guided 3D Animation via Rectified Dynamic Mesh Flow

arXiv:2605.13838v2 Announce Type: replace-cross Abstract: Video-guided 3D animation holds immense potential for content creation, offering intuitive and precise control over dynamic assets. However, practical deployment faces a critical yet frequently overlooked hurdle: the pose misalignment dilemma

REVATR4 models · +1#animation#computer-vision#machine-learningRead on arxiv →
arxivMay 15

Generative Bayesian Optimization: Generative Models as Acquisition Functions

arXiv:2510.25240v3 Announce Type: replace-cross Abstract: We present a general strategy for turning generative models into candidate solution samplers for batch Bayesian optimization (BO). The use of generative models for BO enables large batch scaling as generative sampling, optimization of non-con

#optimization#machine-learning#researchRead on arxiv →
arxivMay 15bullish

Test-Time Learning with an Evolving Library

arXiv:2605.14477v1 Announce Type: new Abstract: We introduce EvoLib, a test-time learning framework that enables large language models to accumulate, reuse, and evolve knowledge across problem instances without parameter updates or external supervision. Instead of adapting model parameters, our appr

#machine-learning#test-time-learning#knowledge-accumulationRead on arxiv →
arxivMay 14bullish

ASAP: Amortized Doubly-Stochastic Attention via Sliced Dual Projection

arXiv:2605.12879v1 Announce Type: new Abstract: Doubly-stochastic attention has emerged as a transport-based alternative to row-softmax attention, with recent Transformer variants using it to reduce attention sinks and rank collapse while improving performance. In this family, the standard approach

SIAS2 models#transformer#attention#machine-learningRead on arxiv →
arxivMay 13bullish

The Attacker in the Mirror: Breaking Self-Consistency in Safety via Anchored Bipolicy Self-Play

arXiv:2605.08427v1 Announce Type: new Abstract: Self-play red team is an established approach to improving AI safety in which different instances of the same model play attacker and defender roles in a zero-sum game, i.e., where the attacker tries to jailbreak the defender; if self-play converges to

QW1 model#ai-safety#self-play#machine-learningRead on arxiv →
arxivMay 13bullish

TTCD:Transformer Integrated Temporal Causal Discovery from Non-Stationary Time Series Data

arXiv:2605.08111v1 Announce Type: cross Abstract: The widespread availability of complex time series data in various domains such as environmental science, epidemiology, and economics demands robust causal discovery methods that can identify intricate contemporaneous and lagged relationships in non-

TR1 model#time-series#causal-discovery#machine-learningRead on arxiv →
arxivMay 13

Sparse-Aware Neural Networks for Nonlinear Functionals: Mitigating the Exponential Dependence on Dimension

arXiv:2604.06774v2 Announce Type: replace-cross Abstract: Deep neural networks have emerged as powerful tools for learning operators defined over infinite-dimensional function spaces. However, existing theories frequently encounter difficulties related to dimensionality and limited interpretability.

#machine-learning#functional-analysis#deep-learningRead on arxiv →
arxivMay 13bullish

Bridging Sequence and Graph Structure for Epigenetic Age Prediction

arXiv:2605.10541v1 Announce Type: new Abstract: Epigenetic clocks based on DNA methylation have emerged as powerful tools for estimating biological age, with broad applications in aging research, age-related disease studies, and longevity science. Despite advances across machine learning approaches

PEDERE4 models · +1#aging-research#machine-learning#epigeneticsRead on arxiv →
arxivMay 13bullish

Drop the Act: Probe-Filtered RL for Faithful Chain-of-Thought Reasoning

arXiv:2605.11467v1 Announce Type: new Abstract: Reasoning models post-hoc rationalize answers they have already committed to internally, producing chains of *reasoning theater*: deliberative-looking steps that contribute nothing to correctness. This wastes inference tokens, pollutes interpretability

MEQWCL3 models#reasoning#reinforcement-learning#interpretabilityRead on arxiv →
arxivMay 12

Mistake-Bounded Language Generation

arXiv:2605.10809v1 Announce Type: new Abstract: We investigate the learning task of language generation in the limit, but shift focus from the traditional time-of-last-mistake metric of a generator's success to a new notion of "mistake-bounded generation." While existing results for language generat

#language-generation#machine-learning#algorithmsRead on arxiv →
arxivMay 11bullish

When Does Embedding Magnitude Matter? A Cross-Task Functional-Symmetry Framework

arXiv:2602.09229v3 Announce Type: replace Abstract: Cosine similarity normalizes both sides; dot product normalizes neither. We propose a 2x2 framework that independently controls query-side and document-side normalization, exposing two intermediate variants (QNorm, DNorm) that have not been previou

PR1 model#retrieval#normalization#machine-learningRead on arxiv →
arxivMay 11bullish

EviDep: Trustworthy Multimodal Depression Estimation via Disentangled Evidential Learning

arXiv:2604.16579v2 Announce Type: replace-cross Abstract: Automated multimodal depression estimation in unconstrained environments is inherently challenged by naturalistic noise and complex behavioral variability. Prevailing deterministic methods, however, produce uncalibrated point estimates withou

EV1 model#machine-learning#artificial-intelligence#mental-healthRead on arxiv →
arxivMay 11bullish

Simple KNN-Based Outlier Detection Achieves Robust Clustering

arXiv:2605.07130v1 Announce Type: new Abstract: Being robust to the presence of outliers is crucial for applying clustering algorithms in practice. In the $\textit{robust $k$-Means}$ problem (i.e., $k$-Means with outliers), the goal is to remove $z$ outliers and minimize the $k$-Means cost on the re

#clustering#outlier-detection#machine-learningRead on arxiv →
arxivMay 11bullish

CommFuse: Hiding Tail Latency via Communication Decomposition and Fusion for Distributed LLM Training

arXiv:2604.24013v2 Announce Type: cross Abstract: The rapid growth in the size of large language models has necessitated the partitioning of computational workloads across accelerators such as GPUs, TPUs, and NPUs. However, these parallelization strategies incur substantial data communication overhe

#distributed-training#parallelization#optimizationRead on arxiv →
arxivMay 8

Dynamic Controlled Variables Based Dynamic Self-Optimizing Control

arXiv:2605.06469v1 Announce Type: cross Abstract: Self-optimizing control is a strategy for selecting controlled variables, where the economic objective guides the selection and design of controlled variables, with the expectation that maintaining the controlled variables at constant values can achi

DE1 model#optimization#control#machine-learningRead on arxiv →
arxivMay 8

Estimating Implicit Regularization in Deep Learning

arXiv:2605.05436v1 Announce Type: cross Abstract: Deep learning systems are known to exhibit implicit regularization (alt. implicit bias), favoring simple solutions instead of merely minimizing the loss function. In some cases, we can analytically derive the implicit regularization -- connecting it

#deep-learning#regularization#machine-learningRead on arxiv →
arxivMay 8bullish

Benchmarking PNW Model for MedMNIST to 100% Accuracy

arXiv:2604.18916v4 Announce Type: replace Abstract: In this paper, we introduce a new concept called Artificial Special Intelligence by which Machine Learning models for the classification problem can be trained error-free, thus acquiring the capability of not making repeated mistakes. The method is

#machine-learning#classification#biomedicalRead on arxiv →
arxivMay 8

A renormalization-group inspired lattice-based framework for piecewise generalized linear models

arXiv:2605.05493v1 Announce Type: cross Abstract: We formally introduce a class of models inspired by renormalization group (RG) theory, built on additive hierarchical expansions analogous to those appearing in functional ANOVA and mixed-effects models. Like ReLU convolutional neural networks, they

REPIHI4 models · +1#interpretable-ai#statistical-physics#machine-learningRead on arxiv →
arxivMay 8

SMI: Statistical Membership Inference for Reliable Unlearned Model Auditing

arXiv:2602.01150v2 Announce Type: replace-cross Abstract: Machine unlearning (MU) is essential for enforcing the right to be forgotten in machine learning systems. A key challenge of MU is how to reliably audit whether a model has truly forgotten specified training data. Membership Inference Attacks

#machine-learning#unlearning#auditingRead on arxiv →
arxivMay 8

Discovering What You Can Control: Interventional Boundary Discovery for Reinforcement Learning

arXiv:2603.18257v2 Announce Type: replace-cross Abstract: When an RL agent's observations contain distractors driven by the same confounders as its true state, observational data alone cannot identify which dimensions the agent controls. In our benchmarks, even state-conditioned observational select

SA1 model#machine-learning#artificial-intelligence#reinforcement-learningRead on arxiv →
arxivMay 8

Online Localized Conformal Prediction

arXiv:2605.05497v1 Announce Type: new Abstract: Conformal prediction is a framework that provides valid uncertainty quantification for general models with exchangeable data. However, in the online learning and time-series settings, exchangeability is not satisfied. Existing online conformal methods,

ONOLAD3 models#conformal-prediction#online-learning#time-seriesRead on arxiv →
arxivMay 8bullish

Towards Self-Explainable Document Visual Question Answering with Chain-of-Explanation Predictions

arXiv:2605.06058v1 Announce Type: new Abstract: Document Visual Question Answering (DocVQA) requires vision-language models to reason not only about what information in a document is relevant to a question, but also where the answer is grounded on the page. Existing DocVQA models entangle question-r

CO1 model#explainability#document-visual-question-answering#machine-learningRead on arxiv →
arxivMay 8

Structural Instability of Feature Composition

arXiv:2605.05223v1 Announce Type: cross Abstract: Sparse Autoencoders (SAEs) have emerged as a powerful paradigm for disentangling feature superposition in transformer-based architectures, enabling precise control via activation steering. However, the theoretical foundations of compositional steerin

#machine-learning#artificial-intelligence#researchRead on arxiv →
arxivMay 7bullish

Adaptive Ensemble Aggregation for Actor-Critics

arXiv:2507.23501v2 Announce Type: replace Abstract: Ensembles are ubiquitous in off-policy actor-critic learning, yet their efficacy depends critically on how they are aggregated. Current methods typically rely on static rules or task-specific hyperparameters to balance overestimation bias and varia

#reinforcement-learning#ensemble-methods#machine-learningRead on arxiv →
arxivMay 6

Data driven approach for Outdoor Channel Prediction in 5G and Beyond

arXiv:2605.01777v1 Announce Type: cross Abstract: An evolution of Wireless Communications towards 5G and beyond provides improved user experience in terms of quality of services. Understanding and estimating Channel information plays crucial role in providing better user experience. Traditional meth

LISUDE3 models#wireless-communications#5g#machine-learningRead on arxiv →
arxivMay 5bullish

Linking spatial biology and clinical histology via Haiku

arXiv:2605.00925v1 Announce Type: new Abstract: Integrating molecular, morphological, and clinical data is essential for basic and translational biomedical research, yet systematic frameworks for jointly modeling these modalities remain limited. Here we present Haiku, a tri-modal contrastive learnin

HA1 model#biomedical#research#machine-learningRead on arxiv →
arxivMay 5bullish

Anomaly-Preference Image Generation

arXiv:2605.02439v1 Announce Type: cross Abstract: Synthesizing realistic and diverse anomalous samples from limited data is vital for robust model generalization. However, existing methods struggle to reconcile fidelity and diversity, often hampered by distribution misalignment and overfitting, resp

#anomaly-detection#machine-learning#computer-visionRead on arxiv →
arxivMay 5

TimesNet-Gen: Deep Learning-based Site Specific Strong Motion Generation

arXiv:2512.04694v3 Announce Type: replace-cross Abstract: Effective earthquake risk reduction relies on accurate site-specific evaluations, which require models capable of representing the influence of local site conditions on ground motion characteristics. We address strong ground motion generation

TICO2 models#machine-learning#earthquake-risk#generative-modelsRead on arxiv →
arxivMay 4bullish

Learning Locally, Revising Globally: Global Reviser for Federated Learning with Noisy Labels

arXiv:2412.00452v2 Announce Type: replace Abstract: Conventioanl federated learning (FL) heavily depends on high-quality labels, which are often impractical in the real world, leading to the federated label-noise (F-LN) problem. Worsely, the F-LN problem is exacerbated by the heterogeneity of FL, wh

#federated-learning#label-noise#machine-learningRead on arxiv →
arxivMay 4

A unified convergence theory for adaptive first-order methods in the nonconvex case, including AdaNorm, full and diagonal AdaGrad, Shampoo and Muo

arXiv:2604.17423v2 Announce Type: replace Abstract: A unified framework for first-order optimization algorithms fornonconvex unconstrained optimization is proposed that uses adaptivelypreconditioned gradients and includes popular methods such as full anddiagonal AdaGrad, AdaNorm, as well as adpative

ADADSH4 models · +1#optimization#machine-learning#researchRead on arxiv →
arxivMay 4bullish

NRGPT: An Energy-based Alternative for GPT

arXiv:2512.16762v3 Announce Type: replace Abstract: Generative Pre-trained Transformer (GPT) architectures are the most popular design for language modeling. Energy-based modeling is a different paradigm that views inference as a dynamical process operating on an energy landscape. We propose a minim

GPEN2 models#language-modeling#energy-based-modeling#machine-learningRead on arxiv →
arxivMay 4bullish

Bridging Graph Drawing and Dimensionality Reduction with Stochastic Stress Optimization

arXiv:2605.00641v1 Announce Type: new Abstract: Both Dimensionality Reduction (DR) and Graph Drawing (GD) aim to visualize abstract, non-linear structures, yet rely on different optimization paradigms. This contrast is evident in Multidimensional Scaling (MDS), which typically depends on the SMACOF

#optimization#dimensionality-reduction#machine-learningRead on arxiv →
HomeModelsNews