Tag

#optimization

100 articles tagged #optimization

arxiv4d agobullish

AI-Driven Surrogate Models for Predicting Electrode-Scale Discharge Behavior in Lithium-Ion Batteries

arXiv:2607.20577v1 Announce Type: new Abstract: Physics-based simulations are essential for understanding the electrode-scale discharge behavior of lithium-ion batteries (LIBs) but suffer from prohibitive computational costs. To address this, we introduce a novel deep learning surrogate pipeline bas

SW1 model #deep-learning #battery-design #optimization Read on arxiv →

arxiv4d agobullish

Routing Without Training: Controllable-Ratio LLM Offloading via Reliability Gating

arXiv:2607.20481v1 Announce Type: new Abstract: Local-cloud collaboration is a practical way to deploy large language models under resource constraints, but existing methods often rely on trained routers or collaboration-aware finetuning that tie routing behavior to a particular operating regime. In

#collaboration #local-cloud #language-models Read on arxiv →

arxiv5d agobullish

Convergence-Latency-Aware Adaptive Modulation and Resource Allocation in RIS-Assisted Wireless Federated Learning

arXiv:2607.19759v1 Announce Type: cross Abstract: Federated learning (FL) over wireless networks suffers from significant training latency and degraded convergence due to unreliable wireless transmission, especially under blocked propagation environments. Although reconfigurable intelligent surfaces

MNCISP3 models #federated-learning #wireless-networks #optimization Read on arxiv →

arxiv6d ago

Integro-differential equations in angular stabilization of drone motion by distributed feedback control

arXiv:2607.18251v1 Announce Type: new Abstract: In this paper, we propose angular stabilization of drone motion using distributed feedback control in the form of an integral operator. It should be stressed that the memory of this integral operator could be unbounded. It is intuitively clear that lar

#control #stabilization #drones Read on arxiv →

arxiv6d agobullish

Beyond Accuracy and Cost: Latency-Aware LLM Query Routing for Dynamic Workloads

arXiv:2607.18253v1 Announce Type: new Abstract: Modern language query routers improve inference efficiency by assigning each query to a model that balances response quality and monetary cost. However, current query routers are largely latency-agnostic and do not consider the generation latency exper

#optimization #latency #inference Read on arxiv →

arxivJul 20bullish

Evolutionary Algorithm-Guided LLMs for Physics-Informed Neural Network Design

arXiv:2607.15560v1 Announce Type: cross Abstract: Physics-informed neural networks (PINNs) are unusually sensitive to interacting choices of architecture, activation, loss weighting, collocation, optimization, and constraint enforcement. Large language models (LLMs) can propose these choices, but in

PILL2 models #neural-networks #evolutionary-algorithm #optimization Read on arxiv →

arxivJul 20bullish

LLM-Guided Transportation Hub Capacity Planning with Textual Business Inputs

arXiv:2607.03651v2 Announce Type: replace Abstract: While traditional hub capacity planning models optimize effectively for quantitative inputs, they often fail to digest qualitative business context. We propose a novel framework where a large language model (LLM) agent iteratively proposes hub capa

LA1 model #optimization #machine-learning #operations-research Read on arxiv →

arxivJul 18bullish

Smarter and Cheaper at Once: Byte-Exact KV-Cache Grafting Turns a Frozen Small Model into a Verified-Knowledge Flywheel

arXiv:2607.14431v1 Announce Type: cross Abstract: We report a way to make a frozen small language model both more capable and dramatically cheaper at once, without changing any weights. Verified knowledge is deposited once as a byte-exact key-value (KV) state artifact and later restored, by graft, i

GE1 model #language-model #optimization #performance Read on arxiv →

arxivJul 18bullish

Counterfactual Optimal Action Trees (COAT): Interpretable Prescriptive Policies from Observational Data

arXiv:2607.14318v1 Announce Type: new Abstract: We introduce COAT (Counterfactual Optimal Action Tree), a framework for learning interpretable prescriptive policies from observational data. COAT combines counterfactual outcome estimation with large-scale mixed-integer optimization, using column gene

CO1 model #machine-learning #optimization #revenue Read on arxiv →

arxivJul 18

Models Can Model, But Can't Bind: Structured Grounding in Text-to-Optimization

arXiv:2605.21751v2 Announce Type: replace Abstract: Text-to-optimization requires two separable capabilities: modeling -- choosing the right optimization structure -- and binding -- grounding every coefficient, index, and parameter in the concrete problem data. We study this via Text2Opt-Bench, a sc

#optimization #machine-learning #benchmark Read on arxiv →

arxivJul 18bullish

Deep-learning Causal Retrieval Optimization for Efficient e-commerce Distribution in Pinterest

arXiv:2607.14161v1 Announce Type: cross Abstract: Pinterest is where people turn inspiration into action as users browse ideas, then take steps toward realization, often by discovering shoppable content. To support this journey, we must distribute commerce content when it helps, not when it distract

#e-commerce #personalization #recommendation Read on arxiv →

arxivJul 18bullish

Branching Policy Optimization: Sandbox-Native Language Agent Reinforcement Learning

arXiv:2607.14171v1 Announce Type: new Abstract: Reinforcement learning has emerged as the dominant paradigm for training large language model (LLM) agents that interact with executable sandboxes. State-of-the-art algorithms such as PPO, RLOO, and GRPO inherit their rollout topology from RLHF: for ea

PPRLGR5 models · +2 #reinforcement learning #language models #optimization Read on arxiv →

arxivJul 16bullish

NSNQuant: A Double Normalization Approach for Calibration-Free Low-Bit Vector Quantization of KV Cache

arXiv:2505.18231v3 Announce Type: replace-cross Abstract: Large Language Model (LLM) inference is typically memory-intensive, especially when processing large batch sizes and long sequences, due to the large size of key-value (KV) cache. Vector Quantization (VQ) is recently adopted to alleviate this

#machine-learning #vector-quantization #optimization Read on arxiv →

arxivJul 15bullish

Self-Evolving In-Context Learning for Direct Pilot-to-Beamformer Design in MU-MISO Systems

arXiv:2607.11970v1 Announce Type: cross Abstract: We develop an enhanced in-context learning (ICL) framework to improve the performance of pilot-based beamforming in multi-user multiple-input single-output (MU-MISO) systems. The proposed scheme integrates the ICL-Transformer backbone with the pilot

ICTR2 models #machine-learning #beamforming #communication Read on arxiv →

arxivJul 14

Optimizing ARDL Models for Retail Sales Forecasting and Fair Pricing

arXiv:2607.09956v1 Announce Type: new Abstract: Pricing food products to balance profitability with consumer welfare is a central challenge for retailers. Dynamic pricing is widely used to maximize revenue, yet most pricing models optimize business objectives while overlooking consumer fairness. Thi

AULISI5 models · +2 #pricing #fairness #forecasting Read on arxiv →

arxivJul 14bullish

PM-KVQ: Progressive Mixed-precision KV Cache Quantization for Long-CoT LLMs

arXiv:2505.18610v2 Announce Type: replace Abstract: Recently, significant progress has been made in developing reasoning-capable Large Language Models (LLMs) through long Chain-of-Thought (CoT) techniques. However, this long-CoT reasoning process imposes substantial memory overhead due to the large

LA1 model #quantization #compression #language-models Read on arxiv →

arxivJul 13

Pitfalls and Remedies for Multi-Task Bayesian Optimization

arXiv:2607.09073v1 Announce Type: new Abstract: Bayesian optimization routinely warm-starts a target experiment with data from related source tasks, and the multi-task Gaussian process is the textbook surrogate for the job. We revisit this default in a controlled setting and find that it misestimate

GA1 model #machine-learning #optimization #transfer-learning Read on arxiv →

arxivJul 10

($\theta_l, \theta_u$)-Parametric Multi-Task Optimization: Joint Search in Solution and Infinite Task Spaces

arXiv:2503.08394v5 Announce Type: replace-cross Abstract: Multi-task optimization is typically characterized by a fixed and finite set of tasks. The present paper relaxes this condition by considering a non-fixed and potentially infinite set of optimization tasks defined in a parameterized, continuo

#optimization #machine-learning #evolutionary-computing Read on arxiv →

arxivJul 10

Optimal uncertainty bounds for multivariate kernel regression under bounded noise: A Gaussian process-based dual function

arXiv:2603.16481v3 Announce Type: replace Abstract: Non-conservative uncertainty bounds are essential for making reliable predictions about latent functions from noisy data, and thus, a key enabler for safe learning-based control. In this domain, kernel methods such as Gaussian process regression ar

GA1 model #machine-learning #optimization #control Read on arxiv →

arxivJul 10bullish

Efficient Long-Horizon Learning for Learned Optimization

arXiv:2607.06772v2 Announce Type: replace Abstract: Learned optimization aims to improve upon hand-designed optimizers (e.g., Adam and Muon) by meta-learning small neural network optimizers over a distribution of tasks. While recent work has greatly advanced the architectural design and inductive bi

ADMUGP6 models · +3 #optimization #meta-learning #language-modeling Read on arxiv →

arxivJul 10

The Regularization Parameter: Sparse Precision Matrix Estimation

arXiv:2607.07735v1 Announce Type: cross Abstract: Sparse precision matrix estimation provides an interpretable and computationally efficient framework for modeling conditional dependencies in high-dimensional, low-sample-size data. A recurring challenge is appropriately selecting the regularization

#machine-learning #estimation #optimization Read on arxiv →

arxivJul 3bullish

Evergreen: Efficient Claim Verification for Semantic Aggregates

arXiv:2604.26180v2 Announce Type: replace-cross Abstract: With recent semantic query processing engines, semantic aggregation has become a primitive operator, enabling the reduction of a relation into a natural language aggregate using an LLM. However, the resulting semantic aggregate may contain cl

LL1 model #databases #optimization #verification Read on arxiv →

arxivJul 3bullish

The risk of KV cache compression

arXiv:2607.01520v1 Announce Type: new Abstract: Transformer inference on long sequences is expensive because softmax attention repeatedly reads from a large KV cache. The prevalent approach to this bottleneck is KV cache compression, which replaces the full cache with a compact summary. Despite its

#machine-learning #optimization #compression Read on arxiv →

arxivJul 3

Parameter Golf: What Really Works?

arXiv:2607.01517v1 Announce Type: new Abstract: How far can a language model improve under a strict artifact budget? Parameter Golf posed this question as an open community challenge in which participants trained the best language model, with the complete artifact (training code + compressed weights

#optimization #language-models #benchmark Read on arxiv →

arxivJul 2bullish

Neural Certificate Pricing for Combinatorial Optimization Problems

arXiv:2607.01185v1 Announce Type: new Abstract: Combinatorial optimization (CO) problems are difficult because certifiable discrete structure induces exponential search. One needs to search over the set exponentially many candidates to certify optimality, however, the structural feasibility of a pat

NE1 model #optimization #machine-learning #research Read on arxiv →

arxivJun 30bullish

Randomized neural operator for parametric PDEs with fast training and conformal uncertainty quantification

arXiv:2606.29440v1 Announce Type: new Abstract: Repeatedly solving parametric PDEs is essential for uncertainty quantification, design optimization and inverse problems, but conventional neural operators require expensive non-convex training. We introduce PCA--RaNN, a randomized latent neural operat

PC1 model #machine-learning #numerical-analysis #optimization Read on arxiv →

arxivJun 29bullish

Layerwise Progressive Freezing: A Training Scaffold for Depth-Scalable Binary Networks

arXiv:2606.27759v1 Announce Type: new Abstract: Training binary neural networks (BNNs) from scratch is dominated by the straight-through estimator (STE), whose forward/backward mismatch produces severe accuracy degradation as networks deepen. We study an orthogonal axis: when and where binarization

RERERE5 models · +2 #binary-neural-networks #training-methods #deep-learning Read on arxiv →

arxivJun 29bullish

OGPO: Sample Efficient Full-Finetuning of Generative Control Policies

arXiv:2605.03065v4 Announce Type: replace Abstract: Generative control policies (GCPs), such as diffusion- and flow-based control policies, have emerged as effective parameterizations for robot learning. This work introduces Off-policy Generative Policy Optimization (OGPO), a sample-efficient algori

#robotics #machine-learning #optimization Read on arxiv →

arxivJun 29bullish

Optimizing Teacher-Student Partitioning for Scalable Knowledge Distillation on HPC Systems

arXiv:2606.27797v1 Announce Type: cross Abstract: Knowledge Distillation (KD) enables training smaller student models under the guidance of larger teacher models, and the widely adopted TRL library implements it. Yet, TRL treats both models symmetrically, missing opportunities to exploit their prono

#distributed #parallelism #optimization Read on arxiv →

arxivJun 29

Categorical Optimization with Bayesian Anchored Latent Trust Regions for Structural Design under High-Dimensional Uncertainty

arXiv:2604.25241v2 Announce Type: replace Abstract: Categorical structural optimization under aleatoric uncertainty is challenging because each design variable must be selected from a finite catalog of admissible instances, while each candidate design may require expensive stochastic finite-element

CO1 model #optimization #machine-learning #uncertainty Read on arxiv →

arxivJun 19bullish

Deep-Unfolded Coordination

arXiv:2606.19920v1 Announce Type: cross Abstract: Distributed optimization is a highly scalable and structurally transparent technique to solve multi-agent robotics problems; however, such methods often suffer from the need for highly-specialized, problem-specific hyperparameter tunings. In this wor

DE1 model #optimization #robotics #machine-learning Read on arxiv →

arxivJun 19bullish

Physics-Informed Neural Network with Squeeze-Excitation-like Attention

arXiv:2606.19853v1 Announce Type: new Abstract: We introduce SEA-PINN, a novel architecture that incorporates a Squeeze-Excitation-like attention mechanism into physics-informed neural networks to dynamically recalibrate the importance of neurons across layers. A key feature of SEA-PINN is its highl

SEFNTS3 models #physics-informed-neural-networks #machine-learning #benchmark Read on arxiv →

arxivJun 19bullish

VIMPO: Value-Implicit Policy Optimization for LLMs

arXiv:2606.20008v1 Announce Type: new Abstract: Reinforcement learning with verifiable rewards has become a central tool for improving the reasoning ability of large language models, but current methods face a trade-off between simplicity and credit assignment. Group-relative methods such as GRPO av

VIGRPP3 models #reinforcement-learning #language-models #optimization Read on arxiv →

arxivJun 18bullish

Optimal scenario design for climate emulation

arXiv:2606.19302v1 Announce Type: cross Abstract: As deep learning for physical systems continues to grow in popularity, efforts to improve generalizability have primarily focused on designing architectures that embed physical constraints. However, for machine-learning surrogate climate models (emul

SI1 model #climate-modeling #machine-learning #optimization Read on arxiv →

arxivJun 16

Representation Costs in Data Science: Foundations and the Quasi-Banach Spaces of Deep Neural Networks

arXiv:2606.14954v1 Announce Type: cross Abstract: We develop a general framework for analyzing representation costs of parametric data-fitting methods through their parameter-space regularizers. From this abstract perspective, we define representation costs for arbitrary parametric models and reveal

DE1 model #machine-learning #optimization #functional-analysis Read on arxiv →

arxivJun 12

Phi-Actor-Critic: Steering General-Sum Games to Pareto-Efficient Correlated Equilibria

arXiv:2606.11284v1 Announce Type: cross Abstract: Real-world multi-agent systems, from traffic coordination to resource allocation, are often modeled as general-sum games where individual incentives conflict with collective welfare. In these settings, the central challenge is not merely finding an e

PH1 model #multi-agent #reinforcement-learning #game-theory Read on arxiv →

arxivJun 12

Position: Generative Engine Optimization Creates Underexamined Risks, Governance Must Target Concentration, Disclosure, and Academic Blind Spots

arXiv:2606.12439v1 Announce Type: cross Abstract: Large language model (LLM) answer engines are increasingly used for information seeking, shifting visibility from ranked lists to synthesized answers. This enables Generative Engine Optimization (GEO), which targets LLM answer engines' evidence pool

#optimization #governance #artificial-intelligence Read on arxiv →

arxivJun 12bullish

A Physics-Inspired Optimizer: Velocity Regularized Adam

arXiv:2505.13196v3 Announce Type: replace Abstract: We introduce Velocity-Regularized Adam (VRAdam), a physics-inspired optimizer for training deep neural networks that draws on ideas from quartic terms for kinetic energy with its stabilizing effects on various system dynamics. Previous algorithms,

VEADAD3 models #optimization #deep-learning #machine-learning Read on arxiv →

arxivJun 12bullish

Arbor: Tree Search as a Cognition Layer for Autonomous Agents

arXiv:2606.12563v1 Announce Type: new Abstract: Arbor is a multi-agent framework that introduces structured tree search as a cognition layer for autonomous agents operating in large, stateful action spaces. Prior autonomous optimization systems operate on isolated targets with stateless evaluation.

#autonomous-agents #optimization #artificial-intelligence Read on arxiv →

arxivJun 12

Small Experiments, Cheaper Decisions: A Case Study in Staged Promotion for Micro-Pretraining

arXiv:2606.11387v1 Announce Type: cross Abstract: Short pretraining runs can reduce experimental cost, but they can also over-promote configurations that only look strong at tiny budgets. We study an auditable staged-promotion protocol for a fixed micro-pretraining runner on two heterogeneous host b

#pretraining #optimization #machine-learning Read on arxiv →

arxivJun 12bullish

MiniMax Sparse Attention

arXiv:2606.13392v1 Announce Type: new Abstract: Ultra-long-context capability is becoming indispensable for frontier LLMs: agentic workflows, repository-scale code reasoning, and persistent memory all require the model to jointly attend over hundreds of thousands to millions of tokens, yet the quadr

#attention-mechanisms #llms #optimization Read on arxiv →

arxivJun 10bullish

TRACE: A Unified Rollout Budget Allocation Framework for Efficient Agentic Reinforcement Learning

arXiv:2606.11119v1 Announce Type: cross Abstract: Reinforcement learning with verifiable rewards (RLVR) is a promising approach for enhancing reasoning and agentic behavior in large language models. However, rollout-intensive policy optimization is often limited by insufficient reward contrast, aris

QW1 model #reinforcement-learning #language-models #optimization Read on arxiv →

arxivJun 10bullish

HMAF: A Hierarchical Multi-Slot GD-RTB Allocation Framework

arXiv:2606.09896v1 Announce Type: cross Abstract: In modern online advertising platforms, Guaranteed Delivery (GD) contracts coexist and bid with Real-Time Bidding (RTB) auctions. Recent approaches either decouple GD and RTB optimization or rely on heuristic priority rules, and thus fail to effectiv

#advertising #optimization #revenue Read on arxiv →

arxivJun 10bullish

Sim2Schedule: A Simulator-Guided LLM Framework for Autonomous Open-Pit Mine Scheduling

arXiv:2606.10286v1 Announce Type: new Abstract: Open-pit mine scheduling is a critical process for maximizing economic return under complex geotechnical and operational constraints. While Mixed-Integer Linear Programming (MILP) provides mathematically optimal baselines, its exponential computational

LA1 model #optimization #scheduling #industrial-applications Read on arxiv →

arxivJun 10bullish

Operator Fusion for LLM Inference on the Tensix Architecture

arXiv:2606.09879v1 Announce Type: new Abstract: This study addresses on-device inference bottlenecks of Transformer models on Tenstorrent's Tensix architecture and proposes an operator fusion strategy that enhances data locality. RMSNorm is fused with matrix multiplication in self-attention and in t

TRQWQW4 models · +1 #machine learning #optimization #parallelism Read on arxiv →

arxivJun 6bullish

Value-and-Structure Alignment for Routing-Consistent Quantization of Mixture-of-Experts Models

arXiv:2606.05688v1 Announce Type: cross Abstract: Mixture-of-Experts (MoE) models scale foundation models efficiently by activating only a subset of experts for each token, but their large number of expert parameters still makes quantization essential for practical deployment. Unlike dense models, h

MI1 model #quantization #moe #foundation-models Read on arxiv →

arxivJun 3

Decentralized Stochastic Nonconvex Optimization under the $(L_0,L_1)$-Smoothness

arXiv:2509.08726v3 Announce Type: replace-cross Abstract: This paper focuses on the decentralized stochastic optimization problem $f(\mathbf{x})=\frac{1}{m}\sum_{i=1}^m f_i(\mathbf{x})$ over a connected network of $n$ agents, where each local function has the form of $f_i(\mathbf{x}) = {\mathbb E}\l

#optimization #stochastic #nonconvex Read on arxiv →

arxivJun 3bullish

Filter, Then Reweight: Rethinking Optimization Granularity in On-Policy Distillation

arXiv:2606.02684v1 Announce Type: cross Abstract: On-Policy distillation (OPD) in large language models is shifting from full-trace KL supervision toward more selective training paradigms. Recent OPD methods increasingly focus on selecting which trajectories to learn from, which tokens are most info

FI1 model #on-policy #distillation #optimization Read on arxiv →

arxivJun 3bullish

Experience-Driven Dynamic Exits for LLMs with Reinforcement Learning

arXiv:2606.03113v1 Announce Type: new Abstract: Large Language Models suffer from slow autoregressive inference. While self-speculative decoding accelerates this process, its efficiency is hampered by static configurations like fixed exit layers and speculation lengths. We reframe this optimization

MEME2 models #optimization #reinforcement-learning #language-models Read on arxiv →

arxivJun 3bullish

Before Fusion, Ask What to Keep: Contextual Calibration of Multimodal Signals

arXiv:2606.02679v1 Announce Type: new Abstract: Multimodal systems often benefit from combining information across language, sound, and visual streams, but this benefit is not guaranteed. A modality that is useful for one input may become distracting for another, and local feature responses within t

#multimodal #fusion #calibration Read on arxiv →

arxivJun 2

How Much Orthogonalization Does Muon Need?

arXiv:2606.00371v1 Announce Type: new Abstract: Muon optimizers improve neural-network training by replacing ill-conditioned momentum updates with approximately semi-orthogonal updates. This motivates a practical question: how much orthogonalization does Muon actually require? We study this question

NAGPMA4 models · +1 #machine-learning #optimization #neural-networks Read on arxiv →

arxivJun 2bullish

Revisiting Reinforcement Learning with Verifiable Rewards from a Contrastive Perspective

arXiv:2605.12969v3 Announce Type: replace-cross Abstract: Group Relative Policy Optimization (GRPO) is one of the most widely adopted RLVR algorithms for post-training large language models on reasoning tasks. We first show that GRPO admits an equivalent discriminative reformulation, in which policy

GRCO2 models #reinforcement-learning #language-models #optimization Read on arxiv →

arxivJun 2bullish

Margin Adaptive DPO: Leveraging Reward Model for Granular Control in Preference Optimization

arXiv:2510.05342v2 Announce Type: replace-cross Abstract: Direct Preference Optimization (DPO) has emerged as a simple and effective method for aligning large language models. However, its reliance on a fixed temperature parameter leads to suboptimal training on diverse preference data, causing over

DIIP$\4 models · +1 #machine-learning #optimization #language-models Read on arxiv →

arxivJun 2bullish

Efficient Test-time Inference for Generative Planning Models

arXiv:2606.00618v1 Announce Type: new Abstract: Generative models have emerged as a powerful paradigm for AI planning, yet their performance remains constrained by the training data distribution. One approach is to improve generated solutions during inference by scaling test-time compute. A more eff

GEHE2 models #planning #inference #optimization Read on arxiv →

arxivMay 29

Gesture-Aware Indoor THz ISAC Systems for Adaptive Resource Allocation

arXiv:2605.29913v1 Announce Type: cross Abstract: This paper investigates a multi-user indoor integrated sensing and communication (ISAC) system operating in the terahertz (THz) band, designed for adaptive communication based on gesture recognition. Leveraging gesture tracking through an extended Ka

EX1 model #terahertz #gesture-recognition #optimization Read on arxiv →

arxivMay 29

Calibrating Generative Models to Distributional Constraints

arXiv:2510.10020v4 Announce Type: replace-cross Abstract: Generative models frequently suffer miscalibration, wherein statistics of the sampling distribution, such as the fraction of generations in a given class, deviate from desired values. We frame calibration as a constrained optimization problem

#machine-learning #calibration #optimization Read on arxiv →

arxivMay 29bullish

HARP: Hadamard-Preconditioned Adaptive Rotation Processor for Extreme LLM Quantization

arXiv:2605.29843v1 Announce Type: cross Abstract: Post-training quantization (PTQ) is essential for deploying LLMs under memory and bandwidth constraints. However, extreme low-bit quantization remains highly sensitive to activation outliers and anisotropic weight curvature. Existing incoherence-base

LL1 model #quantization #machine learning #optimization Read on arxiv →

arxivMay 28

Worker Disagreement Reveals Sharp Directions in Local SGD

arXiv:2605.27739v1 Announce Type: cross Abstract: Deep neural network training often exhibits highly anisotropic loss geometry, where a few sharp dominant Hessian directions coexist with a large flatter bulk. Gradients tend to align disproportionately with these dominant directions, although stable

MLCNTR3 models #machine-learning #deep-learning #optimization Read on arxiv →

arxivMay 26bullish

Local MAP Sampling for Diffusion Models

arXiv:2510.07343v3 Announce Type: replace-cross Abstract: Diffusion Posterior Sampling (DPS) provides a principled Bayesian approach to inverse problems by sampling from $p(x_0 \mid y)$. While posterior sampling is valuable for capturing uncertainty and multi-modality, many classical and practical i

#image-restoration #scientific-applications #bayesian-inference Read on arxiv →

arxivMay 26

Active Query Synthesis for Preference Learning

arXiv:2605.26072v1 Announce Type: new Abstract: Efficient learning of user preferences is crucial for many modern decision making systems but typically requires costly labeled data. Active learning reduces this cost, yet standard methods are computationally expensive due to pool-based evaluation. Fu

#active-learning #machine-learning #optimization Read on arxiv →

arxivMay 25bullish

AGZO: Activation-Guided Zeroth-Order Optimization for LLM Fine-Tuning

arXiv:2601.17261v4 Announce Type: replace Abstract: Zeroth-Order (ZO) optimization has emerged as a promising solution for fine-tuning LLMs under strict memory constraints, as it avoids the prohibitive memory cost of storing activations for backpropagation. However, existing ZO methods typically emp

QWPA2 models #optimization #llms #fine-tuning Read on arxiv →

arxivMay 25bullish

Transform-Invariant Generative Ray Path Sampling for Efficient Radio Propagation Modeling

arXiv:2603.01655v2 Announce Type: replace Abstract: Ray tracing has become a standard for accurate radio propagation modeling, but suffers from exponential computational complexity, as the number of candidate paths scales with the number of objects raised to the interaction order. This bottleneck li

GE1 model #machine-learning #signal-processing #optimization Read on arxiv →

arxivMay 22bullish

Token-weighted Direct Preference Optimization with Attention

arXiv:2605.21883v1 Announce Type: new Abstract: Direct Preference Optimization (DPO) aligns Large Language Models with human preferences without the need for a separate reward model. However, DPO treats all tokens in responses equally, neglecting the differing importance of individual tokens. Existi

LA1 model #optimization #language-models #reinforcement-learning Read on arxiv →

arxivMay 22bullish

Retrospective Sparse Attention for Efficient Long-Context Generation

arXiv:2508.09001v2 Announce Type: replace-cross Abstract: Large Language Models (LLMs) are increasingly deployed in long-context tasks such as reasoning, code generation, and multi-turn dialogue. However, inference over extended contexts is bottlenecked by the Key-Value (KV) cache, whose memory foot

#large-language-models #optimization #attention-mechanisms Read on arxiv →

arxivMay 21bullish

From SGD to Muon: Adaptive Optimization via Schatten-p Norms

arXiv:2605.19781v1 Announce Type: new Abstract: Modern optimizers, like Muon, impose matrix-wise geometry constraints on their updates. These matrix-wise constraints can be unified under Linear Minimization Oracle (LMO) theory. However, all current methods impose fixed LMO geometries for the update

MUSGAD5 models · +2 #optimization #deep learning #neural networks Read on arxiv →

arxivMay 21bullish

Fast and Featureless Node Representation Learning with Partial Pairwise Supervision

arXiv:2605.19916v1 Announce Type: cross Abstract: We introduce Contrastive FUSE, a fast and unified framework for scalable node representation learning in graphs with partially available pairwise node labels and no available node features. Unlike existing methods, we directly optimize a spectral con

CO1 model #machine-learning #graph-learning #optimization Read on arxiv →

arxivMay 19bullish

Latent Heuristic Search: Continuous Optimization for Automated Algorithm Design

arXiv:2605.17137v1 Announce Type: new Abstract: The integration of Large Language Models (LLMs) into evolutionary frameworks has established a new paradigm for automated heuristic discovery. Despite their promise, these methods typically search in the discrete space of program syntax, relying on sto

LA1 model #optimization #automated-heuristic-discovery #evolutionary-frameworks Read on arxiv →

arxivMay 19

Ready from Day 1: Population-Aware Coordination for Large-Scale Constrained Multi-Agent Systems

arXiv:2605.13900v2 Announce Type: replace-cross Abstract: In large-scale multi-agent systems with shared resource constraints, an upstream planner must iteratively evaluate candidate resource plans -- assessing feasibility, aggregate response, and marginal cost -- before committing to one. Lagrangia

#multi-agent #machine-learning #supply-chain Read on arxiv →

arxivMay 18

Preconditioned Regularized Wasserstein Proximal Sampling

arXiv:2509.01685v2 Announce Type: replace-cross Abstract: We consider sampling from a Gibbs distribution by evolving finitely many particles. We propose a preconditioned version of a recently proposed noise-free sampling method, governed by approximating the score function with the numerically tract

TR1 model #machine-learning #optimization #sampling-methods Read on arxiv →

arxivMay 18

Rethinking Neural Network Learning Rates: A Stackelberg Perspective

arXiv:2605.15530v1 Announce Type: new Abstract: Neural networks are typically trained with a single learning rate across all layers. While recent empirical evidence suggests that assigning layer-specific learning rates can accelerate training, a principled understanding of the conditions and mechani

#machine-learning #optimization #neural-networks Read on arxiv →

arxivMay 16bullish

Beyond What to Select: A Plug-and-play Oscillatory Data-Volume Scheduling for Efficient Model Training

arXiv:2605.14773v1 Announce Type: cross Abstract: Data selection accelerates training by identifying representative training data while preserving model performance. However, existing methods mainly focus on designing sample-importance criteria, i.e., deciding what to select, while typically fixing

#optimization #machine-learning #efficiency Read on arxiv →

arxivMay 16

Numerical exploration of the range of shape functionals using neural networks

arXiv:2602.14881v2 Announce Type: replace-cross Abstract: We introduce a novel numerical framework for the exploration of Blaschke--Santal\'o diagrams, which are efficient tools characterizing the possible inequalities relating some given shape functionals. We introduce a parametrization of convex b

IN1 model #optimization #neural-networks #geometry Read on arxiv →

arxivMay 16

Adapting Dijkstra for Buffers and Unlimited Transfers

arXiv:2603.11729v3 Announce Type: replace-cross Abstract: In recent years, RAPTOR based algorithms have been considered the state-of-the-art for path-finding with unlimited transfers without preprocessing. However, this status largely stems from the evolution of routing research, where Dijkstra-base

#routing #algorithms #optimization Read on arxiv →

arxivMay 15

To discretize continually: Mean shift interacting particle systems for Bayesian inference

arXiv:2605.14142v1 Announce Type: cross Abstract: Integration against a probability distribution given its unnormalized density is a central task in Bayesian inference and other fields. We introduce new methods for approximating such expectations with a small set of weighted samples -- i.e., a quadr

#machine-learning #bayesian-inference #sampling Read on arxiv →

arxivMay 15

Generative Bayesian Optimization: Generative Models as Acquisition Functions

arXiv:2510.25240v3 Announce Type: replace-cross Abstract: We present a general strategy for turning generative models into candidate solution samplers for batch Bayesian optimization (BO). The use of generative models for BO enables large batch scaling as generative sampling, optimization of non-con

#optimization #machine-learning #research Read on arxiv →

arxivMay 14bullish

ASAP: Amortized Doubly-Stochastic Attention via Sliced Dual Projection

arXiv:2605.12879v1 Announce Type: new Abstract: Doubly-stochastic attention has emerged as a transport-based alternative to row-softmax attention, with recent Transformer variants using it to reduce attention sinks and rank collapse while improving performance. In this family, the standard approach

SIAS2 models #transformer #attention #machine-learning Read on arxiv →

arxivMay 14bullish

Attention Once Is All You Need: Efficient Streaming Inference with Stateful Transformers

arXiv:2605.13784v1 Announce Type: new Abstract: Conventional transformer inference engines are request-driven, paying an O(n) prefill cost on every query. In streaming workloads, where data arrives continuously and queries probe an ever-growing context, this cost is prohibitive. We introduce a data-

VLSGTE3 models #streaming #inference #optimization Read on arxiv →

arxivMay 13

Constructive conditional normalizing flows

arXiv:2602.08606v3 Announce Type: replace-cross Abstract: Motivated by applications in conditional sampling, given a probability measure $\mu$ and a diffeomorphism $\phi$, we consider the problem of simultaneously approximating $\phi$ and the pushforward $\phi_{\#}\mu$ by means of the flow of a cont

PE1 model #optimization #machine learning #probability Read on arxiv →

arxivMay 11

A Rod Flow Model for Adam at the Edge of Stability

arXiv:2605.06821v1 Announce Type: cross Abstract: Cohen et al. (arXiv:2207.14484) observed that adaptive gradient methods such as Adam operate at the edge of stability. While there has been significant work on continuous-time modeling of gradient descent at the edge of stability, extending these mod

ADRMNA5 models · +2 #optimization #machine learning #momentum methods Read on arxiv →

arxivMay 11bullish

CommFuse: Hiding Tail Latency via Communication Decomposition and Fusion for Distributed LLM Training

arXiv:2604.24013v2 Announce Type: cross Abstract: The rapid growth in the size of large language models has necessitated the partitioning of computational workloads across accelerators such as GPUs, TPUs, and NPUs. However, these parallelization strategies incur substantial data communication overhe

#distributed-training #parallelization #optimization Read on arxiv →

arxivMay 8

Dynamic Controlled Variables Based Dynamic Self-Optimizing Control

arXiv:2605.06469v1 Announce Type: cross Abstract: Self-optimizing control is a strategy for selecting controlled variables, where the economic objective guides the selection and design of controlled variables, with the expectation that maintaining the controlled variables at constant values can achi

DE1 model #optimization #control #machine-learning Read on arxiv →

arxivMay 7bullish

Adaptive Ensemble Aggregation for Actor-Critics

arXiv:2507.23501v2 Announce Type: replace Abstract: Ensembles are ubiquitous in off-policy actor-critic learning, yet their efficacy depends critically on how they are aggregated. Current methods typically rely on static rules or task-specific hyperparameters to balance overestimation bias and varia

#reinforcement-learning #ensemble-methods #machine-learning Read on arxiv →

arxivMay 5bullish

FG$^2$-GDN: Enhancing Long-Context Gated Delta Networks with Doubly Fine-Grained Control

arXiv:2604.19021v2 Announce Type: replace Abstract: Linear attention mechanisms have emerged as promising alternatives to softmax attention, offering linear-time complexity during inference. Recent advances such as Gated DeltaNet (GDN) and Kimi Delta Attention (KDA) have demonstrated that the delta

GAKIFG4 models · +1 #machine learning #attention mechanisms #optimization Read on arxiv →

arxivMay 4

A unified convergence theory for adaptive first-order methods in the nonconvex case, including AdaNorm, full and diagonal AdaGrad, Shampoo and Muo

arXiv:2604.17423v2 Announce Type: replace Abstract: A unified framework for first-order optimization algorithms fornonconvex unconstrained optimization is proposed that uses adaptivelypreconditioned gradients and includes popular methods such as full anddiagonal AdaGrad, AdaNorm, as well as adpative

ADADSH4 models · +1 #optimization #machine-learning #research Read on arxiv →

arxivMay 4bullish

Bridging Graph Drawing and Dimensionality Reduction with Stochastic Stress Optimization

arXiv:2605.00641v1 Announce Type: new Abstract: Both Dimensionality Reduction (DR) and Graph Drawing (GD) aim to visualize abstract, non-linear structures, yet rely on different optimization paradigms. This contrast is evident in Multidimensional Scaling (MDS), which typically depends on the SMACOF

#optimization #dimensionality-reduction #machine-learning Read on arxiv →

arxivMay 1bullish

Hinge Regression Tree: A Newton Method for Oblique Regression Tree Splitting

arXiv:2602.05371v3 Announce Type: replace Abstract: Oblique decision trees combine the transparency of trees with the power of multivariate decision boundaries, but learning high-quality oblique splits is NP-hard, and practical methods still rely on slow search or theory-free heuristics. We present

HI1 model #machine-learning #decision-trees #optimization Read on arxiv →

arxivApr 30bullish

Adaptive Scaling of Policy Constraints for Offline Reinforcement Learning

arXiv:2508.19900v2 Announce Type: replace Abstract: Offline reinforcement learning (RL) enables learning effective policies from fixed datasets without any environment interaction. Existing methods typically employ policy constraints to mitigate the distribution shift encountered during offline RL t

#offline-rl #reinforcement-learning #machine-learning Read on arxiv →

arxivApr 30bullish

Generative Bid Shading in Real-Time Bidding Advertising

arXiv:2508.06550v3 Announce Type: replace-cross Abstract: Bid shading plays a crucial role in Real-Time Bidding (RTB) by adaptively adjusting the bid to avoid advertisers overspending. Existing mainstream two-stage methods, which first model bid landscapes and then optimize surplus using operations

GEAUCH3 models #real-time-bidding #advertising #machine-learning Read on arxiv →

arxivApr 30bullish

Test-Time Safety Alignment

arXiv:2604.26167v1 Announce Type: cross Abstract: Recent work has shown that a model's input word embeddings can serve as effective control variables for steering its behavior toward outputs that satisfy desired properties. However, this has only been demonstrated for pretrained text-completion mode

#safety #language-models #optimization Read on arxiv →

arxivApr 29bullish

MTServe: Efficient Serving for Generative Recommendation Models with Hierarchical Caches

arXiv:2604.22881v1 Announce Type: cross Abstract: Generative recommendation (GR) offers superior modeling capabilities but suffers from prohibitive inference costs due to the repeated encoding of long user histories. While cross-request Key-Value (KV) cache reuse presents a significant optimization

#optimization #machine-learning #cache-management Read on arxiv →

arxivApr 29bullish

Accelerating Eigenvalue Dataset Generation via Chebyshev Subspace Filter

arXiv:2510.23215v2 Announce Type: replace-cross Abstract: Eigenvalue problems are among the most important topics in many scientific disciplines. With the recent surge and development of machine learning, neural eigenvalue methods have attracted significant attention as a forward pass of inference r

#machine-learning #eigenvalue-problems #numerical-analysis Read on arxiv →

arxivApr 29bullish

Dr. RTL: Autonomous Agentic RTL Optimization through Tool-Grounded Self-Improvement

arXiv:2604.14989v2 Announce Type: replace Abstract: Recent advances in large language models (LLMs) have sparked growing interest in automatic RTL optimization for better performance, power, and area (PPA). However, existing methods are still far from realistic RTL optimization. Their evaluation set

DR1 model #optimization #eda #rtl Read on arxiv →

arxivApr 27bullish

Robust Fuzzy local k-plane clustering with mixture distance of hinge loss and L1 norm

arXiv:2604.22405v1 Announce Type: new Abstract: K-plane clustering (KPC), hyperplane clustering, and mixture regression all essentially fall within the same class of problems. This problem can be conceptualized as clustering in relatively high-dimensional K subspaces or K linear manifolds. Tradition

RF1 model #clustering #machine-learning #robustness Read on arxiv →

arxivApr 27bullish

A general optimization solver based on OP-to-MaxSAT reduction

arXiv:2604.21961v1 Announce Type: cross Abstract: Optimization problems are fundamental in diverse fields, such as engineering, economics, and scientific computing. However, current algorithms are mostly designed for specific problem types and exhibit limited generality in solving multiple types of

#optimization #algorithm #research Read on arxiv →

arxivApr 23bullish

LoRA-FA: Efficient and Effective Low Rank Representation Fine-tuning

arXiv:2308.03303v2 Announce Type: replace Abstract: Fine-tuning large language models (LLMs) is crucial for improving their performance on downstream tasks, but full-parameter fine-tuning (Full-FT) is computationally expensive and memory-intensive. Parameter-efficient fine-tuning (PEFT) methods, suc

LOLO2 models #fine-tuning #language-models #optimization Read on arxiv →

arxivApr 23

LayerTracer: A Joint Task-Particle and Vulnerable-Layer Analysis framework for Arbitrary Large Language Model Architectures

arXiv:2604.20556v1 Announce Type: cross Abstract: Currently, Large Language Models (LLMs) feature a diversified architectural landscape, including traditional Transformer, GateDeltaNet, and Mamba. However, the evolutionary laws of hierarchical representations, task knowledge formation positions, and

TRGAMA3 models #large-language-models #architecture #interpretability Read on arxiv →

arxivApr 18bullish

Calibrate-Then-Delegate: Safety Monitoring with Risk and Budget Guarantees via Model Cascades

arXiv:2604.14251v1 Announce Type: new Abstract: Monitoring LLM safety at scale requires balancing cost and accuracy: a cheap latent-space probe can screen every input, but hard cases should be escalated to a more expensive expert. Existing cascades delegate based on probe uncertainty, but uncertaint

CA1 model #safety #machine-learning #optimization Read on arxiv →

arxivApr 18

Safe Reinforcement Learning using Action Projection: Safeguard the Policy or the Environment?

arXiv:2509.12833v2 Announce Type: replace Abstract: Projection-based safety filters, which modify unsafe actions by mapping them to the closest safe alternative, are widely used to enforce safety constraints in reinforcement learning (RL). Two integration strategies are commonly considered: Safe env

#reinforcement-learning #safety #optimization Read on arxiv →

arxivApr 17bullish

Preconditioned Test-Time Adaptation for Out-of-Distribution Debiasing in Narrative Generation

arXiv:2603.13683v2 Announce Type: replace Abstract: Although debiased large language models (LLMs) excel at handling known or low-bias prompts, they often fail on unfamiliar and high-bias prompts. We demonstrate via out-of-distribution (OOD) detection that these high-bias prompts cause a distributio

#debiasing #optimization #language-models Read on arxiv →

arxivApr 16bullish

Neural Two-Stage Stochastic Optimization for Solving Unit Commitment Problem

arXiv:2507.09503v4 Announce Type: replace-cross Abstract: This paper proposes a neural stochastic optimization method for efficiently solving the two-stage stochastic unit commitment (2S-SUC) problem under high-dimensional uncertainty scenarios. The proposed method approximates the second-stage reco

NE1 model #optimization #machine-learning #scalability Read on arxiv →