arxivJul 21bullish

Certified Training for Convolutional Perturbations

arXiv:2607.18195v1 Announce Type: cross Abstract: Vision models have been found to be susceptible to perturbations such as motion blur induced at runtime by a shaking camera. This impedes their deployment in critical applications since phenomena such as slightly blurred vision might lead to failures

#computer-vision #robustness #adversarial-training Read on arxiv →

arxivJul 3

Quantifying the Uncertainty of Blindly Estimated Room Embeddings Using a Dispersion-Calibrated Score

arXiv:2607.01527v1 Announce Type: cross Abstract: Room embeddings derived from reverberant speech are often unreliable: speech content and recording degradation can alter the representation even when speaker, room, and source-receiver geometry remain unchanged, degrading downstream task performance.

#speech-processing #machine-learning #audio Read on arxiv →

arxivJun 29bullish

Not All Relations Rotate Alike: Transformation-Aware Decoupling for Viewpoint-Robust 3D Scene Graph Generation

arXiv:2606.27412v1 Announce Type: cross Abstract: 3D Scene Graph Generation (3DSGG) represents 3D scenes as structured object-relation-object graphs, providing a compact relational abstraction for spatial understanding. In embodied intelligence settings, the same 3D scene may be observed by agents f

TR1 model #computer-vision #3d-scene-graph #robustness Read on arxiv →

arxivJun 25

Alternate loss functions and regression models that achieve robustness to outliers by modulating the learning rate

arXiv:2606.22068v2 Announce Type: replace-cross Abstract: Most real-world datasets used for training supervised learning models are contaminated with noisy data and outliers leading to large prediction errors. This paper proposes a new approach for achieving robustness where the learning rate is mod

SQSM2 models #robustness #outliers #regression Read on arxiv →

arxivJun 18

Robust Detection of Planted Subgraphs in Semi-Random Models

arXiv:2508.02158v2 Announce Type: replace-cross Abstract: Detection of planted subgraphs in Erd\"os-R\'enyi random graphs has been extensively studied, leading to a rich body of results characterizing both statistical and computational thresholds. However, most prior work assumes a purely random gen

#graph-inference #semi-random-models #robustness Read on arxiv →

arxivJun 5bullish

ADAPTOOD: Uncertainty-Aware Fine-Tuning for Out-of-Distribution ECG Time Series Models

arXiv:2606.04164v1 Announce Type: cross Abstract: Data samples used for training often differ from those encountered during fine-tuning and deployment, and while ML models show promise, their performance remains limited when only small annotated datasets are available. Performance often degrades und

#machine-learning #adaptation #robustness Read on arxiv →

arxivMay 29bullish

From Meta-Thought to Execution: Cognitively Aligned Post-Training for Generalizable and Reliable LLM Reasoning

arXiv:2601.21909v2 Announce Type: replace Abstract: Current LLM post-training methods optimize complete reasoning trajectories through Supervised Fine-Tuning (SFT) followed by outcome-based Reinforcement Learning (RL). While effective, a closer examination reveals a fundamental gap: this approach do

#llm #reinforcement-learning #cognitive-architecture Read on arxiv →

arxivMay 29bullish

AliMark: Enhancing Robustness of Sentence-Level Watermarking Against Text Paraphrasing

arXiv:2605.29434v1 Announce Type: cross Abstract: Existing sentence-level watermarking methods enhance robustness to paraphrasing by anchoring watermarks in sentence semantics. However, their prefix-based designs remain vulnerable to structural perturbations, such as sentence splitting and merging,

DIOP2 models #watermarking #paraphrasing #robustness Read on arxiv →

arxivMay 29bullish

Recovering Policy-Induced Errors: Benchmarking and Trajectory Synthesis for Robust GUI Agents

arXiv:2605.29447v1 Announce Type: cross Abstract: While GUI agents have advanced rapidly, they often lack the robustness to recover from their own errors, hindering real-world deployment. To bridge this gap at both the evaluation and data levels, we introduce GUI-RobustEval and propose Robustness-dr

RORO2 models #gui #robustness #evaluation Read on arxiv →

arxivMay 25

Lipschitz Optimization for Formal Verification of Homographies

arXiv:2605.23203v1 Announce Type: cross Abstract: The adoption of vision neural networks in regulated industries requires formal robustness guarantees, especially in safety-critical domains such as healthcare, autonomous vehicles, and aerospace. However, current approaches are confined to incomplete

#computer-vision #safety #verification Read on arxiv →

arxivMay 22

Robust Reasoning Benchmark

arXiv:2604.08571v2 Announce Type: replace-cross Abstract: While Large Language Models (LLMs) achieve high performance on standard mathematical benchmarks, their problem-solving abilities depend on the context and textual formatting. We introduce the Robust Reasoning Benchmark (RRB), a pipeline of 13

CL1 model #benchmark #mathematical reasoning #large language models Read on arxiv →

arxivMay 19bullish

UniAlign: A Model-Agnostic Framework for Robust Network Traffic Classification under Distribution Shifts

arXiv:2605.17575v1 Announce Type: cross Abstract: Network traffic classification (NTC) models often suffer severe performance degradation when deployed in real-world environments due to distribution shifts caused by changing network conditions. Existing robustness-enhancing approaches are commonly c

#network-traffic #classification #robustness Read on arxiv →

arxivApr 29

Out of Spuriousity: Improving Robustness to Spurious Correlations without Group Annotations

arXiv:2407.14974v2 Announce Type: replace-cross Abstract: Machine learning models are known to learn spurious correlations, i.e., features having strong relations with class labels but no causal relation. Relying on those correlations leads to poor performance in the data groups without these correl

#machine-learning #robustness #generalization Read on arxiv →

arxivApr 27bullish

Robust Fuzzy local k-plane clustering with mixture distance of hinge loss and L1 norm

arXiv:2604.22405v1 Announce Type: new Abstract: K-plane clustering (KPC), hyperplane clustering, and mixture regression all essentially fall within the same class of problems. This problem can be conceptualized as clustering in relatively high-dimensional K subspaces or K linear manifolds. Tradition

RF1 model #clustering #machine-learning #robustness Read on arxiv →

arxivApr 17bullish

Shuffle the Context: RoPE-Perturbed Self-Distillation for Long-Context Adaptation

arXiv:2604.14339v1 Announce Type: new Abstract: Large language models (LLMs) increasingly operate in settings that require reliable long-context understanding, such as retrieval-augmented generation and multi-document reasoning. A common strategy is to fine-tune pretrained short-context models at th

MEQW2 models #long-context #language-models #self-distillation Read on arxiv →

arxivApr 10

Corruption-robust Offline Multi-agent Reinforcement Learning From Human Feedback

arXiv:2603.28281v2 Announce Type: replace Abstract: We consider robustness against data corruption in offline multi-agent reinforcement learning from human feedback (MARLHF) under a strong-contamination model: given a dataset $D$ of trajectory-preference tuples (each preference being an $n$-dimensio

#machine-learning #reinforcement-learning #robustness Read on arxiv →

arxivApr 9bearish

Non-identifiability of Explanations from Model Behavior in Deep Networks of Image Authenticity Judgments

arXiv:2604.07254v1 Announce Type: cross Abstract: Deep neural networks can predict human judgments, but this does not imply that they rely on human-like information or reveal the cues underlying those judgments. Prior work has addressed this issue using attribution heatmaps, but their explanatory va

VGEFBA3 models #computer-vision #machine-learning #explanability Read on arxiv →

arxivApr 6

Towards Realistic Class-Incremental Learning with Free-Flow Increments

arXiv:2604.02765v1 Announce Type: new Abstract: Class-incremental learning (CIL) is typically evaluated under predefined schedules with equal-sized tasks, leaving more realistic and complex cases unexplored. However, a practical CIL system should learns immediately when any number of new classes arr

#class-incremental-learning #machine-learning #robustness Read on arxiv →