·
DataBubble
  • Home
  • Models
  • News
  • Compare
  • Boards
  • Pricing
  • About
  • Newsletter
  • Methodology
  • Contact
Latest
DiScoFormer: Plug-In Density and Score Estimation with Transformers6h◆DCFO: Density-Based Counterfactuals for Outliers -- Additional Material6h◆Representation Unlearning: Forgetting through Information Compression6h◆Density-aware Sample-specific Attack6h◆TRACE: Toulmin-based Reasoning Assessment through Constructive Elements for LLM CoT Evaluation6h◆Temporal Stability and Few-Shot Prompting in Math Task Assessment6h◆DialToM: A Theory of Mind Benchmark for Forecasting State-Driven Dialogue Trajectories6h◆Neural Logistic Bandits6h◆ExCAM: Explainable Cultural Awareness Metrics6h◆A Survey on Recent Advances in Conversational Data Generation6h◆MediHive: A Decentralized Agent Collective for Medical Reasoning6h◆Taming Data Challenges in ML-based Security Tasks Using Generative AI6h◆Topological Order in Neural Wavefunctions6h◆Steering Language Models Before They Speak: Logit-Level Interventions6h◆BenchTrace: A Benchmark for Testing Reflection Ability and Controlled Evolution in LLM Agents6h◆Reasoning Theater: Disentangling Model Beliefs from Chain-of-Thought6h◆CaC: Advancing Video Reward Models via Hierarchical Spatiotemporal Concentrating6h◆Reducing Political Manipulation with Consistency Training6h◆Kronecker Embeddings: Byte-Level Structured Token Representations for Parameter-Efficient Language Models6h◆Latent Performance Profiling of Large Language Models6h◆DiScoFormer: Plug-In Density and Score Estimation with Transformers6h◆DCFO: Density-Based Counterfactuals for Outliers -- Additional Material6h◆Representation Unlearning: Forgetting through Information Compression6h◆Density-aware Sample-specific Attack6h◆TRACE: Toulmin-based Reasoning Assessment through Constructive Elements for LLM CoT Evaluation6h◆Temporal Stability and Few-Shot Prompting in Math Task Assessment6h◆DialToM: A Theory of Mind Benchmark for Forecasting State-Driven Dialogue Trajectories6h◆Neural Logistic Bandits6h◆ExCAM: Explainable Cultural Awareness Metrics6h◆A Survey on Recent Advances in Conversational Data Generation6h◆MediHive: A Decentralized Agent Collective for Medical Reasoning6h◆Taming Data Challenges in ML-based Security Tasks Using Generative AI6h◆Topological Order in Neural Wavefunctions6h◆Steering Language Models Before They Speak: Logit-Level Interventions6h◆BenchTrace: A Benchmark for Testing Reflection Ability and Controlled Evolution in LLM Agents6h◆Reasoning Theater: Disentangling Model Beliefs from Chain-of-Thought6h◆CaC: Advancing Video Reward Models via Hierarchical Spatiotemporal Concentrating6h◆Reducing Political Manipulation with Consistency Training6h◆Kronecker Embeddings: Byte-Level Structured Token Representations for Parameter-Efficient Language Models6h◆Latent Performance Profiling of Large Language Models6h◆
News/DarkQA: Benchmarking Vision-Language Models on Visual-Primitive Question Answering in Low-Light Indoor Scenes
arxiv
PublishedMay 13, 2026 at 4:00 AM

DarkQA: Benchmarking Vision-Language Models on Visual-Primitive Question Answering in Low-Light Indoor Scenes

Source
arxiv.orgfull article ↗
Read on arxiv→
Publisher summary· verbatim

arXiv:2512.24985v4 Announce Type: replace-cross Abstract: Vision Language Models (VLMs) are increasingly adopted as central reasoning modules for embodied agents. Existing benchmarks evaluate their capabilities under ideal, well-lit conditions, yet robust 24/7 operation demands performance under a w

Stay posted· Newsletter

A 5-min weekly brief — top movers, price watch, story of the week.

// no spam · unsubscribe one-click · free forever

Discussion
Source
↗
arxiv
Read original ↗All from arxiv →

No replies yet. Be first.

Source
↗
arxiv
Read original ↗All from arxiv →

Related coverage

More from ARXIV
arxivDiScoFormer: Plug-In Density and Score Estimation with Transformers6harxivDCFO: Density-Based Counterfactuals for Outliers -- Additional Material6harxivRepresentation Unlearning: Forgetting through Information Compression6harxivDensity-aware Sample-specific Attack6h
The Bubble Brief
WEEKLY

Read AI insights every Tuesday — top movers, new releases, story of the week.

// no spam · unsubscribe one-click · free forever

Originally published on arxiv ↗
HomeModelsNews