·
DataBubble
  • Home
  • Models
  • News
  • Compare
  • Boards
  • Pricing
  • About
  • Newsletter
  • Methodology
  • Contact
Latest
LLMs as Noisy Channels: A Shannon Perspective on Model Capacity and Scaling Laws9h◆Bridging AI and Clinical Reasoning: Abductive Explanations for Alignment on Critical Symptoms9h◆CHASD: Language Increment-Calibrated Contrastive Decoding against Hallucination in LVLMs9h◆Prudent-Banker: No Extra Fees for Baseline Safety in Adversarial Bandits With and Without Delays9h◆InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion9h◆Representational Alignment with Chemical Induced Fit for Molecular Relational Learning9h◆One Policy, Infinite NPCs: Persona-Traceable Shared RL Policies for Scalable Game Agents9h◆RAG4Outcome: A Retrieval-Augmented Multimodal Framework for Prognostic Prediction in Chronic Osteomyelitis9h◆Uncovering the Latent Potential of Deep Intermediate Representations9h◆OpenSkillEval: Automatically Auditing the Open Skill Ecosystem for LLM Agents9h◆Beyond Log Likelihood: Probability-Based Objectives for Supervised Fine-Tuning across the Model Capability Continuum9h◆Debiased Negative Mining Improves Out-of-distribution Detection with Pre-trained Vision-Language Models9h◆FATHOMS-RAG: A Framework for the Assessment of Thinking and Observation in Multimodal Systems that use Retrieval Augmented Generation9h◆SCRIPT: Scalable Diffusion Policy with Multi-stage Training for Language-driven Physics-Based Humanoid Control9h◆Detecting Drunk Driving Using Off-the-Shelf Smartwatches9h◆Decision-Aware Quadratic ReLU Replacement for HE-Friendly Inference9h◆MAS-Orchestra: Understanding and Improving Multi-Agent Reasoning Through Holistic Orchestration and Controlled Benchmarks9h◆NeuroNL2LTL: A Neurosymbolic Framework for Natural Language Translation of Linear Temporal Logic9h◆Agentic Proving for Program Verification9h◆Co-ReAct: Rubrics as Step-Level Collaborators for ReAct Agents9h◆LLMs as Noisy Channels: A Shannon Perspective on Model Capacity and Scaling Laws9h◆Bridging AI and Clinical Reasoning: Abductive Explanations for Alignment on Critical Symptoms9h◆CHASD: Language Increment-Calibrated Contrastive Decoding against Hallucination in LVLMs9h◆Prudent-Banker: No Extra Fees for Baseline Safety in Adversarial Bandits With and Without Delays9h◆InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion9h◆Representational Alignment with Chemical Induced Fit for Molecular Relational Learning9h◆One Policy, Infinite NPCs: Persona-Traceable Shared RL Policies for Scalable Game Agents9h◆RAG4Outcome: A Retrieval-Augmented Multimodal Framework for Prognostic Prediction in Chronic Osteomyelitis9h◆Uncovering the Latent Potential of Deep Intermediate Representations9h◆OpenSkillEval: Automatically Auditing the Open Skill Ecosystem for LLM Agents9h◆Beyond Log Likelihood: Probability-Based Objectives for Supervised Fine-Tuning across the Model Capability Continuum9h◆Debiased Negative Mining Improves Out-of-distribution Detection with Pre-trained Vision-Language Models9h◆FATHOMS-RAG: A Framework for the Assessment of Thinking and Observation in Multimodal Systems that use Retrieval Augmented Generation9h◆SCRIPT: Scalable Diffusion Policy with Multi-stage Training for Language-driven Physics-Based Humanoid Control9h◆Detecting Drunk Driving Using Off-the-Shelf Smartwatches9h◆Decision-Aware Quadratic ReLU Replacement for HE-Friendly Inference9h◆MAS-Orchestra: Understanding and Improving Multi-Agent Reasoning Through Holistic Orchestration and Controlled Benchmarks9h◆NeuroNL2LTL: A Neurosymbolic Framework for Natural Language Translation of Linear Temporal Logic9h◆Agentic Proving for Program Verification9h◆Co-ReAct: Rubrics as Step-Level Collaborators for ReAct Agents9h◆
News/Market-Bench: Benchmarking Large Language Models on Economic and Trade Competition
arxiv
PublishedApril 21, 2026 at 4:00 AM

Market-Bench: Benchmarking Large Language Models on Economic and Trade Competition

Source
arxiv.orgfull article ↗
Read on arxiv→
Publisher summary· verbatim

arXiv:2604.05523v2 Announce Type: replace Abstract: The ability of large language models (LLMs) to manage and acquire economic resources remains unclear. In this paper, we introduce \textbf{Market-Bench}, a comprehensive benchmark that evaluates the capabilities of LLMs in economically-relevant task

Stay posted· Newsletter

A 5-min weekly brief — top movers, price watch, story of the week.

// no spam · unsubscribe one-click · free forever

Discussion
Source
↗
arxiv
Read original ↗All from arxiv →

No replies yet. Be first.

Source
↗
arxiv
Read original ↗All from arxiv →

Related coverage

More from ARXIV
arxivLLMs as Noisy Channels: A Shannon Perspective on Model Capacity and Scaling Laws9harxivBridging AI and Clinical Reasoning: Abductive Explanations for Alignment on Critical Symptoms9harxivCHASD: Language Increment-Calibrated Contrastive Decoding against Hallucination in LVLMs9harxivPrudent-Banker: No Extra Fees for Baseline Safety in Adversarial Bandits With and Without Delays9h
The Bubble Brief
WEEKLY

Read AI insights every Tuesday — top movers, new releases, story of the week.

// no spam · unsubscribe one-click · free forever

Originally published on arxiv ↗
HomeModelsNews