·
DataBubble
  • Home
  • Models
  • News
  • Compare
  • Boards
  • Pricing
  • About
  • Newsletter
  • Methodology
  • Contact
Latest
Anthropic’s Claude is winning over paid consumers, a market owned by ChatGPT3h◆General Intuition’s $2.3B bet that video games can train AI agents for the real world3h◆Databricks’ former AI chief thinks he can cut AI’s power bill by 1,000x4h◆Which tokens does a hybrid model predict better?4h◆Our latest Google Finance upgrades, including a new app4h◆Netris raises $15M Series A from a16z to help AI neoclouds go live faster5h◆Repositioning retail for the AI era6h◆2 days left to save up to $190: Join 1,000+ founders and investors at TechCrunch Founder Summit6h◆Adobe acquires image and video enhancement tool maker Topaz Labs7h◆Amazon ups India bet with fresh $13B AI infrastructure investment8h◆Ford had to hire back former engineers to fix mistakes made by its automated systems8h◆Facebook’s Creator Studio has been revived as an AI companion app11h◆Can Aggregate Invariants Accelerate Continuous Subgraph Matching? Limits, Laws, and a Dynamic Spectral Index16h◆ScaleToT: Generalizing Structured LLM Reasoning for Billion-Scale Low-Activity User Modeling16h◆Critique of Agent Model16h◆LemonHarness Technical Report16h◆The Measurable Majority16h◆Fast and Slow Variational Continual Learning16h◆Real-Time Interactive Music Generation via Data-Free Streaming Consistency Distillation16h◆A specialized reasoning large language model for accelerating rare disease diagnosis: a randomized AI physician assistance trial16h◆Anthropic’s Claude is winning over paid consumers, a market owned by ChatGPT3h◆General Intuition’s $2.3B bet that video games can train AI agents for the real world3h◆Databricks’ former AI chief thinks he can cut AI’s power bill by 1,000x4h◆Which tokens does a hybrid model predict better?4h◆Our latest Google Finance upgrades, including a new app4h◆Netris raises $15M Series A from a16z to help AI neoclouds go live faster5h◆Repositioning retail for the AI era6h◆2 days left to save up to $190: Join 1,000+ founders and investors at TechCrunch Founder Summit6h◆Adobe acquires image and video enhancement tool maker Topaz Labs7h◆Amazon ups India bet with fresh $13B AI infrastructure investment8h◆Ford had to hire back former engineers to fix mistakes made by its automated systems8h◆Facebook’s Creator Studio has been revived as an AI companion app11h◆Can Aggregate Invariants Accelerate Continuous Subgraph Matching? Limits, Laws, and a Dynamic Spectral Index16h◆ScaleToT: Generalizing Structured LLM Reasoning for Billion-Scale Low-Activity User Modeling16h◆Critique of Agent Model16h◆LemonHarness Technical Report16h◆The Measurable Majority16h◆Fast and Slow Variational Continual Learning16h◆Real-Time Interactive Music Generation via Data-Free Streaming Consistency Distillation16h◆A specialized reasoning large language model for accelerating rare disease diagnosis: a randomized AI physician assistance trial16h◆
News/Minimax Optimal Variance-Aware Regret Bounds for Multinomial Logistic MDPs
arxiv
PublishedMay 21, 2026 at 4:00 AM
—neutral

Minimax Optimal Variance-Aware Regret Bounds for Multinomial Logistic MDPs

Source
arxiv.orgfull article ↗
Read on arxiv→
Publisher summary· verbatim

arXiv:2605.19768v1 Announce Type: new Abstract: We study reinforcement learning for episodic Markov Decision Processes (MDPs) whose transitions are modelled by a multinomial logistic (MNL) model. Existing algorithms for MNL mixture MDPs yield a regret of $\smash{\tilde{O}(dH^2\sqrt{T})}$ (Li et al.,

Stay posted· Newsletter

A 5-min weekly brief — top movers, price watch, story of the week.

// no spam · unsubscribe one-click · free forever

Discussion
Source
↗
arxiv
Read original ↗All from arxiv →

No replies yet. Be first.

Source
↗
arxiv
Read original ↗All from arxiv →

Related coverage

More from ARXIV
arxivCan Aggregate Invariants Accelerate Continuous Subgraph Matching? Limits, Laws, and a Dynamic Spectral Index16harxivScaleToT: Generalizing Structured LLM Reasoning for Billion-Scale Low-Activity User Modeling16harxivCritique of Agent Model16harxivLemonHarness Technical Report16h
The Bubble Brief
WEEKLY

Read AI insights every Tuesday — top movers, new releases, story of the week.

// no spam · unsubscribe one-click · free forever

Originally published on arxiv ↗
HomeModelsNews