arxiv
PublishedMay 14, 2026 at 4:00 AM
▲bullish
State-Centric Decision Process
Publisher summary· verbatim
arXiv:2605.12755v1 Announce Type: new Abstract: Language environments such as web browsers, code terminals, and interactive simulations emit raw text rather than states, and provide none of the runtime structure that MDP analysis requires. No explicit state space, no observation-to-state mapping, no
Stay posted· Newsletter
A 5-min weekly brief — top movers, price watch, story of the week.
Discussion
No replies yet. Be first.
Related coverage
More from ARXIV
arxivMODF-SIR: A Multi-agent Omni-modal Distilled Framework for Social Intelligence Reasoning4harxivPosition: Stop Anthropomorphizing Intermediate Tokens as Reasoning/Thinking Traces!4harxivGeneralizing Beyond Suboptimality: Offline Reinforcement Learning Learns Effective Scheduling through Random Solutions4harxivThe Impossibility of Eliciting Latent Knowledge4hThe Bubble Brief
WEEKLYRead planning insights every Tuesday — top movers, new releases, story of the week.
Originally published on arxiv ↗