arxivJul 18bullish

DriftWorld: Fast World Modeling through Drifting

arXiv:2607.15065v1 Announce Type: cross Abstract: Predictive world models enable robots to plan by imagining the outcomes of their actions, but their value for control hinges on generating many rollouts quickly. This creates a bottleneck for diffusion-based world models: multistep sampling makes eac

DR1 model #robotics #world-models #diffusion Read on arxiv →

arxivJun 10bullish

Business World Model

arXiv:2606.10044v1 Announce Type: new Abstract: Businesses are increasingly adopting AI-enabled tools to improve productivity, reduce costs, and enhance products and services. However, the transformative potential of AI extends beyond automating predefined tasks: it lies in enabling intelligent syst

#artificial-intelligence #business #planning Read on arxiv →

arxivJun 2bullish

Efficient Test-time Inference for Generative Planning Models

arXiv:2606.00618v1 Announce Type: new Abstract: Generative models have emerged as a powerful paradigm for AI planning, yet their performance remains constrained by the training data distribution. One approach is to improve generated solutions during inference by scaling test-time compute. A more eff

GEHE2 models #planning #inference #optimization Read on arxiv →

arxivMay 29

ScheduleStream: Temporal Planning with Samplers for GPU-Accelerated Multi-Arm Task and Motion Planning & Scheduling

arXiv:2511.04758v2 Announce Type: replace-cross Abstract: Bimanual and humanoid robots are appealing because of their human-like ability to leverage multiple arms to efficiently complete tasks. However, controlling multiple arms at once is computationally challenging due to the growth in the hybrid

#robotics #planning #scheduling Read on arxiv →

arxivMay 14bullish

State-Centric Decision Process

arXiv:2605.12755v1 Announce Type: new Abstract: Language environments such as web browsers, code terminals, and interactive simulations emit raw text rather than states, and provide none of the runtime structure that MDP analysis requires. No explicit state space, no observation-to-state mapping, no

#planning #scientific-exploration #question-answering Read on arxiv →

arxivMay 11bullish

End-to-end PDDL Planning with Hardcoded and Dynamic Agents

arXiv:2512.09629v2 Announce Type: replace Abstract: We present an end-to-end framework for planning supported by verifiers. An orchestrator receives a human specification written in natural language and converts it into a PDDL (Planning Domain Definition Language) model, where the domain and problem

OPGPGP5 models · +2 #planning #natural-language-processing #large-language-models Read on arxiv →

arxivMay 6bullish

MINT: Minimal Information Neuro-Symbolic Tree for Objective-Driven Knowledge-Gap Reasoning and Active Elicitation

arXiv:2602.05048v2 Announce Type: replace Abstract: Joint planning through language-based interactions is a key area of human-AI teaming. Planning problems in the open world often involve various aspects of incomplete information and unknowns, e.g., objects involved, human goals/intents -- thus lead

MILL2 models #human-ai-teaming #planning #neuro-symbolic Read on arxiv →

arxivApr 18

Tight Sample Complexity Bounds for Best-Arm Identification Under Bounded Systematic Bias

arXiv:2604.14345v1 Announce Type: new Abstract: As search depth increases in autonomous reasoning and embodied planning, the candidate action space expands exponentially, heavily taxing computational budgets. While heuristic pruning is a common countermeasure, it operates without formal safety guara

LL1 model #autonomous-reasoning #planning #safety Read on arxiv →

arxivApr 3bearish

Can LLMs Perceive Time? An Empirical Investigation

arXiv:2604.00010v1 Announce Type: cross Abstract: Large language models cannot estimate how long their own tasks take. We investigate this limitation through four experiments across 68 tasks and four model families. Pre-task estimates overshoot actual duration by 4--7$\times$ ($p < 0.001$), with mod

GP1 model #language-models #benchmark #safety Read on arxiv →