arxiv
PublishedApril 27, 2026 at 4:00 AM
Cross-Stage Coherence in Hierarchical Driving VQA: Explicit Baselines and Learned Gated Context Projectors
Publisher summary· verbatim
arXiv:2604.22560v1 Announce Type: cross Abstract: Graph Visual Question Answering (GVQA) for autonomous driving organizes reasoning into ordered stages, namely Perception, Prediction, and Planning, where planning decisions should remain consistent with the model's own perception. We present a compar
Stay posted· Newsletter
A 5-min weekly brief — top movers, price watch, story of the week.
Discussion
No replies yet. Be first.
Related coverage
More from ARXIV
arxivLoHoSearch: Benchmarking Long-Horizon Search Agents Beyond the Human Difficulty Ceiling8harxivAPPO: Agentic Procedural Policy Optimization8harxivALIGNBEAM : Inference-Time Alignment Transfer via Cross-Vocabulary Logit Mixing8harxivGrounding Computer Use Agents on Human Demonstrations8hThe Bubble Brief
WEEKLYRead AI insights every Tuesday — top movers, new releases, story of the week.
Originally published on arxiv ↗