arxiv
PublishedApril 24, 2026 at 4:00 AM
—neutral
HiCrew: Hierarchical Reasoning for Long-Form Video Understanding via Question-Aware Multi-Agent Collaboration
Publisher summary· verbatim
arXiv:2604.21444v1 Announce Type: new Abstract: Long-form video understanding remains fundamentally challenged by pervasive spatiotemporal redundancy and intricate narrative dependencies that span extended temporal horizons. While recent structured representations compress visual information effecti
Discussion
No replies yet. Be first.
Originally published on arxiv ↗