arxivMay 1
MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction
arXiv:2604.27393v1 Announce Type: new Abstract: Recent progress in multimodal large language models (MLLMs) has brought AI capabilities from static offline data processing to real-time streaming interaction, yet they still remain far from human-level multimodal interaction. The key bottlenecks are n