arxivApril 7, 2026 at 4:00 AM1 min read
Voxtral Realtime
arXiv:2602.11298v3 Announce Type: replace Abstract: We introduce Voxtral Realtime, a natively streaming automatic speech recognition model that matches offline transcription quality at sub-second latency. Unlike approaches that adapt offline models through chunking or sliding windows, Voxtral Realti
No replies yet. Be first.