arxiv
PublishedJune 10, 2026 at 4:00 AM
—neutral
Swivuriso: The South African Next Voices Multilingual Speech Dataset
Publisher summary· verbatim
arXiv:2512.02201v3 Announce Type: replace Abstract: This paper introduces Swivuriso, a 3000-hour multilingual speech dataset developed as part of the African Next Voices project, to support the development and benchmarking of automatic speech recognition (ASR) technologies in seven South African lan
Stay posted· Newsletter
A 5-min weekly brief — top movers, price watch, story of the week.
Discussion
No replies yet. Be first.
Related coverage
More from ARXIV
arxivMODF-SIR: A Multi-agent Omni-modal Distilled Framework for Social Intelligence Reasoning8harxivPosition: Stop Anthropomorphizing Intermediate Tokens as Reasoning/Thinking Traces!8harxivGeneralizing Beyond Suboptimality: Offline Reinforcement Learning Learns Effective Scheduling through Random Solutions8harxivThe Impossibility of Eliciting Latent Knowledge8hThe Bubble Brief
WEEKLYRead AI insights every Tuesday — top movers, new releases, story of the week.
Originally published on arxiv ↗