·
DataBubble
  • Home
  • Models
  • News
  • Compare
  • Boards
  • Pricing
  • About
  • Newsletter
  • Methodology
  • Contact
Latest
Pool’s new app turns your screenshots into something useful59m◆DoorDash’s new AI chatbot lets you order with prompts and photos2h◆Anthropic apologizes for invisible Claude Fable guardrails4h◆Google DeepMind is worried about what happens when millions of agents start to interact5h◆Deezer launches an AI music detector for other streaming services8h◆Opendoor’s India exit is fueling a bigger conversation about AI and outsourcing12h◆MODF-SIR: A Multi-agent Omni-modal Distilled Framework for Social Intelligence Reasoning12h◆Position: Stop Anthropomorphizing Intermediate Tokens as Reasoning/Thinking Traces!12h◆ARGUS: Stacked Multi-View Identity Mosaic Injection for Subject-Preserving Video Generation12h◆Generalizing Beyond Suboptimality: Offline Reinforcement Learning Learns Effective Scheduling through Random Solutions12h◆The Impossibility of Eliciting Latent Knowledge12h◆Mapping Scientific Literature with Large Language Models and Topic Modeling12h◆Grounding Computer Use Agents on Human Demonstrations12h◆Embodied-R1.5: Evolving Physical Intelligence via Embodied Foundation Models12h◆LSTM based IoT Device Identification12h◆StanceNakba Shared Task: Actor and Topic-Aware Stance Detection in Public Discourse12h◆Breaking the Ice: Analyzing Cold Start Latency in vLLM12h◆Modeling Complex Behaviors: Multi-Personality Composition and Dynamic Switching in Vision-Language Models12h◆DIRECT: When and Where Should You Allocate Test-Time Compute in Embodied Planners?12h◆Higher order PCA-like rotation-invariant features for detailed shape descriptors modulo rotation12h◆Pool’s new app turns your screenshots into something useful59m◆DoorDash’s new AI chatbot lets you order with prompts and photos2h◆Anthropic apologizes for invisible Claude Fable guardrails4h◆Google DeepMind is worried about what happens when millions of agents start to interact5h◆Deezer launches an AI music detector for other streaming services8h◆Opendoor’s India exit is fueling a bigger conversation about AI and outsourcing12h◆MODF-SIR: A Multi-agent Omni-modal Distilled Framework for Social Intelligence Reasoning12h◆Position: Stop Anthropomorphizing Intermediate Tokens as Reasoning/Thinking Traces!12h◆ARGUS: Stacked Multi-View Identity Mosaic Injection for Subject-Preserving Video Generation12h◆Generalizing Beyond Suboptimality: Offline Reinforcement Learning Learns Effective Scheduling through Random Solutions12h◆The Impossibility of Eliciting Latent Knowledge12h◆Mapping Scientific Literature with Large Language Models and Topic Modeling12h◆Grounding Computer Use Agents on Human Demonstrations12h◆Embodied-R1.5: Evolving Physical Intelligence via Embodied Foundation Models12h◆LSTM based IoT Device Identification12h◆StanceNakba Shared Task: Actor and Topic-Aware Stance Detection in Public Discourse12h◆Breaking the Ice: Analyzing Cold Start Latency in vLLM12h◆Modeling Complex Behaviors: Multi-Personality Composition and Dynamic Switching in Vision-Language Models12h◆DIRECT: When and Where Should You Allocate Test-Time Compute in Embodied Planners?12h◆Higher order PCA-like rotation-invariant features for detailed shape descriptors modulo rotation12h◆
News/BaltiVoice: A Speech Corpus and Fine-tuned Whisper ASR System for the Balti Language
arxiv
PublishedJune 11, 2026 at 4:00 AM

BaltiVoice: A Speech Corpus and Fine-tuned Whisper ASR System for the Balti Language

Source
arxiv.orgfull article ↗
Read on arxiv→
Publisher summary· verbatim

arXiv:2606.03504v2 Announce Type: replace-cross Abstract: We present BaltiVoice, a 16.8-hour read-speech corpus for Balti (ISO 639-3: bft), a Tibetic language spoken in Gilgit-Baltistan, Pakistan, with no prior publicly available ASR resources. The corpus contains 10,060 validated utterances in nati

Stay posted· Newsletter

A 5-min weekly brief — top movers, price watch, story of the week.

// no spam · unsubscribe one-click · free forever

Discussion
Source
↗
arxiv
Read original ↗All from arxiv →

No replies yet. Be first.

Source
↗
arxiv
Read original ↗All from arxiv →

Related coverage

More from ARXIV
arxivMODF-SIR: A Multi-agent Omni-modal Distilled Framework for Social Intelligence Reasoning12harxivPosition: Stop Anthropomorphizing Intermediate Tokens as Reasoning/Thinking Traces!12harxivARGUS: Stacked Multi-View Identity Mosaic Injection for Subject-Preserving Video Generation12harxivGeneralizing Beyond Suboptimality: Offline Reinforcement Learning Learns Effective Scheduling through Random Solutions12h
The Bubble Brief
WEEKLY

Read AI insights every Tuesday — top movers, new releases, story of the week.

// no spam · unsubscribe one-click · free forever

Originally published on arxiv ↗
HomeModelsNews