·
DataBubble
  • Home
  • Models
  • News
  • Compare
  • Boards
  • Pricing
  • About
  • Newsletter
  • Methodology
  • Contact
Latest
Startup Battlefield 200 applications officially close in 3 days1h◆Google will pay SpaceX $920M per month for compute2h◆The most interesting startups right now want to get you off your phone3h◆This is your laptop… on AI4h◆New York lawmakers pass one-year ban on new data centers5h◆The token bill comes due: Inside the industry scramble to manage AI’s runaway costs6h◆The latest AI news we announced in May 20266h◆The ‘together tech’ wave might be the most intriguing startup bet of 20267h◆This AI startup says it can tell if a script will make a hit film7h◆AirTrunk commits $30B to build 5GW of AI data centers in India8h◆The Meta hack shows there’s more to AI security than Mythos12h◆Mira Murati steps back into the spotlight, carefully16h◆SFMambaNet: Spectral-Frequency Enhanced Selective State Space Model for Correspondence Pruning17h◆Optical-Guided Neural Collapse for SAR Few-Shot Class Incremental Learning17h◆Dynamic Infilling Anchors for Format-Constrained Generation in Diffusion Large Language Models17h◆Temporal Order Matters for Agentic Memory: Segment Trees for Long-Horizon Agents17h◆Why Muon Outperforms Adam: A Curvature Perspective17h◆Vision Hopfield Memory Networks17h◆Provably Auditable and Safe LLM Agents from Human-Authored Ontologies17h◆FlexRank: Nested Low-Rank Knowledge Decomposition for Adaptive Model Deployment17h◆Startup Battlefield 200 applications officially close in 3 days1h◆Google will pay SpaceX $920M per month for compute2h◆The most interesting startups right now want to get you off your phone3h◆This is your laptop… on AI4h◆New York lawmakers pass one-year ban on new data centers5h◆The token bill comes due: Inside the industry scramble to manage AI’s runaway costs6h◆The latest AI news we announced in May 20266h◆The ‘together tech’ wave might be the most intriguing startup bet of 20267h◆This AI startup says it can tell if a script will make a hit film7h◆AirTrunk commits $30B to build 5GW of AI data centers in India8h◆The Meta hack shows there’s more to AI security than Mythos12h◆Mira Murati steps back into the spotlight, carefully16h◆SFMambaNet: Spectral-Frequency Enhanced Selective State Space Model for Correspondence Pruning17h◆Optical-Guided Neural Collapse for SAR Few-Shot Class Incremental Learning17h◆Dynamic Infilling Anchors for Format-Constrained Generation in Diffusion Large Language Models17h◆Temporal Order Matters for Agentic Memory: Segment Trees for Long-Horizon Agents17h◆Why Muon Outperforms Adam: A Curvature Perspective17h◆Vision Hopfield Memory Networks17h◆Provably Auditable and Safe LLM Agents from Human-Authored Ontologies17h◆FlexRank: Nested Low-Rank Knowledge Decomposition for Adaptive Model Deployment17h◆
DataBubble·

Model Detail

pyannote logo

speaker-diarization-community-1

▼ 0.4%
Provider: pyannoteCategory: audioPipeline: automatic-speech-recognition
DB Score
22.6
Downloads
2.8M
Likes
475
Day
-0.4%
Week
+0.0%
Month
-0.2%
Overview

speaker-diarization-community-1 is an audio model released by pyannote. The model is registered under the automatic-speech-recognition pipeline tag on Hugging Face, distributed under the permissive cc-by-4.0 license.

Technical

The cc-by-4.0 license is permissive, allowing commercial deployment and derivative work without per-seat fees, though attribution requirements still apply.

Trending Signal

Downloads of speaker-diarization-community-1 have moved -0.4% over the past 24 hours, -0.2% over the trailing thirty days. That is a slight downtrend, consistent with normal cooling as newer models compete for the same workloads. These numbers are signal, not guarantee — week-over-week download counts on Hugging Face also reflect mirror traffic, CI scrapes, and one-off benchmarking runs.

Read about databubble_score →
Use Cases

speaker-diarization-community-1 is best fit for speech recognition, transcription, or speech synthesis depending on the task head. Treat this as a starting matrix rather than a benchmark verdict — the right deployment usually depends on the specific evaluation suite that mirrors your workload.

Download History
Research Paper
arXiv: 2104.03603→
Model Info
Licensecc-by-4.0
Citations152 (20 influential)
Recent newsView all news →
Related News
arxiv3d ago

Echo: A Joint-Embedding Predictive Architecture for Speaker Diarization and Speech Recognition in a Shared Latent Space

arXiv:2606.01909v1 Announce Type: cross Abstract: We present Echo, a proof-of-concept audio system built around a single 25 M-parameter ViT encoder. The encoder is pretrained with a JEPA objective and then specialised by stages to carry speaker identity, phonetic content, and dynamic source routing

arxivneutral4d ago

G-STAR: End-to-End Global Speaker-Tracking Attributed Recognition

arXiv:2603.10468v2 Announce Type: replace-cross Abstract: We study timestamped speaker-attributed automatic speech recognition (SA-ASR) for long-form, multi-party speech with overlap. In this setting, chunk-wise inference must preserve meeting-level speaker identity consistency while producing time-

arxiv4d ago

Targeted Speaker Poisoning Framework in Zero-Shot Text-to-Speech

arXiv:2603.07551v2 Announce Type: replace-cross Abstract: Zero-shot Text-to-Speech (TTS) voice cloning poses severe privacy risks, demanding the removal of specific speaker identities from trained TTS models. Conventional machine unlearning is insufficient in this context, as zero-shot TTS can dynam

arxiv7d ago

Survey of End-to-End Multi-Speaker Automatic Speech Recognition for Monaural Audio

arXiv:2505.10975v3 Announce Type: replace-cross Abstract: Monaural multi-speaker automatic speech recognition (ASR) remains challenging due to data scarcity and the intrinsic difficulty of recognizing and attributing words to individual speakers, particularly in overlapping speech. Recent advances h

arxiv7d ago

Explainable AI in Speaker Recognition -- Making Latent Representations Understandable

arXiv:2604.23354v2 Announce Type: replace-cross Abstract: Neural networks can be trained to learn task-relevant representations from data. Understanding how these networks make decisions falls within the Explainable AI (XAI) domain. This paper proposes to study an XAI topic: uncovering the unknown o

arxivneutral7d ago

Interactive In-Meeting Speaker Correction with Human Feedback

arXiv:2509.18377v2 Announce Type: replace Abstract: Most automatic speech processing systems operate in ``open loop'' mode without user feedback about who said what, yet human-in-the-loop workflows can potentially enable higher accuracy. We propose an LLM-assisted in-meeting speaker correction syste

Related Models
pyannote logo
speaker-diarization-3.1
pyannote · 9.5M downloads
pyannote logo
segmentation-3.0
pyannote · 8.4M downloads
hexgrad logo
Kokoro-82M
hexgrad · 13.8M downloads
coqui logo
XTTS-v2
coqui · 10.0M downloads
HomeModelsNews