Model Detail
XTTS-v2
▼ 0.1%XTTS-v2 is an audio model released by coqui. The model is registered under the text-to-speech pipeline tag on Hugging Face, distributed under a other license.
Distribution is governed by the other license — review the exact terms before commercial deployment.
Downloads of XTTS-v2 have moved -0.1% over the past 24 hours, +5.5% over the trailing seven days, +31.2% over the trailing thirty days. The trend is mildly positive, consistent with a model that is being picked up incrementally rather than going viral. These numbers are signal, not guarantee — week-over-week download counts on Hugging Face also reflect mirror traffic, CI scrapes, and one-off benchmarking runs.
XTTS-v2 is best fit for speech recognition, transcription, or speech synthesis depending on the task head. Treat this as a starting matrix rather than a benchmark verdict — the right deployment usually depends on the specific evaluation suite that mirrors your workload.