Model Detail
VoxCPM2
—VoxCPM2 is an audio model with 1.1B parameters released by openbmb. The model is registered under the text-to-speech pipeline tag on Hugging Face, distributed under the permissive apache-2.0 license.
VoxCPM2 ships with 1.1B parameters. Total weight footprint is approximately 2.3 GB, which is the relevant figure when planning local-inference VRAM. The apache-2.0 license is permissive, allowing commercial deployment and derivative work without per-seat fees, though attribution requirements still apply.
Downloads of VoxCPM2 have moved +17.7% over the trailing seven days, +2288.6% over the trailing thirty days. The trend is mildly positive, consistent with a model that is being picked up incrementally rather than going viral. These numbers are signal, not guarantee — week-over-week download counts on Hugging Face also reflect mirror traffic, CI scrapes, and one-off benchmarking runs.
VoxCPM2 is best fit for speech recognition, transcription, or speech synthesis depending on the task head. Treat this as a starting matrix rather than a benchmark verdict — the right deployment usually depends on the specific evaluation suite that mirrors your workload.