Model Detail
s2-pro
▲ 2.4%s2-pro is an audio model with 2.3B parameters released by fishaudio. The model is registered under the text-to-speech pipeline tag on Hugging Face, distributed under a other license.
s2-pro ships with 2.3B parameters. Total weight footprint is approximately 4.6 GB, which is the relevant figure when planning local-inference VRAM. Distribution is governed by the other license — review the exact terms before commercial deployment.
Downloads of s2-pro have moved +2.4% over the past 24 hours, +78.5% over the trailing thirty days. That is a slight downtrend, consistent with normal cooling as newer models compete for the same workloads. These numbers are signal, not guarantee — week-over-week download counts on Hugging Face also reflect mirror traffic, CI scrapes, and one-off benchmarking runs.
s2-pro is best fit for speech recognition, transcription, or speech synthesis depending on the task head. Treat this as a starting matrix rather than a benchmark verdict — the right deployment usually depends on the specific evaluation suite that mirrors your workload.