Model Detail
GLM-5-FP8
—GLM-5-FP8 is a large language model with 377.0B parameters released by zai-org. The model is registered under the text-generation pipeline tag on Hugging Face, distributed under the permissive mit license.
GLM-5-FP8 ships with 377.0B parameters. Total weight footprint is approximately 753.9 GB, which is the relevant figure when planning local-inference VRAM. The mit license is permissive, allowing commercial deployment and derivative work without per-seat fees, though attribution requirements still apply.
GLM-5-FP8 is best fit for general-purpose chat and instruction-following workloads. Treat this as a starting matrix rather than a benchmark verdict — the right deployment usually depends on the specific evaluation suite that mirrors your workload.