DataBubble·
Model Detail
Qwen2.5-0.5B-Instruct
—Provider: QwenCategory: llmPipeline: text-generationParameters: 0.5B
DB Score
5.3
Downloads
6.1M
Likes
478
Day
+0.0%
Week
+0.0%
Month
+0.0%
Overview
Qwen2.5-0.5B-Instruct is a large language model with 0.5B parameters released by Qwen. The model is registered under the text-generation pipeline tag on Hugging Face.
Performance
Open-LLM-Leaderboard scoring places it at MMLU-Pro 8, GPQA 1, IFEval 32, BBH 8, giving a sense of how it handles instruction following, reasoning, and graduate-level QA in absolute terms.
Technical
Qwen2.5-0.5B-Instruct ships as a Qwen2ForCausalLM / 💬 chat models (RLHF, DPO, IFT, ...) architecture with 0.5B parameters.
Use Cases
Qwen2.5-0.5B-Instruct is best fit for general-purpose chat and instruction-following workloads. Treat this as a starting matrix rather than a benchmark verdict — the right deployment usually depends on the specific evaluation suite that mirrors your workload.
Download History
Research Paper
Benchmark Scores
IFEval
31.5
BBH
8.2
GPQA
1.2
MMLU-Pro
8.0
MATH
10.3
MUSR
1.4
Average
10.1
Model Info
ArchitectureQwen2ForCausalLM
Type💬 chat models (RLHF, DPO, IFT, ...)
Citations2,088 (257 influential)
Recent newsView all news →