DataBubble·

Model Detail

Qwen2.5-0.5B-Instruct

—

Provider: QwenCategory: llmPipeline: text-generationParameters: 0.5B

DB Score

27.8

Downloads

6.1M

Likes

478

Day

+0.0%

Week

+0.0%

Month

+0.0%

Overview

Qwen2.5-0.5B-Instruct is a large language model with 0.5B parameters released by Qwen. The model is registered under the text-generation pipeline tag on Hugging Face.

Performance

Open-LLM-Leaderboard scoring places it at MMLU-Pro 8, GPQA 1, IFEval 32, BBH 8, giving a sense of how it handles instruction following, reasoning, and graduate-level QA in absolute terms.

How we score this →

Technical

Qwen2.5-0.5B-Instruct ships as a Qwen2ForCausalLM / 💬 chat models (RLHF, DPO, IFT, ...) architecture with 0.5B parameters.

Use Cases

Qwen2.5-0.5B-Instruct is best fit for general-purpose chat and instruction-following workloads. Treat this as a starting matrix rather than a benchmark verdict — the right deployment usually depends on the specific evaluation suite that mirrors your workload.

Download History

Research Paper

arXiv: 2407.10671→

Benchmark Scores

IFEval

31.5

BBH

8.2

GPQA

1.2

MMLU-Pro

8.0

MATH

10.3

MUSR

1.4

Average

10.1

Model Info

ArchitectureQwen2ForCausalLM

Type💬 chat models (RLHF, DPO, IFT, ...)

Citations2,239 (268 influential)

Recent newsView all news →

System Report for CCL25-Eval Task 5: New Dataset and LoRA-Fine-Tuned Qwen2.5

arXiv:2606.12392v1 Announce Type: cross Abstract: Recently, large language models (LLMs) have achieved promising progress in the fields of classical Chinese translation and the generation of classical poetry. However, domain-specific research on precise translation and affective-semantic understandi

arxivneutral98d ago

Tuning Qwen2.5-VL to Improve Its Web Interaction Skills

arXiv:2604.09571v1 Announce Type: cross Abstract: Recent advances in vision-language models (VLMs) have sparked growing interest in using them to automate web tasks, yet their feasibility as independent agents that reason and act purely from visual input remains underexplored. We investigate this se

Related Models

Qwen3-0.6B

Qwen · 25.3M downloads

Qwen3-VL-2B-Instruct

Qwen · 22.5M downloads

bert-base-uncased

google-bert · 69.6M downloads

paraphrase-multilingual-MiniLM-L12-v2

SBERT · 48.6M downloads

DataBubble·

Model Detail

Qwen2.5-0.5B-Instruct

—

Provider: QwenCategory: llmPipeline: text-generationParameters: 0.5B

DB Score

27.8

Downloads

6.1M

Likes

478

Day

+0.0%

Week

+0.0%

Month

+0.0%

Overview

Qwen2.5-0.5B-Instruct is a large language model with 0.5B parameters released by Qwen. The model is registered under the text-generation pipeline tag on Hugging Face.

Performance

Open-LLM-Leaderboard scoring places it at MMLU-Pro 8, GPQA 1, IFEval 32, BBH 8, giving a sense of how it handles instruction following, reasoning, and graduate-level QA in absolute terms.

How we score this →

Technical

Qwen2.5-0.5B-Instruct ships as a Qwen2ForCausalLM / 💬 chat models (RLHF, DPO, IFT, ...) architecture with 0.5B parameters.

Use Cases

Download History

Research Paper

arXiv: 2407.10671→

Benchmark Scores

IFEval

31.5

BBH

8.2