·
DataBubble
  • Home
  • Models
  • News
  • Compare
  • Boards
  • Pricing
  • About
  • Newsletter
  • Methodology
  • Contact
Latest
Startup Battlefield 200 applications officially close in 3 days1h◆Google will pay SpaceX $920M per month for compute2h◆The most interesting startups right now want to get you off your phone3h◆This is your laptop… on AI4h◆New York lawmakers pass one-year ban on new data centers5h◆The token bill comes due: Inside the industry scramble to manage AI’s runaway costs6h◆The latest AI news we announced in May 20266h◆The ‘together tech’ wave might be the most intriguing startup bet of 20267h◆This AI startup says it can tell if a script will make a hit film7h◆AirTrunk commits $30B to build 5GW of AI data centers in India8h◆The Meta hack shows there’s more to AI security than Mythos12h◆Mira Murati steps back into the spotlight, carefully15h◆SFMambaNet: Spectral-Frequency Enhanced Selective State Space Model for Correspondence Pruning17h◆Optical-Guided Neural Collapse for SAR Few-Shot Class Incremental Learning17h◆Dynamic Infilling Anchors for Format-Constrained Generation in Diffusion Large Language Models17h◆Temporal Order Matters for Agentic Memory: Segment Trees for Long-Horizon Agents17h◆Why Muon Outperforms Adam: A Curvature Perspective17h◆Vision Hopfield Memory Networks17h◆Provably Auditable and Safe LLM Agents from Human-Authored Ontologies17h◆FlexRank: Nested Low-Rank Knowledge Decomposition for Adaptive Model Deployment17h◆Startup Battlefield 200 applications officially close in 3 days1h◆Google will pay SpaceX $920M per month for compute2h◆The most interesting startups right now want to get you off your phone3h◆This is your laptop… on AI4h◆New York lawmakers pass one-year ban on new data centers5h◆The token bill comes due: Inside the industry scramble to manage AI’s runaway costs6h◆The latest AI news we announced in May 20266h◆The ‘together tech’ wave might be the most intriguing startup bet of 20267h◆This AI startup says it can tell if a script will make a hit film7h◆AirTrunk commits $30B to build 5GW of AI data centers in India8h◆The Meta hack shows there’s more to AI security than Mythos12h◆Mira Murati steps back into the spotlight, carefully15h◆SFMambaNet: Spectral-Frequency Enhanced Selective State Space Model for Correspondence Pruning17h◆Optical-Guided Neural Collapse for SAR Few-Shot Class Incremental Learning17h◆Dynamic Infilling Anchors for Format-Constrained Generation in Diffusion Large Language Models17h◆Temporal Order Matters for Agentic Memory: Segment Trees for Long-Horizon Agents17h◆Why Muon Outperforms Adam: A Curvature Perspective17h◆Vision Hopfield Memory Networks17h◆Provably Auditable and Safe LLM Agents from Human-Authored Ontologies17h◆FlexRank: Nested Low-Rank Knowledge Decomposition for Adaptive Model Deployment17h◆
DataBubble·

Model Detail

Qwen logo

Qwen2.5-3B-Instruct

—
Provider: QwenCategory: llmPipeline: text-generationParameters: 3B
DB Score
31.2
Downloads
13.8M
Likes
486
Day
+0.0%
Week
+19.5%
Month
+46.8%
Overview

Qwen2.5-3B-Instruct is a large language model with 3B parameters released by Qwen. The model is registered under the text-generation pipeline tag on Hugging Face, distributed under a other license.

Performance

Open-LLM-Leaderboard scoring places it at MMLU-Pro 25, GPQA 3, IFEval 65, BBH 26, giving a sense of how it handles instruction following, reasoning, and graduate-level QA in absolute terms.

How we score this →
Technical

Qwen2.5-3B-Instruct ships as a Qwen2ForCausalLM / 💬 chat models (RLHF, DPO, IFT, ...) architecture with 3B parameters. Total weight footprint is approximately 3.1 GB, which is the relevant figure when planning local-inference VRAM. Distribution is governed by the other license — review the exact terms before commercial deployment.

Trending Signal

Downloads of Qwen2.5-3B-Instruct have moved +19.5% over the trailing seven days, +46.8% over the trailing thirty days. The trend is mildly positive, consistent with a model that is being picked up incrementally rather than going viral. These numbers are signal, not guarantee — week-over-week download counts on Hugging Face also reflect mirror traffic, CI scrapes, and one-off benchmarking runs.

Read about databubble_score →
Use Cases

Qwen2.5-3B-Instruct is best fit for general-purpose chat and instruction-following workloads. Treat this as a starting matrix rather than a benchmark verdict — the right deployment usually depends on the specific evaluation suite that mirrors your workload.

Download History
Research Paper
arXiv: 2407.10671→
Benchmark Scores
IFEval
64.7
BBH
25.8
GPQA
3.0
MMLU-Pro
25.1
MATH
36.8
MUSR
7.6
Average
27.2
Model Info
Licenseother
ArchitectureQwen2ForCausalLM
Type💬 chat models (RLHF, DPO, IFT, ...)
Citations2,088 (257 influential)
Recent newsView all news →
Related News
arxivneutral52d ago

Tuning Qwen2.5-VL to Improve Its Web Interaction Skills

arXiv:2604.09571v1 Announce Type: cross Abstract: Recent advances in vision-language models (VLMs) have sparked growing interest in using them to automate web tasks, yet their feasibility as independent agents that reason and act purely from visual input remains underexplored. We investigate this se

Related Models
Qwen logo
Qwen3-VL-2B-Instruct
Qwen · 22.5M downloads
Qwen logo
Qwen3-0.6B
Qwen · 21.2M downloads
google-bert logo
bert-base-uncased
google-bert · 69.6M downloads
sentence-transformers logo
paraphrase-multilingual-MiniLM-L12-v2
SBERT · 50.1M downloads
HomeModelsNews