·
DataBubble
  • Home
  • Models
  • News
  • Compare
  • Boards
  • Pricing
  • About
  • Newsletter
  • Methodology
  • Contact
Latest
Startup Battlefield 200 applications officially close in 3 days2h◆Google will pay SpaceX $920M per month for compute3h◆The most interesting startups right now want to get you off your phone4h◆This is your laptop… on AI5h◆New York lawmakers pass one-year ban on new data centers6h◆The token bill comes due: Inside the industry scramble to manage AI’s runaway costs7h◆The latest AI news we announced in May 20267h◆The ‘together tech’ wave might be the most intriguing startup bet of 20268h◆This AI startup says it can tell if a script will make a hit film8h◆AirTrunk commits $30B to build 5GW of AI data centers in India9h◆The Meta hack shows there’s more to AI security than Mythos13h◆Mira Murati steps back into the spotlight, carefully17h◆SFMambaNet: Spectral-Frequency Enhanced Selective State Space Model for Correspondence Pruning18h◆Optical-Guided Neural Collapse for SAR Few-Shot Class Incremental Learning18h◆Dynamic Infilling Anchors for Format-Constrained Generation in Diffusion Large Language Models18h◆Temporal Order Matters for Agentic Memory: Segment Trees for Long-Horizon Agents18h◆Why Muon Outperforms Adam: A Curvature Perspective18h◆Vision Hopfield Memory Networks18h◆Provably Auditable and Safe LLM Agents from Human-Authored Ontologies18h◆FlexRank: Nested Low-Rank Knowledge Decomposition for Adaptive Model Deployment18h◆Startup Battlefield 200 applications officially close in 3 days2h◆Google will pay SpaceX $920M per month for compute3h◆The most interesting startups right now want to get you off your phone4h◆This is your laptop… on AI5h◆New York lawmakers pass one-year ban on new data centers6h◆The token bill comes due: Inside the industry scramble to manage AI’s runaway costs7h◆The latest AI news we announced in May 20267h◆The ‘together tech’ wave might be the most intriguing startup bet of 20268h◆This AI startup says it can tell if a script will make a hit film8h◆AirTrunk commits $30B to build 5GW of AI data centers in India9h◆The Meta hack shows there’s more to AI security than Mythos13h◆Mira Murati steps back into the spotlight, carefully17h◆SFMambaNet: Spectral-Frequency Enhanced Selective State Space Model for Correspondence Pruning18h◆Optical-Guided Neural Collapse for SAR Few-Shot Class Incremental Learning18h◆Dynamic Infilling Anchors for Format-Constrained Generation in Diffusion Large Language Models18h◆Temporal Order Matters for Agentic Memory: Segment Trees for Long-Horizon Agents18h◆Why Muon Outperforms Adam: A Curvature Perspective18h◆Vision Hopfield Memory Networks18h◆Provably Auditable and Safe LLM Agents from Human-Authored Ontologies18h◆FlexRank: Nested Low-Rank Knowledge Decomposition for Adaptive Model Deployment18h◆
DataBubble·

Model Detail

Qwen logo

Qwen2.5-Coder-32B-Instruct

—
Provider: QwenCategory: codePipeline: text-generationParameters: 32B
DB Score
0.6
Downloads
1.4M
Likes
2K
GitHub Stars
17K
Day
+0.0%
Week
+0.0%
Month
+0.0%
Overview

Qwen2.5-Coder-32B-Instruct is a code generation model with 32B parameters released by Qwen. The model is registered under the text-generation pipeline tag on Hugging Face, and supports text->text inputs, distributed under the permissive apache-2.0 license.

Performance

Open-LLM-Leaderboard scoring places it at MMLU-Pro 38, GPQA 13, IFEval 73, BBH 52, giving a sense of how it handles instruction following, reasoning, and graduate-level QA in absolute terms.

How we score this →
Pricing & Throughput

Qwen2.5-Coder-32B-Instruct is priced at $0.12/M input tokens and $0.3/M output tokens. Operationally the model offers a 33K-token context window, which matters when sizing it for prompt-heavy or latency-sensitive workloads. At this input rate the model sits in the commodity tier and is suitable for high-volume workloads where per-call cost dominates the decision.

Technical

Qwen2.5-Coder-32B-Instruct ships as a Qwen2ForCausalLM / 💬 chat models (RLHF, DPO, IFT, ...) architecture with 32B parameters. The published knowledge cutoff is 2024-06-30, so newer events will not be reflected in zero-shot answers without retrieval. Total weight footprint is approximately 32.8 GB, which is the relevant figure when planning local-inference VRAM. The apache-2.0 license is permissive, allowing commercial deployment and derivative work without per-seat fees, though attribution requirements still apply.

Use Cases

Qwen2.5-Coder-32B-Instruct is best fit for code completion, repository-scale Q&A, and pair-programming integrations, and high-volume batch jobs where per-call cost dominates the budget. It is a less obvious choice for one-shot generation of security-critical code without review. Treat this as a starting matrix rather than a benchmark verdict — the right deployment usually depends on the specific evaluation suite that mirrors your workload.

Download History
Pricing
Input ($/M tokens)
$0.12
Output ($/M tokens)
$0.3
Context Window
33K
Research Paper
arXiv: 2409.12186→
Benchmark Scores
IFEval
72.7
BBH
52.3
GPQA
13.2
MMLU-Pro
37.9
MATH
49.5
MUSR
13.7
Average
39.9
Model Info
Licenseapache-2.0
ArchitectureQwen2ForCausalLM
Type💬 chat models (RLHF, DPO, IFT, ...)
Modalitytext->text
Knowledge Cutoff2024-06-30
Recent newsView all news →
Related News
arxivneutral52d ago

Tuning Qwen2.5-VL to Improve Its Web Interaction Skills

arXiv:2604.09571v1 Announce Type: cross Abstract: Recent advances in vision-language models (VLMs) have sparked growing interest in using them to automate web tasks, yet their feasibility as independent agents that reason and act purely from visual input remains underexplored. We investigate this se

Related Models
Qwen logo
Qwen3-VL-2B-Instruct
Qwen · 22.5M downloads
Qwen logo
Qwen3-0.6B
Qwen · 22.2M downloads
sentence-transformers logo
all-MiniLM-L6-v2
SBERT · 254.9M downloads
nomic-ai logo
nomic-embed-text-v1.5
nomic-ai · 17.1M downloads
HomeModelsNews