DataBubble·

Model Detail

gpt-oss-120b

—

Provider: OpenAICategory: llmPipeline: text-generationParameters: 120B

DB Score

7.5

Downloads

4.4M

Likes

Day

+0.0%

Week

+10.2%

Month

+0.0%

Overview

gpt-oss-120b is a large language model with 120B parameters released by OpenAI. The model is registered under the text-generation pipeline tag on Hugging Face, and supports text->text inputs, distributed under the permissive apache-2.0 license.

Performance

gpt-oss-120b has been evaluated across multiple task suites. On task-specific evaluations the model scores 26.0% resolved on SWE-Bench.

How we score this →

Pricing & Throughput

gpt-oss-120b is priced at $0.15/M input tokens and $0.6/M output tokens. Operationally the model offers a 131K-token context window, which matters when sizing it for prompt-heavy or latency-sensitive workloads. At this input rate the model sits in the commodity tier and is suitable for high-volume workloads where per-call cost dominates the decision.

Technical

gpt-oss-120b ships with 120B parameters. The published knowledge cutoff is 2024-06-30, so newer events will not be reflected in zero-shot answers without retrieval. Total weight footprint is approximately 120.4 GB, which is the relevant figure when planning local-inference VRAM. The apache-2.0 license is permissive, allowing commercial deployment and derivative work without per-seat fees, though attribution requirements still apply.

Trending Signal

Downloads of gpt-oss-120b have moved +10.2% over the trailing seven days. The trend is mildly positive, consistent with a model that is being picked up incrementally rather than going viral. These numbers are signal, not guarantee — week-over-week download counts on Hugging Face also reflect mirror traffic, CI scrapes, and one-off benchmarking runs.

Read about databubble_score →

Use Cases

gpt-oss-120b is best fit for general-purpose chat and instruction-following workloads, and high-volume batch jobs where per-call cost dominates the budget. Treat this as a starting matrix rather than a benchmark verdict — the right deployment usually depends on the specific evaluation suite that mirrors your workload.

Download History

Pricing

Input ($/M tokens)

$0.15

Output ($/M tokens)

$0.6

Context Window

131K

Research Paper

arXiv: 2508.10925→

Arena & Community

SWE-Bench