DataBubble·

Model Detail

gpt2

—

Provider: openai-communityCategory: llmPipeline: text-generation

DB Score

24.7

Downloads

13.4M

Likes

Day

+0.0%

Week

+0.0%

Month

+0.0%

Overview

gpt2 is a large language model with 69M parameters released by openai-community. The model is registered under the text-generation pipeline tag on Hugging Face, distributed under the permissive mit license.

Performance

Open-LLM-Leaderboard scoring places it at MMLU-Pro 2, GPQA 1, IFEval 18, BBH 3, giving a sense of how it handles instruction following, reasoning, and graduate-level QA in absolute terms.

How we score this →

Technical

gpt2 ships as a GPT2LMHeadModel / 🟢 pretrained architecture with 69M parameters. The mit license is permissive, allowing commercial deployment and derivative work without per-seat fees, though attribution requirements still apply.

Use Cases

gpt2 is best fit for general-purpose chat and instruction-following workloads. Treat this as a starting matrix rather than a benchmark verdict — the right deployment usually depends on the specific evaluation suite that mirrors your workload.

Download History

Benchmark Scores

IFEval

17.8

BBH

2.8

GPQA

1.1

MMLU-Pro

1.8

MATH

0.5

MUSR

13.9

Average

6.3

Model Info

Licensemit

ArchitectureGPT2LMHeadModel

Type🟢 pretrained

Recent newsView all news →

Benchmarking EngGPT2-16B-A3B against Comparable Italian and International Open-source LLMs

arXiv:2605.07731v2 Announce Type: replace-cross Abstract: This report benchmarks the performance of ENGINEERING Ingegneria Informatica S.p.A.'s EngGPT2MoE-16B-A3B LLM, a 16B parameter Mixture of Experts (MoE) model with 3B active parameters. Performance is investigated across a wide variety of repre

arxiv112d ago

EngGPT2: Sovereign, Efficient and Open Intelligence

arXiv:2603.16430v3 Announce Type: replace-cross Abstract: EngGPT2-16B-A3B is the latest iteration of Engineering Group's Italian LLM and it's built to be a Sovereign, Efficient and Open model. EngGPT2 is trained on 2.5 trillion tokens - less than Qwen3's 36T or Llama3's 15T - and delivers performanc

huggingface1320d ago

From GPT2 to Stable Diffusion: Hugging Face arrives to the Elixir community

Related Models

bert-base-uncased

google-bert · 69.6M downloads

paraphrase-multilingual-MiniLM-L12-v2

SBERT · 48.6M downloads

DataBubble·

Model Detail

gpt2

—

Provider: openai-communityCategory: llmPipeline: text-generation

DB Score

24.7

Downloads

13.4M

Likes

Day

+0.0%

Week

+0.0%

Month

+0.0%

Overview

Performance

Open-LLM-Leaderboard scoring places it at MMLU-Pro 2, GPQA 1, IFEval 18, BBH 3, giving a sense of how it handles instruction following, reasoning, and graduate-level QA in absolute terms.

How we score this →

Technical

Use Cases

Download History

Benchmark Scores

IFEval

17.8

BBH

2.8

GPQA

1.1

MMLU-Pro