·
DataBubble
  • Home
  • Models
  • News
  • Compare
  • Boards
  • Pricing
  • About
  • Newsletter
  • Methodology
  • Contact
Latest
Startup Battlefield 200 applications officially close in 3 days1h◆Google will pay SpaceX $920M per month for compute2h◆The most interesting startups right now want to get you off your phone3h◆This is your laptop… on AI4h◆New York lawmakers pass one-year ban on new data centers5h◆The token bill comes due: Inside the industry scramble to manage AI’s runaway costs6h◆The latest AI news we announced in May 20266h◆The ‘together tech’ wave might be the most intriguing startup bet of 20267h◆This AI startup says it can tell if a script will make a hit film7h◆AirTrunk commits $30B to build 5GW of AI data centers in India8h◆The Meta hack shows there’s more to AI security than Mythos12h◆Mira Murati steps back into the spotlight, carefully15h◆SFMambaNet: Spectral-Frequency Enhanced Selective State Space Model for Correspondence Pruning17h◆Optical-Guided Neural Collapse for SAR Few-Shot Class Incremental Learning17h◆Dynamic Infilling Anchors for Format-Constrained Generation in Diffusion Large Language Models17h◆Temporal Order Matters for Agentic Memory: Segment Trees for Long-Horizon Agents17h◆Why Muon Outperforms Adam: A Curvature Perspective17h◆Vision Hopfield Memory Networks17h◆Provably Auditable and Safe LLM Agents from Human-Authored Ontologies17h◆FlexRank: Nested Low-Rank Knowledge Decomposition for Adaptive Model Deployment17h◆Startup Battlefield 200 applications officially close in 3 days1h◆Google will pay SpaceX $920M per month for compute2h◆The most interesting startups right now want to get you off your phone3h◆This is your laptop… on AI4h◆New York lawmakers pass one-year ban on new data centers5h◆The token bill comes due: Inside the industry scramble to manage AI’s runaway costs6h◆The latest AI news we announced in May 20266h◆The ‘together tech’ wave might be the most intriguing startup bet of 20267h◆This AI startup says it can tell if a script will make a hit film7h◆AirTrunk commits $30B to build 5GW of AI data centers in India8h◆The Meta hack shows there’s more to AI security than Mythos12h◆Mira Murati steps back into the spotlight, carefully15h◆SFMambaNet: Spectral-Frequency Enhanced Selective State Space Model for Correspondence Pruning17h◆Optical-Guided Neural Collapse for SAR Few-Shot Class Incremental Learning17h◆Dynamic Infilling Anchors for Format-Constrained Generation in Diffusion Large Language Models17h◆Temporal Order Matters for Agentic Memory: Segment Trees for Long-Horizon Agents17h◆Why Muon Outperforms Adam: A Curvature Perspective17h◆Vision Hopfield Memory Networks17h◆Provably Auditable and Safe LLM Agents from Human-Authored Ontologies17h◆FlexRank: Nested Low-Rank Knowledge Decomposition for Adaptive Model Deployment17h◆
DataBubble·

Model Detail

meta-llama logo

Llama-3.3-70B-Instruct

▲ 3.0%
Provider: MetaCategory: llmPipeline: text-generationParameters: 70B
DB Score
34.4
Downloads
875K
Likes
3K
GitHub Stars
29K
Day
+3.0%
Week
+0.0%
Month
+81.9%
Overview

Llama-3.3-70B-Instruct is a large language model with 70B parameters released by Meta. The model is registered under the text-generation pipeline tag on Hugging Face, and supports text->text inputs, released under the llama3.3 license.

Performance

Open-LLM-Leaderboard scoring places it at MMLU-Pro 48, GPQA 11, IFEval 90, BBH 57, giving a sense of how it handles instruction following, reasoning, and graduate-level QA in absolute terms.

How we score this →
Pricing & Throughput

Llama-3.3-70B-Instruct is priced at $0.71/M input tokens and $0.71/M output tokens. Operationally the model offers a 131K-token context window, which matters when sizing it for prompt-heavy or latency-sensitive workloads. Pricing in this range is the working middle of the API market — neither the cheapest nor the most expensive option per token, so cost-fit is usually a function of how much output you generate.

Technical

Llama-3.3-70B-Instruct ships as a LlamaForCausalLM / 💬 chat models (RLHF, DPO, IFT, ...) architecture with 70B parameters. The published knowledge cutoff is 2023-12-31, so newer events will not be reflected in zero-shot answers without retrieval. Total weight footprint is approximately 70.6 GB, which is the relevant figure when planning local-inference VRAM. Access is gated on Hugging Face under the llama3.3 license, which means a manual approval step before weights can be downloaded.

Trending Signal

Downloads of Llama-3.3-70B-Instruct have moved +3.0% over the past 24 hours, +81.9% over the trailing thirty days. That is a slight downtrend, consistent with normal cooling as newer models compete for the same workloads. These numbers are signal, not guarantee — week-over-week download counts on Hugging Face also reflect mirror traffic, CI scrapes, and one-off benchmarking runs.

Read about databubble_score →
Use Cases

Llama-3.3-70B-Instruct is best fit for general-purpose chat and instruction-following workloads. Treat this as a starting matrix rather than a benchmark verdict — the right deployment usually depends on the specific evaluation suite that mirrors your workload.

Download History
Pricing
Input ($/M tokens)
$0.71
Output ($/M tokens)
$0.71
Context Window
131K
Research Paper
arXiv: 2204.05149→
Benchmark Scores
IFEval
90.0
BBH
56.6
GPQA
10.5
MMLU-Pro
48.1
MATH
48.3
MUSR
15.6
Average
44.8
Model Info
Licensellama3.3
ArchitectureLlamaForCausalLM
Type💬 chat models (RLHF, DPO, IFT, ...)
Modalitytext->text
Knowledge Cutoff2023-12-31
Citations15,603 (2948 influential)
Recent newsView all news →
Related News
arxiv11d ago

LLAMA LIMA: A Living Meta-Analysis on the Effects of Generative AI on Learning Mathematics

arXiv:2601.18685v3 Announce Type: replace-cross Abstract: The capabilities of generative AI in mathematics education are rapidly evolving, posing significant challenges for research to keep pace. Research syntheses remain scarce and risk being outdated by the time of publication. To address this iss

arxivneutral14d ago

Llamas on the Web: Memory-Efficient, Performance-Portable, and Multi-Precision LLM Inference with WebGPU

arXiv:2605.20706v1 Announce Type: cross Abstract: Running language models in the browser presents a unique opportunity to build efficient, private, and portable AI applications, but requires contending with constrained memory availability and heterogeneous hardware targets. To realize this opportuni

arxiv16d ago

From Llama to Cria: Scaling Down Neural Networks via Neuron-Level Spectral Structural Importance Evaluation

arXiv:2605.18860v1 Announce Type: new Abstract: This paper proposes a neuron pruning framework based on neuron-level spectral structural importance evaluation. Given a trained neural network, we record the hidden states of each hidden layer during inference and model neurons as graph nodes, with hid

arxiv29d ago

Fragile Knowledge, Robust Instruction-Following: The Width Pruning Dichotomy in Llama-3.2

arXiv:2512.22671v2 Announce Type: replace Abstract: Structured width pruning of GLU-MLP layers, guided by the Maximum Absolute Weight (MAW) criterion, reveals a systematic dichotomy in how reducing the expansion ratio affects different model capabilities. While performance on tasks relying on parame

arxiv30d ago

Arithmetic in the Wild: Llama uses Base-10 Addition to Reason About Cyclic Concepts

arXiv:2605.01148v1 Announce Type: new Abstract: Does structure in representations imply structure in computation? We study how Llama-3.1-8B reasons over cyclic concepts (e.g., "what month is six months after August?"). Even though Llama-3.1-8B's representations for these concepts are circularly stru

arxivneutral36d ago

A Practice of Post-Training on Llama-3 70B with Optimal Selection of Additional Language Mixture Ratio

arXiv:2409.06624v4 Announce Type: replace-cross Abstract: Large Language Models (LLM) often need to be Continual Pre-Trained (CPT) to obtain unfamiliar language skills or adapt to new domains. The huge training cost of CPT often asks for cautious choice of key hyper-parameters such as the mixture ra

Related Models
meta-llama logo
Llama-3.1-8B-Instruct
Meta · 10.7M downloads
meta-llama logo
Llama-3.2-1B-Instruct
Meta · 8.2M downloads
google-bert logo
bert-base-uncased
google-bert · 69.6M downloads
sentence-transformers logo
paraphrase-multilingual-MiniLM-L12-v2
SBERT · 50.1M downloads
HomeModelsNews