·
DataBubble
  • Home
  • Models
  • News
  • Compare
  • Boards
  • Pricing
  • About
  • Newsletter
  • Methodology
  • Contact
Latest
Startup Battlefield 200 applications officially close in 3 days2h◆Google will pay SpaceX $920M per month for compute3h◆The most interesting startups right now want to get you off your phone4h◆This is your laptop… on AI5h◆New York lawmakers pass one-year ban on new data centers6h◆The token bill comes due: Inside the industry scramble to manage AI’s runaway costs7h◆The latest AI news we announced in May 20267h◆The ‘together tech’ wave might be the most intriguing startup bet of 20268h◆This AI startup says it can tell if a script will make a hit film8h◆AirTrunk commits $30B to build 5GW of AI data centers in India8h◆The Meta hack shows there’s more to AI security than Mythos13h◆Mira Murati steps back into the spotlight, carefully16h◆SFMambaNet: Spectral-Frequency Enhanced Selective State Space Model for Correspondence Pruning18h◆Optical-Guided Neural Collapse for SAR Few-Shot Class Incremental Learning18h◆Dynamic Infilling Anchors for Format-Constrained Generation in Diffusion Large Language Models18h◆Temporal Order Matters for Agentic Memory: Segment Trees for Long-Horizon Agents18h◆Why Muon Outperforms Adam: A Curvature Perspective18h◆Vision Hopfield Memory Networks18h◆Provably Auditable and Safe LLM Agents from Human-Authored Ontologies18h◆FlexRank: Nested Low-Rank Knowledge Decomposition for Adaptive Model Deployment18h◆Startup Battlefield 200 applications officially close in 3 days2h◆Google will pay SpaceX $920M per month for compute3h◆The most interesting startups right now want to get you off your phone4h◆This is your laptop… on AI5h◆New York lawmakers pass one-year ban on new data centers6h◆The token bill comes due: Inside the industry scramble to manage AI’s runaway costs7h◆The latest AI news we announced in May 20267h◆The ‘together tech’ wave might be the most intriguing startup bet of 20268h◆This AI startup says it can tell if a script will make a hit film8h◆AirTrunk commits $30B to build 5GW of AI data centers in India8h◆The Meta hack shows there’s more to AI security than Mythos13h◆Mira Murati steps back into the spotlight, carefully16h◆SFMambaNet: Spectral-Frequency Enhanced Selective State Space Model for Correspondence Pruning18h◆Optical-Guided Neural Collapse for SAR Few-Shot Class Incremental Learning18h◆Dynamic Infilling Anchors for Format-Constrained Generation in Diffusion Large Language Models18h◆Temporal Order Matters for Agentic Memory: Segment Trees for Long-Horizon Agents18h◆Why Muon Outperforms Adam: A Curvature Perspective18h◆Vision Hopfield Memory Networks18h◆Provably Auditable and Safe LLM Agents from Human-Authored Ontologies18h◆FlexRank: Nested Low-Rank Knowledge Decomposition for Adaptive Model Deployment18h◆
DataBubble·

Model Detail

meta-llama logo

Meta-Llama-3-8B-Instruct

—
Provider: MetaCategory: llmPipeline: text-generationParameters: 8B
DB Score
14.9
Downloads
1.7M
Likes
5K
Day
+0.0%
Week
+0.0%
Month
+28.3%
Overview

Meta-Llama-3-8B-Instruct is a large language model with 8B parameters released by Meta. The model is registered under the text-generation pipeline tag on Hugging Face, and supports text->text inputs, released under the llama3 license.

Performance

Open-LLM-Leaderboard scoring places it at MMLU-Pro 29, GPQA 6, IFEval 48, BBH 27, giving a sense of how it handles instruction following, reasoning, and graduate-level QA in absolute terms.

How we score this →
Pricing & Throughput

Meta-Llama-3-8B-Instruct is priced at $0.15/M input tokens and $0.15/M output tokens. Operationally the model offers a 8K-token context window, which matters when sizing it for prompt-heavy or latency-sensitive workloads. At this input rate the model sits in the commodity tier and is suitable for high-volume workloads where per-call cost dominates the decision.

Technical

Meta-Llama-3-8B-Instruct ships as a LlamaForCausalLM / 💬 chat models (RLHF, DPO, IFT, ...) architecture with 8B parameters. The published knowledge cutoff is 2023-12-31, so newer events will not be reflected in zero-shot answers without retrieval. Total weight footprint is approximately 8.0 GB, which is the relevant figure when planning local-inference VRAM. Access is gated on Hugging Face under the llama3 license, which means a manual approval step before weights can be downloaded.

Trending Signal

Downloads of Meta-Llama-3-8B-Instruct have moved +28.3% over the trailing thirty days. That is a slight downtrend, consistent with normal cooling as newer models compete for the same workloads. These numbers are signal, not guarantee — week-over-week download counts on Hugging Face also reflect mirror traffic, CI scrapes, and one-off benchmarking runs.

Read about databubble_score →
Use Cases

Meta-Llama-3-8B-Instruct is best fit for general-purpose chat and instruction-following workloads, and high-volume batch jobs where per-call cost dominates the budget. Treat this as a starting matrix rather than a benchmark verdict — the right deployment usually depends on the specific evaluation suite that mirrors your workload.

Download History
Pricing
Input ($/M tokens)
$0.15
Output ($/M tokens)
$0.15
Context Window
8K
Research Paper
arXiv: 2407.21783→
Benchmark Scores
IFEval
47.8
BBH
26.8
GPQA
5.7
MMLU-Pro
28.8
MATH
9.1
MUSR
5.4
Average
20.6
Model Info
Licensellama3
ArchitectureLlamaForCausalLM
Type💬 chat models (RLHF, DPO, IFT, ...)
Modalitytext->text
Knowledge Cutoff2023-12-31
Citations15,603 (2948 influential)
Recent newsView all news →
Related News
techcrunchneutral1d ago

Meta steals a tactic from Tesla and builds data centers in tents

Meta may have found one way to slash its massive data center bill: tents.

techcrunchneutral1d ago

Meta rolls out a new AI creator assistant on Facebook

Creators often have to parse through charts and dashboards to understand their performance, but with the new AI assistant, they can get quick answers to questions like "When should I post?" and "What are people saying in my comments?"

arxivneutral1d ago

R-APS: Compositional Reasoning and In-Context Meta-Learning for Constrained Design via Reflective Adversarial Pareto Search

arXiv:2606.04823v1 Announce Type: new Abstract: Large language models (LLMs) are fluent on open-ended tasks, yet in agentic settings, where a system must plan, use tools, and act over extended horizons, fluency does not ensure reliable delivery. We trace this gap to three coupled structural failures

arxivneutral1d ago

Learning to Evaluate: Cost-Effective Model Evaluation on Unlabeled Data with Meta-Learning

arXiv:2605.23595v2 Announce Type: replace-cross Abstract: The rapid advancement of machine learning has led to an unprecedented expansion of model ecosystems, making it increasingly difficult to assess the reliability of newly released models on unseen and unlabeled data. Existing evaluation pipelin

arxivneutral1d ago

Smart Picks in the Dark: Towards Efficient RLVR for Reasoning via Tracing Metacognitive Pivots

arXiv:2606.04503v1 Announce Type: cross Abstract: Reinforcement learning with verifiable rewards (RLVR) has greatly advanced large reasoning models (LRMs), but it requires timely training on a huge fully-annotated dataset. To this end, data-efficient RLVR methods have been widely studied from two pe

arxivneutral1d ago

MENTOR: A Metacognition-Driven Self-Evolution Framework for Uncovering and Mitigating Implicit Domain Risks in LLMs

arXiv:2511.07107v3 Announce Type: replace Abstract: Ensuring the safety of Large Language Models (LLMs) is critical for real-world deployment. However, current safety measures often fail to address implicit, domain-specific risks. To investigate this gap, we introduce a dataset of 3,000 annotated qu

Related Models
meta-llama logo
Llama-3.1-8B-Instruct
Meta · 10.4M downloads
meta-llama logo
Llama-3.2-1B-Instruct
Meta · 8.0M downloads
google-bert logo
bert-base-uncased
google-bert · 69.6M downloads
sentence-transformers logo
paraphrase-multilingual-MiniLM-L12-v2
SBERT · 49.8M downloads
HomeModelsNews