·
DataBubble
  • Home
  • Models
  • News
  • Compare
  • Boards
  • Pricing
  • About
  • Newsletter
  • Methodology
  • Contact
Latest
Startup Battlefield 200 applications officially close in 3 days1h◆Google will pay SpaceX $920M per month for compute2h◆The most interesting startups right now want to get you off your phone4h◆This is your laptop… on AI5h◆New York lawmakers pass one-year ban on new data centers6h◆The token bill comes due: Inside the industry scramble to manage AI’s runaway costs7h◆The latest AI news we announced in May 20267h◆The ‘together tech’ wave might be the most intriguing startup bet of 20267h◆This AI startup says it can tell if a script will make a hit film7h◆AirTrunk commits $30B to build 5GW of AI data centers in India8h◆The Meta hack shows there’s more to AI security than Mythos12h◆Mira Murati steps back into the spotlight, carefully16h◆SFMambaNet: Spectral-Frequency Enhanced Selective State Space Model for Correspondence Pruning17h◆Optical-Guided Neural Collapse for SAR Few-Shot Class Incremental Learning17h◆Dynamic Infilling Anchors for Format-Constrained Generation in Diffusion Large Language Models17h◆Temporal Order Matters for Agentic Memory: Segment Trees for Long-Horizon Agents17h◆Why Muon Outperforms Adam: A Curvature Perspective17h◆Vision Hopfield Memory Networks17h◆Provably Auditable and Safe LLM Agents from Human-Authored Ontologies17h◆FlexRank: Nested Low-Rank Knowledge Decomposition for Adaptive Model Deployment17h◆Startup Battlefield 200 applications officially close in 3 days1h◆Google will pay SpaceX $920M per month for compute2h◆The most interesting startups right now want to get you off your phone4h◆This is your laptop… on AI5h◆New York lawmakers pass one-year ban on new data centers6h◆The token bill comes due: Inside the industry scramble to manage AI’s runaway costs7h◆The latest AI news we announced in May 20267h◆The ‘together tech’ wave might be the most intriguing startup bet of 20267h◆This AI startup says it can tell if a script will make a hit film7h◆AirTrunk commits $30B to build 5GW of AI data centers in India8h◆The Meta hack shows there’s more to AI security than Mythos12h◆Mira Murati steps back into the spotlight, carefully16h◆SFMambaNet: Spectral-Frequency Enhanced Selective State Space Model for Correspondence Pruning17h◆Optical-Guided Neural Collapse for SAR Few-Shot Class Incremental Learning17h◆Dynamic Infilling Anchors for Format-Constrained Generation in Diffusion Large Language Models17h◆Temporal Order Matters for Agentic Memory: Segment Trees for Long-Horizon Agents17h◆Why Muon Outperforms Adam: A Curvature Perspective17h◆Vision Hopfield Memory Networks17h◆Provably Auditable and Safe LLM Agents from Human-Authored Ontologies17h◆FlexRank: Nested Low-Rank Knowledge Decomposition for Adaptive Model Deployment17h◆
DataBubble·

Model Detail

nvidia logo

NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

—
Provider: NVIDIACategory: codePipeline: text-generationParameters: 30B
DB Score
1.4
Downloads
1.6M
Likes
749
Day
+0.0%
Week
+0.0%
Month
+0.0%
Overview

NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 is a code generation model with 30B parameters released by NVIDIA. The model is registered under the text-generation pipeline tag on Hugging Face, and supports text->text inputs, distributed under a other license.

Pricing & Throughput

NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 is priced at $0.05/M input tokens and $0.2/M output tokens. Operationally the model offers a 262K-token context window, which matters when sizing it for prompt-heavy or latency-sensitive workloads. At this input rate the model sits in the commodity tier and is suitable for high-volume workloads where per-call cost dominates the decision.

Technical

NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 ships with 30B parameters. Total weight footprint is approximately 31.6 GB, which is the relevant figure when planning local-inference VRAM. Distribution is governed by the other license — review the exact terms before commercial deployment.

Use Cases

NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 is best fit for code completion, repository-scale Q&A, and pair-programming integrations, high-volume batch jobs where per-call cost dominates the budget, and long-context tasks such as full-codebase analysis or book-length summarization (262K tokens). It is a less obvious choice for one-shot generation of security-critical code without review. Treat this as a starting matrix rather than a benchmark verdict — the right deployment usually depends on the specific evaluation suite that mirrors your workload.

Download History
Pricing
Input ($/M tokens)
$0.05
Output ($/M tokens)
$0.2
Context Window
262K
Research Paper
arXiv: 2512.20848→
Model Info
Licenseother
Modalitytext->text
Citations38 (5 influential)
Recent newsView all news →
Related News
arxiv2d ago

FlashMLA-ETAP: Efficient Transpose Attention Pipeline for Accelerating MLA Inference on NVIDIA H20 GPUs

arXiv:2506.01969v3 Announce Type: replace-cross Abstract: Efficient inference of Multi-Head Latent Attention (MLA) is challenged by deploying the DeepSeek-R1 671B model on a single Multi-GPU server. This paper introduces FlashMLA-ETAP, a novel framework that enhances MLA inference for the single-ins

arxiv2d ago

NVIDIA OmniDreams: Real-Time Generative World Model for Closed-Loop Autonomous Vehicle Simulation

arXiv:2606.03159v1 Announce Type: cross Abstract: As autonomous vehicle capabilities advance, the safe evaluation of driving policies in long-tail scenarios remains a critical bottleneck. In closed-loop simulation, the driving policy model actively interacts with the environment, where its actions d

arxivneutral3d ago

How Much Progress Has There Been in NVIDIA Datacenter GPUs?

arXiv:2601.20115v3 Announce Type: replace-cross Abstract: As the role of modern Graphics Processing Units (GPUs) becomes increasingly essential for several computing tasks, analyzing their past and current progress is paramount for determining future constraints on scientific research. This is parti

techcrunchneutral4d ago

Nvidia chases $200B CPU market with AI agent PCs from Microsoft, Dell, and HP

If Nvidia has cracked a way to bring AI agents easily, safely, and usefully to the masses, it could — and should — be big.

huggingfaceneutral4d ago

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

techcrunchneutral7d ago

After Nvidia’s $20B not-acqui-hire, AI chip startup Groq reportedly raising $650M

Chipmaker Groq is looking to raise $650 million in internal funding as it pivots from hardware to focus more on AI inference, the process of refining the way AI models respond to prompted requests, per Axios.

Related Models
nvidia logo
Gemma-4-31B-IT-NVFP4
NVIDIA · 2.5M downloads
nvidia logo
NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4
NVIDIA · 1.6M downloads
sentence-transformers logo
all-MiniLM-L6-v2
SBERT · 256.0M downloads
nomic-ai logo
nomic-embed-text-v1.5
nomic-ai · 17.1M downloads
HomeModelsNews