DataBubble·

Model Detail

Nemotron-Cascade-2-30B-A3B-i1-GGUF

—

Provider: mradermacherCategory: otherParameters: 30B

DB Score

0.2

Downloads

21K

Likes

Day

+0.0%

Week

+0.0%

Month

+0.0%

Download History

Recent newsView all news →

Accelerating PayPal's Commerce Agent with Speculative Decoding: An Empirical Study on EAGLE3 with Fine-Tuned Nemotron Models

arXiv:2604.19767v1 Announce Type: new Abstract: We evaluate speculative decoding with EAGLE3 as an inference-time optimization for PayPal's Commerce Agent, powered by a fine-tuned llama3.1-nemotron-nano-8B-v1 model. Building on prior work (NEMO-4-PAYPAL) that reduced latency and cost through domain-

arxivneutral11d ago

Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

arXiv:2604.12374v1 Announce Type: cross Abstract: We describe the pre-training, post-training, and quantization of Nemotron 3 Super, a 120 billion (active 12 billion) parameter hybrid Mamba-Attention Mixture-of-Experts model. Nemotron 3 Super is the first model in the Nemotron 3 family to 1) be pre-

arxiv28d ago

Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models

arXiv:2512.13607v2 Announce Type: replace-cross Abstract: Building general-purpose reasoning models with reinforcement learning (RL) entails substantial cross-domain heterogeneity, including large variation in inference-time response lengths and verification latency. Such variability complicates the

huggingface131d ago

The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator

huggingface195d ago

Nemotron-Personas-India: Synthesized Data for Sovereign AI

Related Models

Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-i1-GGUF

mradermacher · 230K downloads

Crow-9B-Opus-4.6-Distill-Heretic_Qwen3.5-GGUF

mradermacher · 199K downloads

clip-vit-large-patch14

OpenAI · 24.5M downloads

clip-vit-base-patch32

OpenAI · 20.5M downloads