·
DataBubble
  • Home
  • Models
  • News
  • Compare
  • Boards
  • Pricing
  • About
  • Newsletter
  • Methodology
  • Contact
Latest
LLMs as Noisy Channels: A Shannon Perspective on Model Capacity and Scaling Laws9h◆Bridging AI and Clinical Reasoning: Abductive Explanations for Alignment on Critical Symptoms9h◆CHASD: Language Increment-Calibrated Contrastive Decoding against Hallucination in LVLMs9h◆Prudent-Banker: No Extra Fees for Baseline Safety in Adversarial Bandits With and Without Delays9h◆InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion9h◆Representational Alignment with Chemical Induced Fit for Molecular Relational Learning9h◆One Policy, Infinite NPCs: Persona-Traceable Shared RL Policies for Scalable Game Agents9h◆RAG4Outcome: A Retrieval-Augmented Multimodal Framework for Prognostic Prediction in Chronic Osteomyelitis9h◆Uncovering the Latent Potential of Deep Intermediate Representations9h◆OpenSkillEval: Automatically Auditing the Open Skill Ecosystem for LLM Agents9h◆Beyond Log Likelihood: Probability-Based Objectives for Supervised Fine-Tuning across the Model Capability Continuum9h◆Debiased Negative Mining Improves Out-of-distribution Detection with Pre-trained Vision-Language Models9h◆FATHOMS-RAG: A Framework for the Assessment of Thinking and Observation in Multimodal Systems that use Retrieval Augmented Generation9h◆SCRIPT: Scalable Diffusion Policy with Multi-stage Training for Language-driven Physics-Based Humanoid Control9h◆Detecting Drunk Driving Using Off-the-Shelf Smartwatches9h◆Decision-Aware Quadratic ReLU Replacement for HE-Friendly Inference9h◆MAS-Orchestra: Understanding and Improving Multi-Agent Reasoning Through Holistic Orchestration and Controlled Benchmarks9h◆NeuroNL2LTL: A Neurosymbolic Framework for Natural Language Translation of Linear Temporal Logic9h◆Agentic Proving for Program Verification9h◆Co-ReAct: Rubrics as Step-Level Collaborators for ReAct Agents9h◆LLMs as Noisy Channels: A Shannon Perspective on Model Capacity and Scaling Laws9h◆Bridging AI and Clinical Reasoning: Abductive Explanations for Alignment on Critical Symptoms9h◆CHASD: Language Increment-Calibrated Contrastive Decoding against Hallucination in LVLMs9h◆Prudent-Banker: No Extra Fees for Baseline Safety in Adversarial Bandits With and Without Delays9h◆InfiGFusion: Graph-on-Logits Distillation via Efficient Gromov-Wasserstein for Model Fusion9h◆Representational Alignment with Chemical Induced Fit for Molecular Relational Learning9h◆One Policy, Infinite NPCs: Persona-Traceable Shared RL Policies for Scalable Game Agents9h◆RAG4Outcome: A Retrieval-Augmented Multimodal Framework for Prognostic Prediction in Chronic Osteomyelitis9h◆Uncovering the Latent Potential of Deep Intermediate Representations9h◆OpenSkillEval: Automatically Auditing the Open Skill Ecosystem for LLM Agents9h◆Beyond Log Likelihood: Probability-Based Objectives for Supervised Fine-Tuning across the Model Capability Continuum9h◆Debiased Negative Mining Improves Out-of-distribution Detection with Pre-trained Vision-Language Models9h◆FATHOMS-RAG: A Framework for the Assessment of Thinking and Observation in Multimodal Systems that use Retrieval Augmented Generation9h◆SCRIPT: Scalable Diffusion Policy with Multi-stage Training for Language-driven Physics-Based Humanoid Control9h◆Detecting Drunk Driving Using Off-the-Shelf Smartwatches9h◆Decision-Aware Quadratic ReLU Replacement for HE-Friendly Inference9h◆MAS-Orchestra: Understanding and Improving Multi-Agent Reasoning Through Holistic Orchestration and Controlled Benchmarks9h◆NeuroNL2LTL: A Neurosymbolic Framework for Natural Language Translation of Linear Temporal Logic9h◆Agentic Proving for Program Verification9h◆Co-ReAct: Rubrics as Step-Level Collaborators for ReAct Agents9h◆
News/Measuring Open-Source Llama Nemotron Models on DeepResearch Bench
huggingface
PublishedAugust 4, 2025 at 7:51 PM

Measuring Open-Source Llama Nemotron Models on DeepResearch Bench

Source
huggingface.cofull article ↗
Read on huggingface→
Stay posted· Newsletter

A 5-min weekly brief — top movers, price watch, story of the week.

// no spam · unsubscribe one-click · free forever

Discussion
Source
↗
huggingface
Read original ↗All from huggingface →

No replies yet. Be first.

Source
↗
huggingface
Read original ↗All from huggingface →
The Bubble Brief
WEEKLY

Read AI insights every Tuesday — top movers, new releases, story of the week.

// no spam · unsubscribe one-click · free forever

Originally published on huggingface ↗
HomeModelsNews