·
DataBubble
  • Home
  • Models
  • News
  • Compare
  • Boards
  • Pricing
  • About
  • Newsletter
  • Methodology
  • Contact
Latest
Meta rolls out a new AI creator assistant on Facebook1h◆What to expect from WWDC 2026: Siri’s highly anticipated revamp and Apple Intelligence updates1h◆Is Silicon Valley ready to put robots in people’s homes? Hello Robot is.3h◆TSMC struggles to keep up with AI demand: ‘We can only support so much’4h◆Apple touts $1.4 trillion in App Store billings and sales, 90% without a commission4h◆Elon Musk is steamrolling Wall Street to become a trillionaire4h◆How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent5h◆Let us filter AI slop, you cowards6h◆EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios6h◆AI leaders call for tougher protections against AI-aided bioweapons6h◆How Endava is redesigning software delivery around AI agents6h◆Task-Seeded Synthetic Q&A Generation for Nemotron Pretraining7h◆How courts are coping with a flood of AI-generated lawsuits7h◆Amazon develops a warehouse robot that workers can speak to8h◆Dreaming: Better memory for a more helpful ChatGPT9h◆SFMambaNet: Spectral-Frequency Enhanced Selective State Space Model for Correspondence Pruning14h◆Optical-Guided Neural Collapse for SAR Few-Shot Class Incremental Learning14h◆Dynamic Infilling Anchors for Format-Constrained Generation in Diffusion Large Language Models14h◆Temporal Order Matters for Agentic Memory: Segment Trees for Long-Horizon Agents14h◆Why Muon Outperforms Adam: A Curvature Perspective14h◆Meta rolls out a new AI creator assistant on Facebook1h◆What to expect from WWDC 2026: Siri’s highly anticipated revamp and Apple Intelligence updates1h◆Is Silicon Valley ready to put robots in people’s homes? Hello Robot is.3h◆TSMC struggles to keep up with AI demand: ‘We can only support so much’4h◆Apple touts $1.4 trillion in App Store billings and sales, 90% without a commission4h◆Elon Musk is steamrolling Wall Street to become a trillionaire4h◆How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent5h◆Let us filter AI slop, you cowards6h◆EVA-Bench Data 2.0: 3 Domains, 121 Tools, 213 Scenarios6h◆AI leaders call for tougher protections against AI-aided bioweapons6h◆How Endava is redesigning software delivery around AI agents6h◆Task-Seeded Synthetic Q&A Generation for Nemotron Pretraining7h◆How courts are coping with a flood of AI-generated lawsuits7h◆Amazon develops a warehouse robot that workers can speak to8h◆Dreaming: Better memory for a more helpful ChatGPT9h◆SFMambaNet: Spectral-Frequency Enhanced Selective State Space Model for Correspondence Pruning14h◆Optical-Guided Neural Collapse for SAR Few-Shot Class Incremental Learning14h◆Dynamic Infilling Anchors for Format-Constrained Generation in Diffusion Large Language Models14h◆Temporal Order Matters for Agentic Memory: Segment Trees for Long-Horizon Agents14h◆Why Muon Outperforms Adam: A Curvature Perspective14h◆
News/Procedural-skill SFT across capacity tiers: A W-Shaped pre-SFT Trajectory and Regime-Asymmetric Mechanism on 0.8B-4B Qwen3.5 Models
arxiv
PublishedMay 15, 2026 at 4:00 AM
—neutral

Procedural-skill SFT across capacity tiers: A W-Shaped pre-SFT Trajectory and Regime-Asymmetric Mechanism on 0.8B-4B Qwen3.5 Models

Source
arxiv.orgfull article ↗
Read on arxiv→
Publisher summary· verbatim

arXiv:2605.11907v2 Announce Type: replace Abstract: We measure procedural-skill SFT contribution across three Qwen3.5 dense scales (0.8B, 2B, 4B) on a 200-task / 40-skill holdout, with Claude Haiku 4.5 as a frontier reference. The corpus is 353 rows of (task + procedural-skill block, Opus chain-of-t

Stay posted· Newsletter

A 5-min weekly brief — top movers, price watch, story of the week.

// no spam · unsubscribe one-click · free forever

Discussion
Source
↗
arxiv
Read original ↗All from arxiv →

No replies yet. Be first.

Source
↗
arxiv
Read original ↗All from arxiv →

Related coverage

More from ARXIV
arxivSFMambaNet: Spectral-Frequency Enhanced Selective State Space Model for Correspondence Pruning14harxivOptical-Guided Neural Collapse for SAR Few-Shot Class Incremental Learning14harxivDynamic Infilling Anchors for Format-Constrained Generation in Diffusion Large Language Models14harxivTemporal Order Matters for Agentic Memory: Segment Trees for Long-Horizon Agents14h
The Bubble Brief
WEEKLY

Read AI insights every Tuesday — top movers, new releases, story of the week.

// no spam · unsubscribe one-click · free forever

Originally published on arxiv ↗
HomeModelsNews