·
DataBubble
  • Home
  • Models
  • News
  • Compare
  • Boards
  • Pricing
  • About
  • Newsletter
  • Methodology
  • Contact
Latest
Startup Battlefield 200 applications officially close in 3 days1h◆Google will pay SpaceX $920M per month for compute2h◆The most interesting startups right now want to get you off your phone4h◆This is your laptop… on AI5h◆New York lawmakers pass one-year ban on new data centers6h◆The token bill comes due: Inside the industry scramble to manage AI’s runaway costs7h◆The latest AI news we announced in May 20267h◆The ‘together tech’ wave might be the most intriguing startup bet of 20267h◆This AI startup says it can tell if a script will make a hit film7h◆AirTrunk commits $30B to build 5GW of AI data centers in India8h◆The Meta hack shows there’s more to AI security than Mythos12h◆Mira Murati steps back into the spotlight, carefully16h◆SFMambaNet: Spectral-Frequency Enhanced Selective State Space Model for Correspondence Pruning17h◆Optical-Guided Neural Collapse for SAR Few-Shot Class Incremental Learning17h◆Dynamic Infilling Anchors for Format-Constrained Generation in Diffusion Large Language Models17h◆Temporal Order Matters for Agentic Memory: Segment Trees for Long-Horizon Agents17h◆Why Muon Outperforms Adam: A Curvature Perspective17h◆Vision Hopfield Memory Networks17h◆Provably Auditable and Safe LLM Agents from Human-Authored Ontologies17h◆FlexRank: Nested Low-Rank Knowledge Decomposition for Adaptive Model Deployment17h◆Startup Battlefield 200 applications officially close in 3 days1h◆Google will pay SpaceX $920M per month for compute2h◆The most interesting startups right now want to get you off your phone4h◆This is your laptop… on AI5h◆New York lawmakers pass one-year ban on new data centers6h◆The token bill comes due: Inside the industry scramble to manage AI’s runaway costs7h◆The latest AI news we announced in May 20267h◆The ‘together tech’ wave might be the most intriguing startup bet of 20267h◆This AI startup says it can tell if a script will make a hit film7h◆AirTrunk commits $30B to build 5GW of AI data centers in India8h◆The Meta hack shows there’s more to AI security than Mythos12h◆Mira Murati steps back into the spotlight, carefully16h◆SFMambaNet: Spectral-Frequency Enhanced Selective State Space Model for Correspondence Pruning17h◆Optical-Guided Neural Collapse for SAR Few-Shot Class Incremental Learning17h◆Dynamic Infilling Anchors for Format-Constrained Generation in Diffusion Large Language Models17h◆Temporal Order Matters for Agentic Memory: Segment Trees for Long-Horizon Agents17h◆Why Muon Outperforms Adam: A Curvature Perspective17h◆Vision Hopfield Memory Networks17h◆Provably Auditable and Safe LLM Agents from Human-Authored Ontologies17h◆FlexRank: Nested Low-Rank Knowledge Decomposition for Adaptive Model Deployment17h◆
DataBubble·

Model Detail

unsloth logo

granite-4.1-8b-GGUF

▲ 6.5%
Provider: unslothCategory: otherParameters: 8B
DB Score
3.8
Downloads
18K
Likes
31
Day
+6.5%
Week
+182.8%
Month
+0.0%
Overview

granite-4.1-8b-GGUF is an AI model with 8B parameters released by unsloth. And supports text->text inputs, distributed under the permissive apache-2.0 license.

Pricing & Throughput

granite-4.1-8b-GGUF is priced at $0.05/M input tokens and $0.1/M output tokens. Operationally the model offers a 131K-token context window, which matters when sizing it for prompt-heavy or latency-sensitive workloads. At this input rate the model sits in the commodity tier and is suitable for high-volume workloads where per-call cost dominates the decision.

Technical

granite-4.1-8b-GGUF ships with 8B parameters, distributed as a quantized weight variant for lower-VRAM inference. The apache-2.0 license is permissive, allowing commercial deployment and derivative work without per-seat fees, though attribution requirements still apply.

Trending Signal

Downloads of granite-4.1-8b-GGUF have moved +6.5% over the past 24 hours, +182.8% over the trailing seven days. That puts the model in active uptrend territory; a sustained move of this size usually reflects a recent release, a viral integration, or a benchmark surprise rather than steady-state demand. These numbers are signal, not guarantee — week-over-week download counts on Hugging Face also reflect mirror traffic, CI scrapes, and one-off benchmarking runs.

Read about databubble_score →
Use Cases

granite-4.1-8b-GGUF is best fit for general-purpose AI workloads, and high-volume batch jobs where per-call cost dominates the budget. Treat this as a starting matrix rather than a benchmark verdict — the right deployment usually depends on the specific evaluation suite that mirrors your workload.

Download History
Pricing
Input ($/M tokens)
$0.05
Output ($/M tokens)
$0.1
Context Window
131K
Research Paper
arXiv: 0000.00000→
Model Info
Licenseapache-2.0
Modalitytext->text
Recent newsView all news →
Related News
arxiv3d ago

GRANITE : a Byzantine-Resilient Dynamic Gossip Learning Framework

arXiv:2504.17471v2 Announce Type: replace-cross Abstract: Gossip Learning (GL) is a decentralized learning paradigm where users iteratively exchange and aggregate models with a small set of neighboring peers. Recent approaches rely on dynamic communication graphs built using Random Peer Sampling (RP

huggingfaceneutral22d ago

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality

huggingfaceneutral37d ago

Granite 4.1 LLMs: How They’re Built

huggingface66d ago

Granite 4.0 3B Vision: Compact Multimodal Intelligence for Enterprise Documents

huggingface77d ago

What's New in Mellea 0.4.0 + Granite Libraries Release

Related Models
unsloth logo
Qwen3-Coder-Next-GGUF
unsloth · 3.1M downloads
unsloth logo
gemma-4-26B-A4B-it-GGUF
unsloth · 2.4M downloads
openai logo
clip-vit-large-patch14
OpenAI · 33.1M downloads
openai logo
clip-vit-base-patch32
OpenAI · 21.4M downloads
HomeModelsNews