·
DataBubble
  • Home
  • Models
  • News
  • Compare
  • Boards
  • Pricing
  • About
  • Newsletter
  • Methodology
  • Contact
Latest
Thousand Token Wood: shipping a multi-agent economy on a 3B model1h◆Startup Battlefield 200 applications officially close in 3 days3h◆Google will pay SpaceX $920M per month for compute4h◆The most interesting startups right now want to get you off your phone6h◆This is your laptop… on AI6h◆New York lawmakers pass one-year ban on new data centers7h◆The token bill comes due: Inside the industry scramble to manage AI’s runaway costs8h◆The latest AI news we announced in May 20268h◆The ‘together tech’ wave might be the most intriguing startup bet of 20269h◆This AI startup says it can tell if a script will make a hit film9h◆AirTrunk commits $30B to build 5GW of AI data centers in India10h◆The Meta hack shows there’s more to AI security than Mythos14h◆Mira Murati steps back into the spotlight, carefully18h◆SFMambaNet: Spectral-Frequency Enhanced Selective State Space Model for Correspondence Pruning19h◆Optical-Guided Neural Collapse for SAR Few-Shot Class Incremental Learning19h◆Dynamic Infilling Anchors for Format-Constrained Generation in Diffusion Large Language Models19h◆Temporal Order Matters for Agentic Memory: Segment Trees for Long-Horizon Agents19h◆Why Muon Outperforms Adam: A Curvature Perspective19h◆Vision Hopfield Memory Networks19h◆Provably Auditable and Safe LLM Agents from Human-Authored Ontologies19h◆Thousand Token Wood: shipping a multi-agent economy on a 3B model1h◆Startup Battlefield 200 applications officially close in 3 days3h◆Google will pay SpaceX $920M per month for compute4h◆The most interesting startups right now want to get you off your phone6h◆This is your laptop… on AI6h◆New York lawmakers pass one-year ban on new data centers7h◆The token bill comes due: Inside the industry scramble to manage AI’s runaway costs8h◆The latest AI news we announced in May 20268h◆The ‘together tech’ wave might be the most intriguing startup bet of 20269h◆This AI startup says it can tell if a script will make a hit film9h◆AirTrunk commits $30B to build 5GW of AI data centers in India10h◆The Meta hack shows there’s more to AI security than Mythos14h◆Mira Murati steps back into the spotlight, carefully18h◆SFMambaNet: Spectral-Frequency Enhanced Selective State Space Model for Correspondence Pruning19h◆Optical-Guided Neural Collapse for SAR Few-Shot Class Incremental Learning19h◆Dynamic Infilling Anchors for Format-Constrained Generation in Diffusion Large Language Models19h◆Temporal Order Matters for Agentic Memory: Segment Trees for Long-Horizon Agents19h◆Why Muon Outperforms Adam: A Curvature Perspective19h◆Vision Hopfield Memory Networks19h◆Provably Auditable and Safe LLM Agents from Human-Authored Ontologies19h◆
DataBubble·

Model Detail

dealignai logo

Gemma-4-26B-A4B-JANG_2L-CRACK

—
Provider: dealignaiCategory: multimodalPipeline: image-text-to-textParameters: 26B
DB Score
1.4
Downloads
26K
Likes
63
Day
+0.0%
Week
+0.0%
Month
+0.0%
Overview

Gemma-4-26B-A4B-JANG_2L-CRACK is a multimodal model with 26B parameters released by dealignai. The model is registered under the image-text-to-text pipeline tag on Hugging Face, released under the gemma license.

Technical

Gemma-4-26B-A4B-JANG_2L-CRACK ships with 26B parameters. Total weight footprint is approximately 3.3 GB, which is the relevant figure when planning local-inference VRAM. Distribution is governed by the gemma license — review the exact terms before commercial deployment.

Use Cases

Gemma-4-26B-A4B-JANG_2L-CRACK is best fit for mixed text-and-image reasoning tasks such as document understanding. Treat this as a starting matrix rather than a benchmark verdict — the right deployment usually depends on the specific evaluation suite that mirrors your workload.

Download History
Research Paper
arXiv: 2403.08295→
Model Info
Licensegemma
Citations1,052 (131 influential)
Recent newsView all news →
Related News
arxiv2d ago

Fine-Tuning and Serving Gemma 4 31B on Google Cloud TPU: A Technical Comparison with GPU Baselines

arXiv:2605.25645v2 Announce Type: replace-cross Abstract: We present the first end-to-end demonstration of fine-tuning and serving Google's Gemma 4 31B model on TPU hardware, providing an empirical comparison of TPU and GPU platforms for large language model adaptation. Using LoRA on a Google TPU v5

arxiv16d ago

Borrowed Geometry: Cross-Distribution Head-Importance Fingerprints of Frozen Pretrained Gemma 4 31B

arXiv:2605.00333v2 Announce Type: replace-cross Abstract: Frozen Gemma 4 31B weights pretrained exclusively on text, unmodified, transfer through a thin trainable interface to non-text modalities the substrate has never processed. On the L24--L29 slice (192 attention heads), an English-text TxtCopy

arxiv29d ago

PSK at SemEval-2026 Task 9: Multilingual Polarization Detection Using Ensemble Gemma Models with Synthetic Data Augmentation

arXiv:2605.05159v1 Announce Type: new Abstract: We present our system for SemEval-2026 Task 9: Multilingual Polarization Detection, a binary classification task spanning 22 languages. Our approach fine-tunes separate Gemma~3 models (12B and 27B parameters) per language using Low-Rank Adaptation (LoR

arxiv30d ago

MedGemma 1.5 Technical Report

arXiv:2604.05081v2 Announce Type: replace Abstract: We introduce MedGemma 1.5 4B, the latest model in the MedGemma collection. MedGemma 1.5 expands on MedGemma 1 by integrating additional capabilities: high-dimensional medical imaging (CT/MRI volumes and histopathology whole slide images), anatomica

arxivneutral37d ago

Distilling Self-Consistency into Verbal Confidence: A Pre-Registered Negative Result and Post-Hoc Rescue on Gemma 3 4B

arXiv:2604.24070v1 Announce Type: cross Abstract: Small instruct-tuned LLMs produce degenerate verbal confidence under minimal elicitation: ceiling rates above 95%, near-chance Type-2 AUROC, and Invalid validity profiles. We test whether confidence-conditioned supervised fine-tuning (CSFT) with self

arxivbullish21d ago

A Large Language Model Based Pipeline for Review of Systems Entity Recognition from Clinical Notes

arXiv:2506.11067v3 Announce Type: replace Abstract: Objective: Develop a cost-effective, large language model (LLM)-based pipeline for automatically extracting Review of Systems (ROS) entities from clinical notes. Materials and Methods: The pipeline extracts ROS section from the clinical note using

Related Models
dealignai logo
Gemma-4-31B-JANG_4M-CRACK
dealignai · 59K downloads
dealignai logo
Gemma-4-26B-A4B-JANG_4M-CRACK
dealignai · 41K downloads
Qwen logo
Qwen3-VL-2B-Instruct
Qwen · 22.5M downloads
google logo
gemma-4-26B-A4B-it
Google · 11.8M downloads
HomeModelsNews