DataBubble·

Model Detail

gemma-3-12b-it-qat-q4_0-unquantized

—

Provider: GoogleCategory: multimodalPipeline: image-text-to-textParameters: 12B

DB Score

20.7

Downloads

70K

Likes

118

Day

+0.0%

Week

+0.0%

Month

+0.0%

Overview

gemma-3-12b-it-qat-q4_0-unquantized is a multimodal model with 12B parameters released by Google. The model is registered under the image-text-to-text pipeline tag on Hugging Face, released under the gemma license.

Technical

gemma-3-12b-it-qat-q4_0-unquantized ships with 12B parameters, distributed as a quantized weight variant for lower-VRAM inference. Total weight footprint is approximately 12.2 GB, which is the relevant figure when planning local-inference VRAM. Access is gated on Hugging Face under the gemma license, which means a manual approval step before weights can be downloaded.

Use Cases

gemma-3-12b-it-qat-q4_0-unquantized is best fit for mixed text-and-image reasoning tasks such as document understanding. Treat this as a starting matrix rather than a benchmark verdict — the right deployment usually depends on the specific evaluation suite that mirrors your workload.

Download History

Research Paper

arXiv: 2403.08295→

Model Info

Licensegemma

Recent newsView all news →

Fine-Tuning and Serving Gemma 4 31B on Google Cloud TPU: A Technical Comparison with GPU Baselines

arXiv:2605.25645v2 Announce Type: replace-cross Abstract: We present the first end-to-end demonstration of fine-tuning and serving Google's Gemma 4 31B model on TPU hardware, providing an empirical comparison of TPU and GPU platforms for large language model adaptation. Using LoRA on a Google TPU v5

arxiv16d ago

Borrowed Geometry: Cross-Distribution Head-Importance Fingerprints of Frozen Pretrained Gemma 4 31B

arXiv:2605.00333v2 Announce Type: replace-cross Abstract: Frozen Gemma 4 31B weights pretrained exclusively on text, unmodified, transfer through a thin trainable interface to non-text modalities the substrate has never processed. On the L24--L29 slice (192 attention heads), an English-text TxtCopy

arxiv29d ago

PSK at SemEval-2026 Task 9: Multilingual Polarization Detection Using Ensemble Gemma Models with Synthetic Data Augmentation

arXiv:2605.05159v1 Announce Type: new Abstract: We present our system for SemEval-2026 Task 9: Multilingual Polarization Detection, a binary classification task spanning 22 languages. Our approach fine-tunes separate Gemma~3 models (12B and 27B parameters) per language using Low-Rank Adaptation (LoR

arxiv30d ago

MedGemma 1.5 Technical Report

arXiv:2604.05081v2 Announce Type: replace Abstract: We introduce MedGemma 1.5 4B, the latest model in the MedGemma collection. MedGemma 1.5 expands on MedGemma 1 by integrating additional capabilities: high-dimensional medical imaging (CT/MRI volumes and histopathology whole slide images), anatomica

arxivneutral37d ago

Distilling Self-Consistency into Verbal Confidence: A Pre-Registered Negative Result and Post-Hoc Rescue on Gemma 3 4B

arXiv:2604.24070v1 Announce Type: cross Abstract: Small instruct-tuned LLMs produce degenerate verbal confidence under minimal elicitation: ceiling rates above 95%, near-chance Type-2 AUROC, and Invalid validity profiles. We test whether confidence-conditioned supervised fine-tuning (CSFT) with self

Related Models

gemma-4-26B-A4B-it

Google · 11.9M downloads

gemma-4-31B-it

Google · 11.1M downloads

Qwen3-VL-2B-Instruct

Qwen · 22.5M downloads

gemma-4-26B-A4B-it

Google · 11.9M downloads