DataBubble·

Model Detail

Qwen3.5-35B-A3B-FP8

—

Provider: QwenCategory: multimodalPipeline: image-text-to-textParameters: 35B

DB Score

1.1

Downloads

1.9M

Likes

146

Day

+0.0%

Week

+0.0%

Month

+0.0%

Overview

Qwen3.5-35B-A3B-FP8 is a multimodal model with 35B parameters released by Qwen. The model is registered under the image-text-to-text pipeline tag on Hugging Face, distributed under the permissive apache-2.0 license.

Technical

Qwen3.5-35B-A3B-FP8 ships with 35B parameters. Total weight footprint is approximately 36.0 GB, which is the relevant figure when planning local-inference VRAM. The apache-2.0 license is permissive, allowing commercial deployment and derivative work without per-seat fees, though attribution requirements still apply.

Use Cases

Qwen3.5-35B-A3B-FP8 is best fit for mixed text-and-image reasoning tasks such as document understanding. Treat this as a starting matrix rather than a benchmark verdict — the right deployment usually depends on the specific evaluation suite that mirrors your workload.

Download History

Research Paper

arXiv: 2309.16609→

Model Info

Licenseapache-2.0

Citations3,938 (424 influential)

Recent newsView all news →

TUDUM: A Turkish-Thinking Reasoning Pipeline for Qwen3.5-27B

arXiv:2607.01927v1 Announce Type: cross Abstract: This paper presents TUDUM (T\"urk\c{c}e D\"u\c{s}\"unen \"Uretken Model), a project pipeline for adapting a Qwen-family 27B thinking model toward Turkish reasoning. The central problem is not only to answer Turkish prompts in Turkish, but to make the

arxivneutral67d ago

Procedural-skill SFT across capacity tiers: A W-Shaped pre-SFT Trajectory and Regime-Asymmetric Mechanism on 0.8B-4B Qwen3.5 Models

arXiv:2605.11907v2 Announce Type: replace Abstract: We measure procedural-skill SFT contribution across three Qwen3.5 dense scales (0.8B, 2B, 4B) on a 200-task / 40-skill holdout, with Claude Haiku 4.5 as a frontier reference. The corpus is 353 rows of (task + procedural-skill block, Opus chain-of-t

arxiv90d ago

Qwen3.5-Omni Technical Report

arXiv:2604.15804v2 Announce Type: replace Abstract: In this work, we present Qwen3.5-Omni, the latest advancement in the Qwen-Omni model family. Representing a significant evolution over its predecessor, Qwen3.5-Omni scales to hundreds of billions of parameters and supports a 256k context length. By

Related Models

Qwen3-0.6B

Qwen · 26.1M downloads

Qwen3-VL-2B-Instruct

Qwen · 22.5M downloads

Qwen3-VL-2B-Instruct

Qwen · 22.5M downloads

gemma-4-26B-A4B-it

Google · 13.6M downloads