·
DataBubble
  • Home
  • Models
  • News
  • Compare
  • Boards
  • Pricing
  • About
  • Newsletter
  • Methodology
  • Contact
Latest
Startup Battlefield 200 applications officially close in 3 days2h◆Google will pay SpaceX $920M per month for compute3h◆The most interesting startups right now want to get you off your phone4h◆This is your laptop… on AI5h◆New York lawmakers pass one-year ban on new data centers6h◆The token bill comes due: Inside the industry scramble to manage AI’s runaway costs7h◆The latest AI news we announced in May 20267h◆The ‘together tech’ wave might be the most intriguing startup bet of 20268h◆This AI startup says it can tell if a script will make a hit film8h◆AirTrunk commits $30B to build 5GW of AI data centers in India8h◆The Meta hack shows there’s more to AI security than Mythos13h◆Mira Murati steps back into the spotlight, carefully16h◆SFMambaNet: Spectral-Frequency Enhanced Selective State Space Model for Correspondence Pruning18h◆Optical-Guided Neural Collapse for SAR Few-Shot Class Incremental Learning18h◆Dynamic Infilling Anchors for Format-Constrained Generation in Diffusion Large Language Models18h◆Temporal Order Matters for Agentic Memory: Segment Trees for Long-Horizon Agents18h◆Why Muon Outperforms Adam: A Curvature Perspective18h◆Vision Hopfield Memory Networks18h◆Provably Auditable and Safe LLM Agents from Human-Authored Ontologies18h◆FlexRank: Nested Low-Rank Knowledge Decomposition for Adaptive Model Deployment18h◆Startup Battlefield 200 applications officially close in 3 days2h◆Google will pay SpaceX $920M per month for compute3h◆The most interesting startups right now want to get you off your phone4h◆This is your laptop… on AI5h◆New York lawmakers pass one-year ban on new data centers6h◆The token bill comes due: Inside the industry scramble to manage AI’s runaway costs7h◆The latest AI news we announced in May 20267h◆The ‘together tech’ wave might be the most intriguing startup bet of 20268h◆This AI startup says it can tell if a script will make a hit film8h◆AirTrunk commits $30B to build 5GW of AI data centers in India8h◆The Meta hack shows there’s more to AI security than Mythos13h◆Mira Murati steps back into the spotlight, carefully16h◆SFMambaNet: Spectral-Frequency Enhanced Selective State Space Model for Correspondence Pruning18h◆Optical-Guided Neural Collapse for SAR Few-Shot Class Incremental Learning18h◆Dynamic Infilling Anchors for Format-Constrained Generation in Diffusion Large Language Models18h◆Temporal Order Matters for Agentic Memory: Segment Trees for Long-Horizon Agents18h◆Why Muon Outperforms Adam: A Curvature Perspective18h◆Vision Hopfield Memory Networks18h◆Provably Auditable and Safe LLM Agents from Human-Authored Ontologies18h◆FlexRank: Nested Low-Rank Knowledge Decomposition for Adaptive Model Deployment18h◆
DataBubble·

Model Detail

Qwen logo

Qwen3.5-35B-A3B

—
Provider: QwenCategory: multimodalPipeline: image-text-to-textParameters: 35B
DB Score
20.8
Downloads
3.1M
Likes
1K
Day
+0.0%
Week
+0.0%
Month
+0.0%
Overview

Qwen3.5-35B-A3B is a multimodal model with 35B parameters released by Qwen. The model is registered under the image-text-to-text pipeline tag on Hugging Face, and supports text+image+video->text inputs, distributed under the permissive apache-2.0 license.

Pricing & Throughput

Qwen3.5-35B-A3B is priced at $0.1625/M input tokens and $1.3/M output tokens. Operationally the model offers a 262K-token context window, which matters when sizing it for prompt-heavy or latency-sensitive workloads. At this input rate the model sits in the commodity tier and is suitable for high-volume workloads where per-call cost dominates the decision.

Technical

Qwen3.5-35B-A3B ships with 35B parameters. Total weight footprint is approximately 36.0 GB, which is the relevant figure when planning local-inference VRAM. The apache-2.0 license is permissive, allowing commercial deployment and derivative work without per-seat fees, though attribution requirements still apply.

Use Cases

Qwen3.5-35B-A3B is best fit for mixed text-and-image reasoning tasks such as document understanding, high-volume batch jobs where per-call cost dominates the budget, and long-context tasks such as full-codebase analysis or book-length summarization (262K tokens). Treat this as a starting matrix rather than a benchmark verdict — the right deployment usually depends on the specific evaluation suite that mirrors your workload.

Download History
Pricing
Input ($/M tokens)
$0.1625
Output ($/M tokens)
$1.3
Context Window
262K
Research Paper
arXiv: 2309.16609→
Model Info
Licenseapache-2.0
Modalitytext+image+video->text
Citations3,775 (409 influential)
Recent newsView all news →
Related News
arxivneutral21d ago

Procedural-skill SFT across capacity tiers: A W-Shaped pre-SFT Trajectory and Regime-Asymmetric Mechanism on 0.8B-4B Qwen3.5 Models

arXiv:2605.11907v2 Announce Type: replace Abstract: We measure procedural-skill SFT contribution across three Qwen3.5 dense scales (0.8B, 2B, 4B) on a 200-task / 40-skill holdout, with Claude Haiku 4.5 as a frontier reference. The corpus is 353 rows of (task + procedural-skill block, Opus chain-of-t

arxiv44d ago

Qwen3.5-Omni Technical Report

arXiv:2604.15804v2 Announce Type: replace Abstract: In this work, we present Qwen3.5-Omni, the latest advancement in the Qwen-Omni model family. Representing a significant evolution over its predecessor, Qwen3.5-Omni scales to hundreds of billions of parameters and supports a 256k context length. By

Related Models
Qwen logo
Qwen3-VL-2B-Instruct
Qwen · 155.7M downloads
Qwen logo
Qwen3-0.6B
Qwen · 18.4M downloads
Qwen logo
Qwen3-VL-2B-Instruct
Qwen · 155.7M downloads
google logo
gemma-4-31B-it
Google · 10.3M downloads
HomeModelsNews