DataBubble·

Model Detail

MiMo-V2-Flash

—

Provider: XiaomiMiMoCategory: codePipeline: text-generation

DB Score

36.9

Downloads

95K

Likes

726

Day

+0.0%

Week

+0.0%

Month

+0.0%

Overview

MiMo-V2-Flash is a code generation model with 154.9B parameters released by XiaomiMiMo. The model is registered under the text-generation pipeline tag on Hugging Face, and supports text->text inputs, distributed under the permissive mit license.

Pricing & Throughput

MiMo-V2-Flash is priced at $0.09/M input tokens and $0.29/M output tokens. Operationally the model offers a 262K-token context window, which matters when sizing it for prompt-heavy or latency-sensitive workloads. At this input rate the model sits in the commodity tier and is suitable for high-volume workloads where per-call cost dominates the decision.

Technical

MiMo-V2-Flash ships with 154.9B parameters. Total weight footprint is approximately 309.8 GB, which is the relevant figure when planning local-inference VRAM. The mit license is permissive, allowing commercial deployment and derivative work without per-seat fees, though attribution requirements still apply.

Use Cases

MiMo-V2-Flash is best fit for code completion, repository-scale Q&A, and pair-programming integrations, high-volume batch jobs where per-call cost dominates the budget, and long-context tasks such as full-codebase analysis or book-length summarization (262K tokens). It is a less obvious choice for one-shot generation of security-critical code without review. Treat this as a starting matrix rather than a benchmark verdict — the right deployment usually depends on the specific evaluation suite that mirrors your workload.

Download History

Pricing

Input ($/M tokens)

$0.09

Output ($/M tokens)

$0.29

Context Window

262K

Model Info

Licensemit

Modalitytext->text

Recent newsView all news →

DeeperRadar: End-to-End MIMO Radar Design and Multi-Modal Fusion for Autonomous Vehicle Perception

arXiv:2607.17351v1 Announce Type: new Abstract: DeeperRadar is a radar-centric, sensor-stack-conditioned framework that co-designs radar sensing and multi-modal 3D detection for autonomous mobility by learning a sparse acquisition pattern end-to-end with the fusion model. A learnable MIMO design mod

arxivneutral5d ago

Full-Pipeline Inference Optimization for MiMo-V2.5 Series: Pushing Hybrid SWA Efficiency to the Limit

arXiv:2607.13095v1 Announce Type: cross Abstract: We present a full-pipeline inference optimization for the MiMo-V2.5 model family, which combines Hybrid Sliding Window Attention (Hybrid SWA), sparse Mixture-of-Experts (MoE), and multimodal encoders. While Hybrid SWA can ideally reduce both attentio

arxiv21d ago

Bridging Neural Networks and Wireless Systems with MIMO-OFDM Semantic Communications

arXiv:2501.16726v2 Announce Type: replace-cross Abstract: Semantic communications aim to enhance transmission efficiency by jointly optimizing source coding, channel coding, and modulation. While prior research has demonstrated promising performance in simulations, real-world implementations often f

arxiv41d ago

Structure from Reasoning, Numbers from Search: On-Premise Open LLMs as Structural Priors for Coupled MIMO Controller Tuning

arXiv:2606.11015v1 Announce Type: new Abstract: Tuning controllers for strongly coupled multi-input multi-output (MIMO) industrial processes is hard: decentralized classical auto-tuning ignores loop interaction, and local numerical optimization from natural initializations stalls in the resulting no

arxiv48d ago

Non-Identical Diffusion Models in MIMO-OFDM Channel Generation

arXiv:2509.01641v3 Announce Type: replace-cross Abstract: We propose a novel diffusion model, termed the non-identical diffusion model, and investigate its application to wireless orthogonal frequency division multiplexing (OFDM) channel generation. Unlike the standard diffusion model that uses a sc

arxiv49d ago

ReFLEX: Length-Generalizable CSI Denoising for MIMO-OFDM via Relative-Frequency Bias

arXiv:2606.00263v1 Announce Type: cross Abstract: This letter studies CSI denoising for MIMO--OFDM with variable NR resource block (RB) allocations. ReFLEX is a length-generalizable Transformer whose frequency attention uses a relative-frequency position bias (RFPB) generated from subcarrier offsets

Related Models

MiMo-V2.5

XiaomiMiMo · 216K downloads

MiMo-V2.5-Pro

XiaomiMiMo · 87K downloads

all-MiniLM-L6-v2

SBERT · 240.6M downloads

nomic-embed-text-v1.5

nomic-ai · 17.1M downloads