Model Detail
MiMo-V2-Flash
—MiMo-V2-Flash is a code generation model with 154.9B parameters released by XiaomiMiMo. The model is registered under the text-generation pipeline tag on Hugging Face, and supports text->text inputs, distributed under the permissive mit license.
MiMo-V2-Flash is priced at $0.09/M input tokens and $0.29/M output tokens. Operationally the model offers a 262K-token context window, which matters when sizing it for prompt-heavy or latency-sensitive workloads. At this input rate the model sits in the commodity tier and is suitable for high-volume workloads where per-call cost dominates the decision.
MiMo-V2-Flash ships with 154.9B parameters. Total weight footprint is approximately 309.8 GB, which is the relevant figure when planning local-inference VRAM. The mit license is permissive, allowing commercial deployment and derivative work without per-seat fees, though attribution requirements still apply.
Downloads of MiMo-V2-Flash have moved +58.0% over the trailing thirty days. That is a slight downtrend, consistent with normal cooling as newer models compete for the same workloads. These numbers are signal, not guarantee — week-over-week download counts on Hugging Face also reflect mirror traffic, CI scrapes, and one-off benchmarking runs.
MiMo-V2-Flash is best fit for code completion, repository-scale Q&A, and pair-programming integrations, high-volume batch jobs where per-call cost dominates the budget, and long-context tasks such as full-codebase analysis or book-length summarization (262K tokens). It is a less obvious choice for one-shot generation of security-critical code without review. Treat this as a starting matrix rather than a benchmark verdict — the right deployment usually depends on the specific evaluation suite that mirrors your workload.
Non-Identical Diffusion Models in MIMO-OFDM Channel Generation
arXiv:2509.01641v3 Announce Type: replace-cross Abstract: We propose a novel diffusion model, termed the non-identical diffusion model, and investigate its application to wireless orthogonal frequency division multiplexing (OFDM) channel generation. Unlike the standard diffusion model that uses a sc
ReFLEX: Length-Generalizable CSI Denoising for MIMO-OFDM via Relative-Frequency Bias
arXiv:2606.00263v1 Announce Type: cross Abstract: This letter studies CSI denoising for MIMO--OFDM with variable NR resource block (RB) allocations. ReFLEX is a length-generalizable Transformer whose frequency attention uses a relative-frequency position bias (RFPB) generated from subcarrier offsets
MIMO: Multilingual Information Retrieval via Monolingual Objectives
arXiv:2605.31171v1 Announce Type: cross Abstract: Multilingual Information Retrieval (MLIR) reflects real-world search environments in which queries and relevant documents may appear in different languages within a mixed-language corpus. However, existing embedding models are primarily optimized for
Deep Learning-Based Channel Extrapolation for Dual-Band Massive MIMO Systems
arXiv:2601.06858v2 Announce Type: replace-cross Abstract: Future wireless communication systems will increasingly rely on the integration of millimeter wave (mmWave) and sub-6 GHz bands to meet heterogeneous demands on high-speed data transmission and extensive coverage. To fully exploit the benefit
Analog RF Computing: A New Paradigm for Energy-Efficient Edge AI Over MU-MIMO Systems
arXiv:2605.14331v1 Announce Type: cross Abstract: Modern edge devices increasingly rely on neural networks for intelligent applications. However, conventional digital computing-based edge inference requires substantial memory and energy consumption. In analog radio frequency (RF) computing, a base s
Multi-Block Attention for Efficient Channel Estimation in IRS-Assisted mmWave MIMO
arXiv:2605.15032v1 Announce Type: cross Abstract: Intelligent Reflecting Surfaces (IRSs) are a promising technology for enhancing the spectral and energy efficiency of millimeter-wave (mmWave) multiple-input multiple-output (MIMO) systems. In these systems, accurate channel estimation remains challe