Model Detail
MiMo-V2.5
▲ 1.3%MiMo-V2.5 is a code generation model with 155.4B parameters released by XiaomiMiMo. And supports text+image+audio+video->text inputs, distributed under the permissive mit license.
MiMo-V2.5 is priced at $0.4/M input tokens and $2/M output tokens. Operationally the model offers a 1049K-token context window, which matters when sizing it for prompt-heavy or latency-sensitive workloads. At this input rate the model sits in the commodity tier and is suitable for high-volume workloads where per-call cost dominates the decision.
MiMo-V2.5 ships with 155.4B parameters. Total weight footprint is approximately 310.8 GB, which is the relevant figure when planning local-inference VRAM. The mit license is permissive, allowing commercial deployment and derivative work without per-seat fees, though attribution requirements still apply.
Downloads of MiMo-V2.5 have moved +1.3% over the past 24 hours, +346.2% over the trailing thirty days. That is a slight downtrend, consistent with normal cooling as newer models compete for the same workloads. These numbers are signal, not guarantee — week-over-week download counts on Hugging Face also reflect mirror traffic, CI scrapes, and one-off benchmarking runs.
MiMo-V2.5 is best fit for code completion, repository-scale Q&A, and pair-programming integrations, high-volume batch jobs where per-call cost dominates the budget, and long-context tasks such as full-codebase analysis or book-length summarization (1049K tokens). It is a less obvious choice for one-shot generation of security-critical code without review. Treat this as a starting matrix rather than a benchmark verdict — the right deployment usually depends on the specific evaluation suite that mirrors your workload.
Non-Identical Diffusion Models in MIMO-OFDM Channel Generation
arXiv:2509.01641v3 Announce Type: replace-cross Abstract: We propose a novel diffusion model, termed the non-identical diffusion model, and investigate its application to wireless orthogonal frequency division multiplexing (OFDM) channel generation. Unlike the standard diffusion model that uses a sc
ReFLEX: Length-Generalizable CSI Denoising for MIMO-OFDM via Relative-Frequency Bias
arXiv:2606.00263v1 Announce Type: cross Abstract: This letter studies CSI denoising for MIMO--OFDM with variable NR resource block (RB) allocations. ReFLEX is a length-generalizable Transformer whose frequency attention uses a relative-frequency position bias (RFPB) generated from subcarrier offsets
MIMO: Multilingual Information Retrieval via Monolingual Objectives
arXiv:2605.31171v1 Announce Type: cross Abstract: Multilingual Information Retrieval (MLIR) reflects real-world search environments in which queries and relevant documents may appear in different languages within a mixed-language corpus. However, existing embedding models are primarily optimized for
Deep Learning-Based Channel Extrapolation for Dual-Band Massive MIMO Systems
arXiv:2601.06858v2 Announce Type: replace-cross Abstract: Future wireless communication systems will increasingly rely on the integration of millimeter wave (mmWave) and sub-6 GHz bands to meet heterogeneous demands on high-speed data transmission and extensive coverage. To fully exploit the benefit
Analog RF Computing: A New Paradigm for Energy-Efficient Edge AI Over MU-MIMO Systems
arXiv:2605.14331v1 Announce Type: cross Abstract: Modern edge devices increasingly rely on neural networks for intelligent applications. However, conventional digital computing-based edge inference requires substantial memory and energy consumption. In analog radio frequency (RF) computing, a base s
Multi-Block Attention for Efficient Channel Estimation in IRS-Assisted mmWave MIMO
arXiv:2605.15032v1 Announce Type: cross Abstract: Intelligent Reflecting Surfaces (IRSs) are a promising technology for enhancing the spectral and energy efficiency of millimeter-wave (mmWave) multiple-input multiple-output (MIMO) systems. In these systems, accurate channel estimation remains challe