·
DataBubble
  • Home
  • Models
  • News
  • Compare
  • Boards
  • Pricing
  • About
  • Newsletter
  • Methodology
  • Contact
Latest
MMGist: A Comprehensive Multimodal Benchmark for 20272h◆Visualizing "We the People": Bridging the Perception Gap through Pluralistic Data Storytelling2h◆Small edits, large models: How Wikipedia advocacy shapes LLM values2h◆Noise-Aware Boundary-Enhanced Generative Learning for Ultrasound Speckle Reduction2h◆Wan-Streamer v0.1: End-to-end Real-time Interactive Foundation Models2h◆MiniOpt: Reasoning to Model and Solve General Optimization Problems with Limited Resources2h◆HierBias: Context-Conditioned Hierarchical Media Bias Detection with Multi-Task Type Classification2h◆Where Larger Models Excel: The Primacy of Constraint-Guided Reasoning2h◆Dynamic-dLLM: Dynamic Cache-Budget and Adaptive Parallel Decoding for Training-Free Acceleration of Diffusion LLM2h◆Phonetic and semantic analyses of spoken corpora of Beijing and Taiwan Mandarin indicate that the neutral tone is a lexical tone2h◆ProfileFoundry: A Synthetic Person-Object Substrate for Privacy, Memory, and Tool-Use Evaluation in LLM Agent2h◆AnySimLite: A Lightweight Few-Shot Similarity Encoder for On-Device Speech-Adjacent Classification2h◆Extracting Problem and Method Sentence from Scientific Papers: A Context-enhanced Transformer Using Formulaic Expression Desensitization2h◆Utilizing Cognitive Signals Generated during Human Reading to Enhance Keyphrase Extraction from Microblogs2h◆Comparing BERT Sentence-Pair Classification and Few-Shot LLM Prompting for Detecting Threat and Solution Framing in German Climate News2h◆Nemotron-TwoTower: Diffusion Language Modeling with Pretrained Autoregressive Context2h◆Assessing Post-Reform Changes in Risk Disclosure Quality with a Multidimensional Text Analysis Approach2h◆Erase-then-Delta Attention: Decoupling Erase and Write Addresses in Delta-Rule Linear Attention2h◆Zero-shot Tweet-Level Stance Detection Enhanced by External Knowledge and Reflective Chain-of-Thought Reasoning2h◆Closing the Quality Gap in Low-Resource Text-to-Speech: LoRA Fine-Tuning of VoxCPM2 for Khmer and Korean2h◆MMGist: A Comprehensive Multimodal Benchmark for 20272h◆Visualizing "We the People": Bridging the Perception Gap through Pluralistic Data Storytelling2h◆Small edits, large models: How Wikipedia advocacy shapes LLM values2h◆Noise-Aware Boundary-Enhanced Generative Learning for Ultrasound Speckle Reduction2h◆Wan-Streamer v0.1: End-to-end Real-time Interactive Foundation Models2h◆MiniOpt: Reasoning to Model and Solve General Optimization Problems with Limited Resources2h◆HierBias: Context-Conditioned Hierarchical Media Bias Detection with Multi-Task Type Classification2h◆Where Larger Models Excel: The Primacy of Constraint-Guided Reasoning2h◆Dynamic-dLLM: Dynamic Cache-Budget and Adaptive Parallel Decoding for Training-Free Acceleration of Diffusion LLM2h◆Phonetic and semantic analyses of spoken corpora of Beijing and Taiwan Mandarin indicate that the neutral tone is a lexical tone2h◆ProfileFoundry: A Synthetic Person-Object Substrate for Privacy, Memory, and Tool-Use Evaluation in LLM Agent2h◆AnySimLite: A Lightweight Few-Shot Similarity Encoder for On-Device Speech-Adjacent Classification2h◆Extracting Problem and Method Sentence from Scientific Papers: A Context-enhanced Transformer Using Formulaic Expression Desensitization2h◆Utilizing Cognitive Signals Generated during Human Reading to Enhance Keyphrase Extraction from Microblogs2h◆Comparing BERT Sentence-Pair Classification and Few-Shot LLM Prompting for Detecting Threat and Solution Framing in German Climate News2h◆Nemotron-TwoTower: Diffusion Language Modeling with Pretrained Autoregressive Context2h◆Assessing Post-Reform Changes in Risk Disclosure Quality with a Multidimensional Text Analysis Approach2h◆Erase-then-Delta Attention: Decoupling Erase and Write Addresses in Delta-Rule Linear Attention2h◆Zero-shot Tweet-Level Stance Detection Enhanced by External Knowledge and Reflective Chain-of-Thought Reasoning2h◆Closing the Quality Gap in Low-Resource Text-to-Speech: LoRA Fine-Tuning of VoxCPM2 for Khmer and Korean2h◆
DataBubble·

Model Detail

lukealonso logo

MiniMax-M2.7-NVFP4

—
Provider: lukealonsoCategory: code
DB Score
1.4
Downloads
39K
Likes
42
Day
+0.0%
Week
+0.0%
Month
+0.0%
Overview

MiniMax-M2.7-NVFP4 is a code generation model with 65.2B parameters released by lukealonso. Distributed under the permissive mit license.

Technical

MiniMax-M2.7-NVFP4 ships with 65.2B parameters. Total weight footprint is approximately 130.4 GB, which is the relevant figure when planning local-inference VRAM. The mit license is permissive, allowing commercial deployment and derivative work without per-seat fees, though attribution requirements still apply.

Use Cases

MiniMax-M2.7-NVFP4 is best fit for code completion, repository-scale Q&A, and pair-programming integrations. It is a less obvious choice for one-shot generation of security-critical code without review. Treat this as a starting matrix rather than a benchmark verdict — the right deployment usually depends on the specific evaluation suite that mirrors your workload.

Download History
Model Info
Licensemit
Recent newsView all news →
Related News
arxiv1d ago

Black-Box Assisted Regression: Phase Transitions and Minimax Optimality

arXiv:2606.25743v1 Announce Type: new Abstract: Foundation models are often used as fixed black-box predictors for downstream tasks with limited labeled data, but their predictions may be biased and unsafe to trust blindly. We study this setting through black-box assisted nonparametric regression: a

arxiv1d ago

Minimax PAC Bounds for Learning in Exogenous Contextual MDPs

arXiv:2606.25170v1 Announce Type: cross Abstract: We study PAC learning in tabular discounted Markov decision processes with exogenous i.i.d. contexts, with discount factor $\gamma$, finite state space $\mathcal X$, action space $\mathcal A$, and context space $\mathcal Z$. At each time step, a cont

arxivneutral7d ago

Quantile of Means: A Bonus-Free Ensemble Method for Minimax Optimal Reinforcement Learning

arXiv:2606.20107v1 Announce Type: new Abstract: Optimal Reinforcement Learning (RL) algorithms typically rely on carefully constructed count-based uncertainty estimates to drive exploration. Although theoretically sound, such estimates are hard to compute in practical settings and therefore offer li

arxiv9d ago

Learning from Biased and Costly Data Sources: Minimax-optimal Data Collection under a Budget

arXiv:2602.17894v2 Announce Type: replace-cross Abstract: Data collection is a critical component of modern statistical and machine learning pipelines, particularly when data must be gathered from multiple heterogeneous sources to study a target population of interest. In many use cases, such as med

arxiv10d ago

Enhancing LLM Safety Through a Theoretical Minimax Game Lens

arXiv:2502.05163v2 Announce Type: replace Abstract: The rapid advancement of large language models (LLMs) necessitates effective mechanisms to ensure their responsible deployment by accurately distinguishing unsafe content from benign content. While substantial safety datasets are available in Engli

arxivneutral11d ago

MiniMax Sparse Attention

arXiv:2606.13392v2 Announce Type: replace Abstract: Ultra-long-context capability is becoming indispensable for frontier LLMs: agentic workflows, repository-scale code reasoning, and persistent memory all require the model to jointly attend over hundreds of thousands to millions of tokens, yet the q

Related Models
lukealonso logo
GLM-5.2-NVFP4
lukealonso · 49K downloads
lukealonso logo
GLM-5.1-NVFP4
lukealonso · 15K downloads
sentence-transformers logo
all-MiniLM-L6-v2
SBERT · 245.3M downloads
nomic-ai logo
nomic-embed-text-v1.5
nomic-ai · 17.1M downloads
HomeModelsNews