The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence
Abstract
The MiniMax-M2 series introduces Mixture-of-Experts language models with minimal activated parameters that achieve high performance in agentic tasks through specialized training and deployment systems.
We introduce the MiniMax-M2 series, a family of Mixture-of-Experts language models built around the principle that mini activations can unleash maximum real-world intelligence. The flagship M2 contains 229.9B total parameters with only 9.8B activated per token. Designed end-to-end for agentic deployment, the M2 series rests on three components: (i) agent-driven data pipelines producing large-scale, verifiable trajectories across agentic coding and agentic cowork, each grounded in an executable workspace and an artifact-aligned reward; (ii) Forge, a scalable agent-native RL system that adapts to long-horizon agent trajectories, paired with windowed-FIFO scheduling, prefix-tree merging, inference optimization, and a clean training-inference-agent decoupling that supports both white-box and black-box agents; (iii) the latest M2.7 checkpoint takes an early step toward self-evolution -- autonomously debugging training runs and modifying its own scaffold. Across M2 through M2.7, this combination translates a mini-activation footprint into frontier-tier performance on agentic coding, deep search, office-task, and reasoning benchmarks.
Community
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- KAT-Coder-V2 Technical Report (2026)
- Mind DeepResearch Technical Report (2026)
- Yet Even Less Is Even Better For Agentic, Reasoning, and Coding LLMs (2026)
- Terminal-World: Scaling Terminal-Agent Environments via Agent Skills (2026)
- Marco DeepResearch: Unlocking Efficient Deep Research Agents via Verification-Centric Design (2026)
- TerminalWorld: Benchmarking Agents on Real-World Terminal Tasks (2026)
- Synthetic Sandbox for Training Machine Learning Engineering Agents (2026)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend
Get this paper in your agent:
hf papers read 2605.26494 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper