Xiaomi has formally entered the bogus intelligence house with the discharge of its first open-source reasoning mannequin, MiMo 7B. Developed by the corporate’s newly established Massive Mannequin Core Workforce, MiMo represents a strategic shift from Xiaomi’s conventional {hardware} focus to superior AI analysis and growth.
Regardless of having simply 7 billion parameters, MiMo has demonstrated robust efficiency in complicated reasoning duties, surpassing a lot bigger fashions like OpenAI’s o1-mini and Alibaba’s 32B QwQ-Preview. This efficiency has positioned Xiaomi within the highlight for producing a extremely environment friendly and compact mannequin able to rivaling extra resource-intensive rivals.
Benchmark Efficiency and Design Technique
MiMo 7B has outperformed its friends on benchmarks reminiscent of AIME 24-25 (a take a look at for mathematical reasoning) and LiveCodeBench v5 (a programming problem dataset). This success is attributed to a well-structured growth course of consisting of each pre-training and post-training improvements.
Pre-training Methods Included:
- Wealthy Reasoning Corpus: Centered on extracting and integrating complicated reasoning information.
- Artificial Information Era: Produced roughly 200 billion tokens of expert-level reasoning knowledge to enhance coaching depth.
- Progressive Issue Coaching: Employed a three-phase coaching methodology with rising ranges of issue.
- Intensive Token Publicity: Educated throughout an unlimited 25 trillion tokens, making certain complete studying.
Put up-training Enhancements:
- Check Issue-Pushed Rewards: Launched a novel strategy for addressing reward sparsity in complicated algorithmic duties.
- Information Re-sampling Strategies: Utilized to stabilize reinforcement studying processes.
- Seamless Rollout System: Elevated coaching effectivity by 2.29 occasions and validation velocity by 1.96 occasions, streamlining the reinforcement studying (RL) pipeline.
A Broader Imaginative and prescient for AI
The discharge of MiMo underscores Xiaomi’s broader ambitions within the AI sector. Whereas beforehand identified for shopper electronics and good gadgets, Xiaomi is now positioning itself as a severe participant in AI mannequin growth. By open-sourcing MiMo, Xiaomi is contributing to the collaborative AI analysis ecosystem, fostering innovation past proprietary growth.
Builders and AI researchers can now entry MiMo 7B and its full technical documentation through Xiaomi’s official Hugging Face repository, providing a useful software for additional experimentation and growth. This transfer displays Xiaomi’s intent to construct a robust presence in AI whereas supporting the open-source group.
Filed in AI (Synthetic Intelligence), ChatGPT and Xiaomi.
. Learn extra about