China
- MiniMax M2.7: Self-Evolving RL and the End of China's Open-Source Playbook
MiniMax M2.7 used earlier model versions to handle 30-50% of its own RL research pipeline -- log-reading, failure analysis, code modification across 100+ iteration loops. The model is also proprietary, marking a strategic shift from Chinese AI's open-source playbook. What the self-evolving loop actually means and why the strategy change matters.