This is a Hierarchical Controller-Worker Architecture.
Think of Kimi K2.5 not as a chatbot, but as a Distributed Operating System Kernel (the Orchestrator) managing a pool of Serverless Functions (the Sub-agents).
tl;dr
- Orchestrator + Frozen Sub-agents
- Parallel Agent RL training