Prime Intellect Releases Prime-RL 0.6.0 Enabling Trillion-Parameter RL Training on 28 Servers

2026-06-23 20:33

Prime Intellect reports the deployment of prime-rl version 0.6.0, a distributed reinforcement learning framework that has breached the training threshold for trillion-parameter mixed-expert models in super-long-context agent tasks. This release enables the training of GLM-5 with 131k context lengths using only 28 H200 servers, maintaining single-step processing times below five minutes, a significant reduction from previous requirements of thousands of GPUs.

The framework implements a fully decoupled asynchronous RL architecture to mitigate GPU idle time during complex code generation, allowing real-time weight updates without waiting for trial task completion. To resolve logic confusion from asynchronous updates, Routing Replay (R3) technology stabilizes expert data distribution, reducing discrepancies between training and inference to one-tenth.

Additionally, the system leverages Mooncake technology to aggregate idle memory into a shared cache pool and employs DeepGEMM with block scaling FP8 training to eliminate precision deviation crashes, ensuring efficient resource utilization across distributed clusters.

Disclaimer: Views are the author's own and do not represent the platform. Do not reproduce without permission. Content is for reference only, not investment advice. Trade at your own risk.

Trending News

Strategy sells 32 BTC to validate liquidity while holding 845k BTC and targeting STRC par value recovery

Solana spot dominance contrasts with Perp DEX lag due to execution unpredictability despite 30% market share

TRUMP memecoin gains 3.15% to $1.90 with $281M volume while facing $2.27 resistance and weak ADX

Kalshi expands restricted jurisdictions to 55 including India amid global regulatory crackdown

Former BIS chief Agustín Carstens endorses stablecoin coexistence with fiat currency amid 100% reserve mandates

Bitcoin ETF inflows show $1B arbitrage versus $55B cumulative with 0.70 correlation to CME shorts

Bitcoin supply-in-profit breaches 15-year trendline as 10.2M coins fall below acquisition price

Binance co-founder Yi He exposes Zhu Pan impersonation scam while CoinUp denies ties

SpaceX stock drops 16.5% post-IPO as September 44% insider unlock looms over $75B valuation

THORChain resumes operations after $10.7M exploit and 1-month security overhaul