Coinbase AI Spend Drops 50% as Token Usage Grows via GLM Models

2026-06-29 10:10

Woofun AI reports that Coinbase CEO Brian Armstrong announced a near 50% reduction in AI expenditure alongside rising token usage. The firm achieved this by optimizing default settings, routing, and caching strategies. Specific actions include switching defaults to open-weight models such as GLM 5.2 and Kimi 2.7, noting that 91% of staff had not hit usage limits. The company also implemented custom prompt preprocessing for task-specific routing, boosted the LibreChat cache hit rate from 5% to 60%, and streamlined context management by initiating new sessions for different tasks.

Additionally, engineers retain model selection freedom but must account for the associated cost impacts.

Disclaimer: Views are the author's own and do not represent the platform. Do not reproduce without permission. Content is for reference only, not investment advice. Trade at your own risk.

Trending News

Arthur Hayes Buys $2.2M SYN Tokens to Challenge Deribit's 85% Market Share

South Korea Bets 1,000 Trillion Won on Chips While BIS Warns of AI Bubble

Loopring DEX Closes as TVL Plummets 99% From $760 Million Peak

Sharplink Resumes Aggressive ETH Accumulation with $62.4M Purchase Amid Market Decline

South Korea Pledges $1.3 Trillion While BIS Warns AI Spending Surpasses Profits

OpenAI Luna Model Triggers 43% LUNA2 Open Interest Surge Despite Zero Spot Activity

a16z Crypto Backs Ornn With $33M to Trade GPU Hash Rate as Commodity

Hyperliquid Dominates DeFi Perpetuals with 59% Market Share and $873M Revenue

Arthur Hayes-Linked Wallet Buys $2.2M Synapse Tokens, Price Jumps 27%

EBA Proposes Fines Up to 12.5% of Revenue for Non-Compliant Crypto Issuers