Claude Opus 4.8 Costs 44 Times More Than DeepSeek V4 Pro in New AI Benchmark

2026-06-16 18:23

Data compiled by Woofun AI shows that Artificial Analysis has overhauled its AI Intelligence Index to prioritize autonomous planning and complex task resolution over simple instruction following. The revised methodology introduces high-difficulty scenarios, such as simulated bank customer service interactions, with the primary metric focusing on the cost and time required to complete tasks successfully.

In the latest rankings, Claude Opus 4.8 secured the top position among available models with a score of 56, narrowly edging out GPT-5.5 at 55 points.

However, a stark cost divergence emerged: executing identical tasks with Claude Opus 4.8 incurred a fee of $1.78, whereas DeepSeek V4 Pro completed the same work for just $0.04. This equates to a 44-fold cost premium for Claude. Performance speeds also varied significantly, with xAI Grok 4.3 finishing in 1.5 minutes compared to Claude Sonnet 4.6's 13.5 minutes. The updated GDPval-AA test now constitutes 20% of the total evaluation, raising the human benchmark to 1000 and extending conversation limits to 250 rounds.

Disclaimer: Views are the author's own and do not represent the platform. Do not reproduce without permission. Content is for reference only, not investment advice. Trade at your own risk.

Trending News

Anthropic model suspension triggers 30% TAO surge as Grayscale cites centralized AI risks

Jane Street slashes 71% of IBIT holdings while expanding ETH ETF positions amid regulatory scrutiny

USD1 reaches $4.5B supply with 87% Binance concentration while expanding to Solana and AI payment rails

US-Iran ceasefire agreement drives BTC to $67,255 as oil drops 5% and SpaceX valuation hits $2.5T

WLFI leverages 463M token sales and UFC sponsorship to drive USD1 circulation to 5B in 12 months

JTO surges 44% as JTX launch triggers $248M volume and $19M annual buyback mechanism

SpaceX valuation hits $2.5T after 20% surge on day 2 with $86.2B raised

MiCA grace period ends July 1 forcing 75% EU crypto shutdown while Kalshi hits $5.1B World Cup volume

Fox acquires Roku for $22B while Salesforce buys Fin for $3.6B to redefine AI agent identity and TV data assets

Ethereum developer pool surpasses 1M with 232k active builders reinforcing institutional trust