Anthropic's Claude rose to 91 in Implicator's weekly LLM Meter for the week of May 31, retaking first place after two weeks behind Google's Gemini. On May 28, Anthropic shipped Claude Opus 4.8 and disclosed a $65 billion Series H funding round that valued the company at $965 billion. The meter scores six large language models each week from the perspective of enterprise and business buyers.
The round, led by Altimeter, Dragoneer, Greenoaks, and Sequoia, made Anthropic the most valuable AI lab on paper, past OpenAI's reported $852 billion mark, according to the company's announcement. Anthropic said its annualized revenue run rate has roughly tripled to about $47 billion since February, when it raised at a $380 billion valuation.
Key Takeaways
- Claude rose to 91 in Implicator's weekly LLM Meter, retaking first place after Anthropic's May 28 announcements.
- Anthropic raised $65 billion at a $965 billion valuation, passing OpenAI to become the most valuable AI lab.
- Claude Opus 4.8 scored 69.2% on SWE-bench Pro, about ten points ahead of OpenAI's GPT-5.5.
- Gemini slipped to 88 and ChatGPT to 84; Mistral rose to 73 on an Airbus deal, while Grok and DeepSeek sit at 30 and 17.
AI-generated summary, reviewed by an editor. More on our AI guidelines.
What Opus 4.8 changed
Opus 4.8 scored 69.2% on the SWE-bench Pro coding benchmark, up from 64.3% on Opus 4.7 and about ten points ahead of GPT-5.5, according to Vellum's testing. Anthropic shipped the model across Amazon Bedrock, Google Vertex AI, and Microsoft Foundry on the day of release, the three clouds most regulated buyers already contract through, and held pricing at $5 and $25 per million input and output tokens. Bridgewater Associates, an early customer, told Anthropic that the model flags problems in its own analysis more often before review.
The drags Anthropic carried into the week did not disappear. A June 15 change moves programmatic Claude usage to a separate metered credit pool, and the company still has not published the token sizes behind its subscription caps.
Stay ahead of the curve
Strategic AI news from San Francisco. No hype, no "AI will change everything" throat clearing. Just what moved, who won, and why it matters. Daily at 6am PST.
No spam. Unsubscribe anytime.
Gemini and ChatGPT slip
Gemini fell two points to 88. Google's I/O launch wave from May 19 still anchors its enterprise case, and the former Vertex AI finished folding into the Gemini Enterprise Agent Platform on May 21. The flagship Gemini 3.5 Pro slipped to June, leaving the lower-cost 3.5 Flash to carry the lineup the week Anthropic put a new model at the top of the quality field.
ChatGPT fell one point to 84. OpenAI hardened its enterprise controls, adding governance for Skills and Windows support for its Codex agent, and enterprise now accounts for more than 40% of its revenue. OpenAI lost ground only in relative terms. GPT-5.5 now trails Opus 4.8 on the SWE-bench Pro benchmark, and the Anthropic round ended OpenAI's run as the most valuable AI lab.
Know someone who'd find this useful? ✉️ Email it to a friend in one click, or they can subscribe free here.
Lower in the field
Mistral rose one point to 73 on a partnership with Airbus that spans the planemaker's commercial aircraft, helicopter, defense, and space divisions, alongside a new enterprise platform, Vibe, built from its Le Chat assistant. Grok fell one point to 30; xAI finished training its 1.5-trillion-parameter Grok V9-Medium model on May 25 for a release expected in mid-June, but reported no enterprise or compliance progress. DeepSeek rose one point to 17 as its first external funding round moved toward a roughly $44 billion valuation, with China's Big Fund in talks to lead. US government-device bans on the Chinese model remain in force.
The next meter publishes the week of June 7. Gemini 3.5 Pro and Grok V9-Medium are both scheduled to ship before then.
Frequently Asked Questions
What is the Implicator LLM Meter?
It is Implicator's weekly scorecard rating six large language models from an enterprise-buyer perspective, weighing compliance, model quality, reliability, ecosystem maturity, vendor stability, and pricing. Scores run from 0 to 100 and update each week.
Why did Claude retake the lead?
Anthropic shipped Claude Opus 4.8 on May 28 and disclosed a $65 billion funding round at a $965 billion valuation. The model leads on coding benchmarks and the raise made Anthropic the most valuable AI lab, the strongest pairing of quality and vendor stability in the field this week.
How does Opus 4.8 compare with GPT-5.5?
Opus 4.8 scored 69.2% on the SWE-bench Pro coding benchmark, about ten points ahead of OpenAI's GPT-5.5, according to Vellum's testing. Anthropic also shipped it across Amazon Bedrock, Google Vertex AI, and Microsoft Foundry on release day.
Why did Gemini and ChatGPT fall?
Both slipped on relative momentum. Gemini's flagship 3.5 Pro slipped to June, and ChatGPT's GPT-5.5 now trails Opus 4.8 on coding while Anthropic's raise ended OpenAI's run as the most valuable lab. Neither company's position weakened on its own terms.
Where do Mistral, Grok, and DeepSeek stand?
Mistral rose to 73 on an Airbus partnership and its new Vibe platform. Grok fell to 30 with no enterprise progress ahead of its mid-June V9-Medium release. DeepSeek rose to 17 as its funding round neared a $44 billion valuation; US government-device bans remain in force.
AI-generated summary, reviewed by an editor. More on our AI guidelines.



IMPLICATOR