---CLAUDE---
score: 90
trend: down
change: -1
+ Filed a confidential S-1 with the SEC on June 1, putting an October IPO and audited financials on the table days after the $65B raise at a $965B valuation, the strongest vendor-longevity signal on the board
+ Deepened the Snowflake partnership at Snowflake Summit 26 around governed enterprise AI, plus a Partner Hub and Services Track backed by $100M in partner investment
+ Claude Managed Agents now run in a customer-controlled sandbox against private MCP servers, keeping agent execution inside enterprise security boundaries
- Two outages in four days: a June 2 incident traced to a Claude Code sub-agent bug that looped infinitely and drained Pro/Max quotas, plus a June 5 error spike
- Still a clear #1 on coding quality and compliance (ISO 42001, FedRAMP, HIPAA), but the outages are the fresh, self-inflicted drag
---GEMINI---
score: 87
trend: down
change: -1
+ The consolidated Gemini Enterprise Agent Platform (former Vertex AI) is still the deepest agent-orchestration stack on the board, with Salesforce, Databricks, Ramp, and Xero as launch adopters
+ Gemini 3.5 Flash continues to anchor the lineup on production price/performance
- Flagship Gemini 3.5 Pro slipped again, still in limited Vertex preview as of June 6 with GA "weeks away" for a second straight week
- A quiet week against rivals that filed to IPO, landed on AWS, and shipped a flagship reads as lost relative momentum
- Pentagon classified-network and DeepMind defense-work questions remain unresolved
---CHATGPT---
score: 86
trend: up
change: +2
+ OpenAI frontier models and Codex went live on AWS (June 1), ending the Azure-only constraint and giving buyers true three-cloud sourcing to match Claude's multi-cloud reach
+ Shipped a Secure MCP Tunnel so ChatGPT, Codex, and the Responses API can reach private and on-prem MCP servers through a customer-hosted client
+ Expanded Codex "for every role, tool, and workflow," plus Enterprise/Business governance and GPT-5.5 workspace-agent controls
+ Enterprise is now north of 40% of revenue with named demand (Goldman Sachs, State Farm, Phillips), on track for consumer parity by year-end
- Still trails Opus 4.8 by about 10 points on SWE-bench Pro and carries ~$14B in projected 2026 losses, though the distribution win narrows the gap to Gemini to a single point
---MISTRAL---
score: 72
trend: down
change: -1
+ Its strongest asset remains the open-weight Mistral Large 3 (675B-parameter MoE, 41B active, Apache 2.0), live on Amazon Bedrock and Azure Foundry since its December debut and ranked #2 among open non-reasoning models on LMArena
+ The EU-sovereign procurement case still rests on the Airbus reference account and the €4B France/Sweden data-center build
- No fresh enterprise catalyst this week while rivals moved (Anthropic's IPO filing, OpenAI's Bedrock GA, DeepSeek's raise), leaving Mistral flat on relative momentum
- Top-end benchmarks still trail Opus 4.8, GPT-5.5, and Gemini 3.5, and Mistral remains outside the Pentagon classified-network roster
---GROK---
score: 31
trend: up
change: +1
+ Shipped Grok Voice and Grok Imagine 1.5 Preview via API (June 4), and added worktrees plus a core model improvement to the Grok Build 0.1 coding beta (June 5)
+ V9-Medium (1.5T parameters, trained on Cursor developer data) is on track for a mid-June coding release, with the SpaceX/xAI right to acquire Cursor's maker Anysphere for $60B giving it a proprietary code-data pipeline
- All consumer- and developer-facing, with no federal, compliance, or procurement progress in a week when rivals advanced on IPO and cloud distribution
- Colossus economics still cast xAI as a GPU landlord (the ~$1.25B/month Anthropic compute deal) more than an enterprise model vendor, against persistent cash-burn concerns
---DEEPSEEK---
score: 18
trend: up
change: +1
+ First external round is firming up fast, now reported at ~$7.4B (about 50B yuan) at a valuation up to $59B, up from ~$45B two weeks ago, with Tencent and CATL in and founder Liang Wenfeng funding ~40% himself
+ Permanent V4-Pro price cuts keep it the cost-leadership floor of the market, under a tenth of GPT-5.5 on input tokens
- The round is still "in talks," and deeper Chinese-state-adjacent backing sharpens rather than relieves the US-procurement compliance problem
- V4 quality still trails Western flagships, and US government-device bans plus the broader compliance perimeter remain in force