---CHATGPT---
score: 89
trend: up
change: +1
+ GPT-5.6 (Sol, Terra, Luna) shipped June 26 with gains in coding, biology, and cybersecurity and a new max-reasoning tier, the field's fastest cadence as Gemini's flagship slipped to July
+ GPT-5.5 stays fully live across Plus, Business, and Enterprise on AWS, Azure, and on-prem MCP, the deployable workhorse buyers can run today
+ Tiered pricing holds predictable at Sol $5/$30, Terra half that, Luna $1/$6 per 1M tokens, with Gemini co-lead Noam Shazeer joining as a talent win
- GPT-5.6 launched government-gated to about 20 pre-approved organizations at the Trump administration's request, so buyers cannot deploy the flagship yet, the same freeze that hit Claude
- The model still loses roughly 70% of head-to-head enterprise deals to Claude, and projected 2026 losses near $14B keep the financial overhang in place
---GEMINI---
score: 85
trend: down
change: -2
+ Gemini's enterprise agent stack stays the deepest on the board (Agentforce, Databricks, Ramp, Xero) and consumer share surged toward 27%, the clearest adoption momentum of any challenger
+ With GPT-5.6 government-gated and Claude's flagship only partly restored, Gemini 3 is the most broadly deployable frontier-class model buyers can run right now
- Gemini 3.5 Pro (2M context, Deep Think) slipped general availability to July, a fourth straight miss that leaves it in limited Vertex preview
- DeepMind lost Nobel laureate John Jumper, Gemini contributors Jonas Adler and Alexander Pritzel, and engineer Arthur Conmy to Anthropic, with Bloomberg reporting two more poised to leave and Alphabet stock sliding
- Pentagon classified-network and DeepMind defense-work questions remain unresolved
---CLAUDE---
score: 80
trend: up
change: +2
+ Anthropic became the industry's talent magnet, landing DeepMind's John Jumper, two Gemini model contributors, and a senior safety engineer, the strongest vendor-direction signal on the board
+ Opus 4.8 keeps the enterprise coding crown (about 54% of coding LLM spend per Menlo) and wins roughly 70% of head-to-head deals against OpenAI, with revenue past $47B and an October IPO listing on track
+ The US government cleared Mythos 5 for redeployment to critical-infrastructure operators June 27, the first step back from the export freeze, with Fable 5 general availability still being negotiated
- The June 14 Max class action advanced, alleging the $200 20x plan delivers only six to eight times Pro usage and the $100 5x plan three-and-a-half times, a live consumer-trust and procurement risk
- Fable 5 remains offline for general use while the compliance stack (ISO 42001, FedRAMP, HIPAA) carries durability with the flagship half-dark
---MISTRAL---
score: 76
trend: up
change: +1
+ With both US flagships now restricted (Claude's export ban and GPT-5.6's government gate), Mistral's no-export-risk, open-weight, European-sovereign pitch gets its strongest validation yet
+ The reported about-€3B raise at a roughly €20B valuation (Bloomberg, June 12) would nearly double the prior mark and fund the compute race, with some reports putting it at $3.5B/$23B
+ Mistral Large 3 stays live on Amazon Bedrock and Azure Foundry, anchored by Airbus and the €4B France/Sweden build
- The raise is still early-stage talks, not closed, with amount and valuation movable
- Top-end benchmarks trail Opus 4.8, GPT-5.6, and Gemini 3.5, and Mistral remains off the Pentagon classified-network roster
---GROK---
score: 34
trend: up
change: +1
+ Grok V9-Medium shipped mid-June at 1.5 trillion parameters, roughly triple the prior production model, trained on Cursor developer data to close the coding gap against Claude and GPT-5.5
+ SpaceXAI secured an option to acquire Cursor maker Anysphere for $60B, or $10B for collaboration, a rare enterprise-data and developer-workflow anchor
- No federal, compliance, or procurement progress, and the New Republic report on Grok in Iran strike targeting keeps reliability and credibility concerns live for buyers
- The structure still reads as GPU landlord, with Grok 5 (6T parameters) only now training on the Colossus 2 supercluster in Memphis
---DEEPSEEK---
score: 20
trend: down
change: -1
+ The $7.4B round at a $50B-plus valuation makes DeepSeek China's most valuable AI startup, and V4-Pro holds the cost floor at about $0.45 per 1M input tokens
- The round handed voting rights and direct equity only to China's state AI fund while Tencent and CATL got none, deepening the state-adjacency that blocks US procurement
- US government-device bans (Navy, NASA, and more) plus Australian, Taiwanese, and South Korean restrictions hold, and a more security-charged Washington only hardens the wall
- V4-Pro still trails the leading proprietary systems from Anthropic, OpenAI, and Google in absolute quality