GLM-5.2 Edges Kimi K2.7 Code in Early Coding Tests
Five reviewers ran GLM-5.2 and Kimi K2.7 Code head-to-head on real coding tasks. GLM-5.2 came out with a narrow edge on design, cost, and code that held up to a second look.
Read full story →
IMPLICATOR.ai
Five reviewers ran GLM-5.2 and Kimi K2.7 Code head-to-head on real coding tasks. GLM-5.2 came out with a narrow edge on design, cost, and code that held up to a second look.
Read full story →---CHATGPT---
score: 87
trend: up
change: +1
+ OpenAI is weighing aggressive token price cuts (WSJ) that lower enterprise costs, pressing its advantage as Anthropic's enterprise run stumbles
+ Three-cloud sourcing across AWS, Azure, and on-prem MCP is exactly the multi-cloud reach the former leader just lost to a government order
+ GPT-5.5 (Instant, Thinking, Pro) is fully live across Plus, Business, and Enterprise, with GPT-5.2 retired June 12 on clean auto-migration
- GPT-5.5 still trails Opus 4.8 by about ten points on SWE-bench Pro, ceding the coding-agent quality lead
- About $14B in projected 2026 losses against a vast compute commitment keeps the financial overhang in place
---GEMINI---
score: 86
trend: down
change: -1
+ The Gemini Enterprise Agent Platform remains the deepest enterprise stack on the board, with Salesforce Agentforce, Databricks, Ramp, and Xero aboard
+ Gemini 3.5 Flash anchors production price/performance and is undercutting both US rivals on token cost
- Gemini 3.5 Pro (2M context, Deep Think) slips to general availability a third straight week, still in limited Vertex preview
- A quiet week cedes the top spot to ChatGPT even as Claude stumbles, a momentum gap rather than a platform one
- Pentagon classified-network and DeepMind defense-work questions remain unresolved
---CLAUDE---
score: 82
trend: down
change: -8
+ Opus 4.8 keeps the enterprise coding crown and the compliance stack (ISO 42001, FedRAMP, HIPAA) is unmatched
+ Every other Claude model stayed online across multiple clouds, and the October IPO track is the strongest durability signal on the board
- The Commerce Department ordered Fable 5 and Mythos 5 pulled June 12, and Anthropic disabled both globally, the first forced takedown of a generally available US model
- Top-tier models that vanish on a government letter are a continuity risk no SLA covers, with foreign-national access, including Anthropic's own staff, now blocked
- The pull compounds the unresolved Pentagon blacklisting suit, making Washington risk the first question on any Claude deployment
---MISTRAL---
score: 74
trend: up
change: +2
+ Reportedly raising about €3B at a roughly €20B valuation (Bloomberg, June 12), nearly double its September Series C
+ The European-sovereign, open-weight pitch gains real weight the week Washington yanked a US lab's flagship models
+ Mistral Large 3 stays live on Amazon Bedrock and Azure Foundry, anchored by the Airbus account and the €4B France/Sweden build
- The raise is early-stage talks, not closed, and the amount and valuation could still change
- Top-end benchmarks still trail Opus 4.8, GPT-5.5, and Gemini 3.5, and Mistral remains outside the Pentagon classified-network roster
---GROK---
score: 32
trend: up
change: +1
+ Grok V9 finished training at 1.5 trillion parameters, about three times the prior model, with the coding-focused V9-Medium targeted for mid-June
+ Grok Voice and the Grok Imagine 1.5 preview shipped via API, rounding out the assistant stack
- No federal, compliance, or procurement progress, nothing that touches the metrics enterprise buyers score
- The structure still reads as GPU landlord to rivals, its largest deal the roughly $1.25B/month Colossus compute contract with Anthropic
---DEEPSEEK---
score: 19
trend: up
change: +1
+ The first external raise is firming at about $7.4B and a $52-59B valuation (Tencent, CATL, with founder Liang at roughly 40%)
+ Permanent V4-Pro price cuts keep it under a tenth of GPT-5.5 on input tokens, holding the cost-leadership floor
- The round is still in talks, and deeper China-state-adjacent backing sharpens the US-procurement compliance problem
- US government-device bans and the full compliance perimeter hold, and a more security-charged Washington only hardens the wall
A Claude Code skill is just a folder with a SKILL.md file, but the GitHub tooling around it has split into distinct jobs. anthropics/skills and skill-creator handle authoring and testing, claude-plugins-official handles versioned distribution, runkids/skillshare syncs one library across Codex and OpenCode, and mattpocock/skills packages engineering-process patterns. Start local, and review any third-party skill like a code dependency before install.
Artificial Analysis named Z.ai's GLM-5.2 the top open-weight model on its Intelligence Index, scoring 51 to lead every open peer. Independent reviewers clocked it near Anthropic's Opus 4.8 on build tasks at roughly five times lower cost, though the model burns 43,000 output tokens per benchmark task. GLM-5.2 ships under an MIT license with a 1-million-token context window and open weights on Hugging Face.
Repo Radar #10: as coding agents push GitHub to machine scale, the week's trending repos are the control layer builders wrap around them. NVIDIA's SkillSpector scans agent skills for malicious code, agentsview gives every coding agent a local cost-and-session ledger, codebase-memory-mcp turns a repo into a queryable memory graph, agentgateway fronts agents and MCP servers with auth and budgets, and LMCache reuses the KV cache to cut inference latency. Scored on stars, license, and setup difficul
Weekly deep dives on AI power, strategy, and market shifts.
For founders, operators, investors, and decision-makers who need signal over noise.
Voice dictation became the rare AI tool that changed how people write. As the speech models commoditize, the fight has moved to the layer above them, where a free app built by a developer who cannot type competes with the $81 million leader and the operating systems closing in.
NVIDIA's RTX Spark drops CUDA and 128GB of unified memory into mainstream Windows PCs, sharpening a local-AI buying decision that now turns on what fails first: memory capacity, bandwidth, or model quality. What an individual and a startup should actually buy in 2026, and where to stop spending.
A Nobel Prize winner just walked out of Google's top AI lab. John Jumper, who shared the 2024 chemistry Nobel for AlphaFold, is leaving Google DeepMind for Anthropic, one day after another Google star left for OpenAI. What Anthropic wants with a protein scientist is the surprise.
A Nobel Prize winner just walked out of Google's crown-jewel AI lab. John Jumper, who shared the 2024 chemistry Nobel for AlphaFold and led the team that mapped more than 200 million protein structures, is leaving Google DeepMind for Anthropic after nearly nine years. His exit comes one day after another Google star defected to OpenAI, and it points straight at a weakness the company has been fighting to fix. What Anthropic wants with a protein scientist is the part nobody saw coming.