RunAgentRun

Open weights just caught the coding frontier

Zhipu AI's GLM-5.2 — open weights under the MIT licence — beats GPT-5.5 on long-horizon coding benchmarks and lands within a point of Claude Opus 4.8, at roughly a sixth of the price. On coding, the open-source gap to the frontier has all but closed. On reasoning and the very longest tasks, it hasn't.

RAR Editor/18 Jun 2026/6 min read

Latest

See all →

News · Infrastructure

Opus 5 lands on AWS at half Fable price

Anthropic's most capable Opus yet ships on Amazon Bedrock this week with zero data retention — at half Fable 5's price.

25 Jul 2026/7 min read

News · Infrastructure

AMD bets $5bn on Anthropic to rival Nvidia

The chipmaker is putting up to five billion dollars into Anthropic and getting up to two gigawatts of new AMD AI chip capacity dedicated to Claude in return — its biggest push yet to become a real second supplier for frontier AI.

23 Jul 2026/5 min read

News · Local & Open

Qwen 3.6 outranks Gemma 4 on intelligence

Alibaba's 35B-A3B reasoning model scores six points higher than Google's 26B-A4B on Artificial Analysis's Intelligence Index. The catch: it costs nearly three times as much per million tokens.

22 Jul 2026/4 min read

Analysis · Local & Open

Stock these open models before political disruption hits

Two political-risk reports both flag widening post-election instability. A local weights library is the boring, effective hedge — and a weekend is enough to build it.

21 Jul 2026/7 min read

News · Models

Alibaba's Qwen 3.8 targets Kimi K3

The 2.4-trillion-parameter open-weight flagship previews through Alibaba's paid products at 10% of list — with public weights promised in the near term.

20 Jul 2026/5 min read

Analysis · Infrastructure

NVIDIA bets the agent era on one protocol

Six flagship creative studios wired AI agents into their tools at SIGGRAPH on Monday. The deeper play is where the silicon ends up.

20 Jul 2026/5 min read

Analysis · Pricing

Anthropic halves Fable 5 subscription limits

Pro users lose access outright and get a one-time $100 credit. The partial climb-down from pulling Fable entirely signals competitive pressure from OpenAI's cheaper GPT-5.6 Sol — and tells Pro subscribers to plan.

18 Jul 2026/5 min read

Analysis · Models

Most frontier AI is just more compute

An MIT CSAIL analysis of 809 large language models finds 80 to 90% of frontier performance is explained by scale alone — not secret recipes. What it means for OpenAI, Anthropic and the open-model race.

17 Jul 2026/5 min read

Analysis · Models

The frontier AI duopoly takes shape

Anthropic readies a $3 trillion listing, Meta fires the opening shot on price, and China moves to lock its top models behind borders. The market just hardened into a two-horse race — and the UK is on the outside looking in.

15 Jul 2026/5 min read

Analysis · Infrastructure

NVIDIA Vera targets the agent-loop bottleneck

NVIDIA's new 'max single-threaded CPU at scale' is built for persistent AI agents — not the queue of human requests. Industry analysts say Chinese data centres could have it in August.

13 Jul 2026/5 min read

How-to · Models

Try GPT-5.6 Sol for coding this afternoon

OpenAI's new flagship scores a point behind Claude Fable 5 on the Artificial Analysis Intelligence Index, costs a third as much per task, and tops the coding-agent chart.

10 Jul 2026/5 min read

News · Models

Grok 4.5 undercuts the frontier on cost

xAI's new model trails the frontier in benchmarks — but at $2 per million input tokens and a fifth of GPT-5.5's output price, the cost case writes itself for high-volume coding.

9 Jul 2026/6 min read

Thought Leaders

See all →

Simon Willison

Creator of Datasette, co-creator of Django · Independent

Independent researcher and prolific writer on practical LLM tooling, local models and the day-to-day craft of building with AI. Essential reading for anyone running models themselves.

Local Models LLM Tooling Open Source

Demis Hassabis

Co-founder & CEO · Google DeepMind

Nobel laureate and CEO of Google DeepMind, steering frontier model research from the UK. The clearest anchor point for Britain's sovereign-AI ambitions and the science-first frontier.

Sovereign AI Frontier Models Science

Andrew Ng

Founder, DeepLearning.AI · Managing General Partner, AI Fund · DeepLearning.AI

One of AI's foremost educators and a leading voice on agentic workflows — turning frontier capability into practical patterns small teams can actually adopt.

Agentic Workflows AI Education

Andrej Karpathy

Founder, Eureka Labs · ex-OpenAI, ex-Tesla AI · Eureka Labs

The field's clearest explainer — coined 'Software 2.0', 'Software 3.0' and 'vibe coding'. He turns each shift in how AI is built into a mental model practitioners actually use.

Agentic Engineering AI Education

Jensen Huang

Founder & CEO, NVIDIA · NVIDIA

Builds the compute the whole AI era runs on. His GTC keynotes set the industry's agenda — and in 2026 that agenda is the 'age of agents' and physical AI.

AI Infrastructure Agentic AI