Latest

Dispatches from the frontier

Weekly analysis, honest takes, and hidden gems. No engagement bait.

Discovery2 minJul 9, 2026

Muse Spark 1.1 Replaces Qwen3.7-Max in helloai's Frontier Set

Meta ships Muse Spark 1.1 with a public API at $1.25/$4.25. helloai drops Qwen3.7-Max and adds Meta as its budget agentic pick at 1487 Elo.

Read article →

Discovery2 minJul 9, 2026

Grok 4.5 Ships as xAI's Default Chat Model

xAI launched Grok 4.5 on July 8 — 83.3% on Terminal-Bench 2.1, 4.2× token efficiency, and $2/$6 pricing. helloai's tracked Grok entry moves from 4.3 to 4.5.

Read article →

Discovery2 minJul 9, 2026

GPT-5.6 Sol Reaches General Availability

OpenAI ends the partner-only preview on July 9. GPT-5.6 Sol replaces GPT-5.5 in helloai's tracked set at the same $5/$30 rate — the upgrade path teams waited on since June 26.

Read article →

Analysis2 minJul 4, 2026

How Caching and Batching Cut Frontier API Costs by 90%

helloai's leaderboard shows nominal per-token rates. Prompt caching and batch APIs stack underneath — turning a $5/MTok flagship into $0.25 on repeated context.

Read article →

Opinion2 minJul 4, 2026

Ollama, Hermes, and GLM-5.2:cloud Are Rewriting the Margin Map

One command routes a full agent harness to MIT-licensed weights at Ollama Pro prices. That stack does not kill frontier labs, but it is eating their easiest revenue.

Read article →

Analysis2 minJul 4, 2026

Frontier Models Score Under 1% on ARC-AGI-3

ARC-AGI-3 launched March 25 with humans at 100% and frontier AI at 0.51%. GPT-5.5 and Opus 4.7 barely move the needle — exposing a gap arena Elo cannot see.

Read article →

Opinion2 minJul 1, 2026

Frontier Releases Now Run Through Government Gates

Fable 5 returns globally July 1 after an 18-day suspension; GPT-5.6 Sol ships only to vetted partners. June 2026 made frontier availability a policy variable, not just an engineering milestone.

Read article →

Discovery3 minJun 22, 2026

Grok 4.3 Lands on Amazon Bedrock

xAI's frontier model is now GA on Amazon Bedrock with the same $1.25/$2.50 pricing, 1M context, and configurable reasoning — the cheapest tracked frontier model just became enterprise-deployable.

Read article →

Analysis3 minJun 22, 2026

Best Model for Coding Right Now

The tracked frontier spans 28 Elo points on coding, but input prices run 4x and output prices 12x. Here is how helloai's scoring picks a winner for engineering work — and when to trade rank for cost.

Read article →

Analysis2 minJun 13, 2026

Fable 5 Suspended: Government Directive Removes Mythos from All Users

Three days after public launch, a US export control order forced Anthropic to disable Fable 5 and Mythos 5 worldwide. A narrow reported jailbreak on vulnerability discovery triggered the recall.

Read article →