Frontier providers are rationing.

Anthropic 98.95% uptime

Claude API uptime over the 90 days ending April 8 was 98.95%. That's ~5x the industry-standard outage budget for a cloud service. Enterprise customers started switching to OpenAI for reliability — WSJ.

Peak-hours rate limits

In late March 2026 Anthropic capped Pro/Max session limits during weekday peak hours (5-11am PT). Power users hit 5-hour limits in 20 minutes. Claude Code's prompt-cache TTL was cut from 1 hour to 5 minutes, inflating quota burn for long sessions.

$9B → $30B in 4 months

Anthropic's annualized revenue run-rate roughly tripled from end-2025 to March 2026. Data centers take 1-2 years to build. The math means the rationing is structural for at least the next 12-24 months.

OpenAI killed Sora

In April OpenAI shut down Sora's consumer app to redirect GPU cycles toward coding and enterprise. Token throughput on the OpenAI API went from 6B/min in October to 15B/min by end of March.

Consumption billing

Anthropic moved enterprise customers from flat-rate seats to consumption-based token billing in April 2026 — and killed the 10-15% volume discounts that applied to larger accounts. Spending commitments now apply whether you use them or not.

$4.08/hr Blackwell

The spot price for one hour of an Nvidia Blackwell GPU rose 48% in two months to $4.08 (per the Ornn Compute Price Index). CoreWeave is requiring 3-year contracts from smaller customers.

The grid can't take the load.

Compute is the visible bottleneck. Underneath, the binding constraint is electricity. The All-In podcast has been beating this drum for months. They're right.

“There's no such thing as a dark GPU right now. Every GPU that's being put in a data center is getting used.”
— David Sacks, All-In Podcast
“We are absolutely compute constrained. … It moves from an AI race to a power race.”
— Chamath Palihapitiya, All-In Podcast
PJM capacity prices: 9.3x

The Eastern US's capacity auction cleared at 9.3x the prior year's price for the 2025/26 service period — and hit the federal price cap in the most recent auction. Households in the PJM region are seeing ~15% bill increases attributed to the data center buildout.

Shortages projected by 2027

PJM Interconnection — the largest US regional grid — is projecting supply shortages as early as 2027 if data center demand continues growing at current pace. EPRI expects data centers to consume up to 17% of US electricity by decade end.

56 GW going off-grid

46 planned US data centers totaling 56 GW will bypass the grid entirely with on-site generation. They're tired of waiting for hookups. Tech companies signed a White House “Ratepayer Protection Pledge” in March 2026.

$281 utility bills

Northern Virginia residents reported January 2026 electricity bills triple their previous norm — $281 vs ~$100 — and ~75% in a state survey blame data centers. Local political pushback is mounting.

Capacity online: 2027+

Anthropic's 1 GW Google TPU deal comes online “starting 2027.” OpenAI's 2 GW AWS Trainium deal is multi-year. Even with capital pouring in, the build timelines mean today's rationing posture is structural through 2027 at minimum.

Transmission bottleneck

Even where generation is sited, transmission capacity hasn't kept pace. Nationwide 5-year peak-load growth expectations rose from ~24 GW in 2022 to ~150 GW in 2025. Interconnection no longer guarantees deliverability.

None of this affects a Mac on your desk.

Cloud constraints compound. Each one — capacity, power, transmission, billing — multiplies the others. A local 32B model on a Mac Mini M4 Pro is on the wrong side of all of them.

The trade is real: open-weight models still trail frontier APIs on the hardest reasoning tasks. But for coding, drafting, agentic workflows, and most everyday uses, that gap is now small enough that the asymmetry above wins.

What you actually need.

The hedge is only useful if it works on the first try and stays up for the long run. That's a tooling problem.

The rational hedge has a toolchain.

DevPulse is the menubar app + CLI for running local AI on a Mac without OOM.

Download for macOS

macOS 14+ · Apple Silicon & Intel · Free during launch