Living Document · Updated as new articles publish

AI
Strategy
Frameworks
& Views

A running analysis of AI's winners and losers — covering infrastructure, company outlooks, key predictions, and the strongest counterarguments.

Last Updated
May 11, 2026
Core Thesis on AI
Three interlocking claims

AI Is an Infrastructure War

Value flows to physical chokepoints and demand aggregators — not model builders or traditional software vendors.

01

Infrastructure Determines Everything

The binding constraints on AI are physical: fabs, memory, power — not algorithmic. $700B+ in annual hyperscaler CapEx is the investment phase of a multi-decade build-out. A chip shortage ~2029 is near-certainty.

02

AI Is Sustaining for Incumbents

Google's ads get more targeted. Meta's feed gets sharper. Shopify's merchants get more tools. The "AI startup kills incumbent" narrative is mostly a startup funding mechanism, not serious analysis.

03

Aggregation Theory Strengthens

AI increases supply of content, code, and services. Whoever controls the demand side — users — gets stronger. Commoditized supply empowers aggregators above all others.

Infrastructure & Memory Crisis
Physical bottlenecks

Physical Constraints Drive Everything

HBM memory and TSMC fab capacity are the real binding constraints — not model quality or software capability.

Company2026 CapExAssessment
Google$175–185BJustified. Cloud 48% growth, 30% margins. Direct ad payoff is the clearest case.
Meta$135B+Most enthusiastic. Ad ROI is most direct and measurable of any hyperscaler.
MicrosoftPaused →Pausing was a mistake. Right to redirect compute to own products vs Azure third-party.
Amazon$200BCautious. CapEx exceeds projected operating cash flow. Requires compute commoditization.
Combined>$700BAnnual hyperscaler capital expenditure
The Memory Crisis

HBM demand for AI is crowding out consumer electronics across the entire industry simultaneously.

Sony — considering delaying PS6 to 2028–2029.

Samsung — reviewing contracts quarterly instead of annually.

Chinese OEMs — cut 2026 shipment targets by up to 20%.

Nintendo — making customers supply their own SD cards.

Valve — delisted Steam Deck in the US.

Inference Architecture — Three Distinct Markets
Training
Serial steps, massive parallelism within each step. Every GPU must sync before the next step.
Key constraint: HBM + networking
Winner: Nvidia
Answer Inference
Human waits for response. Token speed = user experience. Decode is serial and memory-bandwidth bound — KV cache + model weights read per token.
Key constraint: bandwidth & latency
Winners: Cerebras, Groq
Agentic Inference ★
No human in the loop. Latency is irrelevant overnight jobs. Needs context, state, history — KV cache, DRAM, SSDs, databases, embeddings. CPU speed for tool use matters more than GPU speed.
Key constraint: memory hierarchy
Market: Largest by far — scales with compute, not humans
Nvidia launched Dynamo (inference disaggregation framework) and standalone memory/CPU racks to stay relevant in agentic workloads — but cost and simplicity increasingly favor alternatives. China has everything it needs for agentic inference despite lacking leading-edge chips.
Quarterly CapEx Tracker
Hyperscaler Capital Expenditure — Q1 2024 to Q1 2026
USD billions · Per company · Quarterly
Sources: Alphabet, Meta, Amazon, Microsoft earnings releases (Q1 2026, reported Apr 29 2026). Some quarters derived from annual totals where quarterly disclosures unavailable. Microsoft converted to calendar quarters; Q2ʼ25 derived from FY25 annual total. ★ 2026E ÷4 = full-year management guidance divided by 4 (quarterly equivalent): Google $180–190B, Meta $125–145B, Amazon $200B, Microsoft $190B.
Key Predictions
10 forward-looking calls
PREDICTIONS — CONFIDENCE LEVEL
01

Massive chip shortage ~2029 due to TSMC's conservative capacity expansion relative to AI demand.

High
HIGH
02

Microsoft exits first-party console hardware entirely. Xbox becomes a publishing and Game Pass brand.

High
HIGH
03

Memory prices squeeze consumer electronics for years — phones, PCs, and consoles all simultaneously.

High
HIGH
04

Government demands control of AI regardless of who funded its development. Inevitable, not normative.

High
HIGH
05

The AI-native business model — the "feed" equivalent — has not yet been invented. OpenAI's banner ads are primitive.

High
HIGH
06

OpenAI must find product-market fit in 2026 or face commoditization. Shallow weekly usage is not a moat.

Medium
MED
07

Netflix eventually acquires both Paramount and Warner Bros. — possibly simultaneously — within the decade.

Medium
MED
08

AI shopping benefits long-tail merchants (Shopify, Etsy) more than Amazon, by surfacing niche products agents can discover.

Medium
MED
09

PE opportunity in beaten-down SaaS — buy at low multiples, restructure for profitability, run as cash generators.

Medium
MED
10

EA's long-term play is live sports rights — a single platform for watching, playing, betting, and fantasy.

Low–Med
LOW
11

Agentic inference will become the largest compute market — and it won't look like today's GPU clusters. Memory hierarchy (DRAM, SSDs, databases) beats raw bandwidth; CPU speed for tool use matters more than GPU speed; latency is irrelevant without a human in the loop. Nvidia's dominance is training + answer inference, not agentic.

High
HIGH
The Strongest Objections
Where this analysis may be wrong

Five Arguments Against This View

His framework is internally consistent — but these counterpoints deserve serious weight.

"Sustaining" is what incumbents always believe right before disruption

IBM embraced the PC. Newspapers launched websites. The sustaining phase is real and temporary. The analysis itself admits the AI-native business model hasn't been invented. You can't call the game at halftime.

$700B in CapEx with no proven native model rhymes with the telecom bubble

WorldCom and Global Crossing had real technology and real revenue too. Cisco lost 90% despite being the "picks and shovels" play. When CapEx cycles crack, supply-chain winners get hit too.

TSMC is a monopolist exercising pricing power, not a conservative actor

Gross margins expanded from ~53% (2022) to ~58–60% (2026). Supply constraint is deliberate. The thesis can't have TSMC be both admirably rational and problematically insufficient.

Google's AI Mode destroys its own query volume over time

Better answers mean fewer follow-up queries. AI Mode costs 5–10x more per query. Agents won't click ads. Antitrust stripping default status puts $40–60B in annual revenue at risk.

The thin client thesis may be architecturally backwards

Apple Intelligence runs on-device. NPUs with 40+ TOPS are shipping at scale. Round-trip latency (50–200ms) makes real-time agents unusable in many scenarios. Privacy regulation pushes local.

The Key Unexamined Axiom

"AI capabilities will continue to improve on a predictable trajectory."

This is treated as axiomatic. S-curves are only identifiable in retrospect. If improvement stalls: $700B+ CapEx becomes the largest misallocation in history, SaaS recovers, and supply-chain plays crash. This is the scenario this framework fails to model.