Living Document · Updated as new articles publish

AI
Strategy
Frameworks
& Views

A running analysis of AI's winners and losers — covering infrastructure, company outlooks, key predictions, and the strongest counterarguments.

Last Updated

May 11, 2026

►

Core Thesis on AI

Three interlocking claims

AI Is an Infrastructure War

Value flows to physical chokepoints and demand aggregators — not model builders or traditional software vendors.

Infrastructure Determines Everything

The binding constraints on AI are physical: fabs, memory, power — not algorithmic. $700B+ in annual hyperscaler CapEx is the investment phase of a multi-decade build-out. A chip shortage ~2029 is near-certainty.

AI Is Sustaining for Incumbents

Google's ads get more targeted. Meta's feed gets sharper. Shopify's merchants get more tools. The "AI startup kills incumbent" narrative is mostly a startup funding mechanism, not serious analysis.

Aggregation Theory Strengthens

AI increases supply of content, code, and services. Whoever controls the demand side — users — gets stronger. Commoditized supply empowers aggregators above all others.

►

Infrastructure & Memory Crisis

Physical bottlenecks

Physical Constraints Drive Everything

HBM memory and TSMC fab capacity are the real binding constraints — not model quality or software capability.

Company2026 CapExAssessment

Google$175–185BJustified. Cloud 48% growth, 30% margins. Direct ad payoff is the clearest case.

Meta$135B+Most enthusiastic. Ad ROI is most direct and measurable of any hyperscaler.

MicrosoftPaused →Pausing was a mistake. Right to redirect compute to own products vs Azure third-party.

Amazon$200BCautious. CapEx exceeds projected operating cash flow. Requires compute commoditization.

Combined>$700BAnnual hyperscaler capital expenditure

The Memory Crisis

HBM demand for AI is crowding out consumer electronics across the entire industry simultaneously.

Sony — considering delaying PS6 to 2028–2029.

Samsung — reviewing contracts quarterly instead of annually.

Chinese OEMs — cut 2026 shipment targets by up to 20%.

Nintendo — making customers supply their own SD cards.

Valve — delisted Steam Deck in the US.

Inference Architecture — Three Distinct Markets

Training

Serial steps, massive parallelism within each step. Every GPU must sync before the next step.

Key constraint: HBM + networking

Winner: Nvidia

Answer Inference

Human waits for response. Token speed = user experience. Decode is serial and memory-bandwidth bound — KV cache + model weights read per token.

Key constraint: bandwidth & latency

Winners: Cerebras, Groq

Agentic Inference ★

No human in the loop. Latency is irrelevant overnight jobs. Needs context, state, history — KV cache, DRAM, SSDs, databases, embeddings. CPU speed for tool use matters more than GPU speed.

Key constraint: memory hierarchy

Market: Largest by far — scales with compute, not humans

Nvidia launched Dynamo (inference disaggregation framework) and standalone memory/CPU racks to stay relevant in agentic workloads — but cost and simplicity increasingly favor alternatives. China has everything it needs for agentic inference despite lacking leading-edge chips.

Quarterly CapEx Tracker

Hyperscaler Capital Expenditure — Q1 2024 to Q1 2026

USD billions · Per company · Quarterly

Sources: Alphabet, Meta, Amazon, Microsoft earnings releases (Q1 2026, reported Apr 29 2026). Some quarters derived from annual totals where quarterly disclosures unavailable. Microsoft converted to calendar quarters; Q2ʼ25 derived from FY25 annual total. ★ 2026E ÷4 = full-year management guidance divided by 4 (quarterly equivalent): Google $180–190B, Meta $125–145B, Amazon $200B, Microsoft $190B.

►

Key Predictions

10 forward-looking calls

PREDICTIONS — CONFIDENCE LEVEL

Massive chip shortage ~2029 due to TSMC's conservative capacity expansion relative to AI demand.

High

HIGH

Microsoft exits first-party console hardware entirely. Xbox becomes a publishing and Game Pass brand.

High

HIGH

Memory prices squeeze consumer electronics for years — phones, PCs, and consoles all simultaneously.

High

HIGH

Government demands control of AI regardless of who funded its development. Inevitable, not normative.

High

HIGH

The AI-native business model — the "feed" equivalent — has not yet been invented. OpenAI's banner ads are primitive.

High

HIGH

OpenAI must find product-market fit in 2026 or face commoditization. Shallow weekly usage is not a moat.

Medium

MED

Netflix eventually acquires both Paramount and Warner Bros. — possibly simultaneously — within the decade.

Medium

MED

AI shopping benefits long-tail merchants (Shopify, Etsy) more than Amazon, by surfacing niche products agents can discover.

Medium

MED

PE opportunity in beaten-down SaaS — buy at low multiples, restructure for profitability, run as cash generators.

Medium

MED

EA's long-term play is live sports rights — a single platform for watching, playing, betting, and fantasy.

Low–Med

LOW

Agentic inference will become the largest compute market — and it won't look like today's GPU clusters. Memory hierarchy (DRAM, SSDs, databases) beats raw bandwidth; CPU speed for tool use matters more than GPU speed; latency is irrelevant without a human in the loop. Nvidia's dominance is training + answer inference, not agentic.

High

HIGH

►

The Strongest Objections

Where this analysis may be wrong

Five Arguments Against This View

His framework is internally consistent — but these counterpoints deserve serious weight.

"Sustaining" is what incumbents always believe right before disruption

IBM embraced the PC. Newspapers launched websites. The sustaining phase is real and temporary. The analysis itself admits the AI-native business model hasn't been invented. You can't call the game at halftime.

$700B in CapEx with no proven native model rhymes with the telecom bubble

WorldCom and Global Crossing had real technology and real revenue too. Cisco lost 90% despite being the "picks and shovels" play. When CapEx cycles crack, supply-chain winners get hit too.

TSMC is a monopolist exercising pricing power, not a conservative actor

Gross margins expanded from ~53% (2022) to ~58–60% (2026). Supply constraint is deliberate. The thesis can't have TSMC be both admirably rational and problematically insufficient.

Google's AI Mode destroys its own query volume over time

Better answers mean fewer follow-up queries. AI Mode costs 5–10x more per query. Agents won't click ads. Antitrust stripping default status puts $40–60B in annual revenue at risk.

The thin client thesis may be architecturally backwards

Apple Intelligence runs on-device. NPUs with 40+ TOPS are shipping at scale. Round-trip latency (50–200ms) makes real-time agents unusable in many scenarios. Privacy regulation pushes local.

The Key Unexamined Axiom

"AI capabilities will continue to improve on a predictable trajectory."

This is treated as axiomatic. S-curves are only identifiable in retrospect. If improvement stalls: $700B+ CapEx becomes the largest misallocation in history, SaaS recovers, and supply-chain plays crash. This is the scenario this framework fails to model.

AIStrategyFrameworks& Views

AI Is an Infrastructure War

Infrastructure Determines Everything

AI Is Sustaining for Incumbents

Aggregation Theory Strengthens

Physical Constraints Drive Everything

Five Arguments Against This View

"Sustaining" is what incumbents always believe right before disruption

$700B in CapEx with no proven native model rhymes with the telecom bubble

TSMC is a monopolist exercising pricing power, not a conservative actor

Google's AI Mode destroys its own query volume over time

The thin client thesis may be architecturally backwards

The Key Unexamined Axiom

AI
Strategy
Frameworks
& Views