NVIDIA Is Expanding the AI Factory and Narrowing the Competitive Window

Our integrated post-GTC framework on token economics, platform strategy, and why the unit of competition has moved up the stack.

Mar 19, 2026

∙ Paid

Executive takeaway. GTC 2026 reinforced our view that NVIDIA is shifting the competitive field away from performance and toward the economics (TCO) of the AI factory as a whole. We believe the decision is actively shifting to cost per token, tokens per second per watt, deployment speed, utilization, and revenue per megawatt, thus the moat widens materially as the advantages compound across the full stack rather than resting on any single component. Vera CPU racks and Groq LPX racks sit inside that framework. They expand NVIDIA’s direct addressable market, but the more important contribution is that they occupy the adjacencies competitors had hoped to use as entry points. When we look at the post-GTC story in its entirety, we see something broader than faster silicon. We see a company that is systematically extending its control of the AI factory, and the competitive attack surface is getting narrower with each product cycle.

NVIDIA disclosed visibility into more than one trillion dollars of Blackwell and Rubin purchase orders through 2027, and that figure covers only GPU systems, and likely associated networking but not all networking. It does not the Groq LPX low-latency inference racks, standalone Vera CPU systems (Jensen said this could be multi billion business), storage, or the enterprise RTX data center business, all of which are incremental. Our work suggests the total revenue opportunity through 2027 sits meaningfully above current consensus, though the exact magnitude depends on how quickly the newer product categories ramp and how broadly customers adopt the full five-rack factory architecture. This maps to one of our key questions, and research topics: what is the attach rate of Vera CPU racks and Groq racks to Vera Rubin super pods and in what ratio.

In line with the NVIDIA’s theme of “extreme co-optimization,” GTC was about the introduction of a complete five-rack AI factory design. NVIDIA now ships five distinct rack types, all liquid-cooled and sharing common power and cooling assumptions: the Vera Rubin NVL72 GPU rack, the Vera CPU rack, the Groq 3 LPX rack, the BlueField-4 STX storage rack, and the Spectrum-6 SPX Ethernet rack. Ratio/attach rate aside, when you look at the reference superpod, we shouldn’t focus on only the sheer compute density—more than 1,100 Rubin GPUs, over 2,500 LPUs, and roughly 1,400 Vera CPUs in a single deployment—but how deliberately that compute is organized as one system. What this shows is how NVIDIA’s roadmap is evolving into a more tightly integrated system, where the rack “whole compute system is the unit of compute.

We don’t view the strategy here purely for the incremental revenue from new categories, but the way NVIDIA is turning adjacent layers of the stack into extensions of the broader platform. The portfolio expansion is not portfolio expansion for its own sake. It is that these layers become more valuable when designed to work together, improving the performance and economics of the overall system while reducing the number of viable entry points for competitors. What were once potential wedges for alternative vendors increasingly become integrated parts of the NVIDIA architecture, and that materially raises the bar for competition.

The full report goes significantly deeper on all of these dimensions, including specific revenue estimates by product category, the monetization framework for LPX and Vera, a detailed competitive map, the optical transition sequence, infrastructure bottleneck analysis, and what would need to go wrong for the thesis to weaken.

What subscribers get in the full report

• Revenue bridge by bucket for LPX, Vera CPU, storage, and networking through 2027, with assumptions and confidence intervals for each category

• LPX monetization framework mapping three value creation paths (ARPU expansion, utilization uplift, workflow throughput) to specific buyer segments and workloads

• Vera CPU attach economics including our estimate of CPU-resident compute share in agentic workloads and why customers prefer NVIDIA-supplied CPU over mixed-vendor alternatives

• Competitive map with three explicit buckets: where hyperscaler ASICs still make sense, where merchant GPU competitors remain relevant, and where NVIDIA is raising the bar with platform integration

• Software TCO analysis tying CUDA, Dynamo, Spectrum-X800, BlueField STX, DSX Air, and NemoClaw into measurable deployment economics

• Optical transition sequencing with supply chain estimates for optical engine demand, OCS revenue trajectory, and the path to the $80-100B AI optical TAM by 2030

• Five detailed watchpoints with risk scenarios explaining what a negative outcome on each variable would mean for the broader thesis

• Stock catalyst framework identifying the three proof points that move NVIDIA from a strong-quarter story to a durable-platform story, with near-term datapoints to track

• Full superpod architecture breakdown with GPU, LPU, and CPU counts per pod, generational performance comparisons, and factory output metrics per gigawatt

NVIDIA Is Expanding the AI Factory and Narrowing the Competitive Window

Our integrated post-GTC framework on token economics, platform strategy, and why the unit of competition has moved up the stack.

What subscribers get in the full report

This post is for paid subscribers