Every discussion about AI infrastructure eventually turns into a story about Jensen Huang outsmarting the hyperscalers.
The narrative goes something like this: NVIDIA deliberately routed scarce GPUs to upstarts like CoreWeave and Crusoe, creating a new class of AI cloud providers that would prevent Microsoft, Amazon, and Google from monopolizing AI infrastructure.
It’s a compelling story. It’s also an incomplete one.
If you’ve spent time building or financing large-scale enterprise infrastructure, this doesn’t look like a battle. It looks like a capital allocation strategy that happened to benefit everyone involved — for a while.
The rise of the neoclouds wasn’t about defeating the hyperscalers. It was about solving a problem they all had.
The Balance Sheet Problem
Building infrastructure for an AI hardware cycle is fundamentally different from building traditional cloud infrastructure.
The hardware is expensive, demand is unpredictable, and the depreciation curve is unlike anything we’ve seen before. Buy a fleet of GPUs today, and there’s a good chance something significantly faster arrives before you’ve fully recovered your investment.
Public companies don’t like carrying that kind of uncertainty on their balance sheets.
This is where the neoclouds fit into the picture. Take Microsoft’s relationship with CoreWeave: Microsoft has historically accounted for well over half of CoreWeave’s revenue (62% in 2024, 71% in Q2 2025) – not because Microsoft couldn’t build the capacity itself, but because it didn’t want that capacity, and its depreciation schedule, sitting on its own books. A specialized provider takes on the risk of rapidly depreciating hardware, while the hyperscaler keeps investing in assets with much longer economic lives – land, fiber, power infrastructure, enterprise cloud platforms.
That structure has since been used well beyond Microsoft. OpenAI has stacked up roughly $22 billion in commitments to CoreWeave across three separate expansions, and Meta has committed to roughly $35 billion combined across two agreements of its own, most recently a $21 billion expansion. These are different customers solving the same problem in the same way: none of them wanted rapidly depreciating GPU fleets sitting on their own balance sheets, so they paid someone else to hold that risk.
Not every neocloud got there through the same door, either. Crusoe and CoreWeave both count as “neoclouds,” but they didn’t start from the same assets. CoreWeave built its position through GPU-collateralized debt. Crusoe, like several other entrants, grew out of the Bitcoin mining industry and repurposed cheap, already-built power infrastructure toward AI workloads once crypto economics soured. Different starting points, same underlying trade: someone besides the hyperscaler absorbs the capital risk of the hardware cycle.
The hyperscalers didn’t lose the first phase of the AI infrastructure race. They found a way to participate without absorbing all of its financial volatility.
Why This Worked
This model only works if the underlying assets retain value.
GPUs are unusual because they aren’t purpose-built appliances with a single use case. NVIDIA spent more than a decade building CUDA into the default software platform for AI development, which is why demand for GPU compute extends across research labs, startups, enterprises, and cloud providers rather than sitting locked to one buyer.
Developers weren’t choosing neoclouds because Jensen allocated them chips. They were choosing them because their existing software stacks already ran there. That’s what gave NVIDIA hardware a liquidity that few infrastructure assets enjoy, and that liquidity is what made the financing model possible in the first place. (Frameworks like PyTorch and Triton will keep chipping away at that advantage, but not this cycle.)
NVIDIA’s incentive here is simpler than the “multipolar strategy” framing suggests. NVIDIA doesn’t need Microsoft, Amazon, or Google to lose share , it just needs GPUs sold, wherever the check comes from. CoreWeave is the only major cloud provider buying from NVIDIA that isn’t also building its own competing silicon, which makes it a cleaner customer than the hyperscalers, not a rebel one. Backing the neoclouds wasn’t a play against Microsoft and Google. It was a way to keep selling chips to buyers with no reason to ever build their own.
The Case Against It
None of this means the model is free of risk, and it’s worth stating the bear case plainly, especially since two of its central assumptions cracked within the same few months.
The financing math is straining even as the backlog grows. CoreWeave’s Q1 2026 results, reported in May, showed revenue backlog reaching $99.4 billion — up nearly 50% in a single quarter. But that growth came with a GAAP net loss of $740 million and $536 million in net interest expense for the quarter, against total debt approaching $25 billion. Backlog is a measure of future revenue promised, not cash in hand, and the cost of carrying the debt that built the capacity is rising faster than the revenue that capacity has generated so far.
Customer concentration remains a structural overhang. Meta and OpenAI together represent close to two-thirds of CoreWeave’s guaranteed revenue backlog, and contract cancellation clauses tied to delivery schedules create real execution risk on both sides. If any one of those customers delays a buildout or scales back, the neocloud carrying that exposure feels it immediately – the risk didn’t disappear when it moved off the hyperscaler’s balance sheet, it just concentrated somewhere else.
And the assumption that hyperscalers would stay on the sidelines just failed, publicly, in real time. On July 1, 2026, Bloomberg reported that Meta is building a cloud business called Meta Compute to sell its own excess AI infrastructure – both raw GPU capacity and hosted model access – to outside customers, directly competing with the neoclouds it currently pays. The market didn’t treat this as a distant hypothetical: CoreWeave fell 14% and Nebius fell 17% the same day, even though Meta is a paying customer of both. Investors read it correctly – a customer with enough spare capacity to become a competitor changes the pricing power in the relationship, immediately.
That doesn’t make the underlying financing thesis wrong. Neoclouds still solve a real balance-sheet problem hyperscalers don’t want to carry. But it does mean two of the risks that were previously theoretical : debt service outpacing revenue, and hyperscaler self-competition, are now showing up in earnings reports and single-day stock moves, not just bear-case footnotes.
The Second Life of AI Infrastructure
One criticism of the neocloud model is that it depends on borrowing billions against hardware that inevitably becomes obsolete.
That assumes GPUs lose most of their value once the next generation arrives.
Reality is more nuanced. Training frontier models requires the newest hardware connected in enormous, tightly synchronized clusters. Inference is a different business. As models become more efficient and enterprises deploy them at scale, older GPU generations remain perfectly capable of serving inference workloads, fine-tuning models, and running enterprise AI applications. CoreWeave is already leaning on this: much of its long-term business model depends on running older Hopper-generation chips for years after the newest Blackwell and Rubin systems come online, selling that capacity to enterprises and startups who don’t need the bleeding edge.
The question isn’t whether older GPUs continue to have value. They almost certainly will. The question is whether that value exceeds the combined cost of financing, power, cooling, and operations.
That calculation will determine which neoclouds become durable infrastructure companies and which were simply vehicles for financing the first wave of AI demand.
The Next Bottleneck Isn’t GPUs
Much of the discussion around AI infrastructure still assumes GPUs are the scarce resource.
That was true. It’s becoming less true.
The next constraint is power. A company can buy thousands of the latest GPUs and still need hundreds of megawatts of reliable capacity, years of utility planning, transmission infrastructure, substations, cooling, and land that can support all of it. CoreWeave itself surpassed 1 gigawatt of active power in its Q1 2026 results, with total contracted power near 3.5 gigawatts — and getting there took years of buildout, not a single GPU purchase order.
Those assets aren’t built overnight.
This is where the hyperscalers have a structural advantage that’s much harder to replicate than access to silicon. Decades of investment in land, utility relationships, permitting, and physical infrastructure are becoming increasingly valuable as AI data centers grow larger.
The industry’s bottleneck is shifting from semiconductors to energy infrastructure. When bottlenecks move, so does leverage.
Net-Net
The AI cloud isn’t becoming multipolar because NVIDIA wanted to weaken the hyperscalers.
It became multipolar because a new class of providers solved a financing problem during the most capital-intensive phase of AI infrastructure buildout. The neoclouds absorbed risk that public cloud providers had good reasons not to carry on their own balance sheets. NVIDIA sold more GPUs. The hyperscalers gained flexibility. AI companies got access to compute when they needed it most.
That arrangement is now being tested from two directions at once, and they’re not the same threat. Power is a slow-moving constraint : it rewards whoever spent the last decade on utility relationships and permitting, which mostly still favors the hyperscalers over the neoclouds they fund. Hyperscaler self-competition is a fast-moving one – it doesn’t require new infrastructure at all, just a company like Meta deciding its existing excess capacity is worth more sold than idle. A neocloud can plan around the first. The second can reprice a company in a single trading session, as CoreWeave and Nebius just learned.
For the first phase of the AI boom, GPUs were the scarce resource, and the neoclouds won by absorbing financing risk the hyperscalers didn’t want. The next phase will be decided by two separate questions: who controls power, and whether the hyperscalers’ own spare capacity ends up competing with the companies that were built to serve them. Right now, neither question is settled — and the neoclouds don’t control the answer to either one.