It Was Never Jensen vs. the Hyperscalers. It Was a Balance Sheet Problem – And Power Is Next.


Every discussion about AI infrastructure eventually turns into a story about Jensen Huang outsmarting the hyperscalers.

The narrative goes something like this: NVIDIA deliberately routed scarce GPUs to upstarts like CoreWeave and Crusoe, creating a new class of AI cloud providers that would prevent Microsoft, Amazon, and Google from monopolizing AI infrastructure.

It’s a compelling story. It’s also an incomplete one.

If you’ve spent time building or financing large-scale enterprise infrastructure, this doesn’t look like a battle. It looks like a capital allocation strategy that happened to benefit everyone involved — for a while.

The rise of the neoclouds wasn’t about defeating the hyperscalers. It was about solving a problem they all had.

The Balance Sheet Problem

Building infrastructure for an AI hardware cycle is fundamentally different from building traditional cloud infrastructure.

The hardware is expensive, demand is unpredictable, and the depreciation curve is unlike anything we’ve seen before. Buy a fleet of GPUs today, and there’s a good chance something significantly faster arrives before you’ve fully recovered your investment.

Public companies don’t like carrying that kind of uncertainty on their balance sheets.

This is where the neoclouds fit into the picture. Take Microsoft’s relationship with CoreWeave: Microsoft has historically accounted for well over half of CoreWeave’s revenue (62% in 2024, 71% in Q2 2025) – not because Microsoft couldn’t build the capacity itself, but because it didn’t want that capacity, and its depreciation schedule, sitting on its own books. A specialized provider takes on the risk of rapidly depreciating hardware, while the hyperscaler keeps investing in assets with much longer economic lives – land, fiber, power infrastructure, enterprise cloud platforms.

That structure has since been used well beyond Microsoft. OpenAI has stacked up roughly $22 billion in commitments to CoreWeave across three separate expansions, and Meta has committed to roughly $35 billion combined across two agreements of its own, most recently a $21 billion expansion. These are different customers solving the same problem in the same way: none of them wanted rapidly depreciating GPU fleets sitting on their own balance sheets, so they paid someone else to hold that risk.

Not every neocloud got there through the same door, either. Crusoe and CoreWeave both count as “neoclouds,” but they didn’t start from the same assets. CoreWeave built its position through GPU-collateralized debt. Crusoe, like several other entrants, grew out of the Bitcoin mining industry and repurposed cheap, already-built power infrastructure toward AI workloads once crypto economics soured. Different starting points, same underlying trade: someone besides the hyperscaler absorbs the capital risk of the hardware cycle.

The hyperscalers didn’t lose the first phase of the AI infrastructure race. They found a way to participate without absorbing all of its financial volatility.

Why This Worked

This model only works if the underlying assets retain value.

GPUs are unusual because they aren’t purpose-built appliances with a single use case. NVIDIA spent more than a decade building CUDA into the default software platform for AI development, which is why demand for GPU compute extends across research labs, startups, enterprises, and cloud providers rather than sitting locked to one buyer.

Developers weren’t choosing neoclouds because Jensen allocated them chips. They were choosing them because their existing software stacks already ran there. That’s what gave NVIDIA hardware a liquidity that few infrastructure assets enjoy, and that liquidity is what made the financing model possible in the first place. (Frameworks like PyTorch and Triton will keep chipping away at that advantage, but not this cycle.)

NVIDIA’s incentive here is simpler than the “multipolar strategy” framing suggests. NVIDIA doesn’t need Microsoft, Amazon, or Google to lose share , it just needs GPUs sold, wherever the check comes from. CoreWeave is the only major cloud provider buying from NVIDIA that isn’t also building its own competing silicon, which makes it a cleaner customer than the hyperscalers, not a rebel one. Backing the neoclouds wasn’t a play against Microsoft and Google. It was a way to keep selling chips to buyers with no reason to ever build their own.

The Case Against It

None of this means the model is free of risk, and it’s worth stating the bear case plainly, especially since two of its central assumptions cracked within the same few months.

The financing math is straining even as the backlog grows. CoreWeave’s Q1 2026 results, reported in May, showed revenue backlog reaching $99.4 billion — up nearly 50% in a single quarter. But that growth came with a GAAP net loss of $740 million and $536 million in net interest expense for the quarter, against total debt approaching $25 billion. Backlog is a measure of future revenue promised, not cash in hand, and the cost of carrying the debt that built the capacity is rising faster than the revenue that capacity has generated so far.

Customer concentration remains a structural overhang. Meta and OpenAI together represent close to two-thirds of CoreWeave’s guaranteed revenue backlog, and contract cancellation clauses tied to delivery schedules create real execution risk on both sides. If any one of those customers delays a buildout or scales back, the neocloud carrying that exposure feels it immediately – the risk didn’t disappear when it moved off the hyperscaler’s balance sheet, it just concentrated somewhere else.

And the assumption that hyperscalers would stay on the sidelines just failed, publicly, in real time. On July 1, 2026, Bloomberg reported that Meta is building a cloud business called Meta Compute to sell its own excess AI infrastructure – both raw GPU capacity and hosted model access – to outside customers, directly competing with the neoclouds it currently pays. The market didn’t treat this as a distant hypothetical: CoreWeave fell 14% and Nebius fell 17% the same day, even though Meta is a paying customer of both. Investors read it correctly – a customer with enough spare capacity to become a competitor changes the pricing power in the relationship, immediately.

That doesn’t make the underlying financing thesis wrong. Neoclouds still solve a real balance-sheet problem hyperscalers don’t want to carry. But it does mean two of the risks that were previously theoretical : debt service outpacing revenue, and hyperscaler self-competition, are now showing up in earnings reports and single-day stock moves, not just bear-case footnotes.

The Second Life of AI Infrastructure

One criticism of the neocloud model is that it depends on borrowing billions against hardware that inevitably becomes obsolete.

That assumes GPUs lose most of their value once the next generation arrives.

Reality is more nuanced. Training frontier models requires the newest hardware connected in enormous, tightly synchronized clusters. Inference is a different business. As models become more efficient and enterprises deploy them at scale, older GPU generations remain perfectly capable of serving inference workloads, fine-tuning models, and running enterprise AI applications. CoreWeave is already leaning on this: much of its long-term business model depends on running older Hopper-generation chips for years after the newest Blackwell and Rubin systems come online, selling that capacity to enterprises and startups who don’t need the bleeding edge.

The question isn’t whether older GPUs continue to have value. They almost certainly will. The question is whether that value exceeds the combined cost of financing, power, cooling, and operations.

That calculation will determine which neoclouds become durable infrastructure companies and which were simply vehicles for financing the first wave of AI demand.

The Next Bottleneck Isn’t GPUs

Much of the discussion around AI infrastructure still assumes GPUs are the scarce resource.

That was true. It’s becoming less true.

The next constraint is power. A company can buy thousands of the latest GPUs and still need hundreds of megawatts of reliable capacity, years of utility planning, transmission infrastructure, substations, cooling, and land that can support all of it. CoreWeave itself surpassed 1 gigawatt of active power in its Q1 2026 results, with total contracted power near 3.5 gigawatts — and getting there took years of buildout, not a single GPU purchase order.

Those assets aren’t built overnight.

This is where the hyperscalers have a structural advantage that’s much harder to replicate than access to silicon. Decades of investment in land, utility relationships, permitting, and physical infrastructure are becoming increasingly valuable as AI data centers grow larger.

The industry’s bottleneck is shifting from semiconductors to energy infrastructure. When bottlenecks move, so does leverage.

Net-Net

The AI cloud isn’t becoming multipolar because NVIDIA wanted to weaken the hyperscalers.

It became multipolar because a new class of providers solved a financing problem during the most capital-intensive phase of AI infrastructure buildout. The neoclouds absorbed risk that public cloud providers had good reasons not to carry on their own balance sheets. NVIDIA sold more GPUs. The hyperscalers gained flexibility. AI companies got access to compute when they needed it most.

That arrangement is now being tested from two directions at once, and they’re not the same threat. Power is a slow-moving constraint : it rewards whoever spent the last decade on utility relationships and permitting, which mostly still favors the hyperscalers over the neoclouds they fund. Hyperscaler self-competition is a fast-moving one – it doesn’t require new infrastructure at all, just a company like Meta deciding its existing excess capacity is worth more sold than idle. A neocloud can plan around the first. The second can reprice a company in a single trading session, as CoreWeave and Nebius just learned.

For the first phase of the AI boom, GPUs were the scarce resource, and the neoclouds won by absorbing financing risk the hyperscalers didn’t want. The next phase will be decided by two separate questions: who controls power, and whether the hyperscalers’ own spare capacity ends up competing with the companies that were built to serve them. Right now, neither question is settled — and the neoclouds don’t control the answer to either one.

Systems Over Scale: What Bridgewater Teaches Us About the Enterprise AI Plateau


I have lost count of how many client conversations this year have gone the same way. Someone tells me the model isn’t accurate enough yet for what they want to do, and the plan is to just wait for the next release. GPT whatever. Claude whatever. Gemini whatever. Someone bigger and smarter is always around the corner, so why do the hard work now?

Bridgewater just published a paper that quietly pokes a hole in that thinking, and I think it deserves more attention than it’s getting outside finance circles.

They took an open weight model, Qwen3-235B, and ran it through a serious reinforcement learning and distillation pipeline built with Thinking Machines Lab. The result was 84.7% accuracy on their internal financial evaluation suite at a fraction of the inference cost of the big commercial models. Those are impressive numbers.

But the numbers aren’t really the story.

The story is how they got there.

Everyone’s first assumption will be that Bridgewater won because they have proprietary data nobody else has. Sure, that helps. But I think the more interesting thing they built is the feedback loop around the data, not the data itself.

They didn’t have their best investment people label every single example. That would be a waste of very expensive time. Instead they trained a baseline model on cheaper vendor-labeled data first. Only when the model disagreed with the vendor label did it get routed to an experienced investment professional for a second opinion.

So the expensive human judgment gets spent exactly where it matters most, on the cases that are genuinely ambiguous, not on the easy 90 percent that any reasonable process would get right anyway.

Then on the training side, they didn’t just keep distilling from the same fixed teacher model forever. The student gets promoted to teacher only once it proves it’s actually better on validation. That’s a small design choice that I think matters a lot. It keeps the whole system improving instead of plateauing around whatever the original teacher model was capable of.

If I had to boil the lesson down to one sentence, it’s this.

Good enterprise AI usually comes from a better feedback loop, not from more data or a bigger model.

I also think that’s why so many enterprise AI projects seem to plateau around the same accuracy range. Once you’ve exhausted prompt engineering and upgraded to the latest foundation model, the next gains usually don’t come from a smarter model. They come from better supervision, better routing, better feedback, and better systems.

Now, a few things I’d push back on if I were reviewing this paper with a client.

The cost savings headline needs a footnote. A 235 billion parameter model doesn’t run itself. You still need GPUs, batching, latency tuning, people who know how to keep the thing running. If you’re processing enormous volumes every day, owning that infrastructure can absolutely pay off. If your workload is lumpy or unpredictable, a commercial API that turns fixed infrastructure cost into a variable line item might still be the smarter bet.

This isn’t a universal answer. It depends entirely on how much you actually use the thing.

I’d also gently push back on the framing of “replicating expert judgment.” Many of the evaluated tasks focus on document segmentation, filtering, classification, and finding the needle in a haystack of financial text. That’s genuinely useful work and it saves analysts a ton of time. But it is not the same as a model independently coming up with a macro thesis or an investment idea nobody has had yet.

Parsing information well and synthesizing new insight are two different skills. I’d want any vendor or internal team to be honest about which one they’re actually selling me.

And specialization has a cost that doesn’t show up in the benchmark table. A model tuned tightly to today’s financial reporting formats and today’s regulatory language will need care and feeding when those things change, and they always change.

That’s not a knock on the approach. It’s just the maintenance bill nobody talks about until the invoice shows up.

A lot of IT organizations aren’t set up yet to treat retraining and re-distillation as an ongoing operational cost the same way they’d treat patching a production system.

Here’s where I land on all this.

The Bridgewater paper isn’t proof that the big frontier models are becoming irrelevant. It’s evidence that enterprise AI is becoming an architectural discipline.

The organizations that win won’t necessarily be the ones with access to the biggest models. They’ll be the ones that build the best systems around them.

Use specialized models for the high-volume, close-to-the-data work. Save expensive frontier reasoning for the small slice of problems that are genuinely hard and ambiguous.

That’s a tiered architecture. It’s a lot more work than pointing everything at one API. But it’s also a lot harder for a competitor to copy, and that’s usually the kind of advantage worth building.

Your AI Strategy Is About to Become the Most Expensive Commodity Bet in Your Company



In my last post, I argued that boards face a growing geopolitical risk in their AI strategy, that a single regulatory action or export control directive could sever access to a mission-critical workflow with zero warning.

If that is true, it raises an immediate question: why are so many enterprises doubling down on deep dependency on a small number of frontier providers?

The answer is economic. And it deserves immediate scrutiny.


It isn’t regulation that concerns me most right now.

It’s economics !!!

Many organizations are investing in AI as though the cost of intelligence will remain permanently scarce and permanently expensive. That assumption is beginning to break down, and the enterprises built on it are about to find out what happens when pricing assumptions change mid-cycle.

Think about signing a five-year cloud infrastructure agreement just before hyperscale providers permanently reset the economics of compute. You were not buying a lasting advantage. You were locking yourself into yesterday’s pricing model. Many boards are about to make that same mistake with AI.

There is an important nuance here. Not all AI dependency is equal. An enterprise with an orchestration and governance layer between itself and model providers is in a fundamentally different position from an enterprise that has wired core workflows directly to a single API. Optionality does not require full in-house control. It requires the ability to switch, and someone in your AI value chain must demonstrably own that capability. If you are using a managed services partner to run agentic workflows, the question is not just whether they are using the best model today. It is whether their architecture allows them to substitute models without rebuilding your workflows from scratch.

That distinction matters more than which model is currently running.


The Great Commoditization of Intelligence

For the past two years, the dominant assumption has been simple: frontier intelligence would remain scarce, and scarcity would justify premium pricing. That assumption is starting to break.

Across the ecosystem, capable models continue to improve at speed. Open-weight systems are advancing rapidly. Governments increasingly treat AI infrastructure as strategic capability rather than commercial software, which means they are actively funding alternatives to frontier providers.

China’s AI ecosystem illustrates this clearly. Systems such as DeepSeek, alongside models from Alibaba’s Qwen family, show that careful systems engineering and open-weight development can deliver highly competitive performance without relying exclusively on cutting-edge semiconductor supply chains.

Whether this is industrial policy or competitive pressure matters less than the outcome. The economics are shifting.

We have seen this before.

Servers became commodities. Operating systems became commodities. Cloud infrastructure became utilities.

Open source repeatedly compressed margins at the infrastructure layer while expanding value creation above it. AI is following the same trajectory.

That does not mean frontier innovation stops. It almost certainly will not. But it does mean boards need to ask a more uncomfortable question: will your business still pay frontier prices once today’s breakthrough becomes tomorrow’s baseline capability?


The Enterprise Contract Trap

Many CFOs are locking in multi-year AI agreements based on today’s economics. Frontier providers argue this is rational. They point to rapid capability gains in reasoning, multimodal systems, and autonomous agents as justification for sustained premium pricing.

They may be right. But that increases the risk rather than reducing it.

If capability leadership shifts every six to twelve months, long-term contracts become structurally fragile. Your architecture will evolve faster than your procurement cycle.

Every enterprise AI contract is implicitly a bet on scarcity. You are betting that the cost of intelligence will remain high enough to justify today’s terms. That is a historically fragile position in any technology that becomes foundational.

Some will argue open-weight systems are not meaningfully cheaper once governance, engineering, security, and compliance are included. That is true today. But it confuses current friction with structural cost.

Managed services compressed complexity in cloud infrastructure; managed inference is beginning to do the same for AI. The direction of travel matters more than the current state.

The honest board conversation is not “open-weight versus closed.” It is: at what capability and cost threshold does open-weight become the rational choice for each of our core workflows, and are we tracking that threshold over time?

Most organizations are not. Most managed service agreements do not surface it either, which means the question has to be owned explicitly at board level.

As intelligence becomes more accessible, advantage shifts away from the model and toward workflows, data, distribution, and execution.


The Metric Your Board Isn’t Measuring

Most boards still anchor on cost per million tokens. That is already the wrong unit.

Enterprise AI is moving into multi-agent systems where a single request triggers planning, reasoning, tool use, validation, and repeated self-correction before a result is produced. The user sees one interaction. The system may execute dozens of internal cycles.

This breaks the linear relationship between price and output. Token costs can fall while system costs rise if orchestration is poor.

The KPI that matters is not cost per token. It is cost per completed business outcome.

That requires measuring the full workflow: inference, orchestration, failure handling, human review, and retry logic.

It also exposes a second-order issue most boards do not yet track: whether their AI stack, or their partner’s AI stack, is genuinely provider-agnostic, or quietly optimized around a single vendor’s architecture.

That distinction matters because pricing and capability will not move in lockstep. When they diverge, lock-in becomes visible in operating costs rather than contracts.

An orchestration layer designed for model substitution is structurally more valuable than one optimized for a single provider, even if the single-provider version performs slightly better today.


The Geography Arbitrage Myth

A common response from executive teams is straightforward:

“We’ll build a team in Singapore. We’ll route inference through Dubai. We’ll separate legal entities.”

The assumption is that geography solves dependency risk. That assumption is weakening.

Regulators are increasingly focused on ownership, control, technology transfer, and operational influence rather than corporate domicile alone. Major economies are tightening rules around strategic technologies and data flows regardless of where a company is incorporated.

This does not mean global AI is splitting into fully isolated systems. But it is forming partially distinct ecosystems with increasing friction between them. For multinational companies, navigating those boundaries is becoming a capability, not an implementation detail — and it is doubly fragile when the underlying model economics are also in motion.


Where Competitive Advantage Actually Lives

None of this implies enterprises should abandon frontier AI systems. Many organizations will continue to rely on them for reliability, compliance, and legal protection. That is rational.

The mistake is assuming this architecture is permanent.

As intelligence becomes more accessible, advantage shifts away from the model and toward the things competitors cannot easily replicate: proprietary workflows, institutional knowledge, high-quality internal data, customer trust, and execution speed.

The same logic applies to AI partners. A partner whose value is deep integration with a single provider’s stack carries a different risk profile from one whose value is workflow design, domain expertise, and cross-model orchestration. Both can be valid choices. But they are not equivalent when infrastructure economics shift.

Models may become interchangeable. Operating systems of work will not.


Every Major Technology Shift Follows The Same Pattern


The scarce, expensive thing becomes abundant. The companies that treated scarcity as strategy lose. The companies that built differentiated capability on top of abundance win.

The geopolitical risk in the first post can sever AI access overnight. The economic risk in this post can erode AI advantage over time. Both point to the same conclusion: resilience comes from optionality, not dependency.

The question for your board is no longer which AI model to bet on. It is this: what happens to your competitive position when intelligence becomes widely available and competitively priced — and how quickly can you adapt when it does?

The biggest mistake a board can make today is assuming intelligence will remain the scarce asset.

The companies that win this cycle will be the ones that build everything around the moment it stops being one.