The End of Compute Scarcity? Not So Fast

I think there are two big questions hitting AI markets right now:

If $SpaceX(SPCX)$ (via xAI) and Meta are all of a sudden turning around and renting out their compute capacity, are we really in a compute crunch or is the system starting to fill up with excess?
What does the rise of open source models and the end of tokenmaxing mean for Anthropic and OpenAI?

On point 1 - SpaceX famously created ~$2.32B of monthly revenue by selling (ie renting out) about 450k of GPUs to Anthropic, Google and Reflection. Details below.

SpaceX lands another computing deal, this time with Reflection, an open source model development company. $150m / month for GB300s. SpaceX the Neocloud! Deal 1 with Anthropic Colossus 1 and Colossus 2. Anthropic took all of Colossus 1 $1.25b / month ~325k total chips, split

9:43 AM · Jun 22, 2026 · 49.9K Views

4 Replies · 8 Reposts · 120 Likes

Then this week that $Meta Platforms, Inc.(META)$ is planning to start a cloud computing business selling compute capacity. Looking at both of these trends (SpaceX and Meta selling capacity) you can build a bear narrative:

There’s excess compute capacity that two of the largest buyers no longer need. This in turn translates to the end of “scarcity of compute”
This is the canary in the coal mine for downward capex revisions for the hyperscalers
Maybe demand simply isn’t there for what’s being built with AI (Zuckerberg also said this week AI agent tech is developing slower than anticipated)

I don’t think this bear thesis has legs for a few reasons. First, let’s look at some of the details of these deals. Meta hasn’t actually sold anything yet. On SpaceX (xAI), all of their deals are VERY short term in nature. Each side (both seller and buyer) have a 90 day “out” where at any point in the contract they can tap out and stop paying (or providing the compute) within 90 days. The price SpaceX was able to charge was also extraordinarily high. I believe SpaceX structured the deals this way because they plan on using the compute themselves in the future. And the buyers (Anthropic, Google, Reflection) were willing to pay such high prices because they couldn’t get the capacity anywhere else.

I also think you need to look at who’s selling the capacity (SpaceX and Meta). Dario had a great segment on a Dwarkesh podcast talking about the economics of compute buildouts. In summary, there’s broadly two ways to use compute: training and inference. And there’s a magic ratio of training / compute that economically makes sense for any given business (and that ratio will differ by company). Let’s look at either extreme. If you spend a lot of money on compute, and then use 100% of it for training, you won’t have any left to sell inference and make money. Or if you use 100% of it for inference you’ll have no capacity left for training and you’re R&D progress is kneecapped and stuck in time. Neither of these make sense, so there is some middle ground each comapny should deploy.

Looking at both SpaceX (xAI) and Meta. The unfortunate reality is their models were struggling…xAI basically lost their entire team and usage of their models fell off a cliff. They themselves had no demand for their own models, so they couldn’t even use their own compute to sell inference. This mean they had 100% of their compute on the shelf for training, but were also in process of rebuilding their entire team (I’m speaking in absolutes, but of course there’s more nuance than this, but I think it’s directionally accurate).

And as I mentioned above, using 100% of your compute for training is a tough economic proposition. You’re going to spend a TON of money on the build, but then generate no revenue on the backend. Not a great trade. So what did they decide? Again, I’m speaking in conjecture here… They decided until they actually had models to sell, it made economic sense to rent out their capacity so they could bring money in and balance the cashflow equation. And the short term nature of these compute deals validates this - they want a 90 day out to take the capacity back when they’re ready for it. So on SpaceX selling compute, this felt like an idiosyncratic issue they themselves faced. AND the price they were willing to charge supports that scarcity still exists.

On Meta - we don’t exactly know what they’re plans are or what the deals could look like (so it’s hard to really comment), but we do know their own AI efforts have been struggling. Llama models have fallen behind meaningfully (especially compared to chinese open source models), and the team / morale issues they’re facing are well documented. This also could be a “we spent a ton of money on this compute, but aren’t generating enough revenue on it, let’s sell some capacity until we get our act together.” Again, we don’t actually know what their plans are so it’s hard to comment. I also think it’s entirely likely they see the business of AWS, Azure and GCP and just want to get into that game.

So in summary, I really don’t think there is any signal here on “scarcity ending” as from my vantage point there is no excess compute in the system. Anyone who’s willing to sell capacity (hyperscalers, neolabs, etc) finds buyers immediately. So not to be too glib about it, but I think these deals are nothingburgers on the “scarcity is ending” comments. AI laggards are just wanting to turn a cost center into a revenue generating opportunity.

Then we have point two. Does the rise of open source models and the end of token maxing spell the end of Anthropic and OpenAI’s reign?? No, I don’t think so, but the world is certainly evolving.

Yippit data also supposedly points to no slow down in the labs revenue trajectory (but to be fair, if there’s going to be an effect it may just not have hit the P&L yet).

My main reaction to this “fear” is that we have to break down AI usage into two buckets

Who is generating the tokens
Who is generating the revenue

It would seem these two should be directly correlated, but I actually see a future emerging where 80% of the tokens go to “low cost budget” options (like cheaper smaller open source models), and 20% of the tokens go to the “frontier SOTA” tokens from the labs (where these tokens are way more expensive). In this scenario, I think the 20% will drive a significant portion of the revenue (because those tokens will be more expensive and for more mission critical work). To date, the “expensive” tokens have capture all of hte market share AND wallet share, but I see these bifurcating in the future. And this is a good and healthy evolution. But I deeply believe there will always be a market for frontier tokens, and even if that market is a smaller percentage of total tokens, it will be a big percentage of revenue.

To some extent we're already seeing something like this show up in enterprise data. Ramp's latest AI Index shows the top 1% of firms spending ~$7,500 per employee per month on AI while the median firm spends $11.38 (which is a huge ~680x gap). The spend is super concentrated in a small group of power users doing serious, mission-critical work, which is the group I’d imagine reaches for the frontier tokens. The tokens probably eventually spread out across cheap open source options, but the dollars aggregate / cluster at the top, which is a pattern I’d expect to hold.

And then there’s the end of tokenmaxing. This is ABSOLUTELY a trend. BUT - I think the tidal wave is too big. There will certainly be a flushing of wasteful spend that will provide a short term headwind to spending on OpenAI / Anthropic. BUT I think the macro trend of spending more on intelligence is so large it will overwhelm any optimizations. I do realize this paragraph is starting to sound a lot like 2021 when a prevailing attitude was “yes software stocks have high multiples, but they’re growing so fast that their growth will overwhelm the inevitable multiple compression.” (I probably even said something like that myself…). I’m saying something similar here - I think we’re so early in the AI S-Curve that optimizations along the way will look like small blips on the overall trend.

I think there’s a bigger risk to the large labs that I’ll write about next week. A broader trend of “own your weights” is emerging that provides an opportunity (and challenge) for the big labs.

Disclaimer: Investing carries risk. This is not financial advice. The above content should not be regarded as an offer, recommendation, or solicitation on acquiring or disposing of any financial products, any associated discussions, comments, or posts by author or other users should not be considered as such either. It is solely for general information purpose only, which does not consider your own investment objectives, financial situations or needs. TTM assumes no responsibility or warranty for the accuracy and completeness of the information, investors should do their own research and may seek professional advice before investing.

463

Report

Comment

Top
Latest

No comments yet

To The Moon

The End of Compute Scarcity? Not So Fast

Comment

Most Discussed