Zitron is begging for a collapse at this point. Yes, his macro analysis correctly identifies a massive financial risk but his incessant pessimism completely misses the incredible ground-level utility that many of us on HN celebrate every day through undeniable, massive productivity gains.
At this point I'm trying to believe there's a middle ground where the level of individual capability this unlocks, leads to major discoveries.
He has also consistently demonstrated, at least to me, that he doesn't really understand how inference works from a technical perspective, which weakens much of his core thesis for why there should be a collapse.
I do value having some naysayers in the mix generally, because we do need balanced critique in what is otherwise a very frothy hype cycle. I just don't think he's making sound arguments, and that's even assuming you even agree with his premises in the first place.
My biggest gripe with his napkin math is that he treats inference gross margins as something novel that you can't compare to normal SaaS margins. He's right in part: the constant carousel of R&D costs from model training, related infrastructure buildout, and other adjacent costs required to stay competitive do change the analysis a bit.
But he takes this way too far when he says this is structurally different from normal SaaS margins. The business model definitely doesn't look like Dropbox, but it absolutely looks a lot like AWS, especially early AWS, CDNs, telecom, etc. I can speak to the telecom bit personally, since it's been over half of my professional career as an engineer and, in this specific case, also as a founder. You can have a brutally capital-intensive infra business where profitability depends on utilization, oversubscription, peak-capacity planning, segmentation, and recovering capex over time.
The math he presents gets even more questionable as we see explicit segmentation happening for cost-saving reasons. Many forward-thinking orgs are waking up to the fact that they don't need to use the best, most expensive model for every task. They can route easier tasks to cheaper models, use caching, batch non-urgent workloads, and reserve frontier models for the subset of work that actually needs frontier intelligence. That directly undermines his claim that providers always need to chase frontier intelligence in order to maintain current demand, utilization, and pricing curves.
He’s been continuously predicting that the collapse was just around the corner, that progress was slowing, and that there was no market for inference, since 2024.
The fact he’s never reflected on the glaring failures in his analysis tells what we need to know about his intellectual integrity. There’s truth in some of his words about financial risk, but if you can’t acknowledge that there’s upside too, you can’t evaluate risk properly either.
Do you think it's not slowing? Do I miss anything really important?
My understanding is that we have now is incremental improvement on thinking models which appeared more than a year ago. Of course, a breakthrough might happen, but I don't see one yet.
Productivity is not value. It's quite possible for you to experience productivity improvements, and actual value to not be created. That is what I think the most robust data is showing.
That's possible, sure. But I think the answer is more likely in the numbers, not in just qualitatively saying AI isn't worth anything. Like if I pay $30k for an ounce of gold, I got value. Gold is worth something. But that amount of gold wasn't worth what I spent.
EDIT: In fact, parent comment has a link to some numbers.
[EDIT: Most] people don't want to go through the numbers. Ok. But there's a history here. When people don't want to see the numbers, certain kinds of things tend to happen.
I've posted numbers that indicate that productivity is becoming decoupled from value delivery. If you follow the link in my comment it reviews a pretty robust study of 4000 teams over 2 years. There is no product throughput increase.
Code acceleration is great, but.... something precedes that. Vision and strategy re. expansion of offerings and businesses. Once a firm reaches maturity in what it offers and is only touching the edges - this code acceleration is literally useless when you factor in all of the trade-offs.
This is a good thing - it means fat and slow incumbents are sitting ducks to be out-witted by creative and imaginative founders, which is healthy for a well-functioning economy.
Now the economics of existing frontier models are not sustainable - its looking like a mix of the airline (supersonic vs subsonic) and EV industry with China in the background providing decent offerings at much lower prices.
Also, supposed productivity gains are dubious. I personally experience at best no productivity gains when using LLMs to write code, and sometimes it's an active drain on my productivity. There was that one study a year or so ago showing similar results. People are trying to say the productivity gains are there and undeniable, but that is not true. It is very much a subject of controversy whether AI helps productivity.
I can see an argument that the productivity gains are illusory / don’t translate to economic productivity. I’m not denying the possibility.
However, most of the engineers I respect have gone from being skeptics a year ago to convinced today. I don’t personally know any true holdouts any more. If there are studies that disprove productivity gains more than six months ago, I’m happy to believe that it was true of the AIs that were available at the time. But I’m going to need something much more recent before I disbelieve my lyin’ eyes where it pertains to the AIs available today.
There is an observational study that was published in March 2026 that followed 4000 teams over 2 years. It shows, in my view, exactly that the productivity gains don't translate into economic value.
I do not disagree with what you are saying, but I honestly still believe that most of the utility we experience are honestly gonna become very boring very soon that we can just run local... Even if it's a bit more slow who cares, can just run in background while you work on other stuff yourself, read up on things, review other work...
It's not that the utility of it put in question. What is however a giant question mark is how the heck any of the big AI companies are ever gonna get that ROI? Given how many of us are becoming more and more fine with local models that run just fine especially on a good enough computer which most developers have anyway...
Even more dangerous to the big 2 AI companies is the fact that the 20 different Chinese companies are catching up fast and for a lot lower cost.
Why should someone pick Opus 4.8 when Qwen3.7 Plus produces similar results for about 1/20th the cost.
That sort of pricing disparity is across the board. But further it's becoming more and more apparent that they are doing more with less parameters. That's what's giving the local models their super powers.
Even if we assume that everything you said holds true, how is that we as a crowd can make viable a service that eats some $300bn annually in infrastructure costs? Where would that money come from? Most tech companies these days are cutting their AI budgets because the per token pricing is killing them.
Zitron’s gonna keep playing the hits. People pay him money for his brain damaged takes on AI and he’s certainly not going to stop now. He is getting better at saying stuff that’s not immediately disprovable though so good on him.
I really like some good drama slop that reads like a thriller, it is entertaining.
I don't take any of it THAT serious, but lately with the IPOs that are about to hit the indizes, he has gained a lot of attention.
If you look around the internet, most people publish a negative angle on something and then extrapolate it into some grand conspiracy, which is really captivating.
Its crazy when you enter some echo chamber you never engage with (movies, gaming, art/comics) and they have their own head cannon for why the world is bad and collapsing. It puts your echo chamber into perspective to see the same patterns of argumentation and presentation spin out in a different way
Every day people here debate whether or not there are any actual productivity gains from LLM, and it's only in the limited context of software development. While I understand that this place obviously skews heavily towards the software industry, the notion that LLMs are anywhere near as useful in other industries is hubristic (at best).
Yes. Zitron has been predicting and begging for collapse since 2024. It's not just his brand at this point. It's his entire identity. As such, he cannot back down, he cannot question himself, and he cannot accept any other viewpoint. And he will keep moving his goal posts until something happens that can make him go "aha! I told you guys!!"
This, combined with his extreme ignorance, makes him unreadable. The only reason people read his stuff is because it validates and confirms their own anti-AI beliefs. It's why every time he publishes an article, it reaches the front page in an hour or less.
How are they undeniable? They're very deniable. One example is the (seemingly) increasing maintenance costs for AI-generated code[1]. Another is the cost incurred by everybody reading AI slop instead of actual communication.
I don't have hard data as to whether these cancel out the benefits, but it's not as rosy as some seem to think.
[1] After years of people understanding that LOC is not only a poor productivity metric but also a negative indicator of code quality (shorter code for the same thing is better), we now have people touting how many LOC their LLM agent is generating. It's like everyone forgot what LOC actually represents and what it means for long term maintenance costs.
No, he's not, he's making tons of money every month from his Substack subscriptions. In fact, the AI bubble popping would be the worse thing ever for him, he would be out of a job.
Just like the who have predicated the US dollar will collapse any-moment-now and which pushed gold for decades.
Funny how people always say "oh, you are an AI lab, of course you are going to hype AI", but never "oh, you make sooo much money from predicting the collapse of the AI bubble..."
Before you spend 20 minutes reading this article, it's worth understanding that the writer has been posting popular but consistently wrong takes for 2+ years (e.g. https://www.wheresyoured.at/peakai/ from March 2024) arguing that AI is failing, is a waste of money, is bad, will never work, etc.
Can you point to anything specific from the article that you'd describe as consistently wrong? Not disagreeing with you, but nothing popped out to me after skimming the article.
I didn't read the posted article (I don't read this author anymore because I think it's basically anti-AI ideological propaganda).
But from the article I linked back in March 2024:
"Generative AI models are expensive and compute-intensive without providing obvious, tangible mass-market use cases. Murati and Altman's futures depend heavily on keeping the world believing that development and improvement of their models' capabilities will continue a rapacious pace of progress that has unquestionably slowed, with OpenAI admitting that GPT-4 may be worse on some tasks.
As I've written before, hallucinations are a feature not a bug. These models do not "know" anything. They are mathematical behemoths generating a best guess based on training data and labeling, and thus do not "know" what you are asking it to do. You simply cannot fix them. Hallucinations are not going away."
Since then:
- hallucinations are dramatically less of a problem
- several mass market use cases have emerged, most notably coding
> I believe that artificial intelligence has three quarters to prove itself before the apocalypse comes, and when it does, it will be that much worse, savaging the revenues of the biggest companies in tech. Once usage drops, so will the remarkable amounts of revenue that have flowed into big tech, and so will acres of data centers sit unused, the cloud equivalent of the massive overhiring we saw in post-lockdown Silicon Valley.
We have seen 8 quarters since. Has any of that come to pass?
if you can't predict when it will pop then you should really not predict anything. I can also predict that Google will pop. I won't tell you when but I'll tell you that it will. I'll remain thoroughly unfalsifiable and I'll keep pushing the dates.
The quality of AI doomerism takes is matched only by the quality of AI boosterism takes. Ed's kind of interesting as a temperature sensor but I don't feel like you can really take anything he writes seriously.
Yeah they seem clickable because anything Anti-AI is a bit soothing right now, but he is constantly wrong and usually is pushing the angle of "these businesses aren't even profitable!"
Instantly close the tab as soon as the popup to subscribe to his newsletter pops up.
What if you phrase the question from "will AI ever be useful" (a term as utterly vague as "IT") to "will it ever be able to promise the financial gains these companies are hoping? Especially with local models eating their lunch :shrug:
Zitron is in the business of content creation and not successful predictions. It doesn't matter how many times he (and several others around) will say the end is here, they have to be right only once.
BTW, one thing for sure he is right about are the economics, as of today there is no way these massive investments are gone be paid.
Ed is an interesting character. His financial analysis of the AI industry makes logical sense to me (though I am not knowledgeable enough to actually know if it is correct.) However, he seems to be so angry at AI in general, that he misses the obvious areas where LLMs are actually changing the State of the Art.
Coding seems to be one of the core use-cases for LLMs (as Simon Willison pointed out recently) and even if that's the only real use-case for LLMs, they're wildly useful. I do understand that useful != profitable and that's where I think Ed has a real point: until inference becomes much cheaper these companies cannot be profitable. Some mega-players will pay the API token price, but most will not.
I don't think whether "LLMs are actually changing the State of the Art" or not matters for anything he wrote.
If the AI companies need $X billion in revenue to stay afloat, it doesn't matter if 0.5% or 5% or 50% of that revenue is from transforming the State of the Art. It's 100% irrelevant: what matters is that, transformation or no, these companies won't have the income to pay their bills. And if they can't pay their bills, a whole lot of other companies can't either.
So again, transformation or no, it's still a house of cards waiting to collapse. The only thing that would change that is not more "transformation" ... it's a feature set that lets them multiply their current user base (or multiply how much they charge them) several times over.
He's got subscribers. Maybe the attitude is one he's found plays well with them.
I find it quite refreshing in some ways. Lots of people, when they start complaining about this or that aspect of this AI stuff, are wont to add in a little disclaimer that, despite all of the above, they actually really like AI and use it all the time. I assume this is to avoid the scenario of a bunch of pragmatic builders turning up and calmly shipping nuance in the comments (or whatever you call it these days when you get brigaded by a pile of angry keyboard warriors with chips on their shoulder) - and it sure is tiring having to wade through the equivocation.
That's a criticism that'd be hard to level at Zitron! Say what you like about the man, but he's unafraid to appear to take a side.
> until inference becomes much cheaper these companies cannot be profitable. Some mega-players will pay the API token price, but most will not.
This is often repeated but comes from ignorance mostly. You have * zero * reason to believe inference is costly other than just vibes. If you go by data and intuitions - the margins are high.
This kind of thinking really reinforces my belief that people have no idea and are using this whole [AI is not profitable and too costly] thing as a cathartic way to deal with immense progress.
I find it difficult to separate this piece’s tone from its content. The tone puts me off and makes it hard for me to judge it on its merits, despite some of the arguments seeming sound and well supported.
Ed's posts are peak preaching to the choir, they're usually factually correct but he is really bad at convincing anyone who doesn't already strongly agree with him.
Have you seen his recent Bloomberg appearance? He's calm, collected, and matter-of-fact -- the complete opposite of how he presents himself on his newsletter and podcasts, but with the same argument. You wouldn't know from listening to him how spicy he usually is.
It's tuned to the audience. Bloomberg was traditionally for people who actually wanted information. People who were fallible and had limited knowledge.
Of course that mentality is obsolete. Now we all have infinite access to perfectly correct information via the internet.
Perhaps that’s it. I would tend to agree with his position, I think, but don’t appreciate being preached to. Even less so when I agree with what’s being said.
You can disagree. Sarcastically, or otherwise. But I think you may be reading more into my comment than I put there.
I’m not attacking the piece. I’m not saying it’s right. I’m not saying it’s wrong.
What I’m saying is, the tone made it hard for me to judge the arguments fairly, despite finding some of them convincing. And as much as I dislike it, persuasion does partly depend on how an argument is made.
Agreed. I am open to the possibility of the bubble bursting or whatever, but this piece is like 3,000 words and cites everything as evidence the sky is falling. It's just as bad as the pro-AI grifters, just in the other direction.
Does the truth normally lie somewhere in the middle of it all?
Probably. Although I feel more inclined to forgive Ed in this case because it's sort of fighting fire with fire, the insanely hyperbolic and obscenely misleading drivel that's coming out of the most ardent AI boosters is continually unchallenged in the public eye. In a world where we had a more realistic view of AI/ML/LLMs, the limits to its capabilities, and the negative externalities of its widespread adoption in places where it quite frankly does not belong, then I'd be more critical of the Chicken Little sort of writing style
Some people seem to see the world only through bubbles. But if you look at human history, despite the ups and downs, we have a trajectory; generally speaking, human-created systems evolve toward ever-increasing complexity, impact, and efficiency.
The current wave of AI unlocked language - the tools are now speaking and understanding. This, on its own, is astonishing progress. Language is the foundation of our culture and society; it is the very technology that got us, as a species, to where we are today. To have tools that can understand, manipulate, and produce it is a massive leap forward.
Once you see things that way, it is clear that we are not in a bubble; we are in a transition. Yes, there is tons of hype and over-investment, but the demand is real, and so is the impact. Unless you are deep in the tech and have that structural depth, it is easy to dismiss. This is like the invention of the personal computer, but with 100x the impact and speed.
The only "bubble" with AI is that the initial build out is cyclical, and many of the high flying chip stocks with no software arms (ala Nvidia's CUDA) will come back to Earth. I think anyone that thinks AI is going away or won't have massive impact (though maybe not in the doomsday scenario) are in complete denial.
RTFA; it's not about AI's massive impact or lack thereof ... it's about these businesses not having a viable business model that will sustain them (beyond the next couple years).
I think Zitron's problem is he's equating AI to OpenAI and Anthropic. I'd agree with him that both those businesses are in a dangerous position given how fast they've burnt through cash. However, that's not the entirety of the industry and there are a lot smaller labs doing more for a lot less capital.
The business model does appear to be viable for these labs. But that viability comes because they aren't wasting a bunch of R&D money developing worthless products like AI video production.
I admit, I didn't read the whole article; I read a few paragraphs and extrapolated the mindset from which the author operates.
Regarding your comment about the business model—the people in Silicon Valley are not stupid. They know the playbook; we've seen it with social networks. The issue isn't the business model itself; it's that these companies need to dominate the market, and the big players are competing for that on a global scale. It's the exact same playbook that played out in financial systems and social networks, and now it's happening with AI. Once these technologies are deeply integrated into enterprises and the global economy, these players will dominate the market for decades to come.
I can assure you, the people running those companies are smarter than you, me, and the author of this article."
What I suspect isn't that AI goes somewhere, but I do think that the cutting edge companies like Anthropic and OpenAI are in a very precarious position. They don't have very much of a moat and the competition has been catching up quick while spending a lot less doing so. IMO, the main thing keeping them alive right now is name recognition.
If I were to make a prediction, it's that ultimately these cheaper models are going end up eating their lunch. I don't think they'll make back the money they've invested and once that reality hits investors, those two companies are sunk.
That, however, is not the end of AI. Nor will it be the end of Nvidia/micron/etc. It will more just be a localized bubble pop that doesn't eliminate the product from the market.
It is not just about cheaper models; it is about integration with the economy.
These models are building deep integrations into companies and the entire economy. Once that stabilizes, it will be like the electricity grid—pumping tokens to fuel decision-making across the entire global society. Good luck unplugging from that.
Furthermore, there is a massive geopolitical aspect to it: those who are already on the Western financial and technical stack will get integrated even deeper now.
> These models are building deep integrations into companies and the entire economy. Once that stabilizes, it will be like the electricity grid—pumping tokens to fuel decision-making across the entire global society. Good luck unplugging from that.
Much like the electric grid, what we are seeing is a convergence on standard APIs. For example, most of these cheaper models are hosted using APIs compatible with OpenAI. It's not a matter of rewiring your electric plug to work with a different socket standard, instead it's just the process of plugging it into a new socket.
> Furthermore, there is a massive geopolitical aspect to it: those who are already on the Western financial and technical stack will get integrated even deeper now.
Certainly the Chinese models appear to be some of the best when it comes to competition, but they aren't the only ones. There are European models and other US based models which all run for cheaper.
The moat is the infrastructure and lock-in. Similar to AWS or anything else. Small data centers can't compete, and similarly people without massive compute won't be able to either (at least not on the enterprise level.) You might get a few edge models, but for huge businesses they will be using OpenAI and Anthropic (and Google/Microsoft/Amazon, etc).
The biggest competitors aren't small models, they are just the traditional players that already have an "in" with enterprises. That I think will start to show its face once this initial round of buildout is complete, which may not be for another 5+ years.
I disagree. Mainly because those small models are exactly what erode away the moat of needing a giant data center. Those smaller models have been proving themselves to not be far of from the SOTA models.
As OpenAI and Anthropic look to raise their prices, businesses will be much more compelled to looking at cheaper models. And if the narrative is "do the same as you did with OpenAI at 1/20th the cost" that's going to sell to a lot of businesses.
It certainly cuts into what exactly these companies can sell in general. For example, if I wanted to integrate AI into a product I'd almost certainly not chose OpenAI or Anthropic. That's because they are simply way too expensive and what they'd give me is a lot less. We've actually ran into just this. We needed a classifier for a lot of records, we picked a free model because, as you can imagine, we didn't need something as good as what OpenAI and Anthopic offered and free works.
> The current wave of AI unlocked language - the tools are now speaking and understanding. This, on its own, is astonishing progress. Language is the foundation of our culture and society; it is the very technology that got us, as a species, to where we are today.
Buried lede (if the title is the actual promise), the sources don't seem to back the title either. Someone with more patience can correct me if I accidentally missed a bombshell anyway.
Edit:
> If you’re wondering what the story is, [...] I expect it to be out in the next two weeks [...] I can guarantee you it’ll be worth it, and you’ll be stunned by what I report.
Ok, this takes clickbait to new lows. The headline is trying to sell the teaser here, with very limited meat in the middle of the sandwich.
His rhetoric is a bit obsessive and frankly biased against AI.
That said, I think his voice is useful as a counter to the mainstream opinion.
Given the amount of investments, approaching AI from the angle of economics seems correct.
We all have some level of personal experience using AI/LLMs, both chatbots and coding tools, and I personally enjoy using them, but I am sure this experience is relevant in this discussion.
I also enjoy luxury hotels, gourmet food, jet skis and helicopters, but this is not something I indulge in often because of the cost-utility ratio.
The real cost of AI may or may not be lower than its utility. The bet is that utility is increasing while cost is falling.
Whenever I read these kind of articles about AI financials, I'm reminded of identical screeds I read about Uber a few years ago. They were angrily insistent that Uber was a scam company run by criminals and charlatans and could never, ever become profitable or make money for its investors. It was a house of cards that would come crashing down sooner or later, and take everyone's money with it. Now it's 2026. Uber still exists, has revenues of $50bn and is apparently a highly profitable business. I don't know if the original investors have made their money back yet, but Uber certainly hasn't collapsed.
Maybe AI is different. Certainly, the level scale of investment is on a different order of magnitude. But I'm wary of believing anything about the financial impossibility of AI being sustainable when I've seen such similarly confident arguments proved wrong in the past.
Uber used the classic triple-E philosophy of Microsoft and entered a market that was ripe for disruption -- many cities lacked reliable taxi service entirely, others were cartels that fixed prices. They undercut prices to an extreme degree, subsidized fares, and when it either drove local taxi companies out of business and spurred widespread adoption as the default, it had a captive market and duopoly with Lyft which allowed them to raise fares without losing any market share whatsoever.
It's a pretty classic business strategy, and not directly comparable to any of the AI companies. There's a reason people compare the current situation to the dotcom era and not Uber. Also, don't take Uber as an example of a slam-dunk VC success story and leave it at that -- plenty of dumb ideas get pitched and funded and go bankrupt for every Uber.
Absolutely. Even these days, Uber really only has one or two viable competitors. With any 3rd one in a far distant 3rd. Meanwhile, swapping which AI I’m using is as easy as clicking a dropdown. Hardly comparable to a physical car ride.
Yeah, people forget the risk to Uber was real in the early days. If municipalities had enforced their taxi laws, the company would have died and all those millions invested would have been lost (or pivoted into something else).
It was only because Uber successfully bulldozed over all regulations that it was able to succeed ... and that was hard to predict before it happened.
Ed Zitron speaks to a particular type of angry tech conservative. He’s not speaking truth or exposing anything. He’s the soothing voice the tech nerds of yesterday year are yearning for.
The angry polemic that goes on and on and on with cuss words used liberally is just meant to evoke emotion and cathartic resolution to the type of people mentioned above. Not truth.
The thing is, there are a lot of people that find comfort in what he’s writing - primarily because it’s a coping mechanism against how quickly things are moving and a way to deal with being left behind. When you spend time, years, building institutional knowledge and making a whole identity out of it, you obviously will feel bad with the threat of it being commoditised.
I would write against the content of the article but I find it easier and more illuminating to write what he has said before instead. Then it shows how incorrect the guy has been and with what confidence he keeps speaking with.
I'm collecting many kinds of predictions Ed Zitron made so that you can see for yourself whether he has a good track record.
-------
> While complex, generative AI is a technology that probabilistically generates answers, and has no "intelligence." It is inherently limited by its architecture, and in turn can only get "better" in a linear fashion. I see no signs that the transformer-based architecture can do significantly more than it currently does.
He wrote this in 2024 before reasoning models came out. Remember how ChatGPT was in 2024? Do you think this person is someone who gets predictions right?
> Furthermore, I hypothesize a race to the bottom in generative AI will significantly hamper OpenAI's ability to expand revenue, compounded by the fact that we're approaching the limits of transformer-based architecture.
He wrote this in 2024 and since then Anthropic's revenue increased by 160x to $40 B dollars a year and OpenAI's increased by 6x. Do you think this person gets predictions right still?
> I believe we're reaching the upper limits about what generative AI can do and how accurate its outputs can be,
He wrote this in 2024, do you really think we have reached upper limits? Huh?? What I'm using today is significantly more accurate and 2 tiers above what we had.
> And if there are true industry-changing possibilities waiting for us on the other side, I am yet to hear them outside of the fan fiction of Silicon Valley hucksters.
He says this about AI when we have with all honesty have had industry changing possibilities like agentic coding.
> There are indications that consumers have also lost interest. As pointed out by Alex Kantrowitz’ Big Technology newsletter, traffic to ChatGPT on both mobile and web has started to stagnate, if not decline. In January 2024, ChatGPT had 1.6 billion visits — 11% below the all-time peak of 1.8 billion. This makes it only modestly more popular than Bing, which had 1.3 billion unique visits during that period. On the mobile front, ChatGPT has an estimated 6.3 million US users — or 1.7 times less than the total of new Snapchat users added during Q4 2023.
He agrees with the claim that the consumer interest has declined. Since he said this, there was a 9x growth in active users.
"A.I bubble is bursting with Ed Zitron" (1 year back)
He's been constantly crying bubble for years now.
-----
> AI video won’t get truly fixed just by waiting a year.
This is what he had said in 2024, and you just need to compare video from then and now to check whether the predictions came true. Why would anyone trust what this guy has to say?
It's like someone arguing that cheese isn't real. Yes I can go to the grocery store and take a picture of cheese and show it, but what's the point? They can live in their own world. It doesn't change any of our lives. The world is what it is.
Lol... in this case, cheese imports from China are much cheaper, just not quite as good.
And for those who are all "but dur CCP get all ur data" you can use things like AWS Bedrock (at least for earlier versions of Deepseek and Qwen for now) and have more familiar people get all your data. Or buy (at obnoxiously inflated prices) your own HW and not send your data to anyone.
The funniest part of this is that people are often talking about how LLMs are now writing 100% of their code, then also saying that they don't want to expose their code to foreign government exfiltration by using foreign models.
But, uh, if an LLM is writing 100% of your code you have no actual secret sauce to hide from anyone, so why worry about it.
Perfect for idea people. All the value is in the prompt. Ideas are important, not execution. A decade or two ago, they would have been looking for a technical co-founder.
I think we're going to see a lot of craziness in the future in this regard. Not just "secrets", but hypocrites trying to copyright and patent all the AI outputs. All kinds of rabid attempts at constructing monopolies for every half-baked idea they have tried to utter as a prompt.
Meanwhile, like I think you suggest, I would assume everyone can generate similar outputs themselves. The idea that you can claim priority on your dream prompt and lock up the market on prompt responses sounds delusional to me. It's not novel invention when you're spit-balling at the same level of abstraction as every fantasy/scifi writer who ever was.
So I also have doubts about the sustainable business model. How long will it take for this fantasy to unravel, as people discover they cannot monetize their AI outputs as much as they dreamed, and in turn cannot afford to pay the AI services they use?
My absolute nightmare is that this becomes a "too big to fail" thing and oppressive/fascist governments decide to back full regulatory capture. That instead of letting it unwind, they grant and support enforcement of an increasingly absurd and arbitrary copyright/patent regime to support this monetization scheme.
At this point I'm trying to believe there's a middle ground where the level of individual capability this unlocks, leads to major discoveries.
I do value having some naysayers in the mix generally, because we do need balanced critique in what is otherwise a very frothy hype cycle. I just don't think he's making sound arguments, and that's even assuming you even agree with his premises in the first place.
My biggest gripe with his napkin math is that he treats inference gross margins as something novel that you can't compare to normal SaaS margins. He's right in part: the constant carousel of R&D costs from model training, related infrastructure buildout, and other adjacent costs required to stay competitive do change the analysis a bit.
But he takes this way too far when he says this is structurally different from normal SaaS margins. The business model definitely doesn't look like Dropbox, but it absolutely looks a lot like AWS, especially early AWS, CDNs, telecom, etc. I can speak to the telecom bit personally, since it's been over half of my professional career as an engineer and, in this specific case, also as a founder. You can have a brutally capital-intensive infra business where profitability depends on utilization, oversubscription, peak-capacity planning, segmentation, and recovering capex over time.
The math he presents gets even more questionable as we see explicit segmentation happening for cost-saving reasons. Many forward-thinking orgs are waking up to the fact that they don't need to use the best, most expensive model for every task. They can route easier tasks to cheaper models, use caching, batch non-urgent workloads, and reserve frontier models for the subset of work that actually needs frontier intelligence. That directly undermines his claim that providers always need to chase frontier intelligence in order to maintain current demand, utilization, and pricing curves.
Could you share what tells about it? I.e. where he was wrong about it?
The fact he’s never reflected on the glaring failures in his analysis tells what we need to know about his intellectual integrity. There’s truth in some of his words about financial risk, but if you can’t acknowledge that there’s upside too, you can’t evaluate risk properly either.
I find it difficult to take him seriously.
Do you think it's not slowing? Do I miss anything really important?
My understanding is that we have now is incremental improvement on thinking models which appeared more than a year ago. Of course, a breakthrough might happen, but I don't see one yet.
https://unessays.substack.com/p/talk-is-cheap
EDIT: In fact, parent comment has a link to some numbers.
[EDIT: Most] people don't want to go through the numbers. Ok. But there's a history here. When people don't want to see the numbers, certain kinds of things tend to happen.
Code acceleration is great, but.... something precedes that. Vision and strategy re. expansion of offerings and businesses. Once a firm reaches maturity in what it offers and is only touching the edges - this code acceleration is literally useless when you factor in all of the trade-offs.
This is a good thing - it means fat and slow incumbents are sitting ducks to be out-witted by creative and imaginative founders, which is healthy for a well-functioning economy.
Now the economics of existing frontier models are not sustainable - its looking like a mix of the airline (supersonic vs subsonic) and EV industry with China in the background providing decent offerings at much lower prices.
However, most of the engineers I respect have gone from being skeptics a year ago to convinced today. I don’t personally know any true holdouts any more. If there are studies that disprove productivity gains more than six months ago, I’m happy to believe that it was true of the AIs that were available at the time. But I’m going to need something much more recent before I disbelieve my lyin’ eyes where it pertains to the AIs available today.
Here is the report:
https://www.faros.ai/blog/ai-acceleration-whiplash-takeaways
And my commentary:
https://unessays.substack.com/p/talk-is-cheap
It's not that the utility of it put in question. What is however a giant question mark is how the heck any of the big AI companies are ever gonna get that ROI? Given how many of us are becoming more and more fine with local models that run just fine especially on a good enough computer which most developers have anyway...
Why should someone pick Opus 4.8 when Qwen3.7 Plus produces similar results for about 1/20th the cost.
That sort of pricing disparity is across the board. But further it's becoming more and more apparent that they are doing more with less parameters. That's what's giving the local models their super powers.
This, combined with his extreme ignorance, makes him unreadable. The only reason people read his stuff is because it validates and confirms their own anti-AI beliefs. It's why every time he publishes an article, it reaches the front page in an hour or less.
Extreme ignorance?
How are they undeniable? They're very deniable. One example is the (seemingly) increasing maintenance costs for AI-generated code[1]. Another is the cost incurred by everybody reading AI slop instead of actual communication.
I don't have hard data as to whether these cancel out the benefits, but it's not as rosy as some seem to think.
[1] After years of people understanding that LOC is not only a poor productivity metric but also a negative indicator of code quality (shorter code for the same thing is better), we now have people touting how many LOC their LLM agent is generating. It's like everyone forgot what LOC actually represents and what it means for long term maintenance costs.
Which LLM company are you affiliated with/evangelizing for? Please disclose.
No.
> Yes, his macro analysis correctly identifies a massive financial risk but
So, he's right.
> but his incessant pessimism
Realism. You already said he's right.
> the incredible ground-level utility that many of us on HN celebrate every day through undeniable, massive productivity gains.
No.
> At this point I'm trying to believe
Why are you trying to believe at all when things are so great? I thought you were busy celebrating all that undeniable, massive productivity gains.
No, he's not, he's making tons of money every month from his Substack subscriptions. In fact, the AI bubble popping would be the worse thing ever for him, he would be out of a job.
Just like the who have predicated the US dollar will collapse any-moment-now and which pushed gold for decades.
Funny how people always say "oh, you are an AI lab, of course you are going to hype AI", but never "oh, you make sooo much money from predicting the collapse of the AI bubble..."
But from the article I linked back in March 2024:
"Generative AI models are expensive and compute-intensive without providing obvious, tangible mass-market use cases. Murati and Altman's futures depend heavily on keeping the world believing that development and improvement of their models' capabilities will continue a rapacious pace of progress that has unquestionably slowed, with OpenAI admitting that GPT-4 may be worse on some tasks.
As I've written before, hallucinations are a feature not a bug. These models do not "know" anything. They are mathematical behemoths generating a best guess based on training data and labeling, and thus do not "know" what you are asking it to do. You simply cannot fix them. Hallucinations are not going away."
Since then:
- hallucinations are dramatically less of a problem
- several mass market use cases have emerged, most notably coding
- rate of progress has increased
> I believe that artificial intelligence has three quarters to prove itself before the apocalypse comes, and when it does, it will be that much worse, savaging the revenues of the biggest companies in tech. Once usage drops, so will the remarkable amounts of revenue that have flowed into big tech, and so will acres of data centers sit unused, the cloud equivalent of the massive overhiring we saw in post-lockdown Silicon Valley.
We have seen 8 quarters since. Has any of that come to pass?
Instantly close the tab as soon as the popup to subscribe to his newsletter pops up.
BTW, one thing for sure he is right about are the economics, as of today there is no way these massive investments are gone be paid.
Coding seems to be one of the core use-cases for LLMs (as Simon Willison pointed out recently) and even if that's the only real use-case for LLMs, they're wildly useful. I do understand that useful != profitable and that's where I think Ed has a real point: until inference becomes much cheaper these companies cannot be profitable. Some mega-players will pay the API token price, but most will not.
If the AI companies need $X billion in revenue to stay afloat, it doesn't matter if 0.5% or 5% or 50% of that revenue is from transforming the State of the Art. It's 100% irrelevant: what matters is that, transformation or no, these companies won't have the income to pay their bills. And if they can't pay their bills, a whole lot of other companies can't either.
So again, transformation or no, it's still a house of cards waiting to collapse. The only thing that would change that is not more "transformation" ... it's a feature set that lets them multiply their current user base (or multiply how much they charge them) several times over.
I find it quite refreshing in some ways. Lots of people, when they start complaining about this or that aspect of this AI stuff, are wont to add in a little disclaimer that, despite all of the above, they actually really like AI and use it all the time. I assume this is to avoid the scenario of a bunch of pragmatic builders turning up and calmly shipping nuance in the comments (or whatever you call it these days when you get brigaded by a pile of angry keyboard warriors with chips on their shoulder) - and it sure is tiring having to wade through the equivocation.
That's a criticism that'd be hard to level at Zitron! Say what you like about the man, but he's unafraid to appear to take a side.
This is often repeated but comes from ignorance mostly. You have * zero * reason to believe inference is costly other than just vibes. If you go by data and intuitions - the margins are high.
This kind of thinking really reinforces my belief that people have no idea and are using this whole [AI is not profitable and too costly] thing as a cathartic way to deal with immense progress.
Of course that mentality is obsolete. Now we all have infinite access to perfectly correct information via the internet.
He's in the media business... its in his interest to amp things up.
I’m not attacking the piece. I’m not saying it’s right. I’m not saying it’s wrong.
What I’m saying is, the tone made it hard for me to judge the arguments fairly, despite finding some of them convincing. And as much as I dislike it, persuasion does partly depend on how an argument is made.
Does the truth normally lie somewhere in the middle of it all?
Usually does when you decide what constitutes extreme.
The current wave of AI unlocked language - the tools are now speaking and understanding. This, on its own, is astonishing progress. Language is the foundation of our culture and society; it is the very technology that got us, as a species, to where we are today. To have tools that can understand, manipulate, and produce it is a massive leap forward.
Once you see things that way, it is clear that we are not in a bubble; we are in a transition. Yes, there is tons of hype and over-investment, but the demand is real, and so is the impact. Unless you are deep in the tech and have that structural depth, it is easy to dismiss. This is like the invention of the personal computer, but with 100x the impact and speed.
The business model does appear to be viable for these labs. But that viability comes because they aren't wasting a bunch of R&D money developing worthless products like AI video production.
Regarding your comment about the business model—the people in Silicon Valley are not stupid. They know the playbook; we've seen it with social networks. The issue isn't the business model itself; it's that these companies need to dominate the market, and the big players are competing for that on a global scale. It's the exact same playbook that played out in financial systems and social networks, and now it's happening with AI. Once these technologies are deeply integrated into enterprises and the global economy, these players will dominate the market for decades to come.
I can assure you, the people running those companies are smarter than you, me, and the author of this article."
If I were to make a prediction, it's that ultimately these cheaper models are going end up eating their lunch. I don't think they'll make back the money they've invested and once that reality hits investors, those two companies are sunk.
That, however, is not the end of AI. Nor will it be the end of Nvidia/micron/etc. It will more just be a localized bubble pop that doesn't eliminate the product from the market.
These models are building deep integrations into companies and the entire economy. Once that stabilizes, it will be like the electricity grid—pumping tokens to fuel decision-making across the entire global society. Good luck unplugging from that.
Furthermore, there is a massive geopolitical aspect to it: those who are already on the Western financial and technical stack will get integrated even deeper now.
Much like the electric grid, what we are seeing is a convergence on standard APIs. For example, most of these cheaper models are hosted using APIs compatible with OpenAI. It's not a matter of rewiring your electric plug to work with a different socket standard, instead it's just the process of plugging it into a new socket.
> Furthermore, there is a massive geopolitical aspect to it: those who are already on the Western financial and technical stack will get integrated even deeper now.
Certainly the Chinese models appear to be some of the best when it comes to competition, but they aren't the only ones. There are European models and other US based models which all run for cheaper.
The biggest competitors aren't small models, they are just the traditional players that already have an "in" with enterprises. That I think will start to show its face once this initial round of buildout is complete, which may not be for another 5+ years.
I disagree. Mainly because those small models are exactly what erode away the moat of needing a giant data center. Those smaller models have been proving themselves to not be far of from the SOTA models.
As OpenAI and Anthropic look to raise their prices, businesses will be much more compelled to looking at cheaper models. And if the narrative is "do the same as you did with OpenAI at 1/20th the cost" that's going to sell to a lot of businesses.
It certainly cuts into what exactly these companies can sell in general. For example, if I wanted to integrate AI into a product I'd almost certainly not chose OpenAI or Anthropic. That's because they are simply way too expensive and what they'd give me is a lot less. We've actually ran into just this. We needed a classifier for a lot of records, we picked a free model because, as you can imagine, we didn't need something as good as what OpenAI and Anthopic offered and free works.
This is fire erasure
/s
Edit:
> If you’re wondering what the story is, [...] I expect it to be out in the next two weeks [...] I can guarantee you it’ll be worth it, and you’ll be stunned by what I report.
Ok, this takes clickbait to new lows. The headline is trying to sell the teaser here, with very limited meat in the middle of the sandwich.
That said, I think his voice is useful as a counter to the mainstream opinion.
Given the amount of investments, approaching AI from the angle of economics seems correct.
We all have some level of personal experience using AI/LLMs, both chatbots and coding tools, and I personally enjoy using them, but I am sure this experience is relevant in this discussion.
I also enjoy luxury hotels, gourmet food, jet skis and helicopters, but this is not something I indulge in often because of the cost-utility ratio.
The real cost of AI may or may not be lower than its utility. The bet is that utility is increasing while cost is falling.
Maybe AI is different. Certainly, the level scale of investment is on a different order of magnitude. But I'm wary of believing anything about the financial impossibility of AI being sustainable when I've seen such similarly confident arguments proved wrong in the past.
It's a pretty classic business strategy, and not directly comparable to any of the AI companies. There's a reason people compare the current situation to the dotcom era and not Uber. Also, don't take Uber as an example of a slam-dunk VC success story and leave it at that -- plenty of dumb ideas get pitched and funded and go bankrupt for every Uber.
It was only because Uber successfully bulldozed over all regulations that it was able to succeed ... and that was hard to predict before it happened.
The angry polemic that goes on and on and on with cuss words used liberally is just meant to evoke emotion and cathartic resolution to the type of people mentioned above. Not truth.
The thing is, there are a lot of people that find comfort in what he’s writing - primarily because it’s a coping mechanism against how quickly things are moving and a way to deal with being left behind. When you spend time, years, building institutional knowledge and making a whole identity out of it, you obviously will feel bad with the threat of it being commoditised.
I would write against the content of the article but I find it easier and more illuminating to write what he has said before instead. Then it shows how incorrect the guy has been and with what confidence he keeps speaking with.
-------
> While complex, generative AI is a technology that probabilistically generates answers, and has no "intelligence." It is inherently limited by its architecture, and in turn can only get "better" in a linear fashion. I see no signs that the transformer-based architecture can do significantly more than it currently does.
He wrote this in 2024 before reasoning models came out. Remember how ChatGPT was in 2024? Do you think this person is someone who gets predictions right?
> Furthermore, I hypothesize a race to the bottom in generative AI will significantly hamper OpenAI's ability to expand revenue, compounded by the fact that we're approaching the limits of transformer-based architecture.
He wrote this in 2024 and since then Anthropic's revenue increased by 160x to $40 B dollars a year and OpenAI's increased by 6x. Do you think this person gets predictions right still?
> I believe we're reaching the upper limits about what generative AI can do and how accurate its outputs can be,
He wrote this in 2024, do you really think we have reached upper limits? Huh?? What I'm using today is significantly more accurate and 2 tiers above what we had.
> And if there are true industry-changing possibilities waiting for us on the other side, I am yet to hear them outside of the fan fiction of Silicon Valley hucksters.
He says this about AI when we have with all honesty have had industry changing possibilities like agentic coding.
> There are indications that consumers have also lost interest. As pointed out by Alex Kantrowitz’ Big Technology newsletter, traffic to ChatGPT on both mobile and web has started to stagnate, if not decline. In January 2024, ChatGPT had 1.6 billion visits — 11% below the all-time peak of 1.8 billion. This makes it only modestly more popular than Bing, which had 1.3 billion unique visits during that period. On the mobile front, ChatGPT has an estimated 6.3 million US users — or 1.7 times less than the total of new Snapchat users added during Q4 2023.
He agrees with the claim that the consumer interest has declined. Since he said this, there was a 9x growth in active users.
-----
https://www.youtube.com/watch?v=_wStScmT748&t=1s
"AI Bubble Already Bursting?" (8 months back)
https://www.youtube.com/watch?v=T8ByoAt5gCA&t=1s
"A.I bubble is bursting with Ed Zitron" (1 year back)
He's been constantly crying bubble for years now.
-----
> AI video won’t get truly fixed just by waiting a year.
This is what he had said in 2024, and you just need to compare video from then and now to check whether the predictions came true. Why would anyone trust what this guy has to say?
It's like someone arguing that cheese isn't real. Yes I can go to the grocery store and take a picture of cheese and show it, but what's the point? They can live in their own world. It doesn't change any of our lives. The world is what it is.
And for those who are all "but dur CCP get all ur data" you can use things like AWS Bedrock (at least for earlier versions of Deepseek and Qwen for now) and have more familiar people get all your data. Or buy (at obnoxiously inflated prices) your own HW and not send your data to anyone.
The funniest part of this is that people are often talking about how LLMs are now writing 100% of their code, then also saying that they don't want to expose their code to foreign government exfiltration by using foreign models.
But, uh, if an LLM is writing 100% of your code you have no actual secret sauce to hide from anyone, so why worry about it.
Meanwhile, like I think you suggest, I would assume everyone can generate similar outputs themselves. The idea that you can claim priority on your dream prompt and lock up the market on prompt responses sounds delusional to me. It's not novel invention when you're spit-balling at the same level of abstraction as every fantasy/scifi writer who ever was.
So I also have doubts about the sustainable business model. How long will it take for this fantasy to unravel, as people discover they cannot monetize their AI outputs as much as they dreamed, and in turn cannot afford to pay the AI services they use?
My absolute nightmare is that this becomes a "too big to fail" thing and oppressive/fascist governments decide to back full regulatory capture. That instead of letting it unwind, they grant and support enforcement of an increasingly absurd and arbitrary copyright/patent regime to support this monetization scheme.
> It's like someone arguing that cheese isn't real
I agree with your first statement (any being you) because of your second statement.