I don’t know how OpenAI screwed this up. They had the best tech, the largest installed base, the best brand recognition.
And somehow instead of prosecuting the lead in all areas, they got all hubristic and sloppy and just failed to iterate on the core product, while also failing to respond quickly when Anthropic showed that coding agents are the flywheel that makes the whole company faster.
It’s like they thought they had an unassailable monopoly and speedran to the lazy incumbent position, all in a matter of months.
Anecdotally, I would actually argue tbe opposite - Anthropic is overrated, ass-kissed way too much here for mediocre coding abilities (especially for Elixir). ChatGPT most of the time one-shots complex solutions in comparison. The only reason why people shit on OpenAI so much is because of the defence deal, but, it's not like Anthropic is a saint either:
Why pick elixir specifically here? I’m using opus/sonnet via Claude code for a moderately complex personal project built on phoenix and have had a good experience
Claude is good, I'm definitely not saying it's bad. But if you work with LiveView, it will tend to choose more complexity over simplicity. Weirdly enough I have a feeling it's trained more on Python/Ruby (Object oriented paradigms) style code than functional code, so it tries to get things done not so functionally.
Yeah, I've been building a fairly complex app with Claude and it has been great. Backend stack is a Go service, with TS front end and a solver running or-tools in Python.
I do think I do a good job of being very structured at breaking down my requirements and acceptance criteria (thanks dual lives as a devops and SRE guy and then PM). Extensive unit testing, discipline in use of sessions and memories and asking it to think of questions it should be asking me before even formulating a plan.
Claude Code is IMO the benchmark today. For all of the various contexts I’ve used it in it has mostly oneshot the tasks I’ve given it and is very user friendly for someone who is not a professional software engineer. To the extent it fails I can usually figure out quickly why and correct it at a high level.
I think this is where we might have differing opinions. I'm a CTO by profession and I know what bad code is, so it is quite easy for me, based on my professional experience, point out when Claude generates bad code. And when you point it out, or ask it why it didn't take the correct/simpler approach - the response is always along the lines of "Oops, sorry!" or "You're absolutely right to question that..."
I think Codex is a better fit for professional software engineers. It's able to one-shot larger, more complex tasks than Claude and also does better context management which is really important in a large codebase.
On the other hand, I think Claude is more friendly/readable and also still better at producing out-of-the-box nice looking frontend.
Aside from the fabricated drama and the trend chasing, OpenAI still has the best overall model and API service. Anthropic is really good, no doubt. But gpt-5.4 is a better model than even Opus, even if its a marginal advantage. I use both.
I dunno, my experience mirrors the parent posters: we use opus for all our coding, but gpt 5.4 for all of our enterprise agentic work via api (much bigger amount of tokens). it just seems to be more optimized for this.
Sam lost the plot for me. He took too many interviews which led me to not trust him. Last straw came with him standing by Anthropic one day then throwing them under the bus the next. He showed little awareness on why that is problematic.
That's why I changed as well. I got really irritated how Altman tried to get the social credit by having principles, only to change them the moment it was convenient.
> On podcasts his attitude is basically “oh yeah all of you are basically fucked our products will take everyone’s jobs in a couple years.”
I also appreciate his honesty, and don't really understand why the others don't emulate it because there's no cost to them to be honest. At every level of society we've decided to stick our heads in the sand and pretend like this very large tsunami isn't racing toward the coast, so as someone producing this technology you can be honest (and mostly ignored by people in denial), or be cagey and mistrusted (like Sam Altman).
Because it isn’t honest, it is investor hype that these frontier labs need people to believe despite obviously hitting the sublinear part of the improvement curve.
“It’s so dangerous, we’ve reached AGI, we just have to release models that are obviously incapable of abstraction for your safety”
> Sam lost the plot for me. He took too many interviews which led me to not trust him. Last straw came with him standing by Anthropic one day then throwing them under the bus the next. He showed little awareness on why that is problematic.
It should have become clear to all that he was an untrustworthy person when he was fired from OpenAI by its then-board. My understanding is their complaint was he was lying, untrustworthy, and manipulative; and enough stories came out at the time to confirm that.
It's clearly because they didn't hire me after I applied :)
In all seriousness, I use Codex for work and Claude at home, and I feel like nowadays they're actually pretty competitive with each other. I don't know that it's that far behind.
I agree that they clearly erroneously assumed that no one would be able to catch up with them, though. OpenAI had such a head start that that should have been by itself a moat.
Just like OpenAI's original moat, I don't think that's particularly durable. I've already seen plenty of people swing back to preferring codex, and it'll probably swap again with the next model drop. Openclaw is potentially better integrated with ChatGPT at this point because of the explicit subscription support.
Also not like it’s a particularly good piece of tech. It was the first to show a new category. But jeebus the design and security are a nightmare. Any of the numerous other claws are better choices for anything serious.
Yea. In my opinion the value provided for 20$ is better. I wonder how much of antropic value is the hype around claude code coming from every snake oil sales man promoting claude code as best to use with open claw to summarize your emails.
Coding assistants won't win this game. They sure will win the hearts of developers, but to scale you need mass adoption and products for which users want to pay substantially. OpenAI is falling behind in the small features in their chat and app offering and have failed to innovate in their expensive offerings.
Codex btw is getting very competitive. It is fast and no longer far behind.
The strategic playbook of the web era said: Get a huge userbase of normies, then figure out how to monetise them (usually via advertising). OpenAI stumbled into the userbase via ChatGPT, but it's unclear if the strategy or the economics apply to AI. Anthropic tried to compete in the consumer market, but couldn't, so focussed on coding and enterprise, and it looks like that's actually turning into a smart choice, at least right now, because it turns out people will pay subscription costs for agents that do their job for them.
There are three possible paths that sort of substantiate current valuations:
1) Business: LLMs become essential to every company, and you become rich by selling the best enterprise tools to everyone.
2) Consumer: LLMs cannibalize search and a good chunk of the internet, so people end up interacting with your AI assistant instead of opening any websites. You start serving ads and take Google's lunch.
3) Superhuman AGI: you beat everyone else to the punch to build a life form superior to humans, this doesn't end up in a disaster, and you then steal underpants, ???, profit.
Anthropic is clearly betting on #1. Google decided to beat everyone else to #2, and they can probably do it better and more cheaply than others because of their existing infra and the way they're plugged into people's digital lives. And OpenAI... I guess banked on #3 and this is perhaps looking less certain now?
Any publicly available evidence to back that up? There have been post-exit blog posts from OpenAI employees on HN before and it did sound like the only black magic they use there is that many employees work 16 hrs a day during launch of new features. I know that some current Claude Code devs are doing interviews where they claim that they use Claude Code extensively but they clearly have a conflict of interest while they are still employed at Anthropic, so it would be like asking a barber if you need a haircut.
Is there any equivalent for turning an AI LLM into a sticky platform? Right now seems like it’s pretty easy to use harnesses and tooling across models, including open weight and locally running ones.
Despite what the folks here like to believe about themselves, I think the reality is we as attuned to what is in fashion and on trend as everyone else, just about different stuff. Last year it was Chatgpt, this year Claude is the new hotness. Things move so fast we barely have time to form our own opinion, so we fall back on what we read or hear from others. In 12 months who knows what it will be... Gemini? ¯\_(ツ)_/¯
Long term, my feeling is Anthropic's focus on enterprise is the most obviously lucrative but also least defensible application of LLMs. If (more likely when) open source models reach the point of being "good enough" then it's a race to the bottom on pricing. Maybe it will be like AWS vs GCP et al, but I kinda doubt it.
Investors do not care about the product, the users, etc. They care about cash. There are lots of ways to make cash that don't involve having a good product. But if you commit to spending a trillion dollars on hardware, then borrow hundreds of billions in the short term, and it turns out there's no way to recoup the cost, the investors go looking for better returns. This would've worked back in the old days of a bull market, angels looking for the next whale (with "modest" $5BN investments), and startups with no rivals. But in a bear market with multiple competitors trading on a commodity? Lol. Finally the bubble bursts.
Classic SV hubris. Talk to OpenAI people and they’re so convinced they’re untouchable, they don’t bother worrying about things like revenue, or product strategy. All they cared about was being the first to AGI. Well it looks like that isn’t happening soon enough. And now they have zero moat except brand recognition, which is quickly getting eroded.
I've finished (as in: it's done, it works, and I may never need to change it again) entire projects with ChatGPT and Codex. Sometimes it takes a lot of hand-holding to get there, but it does get there and (with the exception of 4o) it's been improving since the beginning.
In contrast: I can't even get Gemini Pro to give me any answers to the most primitive questions that aren't caked in prima facie lies without at least 4 interactions, in any context, ever. The output is consistently and ridiculously garish with its insessant self-contradictions. It seems to be impossible to actually get anywhere with it.
Some of that, sure. But realistically, a lot people are just don't want to pay for every frontier model provider out there as they're released. Not just money, but also time trying them out. (Recommend people at least try out their multimodal model.)
It doesn't help that Google offers a bunch of confusing plans in multiple places. I ended up just pasting all their AI plan URLs, at least that I could find, into Claude so I didn't have to figure it out.
I think Antigravity w/Gemini is a great product; it's been super useful on a bunch of my hobby projects. It's especially wonderful when writing firmware and needing to add support for a new chip. I can point it at a PDF datasheet and it'll do a much better job of reading it and parsing out all of the register fields than anything else. Saves me enormous amounts of time.
> The large gap between OpenAI’s $852-billion valuation and Anthropic’s $380 billion has investors rushing to grab equity in the latter before it rises, according to Augment co-founder Adam Crawley.
Interesting, so there are a lot of people still eager to invest in valuations of well greater than a-quarter-trillion, but OpenAI's latest raise has sucked up all the oxygen for enthusiasm of that valuation going even higher.
Which could be a "dumb money" move ("competitor number lower, already-big-number is scary") or a "smart money" move ("Anthropic is gaining position-wise, and currently is lower valued, let's bet on the one we think is better positioned") or some mix of both.
OpenAI just raised a shit-ton so clearly there is plenty of money out there who don't think there's a bubble or even a blown opportunity there. But the wider community doesn't think they have the competition in the bag, while still being willing to invest in big-AI-cos at absolutely enormous valuations.
If local hardware/models get good enough to take 80%-90% of what people use subscriptions for today... hoo boy. Big-AI is a bet I wouldn't be confident placing billions on. Unless your horizon is more "wait for IPO or next raise or positive news, then get out ASAP" than "hold for 5+ years."
Did OpenAI really “raise” that much? The startup world is not my area of expertise but I remember reading language in the announcements that implied those dollar amounts where more of a conditional promise of money in the future instead of a check today.
Yeah if they sold 1/800th of the company for a billion dollars then they are valued at 800b even if they only have a billion dollars. It’s advantageous for investors to both buy in as cheaply as possible but also have future investors to buy in as expensive as possible to prop up a, perhaps inaccurate, valuation.
It's raised in the sense that some people made a pinky promise to give them cash. But those people also don't have the money and have to raise it from other places. It's largely SoftBank, Oracle, Microsoft and Nvidia, all of whom don't have big piggybanks full of hundreds of billions. They ask for loans based on the promise of making cash to pay for it, and that cash is based on people wanting to use OpenAI. So it's kind of a big financial circle jerk. (Debt, SPVs, loans from Nvidia (at high interest rates), etc)
> It's largely SoftBank, Oracle, Microsoft and Nvidia, all of whom don't have big piggybanks full of hundreds of billions.
Actually SoftBank, Microsoft and Nvidia literally have free cash sitting there.
NVIDIA for example had over $60B in audited, reported free cash flow in 2025[1]
> loans from Nvidia (at high interest rates),
Is this just something you are making up?
"NVIDIA intends to invest up to $100 billion in OpenAI as the new NVIDIA systems are deployed. The first phase is targeted to come online in the second half of 2026 using the NVIDIA Vera Rubin platform."[2]
The closes there is to waht you are saying is reporting that NVIDIA has discussed guaranteeing some of the loans OpenAI is taking to build data centers:
"Nvidia is discussing guaranteeing some of the loans that OpenAI is planning to take out in order to build its own data centers, The Wall Street Journal reported, citing people familiar with the matter."[3]
This of course is the opposite of NVIDIA loaning OpenAI money - if they did this they would be liable for OpenAI's debts.
I am playing around with this at home right now. I think a lot of the latest improvements came with the harness, instead of AI.
The part I am working on is to have better tools and data to search over. Curated for my needs. Similar to the Karpathy post yesterday about his wiki. I am trying something similar and even qwen 3.5 is totally fine for most of what I do.
Disclaimer: I bought memory before the crisis started. Not sure if I would build my PC as is now..
Vibe coding requires the sota models to work at all, but someone who knows what they are doing and uses the AI more responsibly can absolutely use the cheaper Chinese models for coding, and they’re often faster too. If I was one of the big players my entire focus would be on lobbying for regulation and outright banning of local models.
Yeah, Qwen3 coder for Claude Code and 3.5 for OpenClaw replaced my full-stack use of Opus 4.6 already; it's fine for basic web apps, k8s/docker infra setup, optimizing AI models etc. with only slightly higher error rate than Opus. Upcoming 3.6 together with Gemma4 might make it even better (still to test). OpenAI's memory spot market play might have been directed at local inference as well.
Look for Deepseek 4 when it drops, I’m curious how good it will be.
The thing is, if you’re using AI responsibly today you’re already breaking down tasks to such a granular level that you don’t need the power of Opus. You can save that for deeper research tasks.
Based on the current DeepSeek website I suspect it's not going to be great as their current model (V3.4? V4-mini?) often forgets or changes facts explicitly mentioned in the conversation which R1 never did. It's better than R1 at math or coding, but nearly unusable for deep conversation. I suspect they pushed MLA or linear attention too much, or quantize a lot more than before.
Anthropic is not meaningfully better. Their stance is “the good guys have to make money to be in the fight with the bad guys” and so they do all the things their perceived bad guys do. I don’t know how they can do any different, but we just trust them to be good? What is the difference?
I just appreciate the honesty of Amodei telling us pretty much straight up we’re all fucked because the AIs are taking all the jobs in a couple years or less.
He's just another con-man. Fear of missing out is an amazing motivator for investors. The more he shouts doom, the more people with very deep pockets throw money at him. It's all evil.
My loose understanding as someone adjacent to the AI model space is that you have good models that are costly and cheap models that are decent, so a lot of the publicly visible fights where Claude and ChatGPT leapfrog each other is the companies doing cost-benefit of how much optimization to do on the models before your userbase revolts because the agent "used to be great and now kinda sucks".
As a small business owner whose team is entirely in Google Workspace (Drive, Gmail, Chat -- so inbuilt RAG right there), I wonder if Gemini will be the darkhorse. As a user Gemini's a distinct third in "AI smarts", but most business owners aren't power users who are gonna setup Codex or Code to slurp up their work emails and internal docs/SOPs.
The article feels a touch clickbait-y since people love a good fight between the top players and OAI's lost a buncha public goodwill over the past year.
As a government based person who has witnessed multiple states (not in the US) move all operations off GCP because Google doesnt address sovereign risk, local data and hosting privacy requirements,and contracts well at all, where Microsoft and AWS do, I doubt Gemini will have as large a dark horse moment as it could. Copilot Enterprise can span across similar domains, and whilst it is very expensive per user it has the benefit of having existing contracts in place which it can bolt in to.
let me know when they scrap the data centers, id love to get some good deals on hvac equiptment. these companies cannot possibly make enough money when you can run something on your own computer that works mostly as good
Both of these valuations are absolutely absurd. I guess Anthropic looks good in comparison, but I don't want to hold that bag.
The Chinese models are catching up in quality while being a fraction of the price. The market will speak, how many devices that contributed to this thread were made in the USA?
Sure you can argue the Chinese companies are heavily subsidized, but no major LLM lab is remotely close to making a profit this decade.
The latest Opus routinely tells me the latest GPT Pro responses are much better. The GPT responses cost 10x more than GPT at least. And GPT takes 10’s of minutes. So unless and until I’m needing and ready for a really expensive “math checker” it gets left alone.
I am not a finance pro, but is it not normal and expected for secondary equity markets to be hesitant against large unloads of pre-money company stocks, even in cases where the company in question isn't in water like OpenAI is? Does this not say more about the failure of the investors themselves in balancing their portfolio properly than OpenAI itself?
Odd timing...Everything I've read about Claude the last several days suggests that its users are disappointed, even furious at what's happened to its performance.
Same, mostly. However today in particular Claude can't do front-end development with any competence. Never seen this before. I think the rumor is they are rolling out a new model and have to divide their infra across the new model vs the current model.
I wonder how much of this is associated with Scam Altman's personal negative PR and Anthropic's recent PR wins.
I'm inclined to think there isn't much of an association becauss investors don't seem very concerned with morality, but I know ~dozen developers that either switched to, or started using Claude in the past month or so, while not knowing anyone that uses Codex.
They want to see the CEO communicate a path to profitability. Anthropic has - purely by focusing.
OAI in contrast is all over the place and they haven’t shown they’ve learned.
Zuckerberg got punished for his metaverse nonsense and investors were correct to be skeptical and reflect that in the stock price. Altman thinks he’s a god and the rules don’t apply to him. More fool him.
> Zuckerberg got punished for his metaverse nonsense and investors were correct to be skeptical and reflect that in the stock price.
Did they? It's been a little less than 5 years since Zuckerburg announced the all-in plan on the metaverse. In that 5 year period, Meta's stock price has gone up by 84%... which is middle of the pack for MAGMA, less than Apple (91%) and Google (157%), but above Amazon (24%) and Microsoft (47%). It's also comfortably above the S&P 500 (59%).
not to mention the drama that's followed the company from the very beginning. It think its getting to a point where the character issues can no longer be ignored bc it's directly affecting business
I think the IPO and subsequent quarterly earnings where they will be pressured by analysts will pop it all.
I was watching a recent Jensen Huang Q&A with analysts and it was essentially “just trust me bro”. They’re all interconnected - once there’s a correction for one player, all get affected.
Can’t wait to finally get this over with so we can finally move on.
The gap between hype and reality needs to be corrected.
And somehow instead of prosecuting the lead in all areas, they got all hubristic and sloppy and just failed to iterate on the core product, while also failing to respond quickly when Anthropic showed that coding agents are the flywheel that makes the whole company faster.
It’s like they thought they had an unassailable monopoly and speedran to the lazy incumbent position, all in a matter of months.
https://www.cnbc.com/2026/02/12/anthropic-gives-20-million-t...
I do think I do a good job of being very structured at breaking down my requirements and acceptance criteria (thanks dual lives as a devops and SRE guy and then PM). Extensive unit testing, discipline in use of sessions and memories and asking it to think of questions it should be asking me before even formulating a plan.
I think this is where we might have differing opinions. I'm a CTO by profession and I know what bad code is, so it is quite easy for me, based on my professional experience, point out when Claude generates bad code. And when you point it out, or ask it why it didn't take the correct/simpler approach - the response is always along the lines of "Oops, sorry!" or "You're absolutely right to question that..."
On the other hand, I think Claude is more friendly/readable and also still better at producing out-of-the-box nice looking frontend.
On podcasts his attitude is basically “oh yeah all of you are basically fucked our products will take everyone’s jobs in a couple years.”
Altman is a lot more coy and comes across as saying what’s politically expedient at any given point in time.
I also appreciate his honesty, and don't really understand why the others don't emulate it because there's no cost to them to be honest. At every level of society we've decided to stick our heads in the sand and pretend like this very large tsunami isn't racing toward the coast, so as someone producing this technology you can be honest (and mostly ignored by people in denial), or be cagey and mistrusted (like Sam Altman).
“It’s so dangerous, we’ve reached AGI, we just have to release models that are obviously incapable of abstraction for your safety”
It should have become clear to all that he was an untrustworthy person when he was fired from OpenAI by its then-board. My understanding is their complaint was he was lying, untrustworthy, and manipulative; and enough stories came out at the time to confirm that.
In all seriousness, I use Codex for work and Claude at home, and I feel like nowadays they're actually pretty competitive with each other. I don't know that it's that far behind.
I agree that they clearly erroneously assumed that no one would be able to catch up with them, though. OpenAI had such a head start that that should have been by itself a moat.
Check dev spaces like twitter and discord and all anyone talks about is claude-code, openclaw, opus 4.6 etc.
The mindshare went to anthropic.
Even now when I hear Codex I have to stop and think “oh yeah that’s OpenAI’s competitor to Claude Code.”
Codex btw is getting very competitive. It is fast and no longer far behind.
1) Business: LLMs become essential to every company, and you become rich by selling the best enterprise tools to everyone.
2) Consumer: LLMs cannibalize search and a good chunk of the internet, so people end up interacting with your AI assistant instead of opening any websites. You start serving ads and take Google's lunch.
3) Superhuman AGI: you beat everyone else to the punch to build a life form superior to humans, this doesn't end up in a disaster, and you then steal underpants, ???, profit.
Anthropic is clearly betting on #1. Google decided to beat everyone else to #2, and they can probably do it better and more cheaply than others because of their existing infra and the way they're plugged into people's digital lives. And OpenAI... I guess banked on #3 and this is perhaps looking less certain now?
Token generation is the metric Jensen Huang keeps pushing to temper analysts, which also affect nvidia’s future expected cash flows of course.
If increasing the price causes that metric to drop, the whole narrative falls apart and fear will spread in the stock market.
They’re all racing very close to the edge. Some closer than others.
We’re talking on the level of meta, google and probably more if they keep raising money.
They really went all in with hubris and they’re gonna get punished eventually.
Is there any equivalent for turning an AI LLM into a sticky platform? Right now seems like it’s pretty easy to use harnesses and tooling across models, including open weight and locally running ones.
Long term, my feeling is Anthropic's focus on enterprise is the most obviously lucrative but also least defensible application of LLMs. If (more likely when) open source models reach the point of being "good enough" then it's a race to the bottom on pricing. Maybe it will be like AWS vs GCP et al, but I kinda doubt it.
5.4 Extra high >> Opus 4.6
I find that for human in the loop Gemini beats both.
I've finished (as in: it's done, it works, and I may never need to change it again) entire projects with ChatGPT and Codex. Sometimes it takes a lot of hand-holding to get there, but it does get there and (with the exception of 4o) it's been improving since the beginning.
In contrast: I can't even get Gemini Pro to give me any answers to the most primitive questions that aren't caked in prima facie lies without at least 4 interactions, in any context, ever. The output is consistently and ridiculously garish with its insessant self-contradictions. It seems to be impossible to actually get anywhere with it.
What am I doing wrong here?
It doesn't help that Google offers a bunch of confusing plans in multiple places. I ended up just pasting all their AI plan URLs, at least that I could find, into Claude so I didn't have to figure it out.
Interesting, so there are a lot of people still eager to invest in valuations of well greater than a-quarter-trillion, but OpenAI's latest raise has sucked up all the oxygen for enthusiasm of that valuation going even higher.
Which could be a "dumb money" move ("competitor number lower, already-big-number is scary") or a "smart money" move ("Anthropic is gaining position-wise, and currently is lower valued, let's bet on the one we think is better positioned") or some mix of both.
OpenAI just raised a shit-ton so clearly there is plenty of money out there who don't think there's a bubble or even a blown opportunity there. But the wider community doesn't think they have the competition in the bag, while still being willing to invest in big-AI-cos at absolutely enormous valuations.
If local hardware/models get good enough to take 80%-90% of what people use subscriptions for today... hoo boy. Big-AI is a bet I wouldn't be confident placing billions on. Unless your horizon is more "wait for IPO or next raise or positive news, then get out ASAP" than "hold for 5+ years."
> It's largely SoftBank, Oracle, Microsoft and Nvidia, all of whom don't have big piggybanks full of hundreds of billions.
Actually SoftBank, Microsoft and Nvidia literally have free cash sitting there.
NVIDIA for example had over $60B in audited, reported free cash flow in 2025[1]
> loans from Nvidia (at high interest rates),
Is this just something you are making up?
"NVIDIA intends to invest up to $100 billion in OpenAI as the new NVIDIA systems are deployed. The first phase is targeted to come online in the second half of 2026 using the NVIDIA Vera Rubin platform."[2]
The closes there is to waht you are saying is reporting that NVIDIA has discussed guaranteeing some of the loans OpenAI is taking to build data centers:
"Nvidia is discussing guaranteeing some of the loans that OpenAI is planning to take out in order to build its own data centers, The Wall Street Journal reported, citing people familiar with the matter."[3]
This of course is the opposite of NVIDIA loaning OpenAI money - if they did this they would be liable for OpenAI's debts.
[1] https://nvidianews.nvidia.com/news/nvidia-announces-financia...
[2] https://nvidianews.nvidia.com/news/openai-and-nvidia-announc...
[3] https://archive.is/Gpvq2#selection-1299.0-1301.181
The part I am working on is to have better tools and data to search over. Curated for my needs. Similar to the Karpathy post yesterday about his wiki. I am trying something similar and even qwen 3.5 is totally fine for most of what I do.
Disclaimer: I bought memory before the crisis started. Not sure if I would build my PC as is now..
The thing is, if you’re using AI responsibly today you’re already breaking down tasks to such a granular level that you don’t need the power of Opus. You can save that for deeper research tasks.
OpenAI is fine with those as long as they are "legal"... So pretty much they don't care at all.
I agree Anthropic is no saint but it's much, much better than OpenAI.
As a small business owner whose team is entirely in Google Workspace (Drive, Gmail, Chat -- so inbuilt RAG right there), I wonder if Gemini will be the darkhorse. As a user Gemini's a distinct third in "AI smarts", but most business owners aren't power users who are gonna setup Codex or Code to slurp up their work emails and internal docs/SOPs.
The article feels a touch clickbait-y since people love a good fight between the top players and OAI's lost a buncha public goodwill over the past year.
The Chinese models are catching up in quality while being a fraction of the price. The market will speak, how many devices that contributed to this thread were made in the USA?
Sure you can argue the Chinese companies are heavily subsidized, but no major LLM lab is remotely close to making a profit this decade.
But I’m not necessarily oozing about it online — vocal minority and all.
I'm inclined to think there isn't much of an association becauss investors don't seem very concerned with morality, but I know ~dozen developers that either switched to, or started using Claude in the past month or so, while not knowing anyone that uses Codex.
They want to see the CEO communicate a path to profitability. Anthropic has - purely by focusing.
OAI in contrast is all over the place and they haven’t shown they’ve learned.
Zuckerberg got punished for his metaverse nonsense and investors were correct to be skeptical and reflect that in the stock price. Altman thinks he’s a god and the rules don’t apply to him. More fool him.
Did they? It's been a little less than 5 years since Zuckerburg announced the all-in plan on the metaverse. In that 5 year period, Meta's stock price has gone up by 84%... which is middle of the pack for MAGMA, less than Apple (91%) and Google (157%), but above Amazon (24%) and Microsoft (47%). It's also comfortably above the S&P 500 (59%).
The Sam curse. Took down Bankman-Fried before him...
“We literally couldn’t find anyone in our pool of hundreds of institutional investors to take these shares“
This doesn’t bode well for an IPO. The market is smelling a stinker.
Get your popcorn ready for a mad scramble to salvage investments if indeed the shark has been jumped.
I was watching a recent Jensen Huang Q&A with analysts and it was essentially “just trust me bro”. They’re all interconnected - once there’s a correction for one player, all get affected.
Can’t wait to finally get this over with so we can finally move on.
The gap between hype and reality needs to be corrected.
ChatGPT's chat quality has recently dropped hard. While Claude is pricier, it actually takes the effort to think through complex tasks.
All the while, Chinese models are providing cheaper alternatives.