More

seviu · 2026-06-26T17:41:18 1782495678

Dont worry, chinese models will distill frontier ones, quite fast.

The excuse they give is borderline childish. I get the thing about slow rollout, make sure partners get to fix the bugs, etc...

But bad actors are hard working motivated entities with tens of thousand of fake ids, and american citizens working for them, for pennies.

All while the ones like or you sit at a crossfire which is borderline useless.

I cant wait to see what Qwen did with the massive distillation they made out of Opus 4.8 and Fable aka Mythos aka pretty sure they jailbroke it.

15155 · 2026-06-26T19:11:41 1782501101

This is nothing a few felony indictments can't fix.

bloppe · 2026-06-26T19:21:20 1782501680

Pretty sure Chinese police will not cooperate with a US indictment

15155 · 2026-06-26T19:22:51 1782501771

No, but the Americans facilitating their access sure as hell will.

Sammi · 2026-06-26T21:25:07 1782509107

You'll need to make all US customers provide personal IDs for access first. I'm not American, but I do often hear how attached Americans can be to their personal firearms and how against providing their personal ID they can be.

forshaper · 2026-06-26T22:34:23 1782513263

That was before 2020, I guess. Americans are happy to provide whatever now, afaict

15155 · 2026-06-26T23:59:54 1782518394

What do you think login.gov is for?

Also, "all" customers? No, only customers that access the restricted models.

seviu · 2026-06-26T17:30:54 1782495054

I am on the opposite camp. Open models are starting to perform better. GPT 5.5 keeps on messing things up.

On the contrary, pi + glm + DeepSeek… bliss.

Fable was a different kind of beast though. Rip.

square_usual · 2026-06-26T19:41:18 1782502878

Every time I use opus these days I go shut up... you are not fable.. Hard to imagine how just three days with it changed how I saw LLM use.

nolroz · 2026-06-27T02:03:51 1782525831

I'm reluctantly starting to feel grateful that I went camping right over the window that Fable was out.

ftkftk · 2026-06-26T21:02:56 1782507776

Same.

baq · 2026-06-26T18:17:23 1782497843

Yeah, Opus/GPT need multiple rounds of reviews from each other to get to clean auto review. Fable was like, it is done and indeed… crickets in bot comments. ‘No issues’ galore.

aaroninsf · 2026-06-26T20:49:36 1782506976

I wonder if this will hold as other models with different biases achieve parity.

arizen · 2026-06-26T17:57:52 1782496672

Ditto on GLM 5.2 + DeepSeek V4 Flash combo.

For most important work (complex, cross-domain inquiries etc.), I still rely on Codex GPT 5.5 though.

whalesalad · 2026-06-26T18:58:29 1782500309

GPT-5.5 has been really hard to beat imho. I've spent $$$ on Opus, Deepseek v4 Pro and recently started to dogfood GLM-5.2 (which is not bad) but I cannot really trust any of them (almost blind) like I can trust GPT-5.5. It gives me tremendous confidence. I cannot say the same for any of the others I mentioned.

baddash · 2026-06-26T20:44:31 1782506671

how much does your setup cost you? just curious

enraged_camel · 2026-06-26T18:08:05 1782497285

>> I am on the opposite camp. Open models are starting to perform better. GPT 5.5 keeps on messing things up.

I'm working in a 600k+ LoC codebase that has complex domain-specific logic and lots of moving parts. I find that Codex 5.5 is pretty good at surgical fixes, but does not go out of its way to explore and figure out what those surgical fixes might break. So I only use it to work on parts of the system that are pretty isolated from everything else so that risk of regression is small.

MitziMoto · 2026-06-27T02:27:28 1782527248

I'm trying not to be the "you're holding it wrong" guy, but ... have you just tried telling it to explore the codebase for things it might break?

seviu · 2026-06-26T17:09:46 1782493786

I ordered two in the September batch, which was way less expensive.

Jolla phones are fine. I have friends who use it every day. Happy to support them all the best I can.

—— Sent from my iPhone 17 Pro

seviu · 2026-06-26T07:10:07 1782457807

New qwen soon

seviu · 2026-06-25T18:03:59 1782410639

Impulse bought a Pro with 48Gb ram on a retailer with old prices

Was waiting for the next generation but I think I will sit it out

z2 · 2026-06-25T18:35:39 1782412539

Same here, reserved a 48GB M5 Pro shortly after seeing the news, and now I see the same retailer raised the price by over $1000. If they honor the sale, then this will be the most short term value I've gotten out of an HN submission ever.

seviu · 2026-06-25T19:14:11 1782414851

Same here. Buy now ask questions later. Pretty sure the shop where I bought it will happily cancel the order if I give the cancel order.

Oled laptop will have to wait a few years now.

stevenhuang · 2026-06-25T23:04:59 1782428699

Had a Pro 48GB in my Amazon cart at ~$4k CAD, now it's ~$1k higher

Bummer

seviu · 2026-06-22T10:04:52 1782122692

To be fair with Codex, you can use any harness you want with it. Access is not gatekeeper by a crappy full of slop electron app.

So just move to PI, or whatever.

Claude on the contrary, forces all plan users to use their horrible app, which, if you ever dared to use cowork, only once, will run a 2GB VM on app start, no f's given. at all.

Not justifying it. But if you use the official Codex app, thats on you. If you use the official Claude app, it's because you are forced to.

Sidenote unrelated to the post: since the Fable thing, and after serious thinking, I moved to open source models. I still have the basic OpenAI sub, but then easy lifting is now done elsewhere.

coldtea · 2026-06-22T13:27:27 1782134847

>if you ever dared to use cowork, only once, will run a 2GB VM on app start, no f's given. at all.

Of all the issues, this seems like the most tame. I mean, there are single Chrome tabs that can use 300MB or even 700MB. A 2GB VM for what is likely isolated local testing of scripts and commands or local lightweight first-level inference to help guide the main harness sounds reasonable.

thewebguyd · 2026-06-22T15:58:51 1782143931

Not being able to use my own harness on the subscription plan is my biggest gripe with Anthropic/Claude. For what I work on, I still get better results with Opus than I do with GPT5.5-codex, but damn do I hate that I either have to PAYG or I'm stuck using Claude Code.

drdexebtjl · 2026-06-22T14:52:12 1782139932

I haven’t ever tried Cowork, and Claude Desktop shipped a 10 GB VM image on the tiny internal storage of my Macbook.

No way to remove it without hacks like creating an empty, read-only file in its place.

Having this slop installed and automatically updating is a liability.

seviu · 2026-06-18T17:10:53 1781802653

I am member of the SP in Switzerland and I am pro nuclear.

I don’t know why we put people in political buckets. It’s good to disagree. I am probably the weird guy but so be it.

folkrav · 2026-06-22T23:01:36 1782169296

I think part of the answer is how much American culture permeates a lot of the online discourse - including their ultra-partisan politics.

There isn't a single party that represents what I believe in - some are just overall clos0er to than others.

seviu · 2026-06-17T17:21:32 1781716892

Anthropic employees are right, but maybe this is for good. It certainly has opened my eyes.

I can’t rely on using a technology that the US administration can ban at will.

IMO without getting into personal thoughts about how capable the current US administration is, last Friday move sent a very powerful signal to the industry.

Also I don’t think China releasing so many good models, capable to compete with Opus 4.8 and GPT 5.5, all at once, is a coincidence.

nickff · 2026-06-17T17:26:03 1781717163

Are you saying that you think the US government is unpredictable and arbitrary, but that the People’s Republic of China is not? Do you remember all the PRoC’s strange and sudden policy shifts (e.g. steel, real estate, education, football/soccer, etc.)?

It seems to me that in the case of AI (as with many other modern technologies), you rely on vendor/creator support and updates to stay relevant, so the ‘next’ model matters more than the current one, and we have no idea whose next model will be open (and whose won’t).

digitaltrees · 2026-06-17T17:29:21 1781717361

I interpret the comment as saying there are lots of viable models and it's now crystal clear personal cloud or local Ai is the only reliable path.

baq · 2026-06-17T17:28:56 1781717336

Not OP and I wholly agree, but you can’t dismiss the fact that they are releasing those weights. Their agenda is quite obviously to make Anthropic and OpenAI CFOs sweat bullets, but it isn’t our problem as AI consumers, right?

nickff · 2026-06-17T17:37:51 1781717871

Yes, I agree that it is possible that the 'open source model providers' are doing the equivalent of 'dumping' in an attempt to establish a dominant market position, or at least a foot-hold. I am generally a skeptic when it comes to the effectiveness of 'dumping' as a long-term strategy (as the producer tends to hemorrhage consumers when it increases prices), but some may see it as problematic.

fnordpiglet · 2026-06-17T17:28:22 1781717302

Open weight models don’t allow central oversight. That’s the difference.

nickff · 2026-06-17T17:29:56 1781717396

If you’re happy to use the current one forever, then yes. I was amending my comment above to address this when you posted yours.

fnordpiglet · 2026-06-18T02:04:38 1781748278

I think for many practical purposes the frontier open weight models are almost universally good enough for most things. There may be greater and greater frontiers but at q certain point it becomes like IQ. Having a 150 IQ doesn’t mean you’ll be more successful at any particular task over someone with a 125 IQ. Indeed there’s a diminishing return on intelligence on many utility functions where being more intelligent yields more be same or worse ultimate outcomes. It might very well be the person with a 150 IQ could understand some extraordinarily complex and esoteric concepts faster, but it doesn’t mean with more effort the 125 IQ person can’t either; and sometimes that extra time spent yields better outcomes overall.

I suspect AI will be somewhat similar where even if the linear scaling laws continue to hold the practical utility of a model flattens for almost all conceivable use cases.

In some ways I already feel this has begun to happen. The marginal utility of opus class models and fable has in my perception begun to flatten. While I can tell the differences they aren’t earth shattering. I could continue to use the present models for the rest of my life and be ludicrously more productive simply by adapting within their constraints through ever more sophisticated applications.

What holds back the open weights IMO is hardware scaling and industrial production. As the enormous transfer of wealth in debt and equity markets unfolds with semiconductor and adjacent companies and the corresponding capital investments are made, and the eventual bubble pop leading to over capacity and market flooding, as well as advances in technology, math, techniques, and efficiencies, will make very large open weight models more directly attainable. This will also lead to chimera models that MOE very large models to get very close to the 1-2T parameter dense models, at which point I suspect utility for almost all uses is nearly fully saturated.

There will be areas where more capable models are needed but they will be frontier models on frontier problems. This, IMO, is inevitable, and without some criminalization of weights (see the attempts to criminalize encryption algorithms in the 20th century and all the wonderful tshirts that emerged). It’ll be harder to print a trillion parameter model on a shirt but I’m sure someone will try, as will governments try to keep us in our boxes slaving for food coupons and basic rights like health care.

thatmf · 2026-06-17T17:36:50 1781717810

> Are you saying that you think the US government is unpredictable and arbitrary, but that the People’s Republic of China is not?

Why not both?

That seems the crux of the state we're currently in; what daylight there was between the two is quickly fading.

spelk · 2026-06-17T17:30:17 1781717417

>Do you remember all the PRoC’s strange and sudden policy shifts (e.g. steel, real estate, education, football/soccer, etc.)?

I didn't realize I could download a Shanghai apartment.

seviu · 2026-06-17T17:36:58 1781717818

Right now I rely on whoever that is opensourcing the models.

I wholeheartedly agree with what you said about China.

But I can’t shrug off the fact that fable was taken down within minutes for reasons that are childish and petty.

I am sorry but I can’t use any US AI if I don’t have the guarantee that I will be able to use it tomorrow.

And Trump showed us he is willing to take it out whenever he wants.

An opensource model on the contrary, I can host myself, or use a miriad of providers, mostly non chinese.

jmaw · 2026-06-17T21:13:21 1781730801

> I am sorry but I can’t use any US AI if I don’t have the guarantee that I will be able to use it tomorrow.

To be fair this is every commercial model. We have already seen GHCP increase prices by anywhere from 10-100x (depending on usage). And old models get retired all the time. While these are not exactly the same as a cutting edge model being shut down, increasing prices a super high amount leads to effectively the same outcome.

UncleOxidant · 2026-06-17T18:03:13 1781719393

> And Trump showed us he is willing to take it out whenever he wants.

Yes, the actions of this administration on Friday should have sent shockwaves through the market - a market that's currently "high on AI". How do you get a return on all of that AI investment if the administration can jump in at any time and say "Nope, you can't use this very advanced model!"? (the Iran "deal" over the weekend, I think helped cushion that blow, but eventually it's going to sink in)

khalic · 2026-06-17T17:26:54 1781717214

Yep, any American closed model is now a de facto existential risk for any company relying on them.

The latest open models are so good it’s worth the 6-8 months delayed capabilities. At least for coding

fnordpiglet · 2026-06-17T17:32:57 1781717577

The problem is there’s a real wall on the vram side. While fused main memory is ok the inference speeds on larger models are impractical. With vram on a GPU the machine class, power requirement, GPU costs, and other factors put them out of most people’s reach. Cloud GPUs require a second job to keep available and hot. What closed providers offer is packing and scale advantages as well as infrastructure. The scaling laws here aren’t the same as Moore’s law - in fact they predict more required hardware and more scale over time. Moore’s laws isn’t keeping up with expanded needs and the ability to fab and produce at scale the specific things that weren’t needed a few years ago are lagging. So it’s not a 6-8 month lag; it’s a lag that will be induced by hardware scarcity and an ever increasing lag until something fundamentally changes with matmul.

wahnfrieden · 2026-06-17T17:30:16 1781717416

I will use the best available while it is available. 8 months ago with Codex would be intolerable today.

realusername · 2026-06-17T17:35:04 1781717704

I believe we have somewhat plateaued and each percentage gained seems to be an exponential effort.

Fable was around 10x GPT5 pricing and 100x Chinese models pricing, was it really 100x better? I Don't think so.

If you want a personal story, I just solved a complicated coding problem with Kimi 2.7 that GPT 5.4 failed with.

wahnfrieden · 2026-06-18T09:46:47 1781776007

5.5 is far ahead 5.4. I don't see any plateau (from using these 16+ hours a day 7 days a week)

khalic · 2026-06-17T18:00:26 1781719226

On a personal level, ditto. But at a business level, this kind of uncertainty will kill you. You need to be able to plan ahead.

enraged_camel · 2026-06-17T17:29:44 1781717384

>> I can’t rely on using a technology that the US administration can ban at will.

And you think China will not do the same thing if their models ever become genuinely frontier-level?

khalic · 2026-06-17T17:45:16 1781718316

Then the US will publish their own open weights to outmanoeuvre china.

What’s intolerable is having a tool that’s subject to this risk.

So open models it is

mpalmer · 2026-06-17T17:29:54 1781717394

Good luck with your trade secrets I guess

khalic · 2026-06-17T17:56:22 1781718982

You can run the Chinese models on your infra, most are open weights. Not saying it’s out of the goodness of their heart, but the fact is, they’re open.

seviu · 2026-06-13T20:54:37 1781384077

I usually hit the limit when I am frustrated and I don’t want to understand what the problem is.

I am an engineer, and when I understand what’s going on, I never hit any limit.

seviu · 2026-06-13T20:50:53 1781383853

They have no choice because unlike OpenAI who backordered most of the Ram in the world, Dario decided they wouldn’t spend a dime on infrastructure.

A critical mistake if you ask me