This isn't a hardware feat, this is a software triumph. They didn't make special...

pdpi · 2026-03-23T15:12:00 1774278720

It's both.

We haven't had phones running laptop-grade CPUs/GPUs for that long, and that is a very real hardware feat. Likewise, nobody would've said running a 400b LLM on a low-end laptop was feasible, and that is very much a software triumph.

bigyabai · 2026-03-23T16:27:12 1774283232

> We haven't had phones running laptop-grade CPUs/GPUs for that long

Agree to disagree, we've had laptop-grade smartphone hardware for longer than we've had LLMs.

pdpi · 2026-03-23T17:29:42 1774286982

Kind of.

We've had solid CPUs for a while, but GPUs have lagged behind (and they're the ones that matter for this particular application). iPhones still lead by a comfortable margin on this front, but have historically been pretty limited on the IO front (only supported USB2 speeds until recently).

bigyabai · 2026-03-23T23:03:12 1774306992

The GPUs are perfectly solid. Cheap Android handsets have shipped with Vulkan compliance for almost a decade now; the GPUs are equally-featured to consoles and PCs. The same goes for Apple handsets that run byte-identical Metal Compute Shaders to the Mac. For desktop use they are perfectly amenable. The hardware lacks nothing required for inference or gaming that dGPUs ordinarily support.

And even if you raise the requirements, we still have to contend with cheap CUDA-capable GPUs like the one in the ($300!!!) Nintendo Switch, or the Jetson SOCs. The mobile market has had tons of high-speed/low-power options for a very long time now.

mnkyprskbd · 2026-03-23T23:32:51 1774308771

We had LLMs for about 5 minutes or so. Hardly a measure of time for an industry that goes back half a century and then some.

smallerize · 2026-03-23T15:30:42 1774279842

The iPhone 17 Pro launched 8 months ago with 50% more RAM and about double the inference performance of the previous iPhone Pro (also 10x prompt processing speed).

SV_BubbleTime · 2026-03-23T16:58:40 1774285120

>triumph

It’s been a lot of years, but all I can hear after reading that is … I’m making a note here, huge success

GorbachevyChase · 2026-03-23T17:13:36 1774286016

There’s no use crying over every mistake. You just keep on trying until you run out of cake.

breggles · 2026-03-23T17:11:44 1774285904

It's hard to overstate my satisfaction!

anemll · 2026-03-23T17:51:47 1774288307

both, tbh