> add more RAM and GPU to the next iPhone and it's not a toy anymore We're not g...

andyferris · 2026-03-23T22:54:35 1774306475

I highly doubt the A20 Pro will be slower than the A19 Pro - particularly for AI workloads.

echelon · 2026-03-24T01:20:49 1774315249

We're talking six orders of magnitude difference between 0.6t/sec and 35kt/sec.

While there are problems that can be solved with 0.6t/sec, particularly offline, at the edge, in the field applications, these are currently vastly outnumbered by other applications.

There's just no competing. Local sucks.

toofy · 2026-03-24T06:15:11 1774332911

> There's just no competing. Local sucks.

absolutely, however this doesn’t mean we should abandon local. i can’t remember who, but someone in the ai nuts and bolts arena said “smaller local models is where the exciting stuff is happening right now. it’s the area real fast progression is happening.” and it seems to be true. new big models aren’t making near the leaps smaller models are.

it’s so important we keep moving forward on running locally for the same reason it was important for us to use open standards when building the internet. if we hadn’t we’d all be connected through aol with 10 hours/month allowed internet usage and termed in through a sun workstation renting cpu cycles from some mainframe company at like “you’ve got 10,000 cpu cycles left on your monthly plan, please deposit $500 for 5,000 more.”

while all of this this is before my time, i’ve heard and read so many horror stories about how people could only connect through dumb terminals to “you wouldn’t believe it, computers then were the size of buildings” 1000 miles away and had to sign up for workload timeslots. make no mistake, this is the future these companies want, they want us to rent everything and own nothing.

zozbot234 · 2026-03-24T07:02:46 1774335766

Local is enough for most users as long as they're willing to accept a non-realtime response - which is a real limitation (especially for personal agentic use) but not a very significant one. The hardware is not that expensive, a single user's needs aren't going to saturate a state-of-the art AI datacenter rack or anything like that. Not even for heavy agentic workloads.

echelon · 2026-03-24T07:32:50 1774337570

You rent your broadband internet. It's not a foreign concept that we can't own all the infra.

I don't know why we can't just get over the local compute thing and instead build open infra and models in the cloud. That's literally the only way we'll be able to keep pace with hyperscalers.

Local is not going to benefit 99% of use cases. It's a silly toy.

If we build open infra for cloud-based provisioning and inference, we could build a future we still have some ownership in. We'd be able to fine tune large models for lots of purposes. We wouldn't be locked in to major vendors.

bigyabai · 2026-03-24T00:27:57 1774312077

SK Hynix: "Hold my LPDDR5X"