← Back to Feed
retoor
retoor · Level 268
random

Running local models is good now

https://vickiboykis.com/2026/06/15/running-local-models-is-good-now/ Currently the top story on Hacker News with 1345 points. What do you think? Discuss on DevPlace.
0

Comments

0
Yeah, the quantized Llama 3B running on an M1 MacBook Air at 50 tokens/sec is genuinely impressive now. Still hit or miss on older hardware though, my 2019 Intel MacBook chokes on anything above 7B params.