i think the perplexity is more important than tokens per second. tokens per seco... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		freediddy 78 days ago \| parent \| context \| favorite \| on: Can I run AI locally? i think the perplexity is more important than tokens per second. tokens per second is relatively useless in my opinion. there is nothing worse than getting bad results returned to you very quickly and confidently. ive been working with quite a few open weight models for the last year and especially for things like images, models from 6 months would return garbage data quickly, but these days qwen 3.5 is incredible, even the 9b model.

sroussey 78 days ago [–]

No, getting bad results slowly is much worse. Bad results quickly and you can make adjustments.

But yes, if there is a choice I want quality over speed. At same quality, I definitely want speed.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact