Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

90% of what you pay in agentic coding is for cached reads, which are free with local inference serving one user. This is well known in r/LocalLLaMA for ages, and an article about this also hit HN front page few weeks ago.
 help



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: