Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

He's burning Claude tokens to slightly improve his tiny and not very capable LLM? It's fun, I bet, but wake me up when it leads to a research breakthrough.
 help



Please don't fulminate or post snarky, shallow dismissals on HN. The guidelines make it clear we're trying for something better here. https://news.ycombinator.com/newsguidelines.html

I suspect Ant is already doing this for Claude. Takes a sh*t ton of compute though.

nanochat is super capable, the d34 (2.2b) variant is competitive with qwens of that size. Andrej is I assume building out the improvements in preparation for bigger training runs. We desperately need a truly open model, so i think this is incredibly important.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: