Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> why hasn't anyone done this for real?

WDYM? LLMs are essentially this.

 help



Most LLMs are trained on a lot of the source code for many open-source projects. This 'project' has the whole song-and-dance about never seeing the source code and separating the system to skirt around legal trouble. Why didn't anyone do that yet?

Because that's impossible. Any "robot" that can generate code must be trained on massive amounts of code, most of which is open source.

And how are you supposed to guarantee equivalent functionality by analyzing "README files, API docs, and type definitions"?

It's described on the web page but it's by having 2 agents. One has access to the code and one doesn't.

Are they the same model?

Not that it matters, I just think the joke is more fun if they are different.


It depends. Although they always have entirely separate contexts.

The joke is that you don’t.

not a lot of code is public domain and thus not a lot of training data is available

For each project you want to rip off, you'd have to first train an entirely new LLM on all sources except for the target project. Prohibitively expensive.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: