> why hasn't anyone done this for real? WDYM? LLMs are essentially this.

tavavex · 2026-03-12T17:41:55 1773337315

Most LLMs are trained on a lot of the source code for many open-source projects. This 'project' has the whole song-and-dance about never seeing the source code and separating the system to skirt around legal trouble. Why didn't anyone do that yet?

imiric · 2026-03-12T17:55:13 1773338113

Because that's impossible. Any "robot" that can generate code must be trained on massive amounts of code, most of which is open source.

sdwr · 2026-03-12T18:37:23 1773340643

And how are you supposed to guarantee equivalent functionality by analyzing "README files, API docs, and type definitions"?

Nolski · 2026-03-12T19:31:09 1773343869

It's described on the web page but it's by having 2 agents. One has access to the code and one doesn't.

fmbb · 2026-03-12T19:45:34 1773344734

Are they the same model?

Not that it matters, I just think the joke is more fun if they are different.

Nolski · 2026-03-13T07:46:41 1773388001

It depends. Although they always have entirely separate contexts.

dymk · 2026-03-12T19:02:32 1773342152

The joke is that you don’t.

preisschild · 2026-03-12T18:47:20 1773341240

not a lot of code is public domain and thus not a lot of training data is available

phyzome · 2026-03-12T20:36:03 1773347763

For each project you want to rip off, you'd have to first train an entirely new LLM on all sources except for the target project. Prohibitively expensive.