They study it *because* it already has a known solution. The point is to see how...

catgary · 2025-11-03T14:27:16 1762180036

I think this is an interesting direction, but I think that step 2 of this would be to formulate some conjectures about the geometry of other LLMs, or testable hypotheses about how information flows wrt character counting. Even checking some intermediate training weights of Haiku would be interesting, so they’d still be working off of the same architecture.

The biology metaphor they make is interesting, because I think a biologist would be the first to tell you that you need more than one datapoint.

Rygian · 2025-11-03T10:10:06 1762164606

That makes sense; however it does not seem like they check the LLM outputs against the known solution. Maybe I missed that in the article.