Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
simonw
12 months ago
|
parent
|
context
|
favorite
| on:
Jagged AGI: o3, Gemini 2.5, and everything after
Right: we effectively all need our own evals for the tasks that matter to us... but writing those evals continues to be one of the least well documented areas of how to effectively use LLMs.
Consider applying for YC's Summer 2026 batch! Applications are open till May 4
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: