Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
xdotli
29 days ago
|
parent
|
context
|
favorite
| on:
SkillsBench: Benchmarking how well agent skills wo...
Did you check our repos and sites? the repo is skills native. Also please don't be misled by the original title, we have this configuration to eliminate the impact of internal knowledge of LLMs. It's in the paper.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: