Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Would it make this exercise even more interesting if we add that for every 25%+ improvement in val_bpb, existing limits (5 minute and VRAM usage) are also increased (by certain percentages)? This can simuate human-like dev iterations much more closely. Infra can be auto-scaled using a platform like Modal.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: