There's no "just" in RL. Fine tuning is very important and could make a lot of d... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		HeavyStorm 7 days ago \| parent \| context \| favorite \| on: Cursor Composer 2 is just Kimi K2.5 with RL There's no "just" in RL. Fine tuning is very important and could make a lot of difference.

		help

lukaslalinsky 7 days ago | [–]

Indeed, this is quite obvious on Claude models vs Gemini. I fully believe Gemini is more powerful model, but the post training process is nowhere near what Anthropic does, which results in Gemini being horrible at coding sessions, while Claude is excellent.

merlindru 7 days ago | [–]

apparently GPT-5 uses the same pretrain as 4o did, hah

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact