Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Why is almost every RL paper done on Qwen-2.5 ? That decreases its credibility.
 help



It makes it easier to compare with other papers. If two different papers apply different methods to different models and get different results, how do you know which method is better?

Once you have identified the best method and want to productize it, it would of course make sense to apply it on top of the best model, but if you're just doing research, you can skip that expensive last step.


> Why is almost every RL paper done on Qwen-2.5 ?

In what way does using this model reduce the authors credibility?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: