they dont change the model weights (no frontier lab does). if you have evals and all prompts, tool calls the same, I'm curious how you are saying performance decreased..
It's well worth the $20 to not deal with any limits and have it handle all the boilerplate repetitive BS us programmers seem forced to deal with. I think 80% of the benefit comes from spending that $20 (20%? :P) and just having it do the lame shit that we probably shouldn't have to do but somehow need to.
I know. I feel troubled for all providers. (I have never used Grok, for example, for obvious ethical reasons.) But Anthropic seems, currently, better than OpenAI. I always want to encourage the better behaviour and discourage the worst.
reply