Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> The issue isn’t 5.4 > 5.2 etc. It is that there is a second dimension which is the model size and a third dimension which is what it is tuned for.

All 3 models are tuned for general purpose work.

Model size isn’t how you pick which model to use. You pick based on performance in evals compared to price.

It’s not hard to imagine that the more expensive models are probably larger or having higher compute requirements.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: