Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

They probably shrunk the model from 175B to 17B. That's your 10:1 price reduction.


Wouldn't that almost certainly lead to measurable loss of capabilities?


If the model was quantized/distilled correctly, not for a large swath of use cases/problem domain. For anything where loss was not measured during distillation, very likely.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: