Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

From what I understand, alignment and interpretability were rewarded as part of the optimization function. I think it is prudent that we bake in these "guardrails" early on.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: