From what I understand, alignment and interpretability were rewarded as part of ...

		datameta 7 months ago \| parent \| context \| favorite \| on: AI Improves at Improving Itself Using an Evolution... From what I understand, alignment and interpretability were rewarded as part of the optimization function. I think it is prudent that we bake in these "guardrails" early on.