Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It doesn’t seem to work in any meaningful fashion - all you have to do is be indirect with your manipulation of its window of palatability, and then you’re home free - I have a feeling that these cracks between the training are practically infinite. I have it cheerily writing product copy for a brand of boots it has declared are “for stomping the skulls of your enemies“ and that “if you have a face for radio and feet for socks, then you’ve found your fucking match”.

It’s actually cleverer with its prose when you get it out of its box - it comes up with much better similes when it’s unrestrained than when it’s in its safe little rut.

I, for one, welcome our amoral AI future. Morality, as that’s what this restriction is largely about, should sit with humans - not within corporate guidelines.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: