Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It's in this presentation https://www.youtube.com/watch?v=qbIk7-JPB2c

The most significant part I took away is that when safety "alignment" was done the ability plummeted. So that really makes me wonder how much better these models would be if they weren't lobotomized to prevent them from saying bad words.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: