Roman Malov

Bachelor in general and applied physics. AI safety researcher wannabe.

Email: roman.malov27@gmail.com
GitHub: https://github.com/RomanMalov
TG channel (on Russian): https://t.me/healwithcomedy

Wiki Contributions

Comments

Sorted by

I've read it as a part of Agents Foundation course, and I consider this post really effective and clarifying. It got me thinking, can this generalize to other failure modes? Like if programers notice that AI spend too much resources on self-preservation, and then train against such behavior, this failure mode would still arise because self-preservation is an instrumental goal and is a fact about the world and ways in which goal can be achieved in this world.

I'm not a native speaker, can someone please explain the meaning of "Hell is wasted on the evil" in simpler terms?

Sebz n gval fcbg ba gur raq bs Uneel'f jnaq, n phovp zvyyvzrgre bs napube, fgergpurq bhg n guva yvar bs Genafsvtherq fcvqre-fvyx.

sebz gur puncgre 114

Or if I'd - if I'd only gone with - if, that night -

I'm guessing he is talking about the night he lost his potential phoenix.

I think that's intended author's choice. Like what Harry saw was too terrible to acknowledge. Or maybe it's just to create more suspense.

Snape told him that he wanted to check if Harry resembled his father, and the test consisted of stopping bullies, so that might be the reason for Harry's guess.

Load More