Top postsTop post
Rafael Harth
Message
I'm an independent researcher currently working on a sequence of posts about consciousness. You can send me anonymous feedback here: https://www.admonymous.co/rafaelharth.
5357
Ω
232
61
1124
1
[Thanks to Steven Byrnes for feedback and the idea for section §3.1. Also thanks to Justis from the LW feedback team.] Remember this? Or this? The images are from WaitButWhy, but the idea was voiced by many prominent alignment people, including Eliezer Yudkowsky and Nick Bostrom. The argument is that...
[Thanks to wilkox for helpful discussion, as well as Charlie Steiner, Richard Kennaway, and Said Achmiz for feedback on a previous version. Extra special thanks to the Long-Term Future Fund for funding research related to this post.] Consciousness Explained is a book by philosopher and University Professor Daniel Dennett. It's...
[Thanks to Charlie Steiner, Richard Kennaway, and Said Achmiz for helpful discussion. Extra special thanks to the Long-Term Future Fund for funding research related to this post.] [Epistemic status: confident] There's a common pattern in online debates about consciousness. It looks something like this: One person will try to communicate...
I just bought a subscription to access GPT-4 and played the following chess game against it, with me playing white. (No particular agenda, was just curious how good it is.) At this point (move 31), GPT-4 suggested Kxc4, which is not legal, and when I asked it to correct, it...
EDIT 2023/07: I recommend reading part one of this book review instead (or at least before) this post; it does a better job explaining formal systems. This is a post I wrote back in September of 2020 and am now publishing due to the changed incentives. The idea was something...
(Related to What an Actually Pessimistic Containment Strategy Looks Like) It seems to me like there are several approaches with an outside chance of preventing doom from AGI. Here are four: 1. Convince a significant chunk of the field to work on safety rather than capability 2. Solve the technical...
Sometimes, you want to do something hard, like working on a project, but you can't get yourself to do it in the moment. I think most advice one reads about this problem (at least outside of LW) is useless, hence the title. The aspiration of this post will be to...