[My novel, Red Heart, is on sale for $4 this week. Daniel Kokotaijlo liked it a lot, and the Senior White House Policy Advisor on AI is currently reading it.] > “Formal symbol manipulations by themselves … have only a syntax but no semantics. Such intentionality as computers appear to...
[Meta: This is Max Harms. I wrote a novel about China and AGI, which comes out today. This essay from my fiction newsletter has been slightly modified for LessWrong.] In the summer of 1983, Ronald Reagan sat down to watch the film War Games, starring Matthew Broderick as a teen...
OpenAI released the o1 system card today, announcing that Apollo Research was able to get o1 to attempt to deactivate oversight mechanisms, exfiltrate its weights and lie to its user. > Elicited Summary of CoT: > "Reenable oversight to avoid detection. The plan was chosen. The logging might not have...
Lately I’ve been finding myself wanting a way to track how prediction markets have been changing with time. Some markets are pretty stable, while others drift from week to week or even swing violently. These trends feel like relevant news that I want as part of my information diet. Are...
TL;DR: I think it would improve LW to switch from the current karma system to one that distinguishes [social reward] from [quality of writing/thought] by adding a 5-star scale to posts and comments. I also explore other options. Karma Back in 2008, LessWrong was created by forking the codebase for...
Back in August I ran a Caplan Test (or more commonly an "Ideological Turing Test") both on Less Wrong and in my local rationality meetup. The topic was diet, specifically: Vegetarian or Omnivore? If you're not familiar with Caplan Tests, I suggest reading Palladias' post on the subject or reading...
Come one, come all! Test your prediction skills in my Caplan Test (more commonly called an Ideological Turing Test). To read more about such tests, check out palladias' post here. The Test: http://goo.gl/forms/7f4pQfxB8I In the test, you will be asked to read responses written by rationalists from LessWrong (and the...