Benya

The sin of updating when you can change whether you exist

Trigger warning: In a thought experiment in this post, I used a hypothetical torture scenario without thinking, even though it wasn't necessary to make my point. Apologies, and thanks to an anonymous user for pointing this out. I'll try to be more careful in the future. Should you pay up...

Feb 28, 201417

SUDT: A toy decision theory for updateless anthropics

The best approach I know for thinking about anthropic problems is Wei Dai's Updateless Decision Theory (UDT). We aren't yet able to solve all problems that we'd like to—for example, when it comes to game theory, the only games we have any idea how to solve are very symmetric ones—but...

Feb 23, 201427

I like simplicity, but not THAT much

Followup to: L-zombies! (L-zombies?) Reply to: Coscott's Preferences without Existence; Paul Christiano's comment on my l-zombies post In my previous post, I introduced the idea of an "l-zombie", or logical philosophical zombie: A Turing machine that would simulate a conscious human being if it were run, but that is never...

Feb 14, 201438

L-zombies! (L-zombies?)

Reply to: Benja2010's Self-modification is the correct justification for updateless decision theory; Wei Dai's Late great filter is not bad news "P-zombie" is short for "philosophical zombie", but here I'm going to re-interpret it as standing for "physical philosophical zombie", and contrast it to what I call an "l-zombie", for...

Feb 7, 201455

Results from MIRI's December workshop

Last week (Dec. 14-20), MIRI ran its 6th research workshop on logic, probability, and reflection. Writing up mathematical results takes time, and in the past, it's taken quite a while for results from these workshops to become available even in draft form. Because of this, at the December workshop, we...

Jan 15, 201473

Naturalistic trust among AIs: The parable of the thesis advisor's theorem

Eliezer and Marcello's article on tiling agents and the Löbian obstacle discusses several things that you intuitively would expect a rational agent to be able to do that, because of Löb's theorem, are problematic for an agent using logical reasoning. One of these desiderata is naturalistic trust: Imagine that you...

Dec 15, 201337

Meetup : Bristol meetup

Discussion article for the meetup : Bristol meetup WHEN: 20 October 2013 02:00:00PM (+0100) WHERE: Hodgkin House, 3 Meridian Place, Bristol BS8 1JG We'll have another meetup in Bristol this upcoming Sunday, October 20, at the student house where I live. We'll officially start at 2pm to hopefully make it...

Oct 15, 20138

Benya

Benya

Results from MIRI's December workshop

A model of UDT with a concrete prior over logical statements

L-zombies! (L-zombies?)

An angle of attack on Open Problem #1

Benya

Results from MIRI's December workshop

A model of UDT with a concrete prior over logical statements

L-zombies! (L-zombies?)

An angle of attack on Open Problem #1

The sin of updating when you can change whether you exist

SUDT: A toy decision theory for updateless anthropics

I like simplicity, but not THAT much

L-zombies! (L-zombies?)

Results from MIRI's December workshop

Naturalistic trust among AIs: The parable of the thesis advisor's theorem

Meetup : Bristol meetup