twanvl comments on A model of UDT without proof limits - Less Wrong

13 Post author: cousin_it 20 March 2012 07:41PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (37)

You are viewing a single comment's thread. Show more comments above.

Comment author: cousin_it 20 March 2012 08:12:35PM *  3 points [-]

If A's proof system is sound, that directly implies that A won't generate "spurious" proofs.

Nope. In the example problem from the post, A's proof system will happily prove statements like "A()==1 implies U()==100". After all, A() finishes in finite time and returns 2, so any proof system worth its salt will eventually prove that A()==1 is false by simulating A, and a false statement implies anything. The challenge is to not let the agent stumble upon spurious proofs before it returns a value.

Comment author: twanvl 22 March 2012 12:07:32PM 0 points [-]

any proof system worth its salt will eventually prove that A()==1 is false by simulating A,

Not if A includes the proof search! No sound proof system will be able to say anything interesting about programs that use the system itself.

Comment author: cousin_it 22 March 2012 12:26:24PM *  1 point [-]

If the program doesn't terminate, the proof system may be unable to prove that. But if the program terminates in finitely many steps, any good enough proof system allows for a proof that just simulates all the steps one by one. Of course the program itself won't reach that particular proof because the proof is too long.