V_V comments on An angle of attack on Open Problem #1 - Less Wrong

30 Post author: Benja 18 August 2012 12:08PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (84)

You are viewing a single comment's thread. Show more comments above.

Comment author: Eliezer_Yudkowsky 12 March 2013 01:10:26AM 1 point [-]

(Requested reply.)

I think there'd be a wide variety of systems where, so long as the "parent" agent knows the exact strategy that its child will deploy in all relevant situations at "compile time", the parent will trust the child. The point of the Lob problem is that it arises when we want the parent to trust the child generally, without knowing exactly what the child will do. For the parent to precompute the child's exact actions implies that the child can't be smarter than the parent, so it's not the kind of situation we would encounter when e.g. Agent A wants to build Agent B which has more RAM and faster CPUs than Agent A while still sharing Agent A's goals. This, of course, is the kind of "agents building agents" scenario that I am most interested in.

Comment author: V_V 20 December 2013 02:02:00AM *  2 points [-]

That's not the case.

Consider a resource-constrained variant of the original game:

Each program receives as input the round number n and the next program, encrypted by repeated xoring with the output of a monotonic computable function f(n).
Let T_f(n) be the runtime of the fastest algorithm that computes f. Note that T_f(n) is monotonically increasing.

At round n, the current program has a time limit T(n) = C_0 + C_1 * T_f(n). Quirrell never submits programs that exceed the time limit.
In this variant of the game, you have to submit the first program, which has to obey a time limit T(0).

The initial program will not be able to compute the relevant strategy of any of its successors (except at most finitely many of them).
And yet, my quining solution still works (just add a decryption step), and I think Wei Dai's solution also works.

Comment author: cousin_it 20 December 2013 10:49:53AM *  0 points [-]

It seems to me that the variant with time limits has a simple quining solution:

1) Get the time limit as input.

2) Spend almost all available time trying to prove in some formal theory that the next program is equivalent to this one.

3) Double down if a proof is found, otherwise take winnings.

That's similar to your idea, right? I'm not sure if it addresses Eliezer's objection, because I no longer understand his objection...

Comment author: V_V 20 December 2013 06:05:07PM 0 points [-]

That's similar to your idea, right?

Yes.