You're looking at Less Wrong's discussion board. This includes all posts, including those that haven't been promoted to the front page yet. For more information, see About Less Wrong.

Jan_Rzymkowski comments on Surprising examples of non-human optimization - Less Wrong Discussion

19 Post author: Jan_Rzymkowski 14 June 2015 05:05PM

You are viewing a comment permalink. View the original post to see all comments and the full post content.

Comments (9)

You are viewing a single comment's thread. Show more comments above.

Comment author: Douglas_Knight 14 June 2015 09:01:49PM 4 points [-]

Note that (2) and (3) are formal tasks where the optimizer has access to the full set of rules. My understanding is that a lot of chips designed by optimizing a simulator have been pretty lousy in the real world, either being complete failures that only worked in the simulator because of bugs, or being real solutions, but not being usefully robust.

Comment author: Jan_Rzymkowski 14 June 2015 10:24:37PM 3 points [-]

Actually for (2) the optimizer didn't know the set of rules, it played the game as if it were normal player, controlling only keyboard. It has in fact started exploiting "bugs" of which its creator were unaware. (Eg. in Supermario, Mario can stomp enemies in mid air, from below, as long as in the moment of collision it is already falling)

Comment author: Douglas_Knight 14 June 2015 10:44:05PM 2 points [-]

It knows the rules in the sense that the game is built into the optimizer. There's a reason "time travel" is in the title of the paper.