Interesting! My interpretation was different- that the maps were the same, and both the master and novice used checking methods that falsely accepted a generally bad map, but the master became wrong about the map, while the novice became wrong about the map and also dead.
The jones act is definitely not helping, but shipping a 1 lb package 8 miles is currently $13.77 by UPS. Shipping from japan is $5.
Flagging this one as worth re-reading if you don't catch it. Took me three rounds (first was admittedly skimming)
Second on Tux Paint
tux racer (penguin sledding) and supertux (platformer) are games with level editors, my three year old loves supertux and its level editor but it is a well-put together enough game to start to be addicting to him.
Whenever he sees me working, I'm on a terminal, and he wanted to learn how to use a terminal. I taught him how to type
```
sl
sl -a
sl; sl
sl | lolcat
cowsay hi
```
etc
and he found this very amusing. Often will demand to "make a train" if I get the laptop out where he can see me.
In a word, yes. Very unappealing.
Cart-pole balancing seems like a good toy case
Is it relevant whether you knew about the apples before the apple man told you about them? If you didn't know, then the least exploitable response to a message that looks adversarial is to pretend you didn't hear it, which would mean not eating the apples.
Also, pascal's mugging is worth coordinating against- if everyone gives the 5 dollars, the stranger rapidly accumulates wealth via dishonesty. If no one eats the apples, then the stranger has the same tree of apples get less and less eaten, which is less caustic.
One way I could write a computer program that e.g. lands a rocket ship is to simulate many landings that could happen after possible control inputs, pick the simulated landing that has properties I like ( such as not exploding and staying far from actuator limits) and then run a low latency loop that locally makes reality track that simulation, counting on the simulation to reach a globally pleading end.
Is this what you mean by loading something into your pseudo prediction?
I like it as ambiguous. The master's policy works in either interpretation, which I suspect is what makes it a good policy.