A simple version of this is done for panoramic photos. If he looked at the city from a consistent direction throughout the flight, that's all that's needed. If the direction varied, it can't have varied a lot - he had to at least see the sides of the building he was drawing, if maybe from a different angle, and not all the buildings would have been parallel. That kind of rotation seems doable with current image transformers (and that's only neccesary if the drawing has accurate angles even over long distances).
In any case, I don't think it matters to my argument if current ML can do it. All the parts that might be difficult for the computer are doable even for normal humans, and therefore not magical. The only thing that's added to the normal human skill here is perfect memory, which we know is easy for computers and have known for a long time.
To clarify the question: I agree that there is variation in talent and that some very talented people can do things most could never. My question is, if you look at the distribution of talent among normal people, and then check how many standard deviations out our savant candidate is, then what's the chance at least one person with that talent would exist? Basically, is this just the normal right tail that's expected from additive genetic reshuffling, or an "X-man".
Example 3: Stephen Wiltshire. He made a nineteen-foot-long drawing of New York City after flying on a helicopter for 20 minutes, and he got the number of windows and floors of all the buildings correct.
I think ~everyone understands that computers can do this. The "magical" part is doing it with a human brain, not doing it at all. Similarly, blindfolded chess is not more difficult than normal chess for computers. That may take a little knowledge to see. And "doing it faster" is again clear. So the threshold for magic you describe is not the one even the most naive use for AI.
Sentence lengths have declined.
Data: I looked for similar data on sentence lengths in german, and the first result I found covering a similar timeframe was wikipedia referencing Kurt Möslein: Einige Entwicklungstendenzen in der Syntax der wissenschaftlich-technischen Literatur seit dem Ende des 18. Jahrhunderts. (1974), which does not find the same trend:
Year | wps |
1770 | 24,50 |
1800 | 25,54 |
1850 | 32,00 |
1900 | 23,58 |
1920 | 22,72 |
1940 | 19,60 |
1960 | 19,90 |
This data on scientific writing starts lower than any of your english examples from that time, and increases initially, but arrives in the same place (insofar as wps are comparably across languages, which I think is fine for english and german).
6 picolightcones as well, don't think that changed.
Before logging in I had 200 LW-Bux, and 3 virtues. Now I have 50 LW and 8 virtues, and I didn't do anything. Whats that? Is there any explanation of how this stuff works?
I think your disagreement can be made clear with more formalism. First, the point for your opponents:
When the animals are in a cold place, they are selected for a long fur coat, and also for IGF, (and other things as well). To some extent, these are just different ways of describing the same process. Now, if they move to a warmer place, they are now selected for a shorter fur instead, and they are still selected for IGF. And there's also a more concrete correspondence to this: they have also been selected for "IF cold long fur, ELSE short fur" the entire time. Notice especially that there are animals actually implementing this dependent property - it can be evolved just fine, in the same way as the simple properties. And in fact, you could "unroll" the concept of IGF into a humongous environment-dependent strategy, which would then always be selected for, because all the environment-dependence is already baked in.
Now on the other hand, if you train an AI first on one thing, and then on another, wouldn't we expect it to get worse at the first again? Indeed, we would also expect a species living in the cold for very long to lose those adaptations relevant to the heat. The reason for this in both cases are, broadly speaking, limits and penalties to complexity. I'm not sure very many people would have bought the argument in the previous paragraph - we all know unused genetic code decays over time. But in the behavioral/cognitive version with intentionally maximizing IGF that makes it easy to ignore the problems, we're not used to remembering the physical correlates of thinking. Of course, a dragonfly couldn't explicitly maximize IGF, because its brain is to small to even understand what that is, and developing that brain has demands for space and energy incompatible with the general dragonfly life strategy. The costs of cognition are also part of the demands of fitness, and the dragonfly is more fit the way it is, and similarly I think a human explicitly maximizing IGF would have done worse for most of our evolution[1] because the odds you get something wrong are just too high with current expenditure on cognition, better to hardcode some right answers..
I don't share your optimistic conclusion however. Because the part about selecting for multiple things simultanuously, that's true. You are always selecting for everything thats locally extensionally equivalent to the intended selection criteria. There is not a move you could have done in evolutions place, to actually select for IGF instead of [various particular things], this already is what happens when you select for IGF, because it's the complexity, rather than different intent, that lead to the different result[2]. Similarly, reinforcement learning for human values will result is whatever is the simplest[3] way to match human values on the training data.
for AIs, more robust adversarial examples - especially ones that work on AIs trained on different datasets - do seem to look more "reasonable" to humans.
Then I would expect they are also more objectively similar. In any case that finding is strong evidence against manipulative adversarial examples for humans - your argument is basically "there's just this huge mess of neurons, surely somewhere in there is a way", but if the same adversarial examples work on minds with very different architectures, then that's clearly not why they exist. Instead, they have to be explained by some higher-level cognitive factors shared by ~anyone who gets good at interpreting a wide range of visual data.
The really obvious adversarial example of this kind in human is like, cults, or so
Cults use much stronger means than is implied by adversarial examples. For one, they can react to and reinforce your behaviour - is a screen with text promising you things for doing what it wants, with escalating impact and building a track record an adversarial example? No. Its potentially worrying, but not really distinct from generic powerseeking problems. The cult also controls a much larger fraction of your total sensory input over an extended time. Cult members spreading the cult also use tactics that require very little precision - there isn't information transmitted to them on how to do this, beyond simple verbal instructions. Even if there are more precision-needing ways of manipulating individuals, its another thing entirely to manipulate them into repeating those high precision strategies that they couldn't themselves execute correctly on purpose.
if you're not personally familiar with hypnosis
I think I am a little bit. I don't think that means what you think it does. Listening-to-action still requires comprehension of the commands, which is much lower bandwidth than vision, and its a structure thats specifically there to be controllable by others, so it's not an indication that we are controllable to others in other bizzare ways. And you are deliberately not being so critical - you haven't, actually, been circumvented, and there isn't really a path to escalating power - just the fact youre willing to oey someone in a specific context. Hypnosis also ends on its own - the brain naturally tends back towards baseline, implanting a mechanism that keeps itself active indefinitely is high-precision.
Ok, thats mostly what I've heard before. I'm skeptical because:
No. That formula would imply that, if the coin is 30% for sure and you buy it for 0.3, you make 0.2 in expectation, which you don't, you make 0 regardless of what price you buy at.
Note that this kind of problem has also shown up in decision theory more generally. This is a good place to start. In particular, it seems like your problem can be fixed with epsilon exploration (if it doesn't do so automatically, as per Soares), both the EDT and CDT variant should work.