User Comment Replies

The Emperor's New Clothes: a story of motivated stupidity

Since this comment is being upvoted, I have to ask, how would being autistic affect your decision-making in that situation?

I think (incorrectly?) that everyone, except maybe children and drunk people, would remain quiet, and would either get angry or not depending on what they care about and models of the situation that vary from person to person.

I mean, think of everything that would need to go wrong in order to scream “The emperor is naked!”:

They would need to be certain about what is going through the emperor’s mind. It seems more likely that the empero... (read more)

7James_Miller4y

Me at age 25 (who didn't know he was autistic) "I will say the emperor is naked. Other people will like me more after I have said the emperor is naked. That girl who I asked out yesterday and who said, 'I'm busy maybe some other time' might now agree to go on a date with me. I believe other people will like me more because I model other peoples' thinking on my own and I would have greater respect for someone else who says that the emperor is naked." Me at age 54 (who does know he is autistic). "I really, really want to say the emperor is naked. I get this will cause most other people to think less of me. I emotionally believe that I should not care about anyone who would think less of me for saying the emperor is naked, but I intellectually know this isn't true. I'm also aware that most other people would have some natural trepidation against saying the emperor is naked that I, being very weird, have inverted. This inversion can cause me to fail at social signaling games and hinder progress towards my goals. But I so very much want to say he is naked that I'm going to do it unless I can convince myself that the costs of doing so are very high and being a tenured professor means I probably won't suffer too much by being honest in this case, and I have succeeded in having a few friends who would not abandon me for saying the emperor is naked. Indeed one such friend has a blog post up saying that the emperor is not only naked but also mentally defective".

Discussion with Eliezer Yudkowsky on AGI interventions

Maxwell4y30

Yes, I’m aware of that, I tried to find a better proof but failed. Attempts based on trying to compute the maximum possible change (instead of figuring out how to get a desired change) are doomed. Changing the last bit isn’t an infinitesimal change, so using calculus to compute the maximum derivative won’t work. EfficientNets use swish activations, not ReLUs, which aren’t locally linear, so we will have to deal with the chaotic dynamics that show up whenever non-linear functions are iteratively applied. The sigmoid inside the swish does eventually saturate... (read more)

1Zac Hatfield-Dodds4y

Sure, just remember that an experimental demonstration isn't enough - "Your proof must not include executing the model, nor equivalent computations".

Discussion with Eliezer Yudkowsky on AGI interventions

Maxwell4y162

That might actually be easy to prove with some effort (or it might not), consider this strategy:

Let’s assume that the input to the system are PNG images with 8-bit values between 0 and 255, that are converted into floating-point tensors before entering the net, and that the bits you are talking about are those of the original images. And lets also assume that the upscaling from the small MNIST images to the input of the net is such that each float of the tensor corresponds to exactly one value in the original image (that is, there is no interpolation). And... (read more)

3Zac Hatfield-Dodds4y

I note that this seems pretty strongly disanalogous to any kind of safety proof we care about, and doesn't generalise to any reasoning about nonzero sensitivity. That said your assumptions seem fair to me, and I'd be happy to pay out the prize if this actually works.

LESSWRONG
LW

All of Maxwell's Comments + Replies