The hostile telepaths problem

Valentine

18 min read

•

Newcomblike self-deception

•

Sketch of a real-world version

•

Possible examples in real life

•

386 The hostile telepaths problem

by Valentine

27th Oct 2024

18 min read

386

Epistemic status: model-building based on observation, with a few successful unusual predictions. Anecdotal evidence has so far been consistent with the model. This puts it at risk of seeming more compelling than the evidence justifies just yet. Caveat emptor.

Imagine you're a very young child. Around, say, three years old.

You've just done something that really upsets your mother. Maybe you were playing and knocked her glasses off the table and they broke.

Of course you find her reaction uncomfortable. Maybe scary. You're too young to have detailed metacognitive thoughts, but if you could reflect on why you're scared, you wouldn't be confused: you're scared of how she'll react.

She tells you to say you're sorry.

You utter the magic words, hoping that will placate her.

And she narrows her eyes in suspicion.

"You sure don't look sorry. Say it and mean it."

Now you have a serious problem. You don't have an internal "actually mean it" button. And yet here's Mom peering into your soul and demanding that you both have that button and press it. Trying to appease her didn't work. She needs you to be different — and she's checking.

What can you do now?

This is a template for what I've come to call "the hostile telepaths problem". I think it's a common feature of social problems. The hostile telepaths problem is when you're dealing with a being (a) who can kind of read your internal experiences and (b) whom you don't trust won't make your situation worse due to what they find in you.

There are lots of solutions to the hostile telepaths problem. I don't claim to know all of them. But recognizing some common ones has helped clarify a lot of my thinking — particularly around self-deception and akrasia.

And getting very clear on the nature of the problem makes identifying real solutions way easier. This fact produces some previously-surprising-to-me predictions, especially for trauma processing and for making emotionally difficult decisions.

I'll try to spell out what I mean with some theory and a few examples.

Newcomblike self-deception

There's one really tricky solution to the hostile telepaths problem. It deserves some special front-loaded attention before I name some other solutions.

Here I'll try to spell out its logic with a modification of Newcomb's Problem.

Suppose that instead of Omega, you're dealing with Omega-V. Omega-V is an extremely good but imperfect predictor of someone's box choice.

Through some clever sleuthing, you deduce why. Omega-V is scanning only folks' visual thinking for a little while prior to their box choice. It's extremely good at tracking the difference between the mental visualizations of one-boxers vs. two-boxers… provided they don't know Omega-V is scanning specifically their visual thinking.

So now you have an edge. Maybe you can trick Omega-V. Maybe you can visually think like a one-boxer and use other modes of reasoning (like inner talk) to plan on actually two-boxing. You just have to be very careful to have any "leaks" of reasoning into visualization look totally normal for a one-boxer.

If you decide it's worth the risk to try for this deception, the visual reasoning part of your mind has to be kind of siloed from the rest. It's like you have a subagent that truly honestly believes you're a one-boxer. It's the non-visual reasoning parts of you that are aware of the real plan.

Another way of saying this is: in normal Newcomb's Problem, you can't tell whether you're in the real world being faced with Omega's offer or if you're instead Omega's simulation of real-you and you're revealing what real-you will do. The situation is similar with Omega-V, but in this case it's specifically your visual reasoner (viewed as kind of a subagent) that has to be built in such a way that it can't tell whether it's a simulation. Many other parts of you can tell perfectly well what's going on!^[1] But for the deception to work, the way in which you plan on two-boxing cannot be something your inner visual reasoner can even deduce.

Now, let's take this one step further.

Let's suppose we're instead dealing with Omega-C. Also an imperfect predictor. But this time you learn that Omega-C scans specifically all of — and only — your conscious thinking.

At this point, the right strategy is to consciously decide you're going to one-box.

But some people still find themselves two-boxing and winning. Even while consciously intending to one-box. Sometimes they get confused and say the wrong thing or push the wrong button. Some of them trip and accidentally grab both boxes. Some laugh and grab both boxes and find them both full.

Assuming that at least some of those outcomes are strategic… how?

Hint: now you (by which I mean conscious parts of you) are maybe the siloed part. Basically an illusion meant to trick Omega-C.

The thing is, you cannot tell. Because if you could, then the deception — if it were there — wouldn't work.

Which is to say: on the inside, you cannot tell the difference between running this Omega-C deception strategy and honestly truly just being a one-boxer.

And furthermore, it's actually to your advantage not to check. Even if you were to find some way of being able to tell the difference!

But because of that, if you're aware of this deception strategy, it might work best if you honestly (from your perspective) look for some way to tell if you're running it. Actively trying to "come clean". That's a pretty powerful signal to Omega-C that you really are a one-boxer.

So if you can make it so that it seems to you like you really are looking, including accounting for awareness of possible non-conscious "shadow" reasoning, and you find none, but you still two-box in the end…

…well, you win!

…while the conscious parts of you are confused and surprised. And possibly quite dismayed.

Sketch of a real-world version

I call this pattern "Newcomblike self-deception" as a nod to Nate Soares' point that Newcomblike problems are the norm.

The deal is that people are kind of telepathic, in the sense of using things like vocal tone and body language and microexpressions and so on to intuitively deduce what might otherwise be hidden in others. They're scanning for things like intent and strategy and emotional tone. Often this telepathy is in service of synching up ("Hey, you seem upset, is something wrong?"), but it doubles as threat detection.

This telepathy is imperfect. Which means that sometimes Newcomblike self-deception is in fact a viable strategy.

I'd like to name one way I think this type of self-deception can actually happen in a person. It might be the main way it happens, or it might be rare. I honestly don't know. But it's one I've in fact seen in myself^[2] and I think I've observed in some others.

By some mysterious method, it's possible to contract your awareness — by which I mean, the space of things you're actually aware of can be smaller than the space of sensory inputs (including mental experiences like thoughts and memories). Lots of people experience this when watching TV (losing awareness of the room), or when deep in flow work (not noticing hunger for hours while programming).

If you construct a sort of fake self in your mind, and then contract your awareness around that fake self, it can seem to you on the inside like you really are the way depicted in the fake self. Like it's not fake, it really is who you are.

If you also build up explanations to your fictitious self about why things outside that fiction either are consistent with it or don't matter, then you both (a) can honestly display to hostile telepaths that you (here meaning fake you) are being fully sincere in not hiding anything and (b) possibly give the telepaths ways of discounting the unavoidable signals that you (here meaning you holistically) are hiding something.

For instance, as a child whose mother says to you "Say you're sorry and mean it", you might be able to strategically misinterpret your fear of Mom's Wrath as "being really sorry". As long as you're not aware that that's what you're doing, it might work very well! She might read your distress as you really meaning it. ("I'm sorry I'm sorry I won't do it again please Mom I'm sorry…!") And you can keep yourself from being aware of this whole strategy by keeping your awareness contracted on the fictitious version of yourself that's "bad" and "very sorry", and keeping your understanding of the real problem outside of your awareness.

Possible examples in real life

Here are some examples I think I've actually seen — in culture, in others, and in myself:

I think the thing with kids that I sketched above really does happen. More generally, I think similar applications of Newcomblike self-deception are the root cause of (a certain very common kind of) shame: it's a strategic mislabeling of one's pain as being about one's "flaws".
Relatedly, lots of folk mislabel their experience as "I hate math." Most people I've talked to who say this actually hate the coercion and gaslighting used almost universally in math classes. The real problems most folk are focused on in math class are social, like "Appease the teacher" and "Get Mom & Dad off my back." But teachers and parents might insist to a student that "you need to try harder" with the math itself while seeming to sort of telepathically scan them for whether they are in fact trying. I think this can sometimes lead students to strategically mislabel their distress about the situation to themselves.
Gurus getting involved in sex scandals. I'm sure that at least some of them have been very sincere about what amounts to real Jungian shadow work. But somehow all that sincerity mysteriously ends up hiding and serving (instead of revealing and dealing with) an underlying drive to just get laid.
Likewise people "accidentally" cheating. Sometimes folk really are just surprising-to-them vulnerable in some situation and don't have the right kind of discipline when they turn out to need it. But the fact that that ever happens can act as a cover. It's especially obvious in cases of repeated "accidental" cheating.
I've seen four friends, as mothers, stay with and defend abusive partners (boyfriends or husbands) for years. She'd often insist that he's just stressed, or it's a frequent misunderstanding but they love each other, etc. In three cases it became possible for her to consider that he might be abusive after a change in her work gave her enough money to support herself and her child without him if need be. In the fourth case, the mother got a lot of social support such as a place to live and people she trusted to take care of her and her child, and then she had room to consider her partner's actions as abusive.
If I'm upset with a friend and I'm worried that they can't handle what I'm upset with them about, sometimes I can't think straight about what my problem with them is while I'm talking to them. My mind gets foggy, my concepts seem mushy even to me, the words I remember from journaling about it before now form what feels like a gibberish argument, etc. Often this fog suddenly clears up if I get a vivid sense from my friend that our friendship will be fine after we talk. It also gets clearer if the issue is so big that I realize I'm fine with them not being in my life after we talk.
Badly wanting someone to like you can make them like you less. So how do you get them to like you? Not by being aware that you're asking that question! But maybe if you do things for them without knowing that's why you're doing them…? ("Oh, I forgot Bob likes sushi! I just got some because I felt like it, honest!") And maybe if you add an extra dose of self-loathing ("God, I'm being a creep, aren't I? I always do that!") you can pass ~~Omega-C's~~ others' scrutiny here by eliciting care & concern when you might otherwise get caught.

I'm not trying to be exhaustive here. There are tons more examples.

Ending the need for self-deception

By this model, to end (Newcomblike) self-deception, we have to remove the need for it. This means solving each instance of the hostile telepath problem some other way.

This is kind of tricky in practice. When you use self-deception to deal with a hostile telepath, you can't know that that's what you're doing. You^[5] can't even know which hostile telepath problem you're solving! So how do you come up with another solution?

I don't have a provably general answer, but I have a pretty general approach that makes sense to me and has clearly worked several times. I'll share that approach here.

Welcome self-deception

First is welcoming that I'll self-deceive.

But this isn't "Well, I'm going to do it anyway, so I might as well be okay with it." That's nonsense: you probably can't just "be okay" with it. And trying probably makes the problem worse!^[6]

I mean something more wholehearted. If I self-deceive, it's because it's the best solution I have to some hostile telepath problem. If I don't have a better solution, then I want to keep deceiving myself. I don't just tolerate it. I actively want it there. I'll fight to keep it there!

This is somewhat akin to dealing with Omega-C by saying:

Look, I know it's possible I'm running a deception strategy. I could spend a bunch of energy trying to suss it out as a costly signal that it's not there. But at a policy level I'm just not going to do that. Not because I have evidence that I'm not following up on, but because I don't want to add stress to myself in the world where I really am self-deceiving. Since I'm doing this regardless of whether the deception strategy is running, it's not information about whether I'm secretly trying to two-box.

This relieves pressure. If I have some sense that I'm self-deceiving, and my attitude is to back the deception instead of trying to penetrate it, then the hidden part of me running the deception doesn't have to engage in an internal arms race with me. We become same-sided.

Look away when directed to

Once I really back my own self-deception, it becomes easier to notice signs I'm doing it.

This works way better if I trust my occlumency skills here. If I don't feel like I have to reveal the self-deceptions I notice to others, and I trust that I can and will hide it from others if need be, then I'm still safe from hostile telepaths.

Seeing where I self-deceive doesn't mean I see what the deception is. In practice it's more indirect than that. What I mean are things like:

Revealed preferences. (Akin to noticing I two-boxed "by accident".)
My mind suddenly going foggy.
Forgetting what I was thinking about.
Mental chatter getting loud.
Suddenly being very disinterested in what I'm focused on.
Getting abruptly absorbed in something unrelated.
My attention scattering.
Losing awareness of my body, or parts of my body, or my body drives (like hunger).
Body activation signs: holding my breath, tensing my shoulders, quickened speech, etc.
Energy crash or getting really sleepy. (Like a freeze response.)
A sudden addictive impulse.
I feel shame, inadequacy, or otherwise think I'm broken or flawed or bad in some way.
Etc.

I don't mean this as an exhaustive list. Nor do I mean it as things to look out for. Nor do I mean that these always imply that self-deception is going on.

What I mean is, there are things a person does to maintain self-deception. If you basically promise the strategic not-conscious-to-you part that you really will respect the strategy, then it doesn't have to keep you so firmly out of the loop. Then you can potentially start picking up on some signposts like these ones.

Part of the deal is, when you notice such a possible signpost, you look away. You notice it and you drop the inquiry. Because until you have a non-self-deceptive strategy for whatever the real problem is, you don't want to break the one strategy you have.

For instance, sometimes I'll think about responding to an email… and I start getting sleepy. If I push, I start wanting to watch YouTube. These are signs that something in me doesn't trust it's safe for me to look there. Maybe it involves a decision that requires me to ask myself an unsafe question. I don't know — and I don't try to figure it out. At least not right away. Instead I back off and direct my attention elsewhere. Maybe I go cook something, or take a walk. I consciously distract myself from the tension point.

In my experience, this alone can often eliminate most of the stress involved in self-deception. It becomes fine. Annoying, glitchy, but no longer fraught with anxiety and self-doubt.

Hypothesize without checking

After a while I kind of get a "negative space" sense of what the self-deception is about. I continue not to look, out of something like respect. But I still have a hint.

Like if there's an email I keep freezing around. I can tell there's something there. I might even have some intuitive guesses about what it is!

…but I do not check. I don't introspect on whether my guesses feel right.

Instead, I hypothesize. What hostile telepath problem might someone in my shoes be trying to solve such that this behavior arises?

For instance, let's suppose the person is asking for me to run an event this weekend. I might hypothesize like this, intentionally referring to myself in third person:

Maybe Valentine doesn't actually want to do it, but he's scared that letting them know will make them think he's actually uninterested in them in general, which might have them closing opportunities he wants with them in the future.

Importantly, I am not introspectively checking. I'm not asking if I think the above really is what's going on with me. I'm just noticing that, viewing myself in third person, this model does seem to fit the evidence.

I'm also not trying to construct a plan to verify what's going on! Here Nature wants her secrets kept. I do not try to peek under her skirt.

Instead, I notice what Valentine (i.e., me in third person) in this hypothetical could maybe do instead of Newcomblike self-deception. What would be a viable alternative strategy for him?

Maybe Valentine could meditate on their possible disapproval, and come up with a plan for what happens next in which he's okay. (Building power.)

At this point I could just implement this possible solution. I don't have to check if it's relevant to my situation: there's not much cost in leaving myself a line of retreat this way.

If it turns out there's been Newcomblike self-deception going on, and if this hypothetical solution really did resolve the core problem that the self-deception was solving, then the self-deception should basically just lift.^[7]

And if I still have an ugh field around the email, then I haven't addressed the real problem yet. Which is fine. Not ideal, but I'm still going to back any self-deception that might be there while I don't have a better option!

I can repeat this process. Hypothesize without checking, implement solutions that would work in the hypothetical, and find out what happens.

…at least unless and until I start getting frozen about this process. That might mean I'm getting too close to understanding the strategy before it's safe to do so.

Then I back off.

Does this solve self-deception?

I don't know.

I didn't originally set out to make sense of self-deception. I was just trying to understand why people sometimes view themselves as flawed and in need of fixing.

It just turned out that that question was tied to a lot of others. Self-deception being one of them. A lot of them unified by considering the problem of hostile telepaths.

It seems worth noting that a bunch of the method I describe here — particularly the "hypothesize without checking" part — is derived. It amounted to a prediction that I tested and discovered worked as the model anticipated.

Likewise, occlumency being helpful. There might be other explanations for why getting better at privacy makes more thoughts thinkable. But I derived it from this one. And, again, it (anecdotally) seems to have worked as predicted.

These approaches work remarkably well on shame too, by the way. I might write a separate post on shame. Its logic is a bit different, but with a few adjustments I've found that shame dissolves extremely well in contact with these ideas.

With all that said, I don't think I'm in a position to say that I've solved self-deception. I don't know how I could know that. I'm not even convinced I've solved Newcomblike self-deception! My method seems plausibly general, but I don't have even the sketch of a necessity argument yet.

So, more work needed.

Summary

It seems to me that self-deception is solving a real problem. If we don't solve that real problem differently in a given instance, then in that instance we can't stop self-deceiving.

It seems to me that the real problem is (at least sometimes) hostile telepaths.

When I view hostile telepaths as the real problem I'm trying to solve, the perspective suggests what alternative solutions might look like, and it lets me check whether a given approach even can work as a solution.

And it seems to me that when I implement those alternative solutions, the result is sometimes that self-deception visibly falls away, non-mysteriously. It becomes obvious to me what was going on, and why.

I don't know if this model captures all cases of what we might want to call "self-deception". Maybe it does. But my impression is that it at least captures some cases that matter, and quite a lot of them.

^{^}
Note that having non-visual ways of thinking isn't enough to know you're not a simulation. What tells you you're not an Omega-V simulation is that you can reason in ways that (a) cannot be derived from your visual thinking and (b) change what you in fact do.
^{^}
Of course, this is something I became aware of after unraveling the structure in a few cases. It's not something that reveals itself while the structure works.
^{^}
By "psychopath" I mean someone with the cluster B personality disorder. I don't mean something derogatory. Nor am I (necessarily) referring to Gervais Principle psychopaths.
^{^}
To be clear, "hostile telepath" is a role, not an identity. Someone is a hostile telepath to you when they're scanning your mind and you don't trust they won't create problems for you based on what they find. Someone being a hostile telepath is less like them being a criminal and more like them being your lover or your foe. I say this because it's not a solution to identify "the hostile telepaths" in a community and reform or expel them; that approach is gibberish made of confused reification.
^{^}
If I were carefully describing this from the outside, I'd say that your false self can't know. "Self-deception" is really false self deception (as a strategy for deceiving hostile telepaths). The thing is, on the inside it doesn't feel like "your false self". That's the whole point! I'm describing this model in a way that's hopefully legible to the internal experience of actually running the strategy. Otherwise any instructions might make theoretical sense but won't be actionable. Sadly, this way of talking results in some ambiguities — precisely because the whole point of the strategy is to make something difficult to see clearly. Hopefully you can correct for this confusion as needed, sort of shifting to third-person and renaming things when the theory isn't clear.
^{^}
Why? Well, you need to "be okay" with it. But you're not. So what do you do with the fact that you're not okay with it? Loosely speaking, you've just turned your own conscious mind into an internal hostile telepath!

^{^}
In practice I find that not only does this work quite often, but now it sometimes works once I think of the alternative solution. I don't always need to implement it first. It feels to me like this result comes from having built internal trust that I really can and will respect my need for some strategy.

Glossary

occlumency

Omega-V

one-boxer

two-box

Omega-C

Newcomb's Problem

Jungian shadow work

Show Unapproved

^{^}

Note that having non-visual ways of thinking isn't enough to know you're not a simulation. What tells you you're not an Omega-V simulation is that you can reason in ways that (a) cannot be derived from your visual thinking and (b) change what you in fact do.

^{^}

Of course, this is something I became aware of after unraveling the structure in a few cases. It's not something that reveals itself while the structure works.

^{^}

By "psychopath" I mean someone with the cluster B personality disorder. I don't mean something derogatory. Nor am I (necessarily) referring to Gervais Principle psychopaths.

^{^}

To be clear, "hostile telepath" is a role, not an identity. Someone is a hostile telepath to you when they're scanning your mind and you don't trust they won't create problems for you based on what they find. Someone being a hostile telepath is less like them being a criminal and more like them being your lover or your foe. I say this because it's not a solution to identify "the hostile telepaths" in a community and reform or expel them; that approach is gibberish made of confused reification.

^{^}

If I were carefully describing this from the outside, I'd say that your false self can't know. "Self-deception" is really false self deception (as a strategy for deceiving hostile telepaths). The thing is, on the inside it doesn't feel like "your false self". That's the whole point! I'm describing this model in a way that's hopefully legible to the internal experience of actually running the strategy. Otherwise any instructions might make theoretical sense but won't be actionable. Sadly, this way of talking results in some ambiguities — precisely because the whole point of the strategy is to make something difficult to see clearly. Hopefully you can correct for this confusion as needed, sort of shifting to third-person and renaming things when the theory isn't clear.

^{^}

Why? Well, you need to "be okay" with it. But you're not. So what do you do with the fact that you're not okay with it? Loosely speaking, you've just turned your own conscious mind into an internal hostile telepath!

^{^}

In practice I find that not only does this work quite often, but now it sometimes works once I think of the alternative solution. I don't always need to implement it first. It feels to me like this result comes from having built internal trust that I really can and will respect my need for some strategy.

^{^}

"Curated", a term which here means "This just got emailed to 30,000 people, of whom typically half open the email, and it gets shown at the top of the frontpage to anyone who hasn't read it for ~1 week."

Self-DeceptionNewcomb's ProblemSubagentsConsciousnessRationality

Curated

386

The hostile telepaths problem

44Gordon Seidoh Worley

4Valentine

2[anonymous]

14Gordon Seidoh Worley

2transhumanist_atom_understander

2MikkW

2Ratios

2João Ribeiro Medeiros

New Comment

89 comments, sorted by

top scoring

Click to highlight new comments since: Today at 5:56 PM

Some comments are truncated due to high volume. (⌘F to expand all)Change truncation settings

[-]Gordon Seidoh Worley8mo4411

Some cultures used to, and maybe still do, have a solution to the hostile telepaths problem you didn't list: perform rituals even if you don't mean them.

If a child breaks their mom's glasses, the mom doesn't care if they are really sorry or not. All she cares about is if they perform the sorry-I-broke-your-glasses ritual, whatever that looks like. That's all that's required.

The idea is that the meaning comes later. We have some non-central instances of this in Western culture. For example, most US school children recite the Pledge of Allegiance every day (or at least they used to). I can remember not fully understanding what the words meant until I was in middle school, but I just went along with it. And wouldn't you know it, it worked! I do have an allegiance to the United States as a concept.

The world used to be more full of these rituals and strategies for appeasing hostile telepaths, who just chose not to use their telepathy because everyone agreed it didn't matter so long as the rituals were performed. But the spread of Christianity and Islam has brought a demand for internalized control of behaviors to much of the world, and with it we get problems like shame and guilt.

Now I'm not saying that performing rituals even if you don't mean them is a good solution. There are a lot of tradeoffs to consider, and guilt and shame offer some societal benefits that enable higher trust between strangers. But it is an alternative solution, and one that, as my Pledge of Allegiance example suggests, does sometimes work.

4Valentine8mo

Ah, yep! True that! Your point relates more directly to my main interest, memetics. I bet there are memes that encourage both (a) these rituals and (b) the telepathic attacks that make those rituals necessary.

2[anonymous]8mo

Can you explain how it caused that, and maybe what it feels like? (I find it alarming that being forced to recite a pledge as a child can actually have that effect -- I knew humans were culturally programmable, but not that {forcing someone to say "I endorse x!" when they don't know what it means nor want to say it} every day would actually cause them to endorse x later on. Actually, I notice I'm skeptical that that was the real cause in your case; what's your reason for believing it was the cause?) (No pressure to answer my questions of course - interpret them as statements of curiosity rather than requests in the human/social sense)

[-]Gordon Seidoh Worley8mo145

I'm sure my allegiance to these United States was not created just by reciting the Pledge thousands of times. In fact, I resented the Pledge for a lot of my life, especially once I learned more about its history.

But if I'm honest with myself, I do feel something like strong support for the ideals of the United States, much stronger than would make sense if someone had convinced me as an adult that its founding principals were a good idea. The United States isn't just my home. I yearn for it to be great, to embody its values, and to persist, even as I disagree with many of the details of how we're implementing the dream of the founders today.

Why do I think the Pledge mattered? It helped me get the feeling right. Once I had positive feelings about the US, of course I wanted to actually like the US. I latched onto the part of it that resonates with me: the founding principals. Someone else might be attracted to something else, or maybe would even find they don't like the United States, but stay loyal to it because they have to.

I'm also drawing on my experience with other fake-it-until-you-make-it rituals. For example, I and many people really have come to feel more grateful for the th... (read more)

[-]Kaj_Sotala8mo175

I bet something similar could work for getting kids to appologize.

Also, for getting them to say thank you. When kids are at a certain age, adults frequently seem to be reminding them to say thank you for gifts and such; I have a vague memory of adults also reminding me of this, when I was at that age. But these days I automatically say thank you for various things, and mean it.

[-]sitomin7248mo4111

Corollaries:

Honesty

If you want to become more honest and less self-deceiving, acquire power
If you want to make other people more honest and less self-deceiving, provide them with power (including power to protect themselves from you)
If you know someone who is more powerful than you but cant guarantee an upper bound on their power (and future power), then occlumency no longer works

Unboundedness

If you want an unlimited amount of power (such as a utility maximiser), there will almost always be coalitions of people more powerful than you against whom self-deception works
As long as there exist (hostile) coalitions of people unboundedly more powerful than you, completely removing self-deception from yourself is impossible

More than just yourself

If you want more examples of honesty and lack of self-deception available to you, ask powerful people to speak about their life experience. If you want these examples to be public, make them public
If you want two agents hostile to each other to both simultaneously be honest and not self-deceiving, provide them defensive rather than offensive power
If you want to achieve world peace, consider building defensive but not offensive power

... (read more)

2Chris Lakin8mo

What would you say that the main types of power are? My list (for humans): physical security, financial security, social security, emotional security (this one you can only give yourself though)

1fuli6mo

That’s a complicated question. At an individual level you have value alignment (people who agree with your values) and incentive alignment (people who disagree with your values but do what you want anyway because incentives). Value alignment is mostly persuasion and having enough attention of people. Incentive alignment is everything on Maslow hierarchy. You can reward or penalise others in terms of their physical safety, in terms of food and water, in terms of social approval of family and friends, in terms of providing them meaning in life, etc. (Which is basically the stuff you’re saying) There’s another lens to look at this which is, how do you get a lot of leverage over reality. Naval Ravikant quotes three forms of leverage - labour, capital and anything that replicate at zero cost on the internet. There’s more nuance to this but at a high level I agree - having a lot of people who will listen to you is power, having a lot of money is power and publishing information/code/media/games/etc that affect millions of lives is power.

[-]Ivan Vendrov8mo299

I like this a lot! A few scattered thoughts

This theory predicts and explains "therapy-resistant dissociation", or the common finding that none of the "woo" exercises like focusing, meditation, etc, actually work. (c.f. Scott's experience as described in https://www.astralcodexten.com/p/are-woo-non-responders-defective). If there's an active strategy of self-deception, you'd expect people to react negatively (or learn to not react via yet deeper levels of self-deception) to straightforward attempts to understand and untangle one's psychology.
It matches and extends Robert Trivers' theory of self-deception, wherein he predicts that when your mind is the site of a conflict between two sub-parts, the winning one will always be subconscious, because the conscious mind is visible to the subconscious but not vice versa, and being visible makes you weak. Thus, counterintuitively, the mind we are conscious of - in your phrase the false self - is always the losing part.
It connects to a common question I have for people doing meditation seriously - why exactly do you want to make the subconscious conscious? Why is it such a good thing to "become more conscious"? Now I can make the question mor

... (read more)

[-]Kaj_Sotala8mo336

Now I can make the question more precise - why do you think it's safe to have more access to your thoughts and feelings than your subconscious gave you? And how exactly do you plan to deal with all the hostile telepaths out there (possibly including parts of yourself?).

An answer I'd give is that for a lot of people, most of the hostile telepaths are ultimately not that dangerous if you're confident enough to be able to deal with them. As Valentine mentioned, often it's enough to notice that you are actually not anymore in the kind of a situation where the strategies would be necessary.

Unfortunately, many of the strategies also behave in such a way as to make themselves necessary, or to prevent the person from noticing that they could be abandoned:

Maybe I had a parent that wanted me to be dependent on them, so that they could control me. Even if I manage to break away from that parent, I may still have the belief that if someone wants to control me, then I have to genuinely believe that I cannot escape their control or they'll hurt me. This belief will tend to get me into abusive relationships... and then that strategy again becomes necessary for protecting me while in the relations

... (read more)

3PoignardAzur8mo

Interesting. I've had friends who had this "really needs to apologize when they think they might have upset me" thing, and something I noticed is that they when they don't over-apologize they feel the need to point it out too. I never thought too deeply about it, but reading you, I'm thinking maybe their internal experience was "I just felt really uncomfortable for a moment and I still overcame my discomfort, I'm proud of that, I should tell him about it".

4Kaj_Sotala8mo

Sounds plausible to me. Alternatively, telling you that they didn't over-apologize still communicates that they would have over-apologized in different circumstances, so it can be a covert way of still delivering that apology.

[-]Chris Lakin8mo273

This reminds me… maybe muscle tension is a frequent solution to this problem?

Some context: Lately I've been wondering, Why do we often experience feelings as things in the body? For example, why do I feel anxiety in my chest rather than just “knowing” I'm anxious?

For example, my previous chronic neck pain seemed to be related to information that manifested in my neck:

I suspect the feeling in my neck represented the information "I have the choice to leave the social situation I'm in right now" and/or "I am disliking/suppressing myself."

Why might this feeling have manifested in my neck?

What if feelings use the body as a screen to communicate information with others? If you have a certain feeling in your chest, maybe others can see that.

BUT: What if a feeling represents information that your system doesn't want other people to know? Hostile telepaths problem.

Im my case:

The feeling represented the awareness that I was insecure, and there were probably situations (probably social situations) in which it partially benefited me to be partially unaware of the fact that I was insecure.

Well, in that case, your system could create muscle tension to "jam the signal... (read more)

[-]Valentine8mo102

Oh huh. Yeah. It's not a solution by itself since there are lots of other cues hostile telepaths can use. But rigidity might dampen what they can read for sure!

This is testable. It predicts that improved skill with occlumency and/or gaining power should sometimes cause a release of chronic tension.

7Lulie8mo

That wouldn’t be a test of the theory that hostile telepaths use muscle cues, since those things could cause muscle release for other reasons (as per Popper: tests can only be disproving, and they require a rival theory to decide between). If gaining power never causes a release of tension, that still doesn’t disprove the theory, since again they could be tracking other things as well. A more direct question would be something like: Can hostile telepaths in fact read people who are physically rigid better than people who have low muscle tension? Do their reads get better or worse when tension is added? Does it change the type of information they can read (and perhaps give more information for some axes and less for others)? My impression is muscle tension gives a big sign on your back that you are hiding something, but makes it more muddy to non-trained people what exactly is being hidden. It reminds me of Mark Lippmann’s blog post on virtual machines, and how we often have layers of virtual machines. Or in plain language: if you close your eyes and imagine your environment, and imagine making an escape within that imaginary environment, real-you might not tighten your muscles in such a way that you’d be readable. I remember hearing that when we are seriously thinking about standing up, our heart rate and blood pressure rise in anticipation, but if we just hypothesise that we might stand up and keep it very abstract, the body doesn’t start those physical processes. But it’s very obvious when someone has gone into their head! So hostile telepaths often want some kind of emoting or ‘really listening’ or ‘paying attention’ or ‘be present with me’. So, yeah it conceals some information, but then it adds other information (such as meta information about concealment). Actors might be interesting to study, here.

3Camille Berger 8mo

I read these comments a few days ago. It prompted me to try applying something inspired by what was written in the post, but immediately on my muscle tension: I slightly Focus on it, then tell myself to "side with" the tension / feeling, while also telling myself that it's Ok to do so, not trying to "bust" it or put it into words, and using chipmonk's technique (cf his blog) to explore resistance around being seen displaying "the underlying emotion". I have the very clear impression that it weakens the tension quite fast (just timed it, it took about 30 seconds). I'm not having any insight on what the tension was about specifically. That's purely subjective experience report, might be heavily biased.

3Chris Lakin8mo

I think it's true that people who have more power (whether emotional security or social status etc) generally have less muscle tension yea. But that reminds me that I should check with my clients if they accidentally experience much less muscle tension

6Matt Goldenberg8mo

IME you can usually see in someone's face or body when they have a big release, just from the release of tension. But I think it's harder to distinguish this from other hypotheses I've heard like "negative emotions are stored in the tissues" or "muscular tension is a way of stabilizing intentions."

[-]Kaj_Sotala8mo193

So in many cases, "trauma processing" can basically mean noticing you're not a child anymore. You have power. So you don't have to appease the hostile telepaths just because they're adults.

Yes, definitely. And this is also why it's often so important for the therapist - if this is done in the context of therapy - to exhibit unconditional positive regard toward the client. If the therapist is genuinely accepting of any thoughts and feelings that the client brings up, then that opens the door for the client's parts to start considering the possibility that maybe they can tell the truth and still be accepted. And once it has become possible to tell the truth to at least one person, it becomes possible to tell it to yourself as well.

(Though maybe I should say that the therapist needs to either experience unconditional positive regard toward the client, or successfully deceive themselves and the client into thinking that they do. Heh.)

One additional tangle is that often the client's issue is less about needing to act in a certain way, and more about needing to be a certain way. At some point, one frequently goes from "it's bad to break something and not be genuinely sorry on that partic... (read more)

7romeostevensit8mo

It's worth noting that many therapists break therapeutic alliance for ideological or liability reasons and this is one of the reasons that self therapy, peer therapy, llms, and workbooks can sometimes be better.

4Valentine8mo

I mean, technically they don't even need to deceive themselves. They can be consciously judgy as f**k as long as they can mask it effectively. Psychopaths might make for amazing therapists in this one way!

3Kaj_Sotala8mo

True, though I think that judgment tends to be hard to effectively mask in this kind of context (though maybe psychopaths would be able to fake it; I don't know). At least my own experience inclines me to agree with this person:

[-]kave8mo192

From the related book Elephant in the Brain:

Here is the thesis we’ll be exploring in this book: We, human beings, are a species that’s not only capable of acting on hidden motives—we’re designed to do it. Our brains are built to act in our self-interest while at the same time trying hard not to appear selfish in front of other people. And in order to throw them off the trail, our brains often keep “us,” our conscious minds, in the dark. The less we know of our own ugly motives, the easier it is to hide them from others.

3romeostevensit8mo

I was reading this earlier and it dovetails very well with this post. Framing defending yourself against hostile people and processes as primarily selfish itself serves the hostile.

[-]Kaj_Sotala8mo182

Like if there's an email I keep freezing around. I can tell there's something there. I might even have some intuitive guesses about what it is!
…but I do not check. I don't introspect on whether my guesses feel right.
Instead, I hypothesize. What hostile telepath problem might someone in my shoes be trying to solve such that this behavior arises?

I tried doing this and it felt promising, and then I noticed a familiar feeling of wanting tell a person affected by my possible self-deception how I'd now solved the problem and would behave differently from now on. And I remembered that on each previous time when I'd had that feeling and told the other person something like that, my behavior had in fact not changed at all as a consequence.

And now I'm chuckling at myself.

1PoignardAzur8mo

Yeah, bad habits are a bitch.

[-]Vanessa Kosoy8mo176

I've been thinking along very similar lines for a while (my inside name for this is "mask theory of the mind": consciousness is a "mask"). But my personal conclusion is very different. While self-deception is a valid strategy in many circumstances, I think that it's too costly when trying to solve an extremely difficult high-stakes problem (e.g. stopping the AI apocalypse). Hence, I went in the other direction: trying to self-deceive little, and instead be self-honest about my^[1] real motivations, even if they are "bad PR". In practice, this means never making excuses to myself such as "I wanted to do A, but I didn't have the willpower so I did B instead", but rather owning the fact I wanted to do B and thinking how to integrate this into a coherent long-term plan for my life.

My solution to "hostile telepaths" is diving other people into ~3 categories:

People that are adversarial or untrustworthy, either individually or as representatives of the system on behalf of which they act. With such people, I have no compunction to consciously lie ("the Jews are not in the basement... I packed the suitcase myself...") or act adversarially.
People that seem cooperative, so that they deser

... (read more)

7romeostevensit8mo

Agree with the approach with the caveat that some people in group 2 are naive cooperators and therefore second order defectors since they are suckers for group 1. Eg the person who will tell the truth to the Nazis out of mistaken theories of ethics or just behavioral conditioning.

1Matt Vincent7mo

I think that kind of person is included in group 1:

6Valentine8mo

Yep. I'm not sure why you think this is a "very different" conclusion. I'd say the same thing about myself. The key question is how to handle the cases where becoming conscious of a "bad PR" motivation means it might get exposed. And you answer that! In part at least. You divide people into three categories based on (a) whether you need occlumency with them at all and (b) whether you need to use occlumency on the fact that you're using occlumency. I don't think of it in terms this explicit, but it's pretty close to what I do now. People get to see me to the extent that I trust them with what I show them. And that's conscious. Am I misunderstanding you somehow? I both agree and partly disagree. I tagged your comment with where. Totally, yes, having a real and meaningful shared problem means we want a truth-seeking community. Strong agreement. But I think how we "strive" to be truth-seeking might be extremely important. If it's a virtue instead of an engineering consideration, and if people are shamed or punished for having non-truth-seeking behaviors, then the collective "striving" being talked about will encourage individual self-deception and collective untalkaboutability. It's an example of inducing adaptive entropy. Relatedly: mathematicians don't have truth-seeking collaboration because they're trying hard to be truth-seeking. They're trying to solve problems, and they can verify whether their proposed solutions actually solve the problems they're working on. That means truth-seeking is more useful for what they're doing than any alternatives are. There's no need for focusing on the Virtue of Seeking Truth as a culture. Likewise, there's no Virtue of Using a Hammer in carpentry. What puts someone in category 2 or 3 for me isn't something I can strive for. It's more like, I can be open to the possibility and be willing to look for how they and I interact. Then I discover how my trust of them shifts. If I try to trust people more than I do, I end up in

1Keenan Pepper8mo

AKA integrating the ego-dystonic into the homunculus

[-]Ninety-Three8mo*123

By "psychopath" I mean someone with the cluster B personality disorder.

There isn't a cluster B personality disorder called psychopathy. Psychopathy has never been a formal disorder and the only time we've ever been close to it is way back in 1952 when the DSM-1 had a condition called "Sociopathic Personality Disturbance". The closest you'll get these days is Antisocial Personality Disorder, which is a garbage bin diagnosis that covers a fairly broad range of antisocial behaviours, including the thing most people have in mind when they say "psychopath", but also plenty of other personality archetypes that don't seem particularly psychopathic, like adrenaline junkies and people with impulse control issues.

5Seth Herd8mo

Okay; so what's the reality about the people we're thinking of when we say psychopathic? The term seems to still be in use among some professionals, for bad or good reasons. A garbage bin diagnosis seems like a step down if psychopathy or sociopathy was pointing to a more specific set of attitudes and tendencies.

7Ninety-Three8mo

I think Valentine gave a good description of psychopath as "people who are naturally unconstrained by social pressures and have no qualms breaking even profound taboos if they think it'll benefit them", where just eyeballing human nature, that seems to be a "real" category that would show up as a distinct blip in a graph of human behaviour and not just "how constrained by social pressures people are is a normally distributed property and people get called psychopaths in linear proportion to how far left they are on the bell curve".

5Valentine8mo

Cool. I knew there at least used to be "antisocial personality disorder", which I thought was under cluster B along with narcissism and borderline. And I thought "psychopathy" was a different term for APD. Thanks for the correction. The main thing I wanted to gesture at there is that I wasn't using "psychopath" as something derogatory. I didn't mean "bad guys". I meant something more like "people who are naturally unconstrained by social pressures and have no qualms breaking even profound taboos if they think it'll benefit them". (I just now made that up.) It seems to me that it's a pretty specifically different mental/emotional architecture.

5Ninety-Three8mo

Yep, your intended meaning about the distinctive mental architecture was pretty clear, just wanted to offer the factual correction.

[-]Tao Lin8mo111

I'm often surprised how little people notice, adapt to, or even punish self deception. It's not very hard to detect when someone's deceiving them self, people should notice more and disincentivise that

[-]Ratios8mo1210

This reads to me as, "We need to increase the oppression even more."

9Valentine8mo

A few notes: * Sometimes this is obviously true. I agree. * It's a curious question why many folk turn their attention away from someone else's self-deception when it's obvious. Often they don't, but sometimes they do. Why they (we) do that is an interesting question worthy of some sincere curiosity. * Confirmation bias. You don't notice the cases where you don't pick up on someone else's self-deception. Boy oh boy do I disagree. If someone's only option for dealing with a hostile telepath is self-deception, and then you come in and punish them for using it, thou art a dick. Like, do you think it helps the abused mothers I named if you punish them somehow for not acknowledging their partners' abuse? Does it even help the social circle around them? Even if the "hostile telepath" model is wrong or doesn't apply in some cases, people self-deceive for some reason. If you don't dialogue with that reason at all and just create pain and misery for people who use it, you're making some situation you don't understand worse. I agree that getting self-deception out of a culture is a great idea. I want less of it in general. But we don't get there by disincentivizing it.

2jimmy8mo

If that's their only option, and the hostility in your telepathy is antisocial, then yes. In some cases though, people do have other options and their self-deception is offensive, so hostile telepathy is pro-social. For example, it would probably help those mothers if the men knew to anticipate punishment for not acknowledging their abuse of their partners. I bet at least one of those abusive husbands/boyfriends will give his side of the story that's a bit more favorable than "I'm a bad guy, lol", and that it will start to fall apart when pressed. In those cases, he'll have to choose between admitting wrongdoing or playing dumb, and people often do their best to play really dumb. The self-deception there is a ploy to steal someone else's second box, so fuck that guy. I think the right response is to ignore the "self" part of the deception and treat it like any other deception. If it's okay to lie to the Nazis about hiding Jews, then it's okay to deceive yourself into believing it too. If we're going to make it against the law to lie under oath, then making it legal so long as they lie to themselves too is only going to increase the antisocial deception.

[-]Ben Pace8mo*103

Curated!^[1]

I think this is an excellent post on a tricky subject. I found here an articulate description of a great many internal experiences and thoughts I've had but have never well-named or seen written down clearly (e.g. 'occlumency' is a skill I have practiced a lot). I find this topic pretty hard to talk and think openly about, in large part due to the adversarial dynamics, so I am especially grateful for this post (and the ensuing discussion section). One of my favorite posts on LW this year, I think.

Personally, I frame the "Having power" solution as "Gaining independence". I think power is a bit goodhartable on in a corruptible way, and the true goal is to be able to think whichever thoughts you'd think if you had no influences on you, not the thoughts you'd think if you had immense power.

^{^}
"Curated", a term which here means "This just got emailed to 30,000 people, of whom typically half open the email, and it gets shown at the top of the frontpage to anyone who hasn't read it for ~1 week."

6Valentine8mo

Ah yeah, I think "gaining independence" is a better descriptor of (what I meant by) that solution type.

[-]Measure8mo82

it's not information about whether I'm secretly trying to two-box

It's still Bayesian evidence. Someone with a different policy (always deeply investigating themselves), could get Omega-C to have a higher credence of them one-boxing. We'd have to specify how sure Omega has to be to offer the large payment (and what priors Omega has) to know if the choice of policy matters.

2Valentine8mo

I think I disagree. I'll add some precision to point out how. Happy to hear if I'm missing something. E is Bayesian evidence of X if E is more likely to happen when X is true than when it's not. If Bob says "As a policy, I'm not going to check whether I'm running an Omega-C deception", that's equally likely whether Bob is running a deception or not. (Hence the "as a policy" part.) It just fully happens in both cases. So from Omega-C's point of view, it's not Bayesian evidence that distinguishes between the two versions of Bob. It would be evidence if the choice were made from a stance of "Oh shoot, that might be self-deception! Well, I'm now going to adopt the no-looking policy so that I don't have to check it!" Then yeah, sure, that's clearly evidence — which is precisely why that method of deciding not to look isn't what can work. The policy of always deeply investigating oneself can produce evidence for Omega-C, but the act of choosing that policy might not. Choosing the policy not to look just doesn't produce evidence. Or at least that's how it seems to me.

[-]Measure8mo113

The fact that Bob has this policy in the first place is more likely when he's being self-deceptive. Sure, some people will glomorize even when they have nothing to hide, but more often it will be the result of Bob noticing that he's the sort of person who might have something to hide.

It's a general rule that if E is strong evidence for X, then ~E is at least weak evidence for ~X.

7gwern8mo

A fun fictional example here is Bester's The Demolished Man: how do you plan & carry out an assassination when telepaths are routinely eavesdropping on your mind? The protagonist visits a company musician, requesting a musical earworm for a company song to help the workers' health or something; alas! the earworm gets stuck in his head, and so all any telepath hears is the earworm. And you can't blame a man for having an earworm stuck in his head, now can you? He has an entirely legitimate reason for that to be there, which 'explains away' the evidence of the deception hypothesis that telepathic-immunity would otherwise support.

5Valentine8mo

I don't know if that's true. It might be. But some possible counterpoints: * People can distrust systems that demand they check. "You have nothing to fear if you have nothing to hide" can get a response of "No" even from people who don't have anything to hide. * If someone subconsciously thinks they can pull off the illusion of honestly looking while in fact finding nothing, they become more likely to choose to look because they're self-deceiving. * Someone with a policy of not looking might be better at making their own self-deception unnecessary. Sure, that way of deciding doesn't work. Likewise, if you're inclined to decide you're going to dig into possible sources of self-deception because you think it's unlikely that you have any, then you can't do this trick. The hypothetical respect for any self-deception that might be there needs to be unconditional on its existence. Otherwise, for the reason you say, it doesn't work as well. (…with some caveats about how people are imperfect telepaths, so some fuzz in implementation here is in practice fine.) That said, I think you're right in that if Omega-C is looking only at the choice of whether to look or not, then yes, Omega-C would be right to take the choice as evidence of a deception. But the whole point is that Omega-C can read what conscious processes you're using, and can see that you're deciding for a glomerizing reason. That's why why you choose what you do matters so much here. Not just what you choose. Conservation of expected evidence is what makes looking relevant. It's not what makes deciding to look relevant. If I decide to appease Omega-C by looking, and then I find that I'm self-deceiving, the fact that I chose to look gets filtered. The fact that this is possible is why not finding evidence can matter at all. Otherwise it'd just be a charade. Relatedly: I have a coin in my pocket. I don't feel like checking it for bias. Does that make it more likely that the coin is biased? Maybe.

[-]LintzA8mo70

This jogged a lot of thinking about how it fits into various modalities. I think the lack of an actual solution to hostile mind-reading might be a flaw in several modalities I've tried which could be part of why I've struggled to have the progress I made with them stick. Many of these at least point toward alternative methods of dealing with self-deception which could be useful and I think authentic relating suggests at least one idea for an alternative method of occlumency which feels a little more virtuous (definitely felt some aversion to your solutions... (read more)

3Kaj_Sotala8mo

I wouldn't put it as strongly as to say that it's a crucial part of every IFS session. It can sometimes be a very useful question and approach, sure, but I've had/facilitated plenty of great sessions that didn't use that question at all. And there are people who that question just doesn't resonate with.

[-]romeostevensit8mo73

I can secondhand lend some affirmation to the newcomb case. A friend with DID from a childhood with a BPD mom later became a meditator and eventually rendered transparent the shell game that was being played with potentially dangerous preferences and goals to keep them out of consciousness, since the mom was extremely good at telepathy and was hostile for the standard BPD reason: other beings with other goals are inherently threatening to their extremely fragile sense of their own preferences and goals.

Another solution is illegible-ization/orthogonalizatio... (read more)

2Valentine8mo

Oh yeah, that's a cool example. You mean something like, look boring to them? Like, I don't care how good Putin is at reading people, I just don't have anything he wants, so I'm safe as long as I keep (apparently) not having anything he wants?

[-]romeostevensit8mo103

Yes, though this often involves some self deception about your true utility function. I suspect that some ace people did this to themselves to avoid zero sum competition they expect to painfully lose.

[-]Chris Lakin8mo75

I'm very glad you wrote this

[-]lemonhope8mo50

What gaslighting goes on in math class?

[-]Valentine8mo111

A few examples:

Framing kids as "disruptive" or "inattentive" or otherwise having the wrong nature if they feel disengaged. This is after informing them what they're going to study without consulting what's relevant or interesting to them, and then using social power to require them to study those things. But the problem is supposedly the student, not the system.
Claiming that they'll need these math tools later in life, and that this justifies adults pressuring the kids to learn those skills now. (This is more bullshit-flavored than gaslight-flavored, but I think they're psychological neighbors.)
Pretending that because a word problem touches on a topic kids care about, the math is relevant to what the kids like about that topic.
Insisting that forcing kids to take math classes is for their own good, and if the kids don't see why or don't agree, then they should believe the adults over their own sense of things.

It makes me so angry. It's perfectly antithetical to the essence of math as I see it.

2Matt Vincent7mo

This question might be independent from my other one, so I'm putting it in a separate comment. What's your primary solution to the problems that you list? Do you think that it can be mostly solved by teachers--e.g. by not exaggerating the applicability of the course material--or do you think that it requires a systemic solution--e.g. by sending the disruptive and inattentive kids to a class (potentially a quite unconventional one) that they're more interested in? I ask because I'm considering changing careers to become a high school math teacher, and I'd like to avoid using insidious psychological techniques on my students--doubly so if the techniques would cause my students to develop a long-term aversion to mathematics.

2Valentine7mo

You ask a good question. I have a lot of thoughts about it. Different answers at different levels. Like, what should a civilization do vs. what should a parent do vs. what should a teacher do? Different answers. The overall theme, though, is to remove coercion and appeal to native fascination instead. If you have something of value to the student to offer, then in practice there's a way to either (a) show the student that value or (b) earn the student's trust that you're tracking what they care about such that when you say "Trust me" they know there's something good there even if they can't see it for themselves just yet. If you're aiming to be a teacher… well, it's tricky because last I checked, the systems you're embedded in impose mandatory coercion. You have to cover certain topics, often in a certain order, within a certain window of time, etc. Especially since "No Child Left Behind" tied funding to test scores. And parents get mad and start rattling sabres if their kids come back from math class with a bunch of weird stuff the parents don't recognize. Although maybe that was just the Boomers. But that said! There are clever ways of working within these social constraints. If you can do that, the overall thrust for a teacher is to prioritize being curious about how the students are thinking rather than on getting them to understand certain concepts. The lion's share of work for a really good math teacher is in identifying zinger questions. You have to see how a student is thinking about a problem, and follow their contours of reasoning, and notice where it's going to run them into trouble. You could just tell them about the trouble, but it's far more effective to ask them to explain something or figure out something that will lead them right to the paradox spot. After a while you'll probably develop a really rich repertoire of such questions. And maybe more preciously, you'll be familiar with a vast library of thinking styles that students actually use in

2Matt Vincent7mo

Would you say the same of most other class subjects? I ask because, with the exceptions of reading and persuasive writing, I don't think that any conventional school subject is more applicable to the average person's life than grade-school math. Yes, people can get through life with an astonishing ignorance of mathematics, but they can get through life with an even more astonishing ignorance of social studies, literature, and the sciences. In my opinion, the purpose of public basic education is twofold: 1. Identify the children who are talented at a given subject so that we can rapidly and efficiently develop their skill to a point that it becomes useful to society. 2. Intellectually immunize the general population against low-effort fraudsters and other bad actors. Unfortunately, (2) requires most people to spend years learning about subjects that they don't care about. Do you have a different philosophy of education, a different ranking of subjects' importance, or something else?

2Valentine7mo

I was homeschooled and then studied math education, so I'm not sure. But my passing impression is (a) yes, it applies to most methods of teaching in schools regardless of subject; but (b) math taught this way is particularly damaging. I want to emphasize that this is my impression. I'm also not entirely sure why math seems to be more damaging. I have guesses. I just observe that e.g. literature hatred or music phobia aren't nearly as prevalent as math trauma is. Best as I can tell. Well, sure. But people will also pick up the math they need as they need it for the most part. That's true of most subjects really. I didn't learn to read in school. I went to kindergarten before being homeschooled, and they were teaching us the alphabet and some basic words, but I could already read books by then. I learned to read because I wanted to read. There's something very weird in our cultural groundwater around what teaching is. It's like we start with a prescription of subjects and then default to coercion to get students to "know" those subjects. Why? If it's relevant to their lives, we could learn to point out the connection in a way that feels alive to them. If we can't do that, then what makes us so sure that it's relevant for them? Yeah I do. I think the most imporant function of widespread education is to make good citizens. Which is to say, children put through an education system need to come out of it better able to engage with the system that runs their civilization, including the education process for the next generation. In the United States, I think that puts civics as the most important subject. It's really key that citizens understand how their government works, what the checks and balances are, how jury nullification works, what forms of corruption actually do arise even within the current system, etc. Otherwise they don't know how to participate in the government that's supposedly "by the people, for the people". This is vastly more important than l

2lemonhope8mo

Your examples fit the definition quite well. Apparently this is in the dictionary now. https://www.merriam-webster.com/dictionary/gaslighting

[-]jwray8mo40

My experience is very different. I feel unitary, without any IFS or jungian shadow or other sort of subconscious parts trying to deceive my conscious self. I violate quite a lot of social norms without feeling any shame or guilt about it, because I've got an 'internal scorecard'. So long as I'm true to my own values/morality, and I can protect myself with some combination of power / occlumency / disengaging, all three of which come easily to me, social norms don't matter in private.

5Valentine8mo

To me this is exciting. I deduced that the mental architecture you're describing should be possible. It's extremely cool to hear someone just name it as a lived experience. Like, what would a mind that's actually systematically free of Newcomblike self-deception have to be like, assuming the hostile telepaths problem is real? This is one possible solution. Assuming I haven't misunderstood what you're describing!

3Freyja6mo

FWIW I’m pretty confident this is me too; you can ask me about it any time you like—I would love to figure out/replicate what I think I have going here, to find out if it’s teachable/shareable (There’s -one- area of life where I’m less confident I have full access, so it isn’t fair to say I feel 100% this way—but 94-98%)

1[anonymous]6mo

they wrote: what are your values/morality and what happens if you're not true to them?

2Freyja5mo

My values/morality are too complicated and contextual for me to be able to describe/list them easily, but if I’m not true to them, I feel some sort of phenomenological consequence—an emotional reaction (grief, anger etc), or a distinct lack of clarity (cognitive fuzziness, a drained feeling, fatigue); there are probably other signs too but those are obvious ones.

2Giskard7mo

Non-sarcastically, it must be AMAZING to be you.

[-]VaRuna8mo42

I think this is a great outline of how these strategies form. A very similar idea is described in The Elephant in the Brain, but this is straightforwardly written and more visceral in a way I felt the book (and most other attempts to describe it) lacked. Kudos!

The drive to be "perfectly rational" and push all slivers of self-deception out with force is, I think, one of the core psychological errors made in rationalist circles (including the writing) for exactly the reasons you lined out. Well explained!

Honesty, and specifically self-honesty, is held as one... (read more)

[-]Hastings8mo4-2

Organizations and communities can also face hostile telepaths. My pet theory that sort of crystalized while reading this is that p-hacking is academia’s response to a hostile telepath that banned publication of negative results.

This of course sucks for non traditional researchers and especially journalists who don’t even subconsciously know that p=0.05002 r=1e-7 “breakthrough in finding relationship between milk consumption and toenail fungus” is code for “We have conclusively found no effect and want to broadcast to the community that there is no effect here; yet we cannot ever consciously acknowledging that we found nothing because our mortgages depend on fooling a hostile telepath into believing this is something”

[-]CuoreDiVetro8mo40

This is coherent with my experience. I'm pretty sure there are other problems solved by self-deception other than hostile telepaths. One other such problems solved by self-deception which I'm pretty sure I've seen in people is preserving motivation: if something is really important for me and I need to put in a lot of effort to make it happen and probability of success is very low (let's say epsilon), and if know that the probability of success is epsilon would totally annihilate my motivation to work towards it, then maybe hiding to myself that low probab... (read more)

2Matt Vincent7mo

What exactly is your hypothesis? Is it something like: P1) People are irrationally averse to actions that have a positive expected value and a low probability of success. P2) Self-deception enables people to ignore the low probability of success. C) Self-deception is adaptive. I tried to test this reasoning by referencing the research that Daniel Kahneman (co-coiner of the term "planning fallacy") has done about optimism. He has many criticisms of over-optimism among managers/executives, as well as more ordinary people (e.g. those who pursue self-employment). However, he also notes that, for a given optimistic individual, their optimism may have a variety of personal, social, and societal benefits, ranging from good mood and health to inspiring leadership and economic innovation. He goes so far as to say, "If you are allowed one wish for your child, seriously consider wishing him or her optimism.". (Thinking Fast and Slow, p. 255) Altogether, I'm think I'm missing a subtlety that would enable me to deduce the circumstances in which a bias towards optimism would be beneficial. Given that, I'm unable to test your hypothesis.

[-]Freyja6mo32

The ideas in this post remind me both of David Schnarch’s book Brain Talk (and in particular the concept of mind mapping which is central to the book) and also Leverage’s Self-Alignment System, which includes a step almost identical to your ‘hypothesise without checking’ step as a way to address situations where you get hijacked while trying to introspect.

Also I think cultures in which honesty/vulnerability is valorised and privacy/saving face is denigrated limit people’s options for responding to hostile telepaths more than cultures in which privacy... (read more)

[-]tcheasdfjkl6mo30

I like this post. But also the part of it I found most interesting was this footnote bit:

Loosely speaking, you've just turned your own conscious mind into an internal hostile telepath!

bc I think I do that kind of a lot, but also am somewhat sensitive to at least some kinds of things that feel like self-deception or thought-avoidance, and really dislike that feeling, so I do tend to probe at things that feel suspicious in that kind of way, which sometimes adds up to pretty unhelpful thought spirals where I'm chasing my thoughts and emotions around and getti... (read more)

[-]transhumanist_atom_understander7mo20

An example important in my life is planning: I "couldn't" make long-term plans or complete my to-do list as long as my "to-do list" was just a list of obligations rather than anything I really wanted done. More generally, I think plans "on paper" are especially easy case, since they don't take a telepath. For example, see the planning fallacy and Robin Hanson's comment that managers prefer the biased estimates. Getting to a corporate level, there's accounting. A cool related image is in episode two of Twin Peaks when Josie opens the safe and finds two ledg... (read more)

[-]MikkW8mo*20

This post does a good job of laying out compelling arguments for thoughts adjacent to areas I've previously already enjoyed thinking about.

For the record, this sentence popped into my head while reading this: "Wait, but what if I'm Omega-V, and [Valentine] is a two boxer?"

(Edit: the context for this thought is my previous thoughts having read other posts by Valentine, which I find both quite elucidating, but also somehow have left me feeling a bit creeped out; that being said, my opinion about this post itself is strongly positive)

[-]Ratios8mo25

It is worth noting that Ziz has already proposed the same idea in False Faces, although I think Valentine did a better job of systematizing and explaining the reasons for its existence.

Another interesting direction of thought is the connection to Gregory Bateson’s theory that double binds cause schizophrenia. Spitballing here: it could be that a double bind triggers an attempt to construct a "false face" (a self-deceptive module), similar to a normal situation involving a hostile telepath. However, because the double bind is contradictory, the internal mec... (read more)

[-]João Ribeiro Medeiros8mo20

Very powerful reasoning. I would add that a relevant form of self-deception that should be investigated in this framework is religious faith, given its place as as foundational to societies worldwide.

Religious faith seems like an optimal form of solution to hostile telepaths problem, in certain contexts it seems like a mixture of the three solutions you outlined. (Newcomblike self-deception, Having power and Occlumency)

Religious faith seems to provide psychological power through feelings of absolute certainty and over-confidence that religious people... (read more)

[-]Kabir Kumar8mo10

I thought this was going to be an allegory for interpretability.

[-]NickH8mo1-2

I like this except for the reference to "Newcomblike" problems, which, I feel, is misleading and obfuscates the whole point of Newcomb's paradox. Newcomb's paradox is about decision theory - If you allow cheating then it is no longer Newcomb's paradox. This article is about psychology (and possibly deceptive AI) - cheating is always a possible solution .

[-]lemonhope8mo12

Regarding this

Such as the moms in the abusive partners example above: each one could acknowledge her self-deception once it was safe for her abusive partner to know too. She got enough power (financial or social) to protect herself and her child, making the telepathic scan no longer a dire threat.

I would add that most abusive people don't really like crushing their loved ones and it is sometimes easy to get them to stop, eg by having a peer of the abuser get a private word with the two parties separately. I think it is common for there to be simple mis... (read more)

4Valentine8mo

In broad strokes I agree with you. Here I was sharing my observation of four cases where a friend was involved this way. One case might have been miscommunication but it doesn't seem likely to me. The other three definitely weren't. In one of those I personally knew the guy; I liked him, but he was also emotionally very unstable and definitely not a safe father. I don't think the abuse was physical in any of those four cases.

3lemonhope8mo

Aw man we used the same word for different things again

[-]Lorec8mo*10

I think this means that if you care both about (a) wholesomeness and (b) ending self-deception, it's helpful to give yourself full permission to lie as a temporary measure as needed. Creating space for yourself so you can (say) coherently build power such that it's safe for you to eventually be fully honest.

The first sentence here, I think, verbalizes something important.

The second [instrumental-power] is a bad justification, to the extent that we're talking about game-theoretic power [as opposed to power over reductionistic, non-mentalizing Nature]. LD... (read more)

3Valentine8mo

I think the word "power" might be creating some confusion here. I mean something pretty specific and very practical. I'm not sure how to precisely define it, but here are some examples: * If someone threatens to freak out at you if you disagree with them, and you tend to get overwhelmed and panic when the freak out at you, then they have a kind of power over you. Building power here probably looks like learning to experience them freaking out without you getting overwhelmed. * If someone pays for your rent and food but might stop if they get any hint that you're gay, it might not be safe to even ask yourself honestly whether you are. You build power here by getting an income, or a source of rent and food, that doesn't depend on the hostile telepathic benefactor. * If your lover gets turned on by you politically agreeing with them and turned off by disagreement, you might find your political views drifting toward theirs for "unrelated" reasons. One way to build power here is to get other access to sex. Another is to diminish your libido. Another is to break up with them. (Not saying any of these are a great idea. I'm just naming what the solution of "building power" might look like here.) I'm not familiar with LDT. I can't comment on that part. Sorry if that means what I just said misses your point.

1Lorec8mo

! I'm genuinely impressed if you wrote this post without having a mental frame for the concepts drawn from LDT. LDT says that, for the purposes of making quasi-Kantian [not really Kantian but that's the closest thing I can gesture at OTOH that isn't just "read the Yudkowsky"] correct decisions, you have to treat the hostile telepaths as copies of yourself. Indexical uncertainty, ie not knowing whether you're in Omega's simulation or the real world, means that, even if "I would never do that", if someone is "doing that" to me, in ways I can't ignore, I have to act as though I might ever be in a situation where I'm basically forced to "do that". I can still preferentially withhold reward from copies of myself that are executing quasi-threats, though. And in fact this is correct because it minimizes quasi-threats in the mutual copies-of-myself negotiating equilibrium. "Acquire the ability to coerce, rather than being coerced by, other agents in my environment", is not a solution to anything - because the quasi-Rawlsian [again, not really Rawlsian, but I don't have any better non-Yudkowsky reference points OTOH] perspective means that if you precommit to acquire power, you end up in expectation getting trodden on just as much as you trod on the other copies of you. So you're right back where you started. Basically, you have to control things orthogonal to your position in the lineup, to robustly improve your algorithm for negotiating with others. And I think "be willing to back deceptions" is in fact such a socially-orthogonal improvement.

2Valentine8mo

Thanks. :) And thanks for explaining. I'm not sure what "quasi-Kantian" or "quasi-Rawlsian" mean, and I'm not sure which piece of Eliezer's material you're gesturing toward, so I think I'm missing some key steps of reasoning. But on the whole, yeah, I mean defensive power rather than offensive. The offensive stuff is relevant only to the extent that it works for defense. At least that's how it seems to me! I haven't thought about it very carefully. But the whole point is, what could make me safe if a hostile telepath discovers a truth in me? The "build power" family of solutions is based on neutralizing the relevance of the "hostile" part. I think you're saying something more sophisticated than this. I'm not entirely sure what it is. Like here you say: I'm not sure what "the lineup" refers to, so I don't know what it means for something to be orthogonal to my position in it. I think I follow and agree with what you're saying if I just reason in terms of "setting up arms races is bad, all else being equal". Or to be more precise, if I take the dangers of adaptive entropy seriously and I view "create adaptive entropy to get ahead" as a confused pseudo-solution. It might be that that's my LDT-like framework.

1Lorec8mo

I once thought "slack mattered more than any outcome". But whose slack? It's wonderful for all humans to have more slack. But there's a huge game-theoretic difference between the species being wealthier, and thus wealthier per capita, and being wealthy/high-status/dominant/powerful relative to other people. The first is what I was getting at by "things orthogonal to the lineup"; the second is "the lineup". Trying to improve your position relative to copies of yourself in a way that is zero-sum is "the rat race", or "the Red Queen's race", where running will ~only ever keep you in the same place, and cause you and your mirror-selves to expend a lot of effort that is useless if you don't enjoy it. [I think I enjoy any amount of "the rat race", which is part of why I find myself doing any of it, even though I can easily imagine tweaking my mind such that I stop doing it and thus exit an LDT negotiation equilibrium where I need to do it all the time. But I only like it so much, and only certain kinds.]

[-]Kabir Kumar8mo0-1

I think this is really along the wrong path and misunderstanding a lot of things, but so far along the incorrect path of thought and misunderstanding so much, that it's hard to untangle

3Kabir Kumar8mo

To be a bit less useless - I think this fundamentally misses the problem of respect and actually being able to communicate with yourself and fully do things, if you've done so - and that you can do these when you have full faith and respect in yourself (meaning all of yourself - may include love as well, not sure how necessary that is for this). Could maybe be done in other ways as well, but I find those less beautiful, personally.

[-]normienorm7mo-40

Classic less wrong: this is all completely understood by normies. People build a narrative around their own existence, and justify and rationalize, and those self deceptions (obviously like basically all phycological traits) can sometimes be beneficial.

And then your insight is to apply a variant of normy mindfulness, e.g., cultivate awareness and self acceptance before trying to change.