Building up to an Internal Family Systems model

Kaj_Sotala

Introduction

Internal Family Systems (IFS) is a psychotherapy school/technique/model which lends itself particularly well for being used alone or with a peer. For years, I had noticed that many of the kinds of people who put in a lot of work into developing their emotional and communication skills, some within the rationalist community and some outside it, kept mentioning IFS.

So I looked at the Wikipedia page about the IFS model, and bounced off, since it sounded like nonsense to me. Then someone brought it up again, and I thought that maybe I should reconsider. So I looked at the WP page again, thought “nah, still nonsense”, and continued to ignore it.

This continued until I participated in CFAR mentorship training last September, and we had a class on CFAR’s Internal Double Crux (IDC) technique. IDC clicked really well for me, so I started using it a lot and also facilitating it to some friends. However, once we started using it on more emotional issues (as opposed to just things with empirical facts pointing in different directions), we started running into some weird things, which it felt like IDC couldn’t quite handle… things which reminded me of how people had been describing IFS. So I finally read up on it, and have been successfully applying it ever since.

In this post, I’ll try to describe and motivate IFS in terms which are less likely to give people in this audience the same kind of a “no, that’s nonsense” reaction as I initially had.

Epistemic status

This post is intended to give an argument for why something like the IFS model could be true and a thing that works. It’s not really an argument that IFS is correct. My reason for thinking in terms of IFS is simply that I was initially super-skeptical of it (more on the reasons of my skepticism later), but then started encountering things which it turned out IFS predicted - and I only found out about IFS predicting those things after I familiarized myself with it.

Additionally, I now feel that IFS gives me significantly more gears for understanding the behavior of both other people and myself, and it has been significantly transformative in addressing my own emotional issues. Several other people who I know report it having been similarly powerful for them. On the other hand, aside for a few isolated papers with titles like “proof-of-concept” or “pilot study”, there seems to be conspicuously little peer-reviewed evidence in favor of IFS, meaning that we should probably exercise some caution.

I think that, even if not completely correct, IFS is currently the best model that I have for explaining the observations that it’s pointing at. I encourage you to read this post in the style of learning soft skills - trying on this perspective, and seeing if there’s anything in the description which feels like it resonates with your experiences.

But before we talk about IFS, let’s first talk about building robots. It turns out that if we put together some existing ideas from machine learning and neuroscience, we can end up with a robot design that pretty closely resembles IFS’s model of the human mind.

What follows is an intentionally simplified story, which is simpler than either the full IFS model or a full account that would incorporate everything that I know about human brains. Its intent is to demonstrate that an agent architecture with IFS-style subagents might easily emerge from basic machine learning principles, without claiming that all the details of that toy model would exactly match human brains. A discussion of what exactly IFS does claim in the context of human brains follows after the robot story.

Wanted: a robot which avoids catastrophes

Suppose that we’re building a robot that we want to be generally intelligent. The hot thing these days seems to be deep reinforcement learning, so we decide to use that. The robot will explore its environment, try out various things, and gradually develop habits and preferences as it accumulates experience. (Just like those human babies.)

Now, there are some problems we need to address. For one, deep reinforcement learning works fine in simulated environments where you’re safe to explore for an indefinite duration. However, it runs into problems if the robot is supposed to learn in a real life environment. Some actions which the robot might take will result in catastrophic consequences, such as it being damaged. If the robot is just doing things at random, it might end up damaging itself. Even worse, if the robot does something which could have been catastrophic but narrowly avoids harm, it might then forget about it and end up doing the same thing again!

How could we deal with this? Well, let’s look at the existing literature. Lipton et al. (2016) proposed what seems like a promising idea for addressing the part about forgetting. Their approach is to explicitly maintain a memory of danger states - situations which are not the catastrophic outcome itself, but from which the learner has previously ended up in a catastrophe. For instance, if “being burned by a hot stove” is a catastrophe, then “being about to poke your finger in the stove” is a danger state. Depending on how cautious we want to be and how many preceding states we want to include in our list of danger states, “going near the stove” and “seeing the stove” can also be danger states, though then we might end up with a seriously stove-phobic robot.

In any case, we maintain a separate storage of danger states, in such a way that the learner never forgets about them. We use this storage of danger states to train a fear model: a model which is trying to predict the probability of ending up in a catastrophe from some given novel situation. For example, maybe our robot poked its robot finger at the stove in our kitchen, but poking its robot finger at stoves in other kitchens might be dangerous too. So we want the fear model to generalize from our stove to other stoves. On the other hand, we don’t want it to be stove-phobic and run away at the mere sight of a stove. The task of our fear model is to predict exactly how likely it is for the robot to end up in a catastrophe, given some situation it is in, and then make it increasingly disinclined to end up in the kinds of situations which might lead to a catastrophe.

This sounds nice in theory. On the other hand, Lipton et al. are still assuming that they can train their learner in a simulated environment, and that they can label catastrophic states ahead of time. We don’t know in advance every possible catastrophe our robot might end up in - it might walk off a cliff, shoot itself in the foot with a laser gun, be beaten up by activists protesting technological unemployment, or any number of other possibilities.

So let’s take inspiration from humans. We can’t know beforehand every bad thing that might happen to our robot, but we can identify some classes of things which are correlated with catastrophe. For instance, being beaten or shooting itself in the foot will cause physical damage, so we can install sensors which indicate when the robot has taken physical damage. If these sensors - let’s call them “pain” sensors - register a high amount of damage, we consider the situation to have been catastrophic. When they do, we save that situation and the situations preceding it to our list of dangerous situations. Assuming that our robot has managed to make it out of that situation intact and can do anything in the first place, we use that list of dangerous situations to train up a fear model.

At this point, we notice that this is starting to remind us about our experience with humans. For example, the infamous Little Albert experiment. A human baby was allowed to play with a laboratory rat, but each time that he saw the rat, a researcher made a loud scary sound behind his back. Soon Albert started getting scared whenever he saw the rat - and then he got scared of furry things in general.

Something like Albert’s behavior could be implemented very simply using something like Hebbian conditioning to get a learning algorithm which picks up on some features of the situation, and then triggers a panic reaction whenever it re-encounters those same features. For instance, it registers that the sight of fur and loud sounds tend to coincide, and then it triggers a fear reaction whenever it sees fur. This would be a basic fear model, and a “danger state” would be “seeing fur”.

Wanting to keep things simple, we decide to use this kind of an approach as the fear model of our robot. Also, having read Consciousness and the Brain, we remember a few basic principles about how those human brains work, which we decide to copy because we’re lazy and don’t want to come up with entirely new principles:

There’s a special network of neurons in the brain, called the global neuronal workspace. The contents of this workspace are roughly the same as the contents of consciousness.
We can thus consider consciousness a workspace which many different brain systems have access to. It can hold a single “chunk” of information at a time.
The brain has multiple different systems doing different things. When a mental object becomes conscious (that is, is projected into the workspace by a subsystem), many systems will synchronize their processing around analyzing and manipulating that mental object.

So here is our design:

The robot has a hardwired system scanning for signs of catastrophe. This system has several subcomponents. One of them scans the “pain” sensors for signs of physical damage. Another system watches the “hunger” sensors for signs of low battery.
Any of these “distress” systems can, alone or in combination, feed a negative reward signal into the global workspace. This tells the rest of the system that this is a bad state, from which the robot should escape.
If a certain threshold level of “distress” is reached, the current situation is designated as catastrophic. All other priorities are suspended and the robot will prioritize getting out of the situation. A memory of the situation and the situations preceding it are saved to a dedicated storage.
After the experience, the memory of the catastrophic situation is replayed in consciousness for analysis. This replay is used to train up a separate fear model which effectively acts as a new “distress” system.
As the robot walks around its environment, sensory information about the surroundings will enter its consciousness workspace. When it plans future actions, simulated sensory information about how those actions would unfold enters the workspace. Whenever the new fear model detects features in either kind of sensory information which it associates with the catastrophic events, it will feed “fear”-type “distress” into the consciousness workspace.

So if the robot sees things which remind it of poking at hot stove, it will be inclined to go somewhere else; if it imagines doing something which would cause it to poke at the hot stove, then it will be inclined to imagine doing something else.

Introducing managers

But is this actually enough? We've now basically set up an algorithm which warns the robot when it sees things which have previously preceded a bad outcome. This might be enough for dealing with static tasks, such as not burning yourself at a stove. But it seems insufficient for dealing with things like predators or technological unemployment protesters, who might show up in a wide variety of places and actively try to hunt you down. By the time you see a sign of them, you're already in danger. It would be better if we could learn to avoid them entirely, so that the fear model would never even be triggered.

As we ponder this dilemma, we surf the web and run across this blog post summarizing Saunders, Sastry, Stuhlmüller & Evans (2017). They are also concerned with preventing reinforcement learning agents from running into catastrophes, but have a somewhat different approach. In their approach, a reinforcement learner is allowed to do different kinds of things, which a human overseer then allows or blocks. A separate “blocker” model is trained to predict which actions the human overseer would block. In the future, if the robot would ever take an action which the “blocker” predicts the human overseer would disallow, it will block that action. In effect, the system consists of two separate subagents, one subagent trying to maximize rewards and the other subagent trying to block non-approved actions.

Since our robot has a nice modular architecture into which we can add various subagents which are listening in and taking actions, we decide to take inspiration from this idea. We create a system for spawning dedicated subprograms which try to predict and and block actions which would cause the fear model to be triggered. In theory, this is unnecessary: given enough time, even standard reinforcement learning should learn to avoid the situations which trigger the fear model. But again, trial-and-error can take a very long time to learn exactly which situations trigger fear, so we dedicate a separate subprogram to the task of pre-emptively figuring it out.

Each fear model is paired with a subagent that we’ll call a manager. While the fear model has associated a bunch of cues with the notion of an impending catastrophe, the manager learns to predict which situations would cause the fear model to trigger. Despite sounding similar, these are not the same thing: one indicates when you are already in danger, the other is trying to figure out what you can do to never end up in danger in the first place. A fear model might learn to recognize signs which technological unemployment protesters commonly wear. Whereas a manager might learn the kinds of environments where the fear model has noticed protesters before: for instance, near the protester HQ.

Then, if a manager predicts that a given action (such as going to the protester HQ) would eventually trigger the fear model, it will block that action and promote some other action. We can use the interaction of these subsystems to try to ensure that the robot only feels fear in situations which already resemble the catastrophic situation so much as to actually be dangerous. At the same time, the robot will be unafraid to take safe actions in situations from which it could end up in a danger zone, but are themselves safe to be in.

As an added benefit, we can recycle the manager component to also do the same thing as the blocker component in the Saunders et al. paper originally did. That is, if the robot has a human overseer telling it in strict terms not to do some things, it can create a manager subprogram which models that overseer and likewise blocks the robot from doing things which the model predicts that the overseer would disapprove of.

Putting together a toy model

If the robot does end up in a situation where the fear model is sounding an alarm, then we want to get it out of the situation as quickly as possible. It may be worth spawning a specialized subroutine just for this purpose. Technological unemployment activists could, among other things, use flamethrowers that set the robot on fire. So let’s call these types of subprograms dedicated to escaping from the danger zone, firefighters.

So how does the system as a whole work? First, the different subagents act by sending into the consciousness workspace various mental objects, such as an emotion of fear, or an intent to e.g. make breakfast. If several subagents are submitting identical mental objects, we say that they are voting for the same object. On each time-step, one of the submitted objects is chosen at random to become the contents of the workspace, with each object having a chance to be selected that’s proportional to its number of votes. If a mental object describing a physical action (an “intention”) ends up in the workspace and stays chosen for several time-steps, then that action gets executed by a motor subsystem.

Depending on the situation, some subagents will have more votes than others. E.g. a fear model submitting a fear object gets a number of votes proportional to how strongly it is activated. Besides the specialized subagents we’ve discussed, there’s also a default planning subagent, which is just taking whatever actions (that is, sending to the workspace whatever mental objects) it thinks will produce the greatest reward. This subagent only has a small number of votes.

Finally, there’s a self-narrative agent which is constructing a narrative of the robot’s actions as if it was a unified agent, for social purposes and for doing reasoning afterwards. After the motor system has taken an action, the self-narrative agent records this as something like “I, Robby the Robot, made breakfast by cooking eggs and bacon”, transmitting this statement to the workspace and saving it to an episodic memory store for future reference.

Consequences of the model

Is this design any good? Let’s consider a few of its implications.

First, in order for the robot to take physical actions, the intent to do so has to be in its consciousness for a long enough time for the action to be taken. If there are any subagents that wish to prevent this from happening, they must muster enough votes to bring into consciousness some other mental object replacing that intention before it’s been around for enough time-steps to be executed by the motor system. (This is analogous to the concept of the final veto in humans, where consciousness is the last place to block pre-consciously initiated actions before they are taken.)

Second, the different subagents do not see each other directly: they only see the consequences of each other’s actions, as that’s what’s reflected in the contents of the workspace. In particular, the self-narrative agent has no access to information about which subagents were responsible for generating which physical action. It only sees the intentions which preceded the various actions, and the actions themselves. Thus it might easily end up constructing a narrative which creates the internal appearance of a single agent, even though the system is actually composed of multiple subagents.

Third, even if the subagents can’t directly see each other, they might still end up forming alliances. For example, if the robot is standing near the stove, a curiosity-driven subagent might propose poking at the stove (“I want to see if this causes us to burn ourselves again!”), while the default planning system might propose cooking dinner, since that’s what it predicts will please the human owner. Now, a manager trying to prevent a fear model agent from being activated, will eventually learn that if it votes for the default planning system’s intentions to cook dinner (which it saw earlier), then the curiosity-driven agent is less likely to get its intentions into consciousness. Thus, no poking at the stove, and the manager’s and the default planning system’s goals end up aligned.

Fourth, this design can make it really difficult for the robot to even become aware of the existence of some managers. A manager may learn to support any other mental processes which block the robot from taking specific actions. It does it by voting in favor of mental objects which orient behavior towards anything else. This might manifest as something subtle, such as a mysterious lack of interest towards something that sounds like a good idea in principle, or just repeatedly forgetting to do something, as the robot always seems to get distracted by something else. The self-narrative agent, not having any idea of what’s going on, might just explain this as “Robby the Robot is forgetful sometimes” in its internal narrative.

Fifth, the default planning subagent here is doing something like rational planning, but given its weak voting power, it’s likely to be overruled if other subagents disagree with it (unless some subagents also agree with it). If some actions seem worth doing, but there are managers which are blocking it and the default planning subagent doesn’t have an explicit representation of them, this can manifest as all kinds of procrastinating behaviors and numerous failed attempts for the default planning system to “try to get itself to do something”, using various strategies. But as long as the managers keep blocking those actions, the system is likely to remain stuck.

Sixth, the purpose of both managers and firefighters is to keep the robot out of a situation that has been previously designated as dangerous. Managers do this by trying to pre-emptively block actions that would cause the fear model agent to activate; firefighters do this by trying to take actions which shut down the fear model agent after it has activated. But the fear model agent activating is not actually the same thing as being in a dangerous situation. Thus, both managers and firefighters may fall victim to Goodhart’s law, doing things which block the fear model while being irrelevant for escaping catastrophic situations.

For example, “thinking about the consequences of going to the activist HQ” is something that might activate the fear model agent, so a manager might try to block just thinking about it. This has obvious consequence that the robot can’t think clearly about that issue. Similarly, once the fear model has already activated, a firefighter might Goodhart by supporting any action which helps activate an agent with a lot of voting power that’s going to think about something entirely different. This could result in compulsive behaviors which were effective at pushing the fear aside, but useless for achieving any of the robot’s actual aims.

At worst, this could cause loops of mutually activating subagents pushing in opposite directions. First, a stove-phobic robot runs away from the stove as it was about to make breakfast. Then a firefighter trying to suppress that fear, causes the robot to get stuck looking at pictures of beautiful naked robots, which is engrossing and thus great for removing the fear of the stove. Then another fear model starts to activate, this one afraid of failure and of spending so much time looking at pictures of beautiful naked robots that the robot won’t accomplish its goal of making breakfast. A separate firefighter associated with this second fear model has learned that focusing the robot’s attention on the pictures of beautiful naked robots even more is the most effective action for keeping this new fear temporarily subdued. So the two firefighters are allied and temporarily successful at their goal, but then the first one - seeing that the original stove fear has disappeared - turns off. Without the first firefighter’s votes supporting the second firefighter, the fear manages to overwhelm the second firefighter, causing the robot to rush into making breakfast. This again activates its fear of the stove, but if the fear of failure remains strong enough, it might overpower its fear of the stove so that the robot manages to make breakfast in time...

Hmm. Maybe this design isn’t so great after all. Good thing we noticed these failure modes, so that there aren’t any mind architectures like this going around being vulnerable to them!

The Internal Family Systems model

But enough hypothetical robot design; let’s get to the topic of IFS. The IFS model hypothesizes the existence of three kinds of “extreme parts” in the human mind:

Exiles are said to be parts of the mind which hold the memory of past traumatic events, which the person did not have the resources to handle. They are parts of the psyche which have been split off from the rest and are frozen in time of the traumatic event. When something causes them to surface, they tend to flood the mind with pain. For example, someone may have an exile associated with times when they were romantically rejected in the past.
Managers are parts that have been tasked with keeping the exiles permanently exiled from consciousness. They try to arrange a person’s life and psyche so that exiles never surface. For example, managers might keep someone from reaching out to potential dates due to a fear of rejection.
Firefighters react when exiles have been triggered, and try to either suppress the exile’s pain or distract the mind from it. For example, after someone has been rejected by a date, they might find themselves drinking in an attempt to numb the pain.
Some presentations of the IFS model simplify things by combining Managers and Firefighters into the broader category of Protectors, so only talk about Exiles and Protectors.

Exiles are not limited to being created from the kinds of situations that we would commonly consider seriously traumatic. They can also be created from things like relatively minor childhood upsets, as long as the child didn’t feel like they could handle the situation.

IFS further claims that you can treat these parts as something like independent subpersonalities. You can communicate with them, consider their worries, and gradually persuade managers and firefighters to give you access to the exiles that have been kept away from consciousness. When you do this, you can show them that you are no longer in the situation which was catastrophic before, and now have the resources to handle it if something similar was to happen again. This heals the exile, and also lets the managers and firefighters assume better, healthier roles.

As I mentioned in the beginning, when I first heard about IFS, I was turned off by it for several different reasons. For instance, here were some of my thoughts at the time:

The whole model about some parts of the mind being in pain, and other parts trying to suppress their suffering. The thing about exiles was framed in terms of a part of the mind splitting off in order to protect the rest of the mind against damage. What? That doesn’t make any evolutionary sense! A traumatic situation is just sensory information for the brain, it’s not literal brain damage: it wouldn’t have made any sense for minds to evolve in a way that caused parts of it to split off, forcing other parts of the mind to try to keep them suppressed. Why not just… never be damaged in the first place?
That whole thing about parts being personalized characters that you could talk to. That… doesn’t describe anything in my experience.
Also, how does just talking to yourself fix any trauma or deeply ingrained behaviors?
IFS talks about everyone having a “True Self”. Quote from Wikipedia: IFS also sees people as being whole, underneath this collection of parts. Everyone has a true self or spiritual center, known as the Self to distinguish it from the parts. Even people whose experience is dominated by parts have access to this Self and its healing qualities of curiosity, connectedness, compassion, and calmness. IFS sees the therapist's job as helping the client to disentangle themselves from their parts and access the Self, which can then connect with each part and heal it, so that the parts can let go of their destructive roles and enter into a harmonious collaboration, led by the Self. That… again did not sound particularly derived from any sensible psychology.

Hopefully, I’ve already answered my past self’s concerns about the first point. The model itself talks in terms of managers protecting the mind from pain, exiles being exiled from consciousness in order for their pain to remain suppressed, etc. Which is a reasonable description of the subjective experience of what happens. But the evolutionary logic - as far as I can guess - is slightly different: to keep us out of dangerous situations.

The story of the robot describes the actual “design rationale”. Exiles are in fact subagents which are “frozen in the time of a traumatic event”, but they didn’t split off to protect the rest of the mind from damage. Rather, they were created as an isolated memory block to ensure that the memory of the event wouldn’t be forgotten. Managers then exist to keep the person away from such catastrophic situations, and firefighters exist to help escape them. Unfortunately, this setup is vulnerable to various failure modes, similar to those that the robot is vulnerable to.

With that said, let’s tackle the remaining problems that I had with IFS.

Personalized characters

IFS suggests that you can experience the exiles, managers and firefighters in your mind as something akin to subpersonalities - entities with their own names, visual appearances, preferences, beliefs, and so on. Furthermore, this isn’t inherently dysfunctional, nor indicative of something like Dissociative Identity Disorder. Rather, even people who are entirely healthy and normal may experience this kind of “multiplicity”.

Now, it’s important to note right off that not everyone has this to a major extent: you don’t need to experience multiplicity in order for the IFS process to work. For instance, my parts feel more like bodily sensations and shards of desire than subpersonalities, but IFS still works super-well for me.

In the book Internal Family Systems Therapy, Richard Schwartz, the developer of IFS, notes that if a person’s subagents play well together, then that person is likely to feel mostly internally unified. On the other hand, if a person has lots of internal conflict, then they are more likely to experience themselves as having multiple parts with conflicting desires.

I think that this makes a lot of sense, assuming the existence of something like a self-narrative subagent. If you remember, this is the part of the mind which looks at the actions that the mind-system has taken, and then constructs an explanation for why those actions were taken. (See e.g. the posts on the limits of introspection and on the Apologist and the Revolutionary for previous evidence for the existence of such a confabulating subagent with limited access to our true motivations.) As long as all the exiles, managers and firefighters are functioning in a unified fashion, the most parsimonious model that the self-narrative subagent might construct is simply that of a unified self. But if the system keeps being driven into strongly conflicting behaviors, then it can’t necessarily make sense of them from a single-agent perspective. Then it might naturally settle on something like a multiagent approach and experience itself as being split into parts.

Kevin Simler, in Neurons Gone Wild, notes how people with strong addictions seem particularly prone to developing multi-agent narratives:

This American Life did a nice segment on addiction a few years back, in which the producers — seemingly on a lark — asked people to personify their addictions. "It was like people had been waiting all their lives for somebody to ask them this question," said the producers, and they gushed forth with descriptions of the 'voice' of their inner addict:

"The voice is irresistible, always. I'm in the thrall of that voice."

"Totally out of control. It's got this life of its own, and I can't tame it anymore."

"I actually have a name for the voice. I call it Stan. Stan is the guy who tells me to have the extra glass of wine. Stan is the guy who tells me to smoke."

This doesn’t seem like it explains all of it, though. I’ve frequently been very dysfunctional, and have always found very intuitive the notion of the mind being split into very parts. Yet I mostly still don’t seem to experience my subagents anywhere near as person-like as some others clearly do. I know at least one person who ended up finding IFS because of having all of these talking characters in their head, and who was looking for something that would help them make sense of it. Nothing like that has ever been the case for me: I did experience strongly conflicting desires, but they were just that, strongly conflicting desires.

I can only surmise that it has something to do with the same kinds of differences which cause some people to think mainly verbally, others mainly visually, and others yet in some other hard-to-describe modality. Some fiction writers spontaneously experience their characters as real people who speak to them and will even bother the writer when at the supermarket, and some others don’t.

It’s been noted that the mechanisms which use to model ourselves and other people overlap - not very surprisingly, since both we and other people are (presumably) humans. So it seems reasonable that some of the mechanisms for representing other people, would sometimes also end up spontaneously recruited for representing internal subagents or coalitions of them.

Why should this technique be useful for psychological healing?

Okay, suppose it’s possible to access our subagents somehow. Why would just talking with these entities in your own head, help you fix psychological issues?

Let’s consider that a person having exiles, managers and firefighters is costly in the sense of constraining that person’s options. If you never want to do anything that would cause you to see a stove, that limits quite a bit of what you can do. I strongly suspect that many forms of procrastination and failure to do things we’d like to do are mostly a manifestation of overactive managers. So it’s important not to create those kinds of entities unless the situation really is one which should be designated as categorically unacceptable to end up in.

The theory for IFS mentions that not all painful situations turn into trauma: just ones in which we felt helpless and like we didn’t have the necessary resources for dealing with it. This makes sense, since if we were capable of dealing with it, then the situation can’t have been that catastrophic. The aftermath of the immediate event is important as well: a child who ends up in a painful situation doesn’t necessarily end up traumatized, if they have an adult who can put the event in a reassuring context afterwards.

But situations which used to be catastrophic and impossible for us to handle before, aren’t necessarily that any more. It seems important to have a mechanism for updating that cache of catastrophic events and for disassembling the protections around it, if the protections turn out to be unnecessary.

How does that process usually happen, without IFS or any other specialized form of therapy?

Often, by talking about your experiences with someone you trust. Or writing about them in private or in a blog.

In my post about Consciousness and the Brain, I mentioned that once a mental object becomes conscious, many different brain systems synchronize their processing around it. I suspect that the reason why many people have such a powerful urge to discuss their traumatic experiences with someone else, is that doing so is a way of bringing those memories into consciousness in detail. And once you’ve dug up your traumatic memories from their cache, their content can be re-processed and re-evaluated. If your brain judges that you now do have the resources to handle that event if you ever end up in it again, or if it’s something that simply can’t happen anymore, then the memory can be removed from the cache and you no longer need to avoid it.

I think it’s also significant that, while something like just writing about a traumatic event is sometimes enough to heal, often it’s more effective if you have a sympathetic listener who you trust. Traumas often involve some amount of shame: maybe you were called lazy as a kid and are still afraid of others thinking that you are lazy. Here, having friends who accept you and are willing to nonjudgmentally listen while you talk about your issues, is by itself an indication that the thing that you used to be afraid of isn’t a danger anymore: there exist people who will stay by your side despite knowing your secret.

Now, when you are talking to a friend about your traumatic memory, you will be going through cached memories that have been stored in an exile subagent. A specific memory circuit - one of several circuits specialized for the act of holding painful memories - is active and outputting its contents into the global workspace, from which they are being turned into words.

Meaning that, in a sense, your friend is talking directly to your exile.

Could you hack this process, so that you wouldn’t even need a friend, and could carry this process out entirely internally?

In my earlier post, I remarked that you could view language as a way of joining two people’s brains together. A subagent in your brain outputs something that appears in your consciousness, you communicate it to a friend, it appears in their consciousness, subagents in your friend’s brain manipulate the information somehow, and then they send it back to your consciousness.

If you are telling your friend about your trauma, you are in a sense joining your workspaces together, and letting some subagents in your workspace, communicate with the “sympathetic listener” subagents in your friend’s workspace.

So why not let a “sympathetic listener” subagent in your workspace, hook up directly with the traumatized subagents that are also in your own workspace?

I think that something like this happens when you do IFS. You are using a technique designed to activate the relevant subagents in a very specific way, which allows for this kind of a “hooking up” without needing another person.

For instance, suppose that you are talking to a manager subagent which wants to hide the fact that you’re bad at something, and starts reacting defensively whenever the topic is brought up. Now, one way by which its activation could manifest, is feeding those defensive thoughts and reactions directly into your workspace. In such a case, you would experience them as your own thoughts, and possibly as objectively real. IFS calls this “blending”; I’ve also previously used the term “cognitive fusion” for what’s essentially the same thing.

Instead of remaining blended, you then use various unblending / cognitive defusion techniques that highlight the way by which these thoughts and emotions are coming from a specific part of your mind. You could think of this as wrapping extra content around the thoughts and emotions, and then seeing them through the wrapper (which is obviously not-you), rather than experiencing the thoughts and emotions directly (which you might experience as your own). For example, the IFS book Self-Therapy suggests this unblending technique (among others):

Allow a visual image of the part [subagent] to arise. This will give you the sense of it as a separate entity. This approach is even more effective if the part is clearly a certain distance away from you. The further away it is, the more separation this creates.

Another way to accomplish visual separation is to draw or paint an image of the part. Or you can choose an object from your home that represents the part for you or find an image of it in a magazine or on the Internet. Having a concrete token of the part helps to create separation.

I think of this as something like, you are taking the subagent in question, routing its responses through a visualization subsystem, and then you see a talking fox or whatever. And this is then a representation that your internal subsystems for talking with other people can respond to. You can then have a dialogue with the part (verbally or otherwise) in a way where its responses are clearly labeled as coming from it, rather than being mixed together with all the other thoughts in the workspace. This lets the content coming from the sympathetic-listener subagent and the exile/manager/firefighter subagent be kept clearly apart, allowing you to consider the emotional content as you would as an external listener, preventing you from drowning in it. You’re hacking your brain so as to work as the therapist and client as the same time.

The Self

IFS claims that, below all the various parts and subagents, there exists a “true self” which you can learn to access. When you are in this Self, you exhibit the qualities of “calmness, curiosity, clarity, compassion, confidence, creativity, courage, and connectedness”. Being at least partially in Self is said to be a prerequisite for working with your parts: if you are not, then you are not able to evaluate their models objectively. The parts will sense this, and as a result, they will not share their models properly, preventing the kind of global re-evaluation of their contents that would update them.

This was the part that I was initially the most skeptical of, and which made me most frequently decide that IFS was not worth looking at. I could easily conceptualize the mind as being made up of various subagents. But then it would just be numerous subagents all the way down, without any single one that could be designated the “true” self.

But let’s look at IFS’s description of how exactly to get into Self. You check whether you seem to be blended with any part. If you are, you unblend with it. Then you check whether you might also be blended with some other part. If you are, you unblend from it also. You then keep doing this until you can find no part that you might be blended with. All that’s left are those “eight Cs”, which just seem to be a kind of a global state, with no particular part that they would be coming from.

I now think that “being in Self” represents a state where there no particular subagent is getting a disproportionate share of voting power, and everything is processed by the system as a whole. Remember that in the robot story, catastrophic states were situations in which the organism should never end up. A subagent kicking in to prevent that from happening is a kind of a priority override to normal thinking. It blocks you from being open and calm and curious because some subagent thinks that doing so would be dangerous. If you then turn off or suspend all those priority overrides, then the mind’s default state absent any override seems to be one with the qualities of the Self.

This actually fits at least one model of the function of positive emotions pretty well. Fredrickson (1998) suggests that an important function of positive emotions is to make us engage in activities such as play, exploration, and savoring the company of other people. Doing these things has the effect of building up skills, knowledge, social connections, and other kinds of resources which might be useful for us in the future. If there are no active ongoing threats, then that implies that the situation is pretty safe for the time being, making it reasonable to revert to a positive state of being open to exploration.

The Internal Family Systems Therapy book makes a somewhat big deal out of the fact that everyone, even most traumatized people, ultimately has a Self which they can access. It explains this in terms of the mind being organized to protect against damage, and with parts always splitting off from the Self when it would otherwise be damaged. I think the real explanation is much simpler: the mind is not accumulating damage, it is just accumulating a longer and longer list of situations not considered safe.

As an aside, this model feels like it makes me less confused about confidence. It seems like people are really attracted to confident people, and that to some extent it’s also possible to fake confidence until it becomes genuine. But if confidence is so attractive and we can fake it, why hasn’t evolution just made everyone confident by default?

Turns out that it has. The reason why faked confidence gradually turns into genuine confidence is that by forcing yourself to act in confident ways which felt dangerous before, your mind gets information indicating that this behavior is not as dangerous as you originally thought. That gradually turns off those priority overrides that kept you out of Self originally, until you get there naturally.

The reason why being in Self is a requirement for doing IFS, is the existence of conflicts between parts. For instance, recall the stove-phobic robot having a firefighter subagent that caused it to retreat from the stove into watching pictures of beautiful naked robots. This triggered a subagent which was afraid of the naked-robot-watching preventing the robot from achieving its goals. If the robot now tried to do IFS and talk with the firefighter subagent that caused it to run away from stoves, this might bring to mind content which activated the exile that was afraid of not achieving things. Then that exile would keep flooding the mind with negative memories, trying to achieve its priority override of “we need to get out of this situation”, and preventing the process from proceeding. Thus, all of the subagents that have strong opinions about the situation need to be unblended from, before integration can proceed.

IFS also has a separate concept of “Self-Leadership”. This is a process where various subagents eventually come to trust the Self, so that they allow the person to increasingly remain in Self even in various emergencies. IFS views this as a positive development, not only because it feels nice, but because doing so means that the person will have more cognitive resources available for actually dealing with the emergency in question.

I think that this ties back to the original notion of subagents being generated to invoke priority overrides for situations which the person originally didn’t have the resources to handle. Many of the subagents IFS talks about seem to emerge from childhood experiences. A child has many fewer cognitive, social, and emotional resources for dealing with bad situations, in which case it makes sense to just categorically avoid them, and invoke special overrides to ensure that this happens. A child’s cognitive capacities, models of the world, and abilities to self-regulate are also less developed, so she may have a harder time staying out of dangerous situations without having some priority overrides built in. An adult, however, typically has many more resources than a child does. Even when faced with an emergency situation, it can be much better to be able to remain calm and analyze the situation using all of one’s subagents, rather than having a few of them take over all the decision-making. Thus, it seems to me - both theoretically and practically - that developing Self-Leadership is really valuable.

That said, I do not wish to imply that it would be a good goal to never have negative emotions. Sometimes blending with a subagent, and experiencing resulting negative emotions, is the right thing to do in that situation. Rather than suppressing negative emotions entirely, Self-Leadership aims to get to a state where any emotional reaction tends to be endorsed by the mind-system as a whole. Thus, if feeling angry or sad or bitter or whatever feels appropriate to the situation, you can let yourself feel so, and then give yourself to that emotion without resisting it. As a result, negative emotions become less unpleasant to experience, since there are fewer subagents trying to fight against them. Also, if it turns out that being in a negative emotional state is no longer useful, the system as a whole can just choose to move back into Self.

Final words

I’ve now given a brief summary of the IFS model, and explained why I think it makes sense. This is of course not enough to establish the model as true. But it might help in making the model plausible enough to at least try out.

I think that most people could benefit from learning and doing IFS on themselves, either alone or together with a friend. I’ve been saying that exiles/managers/firefighters tend to be generated from trauma, but it’s important to realize that these events don’t need to be anything immensely traumatic. The kinds of ordinary, normal childhood upsets that everyone has had can generate these kinds of subagents. Remember, just because you think of a childhood event as trivial now, doesn’t mean that it felt trivial to you as a child. Doing IFS work, I’ve found exiles related to memories and events which I thought left no negative traces, but actually did.

Remember also that it can be really hard to notice the presence of some managers: if they are doing their job effectively, then you might never become aware of them directly. “I don’t have any trauma so I wouldn’t benefit from doing IFS” isn’t necessarily correct. Rather, the cues that I use for detecting a need to do internal work are:

Do I have the qualities associated with Self, or is something blocking them?
Do I feel like I’m capable of dealing with this situation rationally, and doing the things which feel like good ideas on an intellectual level?
Do my emotional reactions feel like they are endorsed by my mind-system as a whole, or is there a resistance to them?

If not, there is often some internal conflict which needs to be addressed - and IFS, combined with some other practices such as Focusing and meditation - has been very useful in learning to solve those internal conflicts.

Even if you don’t feel convinced that doing IFS personally would be a good idea, I think adopting its framework of exiles, managers and firefighters is useful for better understanding the behavior of other people. Their dynamics will be easier to recognize in other people if you’ve had some experience recognizing them in yourself, however.

If you want to learn more about IFS, I would recommend starting with Self-Therapy by Jay Earley. In terms of What/How/Why books, my current suggestions would be:

How: Self-Therapy by Jay Earley.
What: Internal Family Systems Therapy, by Richard Schwartz
Why: The Power of Focusing, by Ann Weiser Cornell (technically not about IFS, but AWC’s variant of Focusing gets very close to IFS, and is excellent for conveying the right mindset for it)

This post was written as part of research supported by the Foundational Research Institute. Thank you to everyone who provided feedback on earlier drafts of this article: Eli Tyre, Elizabeth Van Nostrand, Jan Kulveit, Juha Törmänen, Lumi Pakkanen, Maija Haavisto, Marcello Herreshoff, Qiaochu Yuan, and Steve Omohundro.

Thanks for writing this! I am very excited that this post exists. I think what this model suggests about procrastination and addiction alone (namely, that they're things that managers and firefighters are doing to protect exiles) are already huge, and resonate strongly with my experience.

In the beginning of 2018 I experienced a dramatic shift that I still don't quite understand; my sense of it at the time was that there was this crippling fear / shame that had been preventing me from doing almost anything, that suddenly lifted (for several reasons, it's a long story). That had many dramatic effects, and one of the most noticeable ones was that I almost completely stopped wanting to watch TV, read manga, play video games, or any of my other addiction / procrastination behaviors. It became very clear that the purpose of all of those behaviors was numbing and distraction ("general purpose feeling obliterators" used by firefighters, as waveman says in another comment) from how shitty I felt all the time, and after the shift I basically felt so good that I didn't want or need to do that anymore.

(This lasted for awhile but not forever; I crashed hard in Se... (read more)

things that had been too scary for me to think about became thinkable (e.g. regrettable dynamics in my romantic relationships), and I think this is a crucial observation for the rationality project. When you have exile-manager-firefighter dynamics going on and you don't know how to unblend from them, you cannot think clearly about anything that triggers the exile, and trying to make yourself do it anyway will generate tremendous internal resistance in one form or another (getting angry, getting bored, getting sleepy, getting confused, all sorts of crap), first from managers trying to block the thoughts and then from firefighters trying to distract you from the thoughts. Top priority is noticing that this is happening and then attending to the underlying emotional dynamics.

Yes!

Valentine has also written some good stuff on this, in e.g. The Art of Grieving Well:

I think the first three so-called “stages of grief” — denial, anger, and bargaining — are avoidance behaviors. They’re attempts to distract oneself from the painful emotional update. Denial is like trying to focus on anything other than the hurt foot, anger is like clutching and yelling and getting mad at the situation,

... (read more)

Curated.

The internal family systems model has seen a lot of discussion in various rationalist and rationalist-adjaecent places, but:

a) usually among people who were already familiar with it,

b) usually with a vague disclaimer of being a fake-framework, without delving into the details of where the limits of the framework lay or how to contextualize it in a broader reductionist worldview.

I think it's been a long-time coming for someone to write up a comprehensive case for why the model is worth taking seriously, placing it in terms that can be concretely reasoned about, built off of and/or falsified.

Really what I want is for Kaj's entire sequence to be made into a book. Barring that, I'll settle for nominating this post.

4Raemon5y

I endorse people using nominations to specify things that don't-quite-fit into the schema we laid out. I think nominating an entire sequence is a reasonable thing to do, and figuring out how to fit that into our overall review/publishing system is an important question. I don't know of a better way to do that other than to just encourage people to spell out what they wish to happen, and then... see what ad-hoc systems we can think of while processing that.

2Ben Pace5y

e.g. that happened with embedded agency and to some extent babble and prune

Have you read Minsky's _Society of Mind_? It is an AI-flavored psychological model of subagents that draws heavily on psychotherapeutic ideas. It seems quite similar in flavor to what you propose here. It inspired generations of students at the MIT AI Lab (although attempts to code it never worked out).

3Kaj_Sotala7y

I looked at the beginning of it a bit before writing this post, but at least the beginning of it gave the impression that its subagents were very low-level (IIRC, it started with an example of building a tower of blocks, or taking some similar physical action, using many different subagents) and overall it had a strong vibe of 80's AI, so then it didn't feel like the most useful thing to be reading.

1eggsyntax2y

Do you happen to recall where you got that information? I've wondered occasionally what later became of Minsky's approach; it's intuitively pretty compelling. I'd love to find a source of info on follow-up work.

1eggsyntax2y

Here's one later discussion I found, from 2003, by Push Singh at MIT's Media Lab. It attempts to summarize the implementable parts of the book, and talks about its history and more recent developments. A couple of interesting things: * Unlike David's source, it says that 'Despite the great popularity of the book The Society of Mind, there have been few attempts to implement very much of the theory.' * It says that Minsky's, The Emotion Machine, forthcoming at the time, is in part a sequel to SoM. I haven't read it, so can't vouch for the accuracy of that statement.

1Kenny7y

I've read that book. One thing I think it's missing, if I'm remembering it correctly, is any interplay between 'bottom-up' and 'top-down' sub-agents. That seems to be a key dynamic à la perceptual control theory.

I've been attempting to use IFS for years without having read much more than brief summaries of it. This post put me on a much firmer footing with it and I was able to much more clearly categorize a bunch of things that have been happening over the past six months or so. Then over the weekend I had a low-level background internal screaming going on, and while my first couple rounds of attempts at resolving it only helped a little, I was finally able to isolate the issue and fix what turned out to be a massive misalignment. I have not felt this aligned in years.

So thank you very, very much for writing this.

4Kaj_Sotala7y

Whoa, glad you found it that useful! Thank you for letting me know. :) I do recommend reading at least Self-Therapy too, it mentions a number of details which I left out of this explanation, and which might be useful to know about when addressing future issues.

My 2 cents:

1 cent: It seems that sub-personalities do not actually exist, but are created by the human mind at the moment of query. The best way to explain this is to look at improvisation theatre, as described in the post by Valentine Intelligent social web. The consequence of this non-actual existence of the subpersonalities is that we could have different expectations about types of personalities, and still get therapeutically useful and consistently sounding results. For example, some people try to cure psychological problems by making a person to rem... (read more)

3Trinley Goldenberg7y

I've come to a similar conclusion that subagents are something like belief clusters. Which themselves are a closer to the metal leaky abstraction if what's actually going on. However I'm open to the idea that Kajs model is the right one here.

2avturchin7y

In fact, different people have different level of schizotypy or, maybe, it would be better called fractionness of mind. On one side is pure monolithic humans, and on the another is people with genius multiple personality disorder, which is very rare.

3waveman7y

This is one good way to rationalize them. It doesn't really much matter whether this is true or not.

It doesn't really much matter whether this is true or not.

I think it matters from the perspective that if subagents are simulated at query time, then a non-subagent model should be able to produce similar results to IFS, with fewer complications.

My own experience comparing subagent-oriented approaches (e.g. IFS, Core Transformation) with non-subagent ones, the non-subagent ones generally require less work to figure out what is going on, because simulating parts that want to hide or deflect stuff is more energy-intensive and frustrating than just helping someone notice that they are hiding or deflecting things.

For example, when I segregate my own desires into parts, it increases the odds of an argument or of parts withholding information or motives, vs. presupposing that all my desires are mine and that I have good reasons even for doing apparently self-destructive things.

That being said, I can think of all kinds of situations where IFS as a metaphor would be superior to more direct approaches... but they all involve people for whom the subagent metaphor is an easier introduction to metacognition, and/or the stuff being dealt with is traumatic enough that you really want to keep

... (read more)

I'm actually kind of surprised that IFS seems so popular in rationalist-space, as I would've thought rationalists more likely to bite the bullet and accept the existence of their unendorsed desires as a simple matter of fact.

Some reasons for the popularity of IFS which seem true to me, and independent of whether you accept your desires:

It's the main modality that rationalists happen to know which lets you do this kind of thing at all. The other popular one is Focusing, which isn't always framed in terms of subagents, but in terms of the memory reconsolidation model it basically only does accessing; de- and reconsolidation will only happen to the extent that the accessing happens to trigger the brain's spontaneous mismatch detection systems. (Also the Bio-Emotive Framework has gotten somewhat popular of late, but that's a very recent development.)
Rationalists tend to really like reductionism, in the sense of breaking complex systems down into simpler parts that you can reason about. IFS is good at giving you various gears about how minds operate, and e.g. turning previously incomprehensible emotional reactions into a completely sensible chain of part

... (read more)

5pjeby6y

I don't think IFS is good reductionism, though. That is, presupposing subagents in general is not a reduction in complexity from "you're an agent". That's not actually reducing anything! It's just multiplying entities contra Occam. Now, if IFS said, "these are specific subagents that basically everyone has, that bias towards learning specific types of evolution-advanged behavior", then that would actually be a reduction. If IFS said, "brains have modules for these types of mental behavior", (e.g. hiding, firefighting, etc.), then that would also be a reduction. But dividing people into lots of mini-people isn't a reduction. The way I reduce the same landscape of things is to group functional categories of mental behavior as standard modules, and treat the specific things people are reacting to and the actual behaviors as data those modules operate on. This model doesn't require any sort of agency, because it's just rules and triggers. (And things like "critical voices" are just triggered mental behavior or memories, not actual agents, which is why they can often be disrupted by changing the way they sound -- e.g. making them seductive or squeaky -- while keeping the content the same. If there were an "agent" desiring to criticize, this technique wouldn't make any sense.) As for compassion, the equivalent in what I'm doing would be the connect stage in collect/connect/correct: * "Collect" is getting information about the problem, determining a specific trigger and automatic emotional response (the thing that we will test post-reconsolidation to ensure we got it) * "Connect" is surfacing the inner experience and memory or belief that drives the response, either as a prediction or learned evaluation * "Correct" is the reconsolidation part: establishing contradiction and generating new predictions, before checking if the automatic response from "Collect" changed All of these require one to be able to objectively observe and communicate inner experience without

(adding to my other comment)

dividing people into lots of mini-people isn't a reduction.

And like, the post you're responding to just spent several thousand words building up a version of IFS which explicitly doesn't have "mini-people" and where the subagents are much closer to something like reinforcement learning agents which just try to prevent/achieve something by sending different objects to consciousness, and learn based on their success in doing so...

4pjeby6y

The presented model of Exiles, Managers, Firefighters, etc. all describes "parts" doing things, but the same ideas can be expressed without using the idea of "parts", which makes that idea redundant. For example, here is a simpler description of the same categories of behavior: Voila! The same three things (Exile, Firefighter, Manager), described in less text and without the need for a concept of "parts". I'm not saying this model is right and the IFS model is wrong, just that IFS isn't very good at reductionism and fails Occam's razor because it literally multiplies entities beyond necessity. From this discussion and the one on reconsolidation, I would hazard a guess that to the extent IFS is more useful than some non-parts-based (non-partisan?) approach, it is because one's treatment of the "parts" (e.g. with compassion) can potentially trigger a contradiction and therefore reconsolidation. (I would hypothesize, though, that in most cases this is a considerably less efficient way to do it than directly going after the actual reconsolidation.) Also, as I mentioned earlier, there are times when the UTE (thing we're Unwilling To Experience) is better kept conceptually dissociated rather than brought into the open, and in such a case the model of "parts" is a useful therapeutic metaphor. But "therapeutic metaphor" and "reductionist model" are not the same thing. IFS has a useful metaphor -- in some contexts -- but AFAICT it is not a very good model of behavior, in the reductionist sense of modeling. If I try to steelman this argument, I have to taboo "agent", since otherwise the definition of subagent is recursive and non-reductionistic. I can taboo it to "thing", in which case I get "things which just try to prevent/achieve something", and now I have to figure out how to reduce "try"... is that try iteratively? When do they try? How do they know what to try? As far as I can tell, the answers to all the important questions for actual understanding are pure hand

9Kaj_Sotala6y

If it was just that brief description, then sure, the parts metaphor would be unnecessary. But the IFS model contains all kinds of additional predictions and applications which make further use of those concepts. For example, firefighters are called that because "they are willing to let the house burn down to contain the fire"; that is, when they are triggered, they typically act to make the pain stop, without any regard for consequences (such as loss of social standing). At the same time, managers tend to be terrified of exactly the kind of lack of control that's involved with a typical firefighter response. This makes firefighters and managers typically polarized - mutually opposed - with each other. Now, it's true that you don't need to use the "part" expression for explaining this. But if we only talked about various behaviors getting reinforced, we wouldn't predict that the system simultaneously considers a loss of a social standing to be a bad thing, and that it also keeps reinforcing behaviors which cause exactly that thing. Now, obviously it can still be explained in a more sophisticated reinforcement model, in which you talk about e.g. differing prioritizations in different situations, and some behavioral routines kicking in at different situations... ...but if at the end, this comes down to there being two distinct kinds of responses depending on whether you are trying to avoid a situation or are already in it, then you need names for those two categories anyway. So why not go with "manager" and "firefighter" while you're at it? And sure, you could call it, say, "a response pattern" instead of "part" - but the response pattern is still physically instantiated in some collection of neurons, so it's not like "part" would be any less correct, or worse at reductionism. Either way, you still get a useful model of how those patterns interact to cause different kinds of behavior. I agree that the practical usefulness of IFS is distinct from the question of w

For example, firefighters are called that because “they are willing to let the house burn down to contain the fire”; that is, when they are triggered, they typically act to make the pain stop, without any regard for consequences (such as loss of social standing). At the same time, managers tend to be terrified of exactly the kind of lack of control that’s involved with a typical firefighter response. This makes firefighters and managers typically polarized - mutually opposed - with each other.

In my experience, this distinction merely looks like normal reinforcement: you can be short-term reinforced to do things that are against your interests in the long-term. This happens with virtually every addictive behavior; in fact, Dodes’ theory of addiction is that people feel better the moment they decide to drink, gamble, etc., and it is that decision that is immediately reinforced, while the downsides of the action are still distant. (Indeed, he notes that people often make that decision hours in advance of the actual behavior.)

If we only talked about various behaviors getting reinforced, we wouldn’t predict that the system simultaneously considers a loss of a social standing to b

... (read more)

The content of this and the other comment thread seems to be overlapping, so I'll consolidate (pun intended) my responses to this one. Before we go on, let me check that I've correctly understood what I take to be your points.

Does the following seem like a fair summary of what you are saying?

Re: IFS as a reductionist model:

Good reductionism involves breaking down complex things into simpler parts. IFS "breaks down" behavior into mini-people inside our heads, each mini-person being equally complex as a full psyche. This isn't simplifying anything.
Talking about subagents/parts or using intentional language causes people to assign things properties that they actually don't have. If you say that a thermostat "wants" the temperature to be something in particular, or that a part "wants" to keep you safe, then you will predict its behavior to be more flexible and strategic than it really is.
The real mechanisms behind emotional issues aren't really doing anything agentic, such as strategically planning ahead for the purpose of achieving a goal. Rather they are relatively simple rules which are used to trigger built-in subsystems that

... (read more)

7pjeby6y

Excellent summary! There are a couple of areas where you may have slightly over-stated my claims, though: I wouldn't say that IFS claims each mini-person is equally complex, only that the reduction here is just a separation of goals or concerns, and does not reduce the complexity of having agency. And this is particularly important because it is the elimination of the idea of smart or strategic agency that allows one to actually debug brains. Compare to programming: when writing a program, one intends for it to behave in a certain way. Yet bugs exist, because the mapping of intention to actual rules for behavior is occasionally incomplete or incorrectly matched to the situation in which the program operates. But, so long as the programmer thinks of the program as acting according to the programmer's intention (as opposed to whatever the programmer actually wrote), it is hard for that programmer to actually debug the program. Debugging requires the programmer to discard any mental models of what the program is "supposed to" do, in order to observe what the program is actually doing... which might be quite wrong and/or stupid. In the same way, I believe that ascribing "agency" to subsets of human behavior is a similar instance of being blinded by an abstraction that doesn't match the actual thing. We're made up of lots of code, and our problems can be considered bugs in the code... even if the behavior the code produces was "working as intended" when it was written. ;-) I don't claim that IFS assumes dedicated per-instance hardware; but it seems kind of implied. My understanding is that IFS at least assumes that parts are agents that 1) do things, 2) can be conversed with as if they were sentient, and 3) can be reasoned or negotiated with. That's more than enough to view it as not reducing "agency". But the article that we are having this discussion on does try to a model a system with dedicated agents actually existing (whether in hardware or software), so at l

7Kaj_Sotala6y

Here's my reply! Got article-length, so I posted it separately.

2Kaj_Sotala6y

Thanks for the clarifications! I'll get back to you with my responses soon-ish.

8Eli Tyre6y

This is a great comment, and I glad you wrote it. I'm rereading it several times over to try and get a handle on everything that you're saying here. In particular, I really like the "muscle" vs. "part" distinction. I've been pondering lately, when I should just squash an urge or desire, and when I should dialogue with it, and this distinction brings some things into focus. I have some clarifying questions though: I don't know what you mean by this at all. Can you give (or maybe point to) an example? --- This is fascinating. When I read your stressing out example, my thought was basically "wow. It seems crazy-difficult to surface the core underlying assumptions". But you think that this is harder, in the IFS framework. That is amazing, and I want to know more. In practice, how do you go about eliciting the rules and then emotionally significant instances? Maybe in the context of this example, how do you get from "I seem to be overly stressed about stuff" to the memory of your mother yelling at you? --- I'm trying to visualize someone doing IFS or IDC, and connect it to what you're saying here, but so far, I don't get it. What are the "examples"? Instances that are counter to the rule / schema of some part? (e.g. some part of me believes that if I ever change my mind about something important, then no one will love me, so I come up with an example of when this isn't or wasn't true?) --- Given that, doesn't it make sense to break down the different parts of a RL policy into parts? If different parts of a policy are acting at cross purposes, it seems like it is useful to say "part 1 is doing X-action, and part 2 is doing Y-action." ...But you would say that it is even better to say "this system, as a whole is doing both X-action, and Y-action"?

I don't know what you mean by this at all. Can you give (or maybe point to) an example?

So, let's take the example of my mother stressing over deadlines. Until I reconsolidated that belief structure... or hell, since UTEB seems to call it a "schema", let's just call it that. I had a schema that said I needed to be stressed out if the goal was serious. I wasn't aware of that, though: it just seemed like "serious projects are super stressful and I never know what to do", except wail and grind my teeth (figuratively speaking) until stuff gets done.

Now, I was aware I was stressed, and knew this wasn't helpful, so I did all sorts of things to calm down. People (like my wife) would tell me everything was fine, I was doing great, go easier/don't be so hard on yourself, etc. I would try practicing self-compassion, but it didn't do anything, except maybe momentarily, because structurally, being not-stressed was incompatible with my schema.

In fact, a rather weird thing happened: the more I managed to let go of judgments I had about how well I was doing, and the better I got at being self-compassionate, the worse I felt. It wasn't the same kind of stress, but it was actually worse, d

... (read more)

4Eli Tyre6y

I wonder how much of this discussion comes down to a different extensional referent of the word "part". According to my view, I would call "the reinforced pattern to activate the 'distress' muscle [in some specific set of circumstances]" a part. That's the thing that I would want to dialogue with. In contrast, I would not call the "distress muscle" itself a part, because (as you say) the distress muscle doesn't haven anything like "beliefs" that could update. In that frame, do you still have an objection?

3pjeby6y

And I don't understand how you could "dialogue" with such a thing, except in the metaphorical sense where debugging is a "dialogue" with the software or hardware in question. I don't ask a stimulus-respponse pattern to explain itself, I dialogue with the client or with my inner experience by trying things or running queries, and the answers I get back are whatever the machine does in response. I don't pretend that the behavior pattern is a coherent entity with which I can have a conversation in English, as for me that approach has only ever resulted in confusion, or at best some occasionally good but largely irreproducible results. And I specifically coach clients not to interpret those responses they get, but just to report the bare fact of what is seen or felt or heard, because the purpose is not to have a conversation but to conduct an investigation or troubleshooting process. A stimulus-response pattern doesn't have goals or fears; goals or fears are things we have, that we get from our SR rules as emergent properties. That's why treating them as intentional agents makes no sense to me: they're what our agency is made of, but they themselves are not a class of thing that could even comprehend such a thing as the notion of agency. Schemas are mental models, not utilitarian agents... not even in a theoretical sense! Humans don't weigh utility, we have an action planner system that queries our predictive model for "what looks like something good to do in this situation", and whatever comes back fastest tends to win, with emotionally weighted stuff or stuff tagged by certain mental muscles getting wired into faster routes. To put it another way, I think the thing you're thinking you can dialogue with is actually a spandrel of sorts, and it's a higher-level unit than what I work with. IFS, in ascribing intention, necessarily has to look at more complex elements than raw, miniscule, almost "atomic" stimulus-response patterns, because that's what's required if you

2Kaj_Sotala6y

I'm not sure why IFS's exile-manager-firefighter model doesn't fit this description? E.g. modeling something like my past behavior of compulsive computer gaming as a loop of inner critic manager pointing out that I should be doing something -> exile being triggered and getting anxious -> gaming firefighter seeking to suppress the anxiety with a game -> inner critic manager increasing the level of criticism and triggering the other parts further, has felt like a reduction to simpler components, rather than modeling it as "little people". They're basically just simple trigger-action rules too, like "if there is something that Kaj should be doing and he isn't getting around doing it, start ramping up an increasing level of reminders". There's also Janina Fisher's model of IFS parts being linked to various specific defense systems. The way I read the first quote in the linked comment, she does conceptualize IFS parts as something like state-dependent memory; for exiles, this seems like a particularly obvious interpretation even when looking at the standard IFS descriptions of them, which talk about them being stuck at particular ages and events. Certainly one can get the effect without compassion too, but compassion seems like a particularly effective and easy way of doing it. Especially given that in IFS you just need to ask parts to step aside until you get to Self, and then the compassion is generated automatically.

2pjeby6y

Because this description creates a new entity for each thing that happens, such that the total number of entities under discussion is "count(subject matter) times count(strategies)" instead of "count(subject matter) plus count(strategies)". By simple math, a formulation which uses brain modules for strategies plus rules they operate on, is fewer entities than one entity for every rule+strategy combo. And that's not even looking at the brain as a whole. If you model "inner criticism" as merely reinforcement-trained internal verbal behavior, you don't need even one dedicated brain module for inner criticism, let alone one for each kind of thing being criticized! Similarly, you can model most types of self-distraction behaviors as simple negative reinforcement learning: i.e., they make pain go away, so they're reinforced. So you get "firefighting" for free as a side-effect of the brain being able to learn from reinforcement, without needing to posit a firefighting agent for each kind of deflecting behavior. And nowhere in these descriptions is there any implication of agency, which is critical to actually producing a reductionist model of human behavior. Turning a human from one agent into multiple agents doesn't reduce anything.

2Kaj_Sotala6y

It seems to me that the emotional schemas that Unlocking the Emotional Brain talks about, are basically the same as what IFS calls parts. You didn't seem to object to the description of schemas; does your objection also apply to them? IFS in general is very vague about how exactly the parts are implemented on a neural level. It's not entirely clear to me what kind of a model you are arguing against and what kind of a model you are arguing for instead, but I would think that IFS would be compatible with both. I agree that reinforcement learning definitely plays a role in which parts/behaviors get activated, and discussed that in some of my later posts [1 2]; but there need to be some innate hardwired behaviors which trigger when the organism is in sufficient pain. An infant which needs help cries; it doesn't just try out different behaviors until it hits upon one which gets it help and which then gets reinforced. And e.g. my own compulsive behaviors tend to have very specific signatures which do not fit together with your description; e.g. a desire to keep playing a game can get "stuck on" way past the time when it has stopped being beneficial. Such as when I've slept in between and I just feel a need to continue the game as the first thing in the morning, and there isn't any pain to distract myself from anymore, but the compulsion will produce pain. This is not consistent with a simple "behaviors get reinforced" model, but it is more consistent with a "parts can get stuck on after they have been activated" model. Not sure what you mean by agency?

It seems to me that the emotional schemas that Unlocking the Emotional Brain talks about, are basically the same as what IFS calls parts. You didn't seem to object to the description of schemas; does your objection also apply to them?

AFAICT, there's a huge difference between UTEB's "schema" (a "mental model of how the world functions", in their words) and IFS' notion of "agent" or "part". A "model" is passive: it merely outputs predictions or evaluations, which are then acted on by other parts of the brain. It doesn't have any goals, it just blindly maps situations to "things that might be good to do or avoid". An "agent" is implicitly active and goal-seeking, whereas a model is not. "Model" implies a thing that one might change, whereas an "agent" might be required to change itself, if a change is to happen.

UTEB also describes the schema as "wordlessly [defining] how the world is" -- which is quite coherent (no pun intended) with my own models of mindhacking. I'm actually looking forward to reading UTEB in full, as the introduction makes it sound like the models I've developed of how this stuff works, are quite similar to theirs.

(Indeed, my own approach is specifically tar

... (read more)

6Eli Tyre6y

Cool. That makes sense. Well, when I talk with people at CFAR workshops, fairly often someone will have the problem of "akrasia" and they'll conceptualize it, more or less, as "my system 1 is stupid and doesn't understand that working harder at my job is the only thing that matters, and I need tools to force my S1 to do the right thing." And then I might suggest that they try on the frame where "the akrasia part", is actually an intelligent "agent" trying to optimize for their own goals (instead of a foreign, stupid entity, that they have to subdue). If the akrasia was actually right, why would that be? And they realize that they hate their job, and obviously their life would be terrible if they spent more of their time working at their terrible job. [I'm obviously simplifying somewhat, but this exact pattern does come up over and over again at CFAR workshops.] That is, in practice, the part, or subagent framing helps at least some people to own their desires more, not less. [I do want to note that you explicitly said, "What I am saying, and have been saying, is that nominalizing behavior patterns as "parts" or "agents" is bad reductionism, independent of its value as a therapeutic metaphor."] --- This doesn't seem right in my personal experience, because the "agents" are all me. I'm conceptualizing the parts of myself as separate from each other, because it's easier to think about that way, but I'm not disowning or disassociating from any of them. It's all me.

4pjeby6y

So my response to that is to say, "ok, let's get empirical about that. When does this happen, exactly? If you think about working harder right now, what happens?" Or, "What happens if you don't work harder at your job?" In other words, I immediately try to drop to a stimulus-response level, and reject all higher-level interpretive frameworks, except insofar as they give me ideas of where to drop my depth charges, so to speak. :) I usually don't bring that kind of thing up until a point has been reached where the client can see that empirically. For example, if I've asked them to imagine what happens if they get their wish and are now working harder at their job... and they notice that they feel awful or whatever. And then I don't need to address the intentionality at all. And sometimes, the real problem has nothing to do with the work and everything to do with a belief that they aren't a good person unless they work more, so it doesn't matter how terrible it is... but also, the very fact that they're guilty about not working more may be precisely the thing they're avoiding by not working! In other words, sometimes an intentional model fails because brains are actually pretty stupid, and have design flaws such that trying to view them as having sensible or coherent goals simply doesn't work. For example, our action planning subsystem is really bad at prioritizing between things we feel good about doing vs. things we feel bad about not doing. It wants to avoid the things we feel bad about not doing, because when we think about them, we feel bad. That part of our brains doesn't understand things like "logical negation" or "implicative reasoning", it just processes things based on their emotional tags. (i.e., "bad = run away") And I'm also not saying I never do anything that's a modeling of intention. But I get there bottom-up, not top-down, and it only comes up in a few places. Also, most of the intentional models I use are for things that pass through the brain

6Trinley Goldenberg6y

This is confusing Dissociation and Integration. I made a 2x2 that helps disambiguate. http://mattgoldenberg.net/wp-content/uploads/2019/10/2x2s-Integration-vs.-Association-2x2.jpg

4pjeby6y

Interesting diagram. I don't really understand it, though, because to me it looks like Focusing is on the wrong side, since Focusing deals in a unified "felt sense" rather than disparate parts -- at least to my understanding of it. Actually, I'm generally confused because without the mental state used by Focusing, Core Transformation, the Work, and Sedona don't work properly, if at all. So I don't understand how it could be separate. Similarly, I can see how CBT could be considered dissociated, but not Focusing. Anyway, when I referred to "dissociating", above, I meant it in the casual sense of people wanting to dis-associate, as in, "I'm not with him..." Not the technical sense of a dissociative experience or D.I.D., though one can also have the desire to detach or disconnect from one's experience in a dissociative way. In general, I was using the term to suggest something like, "the spectrum of ways people try to make an experience unreal or to deny its significance", which includes a variety of strategies including disavowal, denial, and deflection, as well as actual dissociation in the technical sense.

2Trinley Goldenberg6y

In general I think you're quite confused about how most people use the parts terminology if you think felt senses aren't referring to parts, which typically represent a "belief cluster" and visual, kinesthetic, or auditory representation of that belief cluster, often that's anthropomorphized. Note that parts can be different sizes, and you can have a "felt sense" related to a single belief, or clusters of beliefs. You're confusing dissociation and integration here again, so I'll just address the dissociation part. Note that all the things I'm saying here are ORTHOGONAL to the issue of "parts". Yes, focusing is in one sense embodied and experiential as opposed to something like CBT. However, this stuff exists on a gradient, and in focusing the embodiment is explicitly dissociated from and viewed as other. Here's copypasta from twitter: Here's a quote from http://focusing.org that points towards a dissociative stance: " When some concern comes, DO NOT GO INSIDE IT. Stand back, say "Yes, that’s there. I can feel that, there." Let there be a little space between you and that." I've heard an acquaintance describe a session with Anne Weiser-Cornell where they kept trying to say "this is my feeling" and she kept correcting to "this feeling in my body", which again is more of a dissociative stance. Now, is focusing looking to CAUSE dissociation? No, it's using dissociation as a tool because oftentimes people get so caught up in the trees they can't see the forest. For those people, that small bit of dissociation is useful. Similarly, tools that are associated are often useful for people who tend to view themselves as "other". If people tend to dissociate, it can be useful to realize that this is "me". > Anyway, when I referred to "dissociating", above, I meant it in the casual sense of people wanting to dis-associate, as in, "I'm not with him..." Not the technical sense of a dissociative experience . Me as well. I still maintain that viewing things as parts rathe

4pjeby6y

I was under the impression that IFS calls that "unblending", just as ACT calls it "de-fusing". I personally view it more as a stance of detachment or curiosity neutral observation. But I don't object to someone saying "I feel X", because that's already one step removed from "X"! If somebody says, "everything is awful" they're blended or fused or whatever you want to call it. They're taking the map as equivalent to the territory. Saying, "It feels like everything is awful" or "I feel awful" is already one level of detachment, and an okay place to start from. In common psychotherapy, I believe the term "dissociation" is usually associated with much greater levels of detachment than this, unless you're talking about NLP. The difference in degree is probably why ACT and IFS and others have specialized terms like "unblending" to distinguish between this lesser level of detachment, and the type of dissociative experience that comes with say, trauma, where people experience themselves as not even being in their body. Honestly, if somebody is so "in their head" that they don't experience their feelings, I have to go the opposite route of making them more associated and less detached, and I have plenty of tools for provoking feelings in order to access them. I don't want complete dissociation from feelings, nor complete blending with them, and ISTM that almost everything on your chart is actually targeted at that same sweet spot or "zone" of detached-but-not-too-detached. In touch with your experience, but neither absorbed by it nor turning your back on it. Anyway, I think maybe I understand the terms you're using now, and hopefully you understand the ones I'm using. Within your model I still don't know what you'd call what I'm doing, since my "Collect" and "Connect" phases would seem to be in the quadrant with Focusing, while my "Correct" phase explicitly uses The Work and variations on it. And my model doesn't have a notion of parts outside of mental muscles or a metap

4Trinley Goldenberg6y

I think a proper method should be everywhere. There's not a "correct" box, only a correct box for a given person at a given time in a given situation.

6Qiaochu_Yuan6y

Wow, thank you for writing this. This really clarified something for me that I'm in the process of digesting.

5DaystarEld6y

I will note that, in my own practice, IFS and subagents are never presented as "separate from you," but rather "parts of you." What you're describing sound more like what Narrative Therapy sometimes does, in externalizing and personifying the Anger or Addiction or whatever, and then working to better understand its influences on you and your ability to influence it and so on, though the framing on that can also vary greatly between one practitioner and another. Insofar as some people use IFS to "other" their internal desires or behaviors, this feels like it's naturally determined by the "client" more than anything. Some people just find the idea of breaking themselves down into sub-agents or "child vs teenage vs adult self" really clicks with the way they relate to their competing desires and goals, without quite giving up "responsibility" for them... but that opens up a new conversation about how important the sense of "responsibility" for our flaws actually is toward addressing them, which also probably depends a lot on how motivated the client is toward change.

4pjeby6y

Yes, but "part of you" can still be disowning/deflection. It allows one to remain disidentified from the "part", i.e., "oh, it's just that part of me, it's not really me". It allows you to disclaim endorsement of the "part's" values. I can see how it might work for some people. I just avoid it because the clients I work with usually have a metric ton of stuff they're other-ing or judging themselves about or disavowing, so dealing with that issue is already on the critical path for getting done what they came to me for. (And the people who come to me talking about how wonderful IFS is, frequently seem to be the ones with the worst denial issues, so that's probably why I get a bit passionate about explaining why, at least for them, it's a really bad idea to keep doing that.) But yeah, any modality can be abused by anybody in order to keep themselves from changing, and all self-help advice can be trivially weaponized for self-destruction. After all, somebody could easily take what I'm saying about IFS and turn it into ammunition to punish themselves more, because they need to "take responsibility" for all their awful, awful parts. ;-) That being said, I don't say that people need to "take responsibility, just that they need to admit the truth about what they want. It's okay to wish you didn't want something you want, but trying to pretend you don't want it or that it's not you who wants it isn't always a viable coping strategy, and in fact is often crazy-making. That is, the brain's decision-making system appears to be able to handle, "I want this but it's not a good idea", much better and more sanely than it handles "I want to not know that I want this"! The latter is just begging to end up with compulsive behaviors outside of conscious control (because if they could control the behavior, it would mean that they're the one who's doing the wanting).

2Eli Tyre6y

Huh. This does not resonate with my experience, but I will henceforth be on the lookout for this.

9pjeby6y

To be fair, I doubt that my sample size of such individuals is statistically significant. But since in the few times a client has brought up IFS and either enthusiastically extolled it or seemed to be wanting me to validate it as something they should try, it seemed to me to be related to either the person's schema of helplessness (i.e., these parts are doing this to me), or of denial (i.e., I would be successful if I could just fix all these broken parts!), which IMO are both treating the parts metaphor as a way to support and sustain the very dysfunctions that were causing their problems in the first place. In general, I suspect people are naturally attracted to the worst possible modes of therapy for fixing their problems, at least if they know anything about the therapy in question! (And I include myself in that, since I've avoided therapy generally since a bad experience with it in college, and for a long time avoided any self-help modality that involved actually being self-compassionate or anything other than supporting my "fix my broken stuff so I can get on with life" attitude. It's possible that with the right approach and therapist I could potentially have changed faster, once you count all the time I spent researching and developing my methods, all the failures and blind alleys. But I'm happy with the outcome, since more people are being helped than just me, and getting people out of the kinds of pain I suffered is rewarding in its own way.)

4Eli Tyre6y

I've always disliked the term "subagent", but this sentence seems to capture what I mean when I'm talking about psychological "parts". So I think I agree with you about the ontological status of parts, but I can't tell, if you're making some bolder claim. What are you imagining would be the case if IFS was literally true, and subagents were real, instead of "just a metaphor"? . . . In fact, I dislike the word "subagent", because it imports implications that might not hold. A part might be agent-like, but it also might be closer to an urge or a desire or an impulse. To my understanding the key idea of the "parts" framing, is that I should assume, by default, that each part is acting from a model, a set of beliefs about the world or my goals. That is, my desire/ urge / reflex, is not "mindless": it can update. Overall this makes your comment read to me as "these things are not really [subagents], they're just reactions that have [these specific properties of subagents]."

2pjeby6y

Well, for one thing, that they would intelligently shift their behavior to achieve their outcomes, rather than stupidly continuing things that don't work any more. That would be one implication of agency. Also, if IFS were literally true, and "subagents" were the atomic unit of behavior, then the UTEB model shouldn't work, and neither should mine or many other modalities that operate on smaller, non-intentional units. Ah! Now we're getting somewhere. In my frame, an urge, desire or impulse is a reaction. The "response" in stimulus-response. Which is why I want to pin down "when does this thing happen?", to get the stimulus part that goes with it. I see it differently: we have mental models of the world, that contain "here are some things that might be good to do in certain situations", where "things to do" can include "how you should feel, so as to bias towards a certain category of behaviors that might be helpful based on what we know". (And the actions or feelings listed in the model can be things other people did or felt!) In other words, the desire or urge is the output of a lookup table, and the lookup table can be changed. But both the urge and the lookup table are dumb, passive, and prefer not to update if at all possible. (To the extent that information processed through the lookup table will be distorted to reinforce the validity of what's already in the lookup table.) Even in the cases where somebody makes a conscious decision to pursue a goal, (e.g. a child thinking "I'll be good so my parents will love me", or "I'll be perfect so nobody can reject me"), that's just slapping an urge or desire into the lookup table, basically. It doesn't mean we pursue it in any systematic or even sane way! So, what you're seeing as a coherent "part", I see as a collection of assorted interacting machinery that, when it works, could maybe be seen as an intelligent goal-seeking agent... but mostly is dumb machinery subject to all kinds of weird breakage scenarios, tur

1lemonhope6y

As with real and fake memories, I think if you’re careful then you can mainly deal with real ones

Wow. So glad I ended up on a Goodreads review for the IFS main book and this article was recommended. Just wanted to say thank you for the metaphor presented, really helpful.

2Kaj_Sotala5y

Glad it was of use! :)

So I finally read up on it, and have been successfully applying it ever since.

Could you give some examples of where you've been applying IFS and how it's been helpful in those situations?

So I find IFS, Focusing, IDC, and some aspects of TMI-style meditation to basically have blended together into one big hybrid technique for me; they all feel like different aspects of what's essentially the same skill of "listening to what your subagents want and bringing their desires into alignment with each other"; IFS has been the thing that gave me the biggest recent boost, but it's not clear to me that I'm always doing "entirely pure IFS", even though I think there's nearly always a substantial IFS component. (Probably most important has been the part about getting into Self, which wasn't a concept I explicitly had before this.)

That said, a few examples. I already mentioned a few in an earlier post:

My experience is that usually if I have an unpleasant emotion, I will try to do one of two things: either reject it entirely and push it out of my mind, or buy into the story that it’s telling and act accordingly. Once I learned the techniques for getting into Self, I got the ability to sort of… just hang out with the emotion, neither believing it to be absolutely true nor needing to show it to be false. And then if I e.g. had feelings o

... (read more)

I am not OP but I can give an example.

As background there are some activities that are general purpose feeling obliterators and thus are commonly used by firefighters: binge-eating, drinking alcohol, drugs, sex, TV, video games...

I have been fighting with my weight for many (26!) years. I did lose a lot of weight but still at BMI 26 and could not get off that last 7kg. Using the IFS process I identified the firefighters which used eating to make various feelings go away:

Social stress, anxiety about food being available (from when I was young = "Jimmi"), feelings of emotional deprivation (childhood situation), feelings of frustration when I could not understand something, feeling tired, feeling frightened (childhood situation)

Once I connected with these protectors and made friends with them, connected (with their permission) with the original exiles, and established that the problems have solutions, I have been able to stick to my diet for 50 days straight and lose 2.5kg in less than two months. This takes me almost half way to my target.

As an example how much has changed I have had a packet of chocolate biscuits in my refrigerator for the last few weeks with no drama at... (read more)

The back-and-forth (here and elsewhere) between Kaj & pjeby was an unusually good, rich, productive discussion, and it would be cool if the book could capture some of that. Not sure how feasible that is, given the sprawling nature of the discussion.

Nomination for 2019 review:

I originally tried to read Self-Therapy, but bounced off of it because it was aimed too much at people with major life-impacting traumas. This post was much more approachable, and I liked the robot metaphor. Since reading it, I started to notice the ways in which my own mind is behaving like a manager or firefighter with respect to embarrassing incidents in the past.

I came back to this post because I was thinking about Scott's criticism of subminds where he complains about "little people who make you drink beer because they like beer".

I'd already been considering how your robot model is nice for seeing why something submind-y would be going on. However, I was still confused about thinking about these various systems as basically people who have feelings and should be negotiated with, using basically the same techniques I'd use to negotiate with people.

Revisiting, the "Personalized characters" section was pretty useful

... (read more)

I really enjoyed this post and starting with the plausible robot design was really helpful for me accessing the IFS model. I also enjoyed reflecting on your previous objections as a structure for the second part.

The part with repeated unblending sounds reminiscent of the "Clearing a space" stage of Focusing, in which one acknowledges and sets slightly to the side the problems in one's life. Importantly, you don't "go inside" the problems (I take 'going inside' to be more-or-less experiencing the affect associated with the problems). This seems pretty simil

... (read more)

6Kaj_Sotala7y

Thanks, that's very nice and specific feedback. :) Yeah, these feel basically like the same kind of thing. I find that Focusing and IFS have basically blended into some hybrid technique for me, with it being hard to tell the difference anymore. Possibly combined with other related practices, such as Focusing: Elimination of internal conflicts, increased well-being due to improved access to Self, better ability to do things which feel like worth doing. The personal examples in my other comment may give a better idea.

I've read a lot of books in the self-help/therapy/psychology cluster, but this is the first which gives a clear and plausible model of why the mental structure they're all working with (IFS exiles, EMDR unprocessed memories, trauma) has enough fitness-enhancing value to evolve despite the obvious costs.

I'm a little late to the party, but I just read through and did the exercises of The Self Therapy last week and feeling very excited about how many components of the model "clicked" with me. Reading this post gave me insights into why those components resonated with me, so thank you very much for taking the time to write up this supremely helpful post!

The one aspect of the model that I've been having a lot of trouble with, which I view as problematic since the entire model essentially hinges on this practice, is to have an "organic... (read more)

4Kaj_Sotala5y

Happy to hear that the post was useful to you! First piece of advice: don't do that. :-) I feel pretty comfortable saying that this approach is guaranteed not to produce any results. Intellectualizing parts will basically only give you the kind of information that you could produce by intellectual analysis, and for intellectual analysis you don't need IFS in the first place. Even if your guesses are right, they will not produce the kind of emotional activation that's necessary for change. A few thoughts on what to do instead... It sounds (correct me if I'm wrong) like you are giving the part a visual appearance by thinking of the nature of the problem, and choosing an image which seems suitably symbolic of it; then you try to interact with that image. In that case, you are picking a mental image, but the image isn't really "connected" to the part, so the interaction is not going to work. What you want to do is to first get into contact with the part, and then let a visual image emerge on its own. (An important note: parts don't have to have a visual appearance! I expect that one could do IFS even if one had aphantasia. If you try to get a visual appearance and nothing comes up, don't force it, just work with what you do have.) So I would suggest doing something like this: * Think of some concrete situation in which you usually procrastinate. If you have a clear memory of a particular time, let that memory come back to mind. Or you could imagine that you are about to do something that you've usually been procrastinating on. Or you could just pick something that you've been procrastinating on and try doing it right now, to get that procrastination response. * Either way, what you are going for are the kinds of thoughts, feelings, and bodily sensations that are normally associated with you procrastinating. Pay particular attention to any sensations in your body. Whatever it is that you are experiencing, try describing it out loud. For example: "when I think of

3Peter Chang5y

Thank you so much for your detailed response! That makes a lot of sense. I think I need to focus on working with my "impatience" part before I can truly get into the kind of patient and tolerant Self that you are describing. I think I might have gotten a bit derailed due to my experience training for memory competitions. I had to come up with 2700+ very specific visual images of characters each corresponding to a pair of playing cards, and so I've developed this sometimes-annoying habit of quickly making a tenuous association between any information I process and some figure familiar to me. Paying careful attention to the relatively-reliable physical sensations that are triggered with particular trailheads and starting from there sounds like a great idea. Thanks again!

Seems like directly entering a Catastrophic situation (burning hand on hot stove) without going through Distress would lead to a more severe Manager (or Exile) like PTSD. I.e, a soldier walking into a firefight & being vs. being shot by sniper. Related: losing a limb suddenly vs. having it amputated (with advance warning) seems to make it more likely you'd have Phantom Limb pain b/c your mind never registered the limb was missing.

I'm finding it fruitful to consider the "exiles" discussion in this post alongside Hunting the Shadow.

5Kaj_Sotala7y

It doesn't really fit nicely into the simplified version of IFS that I presented in this post, but in the context of Hunting the Shadow, it's worth noting that some protector parts can get exiled too.

2Kaj_Sotala7y

(I now talk about exiled protectors a bit in "Subagents, neural Turing machines, thought selection, and blindspots"; quite relevant for the topic of hunting one's shadow, if I may say so myself)

This is a great post; particularly in how you narrate bouncing off of it and then building a model by which it or something like it is plausible.

I actually had the luck of having an in-person demonstration of this (IFS-style therapy) from someone in the LW/rationalist community years ago and I've been discussing it and recommending it to others ever since.

Wow, this is all very interesting.

I have been using this framework for a bit and I think I have found some important clues about some exile-manager-firefighter dynamics in myself. Although I'm just starting and I still have to clarify my next steps, I feel hopeful that this is the right direction.

There are some things which I would like to know more about. Feel free to answer any.

Which agent should the sympathetic listener be talking to? The manager, the exile, or both?

Assuming that one correctly identifies which thoughts (and ultimately, which situat... (read more)

5Kaj_Sotala7y

Glad to hear it's been of use! First with any of the managers which might be protecting the exiles. Eventually they might give access to the exile, but it's important to not try to rush through them. You only go to the exile after the managers have agreed to give you access to it: bypassing them risks causing damage because the managers had concerns which weren't taken into account. (Self-Therapy has detailed instructions on this.) You might e.g. end up exposing an exile in a situation where you don't have the resources to handle it, and then instead of healing the exile, you end up worsening the original trauma. That will also have the added effect of making your managers less likely to trust you with access to the exile again. Though sometimes I've had exiles pop up pretty spontaneously, without needing to negotiate with managers. In those situations I've just assumed that all managers are fine with this, since there's no sense of a resistance to contacting the exile. If that happens then it's probably okay, but if it feels like any managers are getting in the way, then address their concerns as much as possible. (As the instructor said in an IFS training I did: "to go fast, you need to go slow".) IFS also recommends checking back with the managers after healing the exile, so that they can see that the exile is actually healed now and that they can behave differently in the future. Also, you may want to keep checking back with the exile for a while afterwards, to ensure that it's really been healed. Depends. I think that either are possible, but I don't have a hard and fast rule: usually I've just gone with whatever felt more right. But I'd guess that in the situations where you can get parts to update just by talking to them, it's in situations where you've already accumulated plenty of evidence about how things are, and the relevant parts just need to become aware of them. E.g. if you had some challenge which was very specifically about your childhood enviro

5Elo7y

One of the skills here is an open minded flow of discussion between parts. To get to an open minded discussion, the agents who are shutting down discussions need to form an agreement to discuss. That means no distraction, no sleepiness, no anxiety around the conversation. This open discussion can be done for one part at a time or for the global, "discussions are safe" paradigm. If "discussions are safe", then it's possible to ask the question, "what can't we talk about?" and find content/parts there. (there's still things I don't need to talk about very much, but I have no problem with them and talking about them. For example I prefer to look in an optimistic direction and point my mind there but I have no problem digging up all the fears, doubts and discomforts if that's needed)

4sampe7y

Thank you very much for the detailed reply! You answered all my questions. I got the Self-Therapy audiobook after writing my comment. Looks great so far.

Really enjoyed the post, thanks!

I started the Earley book and it's definitely a struggle. I usually can handle "soft skills" books like this one without getting frustrated by the vague, hand-wavy models—I really enjoyed Gendlin's Focusing, for example—but this one's been especially hard. That said, having your model in mind while I'm reading has kept me going as I'm using it as a sort of Rosetta's stone for some of Earley's claims.

When I first read the post, I expected that "family systems" are related to Hellinger's family constellations: this is a different method of psychotherapy which assumes completely different set of "subagents" to define human mind and its problems. In the Hellinger's constellation method is assumed that actual family relations of a person has the biggest impact on the person's wellbeing (and motivation), and that the family structure is somehow internalised. This family structure could be invoked by group of people (assign... (read more)

3Kaj_Sotala7y

Wow. I didn't expect to see a therapy approach based on morphic fields.

1avturchin7y

I don't think its rational part is based on any "morphic fields". If a person thinks that her mother is god, her father was a devil and suppressed any thoughts about the grandfather, it is expected (but damaged) family structure imprinted in her brain and she will repeat it again when she will try to built her own relations. The best way to learn more about family constellations is just try in ones in a local group - at least, in my case, it helped me to solve long conflict with my mother. The less effective may be to read Bert Hellinger's early books: it provides a theory, but without some experience it may look a little strange.

A visceral, real world example:

Workers who are killed who can't let go of their tools because it's part of their identity. I suspect there is a Part (in IFS parlance) that tells them "this is your identity".

From the book Range (highly recommended):

In four separate fires in the 1990s, twenty-three elite wildland firefighters refused orders to drop their tools and perished beside them. Even when Rhoades eventually dropped his chainsaw, he felt like he was doing something unnatural. Weick found similar phenomena in Navy seamen who ignore... (read more)

Gensler is a practical/applied framework of Freud, whose influence continues to grow in the humanities (outside of the psychology department, wherever that chimera sits). Most of the commentary above would benefit from a basic understanding of primary Freud (Interpretation of Dreams, Ego and Id, Basic Introduction, Civilization and its Discontents). The key to Freud is his dogged insistence on the importance of non-empirical structures (metaphor, analogy) to human thought. My personal belief is that these are incidental artifacts of the development of lang... (read more)

This is very similar to the Lifespan Integration Therapy which I had in April 2020. The logic of this therapy is to connect you with your memories and dissolve the past traumas. I think I greatly benefited from it because I have stopped being afraid of certain moments of my life associated with having depression.

In general, I am reading this sequence because one of my dreams is to understand what consciousness and enlightenment are. There are few gears in my current models of these phenomena.

A psychologist told me that the newer "version" of this is Coherence Therapy. I've only just started to read up on this.

I've gotten enormous benefit just from being aware of the my "parts" without even distinguishing b/t what role they play. Just realizing that what they aren't having the effect they THINK they are.

5Kaj_Sotala6y

See my later post for a discussion of coherence therapy and its connection to IFS. :)

1MrAnalogy@gmail.com6y

yep, read that GREAT post. Any other suggestions for a starting point on Coherence Therapy?

2Kaj_Sotala6y

Thanks! I got some value out of this training guide, though it's primarily aimed at people who already have some therapy training.

(This lasted for awhile but not forever; I crashed hard in Se... (read more)

things that had been too scary for me to think about became thinkable (e.g. regrettable dynamics in my romantic relationships), and I think this is a crucial observation for the rationality project. When you have exile-manager-firefighter dynamics going on and you don't know how to unblend from them, you cannot think clearly about anything that triggers the exile, and trying to make yourself do it anyway will generate tremendous internal resistance in one form or another (getting angry, getting bored, getting sleepy, getting confused, all sorts of crap), first from managers trying to block the thoughts and then from firefighters trying to distract you from the thoughts. Top priority is noticing that this is happening and then attending to the underlying emotional dynamics.

Yes!

Valentine has also written some good stuff on this, in e.g. The Art of Grieving Well:

I think the first three so-called “stages of grief” — denial, anger, and bargaining — are avoidance behaviors. They’re attempts to distract oneself from the painful emotional update. Denial is like trying to focus on anything other than the hurt foot, anger is like clutching and yelling and getting mad at the situation,

... (read more)

Curated.

The internal family systems model has seen a lot of discussion in various rationalist and rationalist-adjaecent places, but:

a) usually among people who were already familiar with it,

b) usually with a vague disclaimer of being a fake-framework, without delving into the details of where the limits of the framework lay or how to contextualize it in a broader reductionist worldview.

Really what I want is for Kaj's entire sequence to be made into a book. Barring that, I'll settle for nominating this post.

4Raemon5y

2Ben Pace5y

e.g. that happened with embedded agency and to some extent babble and prune

3Kaj_Sotala7y

1eggsyntax2y

1Kenny7y

So thank you very, very much for writing this.

4Kaj_Sotala7y

My 2 cents:

3Trinley Goldenberg7y

2avturchin7y

3waveman7y

This is one good way to rationalize them. It doesn't really much matter whether this is true or not.

It doesn't really much matter whether this is true or not.

I think it matters from the perspective that if subagents are simulated at query time, then a non-subagent model should be able to produce similar results to IFS, with fewer complications.

... (read more)

I'm actually kind of surprised that IFS seems so popular in rationalist-space, as I would've thought rationalists more likely to bite the bullet and accept the existence of their unendorsed desires as a simple matter of fact.

Some reasons for the popularity of IFS which seem true to me, and independent of whether you accept your desires:

It's the main modality that rationalists happen to know which lets you do this kind of thing at all. The other popular one is Focusing, which isn't always framed in terms of subagents, but in terms of the memory reconsolidation model it basically only does accessing; de- and reconsolidation will only happen to the extent that the accessing happens to trigger the brain's spontaneous mismatch detection systems. (Also the Bio-Emotive Framework has gotten somewhat popular of late, but that's a very recent development.)
Rationalists tend to really like reductionism, in the sense of breaking complex systems down into simpler parts that you can reason about. IFS is good at giving you various gears about how minds operate, and e.g. turning previously incomprehensible emotional reactions into a completely sensible chain of part

... (read more)

5pjeby6y

(adding to my other comment)

dividing people into lots of mini-people isn't a reduction.

4pjeby6y

9Kaj_Sotala6y

For example, firefighters are called that because “they are willing to let the house burn down to contain the fire”; that is, when they are triggered, they typically act to make the pain stop, without any regard for consequences (such as loss of social standing). At the same time, managers tend to be terrified of exactly the kind of lack of control that’s involved with a typical firefighter response. This makes firefighters and managers typically polarized - mutually opposed - with each other.

If we only talked about various behaviors getting reinforced, we wouldn’t predict that the system simultaneously considers a loss of a social standing to b

... (read more)

Does the following seem like a fair summary of what you are saying?

Re: IFS as a reductionist model:

Good reductionism involves breaking down complex things into simpler parts. IFS "breaks down" behavior into mini-people inside our heads, each mini-person being equally complex as a full psyche. This isn't simplifying anything.
Talking about subagents/parts or using intentional language causes people to assign things properties that they actually don't have. If you say that a thermostat "wants" the temperature to be something in particular, or that a part "wants" to keep you safe, then you will predict its behavior to be more flexible and strategic than it really is.
The real mechanisms behind emotional issues aren't really doing anything agentic, such as strategically planning ahead for the purpose of achieving a goal. Rather they are relatively simple rules which are used to trigger built-in subsystems that

... (read more)

7pjeby6y

7Kaj_Sotala6y

Here's my reply! Got article-length, so I posted it separately.

2Kaj_Sotala6y

Thanks for the clarifications! I'll get back to you with my responses soon-ish.

8Eli Tyre6y

I don't know what you mean by this at all. Can you give (or maybe point to) an example?

... (read more)

4Eli Tyre6y

3pjeby6y

2Kaj_Sotala6y

2pjeby6y

2Kaj_Sotala6y

It seems to me that the emotional schemas that Unlocking the Emotional Brain talks about, are basically the same as what IFS calls parts. You didn't seem to object to the description of schemas; does your objection also apply to them?

(Indeed, my own approach is specifically tar

... (read more)

6Eli Tyre6y

4pjeby6y

6Trinley Goldenberg6y

This is confusing Dissociation and Integration. I made a 2x2 that helps disambiguate. http://mattgoldenberg.net/wp-content/uploads/2019/10/2x2s-Integration-vs.-Association-2x2.jpg

4pjeby6y

2Trinley Goldenberg6y

4pjeby6y

4Trinley Goldenberg6y

I think a proper method should be everywhere. There's not a "correct" box, only a correct box for a given person at a given time in a given situation.

6Qiaochu_Yuan6y

Wow, thank you for writing this. This really clarified something for me that I'm in the process of digesting.

5DaystarEld6y

4pjeby6y

2Eli Tyre6y

Huh. This does not resonate with my experience, but I will henceforth be on the lookout for this.

9pjeby6y

4Eli Tyre6y

2pjeby6y

1lemonhope6y

As with real and fake memories, I think if you’re careful then you can mainly deal with real ones

Wow. So glad I ended up on a Goodreads review for the IFS main book and this article was recommended. Just wanted to say thank you for the metaphor presented, really helpful.

2Kaj_Sotala5y

Glad it was of use! :)

So I finally read up on it, and have been successfully applying it ever since.

Could you give some examples of where you've been applying IFS and how it's been helpful in those situations?

That said, a few examples. I already mentioned a few in an earlier post:

My experience is that usually if I have an unpleasant emotion, I will try to do one of two things: either reject it entirely and push it out of my mind, or buy into the story that it’s telling and act accordingly. Once I learned the techniques for getting into Self, I got the ability to sort of… just hang out with the emotion, neither believing it to be absolutely true nor needing to show it to be false. And then if I e.g. had feelings o

... (read more)

I am not OP but I can give an example.

As background there are some activities that are general purpose feeling obliterators and thus are commonly used by firefighters: binge-eating, drinking alcohol, drugs, sex, TV, video games...

As an example how much has changed I have had a packet of chocolate biscuits in my refrigerator for the last few weeks with no drama at... (read more)

Nomination for 2019 review:

I came back to this post because I was thinking about Scott's criticism of subminds where he complains about "little people who make you drink beer because they like beer".

Revisiting, the "Personalized characters" section was pretty useful

... (read more)

6Kaj_Sotala7y

The one aspect of the model that I've been having a lot of trouble with, which I view as problematic since the entire model essentially hinges on this practice, is to have an "organic... (read more)

4Kaj_Sotala5y

3Peter Chang5y

I'm finding it fruitful to consider the "exiles" discussion in this post alongside Hunting the Shadow.

5Kaj_Sotala7y

2Kaj_Sotala7y

(I now talk about exiled protectors a bit in "Subagents, neural Turing machines, thought selection, and blindspots"; quite relevant for the topic of hunting one's shadow, if I may say so myself)

This is a great post; particularly in how you narrate bouncing off of it and then building a model by which it or something like it is plausible.

5Kaj_Sotala7y

5Elo7y

4sampe7y

Thank you very much for the detailed reply! You answered all my questions. I got the Self-Therapy audiobook after writing my comment. Looks great so far.

Really enjoyed the post, thanks!

3Kaj_Sotala7y

Wow. I didn't expect to see a therapy approach based on morphic fields.

1avturchin7y

A visceral, real world example:

Workers who are killed who can't let go of their tools because it's part of their identity. I suspect there is a Part (in IFS parlance) that tells them "this is your identity".

From the book Range (highly recommended):

In general, I am reading this sequence because one of my dreams is to understand what consciousness and enlightenment are. There are few gears in my current models of these phenomena.

A psychologist told me that the newer "version" of this is Coherence Therapy. I've only just started to read up on this.

I've gotten enormous benefit just from being aware of the my "parts" without even distinguishing b/t what role they play. Just realizing that what they aren't having the effect they THINK they are.

5Kaj_Sotala6y

See my later post for a discussion of coherence therapy and its connection to IFS. :)

1MrAnalogy@gmail.com6y

yep, read that GREAT post. Any other suggestions for a starting point on Coherence Therapy?

2Kaj_Sotala6y

Thanks! I got some value out of this training guide, though it's primarily aimed at people who already have some therapy training.

LESSWRONG
LW

LESSWRONG
LW

295

Building up to an Internal Family Systems model

295

Introduction

Epistemic status

Wanted: a robot which avoids catastrophes

Introducing managers

Putting together a toy model

Consequences of the model

The Internal Family Systems model

Personalized characters

Why should this technique be useful for psychological healing?

The Self

Final words

295

295