Dangers of steelmanning / principle of charity

gothgirl420666

As far as I can tell, most people around these parts consider the principle of charity and its super saiyan form, steelmanning, to be Very Good Rationalist Virtues. I basically agree and I in fact operate under these principles more or less automatically now. HOWEVER, no matter how good the rule is, there are always exceptions, which I have found myself increasingly concerned about.

This blog post that I found in the responses to Yvain's anti-reactionary FAQ argues that even though the ancient Romans had welfare, this policy was motivated not for concern for the poor or for a desire for equality like our modern welfare policies, but instead "the Roman dole was wrapped up in discourses about a) the might and wealth of Rome and b) goddess worship... The dole was there because it made the emperor more popular and demonstrated the wealth of Rome to the people. What’s more, the dole was personified as Annona, a goddess to be worshiped and thanked."

So let's assume this guy is right, and imagine that an ancient Roman travels through time to the present day. He reads an article by some progressive arguing (using the rationale one would typically use) that Obama should increase unemployment benefits. "This makes no sense," the Roman thinks to himself. "Why would you give money to someone who doesn't work for it? Why would you reward lack of virtue? Also, what's this about equality? Isn't it right that an upper class exists to rule over a lower class?" Etc.

But fortunately, between when he hopped out of the time machine and when he found this article, a rationalist found him and explained to him steelmanning and the principle of charity. "Ah, yes," he thinks. "Now I remember what the rationalist said. I was not being so charitable. I now realize that this position kind of makes sense, if you read between the lines. Giving more unemployment benefits would, now that I think about it, demonstrate the power of America to the people, and certainly Annona would approve. I don't know why whoever wrote this article didn't just come out and say that, though. Maybe they were confused".

Hopefully you can see what I'm getting at. When you regularly use the principle of charity and steelmanning, you run the risk of:

1. Sticking rigidly to a certain worldview/paradigm/established belief set, even as you find yourself willing to consider more and more concrete propositions. The Roman would have done better to really read what the modern progressive's logic was, think about it, and try to see where he was coming from than to automatically filter it through his own worldview. If he consistently does this he will never find himself considering alternative ways of seeing the world that might be better.

2. Falsely developing the sense that your worldview/paradigm/established belief set is more popular than it is. Pretty much no one today holds the same values that an ancient Roman does, but if the Roman goes around being charitable all the time then he will probably see his own beliefs reflected back at him a fair amount.

3. Taking arguments more seriously than you possibly should. I feel like I see all the time on rationalist communities people say stuff like "this argument by A sort of makes sense, you just need to frame it in objective, consequentialist terms like blah blah blah blah blah" and then follow with what looks to me like a completely original thought that I've never seen before. But why didn't A just frame her argument in objective, consequentialist terms? Do we assume that what she wrote was sort of a telephone-game approximation of what was originally a highly logical consequentialist argument? If so where can I find that argument? And if not, why are we assuming that A is a crypto-consequentialist when she probably isn't? And if we're sure that objective, consequentialist logic is The Way To Go, then shouldn't we be very skeptical of arguments that seem like their basis is in some other reasoning system entirely?

4. Just having a poor model of people's beliefs in general, which could lead to problems.

Hopefully this made sense, and I'm sorry if this is something that's been pointed out before.

Hopefully you can see what I'm getting at. When you regularly use the principle of charity and steelmanning, you run the risk of:

4. Just having a poor model of people's beliefs in general, which could lead to problems.

Hopefully this made sense, and I'm sorry if this is something that's been pointed out before.

Steelmanning is not a courtesy or a service to my interlocutor. It is a service to me. It is my attempt to build the strongest case I can against my position, so I can shatter it or see it survive the challenge. The interlocutor might not agree, if I were to ask them, that my steelmanned argument is really stronger than theirs; that's no matter. I'm not doing it for them, I'm doing it for myself.

Steelmanning is always done for your own sake. It always says something new that the original owner of the argument didn't think of or at least didn't say. When put back into the discussion, it should be introduced explicitly as your words.

Remember, the steelmanned argument is your creation and is meant for you, you owe it to yourself to test your beliefs with it, but not necessarily in the context of this conversation. Not because concealing it is an easier way to victory, but rather because what's steelmanned for you might not be steelmanned or even interesting to your interlocutor. Their argument said A, and you may have found a way to strengthen it further to say B, but they might not want to claim B, to defend B, to agree that B is stronger than A. That said, if you do think the steelmanned argument would be useful to them, by all means introduce it, but explicitly as your own.

I agree, and this is sort of what I find problematic, I'll explain in a second. (Notice that all four "risks" I mentioned are risks to the Roman and not the progressive.)

Now, going to the example in the post, where the ancient Roman chooses to interpret a progressive argument for increasing welfare as "really" carrying between lines the ancient Roman rationale. He is not doing a charitable reading of his interlocutor's words - they would definitely not agree that this is what they meant to say. And he is not steelmanning anything either, because he hasn't strengthened an argument against his own position; rather, he fortified his existing beliefs by manufacturing another fake confirmation. If he were to modify the progressive's argument in some way that would make it harder for him to interpret it in the ancient-Roman sense, that would be steelmanning.

I think I was a little unclear here, sorry. Imagine that the Roman is already against increasing welfare, for whatever reason. He first reads the progressive article and thinks that the progressive's argument is dumb. He then remembers steelmanning and re-interprets the article as arguing that welfare reform would incur Anonna's favor. He finally realizes that the position isn't that bad when seen in this light, and begins to be a little less certain that increasing welfare would be a bad idea. This is sort of what I was imagining when I wrote the post. The belief that's being tested is not the entire ancient Roman worldview, it's whether or not welfare should be increased.

The thing is, when the Roman creates the new argument "increasing welfare would incur Anonna's favor", that's a completely new idea that he came up with himself, and as such it should be held skeptically. Imagine if Anonna in fact liked welfare when it was in the form of gold coins and hated it when it's in the form of a vague baseless digital currency, and the Roman had no idea, not being an Anonnan priest. However, he might mistakenly think that the fact that the idea "we should increase welfare for equality" is fairly popular and held by smart people is authority for the idea "increasing welfare would incur Anonna's favor", but in fact these are pretty distinct ideas.

I feel like the steelmanning process usually outputs a new argument that you can look at and say "yeah, that kind of makes sense". But I was reading some of the "Tupac is alive" conspiracy theories the other day, and I thought they kind of make sense. For me, an argument kind of making sense is pretty bad evidence for its truth - good evidence would be if I read the argument, then the rebuttals, then the rebuttals to the rebuttals, then the rebuttals to the rebuttals to the rebuttals, and etc. until I finally found a point where I could say "okay, that really does makes sense". But I haven't had the time, or likely the ability, to do this with most arguments, so I usually form my beliefs off of vague intuitions around authority. What I guess I'm afraid of is that I'll conflate my original steelman with a superficially similar popular argument, and then these intuitions will get corrupted and I'll be confused.

Obviously the Roman thing is a pretty dumb cartoony example and it seems too obvious to fall for in real life, but I feel like this usually works on a much more subtle, implicit level, and in fact I think that's why I have a lot of trouble putting it into words. I find this topic really confusing to talk about, so hopefully I didn't say anything too dumb. I think I mainly agree with your post, though, and what everyone else is saying. Again, I think steelmanning is 90% a good thing.

150

Dangers of steelmanning / principle of charity

150

150

150

Dangers of steelmanning / principle of charity

150

150