LESSWRONG
LW

All of Screwtape's Comments + Replies

Interest In Conflict Is Instrumentally Convergent

I think of Ziz and co as less likely than 2std out, for about the reasons you give. I tend to give 200 as the rough number of organizers and groups, since I get a bit under that for ACX Everywhere meetups in a given season. If we're asking per-event, Dirk's ~5,000 number sounds low (off the top of my head, San Diego does frequent meetups but only the ACX Everywheres wind up on LessWrong, and there are others like that) but I'd believe 5,000~10,000.

Book Review: The Art of Happiness

Screwtape2mo42

And I wouldn’t really try to figure out how to be happy by looking at what religions say about it.

People have been thinking about the problem for thousands of years, most of the written answers we've got come from religion and philosophy. Maybe they're all terrible answers but virtue of scholarship, sometimes I read a book and check.

(Which is—I’m sorry to say it—perfectly illustrated by your whole “let’s look at the Wikipedia page for each religion and see how many times it mentions ‘happy’ or ‘happiness’” thing. Come on! Obviously that is a nonsensical th

... (read more)

2Said Achmiz2mo

Don’t get me wrong—I appreciate the book review! Yeah, of course. All of the above. (I forget if the book was about Buddhism specifically or comparative religion more generally; it was in college, which was a while ago. The rest, relatively recent. I mean, Buddhism has gotten a lot of coverage here on LW, among other things. Then there’s David Chapman’s writings on Buddhism… it’s not like there’s any shortage of sources!) I am kind of confused by this reaction. I mean, you said a thing (“Buddhism is unusually attentive to the whole suffering and joy thing”), and as far as I could tell, this was you saying it, not you reporting a claim that was made in the book. (Am I mistaken about this?) And I am asking: do you actually for real believe this? If so, why do you believe it? I’m not asking for citations, like I’m a reviewer of an academic paper that you submitted. It’s not a criticism that you need to address in order to placate me. Answers I would expect might include things like: “Ah, no, this is not my belief, this is a thing the book says.” “Yep, I think that this is actually true, because [reasons].” “Eh, I dunno, I guess it’s more like a vague impression, but maybe it’s wrong; I haven’t thought about it too hard… you think I’m wrong about this? Say more?” “Actually I was being sarcastic. I don’t really think that!” “Yes I more or less believe this but I can see why many people would take the opposite view, it’s complicated, but anyhow it’s not critical to the review.” “Of course I believe it, why wouldn’t I? Isn’t it very obviously and uncontroversially true? Why, do you claim otherwise…? On what basis?” Any of those would make sense as an answer to my question! Ok, see, this sounds like “actually I was reporting a claim made in the book”, and if that’s the full explanation then, cool, that does in fact answer my question. (Obviously it could instead be that the book claims this but your line about it was still you describing your own independent bel

Book Review: The Art of Happiness

Screwtape2mo20

I disagree with #1

Okay, let me try starting from before that point then.

There's a bunch of things commonly referred to as emotions. Happiness is one. Anger is another. Sadness is a third. This list is not exhaustive but there's lots of lists of emotions, here's one.

Have you ever felt any of these, such that you could say "gosh, I'm really angry right now" as a fact about the world?

basically agree with #3, and agree with what #4 says but almost certainly not with what you mean by it. (I also—obviously—disagree with the notion that these things form so

... (read more)

2Said Achmiz2mo

I do not agree with this. (But if you consider happiness to be an emotion, then it makes even less sense to optimize for it!) I think many of the things on that list are not emotions. (I mean, surely “creative” is not an emotion? “Respected”? “Intimate”? “Exposed”? “Sceptical”? “Judgmental”??) Of course. You wrote (#1): I think that thinking of happiness as a “state” is basically a mistake even if there’s some technical sense in which it’s true. I also do not think that people are very good at noticing whether they’re happy or unhappy. I wouldn’t go so far as to say “can’t” but it’s not like checking whether you’re angry (which usually has relatively standard and straightforwardly checkable physiological correlates). I expect that the average person asking “am I happy right now” is more likely than not to get a wrong or nonsensical answer. You also wrote (#2): As above, I don’t think that happiness is a “state” in a useful sense of the word, but I do think that humans can (and most, probably, do) prefer to be happy rather than not being happy. And (#3): Yep, generally true. One can quibble with this (e.g. the caveats that you give in parentheses), but as a basic pattern it is true. But: Ah, but notice the difference: “make a person happier” vs. “make a person happy”. The latter makes sense if happiness is just an emotional state that one can be in. But if that’s not right—if happiness is more like a characteristic of a person’s experience over time, for example—then the latter form doesn’t make much sense anymore. But the former makes sense either way.

Book Review: The Art of Happiness

Screwtape2mo30

It sounds like you've got the same thing going on that the book is talking about, and I'm either labeling a different thing "compassion" in my head than you and the Dalai Lama are talking about or something else is going on.

Book Review: The Art of Happiness

Screwtape2mo20

Possible but from reading the book I think that interpretation is unlikely to be correct. The authors seemed to think empathy itself brought some joy even in these cases.

Interest In Conflict Is Instrumentally Convergent

Screwtape2mo42

Well… hard to say. The LW mods now pass that threshold^[1], but then again they’re not beginning now; they began eight years ago.

My sense is that if the mods had waited to start trying to moderate things until they met this threshold, they wouldn't wind up ever meeting it. There's a bit of, if you can't bench press 100lbs now, try benching 20lbs now and you'll be able to do 100lbs in a couple years, but if you just wait a couple years before starting you won't be able to then either.

Ideally there's a way to speed that up and among the ideas I have for that ... (read more)

2Hastings2mo

fwiw, these are what I'd say a 2std failure case of a rationalist meetup looks like https://www.wired.com/story/delirious-violent-impossible-true-story-zizians/ https://variety.com/2025/tv/news/julia-garner-caroline-ellison-ftx-series-netflix-1236385385/ https://www.wired.com/story/book-excerpt-the-optimist-open-ai-sam-altman/ (Ways my claim could be false: there could have been way more than 150 rationalist meetups, so that these are lower than 2 std, or these could not have, at any point in their development, counted as rationalist meetups, or ziz, sam, and eliezer could have intended these outcomes, so these don't count as failures)

2Said Achmiz2mo

FWIW, my experience is that rationalist meetup organizers are in fact mostly terrible at picking a location and at bringing snacks. (That’s mostly not the kind of failure mode that is relevant to our discussion here—just an observation.) Anyhow… All of this (including the sentiment in the preceding paragraph) would be true in the absence of adversarial optimization… but that is not the environment we’re dealing with. (Also, just to make sure we’re properly calibrating our intuitions: −2std is 1 in 50.) No, I don’t think that’s it. (And I gave up on the “a rationalist meetup aught to have some rationality practiced” notion a long, long time ago.)

Book Review: The Art of Happiness

Screwtape2mo20

. . . okay, wait, looking at your other comments below in this thread was this supposed to be a joke?

1Said Achmiz2mo

I mean, the bit about the food was mostly intended as humor. And I wouldn’t really try to figure out how to be happy by looking at what religions say about it. Now, if I were inclined to do so, I’d look at Judaism, like I said—but that’s at least partially a reductio ad absurdum; “which religion is the most joyful” strikes me as a somewhat silly question, and “Judaism” is a somewhat silly answer. It’s… not wrong? (IMO, of course.) But, like… this whole endeavor is just the wrong way of thinking about the matter. (Which is—I’m sorry to say it—perfectly illustrated by your whole “let’s look at the Wikipedia page for each religion and see how many times it mentions ‘happy’ or ‘happiness’” thing. Come on! Obviously that is a nonsensical thing to be doing here! It isn’t even a “quick and dirty approximation” to anything; it’s just noise.) But the first paragraph—no, that’s a serious question/comment. I really was surprised to read the claim that Buddhism is somehow unusually joyful or unusually “attentive to” joy or can be expected to have better insights about joy, etc. On the contrary, Buddhism strikes me as a religion which is deeply anti-life and pro-death (very “life goals of dead people”). I would not even consider going to such people for insights about joy, of all things. Hence my somewhat incredulous comment.

Book Review: The Art of Happiness

Screwtape2mo60

The ham sandwich replacement works pretty well in my head, I don't understand the Olympic medal analogy, and the lottery tickets thing is confusing to me. I agree people can be mistaken about what they want (though I don't default to assuming that) and I'm confused what you think a lot of people actually want and mistake for wanting to be happy? We seem to have some kind of communication problem or alternately pretty different experiences of happiness, and I'm not sure what's going on here. I'm glad we're on the same page about thinking by the clock and be... (read more)

0Said Achmiz2mo

Huh. Yeah, this amount of communication failure surprises me. I did expect that you’d disagree with at least some of that, but not that it wouldn’t get across at all… I disagree with #1, partially disagree with #2 (the first sentence is ok-ish but the given test is basically meaningless and so can’t be the basis for any of this reasoning), basically agree with #3, and agree with what #4 says but almost certainly not with what you mean by it. (I also—obviously—disagree with the notion that these things form some kind of meaningful sequence of specificity or claim strength etc.) Sure, I didn’t think that you were saying any of those things, so this all makes sense.

Book Review: The Art of Happiness

Screwtape2mo41

Largest world religions by followers: Christianity, Islam, Hinduism, Buddhism, Folk Religion. I know Christianity decently, and while joy is something you're sometimes supposed to get out of a life lived in accordance with God it isn't the point. (There are many Christian variants, I haven't dug into every interpretation, I think I'm generally right here.) Islam, I thought I read the Quran at one point but admit I can't remember much of anything. For Hinduism I read the Bhagavad Gita and remember the outline and some choice quotes, I assert that happiness ... (read more)

2Said Achmiz2mo

A helpful analysis.

2Said Achmiz2mo

Of course, but step one is not exactly under one’s control, as a Jew… hardly seems fair to count that against the religion itself! Matter of taste, I suppose…

Book Review: The Art of Happiness

Screwtape2mo64

Do you want to be happy?

"No" is an answer. Maybe you want personal growth, maybe you want the accomplishment of some goal, maybe you want a ham sandwich. If you did want to be happy, and commenting on LessWrong posts did not make you happy, then I think you should consider what you're doing for a couple minutes then go do something else which will make you happier.

Lots of people want to be happy. Uh, citation needed and I don't have one. I think actively pursuing happiness is a pretty normal thing to do, albeit subject to all the usual mistakes that people... (read more)

2Said Achmiz2mo

But… that transformation doesn’t work, at all. Actively pursuing a ham sandwich makes perfect sense. (And if you want a ham sandwich, asking whether what you are doing will in fact get you a ham sandwich is likewise a sensible thing to do.) But actively pursuing happiness, specifically, doesn’t make sense. That’s my point. I mean, it’s clearly better than the alternative (it’s not like I want to not be happy, much less want to be unhappy), but on the other hand, I don’t “want to be happy” in the same sense that I want most other things. Wanting to be happy seems like like, say, wanting to get an Olympic gold medal. If you train for the Olympics in some sport, and compete, and win, then you will get a gold medal, and that’s cool. On the other hand, suppose that due to some geopolitical reasons or something, all the other athletes in that sport stay home that year, and you “compete” against nobody, and “win”, and get a gold medal. Did you get what you wanted? Or, how about if you blackmail all the judges with compromising photographs, and they award you a gold medal even though you performed terribly and clearly lost? Conversely, what if you compete in the Olympics, and win, but your opponent blackmailed the judges, and you don’t get a gold medal, even though you “should”. Is that better or worse than the preceding two scenarios? Seems like… better, no? Clearly the gold medal wasn’t actually the point… right? But if I ask you “do you want to get an Olympic gold medal”, “no” seems like the wrong answer. Well, but what about “actively pursuing an Olympic gold medal”? That sort of makes sense, but… not really. “Winning the Olympics (in whatever sport)”, that makes sense. And one does get gold medals for that. But the medal itself is not exactly the point. I certainly won’t demand a citation for this, but I think that probably a lot more people mistakenly think that they “want to be happy” than actually want to be happy. Well… it’s hard to know how to take this. Li

Interest In Conflict Is Instrumentally Convergent

Screwtape2mo20

Er, sorry, I think you might’ve misread my comment? What I was saying was that the more seriously the people with the power and authority take the problem, the better it is. (I think that perhaps you got the direction backwards from how I wrote it? Your response would make sense if I had said “directly proportional”, it seems to me.)

"And the severity of the failure will be inversely proportional to how seriously the people with the power and authority take this problem, and to how much effort they put into addressing it."

Hrm. Yes, I seem to have read it di... (read more)

2Said Achmiz2mo

Well… hard to say. The LW mods now pass that threshold[1], but then again they’re not beginning now; they began eight years ago. Yes… essentially, this boils down to a pattern which I have seen many, many times. It goes like this: A: You are trying to do X, which requires Y. But you don’t have Y. B: Well, sure, I mean… not exactly, no. I mean, mostly, sort of… (a bunch more waffling, eventually ending with…) Yeah, we don’t have Y. A: So you can’t do X. B: Well, we have to, I’m afraid… A: That’s too bad, because you can’t. As we’ve established. B: Well, what are we going to do, just not do X? A: Right. B: Unacceptable! We have to! A: You are not going to successfully do X. That will either be because you stop trying, or because you try but fail. B: Not doing X is not an option! B tries to do X B fails to do X, due to the lack of Y A: Yep. B: Well, we have to do our best! A: Your best would be “stop trying to do X”. B ignores A, continues trying to do X and predictably failing, wasting resources and causing harm indefinitely (or until external circumstances terminate the endeavor, possibly causing even more harm in the process) In this case: a bunch of people who are completely unqualified to run meetups are trying to run meetups. Can they run meetups well? No, they cannot. What should they do? They should not run meetups. Then who will run the meetups? Nobody. Now, while reading the above, you might have thought: “obviously B should be trying to acquire Y, in order to successfully do X!”. I agree. But that does not look like “do X anyway, and maybe we’ll acquire Y in the process”. (Y, in this case, is “the skills that we’ve been discussing in this comment thread”.) It has to be a goal-directed effort, with the explicit purpose of acquiring those skills. It can be done while also starting to actually run meetups, but only with an explicit awareness and serious appreciation of the problem, and with serious effort being continuously put in to mitigate

Interest In Conflict Is Instrumentally Convergent

Screwtape2mo20

I disagree your scenario is more realistic.

“Now let’s say that the system is already full of bad actors (as it probably is). They have considered what and who might stop them, and what they might do about it. The system will, of course, be corrupt, and the people running that system will be horrible or incompetent. It is an obvious, straightforward move to promote memes that prevent this from being rectified.”

I think that happens sometimes, and higher pressure scenarios are more likely to be targets for this. Most of my disagreement is that I think m... (read more)

9Said Achmiz2mo

Yes, perhaps most people are trying to do the right thing, but (a) they are mostly not trying very hard, and (b) trying to do the right thing is just not anywhere close to sufficient for actually doing the right thing. It is extremely easy to just find yourself doing the wrong thing, if you are not systematically and effectively avoiding all the things that nudge you toward doing the wrong thing. This is why I have emphasized, throughout this discussion, that I am not accusing anyone in particular of bad faith or bad character, and that not only should you trust no one, you should not even trust yourself, because “trying to do the right thing” is not sufficient even from your own perspective. I do not, but I will take the question as a compliment. Yes. But there is a difference between making a considered judgment not to verify every step of some process, or not to check every instance, etc., and simply not having thought about it. (At the very least, the former decision can be revisited, re-evaluated, updated—the latter decision cannot even be acknowledged or accounted for, because it was never made in the first place!) And, of course—as per my other comment—it may well be that the answer to “if we had to do this the ‘proper’ way, then we couldn’t do it at all” is “then you shouldn’t do it at all”. Re: the “every police force” commentary—the whole “make any move toward trying to change the system” was mostly intended to qualify the “harmless weirdoes” scenario, not to necessarily apply to all people of any sort. (But also, the “moved into the criminal category” distinction is pretty important. But this is a tangent at this point, so let’s table it for now…) FWIW, I think that approaches to conflict resolution in in-person meetups and on online forums should differ considerably, for many reasons, but certainly in large part due to the different ways in which evaluation of bad actors / problems / etc. can/must happen in those two types of contexts. I would not

Interest In Conflict Is Instrumentally Convergent

Screwtape2mo20

I'm trying to come up with people that I think actually reach the standard you're describing. I think I know maybe ten, of which two have any time or interest in handling meetup conflicts.

I do agree there's some big failures that can happen when the people with authority to solve the problem take it very seriously, put a lot of effort into addressing it, and screw up. I don't agree that relationship is inversely proportional; if I imagine say, a 0 effort organizer who does nothing vs a 0.1 effort organizer who only moderates to say "shut up or leave"... (read more)

2Said Achmiz2mo

Er, sorry, I think you might’ve misread my comment? What I was saying was that the more seriously the people with the power and authority take the problem, the better it is. (I think that perhaps you got the direction backwards from how I wrote it? Your response would make sense if I had said “directly proportional”, it seems to me.) “Better not to begin” wouldn’t be “4chan”, it would be “nothing”. I agree that the moderators on Less Wrong aren’t quite at the level we’re talking about, but they’re certainly closer than most people in most places. (And many of what I perceive to be mistakes in moderation policy are traceable to the gap between their approach, and the sort of approach I am describing here.) At the very least, it’s clear that the LW mods have considerable experience with having to evaluate advice that does, in fact, come from their (our) enemies.

Interest In Conflict Is Instrumentally Convergent

Screwtape2mo20

I would be delighted to have the social equivalent of a zero trust conflict resolution system that everyone who interacted with it could understand and where the system could also maintain confidentiality as needed. I'm in favour of the incremental steps towards that I can make. In the abstract, I agree the procedure for evaluating advice should work even if it comes from bitter enemies. I do not think my personal evaluation procedure is currently robust enough to handle that, though tsuyoku naritai, someday maybe it will be.

The main context I encounter th... (read more)

2Said Achmiz2mo

Yes, but in the absence of this, every other approach is doomed to failure. And the severity of the failure will be inversely proportional to how seriously the people with the power and authority take this problem, and to how much effort they put into addressing it. Respectfully, I disagree. I think that this is the only standard that yields workable results. If it cannot be satisfied even approximately, even in large part (if not in whole), then better not to begin.

Interest In Conflict Is Instrumentally Convergent

Screwtape2mo20

That is not the argument I'm trying to make.

The argument I'm trying to make is that conflict resolution is hard in a particular way that approximately nothing else in running events or communities is hard; it's potentially adversarial, therefore taking the advice of people with strongly held and seemingly sensible advice can be a trap.

The bullet pointed personas are not a load bearing part of this thesis. If it would help, try dropping everything from "let's be reductive" to "what they might do about it". I think the only other part I'm directly referencin... (read more)

2Said Achmiz2mo

The idea that situations/problems that involve conflicts, and require resolving conflicts, are more challenging than situations/problems that don’t involve conflicts, is trivial. If someone thinks that conflict handling should be simple then of course that person is an idiot. If this were all that you were saying, then it would hardly be worthy of a post. However. In the OP you write: But a more realistic scenario would be: “Now let’s say that the system is already full of bad actors (as it probably is). They have considered what and who might stop them, and what they might do about it. The system will, of course, be corrupt, and the people running that system will be horrible or incompetent. It is an obvious, straightforward move to promote memes that prevent this from being rectified.” (You might protest that sure, this happens, but you know that you are not a bad actor, right? Even if nobody else can be sure of this, you at least can! And you’re talking to people who also know that they are not bad actors. And I say that even this is false. You don’t know this. The people in your target audience—other organizers, etc.—also don’t know this about themselves. And certainly nobody else should take your (or their) word for it.) You write: But in fact this should read more like this: “Okay, but why should you trust me? Good question; the answer is that you definitely shouldn’t trust me—especially since complaint handling is part of my role, so I have a professional interest, which makes me exceptionally likely to be a bad actor. Do not trust! Verify! If you can’t verify, assume treachery until proven otherwise!” This is an admirable goal and I applaud it. My comments are aimed precisely at this goal also. You can read them as saying “your setup does not and cannot succeed at this, so long as you take the approach described in the OP”. I… didn’t claim that any police force criminalizes everyone who makes any move towards trying to change the system, so… no bet!

Interest In Conflict Is Instrumentally Convergent

Screwtape2mo40

Not sure how referential "you" vs general "you" you're using when you're talking about assuming some "you" is the KGB. I do think it's useful to build a system which does not assume the watchman is perfectly trustworthy and good. In my own case, one of the first things I did once I started to realize how tricky this part of my role might be was write down a method for limited auditing of myself. That said:

Your own approach and policies should work unproblematically even if everyone assumes that you are basically the KGB. (This is especially true if you are

... (read more)

2Said Achmiz2mo

Indeed. But really, I wouldn’t say “suspicious”, exactly; I’d say “yes, it makes perfect sense that you would say this”. This isn’t even an accusation, or anything like that. It’s just the logical outcome of the setup. The question is, can a bad actor shake everyone’s trust in the system? If they can, then is it really a good system? The best answer to “should I trust you[r system]?” isn’t “yes, you should, and here is why”. It’s “you don’t have to”.

Interest In Conflict Is Instrumentally Convergent

Screwtape2mo84

Like I said, I don't have a solution. At least, not one I'm confident and certain of. I have other essays in the pipeline with (optimistically) pieces of it.

I don't think it's doomed. Most security experts a bank would reasonably hire are not bank robbers, you know? I assume that's true anyway, I'm not in that field but somehow my bank account goes un-robbed.

Checking where wildly different spheres agree seems promising. The source of advice here that I trust the most comes from a social worker who I knew for years who hadn't heard of the rationalist commun... (read more)

1samuelshadrach2mo

If you're good at it, you can purchase the knowledge without giving them a position of power. Intelligence agencies purchase zero days from hackers on black market. Foreign spies can be turned using money to become double agents.

3Viliam2mo

Yes, it would be useful to know how exactly that happens. I suspect that a part of the answer is how formal employment and long-term career changes the cost:benefit balance. Like, if you are not employed as a security expert, and rob a bank, you have an X% chance of getting Y money, and Z% chance of ending up in prison. If you get hired as a security expert, that increases the X, but probably even more increases the Z (you would be the obvious first suspect), and you probably get a nice salary so that somewhat reduces the temptation of X% chance at Y. So even if you hire people who are tempted to rob a bank, you kinda offer them a better deal on average? Another part of the answer is distributing the responsibility, and letting the potential bad actors keep each other in check. You don't have one person overseeing all security systems in the bank without any review. One guy places the cameras, another guy checks whether all locations are recorded. One guy knows a password to a sensitive system (preferably different people for different sensitive systems), another guy writes the code that logs all activities in the system. You pay auditors, external penetration testers, etc. There is also reputation. If someone worked in several banks, and they those banks didn't get robbed, maybe it is safe to hire that person. (Or they play a long con. Then again, many criminals probably don't have patience for too long plans.) What about your first job? You probably get a role with less responsibility. And they probably check your background? ...also, sometimes the banks do get robbed; they probably do not always make it public news. So I guess there is no philosophically elegant solution to the problem, just a bunch of heuristics that together reduce the risk to the acceptable level (or rather, we get used to whatever is the final level). So... yeah, it makes sense to learn the heuristics... and there will be obvious objections... and some of the heuristics will be expensive

1Richard Horvath2mo

Any opinion on this regarding being a somewhat good solution? https://www.lesswrong.com/posts/Q3huo2PYxcDGJWR6q/how-to-corner-liars-a-miasma-clearing-protocol

Interest In Conflict Is Instrumentally Convergent

Screwtape2mo72

And yet I notice that the view that “people who have opinions about how [whatever] should be done are unusually likely to be bad actors who want me to do [whatever] in such a way as to benefit them, therefore I should be suspicious of their motives and suggestions” is memetically adaptive. Whenever you come across this idea, it is to your benefit to immediately adopt it—after all, it means that you will thenceforth need to spend less effort evaluating people’s suggestions and opinions, and have a new and powerful reason to reject criticism. And the idea pr

... (read more)

8Said Achmiz2mo

Probably nobody, but then again, your sourdough is probably not impinging on anyone’s interests, either. Baking a loaf of sourdough doesn’t really come with opportunities to exploit other people for your own gain, etc. So of course there’s not going to be much controversy. But whenever there is controversy, usually due to the existence of genuinely competing interests, then motives for sabotage become plausible, whereupon it immediately becomes tempting to declare that those who think that you ought to be doing things differently are just trying to sabotage you. I agree, it certainly is an adversarial situation—and not only sometimes, but most of the time. And I agree that you should not uncritically trust advice that you hear from any sources. In fact, you shouldn’t even trust advice that you hear from yourself. Consider your bank example again. You might think: “hmm, that guy has an odd amount of knowledge of, and/or interest in, internal bank practices and security and so on; suspicious!”. Then you learn that he works at a bank himself, so it turns out that his knowledge and interest aren’t suspicious after all—great, cancel that red flag. No! Wrong! Don’t cancel it! Put it back! Raise two red flags! (“An analysis by the American Bankers Association concluded that 65% to 70% of fraud dollar losses in banks are associated with insider fraud.”) Suspect everyone, especially the people you’ve already decided to trust! But of course “suspect” is exactly the wrong word here. If you’re having to suspect people, you’ve already lost. Consider computer security. I ask about the security software that your company is using to protect your customers’ data—could I see the code? Which cryptographic algorithms do you use? You’re suspicious; what do I need this information for? Who should be allowed to have this sort of knowledge? And of course the right answer is “absolutely everyone”. It should be fully public. If your setup is such that it even makes sense to ask this

The Lizardman and the Black Hat Bobcat

Screwtape2mo20

In a sense, Scott and I have the power to make decisions based on whatever kind and level of knowledge we want. I could go to Scott and say something like hey, I've got a bad vibe here, zero Verifiable Documented Evidence but it's an intuition, lets permanently and publicly ban this person. Or skip going to Scott- I think I only loop him in on ~10% of conflicts I'm aware of.

How much do you trust my intuition? How much do I trust it?

I trust local ACX organizers a lot. If someone comes to me and says the local ACX organizer banned them from the local meetup,... (read more)

Kodo and Din

Screwtape2mo30

I think context clues do usually make the difference between signal as in cell signal vs. kodo clear. I'm less confident that context will usually make the difference between signal as in signal and the noise vs. kodo clear. Most conversations I have with other people where I'd want to use it, I expect they won't have this concept and it's not worth pausing whatever conversation we were having to explain kodo.

(Like, prior to me writing this up I think there were maybe a hundred people in the world who'd heard these terms used this way, because there ... (read more)

Kodo and Din

Screwtape2mo10

Signal feels overloaded to me and kodo seems like a much more narrow concept.

"I've got pretty good cell signal." "The elite spend a lot of time virtue signalling." "The castaways used smoke signals to get rescued." "Careful configuration is required to prevent signal interference between network nodes." "I think there's a pretty good signal in the polling data." "Ted's not actually being mean to me, he's just counter-signalling."

I'd basically agree that kodo is a subcategory of signal, in the same way that chicken is a subcategory of bird. The narrower con... (read more)

3rsaarelm2mo

It seems like the neologism is mostly capturing the meaning of signal from Shannon's information theory (which "signal and noise" points towards anyway), where you frame things by having yes/no questions you want to have answered and observations that answer your questions are signals and observations that do not are noise. So if you need to disambiguate, "signal (in the information-theoretic sense)" could be a way to say it.

purple fire2mo1520

I mean this is a summary of a talk I didn't see so I want to reserve some judgement. But at the same time I just can't imagine being in a situation where I don't want to use the word signal because the other person might think I'm talking about cell signal, so instead I bust out "kodo"

In general, I think we should be pretty wary of taking basic ideas and dressing them up in fancy words. It serves no purpose other than in-group signaling.

4Said Achmiz2mo

So, are “kodo” and “din” just “‘signal’ (specifically as used in the phrase ‘signal to noise ratio’)” and “noise (for example, as used in the phrase ‘signal to noise ratio’)”, respectively? Hmm, but you say: … but I don’t understand the distinction, or what “kodo” means, then. (The poll example is unenlightening; I can’t map the word as you are using it there to any concept I am aware of.)

Kamelo: A Rule-Based Constructed Language for Universal, Logical Communication

Screwtape3mo60

I love a good conlang. This one feels like an interesting start, though I'll be upfront and say I don't think this is a bottleneck on anything AI related.

Some thoughts and questions, in no particular order:

"Here's how words are made" is a start. What's the grammar like? I think that's where a lot of ambiguity creeps in to language.
Am I allowed to stop an encoding partway? For instance, am I allowed to say kakasu meti su to just mean noun, it's a fruiting plant in the Rosaceae family, or do I have to keep going to be grammatically correct?
I kind of like th

... (read more)

3Saif Khan3mo

Hey, Thanks so much for diving into Kamelo, you’ve nailed exactly the kind of questions I’m wrestling with. Grammar & Ambiguity You're totally right — grammar is where ambiguity really enters. Right now, Kamelo doesn’t have a fixed grammar yet. But the idea is: * Word order is generally SVO (subject-verb-object), like English. * Modifiers (adjectives, adverbs) follow what they modify. * Punctuation-like tokens may act as "semantic closers" to end a branch of a conceptual tree. Stopping Mid-Structure Yes! You can stop mid-encoding. That's a key principle: Kamelo is compressible based on shared context, like how we say "the fruit" instead of "a Rosaceae angiosperm of genus Malus". The idea is to transmit enough meaning for the moment, and go deeper if needed. That’s why a base like kakasu meti su ("noun, fruiting plant, [Rosaceae]") could be totally valid in conversation, and even shorten further in high-context. Phoneme Clarification (IPA) This is still flexible, but currently considering: SyllableIPANoteska/ka/like "car"me/me/like "meh"ti/ti/like "tea"su/su/like "soo"lo/lo/like "low" The goal is max distinctiveness across modalities — so these sounds are spread in mouth shape, tongue placement, and timing (good for speech-to-sign or tactile mapping later). Pronouns Pronouns aren’t fixed "words" like in English. Instead, they act like references. For example: * ka → "living entity" * Then you can say lo after that in the same convo to refer back to that entity. So something like: ka ti = "the dog" lo me = "it is happy" (Assuming me = happy or emotive state) They behave more like pointing mechanisms in programming, and are scope-bound to context. Final Thoughts You're spot on: 5 syllables is limiting — Kamelo is intentionally extreme, like a design provocation. It pushes me to see how much abstraction and compression can be done before the system collapses. Future iterations might have 12–20 syllables for balance.

The Bell Curve of Bad Behavior

Screwtape3mo104

Debbie's particular shape is arranged in part to isolate honesty and predictability as useful. If I'd just had her hiding bad things and confabulating good things I'd worry the takeaway would be solely that doing bad things or having a bad average was the problem, so I set her up such that the average stayed put and the curve just flattened out. I think the individual pieces do make sense though, if not in that particular combination.

Hiding good actions happens due to humbleness or status regulation or shyness or just because it's private.

A church ne

Screwtape3mo30

Frontpage is mostly what the admins and mods think is worth frontpaging, plus what users upvote. It's also a positional good, there can only be so many things on the front page. This is a more specific and useful question though! Yeah, if the LW team frontpaged more AI governance and less of everything else, and the average user upvoted more AI governance and less of everything else, the frontpage would have more AI governance on it. I wouldn't be a fan, but I'd understand the move that was the goal. My understanding is that's not the goal.

Not having a use... (read more)

Why does LW not put much more focus on AI governance and outreach?

Screwtape3mo30

I don't have the technical AI Safety skillset myself. My guess is to show up with specific questions if you need a technical answer, try and make a couple of specific contacts you can run big plans past or reach out to if you unexpectedly get traction, and use your LessWrong presence to establish a pointer to you and your work so people looking for what you're doing can find you. That seems worthwhile. After that, maybe crosspost when it's easy? Zvi might be a good example, where it's relatively easy to crosspost between LessWrong and Substack, though he's... (read more)

1Katalina Hernandez3mo

Thank you very much for your advice! Actually helps, and thanks for running that search too :).

Why does LW not put much more focus on AI governance and outreach?

Screwtape3mo60

It might be useful for you to taboo "LessWrong" at least briefly.

I have a spiel that may turn into a post someday about how communities aren't people, the short version being that if you ask "why doesn't the community do X?" the answer is usually that no individual in the community took it upon themselves to be the hero. Other times, someone did, but the result didn't look like the community doing X it looks like individuals doing X.

Is the question "why does the average user on this website not put much more focus on AI Governance and outreach?" Half of Le... (read more)

2Severin T. Seehrich3mo

Good catch! My implicit question was about what ends up on the frontpage, i.e. some mix of version 1 and 3. A friend of mine answered the sociological side of that question to my satisfaction: Many of the most competent people already pivoted to governance/outreach. But they don't have much use for in-group signalling, so they have quantitatively much less posts on the frontpage than others.

2Katalina Hernandez3mo

I would argue that it is people in AI Governance (the corporate "Reponsible AI" kind) that should also make an effort to learn more about AI Safety. I know, because I am one of them, and I do not know of many others that have AI Safety as a key research topic in their agenda. I am currently working on resources to improve AI Safety literacy amongst policy people, tech lawyers, compliance teams etc. Stress-Testing Reality Limited | Katalina Hernández | Substack My question to you is: any advice for the rare few in AI Governance that are here? I sometimes post with the hope of getting technical insights from AI Safety researchers. Do you think it's worth the effort?

Paper

Screwtape3mo30

I'll speak up for notecards: I use binder clips to sort them by category or date once in a while. While they are a bit small for complex or detailed drawings, in a pinch you can lay them slightly overlapping (perhaps with a little tape on the back) and get as big a sheet as you want. They won't replace my sketchbook for doing portraiture anytime soon, but that's a minority of my paper time.

Overall, I love this post and I like hearing other people's approaches to paper!

The Lizardman and the Black Hat Bobcat

Screwtape3mo20

This post feels like it may have been written in response to some specific interpersonal drama. If it was, then I'd like to make it clear that I have absolutely no idea what it was and therefore no opinion on it. I just think this is a useful concept in general.

Thumbs up, I appreciate knowing it lands even for people with no idea of the specific cases.

Other than the murder thing, I'm talking about something I've seen more than once. Like I said in the post, part of what I'm supposed to do for ACX meetups is handle complaints, which creates some unusu... (read more)

The Lizardman and the Black Hat Bobcat

Screwtape3mo30

Basically agreed.

Though also relevant is the degree of maliciousness required and what the subject might get out of it. In the "bobcat instead of office chair" example, this is pretty willful willingness to cause physical harm and the sender doesn't really get anything out of it other than sadistic kicks and making the world much weirder. If the sender sent a much cheaper chair model, there's a less weird motivation (they keep the change) and there's less extra work involved.

The Lizardman and the Black Hat Bobcat

Screwtape3mo114

I'm going to note I'm having a little trouble parsing your sentences here.

Strong downvoted for not just saying what you're really thinking to the person you have a criticism about which is almost definitely wrong.

I think the thing you're saying is that you downvoted because you think instead of writing this essay, I should have told a specific person that I think they're being some kind of jerk (mailing metaphorical bobcats) to a small number of people while being nice to the majority of people. Further, that I'm incorrect about how bad the jerkishness is.... (read more)

The Lizardman and the Black Hat Bobcat

Screwtape3mo80

Yep, and also as things scale you just get less information about everyone.

An random local meetup might fit in one room, sometimes splitting into two rooms so it's easier to have multiple conversations. I can have line of sight to everyone at once and hear it if voices start getting raised. With meetups in ten cities, I can at least wave at most attendees, and have had a couple hours of conversation with the organizers. With meetups in a hundred cities, I have only demographic guesses about who the attendees are, and it takes time and effort to... (read more)

The Lizardman and the Black Hat Bobcat

Screwtape3mo40

Somewhat agreed.

I'm trying to point at something loosely in this vicinity in section V, about hunting in packs - replace "one of them has three good friends" with "one of them paid three people" - where sometimes a bunch of negative reports are happening because someone is making up or deeply exaggerating accusations and routing them to you through different sources. I don't know that it's my first assumption; I currently think "Erin is mailing metaphorical bobcats to a small number of people" happens more often than "Frank is coordinating a bun... (read more)

2Said Achmiz3mo

Yes, of course. But your odds that it’s a competitor’s plot should also go up—and will end up higher by far. (This is one of the myriad examples of what Jaynes called the “resurrection of dead hypotheses”.)

Socially Graceful Degradation

Screwtape3mo50

Huh. That article does not have as much information as I want on how that election process works, but I'll swap to William The Conqueror as an example. Thanks for pointing it out.

It's the second example I've had to swap which probably should dock me some kind of points here, though I still feel pretty good about the overall thesis.

Socially Graceful Degradation

Screwtape3mo40

Even if a skill isn't as useful if you're the only one to know it, if the skill is still somewhat useful that can work. I like literacy as an example; crazy good if most people have it, still useful if only you have it, usually obvious pretty quickly if other people don't have it.

Individual and group rationality are pretty relevant here. In a sense, one thing I'm pointing at is a way to bootstrap (some) rationality skills from the easier individual domain in to the harder group domain; focus on places where the same skill is relevant in both arenas. It's a... (read more)

2024 Unofficial LessWrong Survey Results

Screwtape3mo40

If I try this again next year I plan to use the exact same text and values on both sides, which hopefully will clear up most of that kind of issue. It doesn't really fix marginal value, but I'm not sure that's fatal to this kind of analysis- I can quote a reasonable price for an apple even though my marginal value of apples drops very fast by the time I hit three digits of apples. I could try and fix this by picking things I think people value vaguely the same but then we miss out on catching scope insensitivity.

2024 Unofficial LessWrong Survey Results

Screwtape3mo20

11.2% is if I remove the CFAR attendees. 36.8% is if I remove the non-attendees. Possibly this is a backwards way of setting things up but I think it's right?

Say I have a general population and I know how many pushups they can do on average (call this Everyone Average), and I remove everyone who goes to the gym and see how many pushups those remaining can do on average (Call this Gym-Removed Average) and then I go back to the general population again this time removing everyone who doesn't go to the gym (Call this No-Gym-Removed Average.)

This i... (read more)

2Garrett Baker3mo

Ok first, when naming things I think you should do everything you can to not use double-negatives. So you should say "gym average" or "no gym average". Its shorter, and much less confusing. Second, I'm still confused. Translating what you said, we'd have "no gym removed average" -> "gym average" (since you remove everyone who doesn't go to the gym meaning the only people remaining go to the gym), and "gym removed average" -> "no gym average" (since we're removing everyone who goes to the gym meaning the only remaining people don't go to the gym). Therefore we have, gym average = no gym removed average < gym removed average = no gym average So it looks like the gym doesn't help, since those who don't go to the gym have a higher average number of pushups they can do than those who go to the gym.

Socially Graceful Degradation

Screwtape4mo20

No, I think I'm actually just wrong here and River is correct. I don't know how I wound up with the clockwise rule in my head but I just checked the new driver's pamphlet and it's first to the intersection. Updated.

3AnthonyC4mo

The clockwise rule is what you are supposed to do if people arrive to the intersection at the same time. If exactly two people going opposite directions arrive at the same time and aren't both going straight, then the one going straight goes before the one turning, or the one turning right goes before the one turning left. At least, that's how I and everyone I know was taught, and no, those of us who asked what "at the same time" actually means never got a straight answer.

2024 Unofficial LessWrong Survey Results

Screwtape4mo20

but predicted that it was instead about sensitivity to subtle changes in the wording of questions.

If I try this again next year I'm inclined to keep the wording the same instead of trying to be subtle.

Regarding the dutch book numbers: it seems like, for each of the individual-question presentations of that data, you removed the outliers. When performing the dutch book calculations, however, it seems like you keep the outliers in.

Yep. Well, in the individual reports I reported the version with the outliers, and then sometimes did another pass without outlie... (read more)

2Alice Blair4mo

I think your reasoning-as-stated there is true and I'm glad that you showed the full data. I suggested removing outliers for dutch book calculations because I suspected that the people who were wild outliers on at least one of their answers were more likely to be wild outliers on their ability to resist dutch books; I predict that the thing that causes someone to say they value a laptop at one million bikes is pretty often just going to be "they're unusually bad at assigning numeric values to things." The actual origin of my confusion was "huh, those dutch book numbers look really high relative to my expectations, this reminds me of earlier in the post when the other outliers made numbers really high." I'd be interested to see the outlier-less numbers here, but I respect if you don't have the spoons for that given that the designated census processing time is already over.

2024 Unofficial LessWrong Survey Results

Screwtape4mo20

Hrm. I guess what would be helpful here would be a sense of the range; the average briers floated around .20 to .23, and I don't have a sense of whether that's a tight clustering with a bit of noise or a meaningful difference. To use running a mile as a comparison, differences of seconds mostly aren't important (except at high levels) but differences of minutes are, right?

2Garrett Baker4mo

Yeah, I don't know the answer here, but I will also say that one flaw of the brier score is that its not even clear that these sorts of differences will be even all that meaningful. Like, what you actually want to know is, how much more information does one group here give over the other groups here, and how much credence should we assign to each of the groups (acting as if they were each hypotheses in a Bayes update) given their predictions on the data we have. And for that, you can just run the bayes update. The brier score was chosen for forecasters as far as I can tell because its more fun than scoring yourself based on log-odds (equivalent to the bayes update thing). Its less sensitive to horribly bad predictions, and it has a bounded "how bad can you be". Its also easier to explain and think about, and has a different incentive landscape for those trying to maximize their scores, which may be useful if you're trying to elicit good predictions. But if you're trying to determine who you should listen to (ie in what proportion you should update your model given so-and-so says such-and-such) you can't do better than a Bayes update (given the constraints), so just use that!

2024 Unofficial LessWrong Survey Results

Screwtape4mo20

If Other is larger than I expect, I think of that as a reason to try and figure out what the parts of Other are. Amusingly enough for the question, I'm optimistic about solving this by letting people do more free response and having an LLM sift through the responses.

2mako yass4mo

Contemplating an argument that free response rarely gets more accurate results for questions like this because listing the most common answers as checkboxes helps respondents to remember all of the answers that're true for of them.

2024 Unofficial LessWrong Survey Results

Screwtape4mo30

Thank you! I felt quite clever setting it up.

2024 Unofficial LessWrong Survey Results

Screwtape4mo20

Yeah, I should probably add a bit at the start or end of that section that everything in it is potentially selection effect. I don't know how to look at the thing I'm curious about without that.

Thinking out loud: If you get a random selection of people from the Pushup Club and count how many pushups they can do, then do the same for general population, the difference could be selection effect. People who like doing pushups are more likely to go to pushup club in the first place, and more likely to stick with it. But I can't realistically pay a bunch of Mec... (read more)

2024 Unofficial LessWrong Survey Results

Screwtape4mo20

No, I think that's correct.

There's 107 people who answered above 200, 21 who answered exactly 200, and 113 people who answered below 200. The second quartile (aka the median) is 200. But nobody guessed a negative number, so the people who guessed low aren't pulling the mean down that much. Meanwhile 33 people guessed 1000 or higher, and they can yank the mean a lot without doing that much to the median. If you're asking people to generate numbers, you tend to get whole number quartiles because nobody guesses there's 100.5 stations.

Imagine a the set [1,1,1,2,2,2,2,100,100]. The average is ~23.444, but the median is 2.

Or have I misunderstood the thing that you think needs to be corrected?

2Sherrinford4mo

Sorry, I confused something (not about the median, but about the 400). Thanks for illustrating.

2024 Unofficial LessWrong Survey Results

Screwtape4mo20

Wouldn't that get rid of all of the table of contents?

Ideally I'd have a hierarchy of headings. I think what's happening is it picks up some (but not all) lines that are entirely bold, and treats those as a sort of Heading 4.

1dirk4mo

Sorry, I meant to change only the headings you didn't want (but that won't work for text that's already paragraph-style, so I suppose that wouldn't fix the bold issue in any case; I apologize for mixing things up!). Testing it out in a draft, it seems like having paragraph breaks before and after a single line of bold text might be what triggers index inclusion? In which case you can likely remove the offending entries by replacing the preceding or subsequent paragraph break with a shift-enter (still hacky, but at least addressing the right problem this time XD).

2024 Unofficial LessWrong Survey Results

Screwtape4mo20

Future Survey Discussion thread

2024 Unofficial LessWrong Survey Results

Screwtape4mo20

A Screwtape Point (and upvotes) to whoever can tell me how to fix the table of contents.

1dirk4mo

A relatively easy solution (which would, unfortunately, mess with your formatting; not sure if there's a better one that doesn't do that) might be to convert everything you don't want in there to paragraph style instead of heading 1/2/3

HPMOR Anniversary Parties: Coordination, Resources, and Discussion

Screwtape4mo20

That's just plain unfortunate.

HPMOR Anniversary Parties: Coordination, Resources, and Discussion

Screwtape4mo20

Huh. Let me check with the local organizer and see if they have an update.

HPMOR Anniversary Guide

Screwtape4mo30

Ooh, please share the butter beer recipe?

I was leaning towards reading part of Dumbledore's letter in 119. There are a lot of funny, silly lines I want to quote throughout the day but this piece is short and poignant.

There can only be one king upon the chessboard.
There can only be one piece whose value is beyond price.
That piece is not the world, it is the world's peoples, wizard and Muggle alike, goblins and house-elves and all.
While survives any remnant of our kind, that piece is yet in play, though the stars should die in heaven.
And if that

... (read more)

1thegreatnick4mo

https://www.favfamilyrecipes.com/butterbeer/#wprm-recipe-container-20304 Haven't made it yet but it's going to be a hopefully fun experiment with all of us

HPMOR Anniversary Guide

Screwtape4mo40

What shape is the screen?

This one is probably my favourite for an event banner.

1Steven K Zuber4mo

That’s the one we used the first time and the one we’re using this time around. :)

1ProgramCrafter4mo

It's 16:9 (modulo possible changes in the venue). I have seen your banner and it is indeed one of the best choices out there! For announcing the event I preferred another one.