User Comment Replies

Three Months In, Evaluating Three Rationalist Cases for Trump

The way you stated this makes it seem like your conclusion for the reason why the Democrats lost (and by extension, what they need to do to avoid losing in the future) is obviously correct. But the Median Voter Theorem you invoked is a conditional statement, and I don't think it's at all obvious that its conditions held for the 2024 US presidential election.

3AnthonyC4d

That's true, and you're right, the way I wrote my comment overstates the case. Every individual election is complicated, and there's a lot more than one axis of variation differentiation candidates and voters. The whole process of Harris becoming the candidate made this particular election weird in a number of ways. And as a share of the electorate, there are many fewer swing voters than there used to be a few decades ago, and not conveniently sorted into large, coherent blocks. And yet, it's also true that as few as ~120,000 votes in WI, MI, and PA could have swung the result, three moderate states that have flipped back and forth across each of the past four presidential elections. Only slightly more for several other combinations of states. It's not some deep mystery who lives in the rust belt, and what positions on issues a few tens of thousands of voters who are on the fence might care about. It's not like those issues are uncorrelated, either. And if you look at the last handful of elections, a similar OOM of voters in a similar set of states could have swung things either way, each time. And it's true that Harris underperformed Biden-2024 by vote share in every state but Utah (and 37.7% vs 37.8% does not matter to the outcome in any plausible scenario). If I'm reading the numbers correctly she also received fewer votes numerically than Biden in all but 6 states. So yes: I can very easily imagine scenarios where you're right, and the fact that we don't meet the theoretical assumptions necessary for the median voter theorem to apply means we can't assume an approximation of it in practice. It's even possible, if the Dems had really started organizing sustained and wide-ranging GOTV campaigns fifteen years ago, that there could be the kinds of blue wave elections I keep getting told are demographic inevitabilities just around the corner, as long as they keep moving further towards the current set of progressive policy goals. But what I cannot imagine is that,

Three Months In, Evaluating Three Rationalist Cases for Trump

YonatanK8d129

What I find lacking is any depth to the retrospection. Hanania's willingness to update his position clears a rather low bar. To go further one has to search for systematic causes for the error. For example, being wrong with the markets seems like a good opportunity to examine the hidden incentives that cause the market to get it wrong, not shrug and say "who could have known?"

Why Should I Assume CCP AGI is Worse Than USG AGI?

YonatanK11d10

I feel the question misstates the natsec framing by jumping to the later stages of AGI and ASI. This is important because it leads to a misunderstanding of the rhetoric that convinces normal non-futurists, who aren't spending their days thinking about superintelligence.

The American natsec framing is about an effort to preserve the status quo in which the US is the hegemon. It is a conservative appeal with global reach, which works because Pax Americana has been relatively peaceful and prosperous. Anything that threatens American dominance, including giving... (read more)

Power Lies Trembling: a three-book review

YonatanK14d10

Richard, reading this piece with consideration of other pieces you've written/delivered about Effective Altruism, such as your Lessons from my time in Effective Altruism and your recent provocative talk at EA Global Boston lead me to wonder what it is (if anything) that leads you to self-identify as an Effective Altruist? There may not be any explicit EA shibboleth, but it seems to me to nevertheless entail a set of particular methods, and once you have moved beyond enough of them it may not make any sense to call oneself an Effective Altruist.

My mental mo... (read more)

Power Lies Trembling: a three-book review

YonatanK16d10

A relevant, very recent opinion piece that has been syndicated around the country, explaining the universal value of faith:
https://www.latimes.com/opinion/story/2025-03-29/it-is-not-faith-that-divides-us

PauseAI and E/Acc Should Switch Sides

YonatanK1mo10

There's a gap in the Three Mile Island/Chernobyl/Fukushima analogy, because those disasters were all in the peaceful uses of nuclear power. I'm not saying that they didn't also impact the nuclear arms race, only that, for completeness, the arms race dynamics have to be considered as well.

YonatanK's Shortform

YonatanK1mo20

There are at least 3 levers of social influence, and I suspect that we undervalue the 3rd one for getting anything done, especially when it comes to AI safety. They are 1) government policy 2) the actions of firms 3) low-coordination behaviors of individuals influenced by norms. There is a subclass of #3 that is having its day in the sun, the behaviors of employees of the US federal government, a.k.a. the "Deep State." If their behaviors didn't matter there wouldn't be a perceived need by the Trump administration to purge them en masse (and replace with lo... (read more)

Populectomy.ai

YonatanK1mo10

That summary doesn't sound to me to be in the neighborhood of the intended argument. I would be grateful if you pointed to passages that suggest that reading so that I can correct them (DM me if that's preferable).

Where I see a big disconnect is your conclusion that "AI will have an incentive to do X." The incentives that the essay discusses are human incentives, not those of a hypothetical artificial agent.

Gradual Disempowerment, Shell Games and Flinches

YonatanK1mo50

The subject of your post is the recurring patterns of avoidance you're observing, without mentioning the impact on those more receptive and willing to engage. Nevertheless, I figure you'd still appreciate examples of the latter:

A link to the GD website was sent to me by a relatively distance acquaintance. This is a person of not-insignificant seniority at one of the FAANGs, whose current job is now hitched to AI. They have no specific idea about my own thoughts about AI risk, so my inference is that they send it to anyone they deem sufficiently wonky. The ... (read more)

Neutrality

YonatanK5mo10

There are two kinds of beliefs, those that can be affirmed individually (true independently of what others do) and those that depend on others acting as if they believe the same thing. They are, in other words, agreements. One should be careful not to conflate the two.

What you describe as "neutrality" to me seems to be a particular way of framing institutional forbearance and similar terms of cooperation in the face of the possibility of unrestrained competition and mutual destruction. When agreements collapse, it is not because these terms were unworkable (except for in the trivial sense that, well, they weren't invulnerable to gaming and do on) but because cooperation between humans can always break down.

1David James5mo

Right. Some such agreements are often called social contracts. One catch is that a person born into them may not understand their historical origin or practical utility, much less agree with them.

To the average human, controlled AI is just as lethal as 'misaligned' AI

YonatanK1y32

@AnthonyC I may be mistaken, but I took @M. Y. Zuo to be offering a reductio ad absurdum response to your comment about not being indifferent between the two ways of dying. The 'which is a worse way to die' debate doesn't respond to what I wrote. I said

With respect to the survival prospects for the average human, this [whether or not the dying occurs by AGI] seems to me to be a minor detail.

I did not say that no one should care about the difference.

But the two risks are not in competition, they are complementary. If your concern about misalignment is... (read more)

To the average human, controlled AI is just as lethal as 'misaligned' AI

YonatanK1y10

You have a later response to some clarifying comments from me, so this may be moot, but I want to call out that my emphasis is on the behavior of human agents who are empowered by automation that may fall well short of AGI. A "pivotal act" is a very germane idea, but rather than the pivotal act of the first AGI eliminating would-be AGI competitors, this act is carried out by humans taking out their human rivals.

It is pivotal because once the target population size has been achieved, competition ends, and further development of the AI technology can be halted as unnecessarily risky.

To the average human, controlled AI is just as lethal as 'misaligned' AI

YonatanK1y10

If an unaligned AI by itself can do near-world-ending damage, an identically powerful AI that is instead alignable to a specific person can do the same damage.

If you mean that as the simplified version of my claim, I don't agree that it is equivalent.

Your starting point, with a powerful AI that can do damage by itself, is wrong. My starting point is groups of people whom we would not currently consider to be sources of risk, who become very dangerous as novel weaponry, along with changes in relations of economic production, unlock the means and the motive ... (read more)

To the average human, controlled AI is just as lethal as 'misaligned' AI

YonatanK1y10

Thanks, title changed.

2Shankar Sivarajan1y

What was the old title? Something like "Misaligned AI doesn't kill people, misaligned people do"? This new one sounds like it could make for a good slogan too: "To the average American, gun control is more lethal than guns."

To the average human, controlled AI is just as lethal as 'misaligned' AI

YonatanK1y20

I agree, and I attempted to emphasize the winner-take-all aspect of AI in my original post.

The intended emphasis isn't on which of the two outcomes is preferable, or how to comparatively allocate resources to prevent them. It's on the fact that there is no difference between alignment and misalignment with respect to the survival expectations of the average person.

4AnthonyC1y

Ok, then that I understand. I do not think it follows that I should be indifferent between those two ways of me dying. Both are bad, but only one of them necessarily destroys everything I value. In any case I think it's much more likely a group using an aligned-as-defined-here AGI to kill (almost) everyone by accident, rather than intentionally.

To the average human, controlled AI is just as lethal as 'misaligned' AI

YonatanK1y10

The title was intended as an ironic allusion to a slogan from the National Rifle Association in the U.S., to dismiss calls for tighter restrictions on gun ownership. I expected this allusion to be easily recognizable, but see now that it was probably a mistake.

1[comment deleted]1y

1kave1y

I recognised the allusion but also disliked the title.

5habryka1y

Oh, I totally recognized it, but like, the point of that slogan is to make a locally valid argument that guns are indeed incapable of killing people without being used by people. That is not true of AIs, so it seems like it doesn't apply.

To the average human, controlled AI is just as lethal as 'misaligned' AI

YonatanK1y72

An argument for danger of human-directed misuse doesn't work as an argument against dangers of AI-directed agentic activity.

I agree. But I was not trying to argue against dangers of AI-directed agentic activity. The thesis is not that "alignment risk" is overblown, nor is the comparison of the risks the point, it's that those risks accumulate such that the technology is guaranteed to be lethal for the average person. This is significant because the risk of misalignment is typically thought to be accepted because of rewards that will be broadly shared... (read more)

3Seth Herd1y

That's an excellent summary sentence. It seems like that would be a useful statement in advocating for AI slowdown/shutdown.

How can the world handle the HAMAS situation?

YonatanK1y11

I'm surprised by the lack of follow-up to this post and the accompanying thread, which took place in the immediate aftermath of the October 7th massacre. A lot has happened since then -- new data against which the original thinking could be evaluated. Also time has provided opportunity to self-educate about the conflict, which a few people admitted to not knowing a lot about. Given the human misery that has only worsened since the OP started asking questions, I would think that a follow-up would be a worthy exercise. @Annapurna ?

Constellations are Younger than Continents

YonatanK1y50

Ever since first hearing the music of the Disney movie "Encanto" I've been sneering at the lyrics "stars don't shine they burn/ and constellations shift" because, no, of course constellations don't shift, without really stopping to think about it. Caught in my epistemic arrogance again!

How can the world handle the HAMAS situation?

YonatanK2y10

Oops, Samuel beat me to the punch by 2 minutes.

How can the world handle the HAMAS situation?

YonatanK2y71

You've already noted that it doesn't really matter, but I thought I'd help fill in the blanks.

The current global regime of sovereign nation-states that we take for granted is the product of the 20th century. It's not like an existing sovereign nation-state belonging to the Palestinians was carved up by external powers and arbitrarily handed to Jews. Rather, the disintegration of empires created opportunities for local nationalist movements to arise, creating new countries based on varying and competing unifying or dividing factors such as language, tribal ... (read more)

How can the world handle the HAMAS situation?

YonatanK2y63

This response is not unreasonable, but the description of "WW2-style solution" seems ignorant of the fact that Israel did occupy Gaza for decades, and had something very similar to a "puppet government" there, in the form of the Fatah party in control of the Palestinian Authority in the West Bank. Israel unilaterally withdrew in 2005, and Hamas violently took over in 2007.

The rest of it operates under the hypothesis that Hamas is opposed to the objective interests of the Palestinians of Gaza. This ends up being tautological if objective self-interest is de... (read more)

3Viliam2y

I don't know much about history of Israel, but Wikipedia says that Fatah fought against IDF, and was considered a terrorist organization by Israel. If that is true, it is not the kind of "puppet government" I suggested to install.

Winners-take-how-much?

YonatanK2y10

A helpful counterpoint.

Minimum Viable Exterminator

YonatanK2y00

Why would the human beings have to be suicidal, if they can also have the AI provide them with a vaccine?

3avturchin2y

Technically, in that case not all humans will be dead - but given vaccines failure rate and virus mutation rate, it will eventually kill everybody. Also, vaccine-generating AI have to be more complex. Virus-generator could be just a generative AI trained on viruses code databased, which will also predict virulence and mortality of any new generated code.

Open & Welcome Thread - May 2023

YonatanK2y10

Thank you. If I understand your explanation correctly, you are saying that there are alignment solutions that are rooted in more general avoidance of harm to currently living humans. If these turn out to be the only feasible solutions to the not-killing-all-humans problem, then they will produce not-killing-most-humans as a side-effect. Nuke analogy: if we cannot build/test a bomb without igniting the whole atmosphere, we'll pass on bombs altogether and stick to peaceful nuclear energy generation.

It seems clear that such limiting approaches would be avoide... (read more)

Open & Welcome Thread - May 2023

YonatanK2y60

Hi all, writing here to introduce myself and to test the waters for a longer post.

I am an experienced software developer by trade. I have an undergraduate degree in mathematics and a graduate degree in a field of applied moral and political philosophy. I am a Rawlsian by temperament but an Olsonian by observation of how the world seems to actually work. My knowledge of real-world AI is in the neighborhood of the layman's. I have "Learning Tensorflow.js" on my nightstand, which I promised my spouse not to start into until after the garage has been tidied.

No... (read more)

9metachirality2y

Formal alignment proposals avoid this problem by doing metaethics, mostly something like determining what a person would want if they were perfectly rational (so no cognitive biases or logical errors), otherwise basically omniscient, and had an unlimited amount of time to think about it. This is called reflective equilibrium. I think this approach would work for most people, even pretty terrible people. If you extrapolated a terrorist who commits acts of violence for some supposed greater good, for example, they'd realize that the reasoning they used to determine that said acts of violence were good was wrong. Corrigibility, on the other hand, is more susceptible to this problem and you'd want to get the AI to do a pivotal act, for example, destroying every GPU to prevent other people from deploying harmful AI, or unaligned AI for that matter. Realistically, I think that most entities who'd want to use a superintelligent AI like a nuke would probably be too short-sighted to care about alignment, but don't quote me on that.

LESSWRONG
LW

All of YonatanK's Comments + Replies