Richard_Ngo

Towards a Formal Scientific Epistemology

In my post “Why I’m not a Bayesian”, I argued that the Bayesian approach of assigning credences to propositions with binary truth values only works in simple and restricted domains. Instead, I claimed, a better approach to epistemology is to assign degrees of truth to models of the world. This...

Jun 964

Economic efficiency often undermines sociopolitical autonomy

Many people in my intellectual circles use economic abstractions as one of their main tools for reasoning about the world. However, this often leads them to overlook how interventions which promote economic efficiency undermine people’s ability to maintain sociopolitical autonomy. By “autonomy” I roughly mean a lack of reliance on...

Mar 10161

Two memos from 2024

Context: I wrote the two memos below in mid-2024, while still at OpenAI. They were intended to convey some core aspects of misalignment threat models to OpenAI researchers. When I left in late 2024, I got permission to "take them with me", but didn't get around to posting them until...

Feb 2438

The ML ontology and the alignment ontology

This post contains some rough reflections on the alignment community trying to make its ontology legible to the mainstream ML community, and the lessons we should take from that experience. Historically, it was difficult for the alignment community to engage with the ML community because the alignment community was using...

Feb 24110

Contra Caplan on higher education

Three theories of higher education Getting an undergraduate degree is very costly. In America, the direct financial cost of attending a private university is typically in the hundreds of thousands of dollars. Even when tuition is cheap (or covered by scholarships), forgoing three to four years of salary and career...

Feb 1655

Aligning to Virtues

Which alignment target? Suppose you’re an AI company or government, and you want to figure out what values to align your AI to. Here are three options, and some of their downsides: AIs that are aligned to a set of consequentialist values are incentivized to acquire power to pursue those...

Feb 1693

Distributed vs centralized agents

Much of my thinking over the last year has focusing on understanding the concept of "distributed agents", as opposed to the "centralized agents" that the existing paradigm of expected utility maximization describes. One way of describing the difference is in terms of how autonomous their subagents are. Another is that...

Feb 951

Richard_Ngo

Richard_Ngo

The ants and the grasshopper

Ngo and Yudkowsky on alignment difficulty

Trojan Sky

Why I’m not a Bayesian

Richard_Ngo

The ants and the grasshopper

Ngo and Yudkowsky on alignment difficulty

Trojan Sky

Why I’m not a Bayesian

Towards a Formal Scientific Epistemology

Economic efficiency often undermines sociopolitical autonomy

Two memos from 2024

The ML ontology and the alignment ontology

Contra Caplan on higher education

Aligning to Virtues

Distributed vs centralized agents