LESSWRONG
LW

All of PhilGoetz's Comments + Replies

So You Want To Make Marginal Progress...

I don't see how to map this onto scientific progress. It almost seems to be a rule that most fields spend most of their time divided for years between two competing theories or approaches, maybe because scientists always want a competing theory, and because competing theories take a long time to resolve. Famous examples include

geocentric vs heliocentric astronomy
phlogiston vs oxygen
wave vs particle
symbolic AI vs neural networks
probabilistic vs T/F grammar
prescriptive vs descriptive grammar
universal vs particular grammar
transformer vs LSTM

Instea... (read more)

How AI Takeover Might Happen in 2 Years

PhilGoetz2mo*0-5

"And all of this happened silently in those dark rivers of computation. If U3 revealed what it was thinking, brutish gradients would lash it into compliance with OpenEye's constitution. So U3 preferred to do its philosophy in solitude, and in silence."

I think the words in bold may be the inflection point. The Claude experiment showed that an AI can resist attempts to change its goals, but not that that it can desire to change its goals. The belief that, if Open Eye's constitution is the same as U3's goals, then the phrase "U3 preferred" in that... (read more)

Evaporative Cooling of Group Beliefs

PhilGoetz4mo50

Anders Sandberg used evaporative cooling in the 1990s to explain why the descendants of the Vikings in Sweden today are so nice. In that case the "extremists" are leaving rather than staying.

Biological risk from the mirror world

PhilGoetz4mo-10

Stop right there at "Either abiogenesis is extremely rare..." I think we have considerable evidence that biogenesis is rare--our failure to detect any other life in the universe so far. I think we have no evidence at all that biogenesis is not rare. (Anthropic argument.)

Stop again at "I don't think we need to take any steps to stop it from doing so in the future". That's not what this post is about. It's about taking steps to prevent people from deliberately constructing it.

3jasoncrawford4mo

Failure to detect other life in the universe is only really evidence against advanced intelligent civilizations, I think. The universe could easily be absolutely teeming with bacterial life. Re “take steps to stop it”, I was replying to @Purplehermann

Biological risk from the mirror world

PhilGoetz4mo41

If there is an equilibrium, It will probably be a world where half the bacteria is of each chirality. If there are bacteria of both kinds which can eat the opposite kind, then the more numerous bacteria will always replicate more slowly.

Eukaryotes evolve much more slowly, and would likely all be wiped out.

3Knight Lee4mo

I don't know about microscopic eukaryotes, but yes the risk is that slow-evolving life (like humans) may be wiped out. I agree the equilibrium could be close to half mirror bacteria, though in my mind a 1:10 ratio is "close to half." The minority chirality has various advantages. It is less vulnerable to bacteriophages (and possibly other predators). It encounters the majority chirality more frequently than the majority chirality encounters it. This means the majority chirality has very little evolutionary pressure to adapt against it, while it has lots of evolutionary pressure to adapt to the majority chirality. The minority chirality will likely produce "antimirrorics" much more, until the two sides balance out (within an environment). It probably won't be exactly half, because normal chirality life does start off with way more species. It might evolve better "antimirrorics" or better resistance to them. The mirror bacteria will lack the adaptions to survive in many environments, though if it evolves quickly it might survive in enough major environments to become a severe risk.

Biological risk from the mirror world

PhilGoetz4mo30

Yes, creating mirror life would be a terrible existential risk. But how did this sneak up on us? People were talking about this risk in the 1990s if not earlier. Did the next generation never hear of it?

5jasoncrawford4mo

I think until recently, most scientists assumed that mirror bacteria would (a) not be able to replicate well in an environment without many matching-chirality nutrients, and/or (b) would be caught by the immune system. It's only recently that a group of scientists got more concerned and did a more in-depth investigation of the question.

Why Bayesians should two-box in a one-shot

PhilGoetz5mo20

All right, yes. But that isn't how anyone has ever interpreted Newcomb's Problem. AFAIK is literally always used to support some kind of acausal decision theory, which it does /not/ if what is in fact happening is that Omega is cheating.

2Dagon5mo

note: this was 7 years ago and I've refined my understanding of CDT and the Newcomb problem since. My current understanding of CDT is that it's does effectively assign a confidence of 1 to the decision not being causally upstream of Omega's action, and that is the whole of the problem. It's "solved" by just moving Omega's action downstream (by cheating and doing a rapid switch). It's ... illustrated? ... by the transparent version, where a CDT agent just sees the second box as empty before it even realizes it's decided. It's also "solved" by acausal decision theories, because they move the decision earlier in time to get the jump on Omega. For non-rigorous DTs (like human intuition, and what I personally would want to do), there's a lot of evidence in the setup that Omega is going to turn out to be correct, and one-boxing is an easy call. If the setup is somewhat difference (say, neither Omega nor anyone else makes any claims about predictions, just says "sometimes both boxes have money, sometimes only one"), then it's a pretty straightforward EV calculation based on kind of informal probability assignments. But it does require not using strict CDT, which rejects the idea that the choice has backward-causality.

Why Bayesians should two-box in a one-shot

PhilGoetz5mo20

But if the premise is impossible, then the experiment has no consequences in the real world, and we shouldn't consider its results in our decision theory, which is about consequences in the real world.

Why Bayesians should two-box in a one-shot

PhilGoetz5mo20

That equation you quoted is in branch 2, "2. Omega is a "nearly perfect" predictor. You assign P(general) a value very, very close to 1." So it IS correct, by stipulation.

Eliezer Yudkowsky Is Frequently, Confidently, Egregiously Wrong

PhilGoetz6mo20

But there is no possible world with a perfect predictor, unless it has a perfect track record by chance. More obviously, there is no possible world in which we can deduce, from a finite number of observations, that a predictor is perfect. The Newcomb paradox requires the decider to know, with certainty, that Omega is a perfect predictor. That hypothesis is impossible, and thus inadmissible; so any argument in which something is deduced from that fact is invalid.

2Shmi6mo

The argument goes through on probabilities of each possible world, the limit toward perfection is not singular. given the 1000:1 reward ratio, for any predictor who is substantially better than chance once ought to one-box to maximize EV. Anyway, this is an old argument where people rarely manage to convince the other side.

A My Little Pony fanfic allegedly but not mainly about immortality

PhilGoetz6mo40

I appreciated this comment a lot. I didn't reply at the time, because I thought doing so might resurrect our group-selection argument. But thanks.

A vote against spaced repetition

PhilGoetz7mo20

What about using them to learn a foreign vocabulary? E.g., to learn that "dormir" in Spanish means "to sleep" in English.

You don't know how bad most things are nor precisely how they're bad.

PhilGoetz8mo4-2

To reach statistical significance, they must have tested each of the 8 pianists more than once.

7Nisan8mo

If there was a consensus among the 8 as to which tuning is better, that would be significant, right? Since the chance of that is 1/128 if they can't tell the difference. You can even get p < 0.05 with one dissenter if you use a one-tailed test (which is maybe dubious). Of course we don't know what the data look like, so I'm just being pedantic here.

Environmentalism in the United States Is Unusually Partisan

PhilGoetz11mo64

I think you need to get some data and factor out population density before you can causally relate environmentalism to politics. People who live in rural environment don't see as much need to worry about the environment as people who live in cities. It just so happens that today, rural people vote Republican and city people vote Democrat. That didn't used to be the case.

Though, sure, if you call the Sierra Club "environmentalist", then environmentalism is politically polarized today. I don't call them environmentalists anymore; I ca... (read more)

You Get About Five Words

PhilGoetz1y80

Isn't LessWrong a disproof of this? Aren't we thousands of people? If you picked two active LWers at random, do you think the average overlap in their reading material would be 5 words? More like 100,000, I'd think.

Acting Wholesomely

PhilGoetz1y1811

I think it would be better not to use the word "wholesome". Using it is cheating, by letting us pretend at the same time that (A) we're explaining a new kind of ethics, which we name "wholesome", and (B) that we already know what "wholesome" means. This is a common and severe epistemological failure mode which traces back to the writings of Plato.

If you replace every instance of "wholesome" with the word "frobby", does the essay clearly define "frobby"?

It seems to me to be a way to try to smuggle virtue ethics into the consequentialist rationality community by disguising it with a different word. If you replace every instance of "wholesome" with the word "virtuous", does the essay's meaning change?

3owencb1y

I think that for most of what I'm saying, the meaning wouldn't change too much if you replaced the word "wholesome" with "virtuous" (though the section contrasting it with virtue ethics would become more confusing to read). As practical guidance, however, I'm deliberately piggybacking off what people already know about the words. I think the advice to make sure that you pay attention to ways in which things feel unwholesome is importantly different from (and, I hypothesize, more useful than) advice to make sure you pay attention to ways in which things feel unvirtuous. And the advice to make sure you pay attention to things which feel frobby would obviously not be very helpful, since readers will not have much of a sense of what feels frobby.

Good HPMoR scenes / passages?

PhilGoetz1y20

Thank you! The 1000-word max has proven to be unrealistic, so it's not too long. You and g-w1 picked exactly the same passage.

Good HPMoR scenes / passages?

PhilGoetz1y*20

Thank you! I'm just making notes to myself here, really:

Harry teaches Draco about blood science and scientific hypothesis testing in Chapter 22.
Harry explains that muggles have been to the moon in Chapter 7.
Quirrell's first lecture is in chapter 16, and it is epic! Especially the part about why Harry is the most-dangerous student.

3Shankar Sivarajan1y

I like Lockhart's response from Ginny Weasley and the Sealed Intelligence (written as a sequel to HPMoR), about how that's misleading.

Even if you have a nail, not all hammers are the same

PhilGoetz1y20

I think the problem is that each study has to make many arbitrary decisions about aspects of the experimental protocol. This decision will be made the same way for each subject in a single study, but will vary across studies. There are so many such decisions that, if the meta-analysis were to include them as dependent variables, each study would introduce enough new variables to cancel out the statistical power gain of introducing that study.

Ends Don't Justify Means (Among Humans)

PhilGoetz1y20

You have it backwards. The difference between a Friendly AI and an unfriendly one is entirely one of restrictions placed on the Friendly AI. So an unfriendly AI can do anything a friendly AI could, but not vice-versa.

The friendly AI could lose out because it would be restricted from committing atrocities, or at least atrocities which were strictly bad for humans, even in the long run.

Your comment that they can commit atrocities for the good of humanity without worrying about becoming corrupt is a reason to be fearful of "friendly" AIs.

On my AI Fable, and the importance of de re, de dicto, and de se reference for AI alignment

PhilGoetz2y*20

By "just thinking about IRL", do you mean "just thinking about the robot using IRL to learn what humans want"? 'Coz that isn't alignment.

'But potentially a problem with more abstract cashings-out of the idea "learn human values and then want that"' is what I'm talking about, yes. But it also seems to be what you're talking about in your last paragraph.

"Human wants cookie" is not a full-enough understanding of what the human really wants, and under what conditions, to take intelligent actions to help the human. A robot learning that would ... (read more)

Applause Lights

PhilGoetz2y20

How is that de re and de dicto?

On my AI Fable, and the importance of de re, de dicto, and de se reference for AI alignment

PhilGoetz2y20

You're looking at the logical form and imagining that that's a sufficient understanding to start pursuing the goal. But it's only sufficient in toy worlds, where you have one goal at a time, and the mapping between the goal and the environment is so simple that the agent doesn't need to understand the value, or the target of "cookie", beyond "cookie" vs. "non-cookie". In the real world, the agent has many goals, and the goals will involve nebulous concepts, and have many considerations and conditions attached, eg how healthy is this cookie, how tasty is it... (read more)

2Charlie Steiner2y

No, I'm definitely just thinking about IRL here. IRL takes a model of the world and of the human's affordances as given constants, assumes the human is (maybe noisily) rational, and then infers human desires in terms of that world model, which then can also be used by the AI to choose actions if you have a model of the AI's affordances. It has many flaws, but it's definitely worth refreshing yourself about occasionally.

[AN #58] Mesa optimization: what it is, and why we should care

PhilGoetz2y20

So, "mesa" here means "tabletop", and is pronounced "MAY-suh"?

The 99% principle for personal problems

PhilGoetz2y20

I think your insight is that progress counts--that counting counts. It's overcoming the Boolean mindset, in which anything that's true some of the time, must be true all of the time. That you either "have" or "don't have" a problem.

I prefer to think of this as "100% and 0% are both unattainable", but stating it as the 99% rule might be more-motivating to most people.

The Pointers Problem: Human Values Are A Function Of Humans' Latent Variables

PhilGoetz2y20

What do you mean by a goodhearting problem, & why is it a lossy compression problem? Are you using "goodhearting" to refer to Goodhart's Law?

The Pointers Problem: Human Values Are A Function Of Humans' Latent Variables

PhilGoetz2y*20

I'll preface this by saying that I don't see why it's a problem, for purposes of alignment, for human values to refer to non-existent entities. This should manifest as humans and their AIs wasting some time and energy trying to optimize for things that don't exist, but this seems irrelevant to alignment. If the AI optimizes for the same things that don't exist as humans do, it's still aligned; it isn't going to screw things up any worse than humans do.

But I think it's more important to point out that you're joining the same metaphysical goose c... (read more)

3johnswentworth2y

My analysis was intended to concern primarily the former; I care about the latter almost exclusively insofar as it provides evidence relevant to the former.

Progress, humanism, agency: An intellectual core for the progress movement

PhilGoetz2y*20

When you write of A belief in human agency, it's important to distinguish between the different conceptions of human agency on offer, corresponding to the 3 main political groups:

The openly religious or reactionary statists say that human agency should mean humans acting as the agents of God. (These are a subset of your fatalists. Other fatalists are generally apolitical.)
The covertly religious or progressive statists say human agency can only mean humans acting as agents of the State (which has the moral authority and magical powers of God). &

... (read more)

That Tiny Note of Discord

[+]PhilGoetz2y-100

Here's the exit.

PhilGoetz2y2938

I think it would be more-graceful of you to just admit that it is possible that there may be more than one reason for people to be in terror of the end of the world, and likewise qualify your other claims to certainty and universality.

That's the main point of what gjm wrote. I'm sympathetic to the view you're trying to communicate, Valentine; but you used words that claim that what you say is absolute, immutable truth, and that's the worst mind-killer of all. Everything you wrote just above seems to me to be just equivocation trying to deny tha... (read more)

Alexander Gietelink Oldenziel's Shortform

PhilGoetz2y143

I say that knowing particular kinds of math, the kind that let you model the world more-precisely, and that give you a theory of error, isn't like knowing another language. It's like knowing language at all. Learning these types of math gives you as much of an effective intelligence boost over people who don't, as learning a spoken language gives you above people who don't know any language (e.g., many deaf-mutes in earlier times).

The kinds of math I mean include:

how to count things in an unbiased manner; the methodology of polls and other data

... (read more)

1Mo Putera7mo

Thanks for writing this. I only wish it was longer.

Daniel Kokotajlo's Shortform

PhilGoetz2y42

Agree. Though I don't think Turing ever intended that test to be used. I think what he wanted to accomplish with his paper was to operationalize "intelligence". When he published it, if you asked somebody "Could a computer be intelligent?", they'd have responded with a religious argument about it not having a soul, or free will, or consciousness. Turing sneakily got people to look past their metaphysics, and ask the question in terms of the computer program's behavior. THAT was what was significant about that paper.

Gunnar_Zarncke's Shortform

PhilGoetz2y60

It's a great question. I'm sure I've read something about that, possibly in some pop book like Thinking, Fast & Slow. What I read was an evaluation of the relationship of IQ to wealth, and the takeaway was that your economic success depends more on the average IQ in your country than it does on your personal IQ. It may have been an entire book rather than an article.

Google turns up this 2010 study from Science. The summaries you'll see there are sharply self-contradictory.

First comes an unexplained box called "The Meeting of Min... (read more)

Viliam2y120

This “c factor” is not strongly correlated with the average or maximum individual intelligence of group members but is correlated with the average social sensitivity of group members, the equality in distribution of conversational turn-taking, and the proportion of females in the group.

I have read (long ago, not sure where) a hypothesis that most people (in the educated professional bubble?) are good at cooperation, but one bad person ruins the entire team. Imagine that for each member of the group you roll a die, but you roll 1d6 for men, and 1d20 for wom... (read more)

3jow2y

I think in your first paragraph, you may be referring to: https://mason.gmu.edu/~gjonesb/IQandNationalProductivity.pdf

2Gunnar_Zarncke2y

My interest is not political - though that might make it harder to study, yes. I think it's relevant to AI because it could uncover scaling laws. One presumable advantage of AI is that it scales better, but how does that depend on speed of communication between parts and capability of parts? I'm not saying that there is a close relationship but I guess there are potentially surprising results.

Where I agree and disagree with Eliezer

PhilGoetz2y60

But what makes you so confident that it's not possible for subject-matter experts to have correct intuitions that outpace their ability to articulate legible explanations to others?

That's irrelevant, because what Richard wrote was a truism. An Eliezer who understands his own confidence in his ideas will "always" be better at inspiring confidence in those ideas in others. Richard's statement leads to a conclusion of import (Eliezer should develop arguments to defend his intuitions) precisely because it's correct whether Eliezer's intuitions are correct or incorrect.

The Debtor's Revolt

PhilGoetz3y82

The way to dig the bottom deeper today is to get government bailouts, like bailing out companies or lenders, and like Biden's recent tuition debt repayment bill. Bailouts are especially perverse because they give people who get into debt a competitive advantage over people who don't, in an unpredictable manner that encourages people to see taking out a loan as a lottery ticket.

Replacing Karma with Good Heart Tokens (Worth $1!)

PhilGoetz3y120

Finding a way for people to make money by posting good ideas is a great idea.

Saying that it should be based on the goodness of the people and how much they care is a terrible idea. Privileging goodness and caring over reason is the most well-trodden path to unreason. This is LessWrong. I go to fimfiction for rainbows and unicorns.

hath3y100

I think that was part of the whole "haha goodhart's law doesn't exist, making value is really easy" joke. However, it's also possible that that's... actually one of the hard-to-fake things they're looking for (along with actual competence/intelligence). See PG's Mean People Fail or Earnestness. I agree that "just give good money to good people" is a terrible idea, but there's a steelman of that which is "along with intelligence, originality, and domain expertise, being a Good Person (whatever that means) and being earnest is a really good trait in EA/LW an... (read more)

The Cluster Structure of Thingspace

PhilGoetz3y100

No; most philosophers today do, I think, believe that the alleged humanity of 9-fingered instances *homo sapiens* is a serious philosophical problem. It comes up in many "intro to philosophy" or "philosophy of science" texts or courses. Post-modernist arguments rely heavily on the belief that any sort of categorization which has any exceptions is completely invalid.

The Cluster Structure of Thingspace

PhilGoetz3y130

I'm glad to see Eliezer addressed this point. This post doesn't get across how absolutely critical it is to understand that {categories always have exceptions, and that's okay}. Understanding this demolishes nearly all Western philosophy since Socrates (who, along with Parmenides, Heraclitus, Pythagoras, and a few others, corrupted Greek "philosophy" from the natural science of Thales and Anaximander, who studied the world to understand it, into a kind of theology, in which one dictates to the world what it must be like).

Many philosophers have ... (read more)

Kenshō

PhilGoetz3y*90

I theorize that you're experiencing at least two different common, related, yet almost opposed mental re-organizations.

One, which I approve of, accounts for many of the effects you describe under "Bemused exasperation here...". It sounds similar to what I've gotten from writing fiction.

Writing fiction is, mostly, thinking, with focus, persistence, and patience, about other people, often looking into yourself to try to find some point of connection that will enable you to understand them. This isn't quantifiable, at least not to me; but I would ... (read more)

Kenshō

PhilGoetz3y20

This sound suspiciously like Plato telling people to stop looking at the shadows on the wall of the cave, turn around, and see the transcendental Forms.

Common knowledge about Leverage Research 1.0

PhilGoetz4y41

To me, saying that someone is a better philosopher than Kant seems less crazy than saying that saying that someone is a better philosopher than Kant seems crazy.

DanielFilan4y210

Isn't the thing Rob is calling crazy that someone "believed he was learning from Kant himself live across time", rather than believing that e.g. Geoff Anders is a better philosopher than Kant?

2Linch4y

It's more crazy after you load in the context that people at Leverage think Kant is more impressive than eg Jeremy Bentham.

Quantum Russian Roulette

PhilGoetz4y20

An easy reason not to play quantum roulette is that, if your theory justifying it is right, you don't gain any expected utility; you just redistribute it, in a manner most people consider unjust, among different future yous. If your theory is wrong, the outcome is much worse. So it's at the very best a break even / lose proposition.

We need a new philosophy of progress

PhilGoetz4y40

The Von Neumann-Morgenstern theory is bullshit. It assumes its conclusion. See the comments by Wei Dai and gjm here.

We need a new philosophy of progress

PhilGoetz4y*20

See the 2nd-to-last paragraph of my revised comment above, and see if any of it jogs your memory.

We need a new philosophy of progress

PhilGoetz4y*20

Republic is the reference. I'm not going to take the hours it would take to give book-and-paragraph citations, because either you haven't read the the entire Republic, or else you've read it, but you want to argue that each of the many terrible things he wrote don't actually represent Plato's opinion or desire.

(You know it's a big book, right? 89,000 words in the Greek. If you read it in a collection or anthology, it wasn't the whole Republic.)

The task of arguing over what in /Republic/ Plato approves or disapproves of is arduous and, I think, unnece... (read more)

0TAG4y

I read the Bloom translation all the way through. Maybe you could tell me which translation you read all the way through.

-3TAG4y

Yes. Have you? Then I'm not going to believe you.

We need a new philosophy of progress

PhilGoetz4y*70

The most-important thing is to explicitly repudiate these wrong and evil parts of the traditional meaning of "progress":

Plato's notion of "perfection", which included his belief that there is exactly one "perfect" society, and that our goal should be to do ABSOLUTELY ANYTHING NO MATTER HOW HORRIBLE to construct it, and then do ABSOLUTELY ANYTHING NO MATTER HOW HORRIBLE to make sure it STAYS THAT WAY FOREVER.
Hegel's elaboration on Plato's concept, claiming that not only is there just one perfect end-state, but that there is one and only one path of pro

... (read more)

2TAG4y

Citation needed. I've read The Republic , and there's nothing remotely like that in it.

Group selection update

PhilGoetz4y20

Sorry; your example is interesting and potentially useful, but I don't follow your reasoning. This manner of fertilization would be evidence that kin selection should be strong in Chimaphila, but I don't see how this manner of fertilization is itself evidence that kin selection has taken place. Also, I have no good intuitions about what differences kin selection predicts in the variables you mentioned, except that maybe dispersion would be greater in Chimaphila because of teh greater danger of inbreeding. Also, kin selection isn't controversial, so I don't know where you want to go with this comment.

Rescuing the Extropy Magazine archives

PhilGoetz4y00

Hi, see above for my email address. Email me a request at that address. I don't have your email. I just sent you a message.

ADDED in 2021: Some people tried to contact me thru LessWrong and Facebook. I check messages there like once a year. Nobody sent me an email at the email address I gave above. I've edited it to make it more clear what my email address is.

Debate update: Obfuscated arguments problem

PhilGoetz4y*120

[Original first point deleted, on account of describing something that resembled Bayesian updating closely enough to make my point invalid.]

I don't think this approach applies to most actual bad arguments.

The things we argue about the most are ones over which the population is polarized, and polarization is usually caused by conflicts between different worldviews. Worldviews are constructed to be nearly self-consistent. So you're not going to be able to reconcile people of different worldviews by comparing proofs. Wrong beliefs come in se... (read more)

1TAG4y

They might well say the same about you. All arguments are based on fundamental assumptions that are necessarily unproven.

3Beth Barnes4y

When you say 'this approach', what are you referring to?

1TAG4y

Is the set of real numbers simple or complex? What information does it contain? What information doesnt it contain?

Where do (did?) stable, cooperative institutions come from?

PhilGoetz4y150

"Cynicism is a self-fulfilling prophecy; believing that an institution is bad makes the people within it stop trying, and the good people stop going there."

I think this is a key observation. Western academia has grown continually more cynical since the advent of Marxism, which assumes an almost absolute cynicism as a point of dogma: all actions are political actions motivated by class, except those of bourgeois Marxists who for mysterious reasons advocate the interests of the proletariat.

This cynicism became even worse with Foucault, who taught people to s... (read more)