LESSWRONG
LW

All of DragonGod's Comments + Replies

Announcement: Learning Theory Online Course

I'm currently working through Naive Set Theory (alongside another text). I'll take this as a recommendation to work through the other textbooks later.

My maths level is insufficient for the course I'd guess.

I would appreciate it if videos of the meetings could be recorded. Or maybe I should just stick around and hope this will be run again next year.

We probably won't just play status games with each other after AGI

DragonGod4mo100

When I saw the beginning/title I thought the post would be a refutation of the material scarcity thesis; I found myself disappointed it is not.

6Matthew Barnett4mo

I suppose that means it might be worth writing an additional post that more directly responds to the idea that AGI will end material scarcity. I agree that thesis deserves a specific refutation.

DragonGod's Shortform

DragonGod5mo60

There is not an insignificant sense of guilt/of betraying myself from 2023 and my ambitions from before.

And I don't want to just end up doing irrelevant TCS research that only a few researchers in a niche field will ever care about.

It's not high impact research.

And it's mostly just settling. I get the sense that I enjoy theoretical research, I don't currently feel poised to contribute to the AI safety problem, I seem to have an unusually good (at least it appears so to my limited understanding) opportunity to pursue a boring TCS PhD in some niche field tha... (read more)

DragonGod's Shortform

DragonGod5mo70

I still want to work on technical AI safety eventually.

I feel like I'm on quite far off path from directly being useful in 2025 than I felt in 2023.

And taking a detour to do a TCS PhD that isn't directly pertinent to AI safety (current plan) feels like not contributing.

Cope is that becoming a strong TCS researcher will make me better poised to contribute to the problem, but short timelines could make this path less viable.

[Though there's nothing saying I can't try to work on AI on the side even if it isn't the focus of my PhD.]

6DragonGod5mo

There is not an insignificant sense of guilt/of betraying myself from 2023 and my ambitions from before. And I don't want to just end up doing irrelevant TCS research that only a few researchers in a niche field will ever care about. It's not high impact research. And it's mostly just settling. I get the sense that I enjoy theoretical research, I don't currently feel poised to contribute to the AI safety problem, I seem to have an unusually good (at least it appears so to my limited understanding) opportunity to pursue a boring TCS PhD in some niche field that few people care about. I don't think I'll be miserable pursuing the boring TCS PhD or not enjoy it, or anything of the sort. It's just not directly contributing to what I wanted to contribute to. It's somewhat sad and it's undignified (but it's less undignified than the path I thought I was on at various points in the last 15 months).

(The) Lightcone is nothing without its people: LW + Lighthaven's big fundraiser

DragonGod6mo123

I think LW is a valuable intellectual hub and community.

Haven't been an active participant of recent, but it's still a service I occasionally find myself relying on explicitly, and I prefer the world where it continues to exist.

[I donated $20. Am unemployed and this is a nontrivial fraction of my disposable income.]

DeepSeek beats o1-preview on math, ties on coding; will release weights

DragonGod6mo40

o1's reasoning trace also does this for different languages (IIRC I've seen Chinese and Japanese and other languages I don't recognise/recall), usually an entire paragraph not a word, but when I translated them it seemed to make sense in context.

Change My Mind: Thirders in "Sleeping Beauty" are Just Doing Epistemology Wrong

DragonGod7mo64

This is not a rhetorical question:) What do you mean by "probability" here?

Yeah, since posting this question:

I have updated towards thinking that it's in a sense not obvious/not clear what exactly "probability" is supposed to be interpreted as here.

And once you pin down an unambiguous interpretation of probability the problem dissolves.

I had a firm notion in mind for what I thought probability meant. But Rafael Harth's answer really made me unconfident that the notion I had in mind was the right notion of probability for the question.

7TsviBT7mo

I think the question is underdefined. Some bets are posed once per instance of you, some bets are posed once per instance of a world (whatever that means), etc.

Change My Mind: Thirders in "Sleeping Beauty" are Just Doing Epistemology Wrong

DragonGod7mo20

I have not read all of them!

Change My Mind: Thirders in "Sleeping Beauty" are Just Doing Epistemology Wrong

DragonGod7mo20

My current position now is basically:

Actually, I'm less confident and now unsure.
Harth's framing was presented as an argument re: the canonical Sleeping Beauty problem.
And the question I need to answer is: "should I accept Harth's frame?"
I am at least convinced that it is genuinely a question about how we define probability.
There is still a disconnect though.
While I agree with the frequentist answer, it's not clear to me how to backgpropagate this in a Bayesian framework.
Suppose I treat myself as identical to all other agents in the reference class.
I kno

DragonGod7mo70

I'm curious how your conception of probability accounts for logical uncertainty?

3Rafael Harth7mo

I count references within each logical possibility and then multiply by their "probability". Here's a super contrived example to explain this. Suppose that if the last digit of pi is between 0 and 3, Sleeping Beauty experiments work as we know them, whereas if it's between 4 and 9, everyone in the universe is miraculously compelled to interview Sleeping Beauty 100 times if the coin is tails. In this case, I think P(coin heads|interviewed) is 0.4⋅13+0.6⋅1101. So it doesn't matter how many more instances of the reference class there are in one logical possibility; they don't get "outside" their branch of the calculation. So in particular, the presumptuous philosopher problem doesn't care about number of classes at all. In practice, it seems super hard to find genuine examples of logical uncertainty and almost everything is repeated anyway. I think the presumptuous philosopher problem is so unintuitive precisely because it's a rare case of actual logical uncertainty where you genuinely cannot count classes.

Change My Mind: Thirders in "Sleeping Beauty" are Just Doing Epistemology Wrong

DragonGod7mo150

So in this case, I agree that like if this experiment is repeated multiple times and every Sleeping Beauty version created answered tails, the reference class of Sleeping Beauty agents would have many more correct answers than if the experiment is repeated many times and every sleeping Beauty created answered heads.

I think there's something tangible here and I should reflect on it.

I separately think though that if the actual outcome of each coin flip was recorded, there would be a roughly equal distribution between heads and tails.

And when I was thinking t... (read more)

7Rafael Harth7mo

What I'd say is that this corresponds to the question, "someone tells you they're running the Sleeping Beauty experiment and just flipped a coin; what's the probability that it's heads?". Difference reference class, different distribution; probability now is 0.5. But this is different from the original question, where we are Sleeping Beauty.

Measure7mo101

I separately think though that if the actual outcome of each coin flip was recorded, there would be a roughly equal distribution between heads and tails.

Importantly, this is counting each coinflip as the "experiment", whereas the above counts each awakening as the "experiment". It's okay that different experiments would see different outcome frequencies.

2DragonGod7mo

My current position now is basically:

Change My Mind: Thirders in "Sleeping Beauty" are Just Doing Epistemology Wrong

DragonGod7mo31

I mean I am not convinced by the claim that Bob is wrong.

Bob's prior probability is 50%. Bob sees no new evidence to update this prior so the probability remains at 50%.

I don't favour an objective notion of probabilities. From my OP:

2. Bayesian Reasoning

Probability is a property of the map (agent's beliefs), not the territory (environment).

For an observation O to be evidence for a hypothesis H, P(O|H) must be > P(O|¬H).

The wake-up event is equally likely under both Heads and Tails scenarios, thus provides no new information to update priors.

The o

... (read more)

5Charlie Steiner7mo

Yes, Bob is right. Because the probability is not a property of the coin. It's 'about' the coin in a sense, but it also depends on Bob's knowledge, including knowledge about location in time (Dave) or possible worlds (Carol).

Change My Mind: Thirders in "Sleeping Beauty" are Just Doing Epistemology Wrong

DragonGod7mo4-1

I mean I think the "gamble her money" interpretation is just a different question. It doesn't feel to me like a different notion of what probability means, but just betting on a fair coin but with asymmetric payoffs.

The second question feels closer to actually an accurate interpretation of what probability means.

7Gurkenglas7mo

https://www.lesswrong.com/posts/Mc6QcrsbH5NRXbCRX/dissolving-the-question

Uncertainty in all its flavours

DragonGod1yΩ120

i.e. if each forecaster $w \in W$ has an first-order belief $f (w) \in B (S)$ , and $w \in B (S)$ is your second-order belief about which forecaster is correct, then $(w ⊳_{W S} f) \in B (S)$ should be your first-order belief about the election.

I think there might be a typo here. Did you instead mean to write: " $w \in B (W)$ " for the second order beliefs about the forecasters?

Order Matters for Deceptive Alignment

DragonGod2y51

The claim is that given the presence of differential adversarial examples, the optimisation process would adjust the parameters of the model such that it's optimisation target is the base goal.