LESSWRONG
LW

All of Dave Orr's Comments + Replies

How AI researchers define AI sentience? Participate in the poll

We think humans are sentient because of two factors: first, we have internal experience that means we ourselves are sentient; and two, we rely on testimony from others who say they are sentient. We can rely on the latter because people seem similar. I feel sentient and say I am. You are similar to me and say you are. Probably you are sentient.

With AI, this breaks down because they aren't very similar to us in terms of cognition, brain architecture, or "life" "experience". So unfortunately AI saying they are sentient does not produce the same kind of ... (read more)

The best simple argument for Pausing AI?

Dave Orr5d20

For that specific example, I would not call it safety critical in the sense that you shouldn't use an unreliable source. Intel involves lots of noisy and untrustworthy data, and indeed the job is making sense out of lots of conflicting and noisy signals. It doesn't strike me that adding an LLM to the mix changes things all that much. It's useful, it adds signal (presumably), but also is wrong sometimes -- this is just what all the inputs are for an analyst.

Where I would say it crosses a line is if there isn't a human analyst. If an LLM analyst was directly... (read more)

The best simple argument for Pausing AI?

Dave Orr6d91

"Perhaps we should pause widespread rollout of Generative AI in safety-critical domains — unless and until it can be relied on to follow rules with significant greater reliability."

This seems clearly correct to me - LLMs should not be in safety critical domains until we can make a clear case for why things will go well in that situation. I'm not actually aware of anyone using LLMs in that way yet, mostly because they aren't good enough, but I'm sure that at some point it'll start happening. You could imagine enshrining in regulation that there must be affi... (read more)

2sjadler5d

It would surprise me if LLMs weren't already in use in safety critical domains, at least depending on one's definition of safety critical Maybe I'm thinking of the term overly broadly, but for instance, I'd be surprised if governments weren't already using LLMs as part of their intel-gathering and -analysis operations, which presumably affect some military decisions and (on some margin) who lives or dies. For consequential decisions, you'd of course hope there's enough oversight where some LLM hallucinations don't cause attacks/military actions that weren't justified

Dave Orr22d20

Interesting post! I've noticed that poker reasoning tends to be terrible, it's not totally clear to me why. Pretraining should contain quite a lot of poker discussion, though I guess a lot of it is garbage. I think it could be pretty easily fixed in RL if anyone cared enough, but then it wouldn't be a good test of general reasoning ability.

One nit: it's "hole card", not "whole card".

Orphaned Policies (Post 5 of 7 on AI Governance)

Dave Orr1mo1311

This is a great piece! I especially appreciate the concrete list at the end.

In other areas of advocacy and policy, it's typical practice to have model legislation and available experts ready to go so that when a window opens when action is possible, progress can be very fast. We need to get AI safety into a similar place.

AI #113: The o3 Era Begins

Dave Orr2mo30

Formatting is still kind of bad, and is affecting readability. It's been a couple of posts in a row now with long wall of text paragraphs. I feel like you changed something? And you should change it back. :)

2habryka2mo

Yeah, we gotta fix something about handling the Substack formatted content. It really looks ugly sometimes, though I haven't yet chased down when.

LLM-based Fact Checking for Popular Posts?

Dave Orr3mo52

Are there examples of posts with factual errors you think would be caught by LLMs?

One thing you could do is fact check a few likely posts and see if it's adding substantial value. That would be more persuasive than abstract arguments.

1azergante3mo

Thanks for the suggestion, I added the "Edit 1" section to the post to showcase a small study on 3 posts known to contain factual mistakes. The LLM is able to spot and correct the mistake in 2 of the 3 cases, and provides valuable (though verbose) context. Overall this seems promising to me.

On Google’s Safety Plan

Dave Orr3mo2212

"There have been some relatively discontinuous jumps already (e.g. GPT-3, 3.5 and 4), at least from the outside perspective."

These are firmly within our definition of continuity - we intend our approach to handle jumps larger than seen in your examples here.

Possibly a disconnect is that from an end user perspective a new release can look like a big jump, while from a developer perspective it was continuous.

Note also that continuous can still be very fast. And of course we could be wrong about discontinuous jumps.

Recent AI model progress feels mostly like bullshit

Dave Orr3mo312

I don't work directly on pretraining, but when there were allegations of eval set contamination due to detection of a canary string last year, I looked into it specifically. I read the docs on prevention, talked with the lead engineer, and discussed with other execs.

So I have pretty detailed knowledge here. Of course GDM is a big complicated place and I certainly don't know everything, but I'm confident that we are trying hard to prevent contamination.

Recent AI model progress feels mostly like bullshit

Dave Orr3mo505

I work at GDM so obviously take that into account here, but in my internal conversations about external benchmarks we take cheating very seriously -- we don't want eval data to leak into training data, and have multiple lines of defense to keep that from happening. It's not as trivial as you might think to avoid, since papers and blog posts and analyses can sometimes have specific examples from benchmarks in them, unmarked -- and while we do look for this kind of thing, there's no guarantee that we will be perfect at finding them. So it's completely possib... (read more)

Neel Nanda3mo183

I agree that I'd be shocked if GDM was training on eval sets. But I do think hill climbing on benchmarks is also very bad for those benchmarks being an accurate metric of progress and I don't trust any AI lab not to hill climb on particularly flashy metrics

Garrett Baker3mo150

I work at GDM so obviously take that into account here, but in my internal conversations about external benchmarks we take cheating very seriously -- we don't want eval data to leak into training data, and have multiple lines of defense to keep that from happening.

What do you mean by "we"? Do you work on the pretraining team, talk directly with the pretraining team, are just aware of the methods the pretraining team uses, or some other thing?

Does human (mis)alignment pose a significant and imminent existential threat?

Answer by Dave OrrFeb 23, 202530

Humans have always been misaligned. Things now are probably significantly better in terms of human alignment than almost any time in history (citation needed) due to high levels of education and broad agreement about many things that we take for granted (e.g. the limits of free trade are debated but there has never been so much free trade). So you would need to think that something important was different now for there to be some kind of new existential risk.

One candidate is that as tech advances, the amount of damage a small misaligned group could do is g... (read more)

1jr4mo

Thanks so much for your thoughts Dave. I agree humans have always been misaligned, and that in many ways we have made significant advancements in alignment over long time frames. However, I think few would deny that any metric approximating alignment would be quite volatile over shorter time frames or specific populations, which creates periods of greater risk. I agree that there must be something new that increases that existential risk to justify significant concern. You identified bioweapons as one example, which I agree is a risk, but not the specific one I am concerned about. The new factors I am concerned about are: * The vastly increased ease and ability of small groups of misaligned actors to significantly alter, manipulate, or undermine large numbers of other humans' capacities for alignment. This seems largely tied to social media. As evidence, I would point to the sharp increase in social divisions in the US in recent years. * The introduction of AI that allows individuals to project their misaligned will and power without having to involve or persuade other individuals who previously would have exerted some degree of influence toward realignment It seems to be putting the cart before the horse to be spending so much time, money, effort, and thought on AI Alignment, while our alignment as humans is so poor. In my mind, understanding the nature and roots of our misalignment, and identifying how to use technology to increase our alignment rather than undermine it, seems to me to be an obvious prerequisite (or co-requisite, at least) to being able to trust ourselves to use powerful AI in ways that don't decrease alignment. While recent years may have presented conditions that were especially effective at exploiting vulnerabilities in our capacities for maintaining alignment, those vulnerabilities have always been and always will be a risk, so we will always be the weakest link in the Alignment equation until we put serious effort into elevating

Using Prompt Evaluation to Combat Bio-Weapon Research

Dave Orr4moΩ342

One tip for research of this kind is to not only measure recall, but also precision. It's easy to block 100% of dangerous prompts by blocking 100% of prompts, but obviously that doesn't work in practice. The actual task that labs are trying to solve is to block as many unsafe prompts as possible while rarely blocking safe prompts, or in other words, looking at both precision and recall.

Of course with truly dangerous models and prompts, you do want ~100% recall, and in that situation it's fair to say that nobody should ever be able to build a bioweapon. But... (read more)

2Stuart_Armstrong4mo

The mundane prompts were blocked 0% of the time. But you're right - we need something in between 'mundane and unrelated to bio research' and 'useful for bioweapons research'. But I'm not sure what - here we are looking at lab wetwork ability. It seems that that ability is inherently dual-use.

Eliezer's Lost Alignment Articles / The Arbital Sequence

Dave Orr4mo60

The pivotal act link is broken, fyi.

2kave4mo

Thanks. Fixed.

How different LLMs answered PhilPapers 2020 survey

Dave Orr5mo31

Gemini V2 (1206 experimental which is the larger model) one boxes, so.... progress?

Is it ethical to work in AI "content evaluation"?

Dave Orr5mo*42

I'm probably too conflicted to give you advice here (I work on safety at Google DeepMind), but you might want to think through, at a gears level, what could concretely happen with your work that would lead to bad outcomes. Then you can balance that against positives (getting paid, becoming more familiar with model outputs, whatever).

You might also think about how your work compares to whoever would replace you on average, and what implications that might have as well.

1anon_databoy1235mo

Part of why I ask is because it's difficult for me to construct a concrete gears-level picture of how (if at all) my work influences eventual transformative AI. I'm unsure about the extent to which refining current models' coding capabilities accelerates timelines, whether some tasks are possibly net-positive, whether these impacts are easily offset etc.

Kitchen Air Purifier Comparison

Dave Orr5mo70

This is great data! I'd been wondering about this myself.

Where were you measuring air quality? How far from the stove? Same place every time?

jefftk5mo120

Other side of the room, about ten feet from the stove. Same place each time, yes.

King Lear - A Reinterpretation

Dave Orr5mo20

Practicing LLM prompting?

1[comment deleted]5mo

What's Wrong With the Simulation Argument?

Dave Orr6mo50

I haven't heard the p zombie argument before, but I agree that is at least some Bayesian evidence that we're not in a sim.

We don't know if simulated people will be p zombies
I am not a p zombie [citation needed]
It would be very surprising if sims were not p zombies but everyone in the physical universe is
Therefore the likelihood ratio of being conscious is higher for the real universe than a simulation

Probably 3 needs to be developed further, but this is the first new piece of evidence I've seen since I first encountered the simulation argument in like 2005.

Is Musk still net-positive for humanity?

Dave Orr6mo20

Are we playing the question game because the thread was started by Rosencranz? Is China doing well in the EV space a bad thing?

1mikbp6mo

? I don't know Rosencranz. I'm asking you because you say "Is it the case that the tech would exist without him? I think that's pretty unclear" and this, in my view, depends a lot on the answers to those questions. The opposite, it is good. But if Musk did not have any influence on it, this diminishes Musk's positive impact in this field, making his impact less positive.

Is Musk still net-positive for humanity?

Dave Orr6mo40

Is it the case that the tech would exist without him? I think that's pretty unclear, especially for SpaceX, where despite other startups in the space, nobody else managed to radically reduce the cost per launch in a way that transformed the industry.

Even for Tesla, which seems more pedestrian (heh) now, there were a number of years where they had the only viable car in the market. It was only once they proved it was feasible that everyone else piled in.

1mikbp6mo

About Tesla, do you think it had any influence on China betting hard for EVs? About SpaceX, do you think it makes a big difference to be 'space-ready' a couple of decades earlier or later?

ARC-AGI is a genuine AGI test but o3 cheated :(

Dave Orr6mo50

Progress in ML looks a lot like, we had a different setup with different data and a tweaked algorithm and did better on this task. If you want to put an asterisk on o3 because it trained in some specific way that's different from previous contenders, then basically every ML advance is going to have a similar asterisk. Seems like a lot of asterisking.

1Knight Lee6mo

Maybe we can draw a line between the score an AI gets without using human written problem/solution pairs in any way, and the score an AI gets after using them in some way (RL on example questions, training on example solutions, etc.). In the former case, we're interested in how well the AI can do a task as difficult as the test, all on its own. In the latter case, we're interested in how well the AI can do a task as difficult as the test, if working with humans training it for the task. I really want to make it clear I'm not trying to badmouth o3, I think it is a very impressive model. I should've written my post better.

Oppression and production are competing explanations for wealth inequality.

Dave Orr6mo*8-1

Hm I think the main thrust of this post misses something, which is that different conditions, even contradictory conditions, can easily happen locally. Obviously, it can be raining in San Francisco and sunny in LA, and you can have one person wearing a raincoat in SF and the other one the beach in LA with no problem, even if they are part of the same team.

I think this is true of wealth inequality.

Carnegie or Larry Page or Warren Buffett got their money in a non exploitative way, by being better than others at something that was extremely socially valuable.... (read more)

2Benquo6mo

Fair point about localized heterogeneity. But simply having different optimal interventions in different places doesn't itself justify splitting resources across them. That would require either: 1. Steeply diminishing returns up to the relevant margin for each intervention (making diversification optimal), or 2. Having more resources than we can deploy in all plausibly effective interventions. Either claim would be surprising and worth investigating explicitly. I intended this piece as a call for such investigation. Moreover, if we take your example - productive wealth inequality in the US vs extractive in Uganda - this actually strengthens the case against portfolio diversification. Under this model, returns to investment in Uganda would be systematically captured by extractive institutions. The efficient response might be to focus on systemic changes that reduce extraction (like charter cities or immigration reform) rather than direct aid or cash transfers. This illustrates why we need explicit models of how these systems interact.

6Karl Krueger6mo

In both extremes, wealth results from forming and leading coalitions: but at one extreme it's by leading a coalition to produce things for others, and in the other it's by leading a coalition to extract things from others.

ARC-AGI is a genuine AGI test but o3 cheated :(

Dave Orr6mo209

It seems very strange to me to say that they cheated, when the public training set is intended to be used exactly for training. They did what the test specified! And they didn't even use all of it.

The whole point of the test is that some training examples aren't going to unlock the rest of it. What training definitely does it teach the model how to output the JSON in the right format, and likely how to think about what to even do with these visual puzzles.

Do we say that humans aren't a general intelligence even though for ~all valuable tasks, you have to take some time to practice, or someone has to show you, before you can do it well?

gwern6mo125

Do we say that humans aren't a general intelligence even though for ~all valuable tasks, you have to take some time to practice, or someone has to show you, before you can do it well?

More pointedly, I didn't see anyone complaining about the previous champion doing 100%-ARC-only online training while trying to solve ARC, so why would you complain about weaker offline training as a small part of a giant pretraining corpus?

(Generating millions of examples to train on, yes, people did complain about that and arguably that is 'cheating', but 'not using froze... (read more)

8Knight Lee6mo

When I first wrote the post I did make the mistake of writing they were cheating :( sorry about that. A few hours I noticed the mistake and removed the statements, put the word "cheating" in quotes and explained it at the end. It's possible you saw the old version due to browser caches. Again, I'm sorry. I think my main point still stands. I disagree that "The whole point of the test is that some training examples aren't going to unlock the rest of it. What training definitely does it teach the model how to output the JSON in the right format, and likely how to think about what to even do with these visual puzzles." I don't think poor performance on benchmarks by SOTA generative AI models are due to failing to understand output formatting, and that models should need example questions in their training data (or reinforcement learning sets) to compensate for this. Instead, a good benchmark should explain the formatting clearly to the model, maybe with examples in the input context. I agree that tuning the model using the public training set does not automatically unlock the rest of it! But I strongly disagree that this is the whole point of the test. If it was, then the Kaggle SOTA is clearly better than OpenAI's o1 according to the test. This is seen vividly in François Chollet's graph. No one claims this means the Kaggle models are smarter than o1, nor that the test completely fails to test intelligence since the Kaggle models rank higher than o1. Why does no one seem to be arguing for either? Probably because of the unspoken understanding that they are doing two versions of the test. One where the model fits the public training set, and tries to predict on the private test set. And two where you have a generally intelligent model which happens to be able to do this test. When people compare different models using the test, they are implicitly using the second version of the test. Most generative AI models did the harder second version, but o3 (and the Ka

AGI with RL is Bad News for Safety

Dave Orr7mo30

Why does RL necessarily mean that AIs are trained to plan ahead?

3Nadav Brandes7mo

I explain it in more detail in my original post. In short, in standard language modeling the model only tries to predict the most likely immediate next token (T1), and then the most likely token after that (T2) given T1, and so on; whereas in RL it's trying to optimize a whole sequence of next tokens (T1, ..., Tn) such that the rewards for all the tokens (up to Tn) are taken into account in the reward of the immediate next token (T1).

o1 Turns Pro

Dave Orr7mo114

"Reliable fact recall is valuable, but why would o1 pro be especially good at it? It seems like that would be the opposite of reasoning, or of thinking for a long time?"

Current models were already good at identifying and fixing factual errors when run over a response and asked to critique and fix it. Works maybe 80% of the time to identify whether there's a mistake, and can fix it at a somewhat lower rate.

So not surprising at all that a reasoning loop can do the same thing. Possibly there's some other secret sauce in there, but just critiquing and fixing mistakes is probably enough to see the reported gains in o1.

Automation collapse

Dave Orr8mo21

Aha, thanks, that makes sense.

Automation collapse

Dave Orr9moΩ361

One way this could happen is searching for jailbreaks in the space of paraphrases and synonyms of a benign prompt.

Why would this produce fake/unlikely jailbreaks? If the paraphrases and such are natural, then doesn't the nearness to a real(istic) prompt enough to suggest that the jailbreak found is also realistic? Of course you can adversarially generate super unrealistic things, but does that necessarily happen with paraphrasing type attacks?

2Geoffrey Irving9mo

To clarify, such a jailbreak is a real jailbreak; the claim is that it might not count as much evidence of “intentional misalignment by a model”. If we’re happy to reject all models which can be jailbroken we’ve falsified the model, but if we want to allow models which can be jailbroken but are intent aligned we have a false negative signal for alignment.

What actual bad outcome has "ethics-based" RLHF AI Alignment already prevented?

Answer by Dave OrrOct 19, 202460

You may recall certain news items last February around Gemini and diversity that wiped many billions off of Google's market cap.

There's a clear financial incentive to make sure that models say things within expected limits.

There's also this: https://www.wired.com/story/air-canada-chatbot-refund-policy/

3Roko9mo

This is not to do with ethics though? This is just the model hallucinating?

Avoiding jailbreaks by discouraging their representation in activation space

Dave Orr9mo31

Really cool project! And the write-up is very clear.

In the section about options for reducing the hit to helpfulness, I was surprised you didn't mention scaling the vector you're adding or subtracting -- did you try different weights? I would expect that you can tune the strength of the intervention by weighting the difference in means vector up or down.

1Guido Bergman9mo

Thanks, that is a great suggestion! I will add it to the potential solutions to the problem.

Economics Roundup #3

Dave Orr10mo50

The usual reason is compounding. If you have an asset that is growing over time, paying taxes from it means not only do you have less of it now, but the amount you pulled out now won't compound indefinitely into the future. You want to compound growth for as long as possible on as much capital as possible. If you could diversify without paying capital gains you would, but since the choice is something like, get gains on $100 in this one stock, or get gains on $70 in this diversified basket of stocks, you might stay with the concentrated position even if you would prefer to be diversified.

The economics of space tethers

Dave Orr10mo40

This reminds me of a Brin short story which I think exactly discusses what you're talking about: https://www.davidbrin.com/tankfarm.htm

The economics of space tethers

Dave Orr10mo120

Cool concept. I'm a bit puzzled by one thing though -- presumably every time you use a tether, it slows down and drops to a lower orbit. How do you handle that? Is the idea that it's so much more massive than the rockets its boosting that its slowdown is negligible? Or do we have to go spin it back up every so often?

1harsimony10mo

Yeah, my overall sense is that using falling mass to spin the tether back up is the most practical. But solar sails and ion drives might contribute too, these are just much slower which hurts launch cadence and costs. The fact that you need a regular supply of falling mass from e.g. the moon is yet another reason why tethers need a mature space industry to become viable!

5Brendan Long10mo

This linked article goes into some options for that: https://toughsf.blogspot.com/2020/07/tethers-all-way.html * You can use the tether to catch payloads on the way down and boost the tether back up while also reducing the payload's need for heat shielding * You can use more efficient engines with low thrust/weight ratios to reboost the tether * There are some propellent-free options that use the magnetic field to reboost the tether in exchange for energy (I'm unsure if the energy needs are practical or not) If you had a way to catch them, I think you could just throw rocks down the gravity well and catch them for a boost too.

Charlie Steiner10mo170

One way to regain energy is to run the tether in reverse - drop something from a faster orbit back into the atmosphere, siphoning off some of its energy along the way. If every time you sent one spacecraft up another was lined up to come back down, that would save a lot of trouble.

But you'll still need to do orbital corrections, offset atmospheric drag, and allow for imbalances, so yeah, it would seem like you still need a pretty beefy means of propulsion on this thing, which is oddly unmentioned for being key to the whole design.

2Tao Lin10mo

you can recover lost momentum by decelerating things to land. OP mentions that briefly If every launch returns and lands on earth, that would recover some but not all lost momentum, because of fuel spent on the trip. it's probably more complicted than that though

Thomas Kwa10mo120

Tethers can theoretically use more efficient propulsion because their thrust requirements are lower. The argon Hall effect thrusters on Starlink satellites have around 7x the specific impulse (fuel efficiency) of Starship engines while needing 7x the energy due to KE=mv^2/2 and having a tiny fraction of the thrust. This energy could come from a giant solar panel rather than the fuel, and every once in a while it could be refueled with a big tanker of liquid argon.

Poker is a bad game for teaching epistemics. Figgie is a better one.

Dave Orr1y82

"If you are playing with a player who thinks that "all reds" is a strong hand, it can take you many, many hands to figure out that they're overestimating their hands instead of just getting anomalously lucky with their hidden cards while everyone else folds!"

As you guessed, this is wrong. If someone is playing a lot of hands, your first hypothesis is that they are too loose and making mistakes. At that point, each additional hand they play is evidence in favor of fishiness, and you can quickly become confident that they are bad.

Mistakes in the other direct... (read more)

rossry1y113

That all sounds right, but I want to invert your setup.

If someone is playing too many hands, your first hypothesis is that they are too loose and making mistakes. If someone folds for 30 minutes, then steals the blinds once, then folds some more, you will have a hard time telling whether they're playing wrong or have had a bad run of cards.

But in either case, it is going to be significantly harder for them to tell, from inside their own still-developing understanding of the game, whether the things that are happening to them are evidence about their own mi... (read more)

Enriched tab is now the default LW Frontpage experience for logged-in users

Dave Orr1y30

I wonder if there's a way to give the black box recommended a different objective function. CTR is bad for the obvious clickbait reasons, but signals for user interaction are still valuable if you can find the right signal to use.

I would propose that returning to the site some time in the future is a better signal of quality than CTR, assuming the future is far enough away. You could try a week, a month, and a quarter.

This is maybe a good time to use reinforcement learning, since the signal is far away from the decision you need to make. When someone interacts with an article, reward the things they interacted with n weeks ago. Combined with karma, I bet that would be a better signal than CTR.

Offering Completion

Dave Orr1y102

Children are evidently next word completers.

3athom1y

Less so once they've done enough RLHF

How should TurnTrout handle his DeepMind equity situation?

Dave Orr1y20

I would be very unhappy if a non disparagement agreement were sprung on me when I left the company. And I would be very reluctant to sign one entering any company.

Luckily we don't have those at Google Deepmind.

2kave1y

Fair enough! But perhaps disparaging enough things could affect the value of equity, though probably by less than refusing to sign a non-disparagement agreement and not getting your vested PPUs. Does that make you reconsider whether having the equity might give you action-altering (and, particularly, speech-altering) incentives?

Hot take: The AI safety movement is way too sectarian and this is greatly increasing p(doom)

Dave Orr1y30

I work at DeepMind and have been influenced by METR. :)

1O O1y

That is great to hear, but I find it probable they’ll be ignored/lobbied against/gamed when it goes against business interests.

Answer by Dave OrrMay 19, 202420

If you want a far future fictional treatment of this kind of situation, I recommend Surface Detail by Iain Banks.

Hot take: The AI safety movement is way too sectarian and this is greatly increasing p(doom)

Dave Orr1y40

I think your model is a bit simplistic. METR has absolutely influenced the behavior of the big labs, including DeepMind. Even if all impact goes through the big labs, you could have more influence outside of the lab than as one of many employees within. Being the head of a regulatory agency that oversees the labs sets policy in a much more direct way than a mid level exec within the company can.

3O O1y

Is there evidence that METR had more than nominal impact? I also think the lack of clout will limit his influence in the government. To some government employee, he’s just someone from a random startup they never heard of having outsized influence. Within that agency he's just a cog in some slow moving behemoth. Within OpenAI he is at least an influential voice in the safety org.

Should I Finish My Bachelor's Degree?

Dave Orr1y40

I went back to finish college as an adult, and my main surprise was how much fun it was. It probably depends on what classes you have left, but I took every AI class offered and learned a ton that is still relevant to my work today, 20 years later. Even the general classes were fun -- it turns out it's easy to be an excellent student if you're used to working a full work week, and being a good student is way more pleasant and less stressful than being a bad one, or at least it was for me.

I'm not sure what you should do necessarily, but given that you're t... (read more)

Dyslucksia

Dave Orr1y146

This is very well written and compelling. Thanks for posting it!

3Shoshannah Tekofsky1y

aaaaw thank you for saying that! _ I appreciate it!

The 2nd Demographic Transition

Dave Orr1y3-7

This is a great post. I knew that at the top end of the income distribution in the US people have more kids, but didn't understand how robust the relationship seems to be.

I think the standard evbio explanation here would ride on status -- people at the top of the tribe can afford to expend more resources for kids, and also have more access to opportunities to have kids. That would predict that we wouldn't see a radical change as everyone got more rich -- the curve would slide right and the top end of the distribution would have more kids but not necessaril... (read more)

Would you have a baby in 2024?

Dave Orr1y20

Heh, that's why I put "strong" in there!

A Back-Of-The-Envelope Calculation On How Unlikely The Circumstantial Evidence Around Covid-19 Is

Dave Orr1y63

One big one is that the first big spreading event happened at a wet market where people and animals are in close proximity. You could check densely peopled places within some proximity of the lab to figure out how surprising it is that it happened in a wet market, but certainly animal spillover is much more likely where there are animals.

Edit: also it's honestly kind of a bad sign that you aren't aware of evidence that tends against your favored explanation, since that mostly happens during motivated reasoning.

1Roko1y

I'm avoiding that as I don't understand the data provenance/cover-up potential. The point of this post is to process just the "clean" data - stuff that interested parties such as WIV, Ecohealth and WHO could not have changed or affected. Of course others should try to look into that and work out what's going on.

Lack of Spider-Man is evidence against the simulation hypothesis

Dave Orr2y70

We're here to test the so-called tower of babel theory. What if, due to some bizarre happenstance, humanity had thousands of languages that change all the time instead of a single universal language like all known intelligent species?

Answer by Dave OrrDec 30, 202360

You should ignore the EY style "no future" takes when thinking about your future. This is because if the world is about to end, nothing you do will matter much. But if the world isn't about to end, what you do might matter quite a bit -- so you should focus on the latter.

One quick question to ask yourself is: are you more likely to have an impact on technology, or on policy? Either one is useful. (If neither seems great, then consider earning to give, or just find a way to add value in society in other ways.)

Once you figure that out, the next step is almos... (read more)

Would you have a baby in 2024?

Dave Orr2y1311

I agree that it's bad to raise a child in an environment of extreme anxiety. Don't do that.

Also try to avoid being very doomy and anxious in general, it's not a healthy state to be in. (Easier said than done, I realize.)

Would you have a baby in 2024?

Answer by Dave OrrDec 25, 20236647

I think you should have a kid if you would have wanted one without recent AI progress. Timelines are still very uncertain, and strong AGI could still be decades away. Parenthood is strongly value creating and extremely rewarding (if hard at times) and that's true in many many worlds.

In fact it's hard to find probable worlds where having kids is a really bad idea, IMO. If we solve alignment and end up in AI utopia, having kids is great! If we don't solve alignment and EY is right about what happens in a fast takeoff world, it doesn't really matter if you ha... (read more)

2the gears to ascension1y

strong AGI could still be decades away

1VojtaKovarik2y

One scenario where you might want to have kids in general, but not if timelines are short, is if you feel positive about having kids, but you view the first few years of having kids as a chore (ie, it costs you time, sleep, and money). So if you view kids as an investment of the form "take a hit to your happiness now, get more happiness back later", then not having kids now seems justifiable. But I think that this sort of reasoning requires pretty short timelines (which I have), with high confidence (which I don't have), and high confidence that the first few years of having kids is net-negative happiness for you (which I don't have). (But overall I endorse the claim that, mostly, if you would have otherwise wanted kids, you should still have them.)

2Gunnar_Zarncke2y

I agree with this take. I already have four children, and I wouldn't decide against children because of AI risks.

dr_s2y2714

If we don't solve alignment and EY is right about what happens in a fast takeoff world, it doesn't really matter if you have kids or not.

This IMO misses the obvious fact that you spend your life with a lot more anguish if you think that not just you, but your kid is going to die too. I don't have a kid but everyone who does seems to describe a feeling of protectiveness that transcends any standard "I really care about this person" one you could experience with just about anyone else.

RomanHauksson2y1625

Having kids does mean less time to help AI go well, so maybe it’s not so much of a good idea if you’re one of the people doing alignment work.

Where can I learn about algorithmic transformation of AI prompts?

Answer by Dave OrrNov 21, 202320

The thing you're missing is called instruction tuning. You gather a series of prompt/response pairs and fine tune the model over that data. Do it right and you have a chatty model.

Monthly Roundup #12: November 2023

Dave Orr2y2-8

Thanks, Zvi, these roundups are always interesting.

I have one small suggestion, which is that you limit yourself to one Patrick link per post. He's an interesting guy but his area is quite niche, and if people want his fun stories about banking systems they can just follow him. I suspect that people who care about those things already follow him, and people who don't aren't that interested to read four items from him here.