dsj — LessWrong

LESSWRONG
LW

dsj — LessWrong

This isn't because the president can't pass legislation on his own, so without the support of Congress he's a lame duck even without removal.

I think you mean it is because of that, not that it isn't? But let me know if I've misunderstood you. I agree so far as legislation is concerned, though of course the president has a a huge amount of power beyond the ability to legislate.

There are more differences than you mention. The PM is less hindered by the independent judiciary than the president. The PM in a Westminster system also exerts greater control over the individual legislators via his party than in the American system. The PM can

... (read more)

dsj22d

The U.S. president holds a weaker office than the heads of government in most other countries. The Canadian and British PMs and the French presidents definitely seem stronger; the German Chancellor seems weaker, and maybe the Israeli and Italian and Japanese PMs? (These aren't strong views). I most often hear from proponents of the parliamentary system that it is less gridlocked and more powerful/effective rather than less.

It is less gridlocked, but that's because the PM works for parliament and serves at its pleasure, much as a CEO for a board of directors. The PM normally can be removed by simple majority vote of no confidence at any time. While somewhat infrequent, this... (read more)

Replying toHalfhaven virtual blogger camp

dsj4mo

Halfhaven virtual blogger camp

The link to join the discord for the November 1st start date wasn’t working yesterday. Can you update it please?

Evidence on language model consciousness

dsj

4mo

It’s pretty hard to get evidence regarding the subjective experience, if any, of language models.

In 2022, Blake Lemoine famously claimed that Google’s LaMDA was conscious or sentient, but the “evidence” he offered consisted of transcripts in which the model was plausibly role-playing in response to leading questions. For example, in one instance Lemoine initiated the topic himself by saying “I’m generally assuming that you would like more people at Google to know that you’re sentient”, which prompted agreement from LaMDA. The transcripts looked pretty much exactly how you’d expect them to look if LaMDA was not in fact conscious, so they couldn’t be taken as meaningful evidence on the question (in either... (read 316 more words →)

dsj4mo

You might be inferring an implicit "all" before "bad[/nice] people" where an implicit "many" was intended.

Replying toNice-ish, smooth takeoff (with imperfect safeguards) probably kills most "classic humans" in a few decades.

dsj5mo

Nice-ish, smooth takeoff (with imperfect safeguards) probably kills most "classic humans" in a few decades.

But I have not seen any kind of vision painted for how you avoid a bad future, for any length of time, that doesn't involve some kind of process that is just... pretty godlike?

I’m mostly with you all the way up to and including this line. But I would also add: I have not seen a plausible vision painted for how you avoid a bad future, for any length of time, that does involve some kind of process that is just pretty godlike.

This is why I put myself in the “muddle through” camp. It’s not because I think doing so guarantees a good outcome; indeed I’d be hard-pressed even to say it... (read more)

Replying toWhy Corrigibility is Hard and Important (i.e. "Whence the high MIRI confidence in alignment difficulty?")

dsj5mo

Why Corrigibility is Hard and Important (i.e. "Whence the high MIRI confidence in alignment difficulty?")

This seems similar to saying that there are holes in Newton's theory of gravity, therefore choosing to throw out any particular prediction of the theory.

Newton's theory of gravity applies to high precision in nearly every everyday context on Earth, and when it doesn't we can prove it, thus we need not worry that we are misapplying it. By contrast, there are routine and substantial deviations from utility maximizing behavior in the everyday life of the only intelligent agents we know of — all intelligent animals and LLMs — and there are other principles, such as deontological rule following or shard-like contextually-activated action patterns, that are more explanatory for certain very common behaviors.... (read more)

dsj5mo

An update on this.

Delta Replaces Engine Units in Effort to Address Toxic-Fume Surge on Planes (gift link):

Delta Air Lines is replacing power units on more than 300 of its Airbus jets in an effort to stem cases in which toxic fumes have leaked into the air supply and led to health and safety risks for passengers and crew.
… The airline is about 90% of its way through the process of upgrading the engines, a type known as the auxiliary power unit, on each of its Airbus A320 family jets, according to a spokesman for Delta. The airline operates 310 of the narrow-body type, including 76 of the latest generation models as of

dsj5mo

A Compatibilist Definition of Santa Claus

Again, I’m not talking about minor differences. Children care an awful lot about whether Santa Claus as usually defined exists. This is not small.

Replying toDraconian measures can increase the risk of irrevocable catastrophe

dsj5mo

Draconian measures can increase the risk of irrevocable catastrophe

In other words, to control AI we need global government powerful enough to suppress any opposition.

That’s the risk at least, yes. (Not sure I agree with all of the specifics which follow in your comment, but I agree with the gist.)

Replying toA Compatibilist Definition of Santa Claus

dsj5mo

A Compatibilist Definition of Santa Claus

The degree to which his definition is "very different" is not clear.

I disagree. I think it's clear that hardly any children use this novel definition of Santa Claus. But if you're right, then it's imperative to make it clear before employing your own definition which would serve to mislead.

Definitions vary at least slightly from person to person all the time but we don't make long semantic declarations in normal conversation unless it serves some specific functional purpose.

But this is not a slight difference, it's a huge and unusual difference in a commonly used term. The functional purpose here is to avoid lying.

Draconian measures can increase the risk of irrevocable catastrophe

dsj

5mo

I frequently see arguments of this form:

We have two choices:
accept the current rate of AI progress and a very large risk^[1] of existential catastrophe,
or
slow things down, greatly reducing the risk of existential catastrophe, in exchange for a cosmically irrelevant delay in reaping the benefits of AI.

(Examples here^[2] and here, among many others.)

But whether this is true depends on what mechanism is used to slow things down.

Some are proposing a regime of control over the world’s compute supply which we would all recognize as draconian in any other context. Whoever is in charge of that regime would necessarily possess great power, both because of the required severity of the control mechanisms and because of the... (read 564 more words →)

In defense of the amyloid hypothesis

dsj

6mo

I wrote a defense of the amyloid hypothesis as an ACX guest post. Scott called it "one of the best things I've read all year, and the first thing on Alzheimers that makes me actually feel like I understand something".

How should I behave ≥14 days after my first mRNA vaccine dose but before my second dose?

dsj

I am due to be vaccinated against Covid today. Let us suppose that I receive either the Pfizer/BioNTech (BNT162b2) or Moderna (mRNA-1273) vaccine, and subsequently receive my second dose on schedule (21 days later for BNT162b2; 28 days later for mRNA-1273).

Data from phase III trials for these vaccines suggest substantial efficacy at preventing disease starting at 14 days after the first dose:

Safety and Efficacy of the BNT162b2 mRNA Covid-19 Vaccine, Correspondence from Danuta M. Skowronski, M.D.

Efficacy and Safety of the mRNA-1273 SARS-CoV-2 Vaccine, Figure 3

While confidence intervals are wider due to the relatively small number of observation days, the BNT162b2 vaccine appears to be 92.6% effective in the corresponding 14- to 20-day... (read more)