oumuamua — LessWrong

LESSWRONG
LW

Replying toMonthly Roundup #29: April 2025

Monthly Roundup #29: April 2025

I would assume this is because wasting time (which is to the detriment of your opponent, and which he cannot control) in the first example is a not instrumental to achieving your goal. It is merely a side-effect. "Thou shall not profit from wasting time".

If playing optimally involves making decisions that make the game go longer (such as waiting to draw additional countermagic or whatever), so be it.

That said, I'm surprised Zvi said "match wp" here - I assume this is an oversight on his part. He should just have written "game wp".

Replying toAI #104: American State Capacity on the Brink

oumuamua1y

AI #104: American State Capacity on the Brink

I just tried multiplying 13-digit numbers with o3-mini (high). My approach was to ask it to explain a basic multiplication algorithm to me, and then carry it out. On the first try it was lazy and didn't actually follow the algorithm (it just told me "it would take a long time to actually carry out all the shifts and multiplications...", and it got the result wrong.

Then I told it to follow the algorithm, even if it is time consuming, and it did, and the result was correct.

So I'm not sure about the take that

The fact that something that has ingested the entirety of human literature can’t figure out how to generalize multiplication past 13

... (read more)

Replying toAI #91: Deep Thinking

oumuamua1y

AI #91: Deep Thinking

If future more capable models are indeed actively resisting their alignment training, and this is happening consistently, that seems like an important update to be making?

Could someone explain to me what this resisting behavior during alignment training looked like in practice?

Did the model outright say "I don't want to do this?", did it produce nonsensical results, did it become deceptive, did it just ... not work?

This claim seems very interesting if true, is there any further information on this?

Replying toMonthly Roundup #23: October 2024

oumuamua1y

Monthly Roundup #23: October 2024

glamorize

glomarize is the word I believe you want to use.

Replying toSlave Morality: A place for every man and every man in his place

oumuamua1y

Slave Morality: A place for every man and every man in his place

As a native German speaker I believe I can expand upon, and slightly disagree with, your definition.

I suspect that a significant portion of the misunderstanding about slave morality comes from the fact that the german word "Moral" (which is part of the Netzschean-term "Sklavenmoral") has two possible meanings, depending on context: Morality and morale, and it is the latter which I consider to be the more apt translation in this case.

Nietzsche was really speaking about slave morale. It is important to understand that slave morality is not an ethical system or a set of values, rather it is a mindset which facilitates by psychological mechanism the adoption of certain values and moral systems.

To... (read more)

Replying toAI #78: Some Welcome Calm

oumuamua1y

AI #78: Some Welcome Calm

I don't think the primary decision makers at Nvidia do believe AGI is likely to be developed soon. I think they are hyping AI because it makes them money, but not really believing that progress will continue all the way to AGI in the near future.

I agree - and if they are at all rational they have expended significant resources to find out whether this belief is justified or not, and I'd take that seriously. If Nvidia do not believe that AGI is likely to be developed soon, I think they are probably right - and this makes more sense if there in fact aren't any 5-level models around and scaling really... (read more)

Replying toAI #78: Some Welcome Calm

oumuamua1y

AI #78: Some Welcome Calm

But how would this make sense from a financing perspective? If the company reveals that they are in posession of a 5-level model they'd be able to raise money at a much higher valuation. Just imagine what would happen to Alphabet stock if they proved posession of something significantly smarter than GPT4.

Also, the fact that Nvidia is selling its GPUs rather than keeping them all for itself does seem like some kind of evidence against this. If it were really all just a matter of scaling, why not cut everyone off and rush forward? They have more than enough resources by now to pay the foremost experts millions of dollars a year, and they'd have the best equipment too. Seems like a no-brainer if AGI was around the corner.

Replying toAI #78: Some Welcome Calm

oumuamua1y*

AI #78: Some Welcome Calm

Similarly, he claims that the bill does not acknowledge trade-offs, but the reasonable care standard is absolutely centered around trade-offs of costs against benefits.

Could somebody elaborate on this?

My understanding is that if a company releases an AI model knowing it can be easily exploited ('jailbroken'), they could be held legally responsible - even if the model's potential economic benefits far outweigh its risks.

For example, if a model could generate trillions in economic value but also enable billions in damages through cyberattacks, would releasing it be illegal despite the net positive impact?

Furthermore, while the concept of 'reasonable care' allows for some risk, doesn't it prohibit companies from making decisions based solely on overall... (read more)

Replying toGuide to SB 1047

oumuamua1y

Guide to SB 1047

1, Yes, but they also require far more money to do all the good stuff as well! I’m not saying there isn’t a tradeoff involved here.

2, Yes, I’ve read that. I was saying that this is a pretty low bar, since an ordinary person isn’t good at writing viruses. I’m afraid that the bill might have the effect of making competent jailbreakable models essentially illegal, even if they don’t pose an existential risk (in which case that would be necessary ofc.), and even if their net value for society is positive, because there is a lot of software out there that‘s insecure and that a reasonably competent coding AI could exploit and cause >500 MM in damages.

I’m saying that it might be better to tell companies to git gud at computer security and accept the fact that yes, an AI will absolutely try to break their stuff, and that they won’t get to sue Anthropic if something happens.

Replying toGuide to SB 1047

oumuamua1y

Guide to SB 1047

Correct me if I'm wrong, but it seems to me that something this law implies is that it's only legal to release jailbreakable models if they (more or less) suck.

Got something that can write a pretty good computer virus or materially enable somebody to do it? Illegal under SB1047, and I think the costs might outweigh the benefits here. If your software is so vulnerable that an LLM can hack it, that should be a you problem. Maybe use an LLM to fix it, I don't know. The benefit of AI systems intelligent enough to do that (but too stupid to pose actual existential risks) seems greater than the downside of initial... (read more)