Ben Winchester — LessWrong

LESSWRONG
LW

Replying toThe Cluster Structure of Thingspace

As for where else these ideas can be found, philosophers have been working on conceptual vagueness intensely since the mid-20th century, and cluster concepts were a relatively early innovation. The philosophical literature also has the benefit of being largely free of nebulous speculations about cognition and needless formalism ... The literature also uses terminology in the ordinary way familiar to everybody engaging these issues professionally ... and avoids the invention of needless terms like "thingspace", which mainly achieve the isolation of LessWrong from the external literature.

I think there's some validity to this critique. I read The Cluster Structure of Thingspace (TCSOTS) and was asking myself "isn't this just talking about the problem... (read more)

Replying toVernor Vinge, who coined the term "Technological Singularity", dies at 79

Ben Winchester2y

Vernor Vinge, who coined the term "Technological Singularity", dies at 79

Yeah, maybe it's less the OODA loop involvement and more that "bad things" lead to a kind of activated nervous system that predisposes us to reactive behavior ("react" as opposed to "reflect/respond").

To me, the bad loops are more "stimulus -> react without thinking" than "observe, orient, decide, act". You end up hijacked by your reactive nervous system.

Replying toOn the Gladstone Report

Ben Winchester2y

On the Gladstone Report

"One problem is that due to algorithmic improvements, any FLOP threshold we set now is going to be less effective at reducing risk to acceptable levels in the future."

And this goes doubly so if we explicitly incentivize low-FLOP models. When models are non-negligibly FLOP-limited by law, then FLOP-optimization will become a major priority for AI researchers.

This reminds me of Goodhart's Law, which states “when a measure becomes a target, it ceases to be a good measure."

I.e., if FLOPs are supposed to be a measure of an AI's danger, and we then limit/target FLOPs in order to limit AGI danger, then that targeting itself interferes or nullifies the effectiveness of FLOPs as a measure of danger.

It is (unfortunately) self-defeating. At a minimum, you need to re-evaluate regularly the connection between FLOPs and danger: it will be a moving target. Is our regulatory system up to that task?